Tag
A reflection on the challenge of evaluating personal AI agents whose value heavily relies on memory, illustrated by the author's experience with the Macaron agent.