Huang, Wei-Chieh, Zhang, Weizhi, Liang, Yueqing, Bei, Yuanchen, Chen, Yankai, Feng, Tao, Pan, Xinyu, Tan, Zhen, Wang, Yu, Wei, Tianxin, Wu, Shanglin, Xu, Ruiyao, Yang, Liangwei, Yang, Rui, Yang, Wooseong, Yeh, Chin-Yuan, Zhang, Hanrong, Zhang, Haozhen, Zhu, Siqi, Zou, Henry Peng, Zhao, Wanjia, Wang, Song, Xu, Wujiang, Ke, Zixuan, Hui, Zheng, Li, Dawei, Wu, Yaozu, He, Langzhou, Wang, Chen, Xu, Xiongxiao, Huang, Baixiang, Tan, Juntao, Heinecke, Shelby, Wang, Huan, Xiong, Caiming, Metwally, Ahmed A., Yan, Jun, Lee, Chen-Yu, Zeng, Hanqing, Xia, Yinglong, Wei, Xiaokai, Payani, Ali, Wang, Yu, Ma, Haitong, Wang, Wenya, Wang, Chengguang, Zhang, Yu, Wang, Xin, Zhang, Yongfeng, You, Jiaxuan, Tong, Hanghang, Luo, Xiao, Sun, Yizhou, Wang, Wei, McAuley, Julian, Zou, James, Han, Jiawei, Yu, Philip S., Shu, Kai
Abstract
The research of artificial intelligence is undergoing a paradigm shift from prioritizing model innovations over benchmark scores towards emphasizing problem definition and rigorous real-world evaluation. As the field enters the "second half," the central challenge becomes real utility in long-horizon, dynamic, and user-dependent environments, where agents face context explosion and must continuously accumulate, manage, and selectively reuse large volumes of information across extended interactions. Memory, with hundreds of papers released this year, therefore emerges as the critical solution to fill the utility gap. In this survey, we provide a unified view of foundation agent memory along three dimensions: memory substrate (internal and external), cognitive mechanism (episodic, semantic, sensory, working, and procedural), and memory subject (agent- and user-centric). We then analyze how memory is instantiated and operated under different agent topologies and highlight learning policies over memory operations. Finally, we review evaluation benchmarks and metrics for assessing memory utility, and outline various open challenges and future directions.
Chinese Translation
人工智能的研究正经历一个范式转变,从优先考虑模型创新和基准分数,转向强调问题定义和严格的现实世界评估。随着该领域进入“下半场”,中心挑战变为在长时间跨度、动态和用户依赖的环境中实现真实的实用性,在这些环境中,智能体面临上下文爆炸,并且必须在延续的交互中不断积累、管理和有选择地重用大量信息。因此,记忆作为填补实用性差距的关键解决方案,今年发布了数百篇相关论文。在本次调查中,我们从三个维度提供了基础智能体记忆的统一视角:记忆基质(内部和外部)、认知机制(情节记忆、语义记忆、感官记忆、工作记忆和程序性记忆)以及记忆主体(以智能体和用户为中心)。随后,我们分析了在不同智能体拓扑结构下记忆的实例化和操作方式,并强调了记忆操作的学习策略。最后,我们回顾了评估记忆实用性的基准和指标,并概述了各种开放挑战和未来方向。