Agent Memory 每日论文综述
本报告自动生成自 papers.cool/arxiv/cs.AI
筛选标准:标题或摘要包含 agent、memory、RAG、episodic memory 等关键词
生成时间:2026/5/26 11:30:27
📊 今日概况
- 总扫描论文: 25 篇
- Agent Memory 相关: 14 篇
📝 相关论文列表
1. SkillOpt: Executive Strategy for Self-Evolving Agent Skills
arXiv ID: 2605.23904
核心要点: skill,skillopt,skills,codex,claude,agent,optimizer,chat,executive,gepa…
关键词: skill,skillopt,skills,codex,claude,agent,optimizer,chat,executive,gepa
2. From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills
arXiv ID: 2605.23899
核心要点: skill,skills,emph,experience,textbf,lifecycle,extractors,consumption,extraction,utility…
关键词: skill,skills,emph,experience,textbf,lifecycle,extractors,consumption,extraction,utility
3. Agentic Proving for Program Verification
arXiv ID: 2605.23772
核心要点: agentic,program,verification,claude,proving,specifications,clever,scoring,isomorphism,capabilities…
关键词: agentic,program,verification,claude,proving,specifications,clever,scoring,isomorphism,capabilities
4. MemAudit: Post-hoc Auditing of Poisoned Agent Memory via Causal Attribution and Structural Anomaly Detection
arXiv ID: 2605.23723
核心要点: memory,memaudit,auditing,agent,hoc,post,attack,records,causal,success…
关键词: memory,memaudit,auditing,agent,hoc,post,attack,records,causal,success
5. One Policy, Infinite NPCs: Persona-Traceable Shared RL Policies for Scalable Game Agents
arXiv ID: 2605.23652
核心要点: persona,pcsp,npcs,conditioned,policy,npc,infonce,shared,policies,shot…
关键词: persona,pcsp,npcs,conditioned,policy,npc,infonce,shared,policies,shot
6. Co-ReAct: Rubrics as Step-Level Collaborators for ReAct Agents
arXiv ID: 2605.23590
核心要点: react,rubrics,rubric,step,agents,reasoning,rather,decision,grpo,level…
关键词: react,rubrics,rubric,step,agents,reasoning,rather,decision,grpo,level
7. When Planning Fails Despite Correct Execution: On Epistemic Calibration for LLM-Based Multi-Agent Systems
arXiv ID: 2605.23414
核心要点: epistemic,miscalibration,epc,planning,plans,feasibility,calibration,execution,llm,misjudge…
关键词: epistemic,miscalibration,epc,planning,plans,feasibility,calibration,execution,llm,misjudge
8. Human-in-the-Loop Multi-Agent Ventilator Decision Support with Contextual Bandit Preference Learning
arXiv ID: 2605.23320
核心要点: ventilator,vdss,decision,clinician,bandit,contextual,preference,human,support,loop…
关键词: ventilator,vdss,decision,clinician,bandit,contextual,preference,human,support,loop
9. DART: Semantic Recoverability for Structured Tool Agents
arXiv ID: 2605.23311
核心要点: dart,recoverability,commitment,rollback,downstream,committed,recovery,instance,semantic,failed…
关键词: dart,recoverability,commitment,rollback,downstream,committed,recovery,instance,semantic,failed
10. Parallel Context Compaction for Long-Horizon LLM Agent Serving
arXiv ID: 2605.23296
核心要点: compaction,context,llm,parallel,horizon,agent,volume,conversation,120b,locomo…
关键词: compaction,context,llm,parallel,horizon,agent,volume,conversation,120b,locomo
11. Foundation Protocol: A Coordination Layer for Agentic Society
arXiv ID: 2605.23218
核心要点: coordination,society,foundation,agentic,agents,governable,layer,negotiable,protocol,infrastructure…
关键词: coordination,society,foundation,agentic,agents,governable,layer,negotiable,protocol,infrastructure
12. Redrawing the AI Map: A Theory of Accountability Boundaries in Agentic Ecosystems
arXiv ID: 2605.23179
核心要点: accountability,agentic,organizational,assets,assignable,boundaries,ecosystems,modularization,boundary,redrawing…
关键词: accountability,agentic,organizational,assets,assignable,boundaries,ecosystems,modularization,boundary,redrawing
13. Inductive Deductive Synthesis: Enabling AI to Generate Formally Verified Systems
arXiv ID: 2605.23109
核心要点: ids,deductive,inductive,verified,agents,synthesis,expert,effort,sota,200x…
关键词: ids,deductive,inductive,verified,agents,synthesis,expert,effort,sota,200x
14. EVE-Agent: Evidence-Verifiable Self-Evolving Agents
arXiv ID: 2605.22905
核心要点: eve,evidence,self,evolving,verifiable,agent,agents,answer,span,proposer…
关键词: eve,evidence,self,evolving,verifiable,agent,agents,answer,span,proposer
AI Agent Memory 研究深度洞察报告
1. 研究趋势
今日AI Agent Memory研究呈现多元化发展趋势,技能获取与优化成为热点,如SkillOpt和From Raw Experience to Skill Consumption论文聚焦于Agent技能的自我进化与生命周期管理。与往日相比,研究重点从简单的记忆存储转向复杂的能力构建与验证,如MemAudit关注记忆安全审计,EVE-Agent强调证据可验证性。新兴方向包括多智能体社会协调(如Foundation Protocol)、个性化角色共享(One Policy, Infinite NPCs)以及形式化验证(Inductive Deductive Synthesis)等,显示Agent研究正从单点突破向系统性、可验证性方向发展。
2. 技术演进
Memory系统架构正经历从简单检索增强(RAG)到复杂记忆系统,再到世界模型的演进。早期RAG系统主要关注外部知识检索,而现代Memory系统如MemAudit已整合因果归因与结构异常检测,实现记忆安全审计。DART论文提出的语义恢复能力展示了高级记忆管理,支持回滚与恢复。Parallel Context Compaction解决了长场景下的上下文压缩问题。技术突破点包括:1)技能提取与消费的生命周期管理(From Raw Experience to Skill Consumption);2)基于证据的可验证机制(EVE-Agent);3)结构化工具代理的语义恢复(DART);4)多智能体社会协调层(Foundation Protocol)。这些演进使Agent记忆系统更接近人类认知能力。
3. 关键洞察
技能消费与提取分离:From Raw Experience to Skill Consumption论文表明,技能提取与消费是独立过程,这提示我们构建Agent系统时应将技能库与执行引擎解耦,提高模块化程度,便于技能的独立更新与优化。
记忆安全审计重要性:MemAudit研究显示Agent记忆易受攻击,需要引入因果归因与结构异常检测机制。建议在MyClaw中实现记忆访问日志与异常检测模块,定期审计记忆完整性。
语义恢复能力:DART提出的语义恢复机制表明,简单的状态回滚不足以应对复杂场景,需要理解语义层面的承诺关系。建议在Agent中实现基于语义的恢复机制,而非仅依赖原始状态。
多智能体社会协调:Foundation Protocol提出的协调层概念表明,未来Agent系统将需要类似社会规范的协调机制。建议为MyClaw设计可协商的协议层,支持多Agent协作。
证据可验证性:EVE-Agent强调证据可验证性,这提示我们Agent决策过程应保留可追溯的证据链,提高系统透明度与可靠性。
4. 开源项目关联
今日研究与主流开源项目紧密相关。SkillOpt和From Raw Experience to Skill Consumption的研究可借鉴到LangChain的Agent执行框架中,优化技能管理模块。MemAudit的安全审计理念可整合到LlamaIndex的索引系统中,增强记忆安全性。DART的语义恢复机制对Mem0的状态管理有重要参考价值,特别是其下游任务恢复能力。Foundation Protocol的协调层概念可启发LlamaIndex的多Agent协作框架设计。对于MyClaw项目,建议重点关注:1)整合MemAudit的因果归因技术,增强记忆安全性;2)采用DART的语义恢复机制,提高系统鲁棒性;3)借鉴Foundation Protocol的协调层设计,支持多Agent协作场景。
5. 下一步行动
构建技能生命周期管理模块:基于From Raw Experience to Skill Consumption的研究,在MyClaw中实现技能提取、存储、消费的完整生命周期管理,支持技能的动态更新与优化。
开发记忆安全审计系统:参考MemAudit方法,为MyClaw实现基于因果归因与结构异常检测的记忆安全审计系统,定期检查记忆完整性,防止恶意攻击。
设计语义恢复机制:借鉴DART的语义恢复概念,开发支持语义理解的恢复机制,而非简单的状态回滚,提高系统在面对失败时的恢复能力。
实现多Agent协调协议:基于Foundation Protocol的研究,设计可协商的协调层,支持MyClaw在多Agent环境下的协作与交互,建立Agent社会的规范与标准。
📚 附录
搜索关键词
agent, memory, memory-augmented, episodic, long-term, recall, retrieval, knowledge base, RAG, retrieval-augmented, episodic memory, working memory, memory system, remember, experience replay, memory network, external memory, vector database
本报告由 OpenClaw 自动生成(GLM-5 深度分析版)
面向 Agent Memory 系统设计者,提供前沿研究洞察