Liang, Yuan, Zhong, Ruobin, Xu, Haoming, Jiang, Chen, Zhong, Yi, Fang, Runnan, Gu, Jia-Chen, Deng, Shumin, Yao, Yunzhi, Wang, Mengru, Qiao, Shuofei, Xu, Xin, Wu, Tongtong, Wang, Kun, Liu, Yang, Bi, Zhen, Lou, Jungang, Jiang, Yuchen Eleanor, Zhu, Hangcheng, Yu, Gang, Hong, Haiwen, Huang, Longtao, Xue, Hui, Wang, Chenxi, Wang, Yijun, Shan, Zifei, Chen, Xi, Tu, Zhaopeng, Xiong, Feiyu, Xie, Xin, Zhang, Peng, Gui, Zhengke, Liang, Lei, Zhou, Jun, Wu, Chiyu, Shang, Jin, Gong, Yu, Lin, Junyu, Xu, Changliang, Deng, Hongjie, Zhang, Wen, Ding, Keyan, Zhang, Qiang, Huang, Fei, Zhang, Ningyu, Pan, Jeff Z., Qi, Guilin, Wang, Haofen, Chen, Huajun
Abstract
Current AI agents can flexibly invoke tools and execute complex tasks, yet their long-term advancement is hindered by the lack of systematic accumulation and transfer of skills. Without a unified mechanism for skill consolidation, agents frequently ``reinvent the wheel'', rediscovering solutions in isolated contexts without leveraging prior strategies. To overcome this limitation, we introduce SkillNet, an open infrastructure designed to create, evaluate, and organize AI skills at scale. SkillNet structures skills within a unified ontology that supports creating skills from heterogeneous sources, establishing rich relational connections, and performing multi-dimensional evaluation across Safety, Completeness, Executability, Maintainability, and Cost-awareness. Our infrastructure integrates a repository of over 200,000 skills, an interactive platform, and a versatile Python toolkit. Experimental evaluations on ALFWorld, WebShop, and ScienceWorld demonstrate that SkillNet significantly enhances agent performance, improving average rewards by 40% and reducing execution steps by 30% across multiple backbone models. By formalizing skills as evolving, composable assets, SkillNet provides a robust foundation for agents to move from transient experience to durable mastery.
Chinese Translation
当前的人工智能代理能够灵活地调用工具并执行复杂任务,但其长期发展受到系统性技能积累和转移缺乏的制约。没有统一的技能整合机制,代理经常“重复发明轮子”,在孤立的上下文中重新发现解决方案,而没有利用先前的策略。为了解决这一限制,我们提出了SkillNet,一个旨在大规模创建、评估和组织人工智能技能的开放基础设施。SkillNet在统一本体内构建技能,支持从异构来源创建技能,建立丰富的关系连接,并在安全性、完整性、可执行性、可维护性和成本意识等多个维度进行评估。我们的基础设施集成了超过200,000个技能的库、一个互动平台和一个多功能的Python工具包。在ALFWorld、WebShop和ScienceWorld上的实验评估表明,SkillNet显著提升了代理的性能,平均奖励提高了40%,执行步骤减少了30%,适用于多个基础模型。通过将技能形式化为不断演变的、可组合的资产,SkillNet为代理从短暂经验转向持久掌握提供了坚实的基础。