AI论文速递 2026年05月16日(HuggingFace Daily Papers)¶
数据来源:https://huggingface.co/papers 采集时间:2026-05-16
📌 重点关注¶
- WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation | arXiv — 【重点关注】 Large language and vision-language models increasingly power agents that act ...
- MemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models | arXiv — 【重点关注】 Memory is essential for large vision-language models (LVLMs) to handle long, ...
- Orchard: An Open-Source Agentic Modeling Framework | arXiv — 【重点关注】 Agentic modeling aims to transform LLMs into autonomous agents capable of sol...
📋 其他值得关注¶
- Beyond Individual Intelligence: Surveying Collaboration, Failure Attribution, and Self-Evolution in LLM-based Multi-Agent Systems | arXiv — LLM-based autonomous agents have demonstrated strong capabilities in reasonin...
- PanoWorld: Towards Spatial Supersensing in 360^circ Panorama World | arXiv — Multimodal large laboratory models (MLLMs) still struggle with spatial unders...
- RewardHarness: Self-Evolving Agentic Post-Training | arXiv — Evaluating instruction-guided image edits requires rewards that reflect subtl...
- STALE: Can LLM Agents Know When Their Memories Are No Longer Valid? | arXiv — Large Language Model (LLM) agents are increasingly expected to maintain coher...
- Learning to Communicate Locally for Large-Scale Multi-Agent Pathfinding | arXiv — Multi-agent pathfinding (MAPF) is a widely used abstraction for multi-robot t...
- ATLAS: Agentic or Latent Visual Reasoning? One Word is Enough for Both | arXiv — Visual reasoning, often interleaved with intermediate visual states, has emer...
- IntentVLA: Short-Horizon Intent Modeling for Aliased Robot Manipulation | arXiv — Robot imitation data are often multimodal: similar visual-language observatio...