AI论文速递 2026年05月14日（HuggingFace Daily Papers）¶

数据来源：https://huggingface.co/papers 采集时间：2026-05-14

📌 重点关注¶

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context | arXiv — 【重点关注】 Long-context modeling is becoming a core capability of modern large vision-la...
LLM Agents Already Know When to Call Tools -- Even Without Reasoning | arXiv — 【重点关注】 Tool-augmented LLM agents tend to call tools indiscriminately, even when the ...
PAAC: Privacy-Aware Agentic Device-Cloud Collaboration | arXiv — 【重点关注】 Large language model (LLM) agents face a structural tension: cloud agents pro...

SleepWalk: A Three-Tier Benchmark for Stress-Testing Instruction-Guided Vision-Language Navigation | arXiv — Vision-Language Models (VLMs) have advanced rapidly in multimodal perception ...
MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning | arXiv — Current interactive LLM agents rely on goal-conditioned stepwise planning, wh...
Covering Human Action Space for Computer Use: Data Synthesis and Benchmark | arXiv — Computer-use agents (CUAs) automate on-screen work, as illustrated by GPT-5.4...
ORBIT: Preserving Foundational Language Capabilities in GenRetrieval via Origin-Regulated Merging | arXiv — Despite the rapid advancements in large language model (LLM) development, fin...
UniPath: Adaptive Coordination of Understanding and Generation for Unified Multimodal Reasoning | arXiv — Unified multimodal models (UMMs) aim to integrate understanding and generatio...
Context Training with Active Information Seeking | arXiv — Most existing large language models (LLMs) are expensive to adapt after deplo...
Agent-BRACE: Decoupling Beliefs from Actions in Long-Horizon Tasks via Verbalized State Uncertainty | arXiv — Large language models (LLMs) are increasingly deployed on long-horizon tasks ...