Skip to content

AI论文速递 2026年05月31日(HuggingFace Daily Papers)

数据来源:https://huggingface.co/papers 采集时间:2026-05-31

📌 重点关注

  1. Verifiable Rewards Beyond Math and Code: Lightweight Corpus-Grounded Process Supervision for Factual Question Answering | arXiv【重点关注】 Applying reinforcement learning to improve factual accuracy in knowledge-inte... 💡 强化学习提升事实准确性,对AI知识验证很有价值
  2. GenClaw: Code-Driven Agentic Image Generation | arXiv【重点关注】 Image generation models have evolved from text-conditioned pixel synthesis to... 💡 代码驱动图像生成,为Agent提供视觉表达能力
  3. CoHyDE: Iterative Co-Training of LLM Rewriter & Dense Encoder for Tool Retrieval | arXiv【重点关注】 Tool retrieval over large API catalogs is a core bottleneck for LLM agents: u... 💡 LLM工具检索新突破,Agent性能瓶颈解决方案

📋 其他值得关注

  1. Beyond 3D VQAs: Injecting 3D Spatial Priors into Vision-Language Models for Enhanced Geometric Reasoning | arXiv — Vision-Language Models (VLMs) often struggle with robust 3D spatial reasoning...
  2. UI-KOBE: Knowledge-Oriented Behavior Exploration for Lightweight Graph-Guided GUI Agents | arXiv — Recent advances in mobile GUI agents have shown strong potential for automati...
  3. DynaFLIP: Rethinking Robotics Perception via Tri-Modal-Dynamics Guided Representation | arXiv — Robot manipulation critically depends on perception that preserves the action...
  4. Towards Verifiable Multimodal Deep Research: A Multi-Agent Harness for Interleaved Report Generation | arXiv — Large Language Models (LLMs) have advanced autonomous agents from deep search...
  5. WorldMemArena: Evaluating Multimodal Agent Memory Through Action-World Interaction | arXiv — Multimodal large language models are increasingly deployed as long-horizon ag...
  6. Tiny but Trusted: Efficient Vision-Language Reasoning for Time-Series Anomaly Detection | arXiv — Recent advances in Vision-Language Models (VLMs) have achieved impressive per...
  7. AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security | arXiv — Modern open-world agents such as OpenClaw exhibit powerful cross-environment ...