Skip to content

AI论文速递 2026年05月10日(HuggingFace Daily Papers)

数据来源:https://huggingface.co/papers 采集时间:2026-05-10

📌 重点关注

  1. Audio-Visual Intelligence in Large Foundation Models | arXiv【重点关注】 Audio-Visual Intelligence (AVI) has emerged as a central frontier in artificial intelligence research, focusing on how machines can process and integrate information from both auditory and visual modalities.

💡 多模态智能是AI前沿方向,推动人机交互新边界 2. ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning | arXiv【重点关注】 Reinforcement Learning with Verifiable Rewards (RLVR) enhances reasoning of Large Language Models through innovative negative sample projection and residual learning techniques.

💡 负样本投影提升LLM推理能力,突破思维局限 3. BioTool: A Comprehensive Tool-Calling Dataset for Enhancing Biomedical Capabilities of Large Language Models | arXiv【重点关注】 Despite the success of large language models (LLMs) on general-purpose tasks, specialized applications in biomedicine require domain-specific tools and datasets to achieve optimal performance and reliability.

💡 生物医学工具调用能力提升,专业领域AI突破...

📋 其他值得关注

  1. Think, then Score: Decoupled Reasoning and Scoring for Video Reward Modeling | arXiv — Recent advances in generative video models are increasingly driven by post-tr...
  2. KinDER: A Physical Reasoning Benchmark for Robot Learning and Planning | arXiv — Robotic systems that interact with the physical world must reason about kinem...
  3. CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing | arXiv — Recent advances in large language models have led to strong performance on re...
  4. EDU-CIRCUIT-HW: Evaluating Multimodal Large Language Models on Real-World University-Level STEM Student Handwritten Solutions | arXiv — Multimodal Large Language Models (MLLMs) hold significant promise for revolut...
  5. ReflectDrive-2: Reinforcement-Learning-Aligned Self-Editing for Discrete Diffusion Driving | arXiv — We introduce ReflectDrive-2, a masked discrete diffusion planner with separat...
  6. Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction | arXiv — Modern retrieval systems, whether lexical or semantic, expose a corpus throug...
  7. Turning Drift into Constraint: Robust Reasoning Alignment in Non-Stationary Environments | arXiv — This paper identifies a critical yet underexplored challenge in reasoning ali...