Benchmarks are Not Enough: RAMP for Runtime Assessing of Agentic Models in Production Systems Paper • 2605.27492 • Published 11 days ago • 23
ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning Paper • 2606.03503 • Published 4 days ago • 25
SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks Paper • 2605.31433 • Published 8 days ago • 26
Echo-Infinity: Learning Evolving Memory for Real-Time Infinite Video Generation Paper • 2606.04527 • Published 3 days ago • 26
M^3Eval: Multi-Modal Memory Evaluation through Cognitively-Grounded Video Tasks Paper • 2606.05008 • Published 3 days ago • 26
A Local Perturbation Theory for Cross-Domain Interference and Recovery in Multi-Domain RL Paper • 2606.02398 • Published 5 days ago • 27
AutoLab: Can Frontier Models Solve Long-Horizon Auto Research and Engineering Tasks? Paper • 2606.05080 • Published 3 days ago • 27
AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints Paper • 2606.05622 • Published 2 days ago • 34
NITP: Next Implicit Token Prediction for LLM Pre-training Paper • 2605.24956 • Published 13 days ago • 34
NVIDIA OmniDreams: Real-Time Generative World Model for Closed-Loop Autonomous Vehicle Simulation Paper • 2606.03159 • Published 4 days ago • 21
RobotValues: Evaluating Household Robots When Human Values Conflict Paper • 2606.03312 • Published 4 days ago • 23
VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding Paper • 2606.05259 • Published 3 days ago • 33
ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time? Paper • 2606.05553 • Published 2 days ago • 42
Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution Paper • 2606.06492 • Published 2 days ago • 52
YoCausal: How Far is Video Generation from World Model? A Causality Perspective Paper • 2605.30346 • Published 9 days ago • 54
minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Paper • 2605.30263 • Published 9 days ago • 57
Why Far Looks Up: Probing Spatial Representation in Vision-Language Models Paper • 2605.30161 • Published 9 days ago • 60
From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 10 days ago • 73