Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2606.15007 • Published 5 days ago • 9
UniDDT: Unifying Multimodal Understanding and Generation with Decoupled Diffusion Transformer Paper • 2606.16255 • Published 2 days ago • 8
TokenPilot: Cache-Efficient Context Management for LLM Agents Paper • 2606.17016 • Published 2 days ago • 13
OneRank: Unified Transformer-Native Ranking Architecture for Multi-Task Recommendation Paper • 2606.16838 • Published 2 days ago • 15
VisualClaw: A Real-Time, Personalized Agent for the Physical World Paper • 2606.16295 • Published 2 days ago • 21
Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories Paper • 2606.11176 • Published 8 days ago • 101
JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence Paper • 2606.14777 • Published 7 days ago • 161
AlloSpatial: Agentic Harness Framework for Spatial Reasoning in Foundation Models Paper • 2606.08952 • Published 9 days ago • 1
The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment Paper • 2606.10747 • Published 8 days ago • 11
Hy-Embodied-0.5-VLA: From Vision-Language-Action Models to a Real-World Robot Learning Stack Paper • 2606.14409 • Published 5 days ago • 11
Skip a Layer or Loop It? Learning Program-of-Layers in LLMs Paper • 2606.06574 • Published 13 days ago • 18
HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry Paper • 2606.14249 • Published 5 days ago • 39
ArogyaSutra: A Multi-Agent Framework for Multimodal Medical Reasoning in Indic Languages Paper • 2606.13572 • Published 6 days ago • 2
HarnessBridge: Learnable Bidirectional Controller for LLM Agent Harness Paper • 2606.12882 • Published 6 days ago • 12
MoVerse: Real-Time Video World Modeling with Panoramic Gaussian Scaffold Paper • 2606.13376 • Published 6 days ago • 13
N-GRPO: Embedding-Level Neighbor Mixing for Enhanced Policy Optimization Paper • 2606.10768 • Published 8 days ago • 24