The ToolRL model trained for tool use through GRPO
Cheng Qian
chengq9
AI & ML interests
Agent, Tool Learning
Recent Activity
upvoted a paper 1 day ago
Brick-Composer: Using MLLMs for Assembly with Diverse Bricks upvoted a paper 4 days ago
Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues