Chaojun XIAO

xcjthu

17 11 12

https://xcjthu.github.io/

xcjthu

AI & ML interests

NLP、information extraction

Recent Activity

upvoted a paper 8 days ago

Beyond Reward Engineering: A Data Recipe for Long-Context Reinforcement Learning

submitted a paper 8 days ago

Beyond Reward Engineering: A Data Recipe for Long-Context Reinforcement Learning

upvoted a paper 14 days ago

Rethinking the Role of Efficient Attention in Hybrid Architectures

View all activity

Organizations

upvoted a paper 8 days ago

Beyond Reward Engineering: A Data Recipe for Long-Context Reinforcement Learning

Paper • 2606.18831 • Published 15 days ago • 7

submitted a paper to Daily Papers 8 days ago

Beyond Reward Engineering: A Data Recipe for Long-Context Reinforcement Learning

Paper • 2606.18831 • Published 15 days ago • 7

upvoted a paper 14 days ago

Rethinking the Role of Efficient Attention in Hybrid Architectures

Paper • 2606.15378 • Published 19 days ago • 18

submitted a paper to Daily Papers 14 days ago

Rethinking the Role of Efficient Attention in Hybrid Architectures

Paper • 2606.15378 • Published 19 days ago • 18

liked 2 datasets about 1 month ago

openbmb/UltraData-SFT-2605

Viewer • Updated May 28 • 12.2M • 38.7k • 356

openbmb/Ultra-FineWeb-L3

Viewer • Updated May 28 • 1.06B • 74.2k • 306

New activity in openbmb/MiniCPM5-1B about 1 month ago

Hindi fine-tune of MiniCPM5-1B now available + GGUF quants

🤯👍 4

#6 opened about 1 month ago by

pankajpandey-dev

经过实际测试，该模型长上下文能力很差，我把llama-server --help输出的内容发给模型，让它列出所有选项，这都做不到

#5 opened about 1 month ago by

chuikingshek

Installation Video and Testing - Step by Step

#1 opened about 1 month ago by

fahdmirzac

liked a model about 1 month ago

openbmb/MiniCPM5-1B-SFT

Text Generation • 1B • Updated May 25 • 30.6k • 33

liked a Space about 1 month ago

MiniCPM5 1B Demo

📉

MiniCPM5-1B-Demo

liked a model about 1 month ago

openbmb/MiniCPM5-1B

Text Generation • 1B • Updated May 26 • 350k • 826

updated a model 8 months ago

openbmb/MiniCPM4-0.5B

Text Generation • 0.4B • Updated Oct 20, 2025 • 44.3k • 78

updated a model 9 months ago

openbmb/MiniCPM4.1-8B

Text Generation • 8B • Updated Oct 24, 2025 • 80k • 391

authored a paper 9 months ago

InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation

Paper • 2509.24663 • Published Sep 29, 2025 • 18

upvoted a paper 9 months ago

InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation

Paper • 2509.24663 • Published Sep 29, 2025 • 18

commented a paper 9 months ago

InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation

Paper • 2509.24663 • Published Sep 29, 2025 • 18 •

upvoted a paper 9 months ago

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

Paper • 2509.24006 • Published Sep 28, 2025 • 119

liked a model 10 months ago

openbmb/VoxCPM-0.5B

Text-to-Speech • Updated Sep 19, 2025 • 7k • 807

New activity in openbmb/MiniCPM4.1-8B 10 months ago

Local Installation Video and Testing - Step by Step

👍❤️ 4

#1 opened 10 months ago by

fahdmirzac

Chaojun XIAO

AI & ML interests

Recent Activity

Organizations

xcjthu's activity

Hindi fine-tune of MiniCPM5-1B now available + GGUF quants

经过实际测试，该模型长上下文能力很差，我把llama-server --help输出的内容发给模型，让它列出所有选项，这都做不到

Installation Video and Testing - Step by Step

MiniCPM5 1B Demo

Local Installation Video and Testing - Step by Step