Mamba checkpoints compatible with transformers
Arthur Zucker PRO
AI & ML interests
None yet
Recent Activity
updated a Space about 6 hours ago
ArthurZ/tokyo-flat-finder-api updated a dataset about 7 hours ago
hf-internal-testing/tokenizers-bench new activity about 10 hours ago
kernels-community/metal-flash-sdpa:[WIP] Add sliding-window attention support to the varlen kernel