Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2205.13147

peper-intention

Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 27
VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model

Paper • 2504.07615 • Published Apr 10, 2025 • 36

Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 27

Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 27

Papers - Embeddings - Text - Sentence - Matryoshka

2D Matryoshka Sentence Embeddings

Paper • 2402.14776 • Published Feb 22, 2024 • 8
Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 27

XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference

Paper • 2404.15420 • Published Apr 23, 2024 • 11
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22, 2024 • 126
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 262
How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

Paper • 2404.14047 • Published Apr 22, 2024 • 45

Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 27

Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 27

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2, 2025 • 87
Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 108
BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16, 2025 • 86
FAST: Efficient Action Tokenization for Vision-Language-Action Models

Paper • 2501.09747 • Published Jan 16, 2025 • 29

2023 (and before) Papers of the Year

Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

Paper • 2306.00989 • Published Jun 1, 2023 • 1
Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 66
Scalable Diffusion Models with Transformers

Paper • 2212.09748 • Published Dec 19, 2022 • 17
Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 27

Papers - Image - Datasets - ImageNet

All you need is a good init

Paper • 1511.06422 • Published Nov 19, 2015 • 1
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models

Paper • 2404.14507 • Published Apr 22, 2024 • 23
Efficient Transformer Encoders for Mask2Former-style models

Paper • 2404.15244 • Published Apr 23, 2024 • 1
Deep Residual Learning for Image Recognition

Paper • 1512.03385 • Published Dec 10, 2015 • 16

peper-intention

Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 27
VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model

Paper • 2504.07615 • Published Apr 10, 2025 • 36

Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 27

Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 27

Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 27

Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 27

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2, 2025 • 87
Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 108
BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16, 2025 • 86
FAST: Efficient Action Tokenization for Vision-Language-Action Models

Paper • 2501.09747 • Published Jan 16, 2025 • 29

Papers - Embeddings - Text - Sentence - Matryoshka

2D Matryoshka Sentence Embeddings

Paper • 2402.14776 • Published Feb 22, 2024 • 8
Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 27

2023 (and before) Papers of the Year

Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

Paper • 2306.00989 • Published Jun 1, 2023 • 1
Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 66
Scalable Diffusion Models with Transformers

Paper • 2212.09748 • Published Dec 19, 2022 • 17
Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 27

XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference

Paper • 2404.15420 • Published Apr 23, 2024 • 11
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22, 2024 • 126
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 262
How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

Paper • 2404.14047 • Published Apr 22, 2024 • 45

Papers - Image - Datasets - ImageNet

All you need is a good init

Paper • 1511.06422 • Published Nov 19, 2015 • 1
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models

Paper • 2404.14507 • Published Apr 22, 2024 • 23
Efficient Transformer Encoders for Mask2Former-style models

Paper • 2404.15244 • Published Apr 23, 2024 • 1
Deep Residual Learning for Image Recognition

Paper • 1512.03385 • Published Dec 10, 2015 • 16

Previous
1
2
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs