R&D Benchmarks 2024-2026
10 open source contributions released in 18 months. Retrieval, OCR, encoders, long-document reasoning. All data, weights, and recipes published openly.
51.7M
HuggingFace Downloads
105 models + 69 datasets
3,871
HuggingFace Likes
across HuggingFace
2,371
GitHub Stars
38 repos
969k
PyPl Downloads
All time
9,347
Crates.io Downloads
7 crates
Retrieval Performance
BEIR vs. Decontaminated BEIR
BIER
Decontaminated BEIR


OCR Table Benchmark
Offenburg University & University of Mannheim


OCR Model Performance
Accuracy vs. Throughput


BrowseComp-plus
BM25
Qwen3-Embed-8B
8B
Reason-ModernColBERT
149M
Reason-ModernColBERT
(get_document)
Agent-ModernColBERT
149M
Agent-ModernColBERT
(get_document)


