Agent-ModernColBERT adds ~10% over Reason-ModernColBERT on BrowseComp-Plus, stays at 149M parameters, and brings GPT-5 + Qwen3-8B-level retrieval performance to a fully open stack.
Meet EDiTh: a synthetic enterprise corpus built to benchmark what public datasets can't touch: 1,004 PDFs in six languages and 36 ground-truth use cases.