Research And Development
R&D Overview
.avif)
Advancing Generative AI through Innovation
The R&D team at LightOn plays a pivotal role in advancing the field of generative AI through continuous innovation and development. Their expertise spans across creating and fine-tuning large language models (LLMs) that form the backbone of the Paradigm platform, a comprehensive AI solution designed for enterprise use. This platform simplifies the integration of generative AI into business workflows, offering both on-premise and cloud options to ensure flexibility and scalability for various business needs.
r&d publicationsRecent R&D Posts

The Retriever You Actually Need
Introducing LateOn and DenseOn, two Apache 2.0 retrievers: SOTA on BEIR, built to generalize.
CTA Title
Lorem Ipsum

Document Intelligence at First Sight
OriOn-Qwen-SR1: Fast Implicit Reasoning for Long Documents
CTA Title
Lorem Ipsum

Open-Source LightOnOCR-2 Just Outscored Claude, GPT-5, Qwen3, Mistral and Mathpix at Table Extraction
The most valuable information in enterprise documents doesn't live in paragraphs. It lives in tables
CTA Title
Lorem Ipsum

The Bloated Retriever Era Is Over
Reason-ModernColBERT tops BrowseComp-Plus, the most rigorous agentic search benchmark, across every metric, with 54× fewer parameters and fewer search calls.
CTA Title
Lorem Ipsum

NOVA: A Guide to Actually Measuring How Your Agent Works on Your Data
Numbers Over Vibes: Building a RAG Evaluation Framework That Actually Works
CTA Title
Lorem Ipsum

Day Zero of Multi-Vector Retrieval
Introducing ColBERT-Zero: late interaction model trained from scratch with PyLate
CTA Title
Lorem Ipsum

Introducing OriOn: the SOTA Long-Context Engine That Powers Agentic Search & Reason
Agentic AI starts with retrieval. It scales with long context.
CTA Title
Lorem Ipsum

LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling
The "Stronger Grep" for Modern Development While AI coding assistants like Claude Code have transformed how code is written, their ability to navigate large codebases efficiently is often limited by keyword-based search.
CTA Title
Lorem Ipsum

