| Feb 18, 2026 | Jina releases their embeddings v5 models which claim the top spot on MMTEBv2, the default multilingual IR benchmark. The nano model is based on our EuroBert model which the main author state is the best small multilingual encoder backbone amongst all those they experimented with. |
| Jan 23, 2026 | Our work “Should We Still Pretrain Encoders with Masked Language Modeling?” is accepted at ICLR 2026! |
| Jan 2, 2026 | We release the "Vidore V3" dataset and paper. |
| Jul 7, 2025 | Our Eurobert paper “EuroBERT: Scaling Multilingual Encoders for European Languages” is accepted at COLM! |
| Jul 3, 2025 | We release "Should We Still Pretrain Encoders with Masked Language Modeling?" |
| Jun 2, 2025 | We release the "Context is Gold" paper on Contextual Document Embeddings ! |
| Apr 24, 2025 | Presented ColPali at the ICLR Poster Session |
| Apr 9, 2025 | Best Paper Award at IEEE SatML |
| Mar 10, 2025 | We release EuroBERT ! |
| Mar 10, 2025 | Gave an invited talk at Amazon on ColPali and Retrieval in Vision Space. |
| Jan 23, 2025 | Our work “ColPali: Efficient Document Retrieval with Vision Language Models” is accepted at ICLR ! |
| Jan 2, 2025 | We release the "ModernVbert" model and paper. |
| Oct 15, 2024 | ColPali has been featured in the 2024 edition of the renowned State of AI and is listed in the Tech Radar as a top AI innovation to assess. |
| Sep 30, 2024 | The interview I gave to Jakub Zavrel of Zeta Alpha on the topic of Visual Document Retrieval has been released on Youtube and Spotify. |
| Sep 23, 2024 | Our work “Towards Trustworthy Reranking; A Simple yet Effective Abstention Mechanism” is accepted at TMLR! |
| Aug 19, 2024 | Gave an invited talk at Unbabel on ColPali and Retrieval in Vision Space. |
| Jul 26, 2024 | Invited at the LlamaIndex webinar to talk about ColPali and Document Retrieval in Vision Space. |
| Jul 25, 2024 | The MIT Technology Review has published a featured article on our work on Copyright Traps in LLMs. |
| Jun 21, 2024 | We release ColPali - Efficient Document Retrieval with Vision Language Models 👀 ! |
| Jun 14, 2024 | Gave an invited talk at IBM Research Paris on the topic of CroissantLLM and Large Language Models. |
| May 31, 2024 | Invited talk at Meta Paris |
| May 14, 2024 | Gave an invited talk at Naver Labs Europe on the topic of LLM pretraining and CroissantLLM. |
| May 4, 2024 | Accepted paper at ICML 2024 |
| Feb 13, 2024 | Gave an invited talk at the French government DGFiP and DINUM about LLM pretraining and CroissantLLM. |
| Jan 30, 2024 | We release 🥐 CroissantLLM ! |
| Nov 7, 2023 | Paper accepted at EMNLP 2023 (Oral) ! |
| Nov 4, 2023 | Presented two papers from the MICS lab at AACL, on automatic metric evaluation and bias mitigation through contrastive training. |
| Jan 15, 2023 | I’ll be working with Pierre Colombo and Illuin Technology on various NLP topics with a particular focus on industrial applications of LLMs. Excited to start! |