Manuel Faysse

Gradient descending my way through life.

prof_pic.jpg

PhD Candidate

Lead Research Scientist

Paris, France

Hey! I am Manu, a 2nd year PhD student working on applied NLP and ML Privacy research, but curious about (way too) many other things!

After pretraining at EPFL with a master’s in Robotics and Data Science, and an awesome research stint with the Computational Privacy Group at Imperial College London, I worked as a Research Scientist at Illuin Technology on various NLP use cases, notably deep multimodal models for Document ML and neural information retrieval.

I am now in my academic finetuning phase as a PhD student at CentraleSupélec (Université Paris Saclay), supervised by the distilled knowledge of Pierre Colombo. My research focuses on industrial applications of large language models, with published works in instruction-tuned model automatic evaluation, bilingual Large Language Model pretraining, as well as model memorization, and confidence estimation techniques for neural information retrieval.

My PhD is funded through the CIFRE French program in collaboration with Illuin Technology, where I currently hold a Lead Research Scientist position, and spend a minor share of my time advising and accompanying various R&D efforts in the LLM and Vision LLM space.

Don’t hesitate to contact me to discuss, or to inquire about potential collaborations or talks !

news

Feb 13, 2024 Gave an invited talk at DGFiP and DINUM
Jan 30, 2024 We release 🥐 CroissantLLM !
Nov 7, 2023 Paper accepted at EMNLP 2023 (Oral) !
Nov 4, 2023 Presented two papers from the MICS lab at AACL, on automatic metric evaluation and bias mitigation through contrastive training.
Jan 15, 2023 Started my PhD at MICS and Illuin Technology !

selected publications

2024

  1. croissant.png
    CroissantLLM: A Truly Bilingual French-English Language Model
    Manuel Faysse, Patrick Fernandes, Nuno M. Guerreiro, and 13 more authors
    2024

2023

  1. gavel.png
    Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications
    Manuel Faysse, Gautier Viaud, Céline Hudelot, and 1 more author
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Dec 2023