Skip to main content

Clément Dalloux

PhD student in computer sciences in Rennes (France)
Text mining and information extraction in clinical data


Since December 2016 - PhD student @IRISA, LINKMEDIA team

The IRISA teams in Rennes are hosted by INRIA

My thesis is part of the BigClin project which aims at developing a new clinical records representation relying on fine-grained semantic annotation thanks to new NLP tools dedicated to French clinical narratives. The project also addresses distributed systems issues: scalability, management of uncertain data and privacy, stream processing at runtime...
My role in this context is to develop and test NLP methods and tools in order to process unstructured clinical data in French. This methods have to be based on algorithms able to capture the semantics of the texts efficiently. Targeted tasks include indexing medical content, text mining, information extraction, dectection of uncertainty and negation, ect. NLP tasks will rely on a precise semantic annotation.

May 14-18 2018 @CORIA-TALN-RJC 2018, Rennes, France

In this picture of the organizing committee, I am carefully hidden ;)

I took part in the 2018 CORIA-TALN-RJC conference in three different way. Before the conference, I reviewed papers for RJC (Master and Phd student's papers). Then, as part of the organizing committee I worked on the website, communication with attendants, social media, etc. During the conference, I guided and informed people and did many other tasks. Moreover, on thursday, I presented a poster on negation detection in French and Brazilian Portuguese.

Nov-Dec 2017 @PUCPR, Curitiba - Brésil


As part of the BigClin/Figtem projects, I spent a month at the Pontifical Catholic University of Paraná in Curitiba, Brazil, where I worked on classification taks using supervised learning algorithms on clinical data.

2015 - Master's degree internship @LIMICS


During my master's degree internship, I worked on the accordys project under the supervision of a PhD student. The main task of this internship was to evaluate the performances of document indexing models such as TF-IDF, LSI, LDA, etc. at the document level in order to compute the similarity between several cases of fetal malformation. To do so, we used a small corpus was composed of fetoplacental examinations, written in free text form.

2013-2015 - Master's degree in Linguistics @Université Bordeaux-Montaigne

I got my master’s degree in Linguistic Research and Applied Computations from the university Bordeaux-Montaigne. I followed many courses in linguistic theory (discourse analysis, syntax, semantics, etc.) and applied linguistics (natural language processing, corpus linguistics, etc.)


Grabar, N., Claveau, V., and Dalloux, C., CAS: French Corpus with Clinical Cases, Proceedings of the Ninth International Workshop on Health Text Mining and Information Analysis, Louhi@EMNLP 2018, Brussels, Belgium, October 31, 2018, 122-128, 2018. Link

Dalloux, C., Grabar, N., Claveau, V., and Moro, C. (2018). Portée de la négation : détection par apprentissage supervisé en français et portugais brésilien. 25e Conférence sur le Traitement Automatique des Langues Naturelles (TALN).

Dalloux, C., V. Claveau, et N. Grabar (2017). Détection de la négation : corpus français et apprentissage supervisé. In SIIM 2017 - Symposium sur l’Ingénierie de l’Information Médicale, Toulouse, France, pp. 1–8. Lien

Dalloux, C. (2017). Détection de l’incertitude et de la négation : un état de l’art. In 19es REncontres jeunes Chercheurs en Informatique pour le TAL (RECITAL 2017), pp. 94–107. Lien


Coming soon !

Contact Me