the author's ugly face         
Research topics
Ph. D. students
Short bio

While everything on this page is true, it's also a bit outdated. But I keep it for the record, waiting to put something a bit more up to date. A brief update: I am currently head of the Linkmedia research team at Irisa and Inria Rennes. For a view of my current activities, you can check my recent publications on HAL or have a look at my up-to-date resume.

Research topics

My research activities focuses on describing, structuring, indexing, mining and linking multimedia contents. In this general framework, I dedicate particular attention to multimodal statistical modeling, speech recognition and spoken content processing, unsupervised multimedia data mining and automatic link authoring. My research activities builds upon various scientific foundations such as multimedia signal processing, indexing and information retrieval, natural language processing, multimedia data mining and statistical machine learning.

Recently, I have been conducting activities in:

  1. Multimodal video modeling: Integration the audio, visual and eventually text modality into statistical models
  2. Spoken content analysis: Exploiting spoken content for multimedia content processing
  3. Unsupervised multimedia mining: Discovering reoccurring motifs in multimedia streams in a totally unsupervised fashion
I am also quite active in benchmark initiatives with the organization of the French spoken technology evaluations ESTER 2003, 2005, 2009 and the ETAPE 2012 follow-up, the Affect task at MediaEval 2011, 2012 and 2013 and the Spoken Web Search Task at MediaEval 2012.

I participate in several initiatives regarding animation of the scientific communitee: president of the French-speaking Speech Communication Association (AFCP), co-founder of the Speech and Language in Multimedia (SLIM) special interest group of ISCA, co-founder and general chair of the 1st Joint IEEE and ISCA Workshop on Speech, Language and Audio in Multimedia (SLAM), program coordination chair of Interspeech 2013.

Check the texmix demo on navigating broadcast news archives for an idea of what it is I do in spoken content processing.

Recent participation in projects (contribution to the project)

I am currently involved in the following projects

Over the last few years, I have participated to the following projects
Participation in the activities of the MUSCLE European Network of Excellence.

Ph. D. students

Ongoing Ph. D. thesis I am supervising:

Past Ph. D. students:

More Ph. D. in which I have been or I am involved in (but not supervising in any way):

Software development

I actively participated in the development of the following free software toolkits:

These toolkits are the base (with a little help from HTK) of the IRENE broadcast news indexing platform , orginally developped for the French Ester rich transcription evaluation campaign in collaboration with François Yvon. Also check out my free ESTER resources page.

In the framework of the ASR/NLP work group I am coanimating, we have developed several pieces of code related to spoken document analysis. Among others, worth mentioning are:

These toolkits are not open-source freely distributed softwares but we are nevertheless willing to share. Feel free to contact me should you be interested in any of those.

Selected recent publications

Check out my complete list of publications.

Short bio

I obtained a master degree in Applied Mathematics at the Institut National des Sciences Appliquees (INSA Rouen) in 1995 and worked on speech synthesis at ELAN Informatique from 1996 to 1997. I received a Ph. D. in Signal and Image Processing (Toward speech modeling with Markov random fields) at the Ecole National Superieure des Telecommunications (ENST Paris) in 2000. After a one year post-doctoral stay at Irisa, I joined the Audio Visual Speech Technology group at IBM T. J. Watson research center from 2001 to 2002. Since 2002, I am a research fellow at the Centre National pour la Recherche Scientifique (CNRS), working at the Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA). I received the Habilitation à Diriger des Recherches (HDR) de l'Université de Rennes 1, spécialité Informatique, in 2009.

Guillaume Gravier, Irisa, Campus de Beaulieu, 35042 Rennes Cedex, France.
Tel : +33 2 99 84 72 39 / Fax : +33 2 99 84 71 71