Speech recognition

Speech recognition appeals to a wide range of skills for enabling a computer to interpret the human voice and to correctly react to it. The problems to be solved range from the most basic ones, such as detecting human voice activity, to more challenging issues such as producing live closed-captioning of television broadcasts or determining the topic of a telephone conversation.

Related technologies: Keyword detection, audio content indexing, closed-captioning, text/audio alignment, recognition engine, pronunciation verification, finite-state transducers

CRIM has been taking part in competitions and technological evaluations in speech and speaker recognition since 1992. Our results in international evaluation campaigns such as ETAPE 2011 have established our position among the best research teams of our discipline. We actively pursue our research while keeping abreast with the latest developments of our scientific community, a keen example being our participation in the KALDI group that brings together  leading researchers of our field.

 

 

Teams

Releases

Recent news

  • CRIM and the NFB: The power of collaborative partnerships to drive technological and social innovation
    14/11/2018

    A current project aims to optimize CRIM's video description technology and extend its availability to a wider audience using the NFB's broadcasting platforms.

    +

Upcoming event

  • Intelligence d'affaires et analytique 3e édition - Événement les Affaires
    14 November 2018 8:30
    Centre-Ville, Montréal
    Le CRIM est fier d'être partenaire et de participer à la 3e édition de la conférence Intelligence d'affaires et analytique organisée par les Événements Les Affaires, le 14 novembre 2018.
    +

Recent Publications

  • Towards Automatic Feature Extraction for Activity Recognition from Wearable Sensors: A Deep Learning Approach

    +
  • Checking Sequence Generation for Symbolic Input/Output FSMs by Constraint Solving

    +