Sections

Home
SiteMap

Personal tools

admin only ->>
Log in

You are here: Home » Research » Projects » Phase II » IM2.AP ( Audio processing )

Address

NCCR IM2
c/o Idiap
Centre du Parc
Rue Marconi 19
Case Postale 592
CH-1920 Martigny
Switzerland

tel. +41 27 721 77 11
fax +41 27 721 77 12

IM2.AP ( Audio processing )

Document Actions

Audio processing

IM2.AP

IP Head: John Dines (IDIAP
Partners: IDIAP, TIK/ETHZ, ICSI.

The IM2 domain is characterised by audio signals captured by lapel and headset microphones, microphone arrays and binaural recordings, when it is frequently non-trivial to identify which speaker or speakers are speaking at a particular time. Automatic Speech Recognition (ASR) is difficult in the meeting environment: beyond the issues arising from far-field microphones and multiple sound sources, speech in meetings is conversational, characterised by phenomena such as disfluencies and incomplete utterances. There are additional challenges arising from a high proportion of accented speech from non-native speakers and a multilingual orientation. Finally, we are concerned with the automatic extraction of metadata, such as speaker identity and ``punctuation'' information.

The major objectives for Phase II of IM2.AP are summarised as follows:

The continued advancement of fundamental techniques in audio processing and their application in applied ASR (especially meeting) environments
Encouraging greater intra- and inter- disciplinary cooperation in making advancements to the state-of-the-art
Investigation and rigourous 'proof-of-concept' of new paradigms in audio processing, in particular, for speech recognition and speaker recognition tasks
Evaluation on more realistic and complex tasks

Last modified 2006-02-03 15:36

News and Press

Klewel competes for the Oscar of Swiss Informatics!

2014-07-24

KeyLemon awarded with the CTI Start-up label

2014-06-16

More...