Skip to content

Sections
Personal tools
You are here: Home » Publications

IM2 Publications

Document Actions


IM2 Publications > 2007



Click here to access the IM2 publications database.



Publications 2002-2006



To access the old publications, please click in the table below:


Phase Name Scientific papers
With peer review
Scientific papers
Without peer review
Books Reports
Phase 2
(2006)
DMA Click here

AP


MPR

MCA

HMI



ISD




BMI



Phase 1
(2002 - 2005)
 ACP

 DI
 DS



 IIR

 IP


 MDM


 MI

 SA

 SP


Scientific papers with peer review

 

IM2 Phase II

 

IM2.AP

 

  • Guillaume Lathoud, Julien Bourgeois, and Jürgen Freudenberger, "Sector-Based Detection for Hands-Free Speech Enhancement in Cars", in "EURASIP Journal on Applied Signal Processing, Special Issue on Advances in Multimicrophone Speech Processing", 2006.
  • Hemant Misra, Jithendra Vepa and Herve Bourlard, "Multi-stream ASR: An oracle perspective", to appear in Interspeech'06 (ICSLP), Pittsburgh, U.S.A., 2006.
  • Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Herve Bourlard, "Posterior Based Keyword Spotting with A Priori Thresholds", to appear in Interspeech'06 (ICSLP), Pennsylvania, September 2006.
  • Guillermo Aradilla, Jithendra Vepa, and Herve Bourlard, "Using Posterior-Based Features in Template Matching for Speech Recognition", to appear in Interspeech'06 (ICSLP), Pennsylvania, September 2006.
  • John Dines, Jithendra Vepa, Thomas Hain, "The segmentation of multi-channel meeting recordings for automatic speech recognition", to appear in Interspeech'06 (ICSLP), 2006.
  • Johnny Marièthoz and Samy Bengio, "A Max Kernel For Text-Independent Speaker Verification Systems, Second Workshop on Multimodal User Authentication", MMUA, 2006.
  • "Darren Moore, John Dines, Mathew Magimai Doss, Jithendra Vepa, Octavian Cheng and Thomas Hain, ""Juicer, A Weighted Finite State Transducer speech decoder"", in proceedings of MLMI, May 2006.
  • A Weighted Finite State Transducer speech decoder"", in proceedings of MLMI, May 2006."
  • Petr Fousek and Hynek Hymansky, "Towards ASR based on hierarchical posterior-based keyword recognition", to appear in ICASSP 2006, Toulouse, France.
  • Hamed Ketabdar, Jithendra Vepa, Samy Bengio and Herve Boulard, "Using more informative posterior probabilities for speech recognition", in proceedings of ICASSP 2006, Toulouse, France.
  • G. Lathoud, M. Magimai.-Doss and Gerve Bourlard, "Threshold selection for unsupervised detection with an application to microphone arrays", in proceedings of ICASSP 2006, Toulouse, France.
  • Guillermo Aradilla, Jithendra Vepa and Herve Bourlad, "Using Pitch as Prior Knowledge in Template-Based Speech Recognition", in proceedings of ICASSP 2006, Toulouse France.
  • Guillaume Lathoud, Mathew Magimai.-Doss, Bertrand Mesot and Hervé Bourlard, "Unsupervised Spectral Subtraction for Noise-Robust ASR", published in Proceedings of ASRU’06, December 2005.
  • X. Anguera, C. Wooters, and J. Hernando, "Friends and Enemies: A Novel Initialization for Speaker Diarization", to appear in Interspeech'06 (ICSLP), Pittsburgh, U.S.A., 2006.
  • X. Anguera, C. Wooters, and J. Pardo, "Robust Speaker Diarization for Meetings: ICSI RT06s evaluation system", to appear in Interspeech'06 (ICSLP), Pittsburgh, U.S.A., 2006.
  • Kofi Boakye and Andreas Stolcke, "Improved Speech Activity Detection Using Cross-Channel Features for Recognition of Multiparty Meetings", to appear in Interspeech'06 (ICSLP), Pittsburgh, U.S.A., 2006.
  • Ozgur Cetin and Elizabeth Shriberg, "Analysis of Overlaps in Meetings by Dialog Factors, Hot Spots, Speakers, and Collection Site: Insights for Automatic Speech Recognition", to appear in Interspeech'06 ICSLP), Pittsburgh, U.S.A., 2006.
  • Gallardo-Antolin, X. Anguera, and C. Wooters, "Multi-Stream Speaker Diarization Systems for the Meetings Domain", to appear in Interspeech'06 (ICSLP), Pittsburgh, U.S.A., 2006.
  • Andrew O. Hatch, Sachin Kajarekar, Andreas Stolcke, "Within-class Covariance Normalization for SVM-based Speaker Recognition", to appear in Interspeech'06 (ICSLP), Pittsburgh, U.S.A., 2006.
  • J. Pardo, X. Anguera, C. Wooters, "Speaker Diarization for Multiple Distant Microphone Meetings: Mixing Acoustic Features And Inter-Channel Time Differences", to appear in Interspeech'06 (ICSLP), Pittsburgh, U.S.A., 2006.
  • S. Stenchikova, D. Hakkani-Tur, G. Tur., "QASR: Question Answering Using Semantic Roles for Speech Interface", to appear in Interspeech'06 (ICSLP), Pittsburgh, U.S.A., 2006.
  • M. Zimmerman, D. Hakkani-Tur, J. Fung, N. Mirghafori, L. Gottlieb, E. Shriberg, Y.Liu, "The ICSI+ Muilti-lingual Sentence Segmentation System", to appear in Interspeech'06 (ICSLP), Pittsburgh, U.S.A., 2006.
  • Kolar J.,Shriberg E., Liu Y., "On Speaker-Specific Prosodic Models for Automatic Dialog Act Segmentation of Multi-Party Meetings", to appear in Interspeech'06 (ICSLP), Pittsburgh, U.S.A., 2006.
  • N. Mirghafori and C. Wooters, "Nuts and Flakes: A Study of Data Characteristics in Speaker Diarization", proceedings of ICASSP, Toulouse, France
  • O. Cetin and E. Shriberg, "Speaker Overlaps and ASR Errors in Meetings: Effects Before, During, and After the Overlap", proceedings of ICASSP, Toulouse, France
  • M. Zimmermann, A. Stolcke, E. Shriberg, "Joint Segmentation and Classification of Dialog Acts in Multi-party Meetings", proceedings of ICASSP, Toulouse, France
  • J. Kolar, E. Shriberg, and Y. Liu, "Using Prosody for Automatic Sentence Segmentation of Multi-Party Meetings", proceedings of Ninth International Conference on Text, Speech and Dialogue (TSD 2006), Brno, Czech Republic
  • Stolcke, F. Grezl, M.-Y. Hwang, X. Lei, N. Morgan, and D. Vergyri, "Cross-domain and Cross-language Portability of Acoustic Features Estimated by Multilayer Perceptrons", proceedings of IEEE ICASSP, Toulouse, France
  • X. Anguera, C. Wooters and J. Hernando, "Purity Algorithms for Speaker Diarization of Meetings Data", proceedings of ICASSP, Toulouse, France
  • O. Cetin and E. Shriberg, "Overlap in Meetings: ASR Effects and Analysis by Dialog Factors, Speakers, and Collection Site". 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, Washington DC, USA
  • D. Gelbart, W. Hemmert and M. Holmberg, "Automatic Speech Recognition with an Adaptation Model Motivated by Auditory Processing", IEEE Transactions on Speech and Audio Processing, January 2006.
  • Q. Zhu, B. Chen, F. Grezl and N. Morgan, "Improved MLP Structures for Data-Driven Feature Extraction for ASR", proceedings of Eurospeech 2005
  • Pelaez-Moreno, Q. Zhu, B. Chen and N. Morgan, "Automatic Data Selection for MLP-based Feature Extraction for ASR", proceedings of Eurospeech 2005
  • N. Morgan, Q. Zhu, A. Stolcke, K. Sonmez, S. Sivadas, T. Shinozaki, M. Ostendorf, P. Jain, H. Hermansky, D. Ellis, G. Doddington, B. Chen, O. Cetin, H. Bourlard and M. Athineos, "Pushing the Envelope – Aside", IEEE Signal Processing Magazine, Vol. 22 No. 5, pp. 81-88
  • R. Beutler, T. Kaufmann, and B. Pfister, "Integrating a non-probabilistic grammar into large vocabulary continuous speech recognition", in Proceedings of the IEEE ASRU 2005 Workshop, pages 104-109, San Juan (Puerto Rico), November 2005.
  • H. Romsdorfer and B. Pfister, "Character stream parsing of mixed-lingual text", in ISCA Tutorial and Research Workshop on Multilingual Speech and Language Processing (MultiLing 2006), Stellenbosch (South Africa), April 2006.
  • Vivek Tyagi, Hervé Bourlard, and Christian Wellekens, "On Variable-Scale Piecewise Stationary Spectral Analysis of Speech Signals for ASR", in "Speech Communication", 2006.
  • Johnny Mariéthoz and Samy Bengio, "A Unified Framework for Score Normalization Techniques Applied to Text Independent Speaker Verification", in "IEEE Signal Processing Letters, Volume 12", 2005.
  • Fabio Valente, "Infinite Models for Speaker Clustering", in "International Conference on Spoken Language Processing", 2006.
  • Fabio Valente and Hynek Hermansky, "Discriminant linear processing of time-frequency plane", in "International Conference on Spoken Language Processing", 2006.
  • Petr Motlicek, Hynek Hermansky, Harinath Garudadri, and Naveen Srinivasamurthy, "Speech Coding based on Spectral Dynamics", in "Ninth International Conference on TEXT, SPEECH and DIALOGUE (TSD)", 2006.
  • Hamed Ketabdar, Hervé Bourlard, and Samy Bengio, "Hierarchical Multi-Stream Posterior Based Speech Recognition System", in "Proceedings MLMI workshop", 2005.
  • J.-F. Paiement, D. Eck, and S. Bengio, "A Probabilistic Model for Chord Progressions", in "Proceedings of the Sixth International Conference on Music Information Retrieval (ISMIR)", 2005.

 

IM2.AP, IM2.MPR

 

  • D. Zhang, D. Gatica-Perez, S. Bengio, and I. McCowan. "Modeling individual and group actions in meetings with layered HMMs", IEEE Trans. on Multimedia, Jun. 2006.

 

IM2.AP, IM2.VP

 

  • M. Hari-Krishna, D. Gatica-Perez, and I. McCowan, "Speech Enhancement and Recognition in Meetings with an Audio-Visual Sensor Array", submitted to IEEE Trans. on Speech and Audio Processing, Apr. 2006
  • D. Gatica-Perez, G. Lathoud, J.-M. Odobez, and I. McCowan, "Audio-visual Probabilistic Tracking of Multiple Speakers in Meetings", IEEE Trans. on Audio, Speech, and Language Processing, accepted for publication, Mar. 2006.

 

IM2.BMI

 

  • Blankertz, B., Müller, K.-R., Krusienski, D., Schalk, G., Wolpaw, J.R., Schlögl, A., Pfurtscheller, G., Millán, J. del R., Schröder, M., and Birbaumer, N. (2006), "The BCI Competition III: Validating Alternative Approaches to Actual BCI Problems", IEEE Trans. on Neural Systems and Rehabilitation Engineering, 14(2):153–159. Special Issue on "Brain-Computer Interfaces".
  • Buttfield, A., Ferrez, P.W., and Millán, J. del R. (2006), "Towards a Robust BCI: Error Recognition and Online Learning", IEEE Trans. on Neural Systems and Rehabilitation Engineering, 14(2):164–168. Special Issue on "Brain-Computer Interfaces".
  • Gonzalez Andino, S.L., R. Grave de Peralta, R., Thut, G., Millán, J. del R., Morier, P., and Landis, T. (2006), "Very High Frequency Oscillations (VHFO) as a Predictor of Movement Intentions", NeuroImage, to appear.
  • Gonzalez Andino, S.L., R. Grave de Peralta, R., Pegna, A., Khateb, A., Thut, G., and Landis, T. (2006), "A Glimpse into Your Vision", Human Brain Mapping, to appear.
  • Buttfield, A., Ferrez, P.W., and Millán, J. del R. (2006), "Online Classifier Adaptation in High Frequency EEG", 3rd Int. Brain-Computer Interface Workshop. Graz, Austria.
  • Ferrez, P.W., Galán Moles, F., Buttfield, A., Gonzalez Andino, S.L., Grave de Peralta, R., and Millán, J. del R. (2006), "High Frequency Bands and Estimated Local Field Potentials to Improve Single-Trial Classification of Electroencephalographic Signals", 3rd Int. Brain-Computer Interface Workshop. Graz, Austria.
  • Grave de Peralta, R., Millán, J. del R., Morier, P., and Gonzalez Andino, S.L. (2006), "Accurate Hand Trajectory Prediction by Real and Synthetic EEG", 3rd Int. Brain-Computer Interface Workshop. Graz, Austria.
  • Grave de Peralta, R., Millán, J. del R., Morier, P., and Gonzalez Andino, S.L. (2006), "Theoretical and Experimental Basis for the Development of Direct Noninvasive BCI", 3rd Int. Brain-Computer Interface Workshop. Graz, Austria.
  • Lew, E., Nuttin, M., Ferrez, P.W., Degeest, A., Buttfield, A., Vanacker, G., and Millán, J. del R. (2006), "Non-Invasive Brain Computer Interface for Mental Control of a Simulated Wheelchair", 3rd Int. Brain-Computer Interface Workshop. Graz, Austria.
  • Gonzalez Andino, S.L., Grave de Peralta R., Khateb, A., Pegna, A., Thut, G., and Landis, T. (2006), "A Glimpse into your vision", Poster at 11th Int. Conf. on Functional Mapping of the Human Brain. Florence, Italy.
  • Gonzalez Andino, S.L., Grave de Peralta R., Khateb, A., Pegna, A., and Landis, T. (2006), "Quantifying and Imaging Blindsight", Poster at 11th Int. Conf. on Functional Mapping of the Human Brain. Florence, Italy.
  • Gonzalez Andino, S.L., Grave de Peralta R., Millán, J. del R., and Landis, T. (2006), "Local field potentials estimation from the EEG for the study of neural oscillations and the development of direct non-invasive brain computer interfaces (BCI)", Poster at 11th Int. Conf. on Functional Mapping of the Human Brain. Florence, Italy.
  • Gonzalez Andino, S.L., Grave de Peralta R., Thut, G., Millán, J. del R., Moier, P., and Landis, T. (2006), "Very high frequency oscillations (VHFO) as a predictor of movement intentions", Poster at 11th Int. Conf. on Functional Mapping of the Human Brain. Florence, Italy.
  • Grave de Peralta, R., Hauk, O., and Gonzalez Andino, S.L. (2006), "ANA: The inverse problem, and the Zero Dipole Localization Error", Poster at 11th Int. Conf. on Functional Mapping of the Human Brain. Florence, Italy.
  • Millán, J. del R., Ferrez, P., Gonzalez, S.L., and Grave de Peralta, R. (2006), "Non-invasive accurate prediction of arm movements: A harmless approach to neuroprosthetic devices"; Poster at 11th Int. Conf. on Functional Mapping of the Human Brain. Florence, Italy.
  • Sébastien Marcel and José del R. Millán, "Person Authentication using Brainwaves (EEG) and Maximum A Posteriori Model Adaptation", in "IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE Special Issue on Biometrics", 2007.
  • Silvia Chiappa and David Barber, "EEG Classification using Generative Independent Component Analysis", in "Neurocomputing", 2006.

 

IM2.DMA

 

  • Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Mael Guillemot, Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronenthal, Guillaume Lathoud, Mike Lincoln, Agnes Lisowska, Iain McCowan, Wilfried Post, Dennis Reidsma, and Pierre Wellner, "The AMI Meeting Corpus: a Pre-Announcement", in "Machine Learning for Multimodal Interaction: Second International Workshop, MLMI'2005", 2005.
  • Behera, D. Lalanne, R. Ingold (2005), "Influence of Fusion Strategies on Feature-based Identification of Low-resolution Documents", in proc. of ACM Symposium on Document Engineering (DocEng05), Bristol (United Kingdom), November 2-4 2005 , pp. 20-22.
  • Behera, D. Lalanne, R. Ingold (2005), "Enhancement of Layout-based Identification of Low-resolution Documents using Geometrical Color Distribution", in Proc. of 8th International Conference on Document Analysis and Recognition (ICDAR'05), Seoul (Korea), August 29 - September 01 2005, pp. 468-472
  • J.-L. Bloechle, M. Rigamonti, K. Hadjar, D. Lalanne, R. Ingold (2006), "XCDF: A Canonical and Structured Document Format", in Horst Bunke, A. Lawrence Spitz (eds.), LNCS: "7th International Workshop, DAS 2006, Nelson, New Zealand, February 13-15, 2006, Proceedings", Springer-Verlag, vol. 3872, ISBN:3-540-32140-3, 2006, pp. 141-152.
  • J.-L. Bloechle, M. Rigamonti, D. Lalanne and R. Ingold (2006), "XCDF : Un format canonique pour la représentation de documents", CIFED 06, Fribourg, Switzerland, September 18-22, 2006.
  • D. Mekhaldi, D. Lalanne, R. Ingold (2005), "From Searching to Browsing through Multimodal Documents Linking", in Proc. of 8th International Conference on Document Analysis and Recognition (ICDAR'05), Seoul (Korea), August 29 - September 1 2005, pp. 924-928.
  • Popescu-Belis A. (2006), "Résolution des références aux documents dans un corpus de dialogues humains", TALN 2006 (Treizième Conférence sur le Traitement Automatique des Langues Naturelles), Leuven, Belgium, vol.1, p.256-265.
  • M. Rigamonti, J.-L. Bloechle, K. Hadjar, D. Lalanne, R. Ingold (2005), "Towards a Canonical and Structured Representation of PDF Documents through Reverse Engineering", in Proc. of 8th International Conference on Document Analysis and Recognition (ICDAR'05), Seoul (Korea), August 29 - September 01 2005, pp. 1050-1054.

 

IM2.DMA, IM2.HMI

 

  • Evéquoz, F., Rigamonti, M., Lalanne, D. and Ingold, R., "Document inquisitor : un système de validation des structures et d’élicitation de modèles de documents", CIDE 06, Fribourg, Switzerland, September 18-22, 2006.
  • M. Rigamonti, D. Lalanne, F. Evéquoz, R. Ingold (2006), "Browsing Multimedia Archives Through Intra- and Multimodal Cross-Documents Links", in Steve Renals, Samy Bengio (eds.), LNCS: "Machine Learning for Multimodal Interaction: Second International Workshop, MLMI 2005, Edinburgh, UK, July 11-13, 2005, Revised Selected Papers", Springer-Verlag, vol. 3869, ISBN:3-540-32549-2, 2006, pp. 114-125.

 

IM2.DMA, IM2.HMI, IM2.MCA

 

  • Popescu-Belis A. & Lalanne D. (2006), "Detection and Resolution of References to Meeting Documents", in Renals S. & Bengio S., eds., Machine Learning for Multimodal Interaction II, LNCS 3869, Springer-Verlag, Berlin/Heidelberg, p.64-75.

 

IM2.HMI

 

  • Ailomaa, M., Lisowska, A., Melichar, M., Armstrong, S., and Rajman, M., "Archivus: A multimodal system for multimedia meeting browsing and retrieval", ACL/Coling 2006. Sydney, Australia. July 17-21, 2006.
  • Denis Lalanne, Rolf Ingold, Didier von Rotz, Ardhendu Behera, Dalila Mekhaldi, Andrei Popescu-Belis, "Using static documents as structured and thematic interfaces to multimedia meeting archives", in Bourlard H. & Bengio S., eds. (2004), Multimodal Interaction and Related Machine Learning Algorithms, LNCS, Springer-Verlag, Berlin, pp. 87-100.
  • Lisowska, A and Armstrong S., "Multimodal Input for Meeting Browsing and Retrieval Interfaces: Preliminary findings and the role of comfort", MLMI'06 Bethesda, Maryland. May 1-3, 2006.
  • Lisowska A. and Betrancourt M., "Multimodal Input for Meeting Browsing and Retrieval Interfaces: Preliminary Findings", ERGO-IA 2006, Bidart-Biarritz, France. Oct 11-13, 2006.
  • Lisowska A., "Archivus: a multimodal system for browsing and retrieving multimedia meetings", MLMI'06, Bethesda, Maryland. May 1-3, 2006. Demo presentation.
  • D. Mekhaldi, D. Lalanne, R. Ingold (2005), "From Searching to Browsing through Multimodal Documents Linking", in Proc. of 8th International Conference on Document Analysis and Recognition (ICDAR'05), Seoul (Korea), August 29 - September 1 2005, pp. 924-928.
  • Rajman, M., Ailomaa M., Lisowska, A., Melichar M., and Armstrong S., "Extending the Wizard of Oz Technique for Multimodal Language-enabled Systems", LREC 2006, Genoa, Italy. May 24-26, 2006.
  • Popescu-Belis A., Estrella P., King M. & Underwood N, "A model for context-based evaluation of language processing systems and its application to MT evaluation", LREC 2006, Genoa, Italy, p. 691-696.
  • Rigamonti, M., Evéquoz, F., Lalanne, D., Ingold, R., "Measuring the usefulness of static documents for multimedia meeting browsing", MLMI'06, Washington DC, USA (2006). Poster presentation.

 

IM2.MCA

 

  • D. Grangier, F. Monay, and S. Bengio, "A Discriminative Approach for the Retrieval of Images from Text Queries", in "European Conference on Machine Learning (ECML)", 2006.
  • D. Grangier and S. Bengio, "A Neural Network to Retrieve Images from Text Queries", in "International Conference on Artificial Neural Networks (ICANN)", 2006.
  • D. Grangier, F. Monay, and S. Bengio, "Learning to Retrieve Images from Text Queries with a Discriminative Model", in "International Workshop on Adaptive Multimedia Retrieval (AMR)", 2006.
  • Oren Glickman, Ido Dagan, Mikaela Keller, Samy Bengio, and Walter Daelemans, "Investigating Lexical Substitution Scoring for Subtitle Generation", in "Proceedings of the 10th Conference on Computational Natural Language Learning (CoNLL).", 2006.
  • D. Grangier and S. Bengio, "Exploiting Hyperlinks to Learn a Retrieval Model", in "NIPS Workshop on Learning to Rank", 2005.
  • E Bruno and N. Moenne-Loccoz and S. Marchand-Maillet, "Asymmetric Learning and Dissimilarity Spaces for Content-based Retrieval", in International Conference on Image and Video Retrieval (CIVR 2006), Tempe, AZ, 2006
  • Maria Georgescul, Alexander Clark and Susan Armstrong (2006), "Word Distributions for Thematic Segmentation in a Support Vector Machine Approaches", The Tenth Conference on Computational Natural Language Learning (CoNLL-X), New York, USA, 2006.
  • Maria Georgescul, Alexander Clark and Susan Armstrong (2006), "An Analysis of Quantitative Aspects in the Evaluation of Thematic Segmentation Algorithms", The 7th SIGdial Workshop on Discourse and Dialogue, Sydney, 2006.
  • B. Janvier and E. Bruno and S. Marchand-Maillet and T. Pun, "Performance evaluation of a contextual news story segmentation algorithm", in Proceedings of SPIE Electronic Imaging 2006, Multimedia Content Analysis, Management, and Retrieval 2006 (EI122), San Jose, CA, 2006.
  • Serhiy Kosinov and Stephane Marchand-Maillet and Thierry Pun, "Countering the false positive projection effect in nonlinear asymmetric classification", in The IEEE Symposium on Signal Processing and Information Technology (ISSPIT\'05), Athens, Greece, December, 2005.
  • S. Kosinov and S. Marchand-Maillet, "Visual object categorization with indefinite kernels in discriminant analysis", in Proceedings of SPIE Electronic Imaging 2006, Multimedia Content Analysis, Management, and Retrieval 2006 (EI122), San Jose, CA, USA, 2006.
  • N. Moenne-Loccoz and E Bruno and S. Marchand-Maillet, "Local Feature Trajectories for Efficient Event-Based Indexing of Video Sequences", in International Conference on Image and Video Retrieval (CIVR 2006), Tempe, AZ, 2006
  • Popescu-Belis A. & Georgescul M. (2006), "TQB: Accessing Multimedia Data Using a Transcript-based Query and Browsing Interface", LREC 2006 (Fourth International Conference on Language Resources and Evaluation), Genoa, Italy, p.1560-1565.
  • Alessandro Vinciarelli and Jean-Marc Odobez, "Application of Information Retrieval Technologies to Presentation Slides", IEEE Transactions on Multimedia, to appear in october 2006.
  • Alessandro Vinciarelli. "Indexation de documents manuscrits", (invited paper). Colloque International Francophone sur l'Ecrit et le Document, Fribourg (Switzerland), 2006.
  • Alessandro Vinciarelli, "Sociometry Based Multiparty Audio Recordings Summarization", proceedings of 18th International Conference on Pattern Recognition, Hong Kong, August 2006.
  • Alessandro Vinciarelli, "Sociometry Based Multiparty Audio Recording Segmentation", proceedings of IEEE International Conference on Multimedia and Expo, Toronto, July 2006.

 

IM2.MPR

 

  • Norman Poh, Alvin Martin, and Samy Bengio, "Performance Generalization in Biometric Authentication Using Joint User-Specific and Sample Bootstraps", in "IEEE Pattern Analysis and Machine intelligence", 2007.
  • Christos Dimitrakakis, "Nearly optimal exploration-exploitation decision thresholds", in "Int. Conf. on Artificial Neural Networks (ICANN)", 2006
  • Mikaela Keller, Samy Bengio, and Siew Yeung Wong, "Benchmarking Non-Parametric Statistical Tests", in "Advances in Neural Information Processing Systems, NIPS 18. MIT Press", 2005.
  • P. Besson, M. Kunt, T. Butz and J. Thiran, "A multimodal approach to extract optimized audio features for speaker detection", 13th European Signal Processing Conference (EUSIPCO2005), Antalya, Turkey, September 2005
  • Arsic, R. Vilagut and J. Thiran, "Automatic extraction of geometric lip features with application to multi-modal speaker identification", Proc. of ICME 2006, Toronto, Canada, July 2006
  • M. Gurban and J. Thiran, "Multimodal Speaker Localization in a Probabilistic Framework", Proceedings of EUSIPCO, Florence, Italy, 2006
  • Arsic and J. Thiran, "Mutual information eigenlips for audio-visual speech recognition", 14th European Signal Processing Conference (EUSIPCO), Florence, Italy, 2006
  • D. Gatica-Perez, "Analyzing Group Interactions in Conversations: a Review", in Proc. IEEE Int. Conf. on Multisensor Fusion and Integration for Intelligent Systems (MFI), invited paper, Heidelberg, Sep. 2006.
  • D. Zhang, D. Gatica-Perez, D. Roy, and S. Bengio, "Modeling Interaction from E-Mail Communication", in Proc. IEEE Int. Conf. on Multimedia (ICME), Toronto, Jul. 2006. (collaboration with MIT Media Lab, USA)
  • D. Zhang, D. Gatica-Perez, S. Bengio, and D. Roy, "Learning Influence among Interacting Markov Chains", in Proc. Neural Information Processing Systems (NIPS), Vancouver, Dec. 2005. (collaboration with MIT Media Lab, USA)
  • D. Gatica-Perez, D. Zhang, and S. Bengio, "Extracting Information from Multimedia Meeting Collections", in Proc. ACM Int. Conf. on Multimedia, Workshop on Multimedia Information Retrieval (ACM MM MIR), invited paper, Singapore, Nov. 2005.
  • B. Jensen, N. Tomatis, Laetitia Mayor, A. Drygajlo, R. Siegwart, "Robots Meet Humans - Interaction in Public Spaces", IEEE Transactions on Industrial Electronics, vol. 52, No. 6, Dec. 2005, pp. 1530-1546.
  • Joaquin Gonzalez-Rodriguez, Andrzej Drygajlo, Daniel Ramos-Castro, Marta Garcia-Gomar, Javier Ortega-Garcia, "Robust Estimation, Interpretation and Assessment of Likelihood Ratios in Forensic Speaker Recognition", invited paper, Computer Speech and Language, vol. 20, 2006, pp. 331-355.
  • Alexander, D. Dessimoz, F. Botti, and A. Drygajlo, "Aural and Automatic Forensic Speaker Recognition in Mismatched Conditions", The International Journal of Speech, Language and the Law, vol. 12, Dec. 2005, pp. 214-234.
  • J. Richiardi, A. Drygajlo, P. Prodanov, "Confidence and Reliability Measures in Speaker Verification", invited paper, Journal of the Franklin Institute, to be published, 2006.
  • P. Prodanov, A. Drygajlo, J. Richiardi, A. Alexander, "Grounding in Multimodal Service Robot Conversational System Using Graphical Models", Intelligent Service Robotics, Springer, to be published, 2006.
  • L. Peotta and P. Vandergheynst, "Matching Pursuit with Block Incoherent Dictionaries", IEEE Transactions on Signal Processing, In Press
  • Tosic, P. Frossard and P. Vandergheynst, "Progressive coding of 3D objects based on overcomplete decompositions", IEEE Transactions on Circuits and Systems for Video Technology, In Press
  • M. Arcienega, A. Alexander, P. Zimmermann, A. Drygajlo, "A Bayesian Network Approach Combining Pitch and Spectral Envelope Features to Reduce Channel Mismatch in Speaker Verification and Forensic Speaker Recognition", InterSpeech 2005, Lisbon, Portugal, Sept. 4-8, 2005.
  • K. Kryszczuk, A. Drygajlo, "On Face Image Quality Measures", 2nd Workshop on Multimodal User Authentication (MMUA 06), Toulouse, France, May 11-12, 2006.
  • J. Richiardi, P. Prodanov, A. Drygajlo, "Speaker Verification with Confidence and Reliability Measures", IEEE Int. Conf. Acoustics, Speech and Signal Processing (ICASSP 2006), Toulouse, May 14-19, 2006.
  • D. Dessimoz, J. Richiardi, C. Champod, A. Drygajlo, "Multimodal Biometrics for Identity Documents", 4th European Academy of Forensic Science Conference (EAFS 2006), Helsinki, June 13-16, 2006.
  • K. Kryszczuk, A. Drygajlo, "Singular Point Detection in Fingerprints using Quadrant Change Information", 18th Int. Conf. on Pattern Recognition (ICPR 2006), Hong Kong, 20-24 August 2006.
  • K. Kryszczuk, A. Drygajlo, "On Combining Evidence for Reliability Estimation in Face Verification", 14th European Signal Processing Conference (EUSIPCO 2006), Florence, Italy, Sept. 4-8, 2006.
  • G. Monaci and P. Vandergheynst, "Audiovisual Gestalts", CVPR Workshop on Perceptual Organization in Computer Vision, June 2006
  • P. Jost, S. Lesage, P. Vandergheynst and R. Gribonval, "MOTIF: an efficient algorithm for learning translation invariant dictionaries", Proceedings IEEE ICASSP06, May 2006
  • L. Granai and P. Vandergheynst, "Sparse Approximation by Linear Programming using an L1 Data-Fidelity Term", Proc. of Workshop on Signal Processing with Adaptative Sparse Structured Representations, November 2005
  • P. Jost, S. Lesage, P. Vandergheynst and R. Gribonval, "Learning redundant dictionaries with translation invariance property: the MoTIF algorithm", Proc. of Workshop on Signal Processing with Adaptative Sparse Structured Representations, November 2005
  • S. Marcel, J. Mariéthoz, Y. Rodriguez and F. Cardinaux, "Bi-Modal Face and Speech Authentication: a BioLogin: Demonstration System", Workshop on Multimodal User Authentication, MMUA. 2006
  • Y. Rytsar, S. Voloshynovskiy, O. Koval, F. Deguillaume, E. Topak, S. Startchik and T. Pun, "Tangible interactive system for document browsing and visualisation of multimedia data", in Proceedings of SPIE Photonics West, Electronic Imaging 2006, Multimedia Content Analysis, Management, and Retrieval 2006 (EI122), San Jose, USA, January 15-19 2006.
  • O. Koval, S. Voloshynovskiy and T. Pun, "Laplacian channel state estimation for state dependent channels", in Proceedings of 27-th Symposium on information theory in the Benelux, Noordwijk, The Netherlands, June 8-9, 2006, 2006.
  • R. Villán, S. Voloshynovskiy, O. Koval, J.E. Vila-Forcén, E. Topak, F. Deguillaume, Y. Rytsar and T. Pun, "Text Data-Hiding for Digital and Printed Documents: Theoretical and Practical Considerations", in Proceedings of SPIE-IS&T Electronic Imaging 2006, Security, Steganography, and Watermarking of Multimedia Contents VIII, San Jose, USA, January 15-19 2006.
  • Humm, J. Hennebert and R. Ingold, "Gaussian Mixture Models for CHASM Signature Verification", 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), Washington DC (USA), May 1-3 2006, accepted for publication.
  • Humm, J. Hennebert and R. Ingold, "Scenario and Survey of Combined Handwriting and Speech Modalities for User Authentication", 6th International Conference on Recent Advances in Soft Computing (RASC), Canterbury (UK), July 10-12 2006, accepted for publication.
  • Wahl, J. Hennebert, A. Humm and R. Ingold, "Generation and Evaluation of Brute-Force Signature Forgeries", International Workshop on Multimedia Content Representation, Classification and Security (IWMRCS), Istanbul, (TR), September 11-13 2006, accepted for publication.
  • J. Hennebert, A. Humm et R. Ingold, "Vérification d'Identité par Ecriture et Parole Combinées", Conférence Internationale Francophone sur l'Ecrit et le Document (CIFED), Fribourg, September 18-21 2006, accepted for publication.
  • J. Mariéthoz and S. Bengio, "A max kernel for text-independent speaker verification systems", in Second Workshop on Multimodal User Authentication, MMUA, 2006.
  • N. Poh and S. Bengio, "Chimeric users to construct fusion classifiers in biometric authentication tasks: An investigation", in IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2006.
  • N. Poh, S. Bengio, and A. Ross, "Revisiting Doddington's zoo: A systematic method to assess user-dependent variabilities", in Second Workshop on Multimodal User Authentication, MMUA, 2006.
  • R. Gribonval and P. Vandergheynst, "On the exponential convergence of Matching Pursuits in quasi-incoherent dictionaries", IEEE Transactions on Information Theory, Vol. 52, No 1, pp. 255-261, January 2006
  • P. Jost, P. Vandergheynst and P. Frossard, "Tree-Based Pursuit: Algorithm and Properties", IEEE Transactions on Signal Processing, In Press
  • Monaci, O. Divorra Escoda and P. Vandergheynst, "Analysis of Multimodal Sequences Using Geometric Video Representations", Signal Processing, In Press
  • J. Richiardi, A. Drygajlo, A. Palacios-Venin, R. Ludvig, O. Genton, L. Houmgny, "A Distributed Multimodal Biometric Authentication Framework", 3rd COST 275 Workshop Biometrics on the Internet, Hatfield, UK, Oct. 27-28, 2005.
  • Dumas, C. Pugin, J. Hennebert, D. Petrovska-Delacrétaz, A. Humm, F. Evéquoz, R. Ingold and D. Von Rotz, "MyIdea - Multimodal Biometrics Database, Description of Acquisition Protocols", Third COST 275 Workshop (COST 275), Hatfield (UK), October 27-28 2005, pp. 59-62.
  • Yves Grandvalet and Johnny Mariethoz and Samy Bengio, "A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification," Advances in Neural Information Processing Systems, NIPS 15, ftp://ftp.idiap.ch/pub/papers/2005/grandvalet-nips-2005.pdf
  • N. Poh and S. Bengio, "Database, protocol and tools for evaluating score-level fusion algorithms in biometric authentication", Pattern Recognition, 39(2):223-233, 2006.

 

 

IM2.MPR, IM2.VP

 

  • M. Liwicki, A. Schlapbach, H. Bunke, S. Bengio, J. Mariéthoz, and J. Richiardi, "Writer identification for smart meeting room systems", in H. Bunke and A. L. Spitz, editors, Document Analysis Systems VII: 7th International Workshop, DAS, Lecture Notes in Computer Science, volume 3872, pages 186-195. Springer-Verlag, 2006.

 

IM2.VP

 

  • Conrad Sanderson, Samy Bengio, and Yongsheng Gao, "On transforming statistical models for non-frontal face verification", in "Pattern Recognition (in press)", 2005.
  • Francesco Camastra, Marco Spinetti, and Alessandro Vinciarelli, "Cursive Character Challenge: a New Database for Machine Learning and Pattern Recognition", in "Proceedings of International Conference on Pattern Recognition (ICPR)", 2006.
  • Florent Monay, Pedro Quelhas, Jean-Marc Odobez, and Daniel Gatica-Perez, "Integrating co-occurrence and spatial contexts on patch-based scene segmentation", in "Beyond Patches Workshop, in conjunction with CVPR", 2006.
  • Andrzej Pronobis and Barbara Caputo, "The More you Learn, the Less you Store: Memory-Controlled Incremental SVM", in "Proceedings of International Cognitive Vision Workshop (ICVW) 2006)", 2006.
  • Tatiana Tommasi, Elisabetta La Torre, and Barbara Caputo, "Kernel Methods for Melanoma Recognition", in "Proceedings of Workshop on Computer Vision Approaches to Medical Image Analysis (CVAMIA) 2006)", 2006.
  • Agnès Just, Yann Rodriguez, and Sébastien Marcel, "Hand Posture Classification and Recognition using the Modified Census Transform", in "IEEE Int. Conf. on Automatic Face and Gesture Recognition (AFGR)", 2006.
  • Florent Monay, Pedro Quelhas, Daniel Gatica-Perez, and Jean-Marc Odobez, "Constructing visual models with a latent space approach", in "the Springer series of Lecture Notes in Computer Science", 2006.
  • Sileye O. Ba and Jean Marc Odobez, "Evaluation of Multiple Cues Head Pose Tracking Algorithm in Indoor Environments", in "International Conference on Multimedia & Expo ICME 2005", 2005.
  • Pedro Quelhas, Florent Monay, Jean-Marc Odobez, Daniel Gatica-Perez, Tinne Tuytelaars, and Luc Van Gool, "Modeling Scenes with Local Descriptors and Latent Aspects", in "IEEE Int. Conf. on Computer Vision", 2005.
  • G. Antonini, S. Venegas, M. Bierlaire and J. Thiran, "Behavioral priors for detection and tracking of pedestrians in video sequences", international Journal of Computer Vision, Vol. 69, No 2, pp. 159 - 180, August 2006
  • Zimmermann, M., Chappelier, J.-C., Bunke, H., "Offline grammar-based recognition of handwritten sentences", IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 18, No. 5, May 2006, 818 – 821
  • G. Antonini and J. Thiran, "Counting pedestrians in video sequences using trajectory clustering", accepted in IEEE Transactions on Circuits and Systems for Video Technology, October 2005
  • Liwicki, M., Bunke, H., "Handwriting recognition of whiteboard notes - studying the influence of training set size and type", accepted for publication in Int. Journal of Pattern Recognition and Art. Intelligence
  • M. Bray, E. Koller-Meier, N.N. Schraudolph and L. Van Gool, "Fast stochastic optimization for articulated structure tracking", image and Vision Computing, 2006, in press
  • S. Ba and J.-M. Odobez, "A Rao-Blackwellized Mixed State Particle Filter for Head Pose Tracking", in Proc. of ACM ICMI Multimodal Multiparty Meeting Processing (ICMI-MMMP) Workshop, Trento, Italy, Oct 2005.
  • R. Kehl, M. Bray and L. Van Gool, "Markerless Full Body Tracking by Integrating Multiple Cues", PHI'05 Workshop in Conjunction with ICCV 2005, October 2005.
  • K. Smith, S. Schreiber, I. Potucek, V. Beran, G. Rigoll, and D. Gatica-Perez, "2-D Multi-Person Tracking: a Comparative Study in AMI Meetings", in Proc. Workshop on Classification of Events, Activities and Relationships (CLEAR), Southampton, Apr. 2006
  • S.O. Ba and J.M Odobez, "Head Pose Tracking and Focus of Attention Recognition Algorithms in Meeting Rooms", in Proc. Workshop on Classification of Events, Activities and Relationships (CLEAR), Southampton, Apr. 2006
  • S. Ba and J.M. Odobez, "A Study on Focus of Attention Modelling using Head Pose", in Proc. 3rd Joint Workshop on Machine Learning for Multimodal Interaction (MLMI), Washington DC, May 2006
  • M. Al-Hames, T. Hain, J. Cernocky, S. Schreiber, M. Poel, R. Muller, S. Marcel, D. van Leeuwen, J. Odobez, S. Ba, H. Bourlard, F. Cardinaux, D. Gatica-Perez, A. Janin, P. Motlicek, S. Reiter, S. Renals, J. van Rest, R. Rienks, G. Rigoll, K. Smith, A. Thean, and P. Zemcik, "Audio-Visual Processing in Meetings: Seven Questions and Current AMI Answers", in Proc. Workshop on Machine Learning for Multimodal Interaction (MLMI), Washington DC, May 2006
  • Herbert Bay, Tinne Tuytelaars, Luc Van Gool, "SURF: Speeded Up Robust Features", proceedings of the ninth European Conference on Computer Vision, May 2006
  • K. Smith, P. Quelhas, and D. Gatica-Perez, "Detecting Abandoned Luggage Items in a Public Space", in Proc. IEEE Conf. on Computer Vision and Pattern Recognition, Workshop on Performance Evaluation of Tracking and urveillance (CVPR-PETS), New York, Jun. 2006
  • L. Van Gool, T. Jaeggli and E. Koller-Meier, "Combining Sample-Based and Analytic Density Propagation for Monocular Tracking", learning, Representation and Context for Human Sensing in Video (workshop in conjunction with CVPR), New York, June, 2006
  • Thomas, V. Ferrari, B. Leibe, T. Tuytelaars, B. Schiele, and L. Van Gool, "Towards Multi-View Object Class Detection", IEEE Conference on Computer Vision and Pattern Recognition (CVPR'06), 2006
  • T. Jaeggli, E. Koller-Meier, L. Van Gool, "Monocular Tracking with a Mixture of View-Dependent Learned Models", IV Conference on Articulated Motion and Deformable Objects (AMDO 2006), July 2006
  • Effrosyni Kokiopoulou and Pascal Frossard, "Distributed SVM applied to image classification", to appear in IEEE ICME, Toronto, July 2006
  • Bertolami, R., Bunke, H., "Diversity analysis for ensembles of word sequence recognisers", accepted for Joint Int. Workshops on Structural and Syntactic Pattern Recognition and Statistical Techniques in Pattern Recognition, Hong Kong, August 2006
  • Fasel and D. Gatica-Perez, "Rotation-Invariant Neoperceptron", international Conference on Pattern Recognition (ICPR 2006), August 2006
  • Philipp Zehnder, Esther Koller-Meier, Luc Van Gool, "Efficient, Simultaneous Detection of Multiple Object Classes", 18th International Conference on Pattern Recognition (ICPR 2006), August 2006
  • Spillmann, B., Neuhaus, M., Bunke, H., Pekalska, E., Duin, B., "Transforming strings to vector spaces using prototype selection", accepted for Joint Int. Workshops on Structural and Syntactic Pattern Recognition and Statistical Techniques in Pattern Recognition, Hong Kong, August 2006
  • Neuhaus, M., Riesen, K., Bunke, H., "Fast suboptimal algorithms for the computation of graph edit distance", accepted for Joint Int. Workshops on Structural and Syntactic Pattern Recognition and Statistical Techniques in Pattern Recognition, Hong Kong, August 2006
  • Neuhaus, M., Bunke, H., "A random walk kernel derived from graph edit distance, accepted for Joint Int. Workshops on Structural and Syntactic Pattern Recognition and Statistical Techniques", in Pattern Recognition, Hong Kong, August 2006
  • Bertolami, R., Bunke, H., "Early feature stream integration versus decision level combination in a multiple classifier system for text line recognition", accepted for Int. Conference on Pattern Recognition, Hong Kong, Aug 2006
  • Schlapbach, A., Bunke, H., "Off-line writer identification using Gaussian mixture models", accepted for Int. Conference on Pattern Recognition, Hong Kong, Aug 2006
  • Neuhaus, M., Bunke, H., "A convolution kernel for error-tolerant graph matching", accepted for Int. Conference on Pattern Recognition, Hong Kong, Aug 2006
  • Liwicki, M., Scherz, M., Bunke, H., "Word segmentation of on-line handwritten text lines", accepted for Int. Conference on Pattern Recognition, Hong Kong, Aug 2006
  • G. Antonini, M. Sorci, M. Bierlaire and J. Thiran, "Discrete Choice Models for Static Facial Expression Recognition", proceedings ACIVS, Antwerp, Belgium, 2006
  • Y. Rodriguez and S. Marcel, "Face Authentication Using Adapted Local Binary Pattern Histograms", ECCV06.
  • G. Heusch, Y. Rodriguez and S. Marcel, "Local Binary Patterns as an Image Preprocessing for Face Authentication", AFGR06.
  • V. Popovici, J. Meynet and J. Thiran, "Anisotropic Gaussian Filters for Face Class Modeling", to appear in Proc. of 2nd International Conference on Intelligent Computer Communication and Processing (ICCP), 2006
  • Effrosyni Kokiopoulou and Pascal Frossard, "Classification-Specific Feature Sampling for Face Recognition", to appear in IEEE MMSP, Victoria, BC, October 2006
  • Effrosyni Kokiopoulou and Pascal Frossard, "Pattern Detection by Distributed Feature Extraction", to appear in IEEE ICIP, Atlanta, October 2006
  • J.-M. Odobez, D. Gatica-Perez, and S. Ba, "Embedding Motion in Model-Based Stochastic Tracking", IEEE Trans. on Image Processing, accepted for publication, Nov. 2005.
  • Neuhaus, M., Bunke, H., "Automatic learning of cost functions for graph edit distance", accepted for publication in Information Sciences
  • Neuhaus. M., Bunke, H., "Edit distance based kernel functions for structural pattern Classification", accepted for publication in Pattern Recognition
  • Bertolami, R., Zimmermann, M., Bunke, H., "Rejection strategies for offline handwritten text line recognition", accepted for publication in Pattern Recognition Letters

 

IM2 Phase I

 

IM2.ACP

 

  • Neuhaus. M., Bunke, H., "Self-organizing maps for learning the edit costs in graph matching", IEEE Trans. Systems, Man, and Cybernetics, 35(3), 503 – 514, 2005.
  • Bunke, H., Irniger, Ch., Neuhaus, M., "Novel developments in graph matching", Proc. 10th Iberoamerican Congress on Pattern Recognition, Havana, Cuba, 2005.
  • Neuhaus, M., Bunke, H., "Graph-based Multiple Classifier Systems - A Data Level Fusion Approach", Proc. 13th International Conference on Image Analysis and Processing, ICIAP, Cagliari, 2005.
  • Bunke, H., Irniger, Ch., Neuhaus, M., "Graph Matching – Challenges and Potential Solutions", Proc. 13th Int. Conference on Image Analysis and Processing, ICIAP, Cagliari, Italy, 2005.
  • Neuhaus, M., Bunke, H., "A Graph Matching Based Approach to Fingerprint Classification Using Directional Variance", Proc. 5th Int. Conf. on Audio- and Video-Based Biometric Person Authentication, 2005.
  • Neuhaus, M., Bunke, H.,"Edit distance based kernel functions for attributed graph matching", in Brun, L., Vento, M., (eds.): Graph-Based Representations in Pattern Recognition, Springer, LNCS 3434, 352-361, 2005.
  • Serrau, A., Marcialis, G. L., Bunke, H., Roli, F., "An experimental comparison of fingerprint classification methods using graphs", in Brun, L., Vento, M., (eds.): Graph-Based Representations in Pattern Recognition, Springer, LNCS 3434, 2005.
  • S. Bengio, "Multimodal speech processing using asynchronous hidden markov models", Information Fusion, 5(2):81--89, 2004.
  • S. Bengio, J. Marithoz, and M. Keller, "The expected performance curve", in International Conference on Machine Learning, ICML, Workshop on ROC Analysis in Machine Learning, 2005.
  • Mohamed F. BenZeghiba and Hervé Bourlard, "Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition", in International Conference on Spoken Language Processing (ICSLP 2004), Juju Island, Korea, 2004.(IDIAP­RR 04­23)
  • Conrad Sanderson and Kuldip K. Paliwal, "Identity verification using speech and face information", Digital Signal Processing, 14(5):449--480, 2004.
  • Fabien Cardinaux, Conrad Sanderson, and Samy Bengio, "User Authentication via Adapted Statistical Models of Face Images", IEEE Transaction on Signal Processing, 2005. (IDIAP­RR 04­38)
  • Norman Poh and Samy Bengio, "Noise­robust multi­stream fusion for text-independent speaker authentication", in The Speaker and Recognition Workshop, 2004.
  • Norman Poh and Samy Bengio, "Towards predicting optimal subsets of base-experts in biometric authentication task",iIn S. Bengio and H. Bourlard, editors, Machine Learning for Multimodal Interactions: First International Workshop, MLMI, Lecture Notes in Computer Science, volume LNCS 3361, pages 159--172, 2004.
  • Norman Poh and Samy Bengio, "Database, protocol and tools for evaluating score­level fusion algorithms in biometric authentication", in Fifth Int'l. Conf. Audio­ and Video­Based Biometric Person Authentication AVBPA, 2005.
  • Norman Poh and Samy Bengio, "Eer of fixed and trainable fusion classifiers: A theoretical study with application to biometric authentication tasks", in Sixth International Workshop on Multiple Classifier System (MCS2005), 2005.
  • Norman Poh and Samy Bengio, "F­ratio client­dependent normalisation on biometric authentication tasks", in Proceedings of the 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP­ 05), pages I--721--724, 2005.
  • Norman Poh and Samy Bengio, "How do correlation and variance of base­experts affect fusion in biometric authentication tasks?", IEEE Trans. on Signal Processing, 2005.
  • Norman Poh and Samy Bengio, "Improving fusion with margin­derived confidence in biometric authentication tasks", in Fifth Int'l. Conf. Audio­ and Video­Based Biometric Person Authentication AVBPA, 2005.
  • Norman Poh and Samy Bengio, "A novel approach to combining client­dependent and confidence information in multimodal biometric", in Fifth Int'l. Conf. Audio­ and Video­Based Biometric Person Authentication AVBPA, 2005.
  • V. Popovici, J.­P. Thiran, Y. Rodriguez, and S. Marcel, "On performance evaluation of face detection and localization algorithms", in 17th International Conference on Pattern Recognition, ICPR2004, Cambridge, United Kingdom, 2004.
  • C. Sanderson, F. Cardinaux, and S. Bengio, "On accuracy/robustness/complexity trade­offs in face verification", in IEEE International Conference on Information Technology and Applications, ICITA, 2005.
  • M. Gerber and B. Pfister, "Quasi text-independent speaker verification with neural networks", MLMI'05 Workshop, Edinburgh (United Kingdom), July 2005.
  • J. Richiardi, P. Prodanov, A. Drygajlo, "A Probabilistic Measure of Modality Reliability in Speaker Verification", IEEE Int. Conf. Acoustics, Speech and Signal Processing (ICASSP 2005), Philadelphia, March 19-23, 2005, pp. 709-712, (winner of the Student Paper Contest)
  • P. Prodanov, A. Drygajlo, "Bayesian Networks Based Multi-modality Fusion for Error Handling in Human-Robot Dialogues under Noisy Conditions", Speech Communication, Elsevier, Special Issue on Error Handling in Spoken Dialogue Systems, vol. 45, pp. 231-248, March 2005.
  • H. Ketabdar, J. Richiardi, A. Drygajlo, "Global Feature Selection for On-line Signature Verification”, 12 Biennal Conf. of the International Graphonomics Society (IGS 2005), Salerno, Italy, June 26-29, 2005, pp. 59-63, (Award for the best graduate student presentation in the field of Motor Control, (common paper EPFL-IDIAP)).
  • K. Kryszczuk, A. Drygajlo, "Gradient-Based Image Segmentation for Face Recognition Robust to Directional Illumination", to be published in Int. Conf. on Visual Communications and Image Processing (VCIP 2005), Beijing, China, July 12-15, 2005.
  • K. Kryszczuk, A. Drygajlo, "Addressing the Vulnerabilities of Likelihood-Ratio Based Face Verification", to be published in Audio- and Video-based Biometric Person Authentication (AVBPA 2005), New York, July 20-22, 2005.
  • J. Richiardi, H. Ketabdar, A. Drygajlo, "Local and Global Feature Selection for On-line Signature Verification", to be published in the 8th IAPR International Conference on Document Analysis and Recognition (ICDAR 2005), Seoul, Korea, August 29-September 1, 2005, (common paper EPFL-IDIAP).
  • K. Kryszczuk, J. Richiardi, P. Prodanov, A. Drygajlo, "Error Handling In Multiple Classifier Biometric Systems Using Reliability Measures", to be published in the 13th European Signal Processing Conference (EUSIPCO 2005), Antalya, Turkey, September 4-8, 2005.
  • Conrad Sanderson and Samy Bengio, "Statistical Transformations of Frontal Models for Non-Frontal Face Verification", in Proceedings of the IEEE International Conference on Image Processing (ICIP), 2004.
  • Fabien Cardinaux, Conrad Sanderson, and Samy Bengio, "Face Verification Using Adapted Generative Models", in The 6th International Conference on Automatic Face and Gesture Recognition, FG2004, 2004.
  • S. Bengio and J. Mariéthoz, "The Expected Performance Curve: a New Assessment Measure for Person Authentication", in "Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop", 2004
  • Conrad Sanderson and Kuldip K. Paliwal, "Structurally noise resistant classifier for multi-modal person verification", in "Pattern Recognition Letters", 2003
  • S. Bengio and J. Mariéthoz, "A Statistical Significance Test for Person Authentication", in "Proceedings of Odyssey 2004: The Speaker and Language Recognition Workshop", 2004
  • Norman Poh and Samy Bengio, "Why Do Multi-Stream, Multi-Band and Multi-Modal Approaches Work on Biometric User Authentication Tasks?", in "Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04)", 2004
  • Conrad Sanderson and Samy Bengio, "Statistical Transformations of Frontal Models for Non-Frontal Face Verification", in "Proceedings of the IEEE International Conference on Image Processing (ICIP)", 2004
  • Norman Poh and Samy Bengio, "Noise-Robust Multi-Stream Fusion for Text-Independent Speaker Authentication", in "The Speaker and Recognition Workshop", 2004
  • Norman Poh, Conrad Sanderson, and Samy Bengio, "An Investigation of Spectral Subband Centroids for Speaker Authentication", in "Int'l Conf. on Biometric Authentication", 2004
  • Fabien Cardinaux, Conrad Sanderson, and Samy Bengio, "Face Verification Using Adapted Generative Models", in "The 6th International Conference on Automatic Face and Gesture Recognition, FG2004", 2004
  • Sébastien Marcel, "A Symmetric Transformation for LDA-based Face Verification", in "Proceedings of the 6th International Conference on Automatic Face and Gesture Recognition", 2004
  • Mohamed F. BenZeghiba and Hervé Bourlard, "Confidence Measures in Multiple pronunciations Modeling For Speaker Verification", in "Proceedings of the 2004 IEEE International Conference on Acoustics Speech, and Signal Processing (ICASSP-04)", 2004
  • Conrad Sanderson and Samy Bengio, "Augmenting Frontal Face Models for Non-Frontal Verification", in "Proceedings of the 2003 Workshop on Multimodal User Authentication (MMUA'03)", 2003
  • Norman Poh and Samy Bengio, "Non-Linear Variance Reduction Techniques in Biometric Authentication", in "Workshop on Multimodal User Authentication", 2003
  • El Hannani and D. Petrovska-Delacretaz and G. Chollet, "Linear and Non-Linear Fusion of ALISP-based and GMM systems for Text-independent Speaker Verification", proc. of ODYSSEY04, The Speaker and Language Recognition Workshop, Toledo, Spain, May31-June3, 2004
  • El Hannani and D. Petrovska-Delacretaz and R. Blouet and G. Chollet, " Segmental Score Fusion for Text-independent Speaker Verification", Workshop on Multimodal User Authentication", Santa Barbara, CA, 74-79, December 11-12, 2003
  • U. Niesen and B. Pfister, " Speaker verification by means of ANNs", proceedings of the ESANN'04, Bruges (Belgium), April 2004
  • "Neuhaus, M., Bunke, H., ""An error-tolerant approximate matching algorithm for attributed planar graphs and its application to fingerprint classification"" , in Fred, A. et al. (eds.): Structural, Syntactic, and Statistical Pattern Recognition, Proc. Joint IAPR Int. Workshops SSPR and
  • SPR, Springer LNCS 3138, 180 - 189, 2004"
  • K. Kryszczuk, A. Drygajlo, "Color Correction for Face Detection Based on Human Visual Perception Metaphor", Workshop on Multimodal User Authentication, Santa Barbara, CA, USA, pp. 138-143, Dec. 11-12, 2003
  • J. Richiardi, J. Fierrez-Aguilar, J. Ortega-Garcia, A. Drygajlo, "On-line Signature Verification Resilience to Packet Loss in IP Networks", 2nd COST 275 Workshop – Biometrics on the Internet , Vigo, Spain, pp. 11-16, March 25-26, 2004
  • Drygajlo, D. Meuwly, and A. Alexander, “Statistical Methods and Bayesian Interpretation of Evidence in Forensic Automatic Speaker Recognition”, Eurospeech 2003, Geneva, Switzerland, pp. 689–692, 2003
  • Alexander, F. Botti, and A. Drygajlo, “Handling Mismatch in Corpus-Based Forensic Speaker Recognition”, Odyssey 2004, The Speaker and Language Recognition Workshop, Toledo, , pp. 69–74, Spain, May 2004
  • F. Botti., A. Alexander, and A. Drygajlo, “An interpretation framework for the evaluation of evidence in forensic automatic speaker recognition with limited suspect data”, Odyssey 2004, The Speaker and Language Recognition Workshop, pp. 63–68 , Toledo, Spain, 2004
  • Drygajlo, P. Prodanov, G. Ramel, M. Meisser, R. Siegwart, "On Developing a Voice Enabled Interface for Interactive Tour-Guide Robots", invited paper, Advanced Robotics, Japan, vol. 17, no. 7, pp. 599-616, 2003
  • Drygajlo, "Speech Coding and Recognition in Noisy Environments for Communication Terminals", chapter in J. Tasic(, et al. (Eds), "Intelligent Integrated Media Communication Techniques", Kluwer Academic Publishers, pp. 337-359, Boston 2003
  • P. Prodanov, A. Drygajlo, “Bayesian Networks for Spoken Dialogue Management in Multimodal Systems of Tour-Guide Robots”, Eurospeech 2003, Geneva, Switzerland, pp. 1057-1060, 2003
  • P. Prodanov, A. Drygajlo, “Multimodal Interaction Management for Tour-Guide Robots Using Bayesian Networks”, 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2002), Las Vegas, USA, Oct. 27-31, 2003
  • P. Prodanov, A. Drygajlo, “Bayesian Networks Based Signal Fusion for User Goal Identification in Human-Robot Dialogues”, 6th COST 276 Workshop on Information and Knowledge Management for Integrated Media Communication, Thessaloniki, Greece, May 6-7, 2004
  • J. Czyz, S. Bengio, C. Marcel, and L. Vandendorpe, "Scalability Analysis of Audio-Visual Person Identity Verification", in "4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA", 2003
  • Norman Poh, Sébastien Marcel, and Samy Bengio, "Improving Face Authentication Using Virtual Samples", in "IEEE International Conference on Acoustics, Speech, and Signal Processing", 2003
  • S. Bengio, C. Marcel, S. Marcel, and J. Mariéthoz,"Confidence Measures for Multimodal Identity Verification", in "Information Fusion", 2002.
  • Fabien Cardinaux, Conrad Sanderson, and Sébastien Marcel, "Comparison of MLP and GMM Classifiers for Face Verification on XM2VTS", in "4th International Conference on AUDIO- and VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION", 2003.
  • Le, Q. and Bengio, S., "Client Dependent GMM-SVM Models for Speaker Verification", in "International Conference on Artificial Neural Networks, ICANN/ICONIP 2003", 2003.
  • Sanderson, S. Bengio, H. Bourlard, J. Mariethoz, R. Collobert, M. F. BenZeghiba, F. Cardinaux, and S. Marcel, "Speech & Face Based Biometric Authentication at IDIAP", in "Proceedings of the 2003 IEEE International Conference on Multimedia \& Expo (ICME-03)", 2003
  • Mohamed F. BenZeghiba and Hervé Bourlard, "Hybrid HMM/ANN and GMM Combination for User-Customized Password Speaker Verification", in "Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03)", 2003
  • Conrad Sanderson and Samy Bengio, "Robust Features for Frontal Face Authentication in Difficult Image Conditions", in "Proceedings of 4th International Conference on Audio- and Video-based Biometric Person Authentication (AVBPA-03)", 2003.
  • Conrad Sanderson and Kuldip K. Paliwal, "Noise Resistant Audio-Visual Verification via Structural Constraints", in "Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03)", 2003.
  • Sébastien Marcel, Christine Marcel, and Samy Bengio, "A State-of-the-art Neural Network for Robust Face Verification", in "Proceedings of the COST275 Workshop on The Advent of Biometrics on the Internet", 2002.
  • Mariéthoz, J. and Bengio, S., "A Comparative Study of Adaptation Methods for Speaker Verification", in " International Conference on Spoken Language Processing ICSLP", 2002
  • Sébastien Marcel and Samy Bengio, "Improving Face Verification using Skin Color Information", in "Proceedings of the 16th International Conference on Pattern Recognition", 2002.
  • Fabien Cardinaux and Sébastien Marcel,"Face Verification using MLP and SVM", in "XI Journees NeuroSciences et Sciences pour l'Ingenieur (NSI 2002)", 2002.
  • Beat Pfister and Rene Beutler, "Estimating the Weight of Evidence in Forensic Speaker Verification" Speech Processing Group Computer engineering and Networks Laboratory, ETH Zurich, to appear in Proceedings of Eurospeech 2003, Septembre 1-4 Geneva, Switzerland 2003.
  • Hertel, C., Bunke, H. "A set of novel features for writer identification", in J. Kittler, M.S. Nixon (eds.): Audio-and Video-Based Biometric Person Authentication, Proc. 4th Int. Conference AVBPA, 2003, 679 - 687
  • Neuhaus, M., Bunke, H. "Self-organizing graph edit distance", in E. Hancock, M. Vento (eds.): Graph Based Representations in Pattern Recognition, Proc. 4th Int. Workshop BR2003, Springer, LNCS 2726, 83 - 94
  • Bunke, H., "Graph-based tools for data mining and machine learning", in P. Perner, A. Rosenfeld (eds.): Machine Learning and Data Mining in Pattern Recognition, Proc. 3rd Int. Conference, Springer LNAI 2734, 7 - 19, 2003
  • Mohamed F. BenZeghiba and Hervé Bourlard, "User-Customized Password Speaker Verification based on HMM/ANN and GMM Models'', IDIAP-RR 02-10, 2002, Proc. Intl Conf. on Spoken Language Processing (ICSLP), Denver, Sep. 2002

 

IM2.ACP, IM2.MI

 

  • S. Bengio, "Multimodal Authentication using Asynchronous HMMs", in "4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA", 2003.

 

IM2.DI (ex AP)

 

  • Denis Lalanne, Rolf Ingold, "Structuring multimedia archives with static documents", ERCIM News No. 62 "Multimedia Informatics", July 2005.
  • Denis Lalanne, Rolf Ingold, " Documents statiques et multimodalité, L'alignement temporel pour structurer des archives multimédias de réunions ", Document numérique Vol.8 N° 4/2004, " Temps et Documents ", Lavoisier, pp.65-89, 02-2005.
  • Dalila Mekhaldi, Denis Lalanne and Rolf Ingold, "Unity is Strength: Coupling media for thematic segmentation", DAS 2004, 6th IAPR Workshop on Document Analysis Systems, LNCS 3163, Springer-Verlag, Berlin/Heidelberg, pp. 559, 2004.
  • Ardhendu Behera, Denis Lalanne and Rolf Ingold, "Combining Color and Layout Features for the Identification of Low-resolution Documents", International Journal of Signal Processing Volume 2 Number 1 2005 ISSN:1304-4478, pp. 7-14, 2005
  • Rigamonti M., Lalanne D., Evequoz F., Ingold R., "Browsing multimedia archives through implicit and explicit cross-modal links", 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI'05), 11-13 July 2005.(accepted)
  • Dalila Mekhaldi, Denis Lalanne, and Rolf Ingold, "From Searching to Browsing through Multimodal Documents Linking", Proceedings of 8th International Conference on Document Analysis and Recognition (ICDAR2005), Seoul, Korea, 2005.
  • Ardhendu Behera, Denis Lalanne, and Rolf Ingold, "Color, an Inevitable Feature for Identifying Low-resolution Documents", Proceedings of 8th International Conference on Document Analysis and Recognition (ICDAR2005), Seoul, Korea, 2005.
  • Maurizio Rigamonti, Jean-Luc Bloechle, Karim Hadjar, Denis Lalanne, and Rolf Ingold, "Towards a Canonical and Structured Representation of PDF Documents through Reverse Engineering", Proceedings of 8th International Conference on Document Analysis and Recognition (ICDAR2005), Seoul, Korea, 2005.
  • Hadjar K., Ingold, R., “Logical labeling of Arabic Newspapers using Artificial Neural Nets”. Proceedings of the 8th International Conference on Document Analysis and Recognition, Seoul, Korea, August 2005.
  • Behera, D. Lalanne, and R. Ingold, "Color and Layout-based Identification of Documents Captured from Low-resolution Handheld Devices", International Conference on Pattern Recognition and Computer Vision 2005, Proceedings of the second World Enformatica Congress, WEC'05, ISBN 975-98458-3-0, Vol 1, pp. 51-55, Istanbul, Turkey, 25-27 February 2005
  • Dalila Mekhaldi, Denis Lalanne, Rolf Ingold, "Using Bi-modal Alignment and Clustering Techniques for Documents and Speech Thematic Segmentations", in Thirteenth Conference on Information and Knowledge Management CIKM 2004, pp. 69-77, Washington D.C., U.S.A, November 8-13, 2004.
  • Ardhendu Behera, Denis Lalanne and Rolf Ingold. "Visual Signature based Identification of Low-resolution Document Images". The ACM Symposium on Document Engineering 2004, Milwaukee, Wisconsin, USA, pp. 178-187, October 28-30, 2004.
  • Dalila Mekhaldi, Denis Lalanne, Rolf Ingold. "Thematic Segmentation of Meetings Through Document/Speech Alignment", in ACM Multimedia 2004, 12th Annual Conference, New York City, Columbia University, pp. 804-811, October 10-16, 2004.
  • Dalila Mekhaldi, Denis Lalanne. "Thematic Alignment Of Documents With Recorded Speech", in doctoral symposium of ACM Multimedia 2004, 12th Annual Conference, New York City, Columbia University, pp. 973-974, October 10-16, 2004.
  • Dalila Mekhaldi, Denis Lalanne and Rolf Ingold, "Thematic Alignment of recorded speech with documents", DocEng 2003, ACM Symposium on Document Engineering, ACM Press pp. 52-54, Grenoble, France, 2003,
  • Karim Hadjar, Maurizio Rigamonti, Denis Lalanne and Rolf Ingold, "Xed: a new tool for eXtracting hidden structures from Electronic Documents", International Workshop on Document Image Analysis for Libraries, IEEE Computer Society, pp. 212-224, PARC, Palo Alto, California, USA, 2004,
  • Maurizio Rigamonti, Karim Hadjar, Denis Lalanne et Rolf Ingold, "XED, Un outil pour l'extraction et l'analyse de documents PDF", Huitième Colloque International Francophone sur l'Ecrit et le Document 2004 La Rochelle, France, pp. 85-90
  • Éric Clavier, Sébastien Adam, Pierre Héroux, Maurizio Rigamonti et Jean-Marc Ogier, "DocMining : Une plate-forme de conception de systèmes d'analyse de documents.", CIFED 2004, pp. 97-102, La Rochelle, France, 21-25 June 2004
  • Behera A., Lalanne D., Ingold R. "Looking at projected documents: Event detection & document identification", the 2004 IEEE International Conference on multimedia and expo (ICME 2004). Taipei, Taiwan, June 2004
  • Dalila Mekhaldi, Denis Lalanne and Rolf Ingold, "Unity is Strength: Coupling media for thematic segmentation", DAS 2004, 6th IAPR International Workshop on Document Analysis Systems, University of Florence, Italy, September 8-10, 2004
  • Andrei Popescu-Belis and Denis Lalanne, “References to Documents (Ref2doc): Reference Resolution Over a Restricted Domain”, ACL 2004 Workshop on Reference Resolution and its Applications, Barcelona, Spain, 2004
  • Karim HADJAR and Rolf Ingold, "Physical Layout Analysis of Complex Structured Arabic Documents using Artificial Neural Nets", 6th IAPR International Workshop on Document Analysis Systems (DAS'2004), Florence (Italy), 08-10 September 2004
  • S. Adam, M. Rigamonti, E. Clavier, J.-M. Ogier, E. Trupin and K. Tombre, "DocMining: A Document Analysis System Builder", 6th IAPR International Workshop on Document Analysis Systems (DAS'2004), Florence (Italy), september 2004
  • Dalila Mekhaldi, Denis Lalanne, Rolf Ingold, "Meeting Thematic Segmentation Through Document/Speech Alignment", in ACM Multimedia 2004, 12th Annual Conference, October 10-16, 2004, New York City, Columbia University
  • Dalila Mekhaldi, Denis Lalanne, "Thematic Alignment Of Documents With Recorded Speech", in doctoral symposium of ACM Multimedia 2004, 12th Annual Conference, New York City, Columbia University, October 10-16, 2004
  • Ardhendu Behera, Denis Lalanne and Rolf Ingold, "Visual Signature based Identification of Low-resolution Document Images", in ACM Symposium on Document Engineering 2004, Milwaukee, Wisconsin, USA, October 28-30, 2004
  • Denis Lalanne, Stéphane Sire et al."A research agenda for assessing the utility of document annotations in multimedia databases of meeting recordings", 3rd International Workshop on Multimedia Data and Document Engineering (MDDE-2003). September 8th 2003, Berlin, Germany, in conjunction with VLDB-2003.
  • Denis Lalanne, Dalila Mekhaldi and Rolf Ingold. "Talking about documents: revealing a missing link to multimedia meeting archives" in Document Recognition and Retrieval XI conference, IS&T/SPIE's International Symposium on Electronic Imaging 2004, 18-22 January 2004 in San Jose, CA USA.

 

IM2.DI (ex AP), IM2.MDM

 

  • Popescu-Belis A. & Lalanne D., "Detection and Resolution of References to Meeting Documents", in Renals S. & Bengio S., eds., Machine Learning for Multimodal Interaction II, LNCS, Springer-Verlag, Berlin/Heidelberg, 12 p., 2005. (in press)
  • Popescu-Belis Andrei and Lalanne Denis, "Reference Resolution over a Restricted Domain: References to Documents", ACL 2004 Workshop on Reference Resolution and its Applications, Barcelona, Spain, p.71-78, 2004.
  • Popescu-Belis A., Clark A., Georgescul M., Lalanne D. & Zufferey S., "Shallow Dialogue Processing Using Machine Learning Algorithms (or not)", in Bengio S. & Bourlard H., eds., Machine Learning for Multimodal Interaction, LNCS 3361, Springer-Verlag, Berlin/Heidelberg, p.277-290, 2005.
  • Popescu-Belis A. & Lalanne D., "Ref2doc: Reference Resolution over a Restricted Domain", proc. of ACL 2004 Workshop on Reference Resolution and its Applications, Barcelona, Spain, 8 p.11, 2004
  • Denis Lalanne, Rolf Ingold, Didier von Rotz, Ardhendu Behera, Dalila Mekhaldi, Andrei Popescu-Belis (in press) - "Using static documents as structured and thematic interfaces to multimedia meeting archives". In Bourlard H. & Bengio S., eds., Multimodal Interaction and Related Machine Learning Algorithms, LNCS, Springer-Verlag, Berlin, 8 p, 2004

 

IM2.DS

 

  • J. Tarraga, R.D. Hersch, ``Parallel File Striping onOptical Jukebox Servers'', in Proceedings IEEE International Conference on Multimedia and Expo, Lausanne, Switzerland, August 2002

 

IM2.IIR

 

  • Eric Bruno and Nicolas Moenne-Loccoz and Stéphane Marchand Maillet, "Interactive Video Retrieval based on Multimodal Dissimilarity Representation", in Proceedings of the 1st Workshop on Machine Learning Techniques for Processing Multimedia Content, MLMM'05, Bonn, Germany, August, 2005
  • Éric Bruno and Nicolas Moenne-Loccoz and Stéphane Marchand-Maillet, "Unsupervised Event Discrimination Based on Nonlinear Temporal Modelling of Activity", in Pattern Analysis and Application (PAA), 2005. (to appear)
  • Eric Bruno, Nicolas Moenne-Loccoz, and Stéphane Marchand Maillet, "Learning user queries in multimodal dissimilarity spaces", in Proceedings of the 3rd International Workshop on Adaptive Multimedia Retrieval, AMR'05, Glasgow, UK, July 2005.
  • Bruno Janvier, Éric Bruno, Stéphane Marchand-Maillet, and Thierry Pun, "Information-theoretic temporal segmentation of videos and applications: multiscale keyframe selection and transition detection", Multimedia Tools and Applications, 2005. (to appear).
  • B. Janvier, E. Bruno, S. Marchand-Maillet, and T. Pun, "Semantic Segmentation of Video Collections using Boosted Random Fields", MLMI'05, 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, 11-13 July 2005.
  • Bruno Janvier and Nicolas Moenne-Loccoz and Stéphane Marchand Maillet and Thierry Pun, "A contextual model for semantic video structuring", in 13th European Signal Processing Conference, EUSICO'05, Antalya, Turkey, September, 2005
  • S. Kosinov and S. Marchand-Maillet, "Hierarchical Ensemble Learning For Multimedia Categorization And Autoannotation", Proceedings of the 2004 IEEE Signal Processing Society Workshop (MLSP 2004), pp. 645-654, São Luís, Brazil, Sept. 2004.
  • S. Kosinov and S. Marchand-Maillet, "Multimedia autoannotation via hierarchical semantic ensembles", Proceedings of the Int. Workshop on Learning for Adaptable Visual Systems (LAVS 2004), Cambridge, UK, 2004.
  • S. Kosinov and S. Marchand-Maillet, "Evaluation of distance-based discriminant analysis and its kernelized extension in visual object recognition", Proceedings of the 7th International on signalimage processing and pattern recognition (UkrObraz 2004), Kijiv, Ukraine, 2004.
  • S. Kosinov, I. Titov and S. Marchand Maillet, "Large Margin Multiple Hyperplane Classification for Content-based Multimedia Retrieval", Proceedings of the 1st Workshop on Machine Learning Techniques for Processing Multimedia Content, MLMM'05, Bonn, Germany, August 2005.
  • S. Marchand-Maillet and E. Bruno, "Collection Guiding: A new framework for handling large multimedia collections", Proceedings of the First Workshop on Audio-visual Content And Information Visualization In Digital Librairies, AVIVDiLib05, Cortona, Italy, 2005.
  • S. Marchand-Maillet and E. Bruno, "Characterising optimal structures in multimedia collections for enhanced exploration", MLMI'05, 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, Edinburgh UK, 11-13 July 2005.
  • N. Moënne-Loccoz, E. Bruno and S. Marchand-Maillet, "Interactive Partial Matching of Video Sequences in Large Collections", In IEEE International Conference on Image Processing, Genova, Italy, 11-14 September 2005.
  • Nicolas Moënne-Loccoz and Bruno Janvier and Stéphane Marchand-Maillet and Eric Bruno, "Handling Temporal Heterogeneous Data for Content-Based Management of Large Video Collections", in Multimedia Tools and Applications, 2005. (to appear)
  • Nicolas Moenne-Loccoz and Éric Bruno and Stéphane Marchand-Maillet, "Knowledge-based Detection of Events in Video Streams from Salient Regions of Activities", in Pattern Analysis and Applications (PAA), special issue Video Based Event Detection, 2005. (to appear)
  • Nicolas Moenne-Loccoz, Eric Bruno, and Stéphane Marchand Maillet, "Interactive retrieval of video sequences from local feature dynamics", in Proceedings of the 3rd International Workshop on Adaptive Multimedia Retrieval, AMR'05, Glasgow, UK, July 2005.
  • Nicolas Moenne-Loccoz, Bruno Janvier, Stéphane Marchand-Maillet, and Éric Bruno, "An integrating framework for the management of video collections", in First Workshop on Machine Learning and Multimodal Interaction, volume MLMI'04, Lecture Notes in Computer Science 3361, Martigny, Switzerland, 2005
  • Alessandro Vinciarelli, "Noisy Text Categorization", IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005. (accepted for publication)
  • Alessandro Vinciarelli, "Application of Information Retrieval Techniques to Single Writer Documents", Pattern Recognition Letters, 2005. (accepted for publication)
  • Nabil Daddaoua, Alessandro Vinciarelli and Jean-Marc Odobez, "OCR Based Slide Retrieval", International Conference on Document Analysis and Recognition", Seoul, 2005. (to be presented)
  • David Grangier and Alessandro Vinciarelli, "Effect of Segmentation Method on Video Retrieval Performance", IEEE International Conference on Multimedia and Expo, Amsterdam, 2005. (to be presented)
  • Alessandro Vinciarelli, "Effect of Recognition Errors on Information Retrieval Performance", Proceedings of 9th International Workshop on Frontiers in Handwriting Recognition, pp. 275-279, Tokyo, 2004.
  • Arsic, N. Marina and J.-P. Thiran, "Impact of sample sizes on information theoretic measures for audio-visual signal processing", in Proc. of 13th International Conference on Signal Processing (EUSIPCO), Antalya, Turkey, September 2005.
  • M. Keller and S. Bengio, "A Neural Network for Text Representation", in International Conference on Artificial Neural Networks, ICANN, 2005.
  • Vinciarelli A., "Noisy Text Categorization", in Proceedings of International Conference on Pattern Recognition (ICPR), 2004.
  • Vinciarelli, A., "Noisy Text Categorization", in "Proceedings of International Conference on Pattern Recognition (ICPR)", 2004
  • Vinciarelli, A., "Effect of Recognition Errors on Information Retrieval Performance", in "Proceedings of International Workshop on Frontiers in Handwriting Recognition", 2004
  • B. Janvier, E. Bruno, T. Pun, S. Marchand-Maillet, "Information theoretic framework for temporal segmentation of videos and Applications", MTAP - Multimedia Tools and Applications, Kluwer, Special Issue, Selected extended papers from CBMI'03, 3rd Int. Workshop on Content-Based Multimedia Indexing, IRISA, Rennes, France, September 22-24, 2003
  • S. Marchand-Maillet and E. Bruno, "Exploiting User Interaction for Semantic Content-based Image Retrieval", in "Trends and Advances in Content-Based Image and Video Retrieval", R. Veltkamp, L Shapiro, J. Malik, H.-P. Kriegel Eds. LCNS, Springer. 2003 (to appear - Invited contribution)
  • Marchand-Maillet, H. Müller, W. Müller, T. Pun, "Was leisten automatische inhaltbasierte Bildsuchsysteme?, The reality of automatic content-based image retrieval systems", in Suchbilder, W. Ernst, S. Heidenreich, and U. Holl Eds., Kulturverlag Kadmos Publ., 2003
  • H. Müller, W. Müller, S. Marchand-Maillet, D. McG. Squire, T. Pun, "A framework for benchmarking in visual information retrieval", Int. Journal on Multimedia Tools and Applications (Kluwer), Special Issue on Multimedia Information Retrieval, p. 55-73, 21, 2, 2003
  • E. Bruno, S. Marchand-Maillet, "Non-linear temporal modeling for motion-based video overviewing", in European Conference on Content-based Multimedia Indexing (CBMI03), Rennes France, September, 2003.
  • E. Bruno, S. Marchand-Maillet, "Prédiction Temporelle de Descripteurs Visuels pour la Mesure de Similarité entre Vidéos", in 19eme Colloque sur le traitement du signal et des images (GRETSI’03), ENST Paris, France, September, 2003.
  • B. Janvier, E. Bruno, S. Marchand-Maillet, T. Pun, "Information-theoretic framework for the joint temporal partitioning and representation of video data", CBMI'03, 3rd Int. Workshop on Content-Based Multimedia Indexing, IRISA, Rennes, France, September 22-24, 2003
  • C. Jelmini and S. Marchand-Maillet, "OWL-based reasoning with retractable inference", in RIAO 2004, conference on Coupling approaches, coupling media and coupling languages for information retrieval, Avignon, France, April 2004
  • S. Kosinov, S. Marchand-Maillet, T. Pun, "Iterative majorization approach to distance-based discriminant analysis", in: Classification: The Ubiquitous Challenge (Gesellschaft für Klassification, GfKl'04), Dortmund, Germany, March 9-11, 2004
  • S. Kosinov and S. Marchand-Maillet, "Distance-based Discriminant Analysis for pattern classification", Workshop EUNITE 2004. Aachen, Germany, June 2004 (invited paper).
  • S. Kosinov, S. Marchand-Maillet, "Global knowledge learning for multimedia autoannotation", CPR Workshshop LAVS, Cambridge, UK, August 2004
  • S. Kosinov, S. Marchand-Maillet, "Hierarchical ensemble learning for multimedia categorization and autoannotation", IEEE Machine Learning and Signal Processing, Brazil, September 2004
  • S. Kosinov, S. Marchand-Maillet, T.Pun, "Visual object categorization using distance-based discriminant analysis", 4th Int. Workshop on Multimedia Data and Document Engineering (MDDE-04), July 2nd 2004, In conjunction with IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2004, Washington, DC, USA, 27th June - 2nd July, 2004
  • S. Marchand-Maillet, "Visual Collection Guiding. Dagsthul Seminar on Content-based Retrieval", Schloss Dagsthul, Germany, Jan. 2004
  • N.Moenne-Loccoz, E. Bruno, S. Marchand-Maillet, "Video Content Representation as Salient Regions of Activity", International Conference on Image and Video Retrieval (CIVR 2004), Dublin, IE, July 2004.
  • N. Moenne-Loccoz, B. Janvier, E. Bruno, S. Marchand-Maillet, "Managing Video Collections at Large", ACM SIGMOD Workshop on “Computer Vision meets Databases”. Paris, France, June 2004
  • H. Müller, A.Geissbuhler, S. Marchand-Maillet, "Extension to the Multimedia Retrieval Markup Language: A communication protocol for content-based image retrieval", in European Conference on Content-based Multimedia Indexing (CBMI03), Rennes France, September, 2003
  • H. Müller, A Geissbühler, S. Marchand-Maillet and P. Clough, "Benchmarking Image Retrieval Applications", in Visual Information Systems (VIS2004). San Francisco, 8-10 September 2004 (invited).
  • V. Pallotta, A. Ballim, S. Marchand-Maillet and A. Lisowska, "Towards Meeting Information Sytems: Meeting Knowledge Management", in International Conference on Enterprise Information Sytems (ICEIS 04), Porto, Portugal, 14-17 April 2004
  • Bruno Janvier, Eric Bruno, Stephane Marchand-Maillet and Thierry Pun, "Information-theoretic framework for the joint temporal partioning and representation of video data", in European Conference on Content-based Multimedia Indexing (CBMI 03), Rennes France, September 2003.
  • Eric Bruno and Stephane Marchand-Maillet, "Non-linear temporal modeling for motion-based video overviewing", in European Conference on Content-based Multimedia Indexing (CBMI 03), Rennes France, September 2003
  • Henning Muller, Antoine Geissbuhler and Stephane Marchand-Maillet, "Extension to the Multimedia Retrieval Markup Language: A communication protocol for content-based image retrieval", in European Conference on Content-based Multimedia Indexing (CBMI 03), Rennes France, September 2003.
  • H.Müller, T. Pun, D. McG. Squire, "Learning from user behavior in image retrieval: application of the market basket analysis", Int. J. of Comp. Vision, Special Issue on Content-Based Image Retrieval, 56, 1/2, 65-77, Jan. - Feb. 2004
  • Carlo Jelmini, Stephane Marchand-Maillet, "DEVA: an extensible ontology-based annotation model for visual document collections", In Proceedings of {SPIE} Photonics West, Electronic Imaging 2002, Internet Imaging {IV}, Santa Clara, CA, USA, 2003.
  • S. Marchand-Maillet, H. Müller, W. Müller, T. Pun, "Automatisierte, inhaltbasierte Image-retrieval-Systeme, The reality of automatic content-based image retrieval systems", Suchbilder book, S. Heidenreich, to appear, 2003.
  • S. Marchand-Maillet and E. Bruno, "From Visual Models to Semantic Visual Information Retrieval", Special session at the IEEE Int. Conf. on Image Processing (ICIP 2003), Barcelona, Spain, Sept. 2003
  • S. Marchand-Maillet and E. Bruno, "Exploiting User Interaction for Semantic Content-based Image Retrieval|, in "Trends and Advances in Content-Based Image and Video Retrieval", R. Veltkamp, L Shapiro, J. Malik, H.-P. Kriegel Eds. LCNS, Springer. 2003 (to appear - Invited contribution).
  • E. Bruno, S. Marchand-Maillet "Prédiction Temporelle de Descripteurs Visuels pour la Mesure de Similarité en­tre Vidéos", in 19eme Colloque sur le traitement du signal et des images (GRETSI’03), ENST Paris, France, Sep­tember, 2003.
  • P. Roth, D. Richoz, T. Pun, " A multimodal system for the non-visual exploration of digital pictures", Interact 2003, 9th ICIP TC13 Int. Conf. on Human-Computer Interaction, Zuerich, Switzerland, Sept. 1-5, 2003
  • H. Müller, W. Müller, S. Marchand-Maillet, D. Squire, T. Pun. "A Framework for Benchmarking in Visual Information Retrieval'', In International Journal on Multimedia Tools and Applications (Special Issue on Multimedia Information Retrieval), 2002 (to appear),
  • S. Marchand-Maillet, "Construction of a Formal Multimedia Benchmark'', In Proceedings of the European Signal Processing Conference (EUSIPCO2002)}, Toulouse, France, 2002.
  • H. Müller, S. Marchand-Maillet, T. Pun. "The Truth About Corel - Evaluation in Image Retrieval'', in Proceedings of The Challenge of Image and Video Retrieval (CIVR2002), London, UK, 2002 . (to appear)
  • T. Louchnikova, S. Marchand-Maillet, "Flexible Image Decomposition for Multimedia Indexing and Retrieval'', in Proceedings of SPIE Photonics West, Electronic Imaging 2002, Internet Imaging III, San Jose, USA, 2002
  • "T. Pfund, S. Marchand-Maillet, ""A Dynamic Multimedia Annotation Tool'', in Proccedings of SPIE Photonics West, Electronic Imaging 2002, Internet
  • Imaging III, San Jose, USA, 2002"

 

IM2.IIR, IM2.MI

 

  • D. Gatica-Perez, I. McCowan, M. Barnard, S. Bengio and H. Bourlard, "On Automatic Annotation of Meeting Databases", IEEE International Conference on Image Processing, ICIP, 2003

 

IM2.IIR, IM2.SA

 

  • F. Monay and D. Gatica-Perez, PLSA-based Image Auto-Annotation: Constraining the Latent Space, in "Proc. ACM Int. Conf. on Multimedia (ACM MM)", 2004.
  • T. Butz and J. Thiran, From Error Probability to Information Theoretic (Multi-Modal) Signal Processing, Signal Processing, Vol. 85, No 5, pp. 875-902, 2005
  • Daniel Gatica-Perez, Alexander Loui, and Ming-Ting Sun, "Finding Structure in Home Videos by Probabilistic Hierarchical Clustering", in "IEEE Transactions on Circuits and Systems for Video Technology", 2003
  • Datong Chen and Jean-Marc Odobez, "Monte Carlo Video Text Segmentation, in ICIP 2003,
  • Daniel Gatica-Perez, Ming-Ting Sun, and Alexander Loui,"Probabilistic Home Video Structuring: Feature Selection and Performance Evaluation'', IDIAP-RR 02-11, 2002, Proc. IEEE International Conference on Image Processing, 2002

 

IM2.IIR, IM2.SP

 

  • J. Ajmera, I. McCowan, and H. Bourlard, "An Online Audio Indexing System", in "", 2004

 

IM2.IP

 

  • Wellner, P., Flynn, M., Tucker, S., Whittaker, S., "A Meeting Browser Evaluation Test", Conference on Human Factors in Computing Systems (CHI), Portand, Oregon, USA, 2nd-7th April 2005.
  • "Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Mael Guillemot, Thomas Hain, Jaroslav Kadlec, Vasilis Karaiskos, Wessel Kraaij, Melissa Kronenthal, Guillaume Lathoud, Mike Lincoln, Agnes Lisowska, Iain McCowan, Wilfried Post, Dennis Reidsma, and Pierre
  • Wellner, ""The AMI Meeting Corpus: A Pre-Announcement"", In proceedings of MLMI05: 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms 11-13 July 2005."

 

IM2.IP, IM2.DI (ex AP), IM2.ACP, IM2.DS, IM2.IIR, IM2.MDM, IM2.MI, IM2.SA, IM2.SP, IM2.VE

 

  • Mael Guillemot, Pierre Wellner, Daniel Gatica-Perez & Jean-Marc Odobez, "A Hierarchical Keyframe User Interface for Browsing Video over the Internet", Ninth IFIP TC13 International Conference on Human-Computer Interaction (Interact 2003), Zurich, September 2003.
  • McCowan and S. Bengio and D. Gatica-Perez and G. Lathoud and F. Monay and D. Moore and P. Wellner and H. Bourlard, "Modeling Human Interaction in Meetings", ICASSP'03

 

IM2.IP, IM2.IIR, IM2.MI

 

  • Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain McCowan and Guillaume Lathoud, "Multimodal Group Action Clustering in Meetings", in "ACM 2nd International Workshop on Video Surveillance & Sensor Networks in conjunction with 12th ACM International Conference on Multimedia", 2004

 

IM2.IP, IM2.MDM, IM2.SP

 

  • N. Morgan, D. Baron, S. Bhagat, H. Carvey, R. Dhillon, J. Edwards, D. Gelbart, A. Janin, A. Krupski, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, and C. Wooters, "Meetings about meetings: research at ICSI on speech in multiparty conversations", ICASSP-2003, Hong Kong, April 2003.

 

IM2.IP, IM2.SP

 

  • J. Ajmera, I. McCowan, and H. Bourlard, "Robust Speaker Change Detection", in "IEEE Signal Processing Letters", 2003
  • J. Ajmera, G. Lathoud, and I. McCowan, "Clustering And Segmenting Speakers And Their Locations In Meetings", in "ICASSP", 2004
  • Guillaume Lathoud, Iain A. McCowan, and Jean-Marc Odobez, "Unsupervised Location-Based Segmentation of Multi-Party Speech", in "Proceedings of the 2004 ICASSP-NIST Meeting Recognition Workshop", 2004
  • D. Moore and I. McCowan, "Microphone Array Speech Recognition : Experiments on Overlapping Speech in Meetings", in "Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03)", 2003.
  • Guillaume Lathoud, Iain A. McCowan, and Darren C. Moore, "Segmenting Multiple Concurrent Speakers Using Microphone Arrays", in "Proceedings of Eurospeech 2003", September 2003.
  • G. Lathoud and I. McCowan, "Location Based Speaker Segmentation", in "Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03)", 2003

 

IM2.MDM

 

  • Popescu-Belis A., "Compte rendu de Dialogue homme-machine multimodal", Landragin F., T.A.L. : Traitement automatique de la langue, vol. 46, n. 1, p.17-20, 2004.
  • Rajman, Martin and Bui, Trung H. and Rajman, Andréa and Seydoux, Florian and Trutnev, Alex and Quarteroni, Silvia, "Assessing the usability of a dialogue management system designed in the framework of a rapid dialogue prototyping methodology", ACTA ACUSTICA united with ACUSTICA, the Journal of the European Acoustics Association (EAA): International Journal on Acoustics. Vol. 90, no. 6 pp. 1096-1111 ISSN 1610-1928 S. Hirzel Verlag- Stuttgart, Nov./Dec. 2004.
  • Melichar, M., Lisowska, A. Armstrong, S. & Rajman, M., "Rapid Multimodal Dialogue Design: Application in a Multimodal Meeting Retrieval and Browsing System", Joint AMI/PASCAL/IM2/M4 Workshop on Multimodal Interaction and Related Machine Learning Algorithms, Edinburgh, UK, 2005.
  • Ailomaa, M., Vladimir Kadlec, Jean-Cedric Chappelier, and Martin Rajman, "Efficient Processing of Extra-grammatical Sentences: Comparing and Combining two approaches to Robust Stochastic parsing". Proc. of the International Symposium on Applied Stochastic Models and Data Analysis (ASMDA2005). Brest, France, May 17-20, 2005.
  • Ailomaa, M., Vladimir Kadlec, Jean-Cédric Chappelier, and Martin Rajman, “Robust Stochastic Parsing: Comparing two approaches for Processing Extra-grammatical Sentences”, Proc. of the 15th Nordic Conference of Computational Linguistics (NODALIDA2005). Joensuu, Finland, May 20-21, 2005.
  • Martin Rajman, Pierre Andrews, Maria del Mar Perez Almenta, and Florian Seydoux, "Conceptual document indexing using a large scale semantic dictionary providing a concept hierarchy", Proc. of the International Symposium on Applied Stochastic Models and Data Analysis (ASMDA2005). Brest, France, May 17-20, 2005.
  • Melichar, M., Cenek, P., Rajman, M., "The architecture and design principle of EPFL dialogue management system" Text Speech and Dialogue 2005, Karlovy Vary, Czech Republic, 2005
  • Quarteroni , S., Rajman, M., Melichar, M., "Introducing reset patterns: An extension to a rapid dialogue prototyping methodology", the IEE International Workshop on Intelligent Environments, University of Essex, Colchester, UK. 28-29 June 2005.
  • Georgescul, M., Clark A. & Armstrong S., "Using support vector machines for thematic text segmentation", PASCAL Workshop on Machine Learning, Support Vector Machines, and Large Scale Optimization, Wissenschaftszentrum Schloß Thurnau, Germany.
  • Rajman, M., Bui, T.H., Rajman, A., Seydoux, F., Trutnev, A., Quarteroni, S., "Assessing the usability of a dialogue management system designed in the framework of a rapid dialogue prototyping methodology", Acta Acustica united with Acustica, to appear in 2004
  • Antoine Rozenknop, Jean-Cédric Chappelier, Martin Rajman, "Polynomial Discriminant Tree Substitution Grammars" in revue Traitement Automatique des Langues (TAL), 44(3), Paris, to appear in 2004
  • Antoine Rozenknop, Jean-Cédric Chappelier, and Martin Rajman, "Discriminative Models of SCFG and STSG", Proceedings of the 7th International Conference on Text, Speech Dialogue (TSD 2004), P. Sojka, I. Kopecek K. Pala (Eds.), Lecture Notes in Computer Science, Springer-Verlag, Berlin Heidelberg New York, Brno, Czech Republic, September 8-11, 2004
  • T. H. Bui, M. Rajman, and M. Melichar, “Rapid Dialogue Prototyping Methodology”, Proceedings of the 7th International Conference on Text, Speech Dialogue (TSD 2004), P. Sojka, I. Kopecek K. Pala (Eds.), Lecture Notes in Computer Science, Springer-Verlag, Berlin Heidelberg New York, Brno, Czech Republic, September 8-11, 2004
  • Pallotta V., Ghorbel H., Ruch P. & Coray G., "An Argumentative Annotation Schema for Meeting Discussions", LREC'2004 (Fourth International Conference on Language Resources and Evaluation), Lisbon, Portugal, vol.III, p.1003-1006, 2004
  • Lisowska A., Rajman M., Bui T.H. (in press), " ARCHIVUS: A System for Accessing the Content of Recorded Multimodal Meetings", in Bourlard H. & Bengio S., eds., Multimodal Interaction and Related Machine Learning Algorithms, LNCS, Springer-Verlag, Berlin, 2004
  • Lonneke van der Plas, Vincenzo Pallotta, Martin Rajman, Hatem Ghorbel, "Automatic Keyword Extraction from Spoken Text. A Comparison of two Lexical Resources : EDR & WordNet", Proceedings of the Fourth International Conference on Language Resource and Evaluation (LREC'2004), Lisbon, vol.VI, p.2205-2208, June, 2004
  • Pallotta V., Ghorbel H., Ballim A., Lisowska A., & Marchand-Maillet S., "Towards Meeting Information Systems: Meeting Knowledge Management", proceedings of ICEIS '04 (6th International Conference on Enterprise Information Systems), p.464-469, Porto, Portugal, April 14-17 2004
  • Ghorbel, H., Coray G., Collet O., "L'alignement des documents médiévaux", Hermès Document Numérique, numéro spécial sur Numérisation et patrimoine. vol 7/3-4 pp.27-45, 2003
  • Clark A. and Thollard F., "PAC-learnability of Probabilistic Deterministic Finite State Automata", Journal of Machine Learning Research, n. 5, p.473-497, May 2004
  • Cartoni B. Bouillon P., Alphonse Y., Lehmann S, "Automatisation of the Activity of Term Collection in Different Languages", proc. of LREC'2004 (Fourth International Conference on Language Resources and Evaluation), Lisbon, Portugal, 2004
  • Clark A. & Popescu-Belis A., "Multi-level Dialogue Act Tags", proc. of SIGDIAL'04 (5th SIGdial Workshop on Discourse and Dialogue), Cambridge, MA, USA, p.163-170, 2004
  • Lisowska A., Popescu-Belis A. & Armstrong S., "User Query Analysis for the Specification and Evaluation of a Dialogue Processing and Retrieval System", proc. of LREC'2004 (Fourth International Conference on Language Resources and Evaluation), Lisbon, Portugal, vol.III, p.993-996, 2004
  • Popescu-Belis A., "Abstracting a Dialog Act Tagset for Meeting Processing", proc. of LREC'2004 (Fourth International Conference on Language Resources and Evaluation), Lisbon, Portugal, vol.IV, p.1415-1418, 2004
  • Popescu-Belis A., Rigouste L., Salmon-Alt S. & Romary L., "Online Evaluation of Coreference Resolution", proc. of LREC'2004 (Fourth International Conference on Language Resources and Evaluation), Lisbon, Portugal, vol.IV, p.1507-1510, 2004
  • Popescu-Belis A., Georgescul M., Clark A. & Armstrong S., "Building and Using a Corpus of Shallow Dialogue Annotated Meetings", proc. of LREC'2004 (Fourth International Conference on Language Resources and Evaluation), Lisbon, Portugal, vol.IV, p.1451-1454, 2004
  • Van der Plas L., Pallotta V., Rajman M. & Ghorbel H., "Automatic Keyword Extraction from Spoken Text. A Comparison of Two Lexical Resources: EDR and WordNet", proc. of LREC'2004 (Fourth International Conference on Language Resources and Evaluation), Lisbon, Portugal, vol.VI, p.2205-2208, 2004
  • Zufferey S. & Popescu-Belis A., "Towards Automatic Disambiguation of Discourse Markers: the Case of 'Like' ", proc. of SIGDIAL'04 (5th SIGdial Workshop on Discourse and Dialogue), Cambridge, MA, USA, p.63-71, 2004
  • "Proceedings of the Second International Workshop on Generative Approaches to the Lexicon", P. Bouillon and K. Kanzaki (eds), May 15-17 2003, Geneva, Switzerland
  • Popescu-Belis A. "Evaluation-Driven Design of a Robust Reference Resolution System", Natural Language Engineering, vol. 9, n. 2, p.1-26. 2003
  • Armstrong S., Clark A., Coray G., Georgescul M., Pallotta V., Popescu-Belis A., Portabella D., Rajman M. & Starlander M. "Natural Language Queries on Natural Language Data: a Database of Meeting Dialogues", NLDB 2003 (8th International Conference on Applications of Natural Language to Information Systems), Burg/Cottbus, Germany. 2003
  • Pallotta V., "A computational dialectics approach to meeting tracking and understanding, in Materiali Linguistici" , Franco Angeli, October 2003
  • Clark, A. "Preprocessing very noisy text. Workshop on Shallow Processing of Large Corpora" Corpus Linguistics 2003, Lancaster, UK. http://issco-www.unige.ch/staff/clark/im2mdmwp.html
  • Palotta, V. "Tutorial on Computational Dialogue Models", EACL, Budapest, Hungary 2003
  • Zufferey S. & Popescu-Belis A. "Automating the corpus- based study of the acquisition of pragmatic markers", Eighth International Pragmatics Conference, Toronto, Canada, 2003.
  • Popescu-Belis A. "Compte rendu de Multimodality in Language and Speech Systems", Granstrom B., House D. & Karlsson I., eds., 2002. In T.A.L: Traitement automatique de la langue, vol. 44, n. 1, 2003.
  • Rajman M., Rajman A., Seydoux F., Trutnev A. "Prototypage rapide et évaluation de modèles de dialogue finalisés", Actes de la 10eme conférence sur le Traitement Automatique des Langues Naturelles (TALN 2003), Batz-sur-Mer, France, June 2003.
  • Rajman M., Rajman A., Seydoux F., Trutnev A. "Assessing the usability of a dialogue management system designed in the framework of a rapid dialogue prototyping methodology", Proc. of the 1st ISCA Workshop on Auditory Quality of Systems, Mont Cenis, Germany, April, 2003.
  • Seydoux F., Trutnev A., Rajman M. "Dialogue Management with weak speech recognition : a pragmatic approach", ISCA workshop on Error handling in dialogue systems, , p.133-138, Chateau-d'Oex, Switzerland, August 2003.
  • Zufferey S."Discourse markers in dialogue: relevance-theoretic analysis and corpus-based validation", 8th IPrA Conference (International Pragmatics Association), Toronto, Canada.2003
  • "V. Claveau, P. Sébillot, C. Fabre and P. Bouillon,  «Learning Semantic Lexicons from a Part-of-Speech and Semantically Tagged,  «Corpus using Inductive Logic Programming'', in Journal of Machine Learning Research (JMLR)},
  • Special issue on inductive logic programming, 2002, to appear"
  • Bouillon, Pierrette, Claveau, Vincent, Fabre, Cécile and Sébillot, Pascale, "Acquisition of Qualia Elements from Corpora: Evaluation of a Symbolic Learning Method'', in Proceedings of LREC 2002 (Language Resources and Engineering Conference), Las Palmas, Spain, 2002
  • Clark, Alexander, "Memory-Based Learning of Morphology with Stochastic Transducers'', in Proceedings of ACL 2002 (Association for Computational Linguistics), Philadelphia, PA,USA, 2002
  • Popescu-Belis, Andrei, "The Use of Referring Cases and of Individual/Categorial Salience to Represent Discourse Referents in Narrative Texts'', in Proceedings of DAARC-4 (Discourse Anaphora and Anaphor Resolution Colloquium), Lisbon,Portugal, 2002
  • Rayner, Manny and Bouillon, Pierrette, "A flexible Speech to Speech Phrasebook Translator'', in Proceedings of ACL-02 Workshop on Speech-to-Speech Translation: Algorithms and Systems, Philadelphia, PA, USA, 2002
  • Hovy, E., King, M. and Popescu-Belis, A., "Computer-aided Specification of Quality Models for Machine Translation Evaluation'', in Proceedings of LREC02, p.1239-1246, Las Palmas de Gran Canaria, Spain, 2002
  • Popescu-Belis, A., Armstrong, S. and Robert, G.,"Secure Electronic Dictionary Distribution Through the DicoPro Server'', in Proceedings of LREC 2002, p.1144-1149, Las Palmas de Gran Canaria, Spain, 2002
  • Hovy, E., King, M. and Popescu-Belis, A.,`"An Introduction to MT Evaluation'', in Workbook of the LREC 2002 Workshop on Machine Translation Evaluation: Human Evaluators Meet Automated Metrics, p.1-7, Las Palmas de Gran Canaria, Spain, 2002
  • Dabbadie, M., Hartley, A., King, M., Miller, K.J., Mustafa El Hadi, W.,Popescu-Belis, A., Reeder, F. and Vanni, M., "A Hands-On Study of the Reliability and Coherence of Evaluation Metrics'', in Workbook of the LREC 2002 Workshop on Machine Translation Evaluation: Human Evaluators Meet Automated Metrics, p.8-16, Las Palmas de Gran Canaria, Spain, 2002
  • Franck Thollard and Alexander Clark, "Apprentissage d'Automates Probabilistes Déterministes'', Proceedings of CAp 2002
  • Franck Thollard and Alexander Clark, "Détection de Groupes Nominaux par Inférence Grammaticale Probabiliste'', Proceedings of CAp 2002

 

IM2.MDM, IM2.SP

 

  • D. Hillard, M. Ostendorf, and E. Shriberg, "Detection Of Agreement vs. Disagreement In Meetings: Training With Unlabeled Data", Proc. HLT-NAACL Conference, Edmonton, Canada, May 2003.
  • B. Wrede and E. Shriberg, "Spotting "Hot Spots" in Meetings: Human Judgements and Prosodic Cues", EUROSPEECH 2003, Geneva, September 2003 (In Press)
  • S. Bhagat, H. Carvey, E. Shriberg, "Automatically Generated Prosodic Cues to Lexically Ambiguous Dialog Acts in Multiparty Meetings", ICPhS 2003, Barcelona, August 2003.

 

IM2.MI

 

  • Mark Barnard and Jean-Marc Odobez, "Robust playfield segmentation using map adaptation", in Proc. 17th International Conference on Pattern Recognition (ICPR 2004), Cambridge, United Kingdom, August 2004. (IDIAPRR 03-77)
  • S. Bengio and H. Bourlard, "Multi channel sequence processing", in J. Winkler, N. Lawrence, and M. Niranjan, editors, The Shefield Machine Learning Workshop, volume LNCS 3635, Springer-Verlag, 2005.
  • Silvia Chiappa and David Barber, "Generative independent component analysis for eeg classification", in European Symposium on Artificial Neural Networks ESANN, 2005.
  • Silvia Chiappa and David Barber, "Generative temporal ica for classification in asynchronous bci systems", in The 2nd International IEEE EMBS Conference On Neural Engineering, 2005. (IDIAP-RR 05-08)
  • Christos Dimitrakakis and Samy Bengio, "Boosting hmms with an application to speech recognition", in IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2004. (IDIAP-RR 03-41)
  • Christos Dimitrakakis and Samy Bengio, "Online policy adaptation for ensemble classifiers", in 12th European Symposium on Artificial Neural Networks, ESANN 04, 2004. (IDIAP-RR 03-69)
  • Christos Dimitrakakis and Samy Bengio, "Boosting word error rates", in IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP, 2005. (IDIAP-RR 04-49)
  • P.W. Ferrez and J. del R. Millán, "You are wrong! automatic detection of interaction errors from brain waves", in Proceedings of the 19th International Joint Conference on Artificial Intelligence, Edinburgh, UK, August 2005.
  • D. Gatica-Perez, I. McCowan, D. Zhang, and S. Bengio, "Detecting group interest-level in meetings", in IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2005.
  • R. Grave de Peralta Menendez, S. Gonzalez Andino, L. Perez, P.W. Ferrez, and J. del R. Millán, "Non-invasive estimation of local field potentials for neuroprosthesis control", Cognitive Processing, Special Issue on Motor Planning in Humans and Neuroprosthesis Control, 2005.
  • McCowan, D. Gatica-Perez, S. Bengio, G. Lathoud, M. Barnard, and D. Zhang, "Automatic analysis of multimodal group actions in meetings", IEEE Transactions on Pattern Analysis and Machine Intelligence, 2004. (to appear)
  • J. del R. Millán, "Interfaces cerebrales", Mente y Cerebro, 2005.
  • J.-F. Paiement, D. Eck, S. Bengio, and D. Barber, "A graphical model for chord progressions embedded in a psychoacoustic space", in Proceedings of the 22nd International Conference on Machine Learning, 2005. (IDIAP-RR 05-33)
  • Marc Al-Hames, Alfred Dielmann, Daniel Gatica-Perez, Stephan Reiter, Steve Renals, and Dong Zhang, "Multimodal Integration for Meeting Group Action Segmentation and Recognition", MLMI, July, 2005. (IDIAP-RR 31)
  • Dong Zhang, Daniel Gatica-Perez, and Samy Bengio, "Semi-supervised Meeting Event Recognition with Adapted HMMs", in Prof. IEEE ICME, July, 2005. (IDIAP-RR 15)
  • Dong Zhang, Daniel Gatica-Perez, and Samy Bengio, "Semi-supervised Adapted HMMs for Unusual Event Detection", in Prof. IEEE CVPR, June, 2005. (IDIAP-RR 80)
  • J. del R. Millán, F. Renkens, J. Mouri no, and W. Gerstner, "Brain-Actuated Interaction", in "Artificial Intelligence", 2004
  • R. Collobert, Y. Bengio, and S. Bengio, "Scaling Large Learning Problems with Hard Parallel Mixtures", in "International Journal on Pattern Recognition and Artificial Intelligence (IJPRAI)", 2003
  • J. del R. Millán, "On the Need for On-Line Learning in Brain-Computer Interfaces", in "Proceedings of the International Joint Conference on Neural Networks", 2004
  • S. Chiappa and S. Bengio, "HMM and IOHMM Modeling of EEG Rhythms for Asynchronous BCI Systems", in "European Symposium on Artificial Neural Networks ESANN", 2004
  • R. Collobert and S. Bengio, "Links Between Perceptrons, MLPs and SVMs", in "International Conference on Machine Learning, ICML", 2004
  • Christos Dimitrakakis and Samy Bengio, "Online Policy Adaptation for Ensemble Classifiers", in "12th European Symposium on Artificial Neural Networks, ESANN 04", 2004
  • R. Collobert and S. Bengio, "A Gentle Hessian for Efficient Gradient Descent", in "IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP", 2004
  • J. del R. Millán, "Restoring Locomotion with a Thought Controlled Mobile Robot", in "Proceedings of the 4th Forum of European Neuroscience", 2004
  • J. del R. Millán, F. Renkens, J. Mouri no, and W. Gerstner, "Non-Invasive Brain-Actuated Control of a Mobile Robot", in "Proceedings of the 18th International Joint Conference on Artificial Intelligence", 2003
  • F. Cincotti, A. Scipione, A. Tiniperi, D. Mattia, M.G. Marciani, J. del R. Millán, S. Salinari, L. Bianchi, and F. Babiloni, "Comparison of different feature classifiers for brain computer interfaces", in "Proceedings of the 1st International IEEE EMBS Conference on Neural Engineering", 2003
  • T. I. Alecu, S. Voloshynovskiy, T. Pun, "EEG cortical imaging: a vector field approach for Laplacian denoising and missing data estimation", ISBI 2004, 2004 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, Arlington, VA, USA, April 15-18, 2004
  • T. I. Alecu, S. Voloshynovskiy, T. Pun, "Regularized two-step brain activity reconstruction from spatio-temporal EEG data", SPIE Int. Symp. Optical Science and Technol., Conf. Image Reconstruction from Incomplete Data III (AM320), Denver, Colorado, USA, 2-6 August 2004
  • T. I. Alecu, S. Voloshynovskiy, T. Pun, "Localization properties of an EEG sensor system: lower bounds and optimality", EUSIPCO-2004, 12th Eur. Signal Proc. Conf., Vienna, Austria, Sept. 7-10, 2004
  • Grave de Peralta Menendez, R, Murray, M. M., Michel, C.M., Martuzzi, R., Gonzalez Andino S.L., "Electrical Neuroimaging Based on Biophysical Constraints", Neuroimage 21, 527–539, 2004
  • Grave de Peralta Menendez, R, Murray, M., Gonzalez Andino S., "Improving the Performance of Linear Inverse Solutions by Inverting the Resolution Matrix", IEEE Transactions on Biomedical Engineering. In press
  • Elly Gysels, José del R. Millán, Silvia Chiappa, Patrick Celka, "Studying Phase Synchrony for Classification of Mental Tasks in Brain Machine Interfaces", accepted for 14th Conference of the International Society for Brain Electromagnetic Topography, November 19-23, Santa Fe, New Mexico, 2003
  • Millán, J. del R, "Adaptive brain interfaces for communication and control", Proc. of the 10th Int. Conf. on Human-Computer Interaction (HCII-2003). Special Session on "User Interfaces for the Age of the Disappearing Computer", Crete, Greece. June 2003. Invited paper
  • Millán, J. del R., Renkens, F., Mouriño, J., &, Gerstner, W., "Non-invasive brain-actuated control of a mobile robot", Proc. of the Int. Joint Conf. on Artificial Intelligence (IJCAI-03). Acapulco, Mexico. August 2003.
  • Science, vol. 299, 24/01/03, pp. 496-499. Focus article on BCI reviewing Millán's work as one of the world's key researchers in the field.
  • J. Kronegg, T. Alecu, T. Pun, "Information theoretic bit-rate optimization for average trial protocol Brain-Computer Interfaces", HCI International 2003, 10th International Conference on Human-Computer Interaction, Crete, Greece, June 22-27, 2003,
  • J. del R. Millán and J. Mouri no, "Asynchronous BCI and Local Neural Classifiers: An Overview of the Adaptive Brain Interface Project", in IEEE Trans. on Neural Systems and Rehabilitation Engineering, Special Issue on Brain-Computer Interface Technology", 2003.
  • J. del R. Millán, "Adaptive Brain Interfaces", in "Communications of the ACM", vol. 46, n. 3, pp. , 2003. Invited paper.
  • Collobert, R., Bengio, S., and Bengio, Y., "A Parallel Mixture of SVMs for Very Large Scale Problems", in "Neural Computation", 2002.
  • Collobert, R., Bengio, Y., and Bengio, S., "Scaling Large Learning Problems with Hard Parallel Mixtures", in "International Workshop on Pattern Recognition with Support Vector Machines, SVM'2002", 2002.
  • J. Mouri no, S. Chiappa, R. Jané, and J. del R. Millán, "Evolution of the Mental States Operating a Brain-Computer Interface", in "Proceedings of the International Federation for Medical and Biological Engineering", 2002.
  • R. Grave de Peralta Menendez, S. Gonzalez Andino, J. Millan, T. Pun, C. M. Michel, "Direct non-invasive brain computer interfaces", Human Brain Mapping 2003, June 18-22, 2003, New York, USA
  • Francesco Camastra and Alessandro Vinciarelli, "Estimating the Intrinsic Dimension of Data with a Fractal-Based Method'', IDIAP-RR-02-02, 2002, accepted for publication in IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), to appear

 

IM2.MI, IM2.IP

 

  • Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain McCowan, and Guillaume Lathoud, "Modeling Individual and Group Actions in Meetings: a Two-Layer HMM Framework",in "the Second IEEE Workshop on Event Mining: Detection and Recognition of Events in Video, In Association with CVPR", 2004

 

IM2.MI, IM2.SA

 

  • Mark Barnard, Jean-Marc Odobez, and Samy Bengio, "Multi-Modal Audio-Visual Event Recognition for Football Analysis", IDIAP-RR 03-12, 2003.
  • Francesco Camastra and Alessandro Vinciarelli, "Combining Neural Gas and Learning Vector Quantization for Cursive Character Recognition, in "Neurocomputing", 2003.
  • Gunter, S., Bunke, H., "Generating classifier ensembles from multiple prototypes and its application to handwriting recognition'', in Roli, F., Kittler, J. (Eds.) Multiple Classifier Systems, Proceedings MCS 2002, LNCS 2364, Springer, pp.179-188, 2002

 

IM2.MI, IM2.SA, IM2.SP

 

  • D. Gatica-Perez, G. Lathoud, I. McCowan, J.-M. Odobez, and D. Moore, "Audio-Visual Speaker Tracking with Importance Particle Filters", Proc. IEEE ICIP, Barcelona, Sept 2003.
  • Torsten Butz and Jean-Philippe Thiran, "Feature Space Mutual Information in Video-Speech Sequences", IEEE ICME 2002; Lausanne; Switzerland; 2002, Vol. 2 , pp. 361364. http://ltswww.epfl.ch/~brain/publications/torsten/conferences/2002/icme/icme.pdf

 

IM2.MI, IM2.SP

 

  • Hamed Katebdar, Jithendra Vepa, Samy Bengio and Herve Bourlard, "Developing and Enhancing Posterior based Speech Recognition Systems", INTERSPEECH 2005, Lisbon, Portugal, Sept. 2005. (accepted)
  • Hamed Katebdar, Herve Bourlard and Samy Bengio, "Hierarchical Multi-Stream Posterior based Speech Recognition System", Proceedings of MLMI’05 workshop, Edinburgh, U.K., July, 2005.
  • S. Bengio, "An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition", in "Advances in Neural Information Processing Systems, NIPS 15", 2003
  • S. Bengio, "Multimodal Speech Processing Using Asynchronous Hidden Markov Models", in "Information Fusion", 2003.
  • T.A.Stephenson, M.Magimai-Doss, H.Bourlard, "Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition,'' accepted for publication in Intl. Conf. on Pattern Recognition} (ICPR 2002), August 11-15, Quebec City, 2002
  • Todd A. Stephenson, Mathew Magimai-Doss, and Hervé Bourlard, "Auxiliary Variables in Conditional Gaussian Mixtures for Automatic Speech Recognition'', IDIAP-RR 02-25, 2002, Seventh International Conference on Spoken Language Processing (ICSLP 2002), September 2002
  • Todd A. Stephenson, Jaume Escofet, Mathew Magimai-Doss, an Hervé Bourlard, "Dynamic Bayesian Network Based Speech Recognition with Pitch and Energy as Auxiliary Variables'', IDIAP-RR 02-24, 2002, 2002 IEEE International Workshop on Neural Networks for for Signal Processing (NNSP 2002), September 2002

 

IM2.SA

 

  • G. Monaci and P. Vandergheynst, "Learning structured dictionaries for image representation", Proc. of IEEE International Conference on Image Processing (ICIP '04), Singapore, pp. 2351-2354, October 2004.
  • G. Monaci, P. Jost and P. Vandergheynst, "Image compression with learnt tree-structured dictionaries", Proc. of IEEE International Workshop on Multimedia Signal Processing (MMSP '04), Siena, pp. 35-38, September 2004.
  • Günter, S., Bunke, H., "Off-line cursive handwriting recognition using multiple classifier systems - on the influence of vocabulary, ensemble, and training set size", Optics and Lasers in Engineering 43, p 437 – 454, 2005.
  • Varga, T., Bunke, H., "Off-line handwritten word recognition using synthetic training data produced by means of a geometrical distortion model", Int. Journal of Pattern Recognition and Art. Intelligence, Vol. 18, No. 7, 1285 – 1302, 2004.
  • Günter, S., Bunke, H., "Multiple classifier systems in off-line handwritten word recognition – on the influence of training set and vocabulary size", Int. Journal of Pattern Recognition and Art. Intelligence, Vol. 18, No. 7, p 1303 – 1320, 2004.
  • Günter, S., Bunke, H., "Handwritten word recognition using classifier ensembles generated from multiple prototypes", Int. Journal of Pattern Recognition and Art. Intelligence, Vol. 18, No. 5, p 957 – 974, 2004.
  • Bertolami, R., Bunke, H., "Ensemble methods for handwritten text line recognition systems", Proc. IEEE Int. Conference on Systems, Man, and Cybernetics, Hawaii, 2005
  • Liwicki, M., Bunke, H., "Handwriting Recognition of Whiteboard Notes", Proc. 12th Conference of the International Graphonomics Society, p 118 – 122, 2005.
  • Varga, T., Kilchhofer, D., Bunke, H., "Template-based Synthetic Handwriting Generation for the Training of Recognition Systems", Proc. 12th Conference of the International Graphonomics Society, p 206 – 211, 2005.
  • Liwicki, M., Bunke, H., "Enhancing Training Data for Handwriting Recognition of Whiteboard Notes with Samples from a Different Database", Proc. 8th Int. Conf. on Document Analysis and Recognition, 2005
  • Liwicki, M., Bunke, H., "IAM-OnDB – an On-Line English Sentence Database Acquired from Handwritten Text on a Whiteboard", Proc. 8th Int. Conf. on Document Analysis and Recognition, 2005
  • Varga, T., Bunke, H., "Tree Structure for Word Extraction from Handwritten Text Lines", Proc. 8th Int. Conf. on Document Analysis and Recognition, 2005
  • Bertolami, R., Bunke, H., "Multiple Handwritten Text Recognition Systems Derived from Specific Integration of a Language Model", Proc. 8th Int. Conf. on Document Analysis and Recognition, 2005
  • Günter, S., Bunke, H., "Ensembles of classifiers for handwritten word recognition specialized on individual handwriting styles", in Marinai, S., Dengel, A. (eds.): Document Analysis Systems IV, Proc. 6th Int. Workshop, Springer LNCS 3163, p 286 – 297, 2004.
  • Günter, S., Bunke, H., "Combination of three classifiers with different architectures for handwritten word recognition", Proc. 9th Int. Workshop on Frontiers in Handwriting Recognition, p 63 – 68, 2004.
  • Zimmermann, M., Bunke, H., "N-gram language models for offline handwritten text recognition", Proc. 9th Int. Workshop on Frontiers in Handwriting Recognition, p 203 – 208, 2004.
  • Varga, T., Bunke, H., "Comparing natural and synthetic training data for off-line cursive handwritten text recognition", Proc. 9th Int. Workshop on Frontiers in Handwriting Recognition, p 221 – 225, 2004.
  • Pekalska, E., Duin, R., Günter, S., Bunke, H., "On not making dissimilarities Euclidean", in Fred, A. et al. (eds.): Structural, Syntactic, and Statistical Pattern Recognition, Proc. Joint IAPR Int. Workshops SSPR and SPR, Springer LNCS 3138, p 1145 – 1154, 2004.
  • Günter, S., Bunke, H., "An evaluation of ensemble methods in handwritten word recognition based on feature selection", Proc. 17th Int. Conference on Pattern Recognition, Vol I, p 388 – 392, 2004.
  • Varga, T., Bunke, H., "Off-line handwritten textline recognition using a mixture of natural and synthetic training data", Proc. 17th Int. Conference on Pattern Recognition, Vol II, p 545 – 549, 2004.
  • Zimmermann, M., Bunke, H., "Optimizing the integration of a statistical language model in HMM based offline handwritten text Recognition", Proc. 17th Int. Conference on Pattern Recognition, Vol II, p 541 – 544, 2004.
  • M. Bray, E. Koller-Meier, P. Mueller, L. Van Gool and N.N. Schraudolph, "Stochastic Optimization for High-Dimensional Tracking in Dense Rage Maps", IEE journal in 'Vision, Image & Signal Processing', Will be published in 2005.
  • M. Bray, E. Koller-Meier, and L. Van Gool, "Smart particle filtering for high-dimensional tracking", Computer Vision and Image Understanding (to be published)
  • Griesser, T. P. Koninckx and L. Van Gool, "Adaptive Real-Time 3D Acquisition and Contour Tracking within a Multiple Structured Light System", 12th Pacific Conference on Computer Graphics and Applications (PG2004), October 2004.
  • Griesser and L. Van Gool, "A High-Speed Adaptive Multi-Module Structured Light Scanner", CapTech 2004, December 2004.
  • Griesser and L. Van Gool, "RTSyncNet - A flexible Real-Time Synchronisation Network for Cluster based Vision- and Graphics-Architectures", Proceedings of the second conference on visual information engineering (VIE 2005), April 2005.
  • D. Roth, P. Doubek and L. Van Gool, "Bayesian Pixel Classification for Human Tracking", IEEE Workshop on Motion and Video Computing (MOTION), January 2005.
  • T. Jaeggli, G. Caenen, R. Fransens and L. Van Gool, "Analysis of Human Locomotion Based on Partial Measurements", Proceedings of IEEE Motion 2005, January 2005.
  • T. Jaeggli, T. P. Koninckx and L. Van Gool, "Model-based Sparse 3D Reconstruction for Online Body Tracking", Proceedings of IST/SPIE's 17th annual symposium on electronic imaging - videometrics VIII, January 2005.
  • Fasel, F. Monay, and D. Gatica-Perez, "Latent Semantic Analysis of Facial Action Codes for Automatic Facial Expression Recognition", International Conference on Multimedia, Workshop on Video Surveillance and Sensor Networks (ACM MM-VSSN), October 2004.

  • D. Serby, E. Koller-Meier and L. Van Gool, "Probabilistic Object Tracking Using Multiple Features", International Conference on Pattern Recognition (ICPR04), August 2004.
  • R. Kehl, M. Bray, L. Van Gool, "Full Body Tracking from Multiple Views using Stochastic Sampling", International Conference on Computer Vision and Pattern Recognition, San Diego, June 2005.
  • Geys and L. Van Gool, "Hierarchical coarse to fine Depth Estimation for Realistic View Interpolation", 3DIM, Ottawa, June 2005.
  • G. Caenen, R. Fransens, L. Van Gool, "Analysis of Human Locomotion Based on Partial Measurements", IEEE Motion Workshop, Breckenridge, 2005.
  • V. Popovici, S. Bengio and J. Thiran, "Kernel Matching Pursuit for Large Datasets", Pattern Recognition, 2005. (in press)
  • J. Meynet, V. Popovici and J. Thiran, "Mixture of SVMs for Face Class Modeling, MLMI'04", Proceedings of the Workshop on Machine Learning for Multimodal Information, September 2004.
  • J. Meynet, V. Popovici and J. Thiran, "Face Class Modeling Using Mixture of SVMs", in Proceedings of International Conference on Image Analysis and recognition, ICIAR 2004, Porto, Portugal, September 2004.
  • G. Antonini and J. Thiran, "Trajectories clustering in ICA space: an application to automatic counting of pedestrians in video sequences", Advanced Concepts for Intelligent Vision Systems, ACIVS 2004, Brussels, Belgium, September 2004.
  • G. Antonini, S. Venegas, J. Thiran and M. Bierlaire, "A discrete choice pedestrian behavior model for pedestrian detection in visual tracking systems", Advanced Concepts for Intelligent Vision Systems, ACIVS 2004, Brussels, Belgium, September 2004.
  • S. Venegas, G. Antonini, J. Thiran and M. Bierlaire, "Bayesian Integration of a Discrete Choice Pedestrian Behavioral Model and Image Correlation Techniques for Automatic Multi Object Tracking", 2004 IEEE International Conference on Image Processing, ICIP2004, Singapore, October 2004.
  • J. Meynet, V. Popovici, M. Sorci and J. Thiran, Combining SVMs for Face Class Modeling, 13th European Signal Processing Conference - EUSIPCO, 2005.
  • M. Gurban and J. Thiran, "Audio-visual speech recognition with a hybrid SVM-HMM system", 13th European Signal Processing Conference - EUSIPCO 2005, 2005.
  • D. Biliotti, G. Antonini and J. Thiran, "Multi-layer hierarchical clustering of pedestrian trajectories for automatic counting of people in video sequences", Proceedings of the IEEE Workshop on Motion and Video Computing (WACV/MOTION’05), January 2005.
  • X. Bresson, P. Vandergheynst and J. Thiran, "Multiscale Active Contours", Proceedings of 5th International Conference on Scale Space and PDE methods in Computer Vision, Hofgeismar, Germany, 2005.
  • P. Zehnder, E. Koller-Meier, R. Fransens and L. Van Gool, "A Hierarchical System for Recognition, Tracking and Pose Estimation", Cognitive Vision Systems, Springer, pp. 329-340, 2005.
  • J.-M. Odobez, D. Gatica-Perez et S. Ba, "Utilisation du Mouvement Visuel en Suivi par Filtrage Particulaire", Traitement du Signal, 2005. (in press)
  • (*) D. Gatica-Perez, G. Lathoud, J.-M. Odobez, and I. McCowan, "Multimodal Multispeaker Prob-abilistic Tracking in Meetings", in Proc. ICMI, Trento, Oct. 2005 (in press)
  • S. Ba and J.-M. Odobez, "Evaluation of Multiple Cue Head Pose Estimation Algorithms in Natural Environements", in Proc. IEEE ICME, Amsterdam, Jul. 2005.
  • (*) I. McCowan, M. Hari-Krishna, D. Gatica-Perez, D. Moore, and S. Ba, "Speech Acquisition in Meetings with an Audio-Visual Sensor Array", in Proc. IEEE ICME, Amsterdam, Jul. 2005.
  • K. Smith, D. Gatica-Perez, and J.-M. Odobez, "Using Particles to Track Varying Numbers of Interacting People", in Proc. IEEE CVPR, San Diego, June 2005.
  • K. Smith, D. Gatica-Perez, J.-M. Odobez, and S. Ba, "Evaluating Multi-Object Tracking, in Proc. IEEE CVPR-EEMCV", San Diego, June 2005.
  • (*) D. Gatica-Perez, J.-M. Odobez, S. Ba, K. Smith, and G. Lathoud, "Tracking People in Meetings with Particles", in Proc. WIAMIS, Montreux, Apr. 2005.
  • F. Deguillaume, Y. Rytsar, S. Voloshynovskiy and T. Pun, "Data-hiding based text document security and automatic processing", IEEE International Conference on Multimedia & Expo (ICME) 2005, Amsterdam, The Netherlands, July 6-8 2005.
  • O. Koval, S. Voloshynovskiy, F. Deguillaume, F. Perez-Gonzalez and T. Pun, "Robustness improvement of known-host-state watermarking using host statistics", Proceedings of SPIE Photonics West, Electronic Imaging 2005, Security, Steganography, and Watermarking of Multimedia Contents VII (EI120), San Jose, USA, January 16-20 2005.
  • Y. Rytsar, S.Voloshynovskiy, O.Koval, F.Deguillaume, S.Startchik, and T. Pun, "Document interactive navigation in multimodal databases", MLMI'05, 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms,Edinburgh, UK, 11-13 July 2005
  • E. Topak, S. Voloshynovskiy, O. Koval, M.K. Mihcak and T. Pun, "Security analysis of robust data hiding with geometrically structured codebooks", Proceedings of SPIE Photonics West, Electronic Imaging 2005, Security, Steganography, and Watermarking of Multimedia Contents VII (EI120), San Jose, USA, January 16-20 2005.
  • E. Topak, S. Voloshynovskiy, O. Koval, J. E. Vila-Forcén and T. Pun, "On Security of Geometrically-Robust Data-Hiding", WIAMIS 2005, 6th International Workshop on Image Analysis for Multimedia Interactive Services, Montreux, Switzerland, April 13-15 2005.

  • J.E. Vila-Forcén, S. Voloshynovskiy, O. Koval, F. Pérez-González and T. Pun, "Worst case additive attack against quantization-based data-hiding methods", Proceedings of SPIE Photonics West, Electronic Imaging 2005, Security, Steganography, and Watermarking of Multimedia Contents VII (EI120), San Jose, USA, January 16-20 2005.
  • J.E. Vila-Forcen, S. Voloshynovskiy, O. Koval and T. Pun, "Asymmetric spread spectrum data-hiding for Laplacian host data", IEEE International Conference on Image Processing, Genova, Italy, 11-14 September 2005.
  • R. Villán, S. Voloshynovskiy, O. Koval and T. Pun, "Multilevel 2D Bar Codes: Towards High Capacity Storage Modules for Multimedia Security and Management", Proceedings of SPIE Photonics West, Electronic Imaging 2005, Security, Steganography, and Watermarking of Multimedia Contents VII (EI120), San Jose, USA, January 16-20 2005.
  • S. Voloshynovskiy, O. Koval, F. Pérez-González, M. Kivanc Mihcak, J.E. Vila-Forcén and T. Pun, "Data-hiding with partially available side information", EUSIPCO 2005, 13th European Signal Processing Conference, Antalya, September 4-8 2005.
  • S. Voloshynovskiy, P. Comesana, O. Koval, E. Topak, J. E. Vila Forcen and T. Pun, "On reversibility of random binning techniques: multimedia perspectives", 9th IFIP TC-6 TC-11 CMS 2005, Conf. on Communications and Multimedia Security, 19-21, Salzburg, Austria, September 2005..
  • S. Voloshynovskiy, O. Koval, F. Pérez-González, K. Mihcak and T. Pun, "Data-hiding with host state at the encoder and partial side information at the decoder", IEEE Transactions on Signal Processing, 2005. (to appear)
  • S. Voloshynovskiy, O. Koval and T. Pun, "Image denoising based on the edge-process model", Signal Processing, 2005. (to appear)
  • Luis Pérez-Freire, Fernando Pérez-González, and Sviatoslav Voloshinovskiy, "An accurate analysis of scalar quantization-based data-hiding", IEEE Transactions on Information Forensics and Security, 2005. (to appear)
  • J. Wagner and P. Frossard, "Playback Delay Optimization in Scalable Video Streaming", Proceedings of the IEEE International Conference on Multimedia and Expo, July 2005.
  • J. Wagner and P. Frossard, "Playback Delay and Buffering Optimization in Scalable Video Broadcasting", Proceedings of First International Conference on Multimedia Services Access Networks (MSAN), June 2005 [Invited Paper]
  • L. Granai, E. Maggio, L. Peotta and P. Vandergheynst, Hybrid Video Coding based on Bidimensional Matching Pursuit, EURASIP - Journal on Applied Signal Processing, Vol. 2004, No 17, pp. 2705-2714, December 2004
  • Bogdanova, P. Vandergheynst, J. Antoine, L. Jacques and M. Morvidone, "Stereographic Wavelet Frames on the Sphere", Applied and Computational Harmonic Analysis, 2005. (in press)
  • R. Gribonval, R. Figueras i Ventura and P. Vandergheynst, "A simple test to check the optimality of a sparse signal approximation", EURASIP Signal Processing, Special Issue on Sparse Approximations in Signal and Image Processing, 2005. (accepted)
  • L. Peotta, L. Granai and P. Vandergheynst, "Image compression using an edge adapted redundant dictionary and wavelets Signal", EURASIP Signal Processing, Special Issue on Sparse Approximations in Signal and Image Processing, 2005. (accepted)
  • R. Figueras i Ventura, P. Vandergheynst and P. Frossard, "Low rate and flexible image coding with redundant representations", IEEE Transactions on Image Processing, February 2005. (accepted)
    • Rahmoune, P. Vandergheynst and P. Frossard, "Flexible Motion-Adaptive Video Coding with Redundant Expansions", IEEE Transactions on Circuits and Systems for Video Technology, May 2005. (accepted)
  • S. Bilavarn, E. Debes, P. Vandergheynst and J. Diguet, "Processor Enhancements for Media Streaming Applications", Journal of VLSI Signal Processing-Systems for Signal, Image, and Video Technology, Vol. 41, No 2, September 2005. (accepted)
  • R. Gribonval, R. Figueras i Ventura and P. Vandergheynst, "A simple test to check the optimality of sparse signal approximations", Acoustics, Speech, and Signal Processing 2005, Proceedings. (ICASSP '05). IEEE International Conference, March 2005
  • Tosic, P. Frossard and P. Vandergheynst, "Progressive low bit rate coding of simple 3D objects with Matching Pursuit", Proceedings of the IEEE Data Compression Conference, March 2005
  • M. Flierl and P. Vandergheynst, "Video Coding with Motion-Compensated Temporal Transforms and Side Information", Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, March 2005.
    • Rahmoune, P. Vandergheynst and P. Frossard, "Scalable Motion-Adaptive Video Coding with Redundant Representations", Proceedings of the Picture Coding Symposium, PCS, December 2004.
  • M. Flierl and P. Vandergheynst, "Inter-Resolution Transform for Spatially Scalable Video Coding", Proceedings of the Picture Coding Symposium, December 2004
  • K. Smith and D. Gatica-Perez, "Order matters: a distributed sampling method for multi-object tracking", in Proc. BMVC, London, Sep. 2004.
  • S. Ba and J.-M. Odobez, "A Probabilistic Framework for Joint Head Tracking and Pose Esti-mation", in Proc. ICPR, Cambridge, Aug. 2004.
  • J.-M. Odobez and D. Gatica-Perez, "Motion in Model-Based Stochastic Tracking", in Proc. ICPR, Cambridge, Aug. 2004.
  • (*) D. Gatica-Perez, G. Lathoud, I. McCowan, and J.-M. Odobez, "A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking", in Proc. IEEE ICCV-WOMTEC, Nice, Oct. 2003.
  • (*) D. Gatica-Perez, G. Lathoud, I. McCowan, J.-M. Odobez, and D. Moore, "Audio-Visual Speaker Tracking with Importance Particle Filters", in Proc. IEEE ICIP, Barcelona, Sep. 2003.
  • J.-M. Odobez, S. Ba, and D. Gatica-Perez, "An Implicit Motion Likelihood for Tracking with Particle Filters", in Proc. BMVC, Norwich, Sep. 2003.
  • J.-M. Odobez and Sileye Ba, "A new model for visual tracking based on particle filters", in Proc. GRETSI, Paris, Sep. 2003.
  • Datong Chen and Jean-Marc Odobez, "Video Text Recognition using Sequential Monte Carlo and Error Voting Methods", in Pattern Recognition Letters", 2005. (accepted)
  • Datong Chen, Jean-Marc Odobez, and Jean-Philippe Thiran, "Monte Carlo Video Text Segmentation", in Int. Journal Of Pattern Recognition and Artificial Intelligence", 2005. (accepted)
  • Datong Chen, Jean-Marc Odobez, and Hervé Bourlard, "Text Detection and Recognition in Images and Videos", in "Pattern Recognition", 2004.
  • Datong Chen, Jean-Marc Odobez, and Jean-Philippe Thiran, "A Localization/Verification Scheme for Finding Text in Images and Video Frames Based on Contrast Independent Features and Machine Learning Methods", in Signal Processing: Image Communication, 2004.
  • Sileye O. Ba and Jean Marc Odobez, "Evaluation of Head Pose Tracking Algorithm in Indoor Environments", in International Conference on Multimedia & Expo ICME 2005), 2005.
  • Just, O. Bernier, and S. Marcel, "Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures", in Proc. of the sixth International Conference on Automatic Face and Gesture Recognition, 2004.
  • Pedro Quelhas and Jean-Marc Odobez, "Fusion of Structural and Color Local Descriptors for Enhanced Object Recognition", in Proceedings IEEE WIAMIS 2004 (5th International Workshop on Image Analysis for Multimedia Interactive Services), Lisboa, Portugal, 21-23 April 2004.
  • Dong Zhang, S. Z. Li, and Daniel Gatica-Perez, "Real-Time Face Detection Using Boosting Learning in Hierarchical Feature Spaces", in the International Conference on Pattern Recognition (ICPR), 2004.
  • Pozdnoukhov and S. Bengio, "Tangent Vector Kernels for Invariant Image Classification with SVMs", in 17th Int. Conf. Pattern Recognition (ICPR), 2004.
  • A.Vinciarelli, S.Bengio, and H.Bunke, "Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models", in "IEEE Transactions on Pattern Analysis and Machine Intelligence", 2004
  • Kevin Smith, "Order Matters: A Distributed Sampling Method for Multi-Object Tracking", in British Machine Vision Conference (BMVC), 2004
  • V. Popovici, J.-P. Thiran, Y. Rodriguez, and S. Marcel, "On Performance Evaluation of Face Detection and Localization Algorithms", in "17th International Conference on Pattern Recognition, ICPR2004", 2004
  • Daniel Gatica-Perez, Napat Triroj, Jean-Marc Odobez, Alexander Loui, and Ming-Ting Sun, "Assessing Scene Structuring in Consumer Videos", in "Int. Conf. on Image and Video Retrieval (CIVR)", 2004
  • J-M. Odobez and S. Ba, "Modélisation implicite du mouvement en suivi par filtrage de Monte Carlo séquentiel ", in "GRETSI conference, Signal and Image Processing ", 2003
  • Pedro Quelhas and James Boyce,"Vessel Segmentation and Branching Detection using an Adaptive Profile Kalman Filter in Retinal Blood Vessel Structure Analysis", in "Pattern Recognition and Image Analysis: First Iberian Conference, IbPRIA 2003, Springer-Verlag LNCS", 2003
  • Pedro Quelhas and Jean-Marc Odobez, "Fusion of Structural and Color Local Descriptors for Enhanced Object Recognition", in "Proceedings IEEE WIAMIS 2004(5th International Workshop on Image Analysis for Multimedia Interactive Services), 21-23 April, 2004, Lisboa, Portugal", 2004
  • Just, O. Bernier, and S. Marcel, "Recognition of Isolated Complex Mono- and Bi-Manual 3D Hand Gestures", in "Proc. of the sixth International Conference on Automatic Face and Gesture Recognition", 2004
  • K. Nummiaro, E. Koller-Meier, L. Van Gool, "An adaptive color-based particle filter", Image and vision computing, vol. 21, no. 1, pp. 99-110, Dec. 2003
  • K. Nummiaro, E. Koller-Meier, T. Svoboda, D. Roth, L. Van Gool, "Color-based object tracking in multi-camera environments", lecture notes in computer science, vol. 2781, pp. 591-599, 2003 (Proceedings 25th pattern recognition symposium - DAGM’03), Magdeburg, Germany, September 10-12, 2003
  • K. Seidel and J. Martinec, "Remote Sensing in Snow Hydrology", Remote Sensing in Snow Hydrology, K. Seidel and J. Martinec, ed., Springer Verlag, pp. 1-150, 2003
  • Kalberer, G.A.[Gregor A.], Müller, P.[Pascal], Van Gool, L.[Luc], "Visual speech, a trajectory in viseme space", IJIST(13), No. 1, 2003, pp. 74-84
  • G. A. Kalberer, P. Müller and L. Van Gool, "Modeling and Synthesis of Visual Speech in 3D. 3D Modeling and Animation: Synthesis and Analysis Techniques for the Human Body", edited by N. Sarris, M. Strintzis, IDEA Group Inc., 2003
  • T. Goedemé, M. Nuttin, T. Tuytelaars, L. Van Gool, "Vision based intelligent wheel chair control : the role of vision and inertial sensing in topological navigation", Journal of robotic systems, vol. 21, no. 2, pp. 85-94, 2004
  • F. Mindru, T. Tuytelaars, L. Van Gool, T. Moons, "Moment invariants for recognition under changing viewpoint and illumination, Computer vision and image understanding", vol. 94, pp. 3-27, 2004
  • Matthieu Bray, Esther Koller-Meier, Pascal Muller, Nicol N. Schraudolph and Luc Van Gool, "Stochastic Optimization for High-Dimensional Tracking in Dense Range Maps", IEE Journal of Vision, Image & Signal Processing (in press)
  • Vittorio Ferrari, Tinne Tuytelaars and Luc Van Gool, "Simultaneous Object Recognition and Segmentation by Image Exploration", LNCS 3021, pp. 40-54, 2004
  • Pascal Muller, Gregor A. Kalberer, Marc Proesmans and Luc Van Gool, "Realistic speech animation based on observed 3D face dynamics", IEE Journal of Vision, Image & Signal Processing (in press)
  • L. Van Gool, M. Pollefeys, M. Proesmans, A. Zalesny, "Modelling Sagalassos : creation of a 3D archaeological virtual site, Images and artefacts of the ancient world", Bowman A.K. and Brady J.M., eds., September 2004, Oxford University Press/British Academy, London, UK (in press)
  • Florica Mindru, Tinne Tuytelaars, Luc Van Gool, and Theo Moons, "Moment invariants for recognition under changing viewpoint and illumination", Computer Vision and Image Understanding, Vol. 94, No. 1-3 pp. 3-27. 2004, Special Issue: Colour for Image Indexing and Retrieval, Eds. Theo Gevers, Graham Finlayson, and Raimondo Schettini
  • Tinne Tuytelaars and Luc Van Gool, "Matching widely separated views based on affine invariant regions", Int. J. Computer Vision, Vol~59, No.~1, pp.~61-85, 2004
  • M. Osian, L. Van Gool, "Video shot characterization", Workshop on TREC video retrieval evaluation - TRECVID 2003, Gaithersburg, Maryland, USA, November 17-18, 2003,
  • Strecha, F. Verbiest, M. Vergauwen, L. Van Gool, "Shape from video v.s. still images", proceedings Conference on optical 3-D measurement techniques, vol. 2, pp. 168-175, Zürich, Switzerland, September 22-25, 2003
  • Strecha, T. Tuytelaars, L. Van Gool, "Dense matching of multiple wide-baseline views", proceedings 9th IEEE international conference on computer vision, ICCV 2003, vol. 2, pp. 1194-1201, Nice, France, October 13-16, 2003,
  • R. Fransens, J. De Prins, L. Van Gool, "SVM-based nonparametric discriminant analysis, an application to face detection", Proceedings 9th IEEE international conference on computer vision, ICCV 2003, vol. 2, pp. 1289-1296, Nice, France, October 13-16, 2003
  • G. Bianchi, M. Harders and G. Székely, "Mesh Topology Identification for Mass-Spring Models", Medical Image Computing and Computer-Assisted Intervention MICCAI 2003, November 2003
  • J. Cosmas, T. Itegaki, D. Green, N. Joseph, L. Van Gool, A. Zalesny, D. Vanrintel, F. Leberl, M. Grabner, K. Schindler, K. Karner, M. Gervautz, S. Hynst, M. Waelkens, M. Vergauwen, M. Pollefeys, K. Cornelis, T. Vereenooghe, R. Sablatnig, M. Kampel, P. Axell, E. Meyns, "Providing multimedia tools for recording, reconstruction, visualization and database storage/access of archaeological excavations", VAST, November 2003, pp. 183-192
  • Neubeck, A. Zalesny, L. Van Gool, "Cut primed smart-copying", The 3rd international workshop on texture analysis an synthesis, ICCV 2003 Nice, France, October 2003, pp. 71-76, 17
  • T. Jaeggli, T.P. Koninckx, L. Van Gool, "Online 3D Acquisition and Model Integration", IEEE International Workshop on Projector-Camera Systems - ICCV03, cdrom proc, Nice, France, October 12, 2003
  • N. Kogo, R. Fransens, G. Van Belle, J. Wagemans, L. Van Gool, "End-stopped cue detection for subjective surface reconstruction," 26th European conference on visual perception, Sept. 03
  • H. Shao, T. Svoboda, V. Ferrari, T. Tuytelaars, & L. Van Gool, "Fast indexing for image retrieval based on local appearance with re-ranking", International Conference on Image Processing, Sept. 2003
  • T. Goedemé, T. Tuytelaars, L. Van Gool, "Fast wide baseline matching for visual navigation", IEEE computer society conference on computer vision and pattern recognition - CVPR2004, Washington, DC, USA, June 27 - July 2, 2004
  • R. Fransens, C. Strecha and L. Van Gool, "A probabilistic approach to optical flow based super-resolution", IEEE computer society conference on computer vision and pattern recognition - CVPR2004, Washington, DC, USA, June 27 - July 2, 2004
  • M. Osian, T. Tuytelaars and L. Van Gool, "Fitting superellipses to incomplete contours", IEEE computer society conference on computer vision and pattern recognition - CVPR2004, Washington, DC, USA , June 27 - July 2, 2004,
  • K. Ozden, K. Cornelis, L. Van Eycken and L. Van Gool, "Reconstructing 3D independent motion using non-accidentalness", IEEE computer society conference on computer vision and pattern recognition - CVPR2004, Washington, DC, USA, June 27 - July 2, 2004
  • G. Caenen and L. Van Gool, "Maximum response filters for texture analysis", IEEE computer society conference on computer vision and pattern recognition - CVPR2004, Washington, DC, USA, June 27 - July 2, 2004
  • T. Tuytelaars and L. Van Gool, "Synchronizing video sequences", IEEE computer society conference on computer vision and pattern recognition - CVPR2004, Washington, DC, USA, June 27 - July 2, 2004
  • Thomas P. Koninckx, Tobias Jaeggli, Luc Van Gool, "Adaptive scanning for online 3D model acquisition", CVPR, 2004
  • Vittorio Ferrari, Tinne Tuytelaars, Luc Van Gool, "Integrating multiple model views for object recognition", CVPR, 2004
  • Vittorio Ferrari, Tinne Tuytelaars, Luc Van Gool "Retrieving Objects From Videos Based on Affine Regions", to appear in European Signal Processing conference (EUSIPCO), Vienna, Austria, September 2004
  • Matthieu Bray, Esther Koller-Meier, Nicol N. Schraudolph, Luc Van Gool, "Stochastic Meta-descent for tracking articulated structures", CVPR, 2004
  • L. Van Gool , M. Waelkens b, P. Mueller c, T. Vereenooghe b, M. Vergauwen, "Photo-realistic and detailed 3D-modeling: The Antonine Nymphaeum at Sagalassos (Turkey)", CAA, 2004
  • P. Mueller, T. Vereenooghe, M. Vergauwen, L. Van Gool, and M. Waelkens, "Photo-realistic and detailed 3D modeling: the Antonine nymphaeum at Sagalassos (Turkey)", Proc. Conf. on Computer Applications and Quantitative Methods to Archaeology (CAA), Prato, Italy, 13-17 April, 2004
  • L. Van Gool, M. Waelkens, P. Mueller, T. Vereenooghe, and M. Vergauwen, "Total recall: a plea for realism in models of the past", Proc. ISPRS Conf., Istanbul, july 2004 (DVD proceedings in our library, but paper version on sale...)
  • L. Van Gool (Invited Paper), "Modeling from images - Sagalassos' nymphaeum as a showcase", Proc. ISPRS Conf., Istanbul, july 2004
  • P. Doubek, I. Geys, T. Svoboda and L. Van Gool, "Cinematographic Rules Ap[lied to a Camera Network", Omnivis2004, The fifth Workshop on Omnidirectional Vision, Camera Networks and Non-Classical Cameras, May 2004
  • G. A. Kalberer, P. Mueller and L. Van Gool, "Animation Pipeline: Realistic Speech Based on Observed 3D Face Dynamics", 1st European Conference on Visual Media Production (CVMP), March 2004
  • M. Bray, E. Koller-Meier, P. Mueller, L. Van Gool and N. N. Schraudolph, "3D Hand Tracking by Rapid Stochastic Gradient Descent Using a Skinning Model", 1st European Conference on Visual Media Production (CVMP), March 2004
  • "T. Koninckx, A. Griesser and L. Van Gool,  «Real-time Range Scanning of Deformable Surfaces by Adaptively Coded Structured Light «, Fourth International Conference on 3-D Digital Imaging and Modeling (3DIM03), October 2003
  • Neubeck, A. Zalesny and L. Van Gool, "Cut-primed smart copying», Texture 2003 Workshop in conjunction with ICCV 2003, Oktober 2003"
  • Ph. Zehnder, "A hierarchical system for recognition, tracking and pose estimation", Martigny, JOINT AMI/PASCAL/IM2/M4 Workshop on Multimodal Interaction and Related Machine Learning Algorithms, 22. June 04,
  • V. Popovici and J. Thiran, "Pattern Recognition using Higher-Order Local Autocorrelation Coefficients", Pattern Recognition Letters, Vol. 25, No 10, pp. 1107-1113, July 2004
  • Chen, J. Odobez and J. Thiran, "A localization/verification scheme for finding text in images and video frames based on contrast independent features and machine learning method", Image Communication, Vol. 19, No 3, pp. 205-217, March 2004
  • "T. Butz, P. Hagmann, E. Tardif, R. Meuli and J. Thiran, ""A new brain segmentation framework"", Lecture Notes in Computer Science, Vol. 2879, pp. 586-593, November 2003"
  • "V. Popovici and J. Thiran, ""Adaptive Kernel Matching Pursuit for Pattern Classification"", Proceedings of International Conference on Artificial Intelligence and Applications, February 2004"
  • N. Aspert, T. Ebrahimi and P. Vandergheynst, "Non-linear subdivision using local spherical coordinates", Computer Aided Geometric Design, Vol. 20, No 3, pp. 165-187, 2003
  • P. Frossard, P. Vandergheynst, R. Figueras i Ventura and M. Kunt, "A Posteriori Quantization of Progressive Matching Pursuit Streams", IEEE Transactions on Signal Processing, Vol. 52, No 2, pp. 525-535, February 2004
  • Petrovic, O. Divorra Escoda and P. Vandergheynst, "Multiresolution segmentation of natural images: From linear to non-linear scale-space representations", IEEE Transactions on Image Processing, Accepted for publication, July 2004
  • R. Figueras i Ventura, P. Vandergheynst and P. Frossard, "Low rate and scalable image coding with redundant representations", IEEE Transactions on Image Processing, Accepted for publication, 2004
  • P. Jost, P. Frossard and P. Vandergheynst, "Redundant Image Representations in Security Applications", International Conference on Image Processing, October 2004
  • G. Monaci and P. Vandergheynst, "Learning structured dictionaries for image representation", in proceedings of IEEE ICIP 2004, Singapore, October 2004
  • Bogdanova, P. Vandergheynst, J. Antoine, L. Jacques and M. Morvidone, "Discrete Wavelet Frames on the Sphere", in proceedings of EUSIPCO 2004, Vienne, Austria, September 2004
  • G. Monaci, P. Jost and P. Vandergheynst, "Image compression with learnt tree-structured dictionaries", in proceedings of IEEE MMSP 2004, Siena, September 2004
  • M. Flierl and P. Vandergheynst, "Distributed Coding of Dynamic Scenes with Motion-Compensated Wavelets", proc. of the IEEE International Workshop on Multimedia Signal Processing, September 2004
  • O. Divorra Escoda and P. Vandergheynst, "A Bayesian Approach to Video Expansions on Parametric Over-Complete 2-D Dictionaries", International Workshop on Multimedia Signal Processing, September 2004
  • L. Granai and P. Vandergheynst, "Sparse decomposition over multi-component redundant dictionaries", Multimedia Signal Processing (MMSP04), Workshop on, September 2004
  • Rahmoune, P. Vandergheynst and P. Frossard, "MP3D: Highly Scalable Video Coding Scheme Based on Matching Pursuit", Proceedings of IEEE ICASSP, May 2004
  • R. Figueras i Ventura, P. Vandergheynst, P. Frossard and A. Cavallaro, "Color image scalable coding with Matching Pursuit", Proceedings of IEEE ICASSP, April 2004
  • M. Flierl, P. Vandergheynst and B. Girod, "Video Coding with Lifted Wavelet Transforms and Complementary Motion-Compensated Signals", Proc. of the SPIE Conf. on Visual Communications and Image Processing, Proc. of the SPIE Conf. on Visual Communications and Image Processing, Vol. 5308, January 2004
  • Petrovic and P. Vandergheynst, "Multiscale Variational Approach to Simultaneous Image Regularization and Segmentation", Proceedings of the 3rd International Symposium on Image and Signal Processing and Analysis, September 2003
  • S. Voloshynovskiy, F. Deguillaume, O. Koval and T. Pun, "Information-Theoretic Data-Hiding: Recent Achievements and Open Problems", Accepted to International Journal of Image and Graphics, Special Issue on Image Data Hiding (Invited paper)
  • O. Koval, S.Voloshynovskiy, F. Perez-Gonzalez, F. Deguillaume, and T. Pun, "Spread Spectrum Watermarking for Real Images: Is Everything So Hopeless?", XIIth European Signal Processing Conference EUSIPCO 2004, Vienna, Austria, September 07 - 10, 2004
  • O. Koval, S.Voloshynovskiy, F. Perez-Gonzalez, F. Deguillaume, and T. Pun, "Quantization-based watermarking performance improvement using host statistics: additive noise attack", ACM Multimedia and Security Workshop Magdeburg, Germany, September 20-21, 2004
  • L. Perez-Freire, F. Perez-Gonzalez and S. Voloshynovskiy, "Revealing the true achievable rates of scalar Costa scheme", IEEE International Workshop on Multimedia Signal Processing, Siena, Italy, Sept 29 – Oct 1, 2004
  • P. Roth, T. Pun, "A multimodal system for the non-visual exploration of digital pictures", Interact 2003 "Bringing the Bits togETHer", 9th ICIP TC13 Int. Conf. on Human-Computer Interaction, Zuerich, Switzerland, Sept. 1-5, 2003
  • Y. Rytsar, S. Voloshynovskiy and T. Pun, "Metadata Representation for Semantic-Based Multimedia Security and Management", Workshop on Metadata for Security, International Federated Conferences OTM '03, Catania, Italy, 3-7 November 2003
  • Topak, S. Voloshynovskiy, O. Koval and T. Pun, "Capacity-security analysis of repetitive watermarking", XIIth European Signal Processing Conference EUSIPCO 2004, Vienna, Austria, September 07 - 10, 2004
  • J. Vila, O. Koval and S. Voloshynovskiy, "Facial Image Compression Using Overcomplete Transforms", SPIE Electronic Imaging 2004, Visual Communications and Image Processing, San Jose, California, USA, 18–22 January 2004
  • J. Vila, S.Voloshynovskiy, O. Koval, F. Perez-Gonzalez and T. Pun, "Worst case additive attack against quantization-based watermarking techniques", IEEE International Workshop on Multimedia Signal Processing, Siena, Italy, Sept 29 – Oct 1, 2004
  • S. Voloshynovskiy. O. Koval, F. Deguillaume and T. Pun, "Visual communications with side information via distributed printing channels: extended multimedia and security perspectives", Conf. on Security, Steganography, and Watermarking of Multimedia Contents VI, Special session "Document security", S. Voloshynovskiy, chair, part of the IST/SPIE Symposium on Electronic Imaging 2004, San Jose, CA, USA, paper EI 5306-43, Jan 18-22, 2004
  • Günter, S., Bunke, H., "Optimization of weights in a multiple classifier handwritten word recognition system using a genetic algorithm", Electronic Letters of Computer Visison and Image Analysis, ELCVIA, Vol 3, No 1, 25 – 44, 2004
  • Vinciarelli, A., Bengio, S., Bunke, H., "Offline recognition of unconstrained handwritten texts using HMMs and statistical language models", IEEE Trans. PAMI 26, 709 – 720, 2004
  • Günter, S., Bunke, H., "Feature selection algorithms for the generation of multiple classifier systems and their application to handwritten word recognition", Pattern Recognition Letters 25, 2004, 1323 – 1336, 2004
  • Günter, S., Bunke, H., "HMM-based handwritten word recognition: on the optimization of the number of states, training iterations and Gaussian components", Pattern Recognition 37, 2069 - 2079, 2004
  • Günter, S., Bunke, H., "Fast feature selection in an HMM-based multiple classifier system for handwriting recognition", in Michaelis, B., Krell, G. (eds.): DAGM 2003, Springer LNCS 2781, 289 - 296, 2003
  • Günter, S., Bunke, H., "Off-line cursive handwriting recognition - on the influence of training set and vocabulary size in mutliple classifier systems", Proc. of the 11th Conf. of the Int. Graphonomics Society, 196 - 199, 2003
  • Varga, T., Bunke, H., "Effects of training set expansion in handwriting recognition using synthetic data", Proc. of the 11th Conf. of the Int. Graphonomics Society, 200 – 203, 2003
  • Günter, S., Bunke, H., "Ensembles of classifiers derived from multiple prototypes and their application to handwriting recognition", in Roli, F.,Kittler, J., Windeatt, T. (Eds.): Multiple Classifier Systems, Proc. 5th Int. Workshop, MCS 2004, Springer Verlag, LNCS 3077, 314 - 323, 2004
  • Zimmermann, M., Bunke, H., "Optimizing the integration of a statistical language model in HMM based offline handwritten text recognition, Proc. 17th Int. Conference on Pattern Recognition, Vol II, 541 - 544, 2004
  • Schlapbach, A., Bunke, H., "Off-line handwriting identification using HMM based recognizers", Proc. 17th Int. Conference on Pattern Recognition, Vol II, 654 - 658, 2004
  • Varga, T., Bunke, H., "Off-line handwritten textline recognition using a mixture of natural and synthetic training data", Proc. 17th Int. Conference on Pattern Recognition, Vol II, 545 - 549, 2004
  • Neuhaus, M., Bunke,H., "A probabilistic approach to learning costs for graph edit distance", ICPR Proc. 17th Int. Conference on Pattern Recognition, Vol III, 389 - 393, 2004
  • Günter, S., Bunke, H., " An evaluation of ensemble methods in handwritten word recognition based on feature selection" , Proc. 17th Int. Conference on Pattern Recognition, Vol I, 388 - 392, 2004
  • Zimmermann, M., Bertolami, R., Bunke, H., "Rejection strategies for offline handwritten sentence recognition", Proc. 17th Int. Conference on Pattern Recognition, Vol II, 550 - 553, 2004
  • Günter, S., Bunke, H.: "Evaluation of classical and novel ensemble methods for handwritten word recognition", in Fred, A. et al. (eds.): Structural, Syntactic, and Statistical Pattern Recognition, Proc. Joint IAPR Int. Workshops SSPR and SPR, Springer LNCS 3138, 583 - 573, 2004
  • Irniger, Ch., Bunke, H., "Decision tree structures for graph database filtering", in Fred, A. et al. (eds.): Structural, Syntactic, and Statistical Pattern Recognition, Proc. Joint IAPR Int. Workshops SSPR and SPR, Springer LNCS 3138, 66 - 75, 2004
  • K. Nummiaro, E. Koller-Meier, T. Svoboda, D.Roth, and L. Van Gool, "Color-Based Object Tracking in Multi-Camera Environments", Symposium for Pattern Recognition of the DAGM, Springer LNCS Series, Magdeburg, Sep 2003
  • Bunke, H, "Recognition of cursive Roman handwriting - past, present and future", Proc. 7th Int.Conference on Document Analysis and Recognition, Edinburgh, 2003, 448 - 459
  • Zimmermann, M., Chappelier, J.-C., Bunke, H, "Parsing N-best lists of handwritten sentences", Proc. 7th Int. Conference on Document Analysis and Recognition, Edinburgh, 2003, 572 - 576
  • Varga, T., Bunke, H., "Generation of synthetic training data for an HMM-based handwriting recognition system", Proc. 7th Int. Conference on Document Analysis and Recognition, Edinburgh, 2003, 618 - 622
  • Vinciarelli, A., Bengio, S., Bunke, H., "Offline recognition of large vocabulary cursive handwritten text", Proc. 7th Int. Conference on Document Analysis and Recognition (ICDAR), Edinburgh, 2003, 1101 - 1105
  • Günter, S., Bunke, H. "Optimizing the number of states, training iterations and Gaussians in an HMM-based handwritten word recognizer", Proc. 7th Int. Conference on Document Analysis and Recognition, Edinburgh, 2003, 472 - 476
  • Günter, S., Bunke, H., "Ensembles of classifiers for handwritten word recognition", Int. Journal on Document Analysis and Recognition, Vol. 5, No. 4, 2003, 224 - 232
  • V. Popovici, J.-Ph. Thiran, "Face Class Modeling in Eigenfaces Space", Proc. of 3rd Int. Workshop on Pattern Recognition in Information Systems (PRIS), p.38--45, Anger, France, 2003
  • Popovici, J.-Ph. Thiran, "Face Detection Using an SVM Trained in Eigenfaces Space", Proc. 4th Int. Conf. on Audio- and Video-Based Person Authentication (AVBPA), p.111--118, Surrey, UK, 2003, Springer-Verlag, LNCS 2688
  • Guenter, S., Bunke, H., "Fast Feature Selection in an HMM-based Multiple Classifier System for Handwriting Recognition", DAGM'03 Symposium, Magdeburg, Sept. 10-12, 2003.
  • Oscar Divorra Escoda and Pierre Vandergheynst, "Video Coding Using a Deformation Compensation Algorithm Based on Adaptive Matching Pursuit Image Decompositions", Proc. of IEEE ICIP, Barcelona, Catalonia, September 2003
  • Lorenzo Peotta, Lorenzo Granai and Pierre Vandergheynst, "Very low bit rate image coding using redundant dictionaries", Proc. of SPIE 48th Annual Meeting, San Diego, August 2003.
  • Torsten Butz and Jean-Philippe Thiran, "From Error Probability to Information Theoretical Signal Processing", submitted to IEEE Trans. on Pattern Analysis and Machine Intelligence (IEEE PAMI)
  • Torsten Butz, Patric Hagman, Eric Tardif, Reto Meuli, Jean-Philippe Thiran "A New Brain Segmentation Framework", accepted for publication in MICCAI 2003, Montreal, Canada.
  • S. Voloshynovskiy, O. Koval, F. Deguillaume, and T. Pun, "Data hiding capacity-security analysis for real images based on stochastic non-stationary geometrical models", Proceedings of SPIE Photonics West, Electronic Imaging 2003, Image and Video Communications and Processing V, Santa Clara, USA, 2003. Available from: http://vision.unige.ch/publications/postscript/2003/VoloshynovskiyKovalDeguillaumePun_SPIE2003.pdf
  • Oscar Divorra Escoda and Pierre Vandergheynst, "Locally Temporal Adaptive Transform Scheme for Sub-band Video Coding", In Proceedings of IEEE ICASSP, April 2003.
  • G. Antonini, V. Popovici, J.-Ph. Thiran, "Independent Component Analysis and Support Vector Machines for Face Feature Extraction", to appear in Intl. Conf. on Audio- and Video-Based Person Authentication (AVBPA), Surrey, UK, 2003, Springer-Verlag, LNCS 2688
  • S. Voloshynovskiy, O. Koval and T. Pun, "Wavelet-based image denoising using non-stationary stochastic geometrical image priors", Proceedings of SPIE, Photonics West, Electronic Imaging 2003, Image and Video Communications and Processing V, Santa Clara, USA, 2003.
  • Datong Chen, Jean-Marc Odobez, and Hervé Bourlard, "Text Detection and Recognition in Images and Videos", in "to appear in Pattern Recognition", 2003.
  • Fasel, B. and Luettin, J., "Automatic Facial Expression Analysis: A Survey", in "Pattern Recognition", 2003.
  • A Localization/Verification Scheme for Finding Text in Images and Video Frames Based on Datong Chen, Jean-Marc Odobez, and Jean-Philippe Thiran, "Contrast Independent Features and Machine Learning Methods", in "Signal Processing: Image Communication", 2003.
  • F. Monay and D. Gatica-Perez, "On Image Auto-Annotation with Latent Space Models", in "Proc. ACM Int. Conf. on Multimedia (ACM MM)", 2003.
  • J-M. Odobez, S. Ba, and D. Gatica-Perez, "An Implicit Motion Likelihood for Tracking with Particle Filters", in "British Machine Vision Conference (BMVC)", 2003.
  • J-M. Odobez, D. Gatica-Perez, and M. Guillemot, "Spectral Structuring of Home Videos", in "International Conference on Image and Video Retrieval (CIVR'03)", 2003.
  • M. Guillemot, P. Wellner, D. Gatica-Perez, and J-M.Odobez, "A Hierarchical Keyframe User Interface for Browsing Video over the Internet, in "Proceedings of the 9th International Conference on Human-Computer Interaction (INTERACT-2003)", 2003.
  • J-M. Odobez, D. Gatica-Perez, and M. Guillemot, "Video Shot Clustering using Spectral Methods", in "3rd Workshop on Content-Based Multimedia Indexing (CBMI)", 2003.
  • Bierlaire M., Antonini G. and Weber M. "Behavioural dynamics for pedestrians", 10th International Conference on Travel Behaviour Research, Lucerne, August 2003
  • Fasel, B., "Facial Expression Analysis using Shape and Motion Information Extracted by Convolutional Neural Networks, in "International IEEE Workshop on Neural Networks for Signal Processing (NNSP 02)", 2002
  • Fasel, B., "Head-Pose Invariant Facial Expression Recognition using Convolutional Neural Networks", in "International IEEE Conference on Multimodal Interfaces (ICMI 02)", 2002.
  • Fasel, B., "Mutliscale Facial Expression Recognition using Convolutional Neural Networks", in "Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP 02)", 2002
  • Daniel Gatica-Perez and Ming-Ting Sun, "Object Localization in Metric Spaces for Video Linking", in "IEEE Workshop on Motion and Video Computing", 2002.
  • Fasel, B., "Robust Face Analysis using Convolutional Neural Networks", in "Proceedings of the International Conference on Pattern Recognition (ICPR 02)", 2002.
  • Helmers, M., Bunke, H. "Generation and use of synthetic training data in cursive handwriting recognition", in F.J. Perales et al. (eds.): Pattern Recognition and Image Analysis, Proc. 1st Iberian Conference IbPRIA, Springer, LNCS 2652, 2003, 336 -345
  • Guenter, S., Bunke, H. "New boosting algorithms for classification problems with large number of classes applied to a handwritten word recognition task", in T. Windeatt, F. Roli (eds.): Multiple Classifier Systems, Proc. 4th Int. Workshop, Springer, LNCS 2709, 2003, 329 - 335
  • P. Roth, J. Kronegg, T. Pun, "Rendering digital images accessible for blind computer users", 10th International Conference on Human-Computer Interaction, HCI International 2003, June 22-27, 2003, Crete, Greece.
  • Y. Rytsar, S. Voloshynovskiy, F. Ehrler, T. Pun, "Interactive segmentation with hidden object-based annotations: toward smart media}, SPIE Electronic Imaging 2004, Storage and Retrieval Methods and Applications for Multimedia, San Jose, CA, USA, Jan. 24-29, 2004
  • S. Voloshynovskiy, F. Deguillaume, O. Koval and T. Pun, "Information-Theoretic Data-Hiding for Public Network Security, Services Control and Secure Communications", Proceedings of TELSIKS2003, Nis, Yugoslavia, October, 2003. (invited presentation)
  • "Tinne Tuytelaars, Andreas Turina, and Luc Van Gool, ""Non-combinatorial Detection of Regular Repetitions under Perspective Skew'', accepted for publication in IEEE Trans. on
  • Pattern Analysis and Machine Intelligence, in a Special Issue on grouping,"
  • Alessandro Vinciarelli, "A survey on Off-Line Cursive Word Recognition,'' Pattern Recognition, Vol 35, no. 7, pp 1433-1446, 2002,
  • Alessandro Vinciarelli and Samy Bengio, "Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition,'' Pattern Recognition Letters, Vol 23, no. 8, pp 905-916, 2002
  • Vlad Popovici and Jean-Philippe Thiran, "High-order Autocorrelations for Pattern Recognition'', Proc. ICIP 2001,Thessaloniki, Greece, pp. 724-727, October 2001
  • V. Popovici, J.-Ph. Thiran, "PCA in Autocorrelation Space'', to appear in Proceedings of International Conference on Pattern Recognition, Quebec City, Canada, August 2002
  • V. Popovici, J.-Ph. Thiran, "Pattern Recognition using Higher-Order Local Autocorrelation Coefficients'', in IEEE Workshop on Neural Networks for Signal Processing, Martigny, Switzerland, September 2002
  • S. Voloshynovskiy, O. Koval and T. Pun, "Wavelet-based image denoising using non-stationary stochastic geometrical image priors'', Proceedings of SPIE Photonics West, Electronic Imaging 2003, Image and Video Communications and Processing V, Santa Clara, USA, 2003. (submitted),
  • Gunter, S., Bunke, H., "A new combination scheme for HMM-based classifiers and its application to handwriting recognition'', in Proceedings of 16th Int. Conference on Pattern Recognition}, Quebec City, Aug 2002
  • "Gunter, S., Bunke, H., ""Creation of classifier ensembles for handwritten word recognition using feature selection algorithms'' in Proceedings of 8th Int. Workshop Frontiers in Handwriting
  • Recognition, Niagara Falls, Aug 2002,"
  • Alessandro Vinciarelli and Samy Bengio, "Writer adaptation techniques in HMM based Off-Line Cursive Script Recognition,'' Proceedings of 8th International Workshop on Frontiers in Handwriting Recognition, Niagara on the Lake, 2002
  • Alessandro Vinciarelli and Samy Bengio, "Offline Cursive Word Recognition using Continuous Density Hidden Markov Models trained with PCA or ICA Features,'' Proceedings of the International Conference on Pattern Recognition (ICPR),Québec, August 2002
  • Daniel Gatica-Perez and Ming-Ting Sun, "Linking Objects in Videos by Importance Sampling'', IDIAP-RR 02-20, 2002, Proc. IEEE International Conference on Multimedia and Expo, Lausanne, 2002
  • Datong Chen, Jean-Marc Odobez, and Herve Bourlard, "Text Segmentation and Recognition in Complex Background Based on Markov Random Field'', IDIAP-RR 02-17, 2002, Proc. Intl. Conference on Pattern Recognition (ICPR), Québec, August 2002

 

IM2.SA, IM2.ACP

 

  • Schlapbach, A., Bunke, H., "Using HMM based recognizers for writer identification and verification", Proc. 9th Int. Workshop on Frontiers in Handwriting Recognition, p 167–172, 2004.
  • Schlapbach, A., Bunke, H., "Writer Identification Using an HMM-Based Handwriting Recognition System: To Normalize the Input or Not?", Proc. 12th Conference of the International Graphonomics Society, p 138–142, 2005.
  • Schlapbach, A., Kilchherr, V., Bunke, H., "Improving Writer Identification by Means of Feature Selection and Extraction", Proc. 8th Int. Conf. on Document Analysis and Recognition, 2005.
  • Conrad Sanderson and Kuldip K. Paliwal, "Fast features for face authentication under illumination direction changes", in "Pattern Recognition Letters", 2003
  • S. Marcel and Y. Rodriguez, "Boosting Pixel-based Classifiers for Face Verification", in "Biometric Authentication Workshop of the 8th European Conference on Computer Vision, BIOAW2004", 2004
  • E. Bailly-Baillière, S. Bengio, F. Bimbot, M. Hamouz, J. Kittler, J. Mariéthoz, J. Matas, K. Messer, V. Popovici, F. Porée, B. Ruiz, and J.-P.Thiran, "The BANCA Database and Evaluation Protocol", in "4th International Conference on Audio- and Video-Based Biometric Person Authentication, AVBPA", 2003
  • Sanderson and K. K. Paliwal, "Fast features for face authentication under illumination direction changes", Pattern Recognition Letters, Vol. 24, No. 14, 2003. pp. 2409-2419.

 

IM2.SP

 

  • Andrew C. Morris, Viktoria Maier and Phil Green, "From WER and RIL to MER and WIL: improved evaluation measures for connected speech recognition", in International Conference on Spoken Language Processing (ICSLP), Jeju Island, Korea, 2004
  • Petr Fousek, Petr Svojanovsky, Frantisek Grezl, and Hynek Hermansky, "New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments", in Proceedings of International Conference on Spoken Language Processing (ICSLP), 2004
  • Marios Athineos, Hynek Hermansky, and Daniel P.W. Ellis, "PLP^2: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns", in Proceedings of the 2004 SAPA Workshop, 2004.
  • Jithendra Vepa and Simon King, "Subjective Evaluation of Join Cost Functions Used in Unit Selection Speech Synthesis", in Proceedings of ICSLP, 2004.
  • Guillaume Lathoud and Iain A. McCowan, "A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays", in Proceedings of the 2004 SAPA Workshop, 2004.
  • Herve Bourlard, Samy Bengio, Mathew Magimai Doss, Qifeng Zhu, Bertrand Mesot, Nelson Morgan, "Towards using hierarchical posteriors for flexible automatic speech recognition systems", in Proceedings DARPA EARS (Effective, Affordable, Reusable, Speech-to-text) Rich Transcription (RT’04) Workshop,IBM Palisades, NY., 7-10 November 2004. (IDIAP-RR 04-58 2004)
  • H. Hermansky, "Stochastic Techniques in Deriving Perceptual Knowledge", Proceedings Workshop on Statistical and Perceptual Audio Processing SAPA'04, Jeju, Korea, Oct 2004.
  • Misra, H., Ikbal, S., Sivadas, S., Bourlard, H., and Hermansky, H., “Multi-Resolution Spectral Entropy Feature for Robust ASR,” Proceedings IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Philadelphia (PA), March 19-23, 2005.
  • Ikbal, S., Bourlard, H., and Magimai.-Doss, M. “HMM/ANN based Spectral Peak Location Estimation for Noise Robust Speech recognition”, Proceedings IEEE Intl. Conf. on Acoustics, Speech, and Signal Processing, Philadelphia (PA), March 19-23, 2005.
  • Michael McGreevy, "Pseudo-syntactic language modeling for disfluent speech recognition", in Tenth Australian Conference of Speech Technology, Sydney, December 2004.
  • Guillaume Lathoud and Mathew Magimai.-Doss, "A Sector-Based, Frequency-Domain Approach to Detection and Localization of Multiple Speakers", in Proceedings of ICASSP 2005.
  • Thomas Hain, Lukas Burget, John Dines, Giulia Garau, Martin Karafiat, Mike Lincoln, Iain McCowan, Darren Moore, Vincent Wan, Roeland Ordelman and Steve Renals. “The 2005 AMI System for the Transcription of Speech in Meetings”, in NIST Spring 2005 Rich Transcription Workshop, (Edinburgh, Scotland), 2005.
  • Thomas Hain, Lukas Burget, John Dines, Iain McCowan, Martin Karafiat, Mike Lincoln, Darren Moore, Giulia Garau, Vincent Wan, Roeland Ordelman, and Steve Renals, “The Development of the AMI System for the Transcription of Speech in Meetings”, in Proceedings of MLMI’05 Workshop, Edinburgh, UK, 2005.
  • Thomas Hain, John Dines, Giulia Garau, Martin Karafiat, Darren Moore, Vincent Wan, Roeland Ordelman and Steve Renals. “Transcription of Conference Room Meetings: an Investigation”, INTERSPEECH 2005, Lisbon, Portugal, 2005. (accepted)
  • Guillermo Aradilla, Jithendra Vepa and Herve Bourlard, "Improving Speech Recognition using a Data-Driven Approach", INTERSPEECH 2005, Lisbon, Portugal, Sept. 2005. (accepted)
  • Hemant Misra, and Herve Bourlard, “Spectral Entropy Feature in Full-Combination Multi-Stream for Robust ASR”, INTERSPEECH’2005, Lisbon, Portugal, 2005. (accepted)
  • Mikko Lehtonen, Petr Fousek and Hynek Hermansky, Hierarchical approach for spotting keywords, Proceedings of MLMI’05 workshop, Edinburgh, U.K., July, 2005.
  • Hynek Hermansky, Petr Fousek and Mikko Lehtonen , "The Role of Speech in Multimodal Human-Computer Interaction (Towards reliable rejection of non-keyword input)", Proceedings of International Conference on Text, Speech and Dialogue, Springer Verlag 2005. (Invited Keynote Paper)
  • Hynek Hermansky and Petr Fousek, "Multi-resolution RASTA filtering for TANDEM-based ASR", INTERSPEECH 2005, Lisbon, Portugal, Sept. 2005. (accepted)
  • Guillaume Lathoud, Julien Bourgeois, and Jürgen Freudenberger, "Multichannel Speech Enhancement in Cars: Explicit vs. Implicit Adaptation Control", in Proceedings of HSCMA 2005, 2005.
  • J. Ang, Y. Liu and E. Shriberg, "Automatic Dialog Act Segmentation and Classification in Multiparty Meetings", 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, Philadelphia, PA 2005.
  • O. Cetin and M. Ostendorf, "Multi-rate and variable-rate", 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, Philadelphia, PA, 2005.
  • Y. Liu, A. Stolcke, E. Shriberg, and M. Harper, "Using Conditional Random Fields For Sentence Boundary Detection in Speech", Proceedings of ACL 2005, Michigan, 2005.
  • Y. Liu, E. Shriberg, A. Stolcke, and M. Harper, "Comparing HMM, Maximum Entropy, and Conditional Random Fields for Disfluency Detection", Eurospeech 2005, Lisboa, 2005. (in press)
  • Venkataraman, Y. Liu, E. Shriberg, and A. Stolcke, "Does Active Learning Help Automatic Dialog Act Tagging in Meeting Data?", INTERSPEECH 2005, Lisbon, Portugal, Sept. 2005. (accepted)
  • Y. Liu, E. Shriberg, A. Stolcke, B. Peskin, J. Ang, D. Hillard, M. Ostendort, M. Tomalin, P. Woodland, and M. Harper, "Structural Metadata Research in the EARS Program", 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, Philadelphia, PA, 2005.
  • Y. Chen, Q. Zhu, N. Morgan, "Tonotopic Multi-Layered Perceptron: A Neural Network for Learning", 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, Philadelphia, PA, 2005.
  • Wooters, N. Mirghafori, A. Stolcke, T. Pirinen, I Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin and M. Ostendor, "The 2004 ICSI-SRI-UW Meeting Recognition System", in Proceedings of the Joint AMI/Pascal/IM2/M4 Workshop on Meeting Recognition. 2004.
  • N. Mirghafori, A. Stolcke, C. Wooter, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin and M. Ostendorf, "From Switchboard to Meetings: Development of the 2004 ICSI-SRI-UW Meeting Recognition System", Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004.
  • Bigi, Y. Huang and R. De Mori, "Vocabulary and Language Model Adaptation using Information Retrieval", Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004.
  • Chen, Q. Zhu and N. Morgan, "Learning Long-Term Temporal Features in LVCSR Using Neural Networks", Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004.
  • Q. Zhu, B. Chen, N. Morgan and A. Stolcke, "On using MLP features in LVCSR", Proceedings of International Conference on Spoken Language Processing, Jeju, Korea, October 2004.
  • R. Beutler, T. Kaufmann, and B. Pfister, "Using rule-based knowledge to improve LVCSR", in Proceedings of ICASSP'05, pages 829-832, Philadelphia, March 2005.
  • H. Romsdorfer and B. Pfister, "Multi-context rules for phonological processing in polyglot TTS synthesis", in Proceedings of Interspeech 2004 - ICSLP, pages 737-740, Jeju Island (Korea), October 2004.
  • H. Romsdorfer, B. Pfister, and R. Beutler, "A mixed-lingual phonological component which drives the statistical prosody control of a polyglot TTS synthesis system", in S. Bengio and H. Bourlard, editors, Machine Learning for Multimodal Interaction, pages 263-276. Springer-Verlag Heidelberg, January 2005.
  • H. Romsdorfer and B. Pfister, "Phonetic labeling and segmentation of mixed-lingual prosody databases", INTERSPEECH 2005, Lisbon, Portugal, Sept. 2005. (accepted)
  • F. de Wet, K. Weber, L. Boves, B. Cranen, S. Bengio, and H. Bourlard, Evaluation of Formant-Like Features for Automatic Speech Recognition, in "Journal of the Acoustical Society of America (JASA)", 2004
  • Julien Bourgeois, Jürgen Freudenberger, and Guillaume Lathoud, "Implicit Control of Noise Canceller for Speech Enhancement", in Proceedings of INTERSPEECH 2005, 2005.
  • Guillaume Lathoud, Mathew Magimai.-Doss, and Bertrand Mesot, "A Spectrogram Model for Enhanced Source Localization and Noise-Robust ASR", in Proceedings of INTERSPEECH 2005, 2005.
  • S. Ikbal, M. Magimai.-Doss, H. Misra, and H. Bourlard, "Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR", in Proceedings of the INTERSPEECH-ICSLP-04, 2004.
  • Marios Athineos, Hynek Hermansky, and Daniel P.W. Ellis, "LP-TRAP: Linear predictive temporal patterns", in International Conference on Spoken Language Processing ICSLP-04, Jeju, Korea, Oct 2004.
  • McCowan and H. Bourlard, "Microphone Array Post-filter based on Noise Field Coherence", in "IEEE Transactions on Speech and Audio Processing", 2003
  • Mathew Magimai.-Doss, Todd A. Stephenson, Shajith Ikbal, and Hervé Bourlard, "Modelling Auxiliary Features in Tandem Systems", in "Proceedings of ICSLP", 2004
  • Hemant Misra, Shajith Ikbal, Hervé Bourlard, and Hynek Hermansky, "Spectral Entropy Based Feature for Robust ASR", in "Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)", 2004
  • S. Ikbal, H. Misra, H. Bourlard, and H. Hermansky, "Phase AutoCorrelation (PAC) features in Entropy based Multi-Stream for Robust Speech Recognition", in "Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04)", 2004
  • Sunil Sivadas and Hynek Hermansky, "On Use of Task Independent Training Data in Tandem Feature Extraction", in "Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-04)", 2004
  • Christos Dimitrakakis and Samy Bengio, "Boosting HMMs with an application to speech recognition", in "IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP", 2004
  • Mathew Magimai.-Doss, Samy Bengio, and Hervé Bourlard, "Joint Decoding for Phoneme-Grapheme Continuous Speech Recognition", in "Proceedings of ICASSP", 2004
  • Mathew Magimai.-Doss, Todd A. Stephenson, Hervé Bourlard, and Samy Bengio, "Phoneme-Grapheme Based Speech Recognition System", in "Proceedings of IEEE ASRU", 2003
  • Vivek Tyagi, Iain McCowan, Herve Bourlard, and Hemant Misra, "Mel-Cepstrum Modulation Spectrum (MCMS) Features for Robust ASR", in "IEEE ASRU", 2003
  • Hynek Hermansky, "TRAP-TANDEM: Data-driven extraction of temporal features from speech", in "large part published in Proceedings of ASRU-2003", 2003
  • S. Ikbal, M. Magimai.-Doss, H. Misra, and H. Bourlard, "Spectro-Temporal Activity Pattern (STAP) Features for Noise Robust ASR", in "Proceedings of the INTERSPEECH-ICSLP-04", 2004
  • S. Ikbal, H. Misra, S. Sivadas, H. Hermansky, and H. Bourlard, "Entropy Based Combination of Tandem Representations for Noise Robust ASR", in "Proceedings of the INTERSPEECH-ICSLP-04", 2004
  • Werner Hemmert, Marcus Holmberg, and David Gelbart, "Auditory-based Automatic Speech Recognition", to appear in Proc. ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, Jeju, Korea, October 2004
  • Barry Chen, Qifeng Zhu, and Nelson Morgan, "Learning Long-Term Temporal Features in LVCSR Using Neural Networks", to appear in Proc. Intl. Conf. Spoken Language Processing, Jeju, Korea, October 2004
  • Q. Zhu, B. Chen, N. Morgan, and A. Stolcke, " On using MLP features in LVCSR", to appear in Proc. Intl. Conf. Spoken Language Processing, Jeju, Korea, October 2004
  • Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, B. Peskin, and M. Harper, "The ICSI-SRI-UW Metadata Extraction System", to appear in Proc. Intl. Conf. Spoken Language Processing, Jeju, Korea, October 2004
  • Y. Liu, E. Shriberg, A. Stolcke, and M. Harper, "Using Machine Learning to Cope with Imbalanced Classes in Natural Speech: Evidence from Sentence Boundary and Disfluency Detection", to appear in Proc. Intl. Conf. Spoken Language Processing, Jeju, Korea, October 2004
  • N. Mirghafori, A. Stolcke, C. Wooters, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin, & M. Ostendorf, "From Switchboard to Meetings: Development of the 2004 ICSI-SRI-UW Meeting Recognition System", to appear in Proc. Intl. Conf. Spoken Language Processing, Jeju, Korea, October 2004
  • T. Pirinen and J. Yli-Hietanen, "Time delay based failure-robust direction of arrival estimation ", to appear in IEEE SAM 2004, Sitges, Barcelona, Spain, July 2004
  • M. Galley, K. McKeown, J. Hirschberg, and E. Shriberg, "Identifying Agreement and Disagreement in Conversational Speech: Use of Bayesian Networks to Model Pragmatic Dependencies", to appear in Proc. ACL, Barcelona, July 2004
  • Y. Liu, A. Stolcke, E. Shriberg, and M. Harper, "Comparing and Combining Generative and Posterior Probability Models: Some Advances in Sentence Boundary Detection in Speech", to appear in Proc. Conf. on Empirical Methods in Natural Language Processing, Barcelona, Spain, July 2004
  • K. Boakye and B. Peskin, "Text-Constrained Speaker Recognition on a Text-Independent Task", Odyssey 2004 - The Speaker and Language Recognition Workshop, Toledo, Spain, June 2004
  • Janin, J. Ang, S. Bhagat, R. Dhillon, J. Edwards, J. Macias-Guarasa, N. Morgan, B. Peskin, E. Shriberg, A. Stolcke, C. Wooters, B. Wrede, "The ICSI Meeting Project: Resources and Research", NIST ICASSP 2004 Meeting Recognition Workshop, Montreal, May 2004
  • Stolcke, C. Wooters, N. Mirghafori, T. Pirinen, I. Bulyko, D. Gelbart, M. Graciarena, S. Otterson, B. Peskin, & M. Ostendorf, "Progress in Meeting Recognition: The ICSI-SRI-UW Spring 2004 Evaluation System", NIST ICASSP 2004 Meeting Recognition Workshop, Montreal, May 2004
  • T. Pirinen, J. Yli-Hietanen, P. Pertilä and A. Visa, "Detection and compensation of sensor malfunction in time delay based direction of arrival estimation", In Proc. IEEE ISCAS, Vancouver, May 2004
  • D. Hillard, M. Ostendorf, A. Stolcke, Y. Liu, and E. Shriberg, "Improving Automatic Sentence Boundary Detection with Confusion Networks", Proc. HLT-NAACL Conference, April-May 2004, Boston
  • Shriberg, R. Dhillon, S. Bhagat, J. Ang, and H. Carvey, "The ICSI Meeting Recorder Dialog Act (MRDA) Corpus", Proc. HLT-NAACL SIGDIAL Workshop, April-May 2004, Boston
  • N. Morgan, B. Y. Chen, Q. Zhu, and A. Stolcke, "TRAPping Conversational Speech: Extending TRAP/Tandem approaches to conversational telephone speech recognition", Proc. IEEE ICASSP, Montreal, May 2004
  • N. Mirghafori and M. Hebert, " Parameterization of the Score Threshold for a Text-Dependent Adaptive Speaker Verification System", Proc. IEEE ICASSP, Montreal, May 2004
  • M. Hebert and N. Mirghafori, " Desperately Seeking Impostors: Data-Mining for Competitive Impostor Testing in a Text-Dependent Speaker Verification System", Proc. IEEE ICASSP, Montreal, May 2004
  • E. Shriberg and A. Stolcke, "Direct Modeling of Prosody: An Overview of Applications in Automatic Speech Processing", Proc. International Conference on Speech Prosody, Nara, Japan, March 2004
  • E. Shriberg and A. Stolcke, "Prosody Modeling for Automatic Speech Recognition and Understanding", to appear in Mathematical Foundations of Speech and Language Modeling, M. Johnson, M. Ostendorf, S. Khudanpur, R. Rosenfeld (eds.), Volume 138 in IMA Volumes in Mathematics and its Applications, pp. 105-114, Springer-Verlag
  • Wrede and E. Shriberg: "The Relationship Between Dialogue Acts and Hot Spots in Meetings", Proc. IEEE Speech Recognition and Understanding Workshop, St. Thomas, U.S. Virgin Islands, Dec. 2003
  • Y. Liu, E. Shriberg, and A. Stolcke, "Automatic disfluency identification in conversational speech using multiple knowledge sources", EUROSPEECH 2003, Geneva, September 2003
  • Chen, S. Chang, and S. Sivadas, "Learning Discriminative Temporal Patterns in Speech: Development of Novel TRAPS-Like Classifiers", EUROSPEECH 2003, Geneva, September 2003
  • L. Docio-Fernandez, D. Gelbart, and N. Morgan, "Far-field ASR on Inexpensive Microphones", EUROSPEECH 2003, Geneva, September 2003
  • P. Somervuo, B. Chen, Q. Zhu, "Feature Transformations and Combinations for Improving ASR Performance", EUROSPEECH 2003, Geneva, September 2003
  • Romsdorfer, B. Pfister, and R. Beutler, " A mixed-lingual phonological component in polyglot TTS synthesis", MLMI Workshop, Martigny, Switzerland, June 2004
  • Romsdorfer and B. Pfister, " Multi-context rules for phonological processing in polyglot TTS synthesis", in Proceedings of Interspeech 2004 - ICSLP, Jeju Island (Korea), October 2004
  • Chuck Wooters Nikki Mirghafori Andreas Stolcke Tuomo Pirinen Ivan Bulyko Dave Gelbart Martin Graciarena Scott Otterson Barbara Peskin Mari Ostendorf, "The 2004 ICSI-SRI-UW Meeting Recognition System",Proceedings of the Joint AMI/PASCAL/IM2/M4 workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLIM), Martigny, Switzerland, June 21-23. to appear in 2004
  • Qifeng Zhu Barry Chen Nelson Morgan Andreas Stolcke, "Tandem Connectionist Feature Extraction for Conversational Speech Recognition", Proceedings of the Joint AMI/PASCAL/IM2/M4 workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLIM), Martigny, Switzerland, June 21-23. to appear in 2004
  • Barry Chen Qifeng Zhu Nelson Morgan, "Long-Term Temporal Features for Conversational Speech Recognition", Proceedings of the Joint AMI/PASCAL/IM2/M4 workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLIM), Martigny, Switzerland, June 21-23. to appear in 2004
  • Hemant Misra, Herve Bourlard, and Vivek Tyagi, "New entropy based combination rules in HMM/ANN multi-stream ASR," in Proceedings of IEEE International Conference on Acoustic, Speech and Signal Processing, Hong Kong, Apr. 2003.
  • Vivek Tyagi, Iain McCowan, Herve Bourlard, and Hemant Misra, " On factorizing spectral dynamics for robust speech recognition," accepted for Eurospeech, Geneva, Switzerland, Sep. 2003.
  • Mathew Magimai.-Doss, Todd A. Stephenson, and Hervé Bourlard, " Using pitch frequency information in speech recognition", accepted for Eurospeech, Geneva, Switzerland, Sep. 2003.
  • Todd A. Stephenson, Mathew Magimai-Doss, and Hervé Bourlard, "Speech recognition with auxiliary information," accepted for publication in IEEE Transaction on Speech and Audio Processing
  • Lapidot, I. and Guterman, H., "Dichotomy Between Clustering Performance and Minimum Distortion in Piecewise-Dependent-Data (PDD) Clustering", to be published in IEEE Signal Processing Letters", 2003
  • S. Ikbal, H. Misra, and H. Bourlard, "Phase AutoCorrelation (PAC) derived Robust Speech Features", in Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03)", Hong Kong 2003.
  • Hemant Misra and Andrew C. Morris, "Confusion Matrix Based Entropy Correction in Multi-stream Combination", in "Proceedings of Eurospeech", Geneva, Switzerland, Sep. 2003.
  • Todd A. Stephenson, Mathew Magimai-Doss, and Hervé Bourlard, "Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks", in "Proceedings of the 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-03)", 2003.
  • Pfister and H. Romsdorfer, "Mixed-Lingual Text Analysis for Polyglot TTS Synthesis", Eurospeech 2003.
  • R. Beutler and B. Pfister, "Integrating Statistical and Rule-Based Knowledge for Continuous German Speech Recognition", Eurospeech 2003
  • Todd A. Stephenson, Mathew Magimai-Doss, and Hervé Bourlard, "Auxiliary Variables in Conditional Gaussian Mixtures for Automatic Speech Recognition", in "Seventh International Conference on Spoken Language Processing (ICSLP 2002)", 2002
  • "Jitendra Ajmera and Charles Wooters, "A Robust Speaker Clustering Algorithm", in ""IEEE Automatic Speech Recognition Understanding Workshop (ASRU)"", 2003"
  • "Shajith Ikbal and Hynek Hermansky and Herve Bourlard, ""Nonlinear Spectral Transformations for Robust Speech Recognition"", in Proceedings of the IEEE ASRU WORKSHOP 2003. St. Thomas, U.S. Virgin Islands, 2003."
  • Pujol, P., Hagen, A., Bourlard, H., and Nadeu, C., "Comparison and combination of features in hybrid HMM/MLP and HMM/GMM speech recognition", accepted for publication in IEEE Transaction of Speech and Audio Processing, 2003.
  • "Weber, K., Ikbal, S., Bengio, S., and Bourlard, H., ""Robust speech recognition and feature extraction using HMM2"", in Computer, Speech and Langauge, vol. 17, no. 2-3, April-July 2003, pp. 195-212, Academic Press"
  • S. Moeller and H. Bourlard, "Analytic Assessment of Telephone Transmission Impact on ASR Performance Using a Simulation Model'', to be published in Speech Communication, also IDIAP Research Report RR01-17,
  • Ajmera, J., McCowan, I., and Bourlard, H., "Speech/Music Discrimination Using Entropy and Dynamism Features in a HMM Classification Famework,'' to be published in Speech Weber, K., Ikbal, S., Bengio, S., and Bourlard, H.,``Robust Speech Recognition and Feature Extraction Using HMM2,'' to be published in Computer, Speech, and Language, Academic Press, also IDIAP Research Report RR-01-42,
  • Weber, K., Ikbal, S., Bengio, S., and Bourlard, H., "Robust Speech Recognition and Feature Extraction Using HMM2,'' to be published in Computer, Speech, and Language, Academic Press, also IDIAP Research Report RR-01-42,
  • Ajmera, J., McCowan, I., and Bourlard, H., "Robust HMM-Based Speech/Music Segmentation,'' in Proceedings of IEEE Intl. Conf.on Acoustics, Speech, and Signal Processing, Orlando, Florida, pp. I.297-300, May 13-17, 2002,
  • S. Ikbal, K. Weber, and H. Bourlard, "Speaker Normalization using HMM2'', IDIAP-RR 02-15, 2002, Proc. IEEE Workshop on Neural Networks for Signal Processing, Martigny (CH), Sep. 4-6, 2002
  • Andrew C. Morris, Simon Payne, and Hervé Bourlard, "Low cost duration modelling for noise robust speech recognition'', IDIAP-RR 02-08, 2002, in Intl Conf. on Spoken Language Processing (ICSLP), Denver, Sep. 2002,
  • J. Ajmera, H. Bourlard, I. Lapidot, and I. McCowan, "Unknown-Multiple Speaker clustering using HMM'', IDIAP-RR 02-07, 2002, Proc. Intl Conf. on Spoken Language Processing (ICSLP), Denver, Sep. 2002,
  • K. Weber, F. de Wet, B. Cranen, L. Boves, S. Bengio, and H. Bourlard, "Evaluation of Formant-Like Features for ASR'', IDIAP-RR 02-04, 2002, Proc. Intl Conf. on Spoken Language Processing (ICSLP), Denver, Sep. 2002
  • P. Pujol, S. Pol, A. Hagen, and and H. Bourlard, "Comparison and Combination of Rasta-PLP and FF Features in a Hybrid HMM/MLP Speech Recognition System,'' in Proc. Intl. Conf. of Spoken Language Processing (ICSLP'02), Denver, September 2002

 

IM2.SP, IM2.DI (ex AP)

 

  • Mohamed F. BenZeghiba and Hervé Bourlard, "On the combination of speech and speaker recognition", Eurospeech, 2003.
  • McCowan, A. Morris, and H. Bourlard, "Robust Speech Recognition with Small Microphone Arrays using the Missing Data Approach'', IDIAP-RR 02-09, 2002, Proc. Intl Conf. on Spoken Language Processing (ICSLP), Denver, Sep.2002,

 

IM2.SP, IM2.SA

 

  • (*) Guillaume Lathoud, Jean-Marc Odobez, and Daniel Gatica-Perez, "AV16.3: an Audio-Visual Corpus for Speaker Localization and Tracking", in Proceedings of the 2004 MLMI Workshop, S. Bengio and H. Bourlard Eds, Springer Verlag, 2005.
  • D. Gatica-Perez, G. Lathoud, I. McCowan, J.-M. Odobez." A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking", IEEE ICCV Workshop on Multimedia Technologies in E-Learning and Collaboration (WOMTEC), Nice, Oct. 2003.

 

 

Scientific papers without peer review

 

IM2 Phase II


IM2.DMA

 

  • Didier von Rotz, David Bourillot, Omar Abou Khaled, Rudolf Scheurer, Denis Lalanne, Rolf Ingold, Jean-Yves Le Meur, Thomas Baron (2006), "SMAC - Smart Multimedia Archive for Conferences", Flash Informatique FI1/06, Ecole Polytechnique Fédérale de Lausanne, février 2006, ISSN 1420-7192 , pp. 3-10. (http://ditwww.epfl.ch/publications-spip/IMG/pdf/1-6-page3.pdf)

 

IM2.DI (ex AP)

 

  • Denis Lalanne, Didier von Rotz, Rolf Ingold, "IM2.DI, Intégration de documents dans des archives multimédias de réunions", Flash Informatique FI2/05, Ecole Polytechnique Federale de Lausanne, ISSN 1420-7192, février 2005.
  • Omar ABOU KHALED, Rudolf SCHEURER, Denis LALANNE, Rolf INGOLD, Jean-Yves Le Meur, "Smart Multimedia Archive for Conferences (S.M.A.C.)", Flash Informatique FI2/05, Ecole Polytechnique Federale de Lausanne, ISSN 1420-7192, février 2005.
  • Denis Lalanne, Rolf Ingold, (in press), "Documents et multimodalité, Vecteurs thématiques et structurés vers des archives multimédia", numéro spécial de la revue scientifique et technique "Document numérique" sur le  thème "Temps et Documents", Service éditorial Hermès
  • Rolf Ingold, "Analyse et reconnaissance d'images de documents'', dans ``Techniques de l'ingénieur - Traité d'informatique'', H 7068, à paraître

 

IM2.SP

 

  • R. Beutler, "Improve Continuous Speech Recognition thru Linguistic Knowledge", February 2003, COST 278 workshop Barcelona

 

IM2.IIR

 

  • J. Hammerton, M. Osborne, S. Armstrong, W. Daelemans, "Introduction to the Special Issue on Machine Learning Approaches to Shallow Parsing'',  in The Journal of Machine Learning Research - Special Issue on Machine Learning Approaches to Shallow Parsing, J. Hammerton, M. Osborne, S. Armstrong, W. Daelemans (eds.)

 

 

Books

 

IM2 Pase II

 

IM2.BMI

 

  • Ferrez, P.W. and Millán, J. del R. (forthcoming), "Error-Related EEG Potentials in Brain-Computer Interfaces", In G. Dornhege et al. (eds.), Towards Brain-Computing Interfacing. Cambridge, MA: MIT Press.
  • Grave de Peralta, R., Gonzalez Andino, S.L., Ferrez, P.W., and Millán, J. del R. (forthcoming), "Non-Invasive Estimates of Local Field Potentials for Brain-Computer Interfaces", In G. Dornhege et al. (eds.), Towards Brain-Computing Interfacing. Cambridge, MA: MIT Press.
  • Millán, J. del R., Buttfield, A., Vidaurre, C., Krauledat, M., Schögl, A., Shenoy, P., Blankertz, B., Rao, R.P.N., Cabeza, R., Pfurtscheller, G., and Müller, K.-R. (forthcoming), "Adaptation in Brain-Computer Interfaces", In G. Dornhege et al. (eds.), Towards Brain-Computing Interfacing. Cambridge, MA: MIT Press.
  • Millán, J. del R., Ferrez, P.W., and Buttfield, A. (forthcoming), "The IDIAP Brain-Computer Interface: An Asynchronous Multi-Class Approach", In G. Dornhege et al. (eds.), Towards Brain-Computing Interfacing. Cambridge, MA: MIT Press.

 

IM2.MCA

 

  • Manny Rayner, Beth Ann Hockey, and Pierette Bouillon (2006), "Putting Linguistics into Speech Recognition", CSLI Publications, Stanford, CA.

 

IM2.MPR

 

  • J. Thiran, "Fusion and Fission of multimodal information", in SIMILAR Dreams: mul-timodal interfaces in our future life, UCL - Presses Universitaires de Louvain, pp. 29-42, 2005.
  • Drygajlo, "Reconnaissance vocale et sécurité", Chapter 6 in F. Leprévost, T. Ebrahimi, B. Warusel (Eds), "Enjeux de la sécurité multimédia", Traité IC2, série Informatique et systèmes d'information, Hermes Science and Lavoisier, Paris 2006, pp. 157-172.
  • S. Renals and S. Bengio, editors,  "Machine Learning for Multimodal Interaction: Second International Workshop", MLMI'2005. volume 3869 of Lecture Notes in Computer Science. Springer-Verlag, 2006.

 

IM2 Phase I

 

IM2.ACP

 

  • Schenker, A., Kandel, A., Bunke, H., Last, M., "Graph Theoretic Techniques for Web Content Mining", World Scientific, 2005.
  • Basu, M., Bunke, H., Del Bimbo, A. (eds.), "Syntactic and Structural Pattern Recognition", Special Section of IEEE Trans. Pattern Analysis and Machine Intelligence, 27(7), 2005.
  • F. Bimbot, J.F Bonastre, C. Fredouille, G. Gravier I. Magrin-Chagnolleau, S. Meignier, T. Merlin, J. Ortega-Garcia, D. Petrovska-Delactrétaz, and D. A. Reynolds. "A tutorial on Text-Independent Speaker Verification". Eurasip Journal on Applied Signal Processing, Volume 2004, No. 4, pp. 430-451, 1 April 2004.
  • Drygajlo, "Man-Machine Voice Enabled Interfaces", chapter in J. Tasic(, et al. (Eds), "Intelligent Integrated Media Communication Techniques", Kluwer Academic Publishers, Boston, pp. 305-336, 2003

 

IM2.DI (ex AP)

 

  • Rolf Ingold, Christine Vanoirbeek, "Document Analysis Revisited for Web Documents", in Apostolos Antonacopoulos and Jianying Hu (eds.), Web Document Analysis, Challenges and Opportunities, World Scientific Publishing Co. Pte. Ltd., pp.315-331, 2003

 

IM2.DI (ex AP), IM2.MDM

 

  • Andrei Popescu-Belis, Alexander Clark, Maria Georgescul, Sandrine Zufferey & Denis Lalanne (in press) "Shallow Dialogue Processing Using Machine Learning Algorithms (or not)", In Bourlard H. & Bengio S., eds., Multimodal Interaction and Related Machine Learning Algorithms, LNCS, Springer-Verlag, Berlin, 8 p, 2004

 

IM2.IP, IM2.DI (ex AP), IM2.ACP, IM2.DS, IM2.IIR, IM2.MDM, IM2.MI, IM2.SA, IM2.SP, IM2.VE

 

  • S. Bengio and H. Bourlard, editors. Machine Learning for Multimodal Inter- action: First International Workshop, MLMI'2004, volume 3361 of Lecture Notes in Computer Science. Springer-Verlag Heidelberg, 2005. (all publications refering to this book are under "Scientific papers with peer review" for 2004 and 2005 annual progress reports)

 

IM2.MI

 

  • J. del R. Millan, "Brain-computer interfaces,'' to appear in {\it The Handbook of Brain Theory and Neural Networks}, M. A. Arbib (Ed.), Bradford Books, The MIT Press, 2002,

 

IM2.MI, IM2.SP

 

  • Bourlard,H, and Bengio,S., "Hidden Markov Models and other Finite State Automata for Sequence Processing,'' to appearin The Handbook of Brain Theory and Neural Networks, M. A. Arbib (Ed.), Bradford Books, The MIT Press, 2002,

 

IM2.SA

 

  • J. Antoine, R. Murenzi, P. Vandergheynst and S. Ali, "Two-Dimensional Wavelets and their Relatives", September 2004.
  • G. A. Kalberer, P. Mueller and L. Van Gool, "Modeling and Synthesis of Realistic Visual Speech in 3D", 3D Modeling and Animation: Synthesis and Analysis Techniques for the Human Body, N. Sarris and M. G. Strintzis, Ed.,  IDEA Group Inc.,  pp. 266-294, 2004.
  • JP Antoine, S.T. Ali, R. Murenzi and P. Vandergheynst, "Wavelets and their relatices", Cambridge university press, in press, expected publication August 2004

 

IM2.SP

 

  • Morgan, N., Bourlard, H., and Hermanksy, H., "`Speech Recognition and the Auditory Perspective", chapter in "Speech Processing in the Auditory System'' eds. S. Greenberg and W. Ainsworth, Springer-Verlag, 2004
  • H. Bourlard, S. Bengio, and K. Weber, "Towards Robust and Adaptive Speech Recognition Models'',  IDIAP-RR 02-01, 2002, to be published in Mathematical Foundations of Speech Processing and Recognition}, Institute for Mathematics and its Applications (IMA) Series, Eds. Mari Ostendorf, Sanjeev Khudanpur, and Roni Rosenfeld, Springer-Verlag,

 

Reports

 

IM2 phase II

 

IM2.AP

 

  • Guillaume Lathoud, "Further Applications of Sector-Based Detection and Short-Term Clustering". IDIAP-RR 06-26, 2006.
  • R. Eklund, R. Bates, C. Kuyper, E. Willingham, and E. Shriberg, "The Annotation and Analysis of Importance in Meetings". ICSI Technical Report TR-06-003, June, 2006
  • T. Kaufmann, "Evaluation von Grammatikformalismen in Hinblick auf die Anwendung in der Spracherkennung", Zwischenbericht zum Nationalfonds-Projekt 105211-104078/1: Rule-Based Language Model for Speech Recognition. Institut TIK, ETH Zürich, September 2005.
  • M. Gerber, "Evaluation of neural networks as a distance measure", report to the NCCR Project IM2.ACP. Institut TIK, ETH Zürich, December 2005.
  • B. Pfister und R. Beutler, "Improving Speech Recognition thru Linguistics", Schlussbericht für das Projekt COST 278. Institut TIK, ETH Zürich, Februar 2006.
  • Hemant Misra, "Multi-stream Processing for Noise Robust Speech Recognition", IDIAP-RR 06-28, 2006.
  • Joseph Keshet, Samy Bengio, Dan Chazan, Shai Shalev-Shwartz, and Yoram Singer, "Discriminative Kernel-Based Phoneme Sequence Recognition", IDIAP-RR 06-14, 2006.
  • Bertrand Mesot and David Barber, "Switching Linear Dynamical Systems for Noise Robust Speech Recognition", IDIAP-RR 06-08, 2006.
  • T. Cemgil, B. Kappen, and D. Barber, "A Generative Model for Music Transcription", IDIAP-RR 05-89, 2005.
  • David Barber, "Efficient Kalman Smoothing for Harmonic State-Space Models", IDIAP-RR 05-87, 2005.
  • D. Grangier and S. Bengio, "A Discriminative Decoder for the Recognition of Phoneme Sequences", IDIAP-RR 05-67, 2005.
  • Johnny Mariéthoz and Samy Bengio, "Can a Professional Imitator Fool a GMM-Based Speaker Verification System?", IDIAP-RR 05-61, 2005.
  • Hari Krishna Maganti, Jithendra Vepa, and Hervé Bourlard, "Continuous Microphone Array Speech Recognition on Wall Street Journal Corpus", IDIAP-RR 05-47, 2005.
  • Mohamed Faouzi BenZeghiba, "Joint Speech and Speaker Recognition", IDIAP-RR 05-28, 2005.
  • Hamed Ketabdar, Jithendra Vepa, Samy Bengio, and Hervé Bourlard, "Developing and Enhancing Posterior Based Speech Recognition Systems", IDIAP-RR 05-23, 2005.
  • Mathew Magimai.-Doss, John Dines, Hervé Bourlard, and Hynek Hermansky, "Improving Continuous Speech Recognition System Performance with Grapheme Modelling", IDIAP-RR 05-16, 2005.
  • Guillaume Lathoud and Mathew Magimai.-Doss Bertrand Mesot, "A Frequency-Domain Silence Noise Model", IDIAP-RR 05-13, 2005.

 

IM2.AP, IM2.DMA

 

  • Mike Lincoln, Iain McCowan, Jithendra Vepa, and Hari Krishna Maganti, "The Multi-Channel Wall Street Journal Audio Visual Corpus (MC-WSJ-AV): Specification and Initial Experiments", IDIAP-RR 05-69, 2005.

 

IM2.AP, IM2.VP

 

  • H.-K. Maganti and D. Gatica-Perez, "Speaker localization for microphone-array-based ASR: the effects of accuracy on overlapping speech", IDIAP Research Report, May 2006

 

IM2.BMI

 

  • Ferran Galán, Francesc Oliva, Joan Guàrdia, Pierre Ferrez, and José del R. Millán, "Detecting Intentional Mental Transitions in an Asynchronous BCI", IDIAP-RR 06-43, 2006.
  • Silvia Chiappa and David Barber, "Bayesian Linear Gaussian State Space Models for Biosignal Decomposition", IDIAP-RR 05-84, 2005.

 

IM2.DMA

 

  • M. Guillemot, B. Crettol & P. Wellner (2006), "From Meeting Recordings to Web Distribution: Description of the Process", IDIAP-Com 05-05, January 2006.
  • Popescu-Belis A. (2005), "Dialogue Acts: One or More Dimensions? ISSCO Working Paper n. 62, Version 2, November 2005, University of Geneva, 30 p.
  • Popescu-Belis A., Estrella P., King M. & Underwood N. (2005), "Towards Automatic Generation of Evaluation Plans for Context-based MT Evaluation", ISSCO Working Paper n. 64, August 2005, University of Geneva, 18 p.

 

IM2.HMI, IM2.ISD

 

  • Gerwin van Doorn, Mike Flynn, and Pierre Wellner, "Overlapped meeting playback", MLMI'06, Bethesda, Maryland. May 1-3, 2006.  Demo presentation

 

IM2.MCA

 

  • Perrow, Mike and Barber, David, "Probabilistic Tagging of Unstructured Genealogical Records", IDIAP-RR 05-86, 2005.
  • Dong Zhang, Daniel Gatica-Perez, Deb Roy, and Samy Bengio, "Modeling Interactions from Email Communication", IDIAP-RR 05-51, 2005.
  • Serge Kosinov (PhD thesis, supervised by S. Marchand-Maillet), Dec. 2005, "Machine learning approach to semantic augmentation of multimedia documents for efficient access and retrieval", Computer Science Department, University of Geneva. Jury: Dr. Samy Bengio, IDIAP, Switzerland, Prof. Matthieu Cord, ENSEA, France, Dr. Eric Bruno, Prof. Thierry Pun, Dr. Stéphane Marchand-Maillet
  • Nicolas Moënne-Loccoz (PhD thesis, supervised by S. Marchand-Maillet), Dec. 2005, "Dynamique des composantes visuelles pour la gestion de documents video par le contenu", Computer Science Department, University of Geneva. Jury: Dr. Cordelia Schmid, INRIA, France, Philippe Joly, IRIT, France, Dr. Jean-Marc Odobez, IDIAP, Switzerland, Prof. Thierry Pun, Dr. Stéphane Marchand-Maillet.
  • A.Peregoudov, A.Vinciarelli and H.Bourlard, "Towards using slide information to enhance speech transcription of meetings", IDIAP Technical Report IDIAP-RR-06-01 (2006)
  • R. Vilagut, I.Arsic and J.-P.Thiran, "Visual speaker identification using lip features", Master thesis, December 2005.
  • A.Vinciarelli, "Speakers role recognition in multiparty audio recordings", IDIAP Technical Report IDIAP-RR-06-35 (2006)

 

IM2.MPR

 

  • M. Gurban and J. Thiran, "An information theoretic perspective on multimodal signal processing", EPFL-ITS technical report No 2005-38, December 2005
  • R. Rienks, D. Zhang, D. Gatica-Perez, and W. Post, "Detection and Application of Influence Rankings in Small Group Meetings", IDIAP Research Report, May 2006 (collaboration with University of Twente and TNO, The Netherlands).
  • D. Zhang, D. Gatica-Perez, S. Bengio, and D. Roy, "The Team-Player Influence Model", IDIAP Research Report, Apr. 2006 (collaboration with MIT Media Lab, USA)
  • D. Joshi and D. Gatica-Perez, "Finding Groups of People in Google News", IDIAP Research Report, Dec. 2005 (collaboration with Penn State University, USA).
  • Humm, J. Hennebert and R. Ingold, "Combined Handwriting and Speech Modalities for User Authentication", Internal Publication of the Department of Informatics, University of Fribourg, 06-05, March 2006.
  • Wahl, J. Hennebert, A. Humm and R. Ingold, "A novel method to generate Brute-Force Signature Forgeries", Internal Publication of the Department of Informatics, University of Fribourg, 06-09, June 2006.
  • Johnny Mariethoz and Samy Bengio, "A Kernel Trick For Sequences Applied to Text-Independent Speaker Verification Systems", IDIAP Research Report 05-77, 2005. ftp://ftp.idiap.ch/pub/reports/2005/mariethoz-idiap-rr-05-77.pdf
  • N. Poh and S. Bengio, "Using Chimeric Users to Construct Fusion, "Classifiers in Biometric Authentication Tasks: An Investigation", IDIAP Research Report 05-59, 2005. ftp://ftp.idiap.ch/pub/reports/2005/rr05-59.pdf
  • Renato Villán, Sviatoslav Voloshynovskiy, Oleksiy Koval and Thierry Pun, "Multilevel 2D Bar Codes: Towards High Capacity Storage Modules for Multimedia Security and Management", IEEE Transactions on Information Forensics and Security, 2006. (submitted)

 

IM2.VP

 

  • Sébastien Marcel, Yann Rodriguez and Guillaume Heusch, "On the Recent Use of Local Binary Patterns for Face Authentication", IDIAP-RR 06-34, 2006.
  • Fabien Cardinaux, "Face Authentication Based on Local Features and Generative Models", IDIAP-RR 05-85, 2005.
  • Yann Rodriguez, Fabien Cardinaux, Samy Bengio, and Johnny Mariéthoz, "Measuring the Performance of Face Localization Systems", IDIAP-RR 05-53, 2005.
  • Tiffany Sauquet, Yann Rodriguez, and Sebastien Marcel, "Multiview Face Detection", IDIAP-RR 05-49, 2005.
  • Guillaume Heusch, Fabien Cardinaux and Sébastien Marcel, "Efficient Diffusion-based Illumination Normalization for Face Verification", IDIAP-RR 05-46, 2005.
  • Bunke, H., Dickinson, P., Kraetzl, M., "Theoretical and algorithmic framework for hypergraph matching", Int. Conference Image Analysis and Processing, ICIAP, Cagliari, Italy, 2005
  • Bunke, H., Dickinson, P., Humm, A., Irniger, Ch., Kraetzl, M., "Computer network monitoring and abnormal even detection using graph matching and multidimensional scaling", Industrial Conference on Data Mining,  ICDM, Leipzig, Germany, 2006
  • Pekalska, E., Harol A., Duin, R.P.W., Spillmann, B., Bunke, H., "Non-Euclidean or non-metric measures can be informative", Joint Workshop on Structural and Syntactic Pattern Recognition, and Statistical Techniques in Pattern Recognition, S&SSPR, Hong Kong, 2006 (based on joint work between Technical University of Delft and the University of Bern)
  • Spillmann, B., Neuhaus, M., Bunke, H., "Multiple classifier systems for embedded string patterns", Int. Workshop on Artificial Neural Networks in Pattern Recognition,  ANNPR, Reisensburg Castle, Germany, 2006.
  • S. Ba and J.-M. Odobez, "From Head Pose to Focus of Attention: a Study in Meetings", IDIAP Research Report, Jun. 2006.
  • K. Smith, S. Ba, J.-M. Odobez, and D. Gatica-Perez, "Tracking the Multi-Person Wandering Visual Focus of Attention", IDIAP Research Report, Oct. 2005.
  • J. Keomany and S. Marcel, "Active Shape Models using Local Binary Patterns", IDIAP Research Report, 2006
  • M. Sorci, G. Antonini and J. Thiran, "Relevant Component Analysis for static facial expression classification", EPFL-ITS Technical Report, 2005
  • J. Meynet, V. Popovici and J. Thiran, "Face Detection with Mixtures of Boosted Discriminant Features", EPFL-ITS Technical Report, 2005.
  • J. Meynet, V. Popovici and J. Thiran, "Face Detection with Boosted Gaussian Features", submitted to Pattern Recognition, 2006
  • J. Meynet, V. Popovici and J. Thiran, "Mixtures of Boosted Classifiers for Frontal Face Detection", submitted to Pattern Recognition, 2006
  • Maurer & Billard, "Exact Estimate of the Capacity and Improved Learning Rule of anIsing-like Model for Storing Semantically Correlated Patterns", Submitted, Neural Networks, 2006

 

IM2 Phase I

 

IM2.ACP

 

  • Neuhaus, M., Bunke, H., "Graph Matching - Exact and Error-tolerant Methods and the Automatic Learning of Edit Costs", submitted.
  • Schlapbach. A., Bunke, H., "A Writer Identification and Verification System Using HMM Based Recognizers", submitted.
  • Neuhaus, M., Bunke, H., "Edit distance based kernel functions for structural pattern classification", submitted.
  • Neuhaus, M., Bunke, H.: "Automatic learning of cost functions for graph edit distance", submitted.
  • Conrad Sanderson, Fabien Cardinaux, and Samy Bengio, "On Performance / Robustness / Complexity Trade­O#s in Face Verification", IDIAP­RR 74, 2004.
  • Conrad Sanderson and Kuldip K. Paliwal, "On the Use of Speech and Face Information for Identity Verification", IDIAP­RR 10, March 2004.
  • Fabien Cardinaux, "Local Features and 1D­HMMs for Fast and Robust Face Authentication", IDIAP­RR 17, 2005. (Submitted to British Machine Vision Conference (BMVC) 2005).
  • Guillaume Heusch, Fabien Cardinaux, and Sebastien Marcel, "Lighting normalization algorithms for face verification", IDIAP­COM 03, 2005.
  • Johnny Mariethoz and Samy Bengio, "A new speech recognition baseline system for numbers 95 version 1.3 based on torch",  IDIAP­RR 16, 2004.
  • Mikaela Keller, Johnny Mariethoz, and Samy Bengio, "Significance Tests for bizarre Measures in 2­Class Classification Tasks", IDIAP­RR 34, 2004.
  • Jerome Kowalczyk, "Une application de reconnaissance du locuteur : le user­customized password speaker verification", IDIAP­COM 04, 2004.
  • Norman Poh and Samy Bengio, "Improving single modal and multi­modal biometric authentication using f­ratio client­dependent normalisation", IDIAP­RR 52, 2004.
  • Norman Poh and Samy Bengio. An investigation of f­ratio client­dependent normalisation on biometric authentication tasks. IDIAP­RR 46, 2004.
  • Norman Poh and Samy Bengio, "A study of the effects of score normalisation prior to fusion in biometric authentication tasks", IDIAP­RR 69, 2004.
  • Norman Poh and Samy Bengio. Can chimeric persons be used in multi­modal biometric authentication experiments?", IDIAP­RR 20, 2005.
  • M. Gerber, "Evaluation von Vektorquantisierungsmethoden für das Finden lautlich ähnlicher Abschnitte in Sprachsignalen", Report Nr. 1, NCCR Projekt IM2.ACP, Institut TIK, ETH Zurich, December 2004.
  • M. Gerber, "Evaluation of features and algorithms to find common subsegments", Report Nr. 2, NCCR Project IM2.ACP, Institut TIK, ETH Zurich, January 2005.
  • Bruno Dumas, Catherine Pugin, Jean Hennebert, Dijana Petrovska-Delacrétaz, Rolf Ingold, Andreas Humm, Florian Evéquoz and Didier Von Rotz, “MyIDea – Multimodal Biometrics Database, Description Of Acquisition Protocols”, submitted to the Third COST275 workshop on Biometrics over the Internet.
  • B. Dumas, F. Evequoz, J. Hennebert, A. Humm, R. Ingold, D. Petrovska-Delacretaz, C. Pugin and D. Von Rotz, “MyIDea – Sensors Specifications and Acquisition Protocol“, Internal Publication of the Informatics Department, n° 05-12, University of Fribourg, June 2005
  • Norman Poh and Samy Bengio, "Compensating User-Specific Information with User-Independent Information in Biometric , Authentication Tasks", IDIAP-RR 05-44, 2005.
  • Norman Poh and Samy Bengio, "Towards Explaining the Success (Or Failure) of Fusion in Biometric Authentication", IDIAP-RR 05-43, 2005.
  • Norman Poh and Samy Bengio, "Can Chimeric Persons Be Used in Multimodal Biometric Authentication Experiments?", IDIAP-RR 05-20, 2005.
  • Johnny Mariéthoz and Samy Bengio, "A Bayesian Framework for Score Normalization Techniques Applied to Text Independent Speaker Verification", IDIAP-RR 04-62, 2004.
  • Norman Poh and Samy Bengio, "Improving Single Modal and Multimodal Biometric Authentication Using F-ratio Client-Dependent Normalisation", IDIAP-RR 04-52, 2004.
  • J. Richiardi, A. Drygajlo, A. Palacios-Venin, R. Ludvig, O. Genton, L. Houmgny, "A Distributed Multimodal Biometric Authentication Framework", in 3rd COST 275 workshop "Biometrics on the Internet", Hatfield, Oct 27-28, 2005. (Submitted)
  • K. Kryszczuk, A. Drygajlo, "Robust Method of Reference Point Localization in Fingerprints",  3rd COST 275 Workshop “Biometrics on the Internet”, Hatfield, UK, Oct. 27-28, 2005. (submitted)
  • Norman Poh and Samy Bengio, "Evidences of Equal Error Rate Reduction in Biometric Authentication Fusion", IDIAP-RR 04-43, 2004
  • Mohamed F. BenZeghiba and Hervé Bourlard, "User-Customized Password Speaker Verification Using Multiple Reference and Background Models", IDIAP-RR 04-41, 2004
  • Fabien Cardinaux, Conrad Sanderson, and Samy Bengio, "User Authentication via Adapted Generative Models of Face Images", IDIAP-RR 04-38, 2004
  • Marc Saban and Conrad Sanderson, "On Local Features for Face Verification", IDIAP-RR 04-36, 2004
  • Norman Poh and Samy Bengio, "Towards Predicting Optimal Subsets of Base-Experts in Biometric Authentication Task", IDIAP-RR 04-17, 2004
  • S. Marcel, "Improving Face Verification using Symmetric Transformation", IDIAP-RR 03-68, 2003
  • Johnny Mariéthoz and Samy Bengio, "An Alternative To Silence Removal For Text-Independent Speaker Verification", IDIAP-RR 03-51, 2003
  • Norman POH Hoon Thian, Samy Bengio, "Towards Predicting Optimal Subsets of Base-Experts in Biometric Authentication Task", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Plamen Prodanov, Andrzej Drygajlo, "Multimodal Signal Fusion for User Goal Identification in Human-Robot Interaction Using Bayesian Networks", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • N. Poh, C. Sanderson and S. Bengio. Spectral Subband Centroids, "An Alternative Features for Speaker Verification", IDIAP Technical Report RR-03-26, 2003.
  • Norman Poh and Samy Bengio, "Variance Reduction Techniques in Biometric Authentication", IDIAP-RR 03-17, 2003.
  • Norman Poh and Samy Bengio, "Non-Linear Variance Reduction Techniques in Biometric Authentication", IDIAP-RR 03-26, 2003.
  • Christine Marcel, "Multimodal Identity Verification at IDIAP", IDIAP-Com 03-04, 2003.
  • Conrad Sanderson, "Speech Processing \& Text-Independent Automatic Person Verification", IDIAP-Com
  • Lapidot, I., "Self-Organizing-Maps With BIC For Speaker Clustering", IDIAP-RR 02-60, 2002.
  • Sébastien Marcel, "Robust Face Verification using Skin Color and Neural Networks", IDIAP-RR 02-49, 2002
  • Mohamed F. BenZeghiba and Hervé Bourlard, "User-Customized Password HMM Based Speaker Verification", IDIAP-RR 02-35, 2002.
  • Conrad Sanderson and Kuldip K. Paliwal, "Information Fusion and Person Verification Using Speech \& Face Information", IDIAP-RR 02-33, 2002.
  • Mohamed F. BenZeghiba and Hervé Bourlard, "User-Customized Password Speaker Verification based on HMM/ANN and GMM Models", IDIAP-RR 02-10, 2002.
  • V. Lemaire and F. Clérot, "SOM-Based Clustering for On-Line Fraud Behavior Classification: a Case Study'', IDIAP-RR 02-30, 2002,
  • "Samy Bengio, and Jerzy Korczak, Norman Poh, ""A Multi-sample Multi-source Model for Biometric Authentication'', IDIAP-RR
  • 02-14, 2002,"
  • F. Porée, J.  Mariéthoz, S. Bengio, and F. Bimbot, "The BANCA Database and Experimental Protocol for Speaker Verification'', IDIAP-RR 02-13, 2002,
  • "Quan Le and Samy Bengio, ""Hybrid generative-discriminative models for speech and speaker recognition'',  IDIAP-RR 02-06,
  • 2002"
  • S. Bengio, F.  Bimbot, J. Mariéthoz, V. Popovici, F. Porée, E. Bailly-Baillière, G. Matas, and B. Ruiz, "Experimental Protocol on the BANCA Database'',  IDIAP-RR 02-05, 2002,

 

IM2.ACP, IM2.MI

 

  • Mikaela Keller, Johnny Mariéthoz, and Samy Bengio, "Significance Tests for Bizarre Measures in 2-Class Classification Tasks", IDIAP-RR 04-34, 2004
  • Norman Poh and Samy Bengio, "How Do Correlation and Variance of Base-Experts Affect Fusion in Biometric Authentication Tasks?", IDIAP-RR 04-18, 2004
  • Conrad Sanderson and Kuldip K. Paliwal, "On the Use of Speech and Face Information for Identity Verification", IDIAP-RR 04-10, 2004
  • Sutapa Sarangi, "Enhanced Performance of Multimodal Biometric Systems by Confidence Estimation", IDIAP-Com 03-05, 2003

 

IM2.ACP, IM2.SP

 

  • Mohamed Faouzi BenZeghiba, Hervé Bourlard, "Combination od Speech and Speaker Recognition", poster presentation at MLMI'04 in Martigny Switzerland, June 2004

 

IM2.DI (ex AP)

 

  • Denis Lalanne, Agnes Lisowska, Eric Bruno, Mike Flynn, Maria Georgescul, Maël Guillemot, Bruno Janvier, Stéphane Marchand-Maillet, Mirek Melichar, Nicolas Moenne-Loccoz, Andrei Popescu-Belis, Martin Rajman, Maurizio Rigamonti, Didier von Rotz, Pierre Wellner.. "The IM2 Multimodal Meeting Browser Family", Technical report, Fribourg, March 2005.
  • Didier von Rotz, " Smart Meeting Minutes, une plate-forme permettant d'enregistrer, archiver, annoter et consulter des réunions", rapport final du projet CCTI numéro TI 07-02, November 2004.
  • Denis Lalanne, «SMAC, Smart Multimedia Archive for Conferences », Specifications v5.0, Denis Lalanne, 24th November 2004.
  • Stephane Sire and Denis Lalanne, "Smart Meeting Minutes Application specification”, Technical Report, August 2002
  • Denis Lalanne and Stephane Sire, "Analysis of end-user requirements and sample queries ", Technical Report, 2003
  • Denis Lalanne and Fuad Rahman, Building Digital libraries that reach users. Report of DIAL'04 discussion group on User experience, PARC, Palo Alto, CA, 2004
  • Jean-Philippe Zbinden, "PDA & Services multimédias", projet de diplôme 2003, University of Applied Sciences
  • Cristobal Martin & Christophe Pouly, "Système distribué d'enregistrement audio-vidéo", University of Applied Sciences, Fribourg, 2003
  • Catherine Pugin, "Smart Meeting Minutes Viewer", rapport projet de Bachelor, UniFr, 2003
  • Christoph Ehret, "Smart Meeting Minutes Writer", rapport projet de Bachelor, UniFr, 2003
  • Laurence Fidanza, "Smart Meeting Minutes Organizer", rapport projet de Bachelor, UniFr, 2003
  • Gay-Crosier Benoit & De Bruyne Tom, "Java networking using Bluetooth on Pocket PC", University of Applied Sciences, Fribourg, 2003
  • Laurent Prin & Patrick Terreaux, "Intégration d'un participant externe à une réunion par vidéo conférence", University of Applied Sciences, Fribourg, 2003
  • Dalila Mekhaldi, Denis Lalanne, Rolf Ingold. "Document/Speech Thematic Segmentation Through Their Alignment", in Thirteenth Conference on Information and Knowledge Management CIKM 2004
  • Denis Lalanne, Rolf Ingold, “STRUCTURING MULTIMEDIA ARCHIVES WITH STATIC DOCUMENTS”,  International Workshop on Image, Video, and Audio Retrieval and Mining on October 25-26, 2004, Université de Sherbrooke, Canada
  • Dalila Mekhaldi, Denis Lalanne, "Thematic Alignment of Documents with Recorded Speech", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Ardhendu Behera, Denis Lalanne, "Looking at Meeting Documents: Events Detection and Documents Identification", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • "Smart Meeting Minutes Application specification'', Technical Report, August 2002,

 

IM2.DI (ex AP), IM2.IP, IM2.MDM

 

  • Lalanne D., Lisowska A., Bruno E., Flynn M., Georgescul M., Guillemot M., Janvier B., Marchand-Maillet S., Melichar M., Moenne-Loccoz N., Popescu-Belis A., Rajman M., Rigamonti M., von Rotz D. & Wellner P. (2005) - "The IM2 Multimodal Meeting Browser Family", IM2 Technical Report, 17 p, March 2005.

 

IM2.DI (ex AP), IM2.MDM

 

  • Andrei Popescu-Belis, Denis Lalanne, "Resolution of References to Document Elements in Meeting Dialogues", poster presentation at MLMI'04 in Martigny Switzerland, June 2004

 

IM2.IIR

 

  • Alessandro Vinciarelli and Jean-Marc Odobez, "Application of Information Retrieval Technologies to Presentation Slides", IDIAP RR-05-36, 2005.
  • D. Grangier and S. Bengio, Inferring Document Similarity from Hyper-links, IDIAP-RR 05-21, 2005.
  • D. Grangier and A. Vinciarelli, "Noisy Text Clustering", IDIAP-RR 04-31, 2004
  • A.Vinciarelli, "Handwritten document retrieval", IDIAP-RR 04-12, 2004
  • Mikaela Keller and Samy Bengio, "Theme Topic Mixture Model: A Graphical Model for Document Representation", IDIAP-RR 04-05, 2004
  • D. Grangier and A. Vinciarelli, "Making Retrieval Faster Through Document Clustering", IDIAP-RR 04-02, 2004
  • Mikaela Keller and Samy Bengio, "Textual Data Representation", IDIAP-RR 03-74, 2003
  • Alessandro Vinciarelli, and Hervé Bourlard, "Information Retrieval on Noisy Text, David Grangier", IDIAP-Com 03-08, 2003
  • T. Bouzerda, C. Jelmini, S. Marchand-Maillet, "A flexible framework for the development of XML protocols: Applications to MRML", technical report number No. 03.03, Computer Vision and Multimedia Laboratory, Computing Centre, University of Geneva, rue General Dufour, 24, CH-1211, Geneva, Switzerland, June, 2003
  • S. Marchand-Maillet, "Collection Guiding: 1- Principles", technical report number No. 03.06, Computer Vision and Multimedia Laboratory, Computing Centre, University of Geneva, rue General Dufour, 24, CH-1211, Geneva, Switzerland, 2003.
  • S. Marchand-Maillet, "Collection Guiding: 2- Theoretical developments", technical report number No. 04.01, Computer Vision and Multimedia Laboratory, Computing Centre, University of Geneva, rue General Dufour, 24, CH-1211, Geneva, Switzerland, 2004.
  • N. Moenne-Loccoz, "Video content decomposition for efficient indexing", technical report number No. 04.02, Computer Vision and Multimedia Laboratory, Computing Centre, University of Geneva, rue General Dufour, 24, CH-1211, Geneva, Switzerland, 2004
  • E. Bruno, N. Moenne-Loccoz , S. Marchand-Maillet. Unsupervised Event Discrimination Based on Nonlinear Temporal Modeling of Activity Content. Submitted to the Pattern Analysis and Applications Journal (PAA), Special issue on “Video based event detection”. January 2004
  • N. Moenne-Loccoz, E. Bruno and S. Marchand-Maillet.. Knowledge-based Event Detection in Video Streams from Salient Regions of Activity.  Submitted to the Pattern Analysis and Applications Journal (PAA), Special issue on “Video based event detection”. January 2004
  • Bruno Janvier, Carlo Jelmini, Sergei Kosinov, Nicolas Moënne-Loccoz, Eric Bruno, Stéphane Marchand-Maille, "Integrated Multimedia Information Management", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • David Grangier, "Noisy Text Clustering", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Stephane Marchand-Maillet "Meeting Record Modelling for Enhanced Browsing", Tech. Rep. 03.01, Computer Vision and Multimedia Laboratory, Computing Centre, University of Geneva, rue General Dufour, 24, CH-1211 Geneva, Switzerland, March 2003
  • Stephane Marchand-Maillet, "MRML: Steps towards version 2". Tech. Rep. 03.02, Computer Vision and Multimedia Laboratory, Computing Centre, University of Geneva, rue General Dufour, 24, CH-1211 Geneva, Switzerland, March 2003.
  • Iain McCowan, Jitendra Ajmera, and Darren Morre, "An Online System for Automatic Annotation of Audio Documents", IDIAP-RR 03-39, 2003
  • Tayeb Bouzerda, Carlo Jelmini and Stephane Marchand-Maillet, "A flexible framework for the development of XML protocols: Applications to MRML", Technical Report 03.03, Viper group, Computer Vision and Multimedia Lab. University of Geneva, June 2003
  • Jean-Marc Odobez and Datong Chen, "Video Text Recognition Based on Markov Random Field and grayscale consistency constraint'', IDIAP-RR 02-18, 2002,

 

IM2.IIR, IM2.MI

 

  • McCowan, D. Gatica-Perez, S. Bengio, and G. Lathoud, "Automatic Analysis of Multimodal Group Actions in Meetings", IDIAP-RR 03-27, 2003.

 

IM2.IIR, IM2.SA

 

  • Mark Barnard and Jean-Marc Odobez, "Sports Event Recognition using Layered HMMs", IDIAP-RR 05-07, 2005.
  • Datong Chen, "Text detection and recognition in images and video sequences", IDIAP-RR 03-44, 2003

 

IM2.IIR, IM2.SP

 

  • McCowan, D. Moore, J. Dines, D. Gatica-Perez, M. Flynn, P. Wellner, and H. Bourlard, "On the Use of Information Retrieval Measures for Speech Recognition Evaluation", IDIAP-RR 04-73, 2004.
  • J. Ajmera, I. McCowan, and H. Bourlard, "Robust Audio Segmentation", IDIAP-RR 04-35, 2004
  • Guillaume Lathoud, Iain A. McCowan, Jean-Marc Odobez, "Unsupervised Location-Based Segmentation of Multi-Party Speech", poster presentation at MLMI'04 in Martigny Switzerland, June 2004

 

IM2.IP

 

  • M. Flynn and P. Wellner, "In Search of a Good BET", IDIAP-Com 03-11, 2003
  • McCowan, D. Gatica-Perez, and S. Bengio, "Meeting Data Collection Specifications", IDIAP-Com 03-10, 2003
  • Oleksandr DRUTSKYY, Hassina BOUNIF, "A Multimodal Database Framework for Multimedia Meeting Annotations", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Hassina BOUNIF, "Predictive database schema evolution", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Mike Flynn, Mael Guillemot, Pierre Wellner, "Poster and Demo: The Ferret Meeting Browser", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Darren Moore, Mike Flynn, Frank Formaz, Daniel Gatica-Perez, Mael Guillemot, Olivier Masson, Iain McCowan, Pierre Wellner, "A Technical Overview of IDIAP Infrastructure for the Acquisition and Distribution of Multimodal Meeting Data", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • McCowan and D. Moore, "Small Microphone Array: Algorithms and Hardware", IDIAP-Com 03-07, 2003.
  • D. Moore, "The IDIAP Smart Meeting Room", IDIAP-Com, 02-07,2002

 

IM2.IP, IM2.IIR

 

  • P. Wellner, M. Flynn, and M. Guillemot, "Browsing Recorded Meetings with Ferret", IDIAP-RR 04-32, 2004
  • Iain McCowan, Jitendra Ajmera, and Darren Morre, "An Online System for Automatic Annotation of Audio Documents", IDIAP-RR 03-39, 2003

 

IM2.IP, IM2.IIR, IM2.MI

 

  • McCowan, D. Gatica-Perez, S. Bengio, and H. Bourlard, "Towards Computer Understanding of Human Interactions", IDIAP-RR 03-45, 2003

 

IM2.IP, IM2.MDM

 

  • Agnes Lisowska, Guillaume Lathoud, Joanne Moore, "Transcribing Meetings: Applying Speech Transcription Methods Developed at ICSI to the New Meeting Corpus Recorded at IDIAP", poster presentation at MLMI'04 in Martigny Switzerland, June 2004

 

IM2.IP, IM2.MI, IM2.SA, IM2.SP

 

  • Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain McCowan and Guillaume Lathoud, "Modeling Individual and Group Actions in Meetings With Layered HMMs", IDIAP-RR 04-33, 2004

 

IM2.MDM

 

  • Popescu-Belis A., "Dialogue Acts: One or More Dimensions?", ISSCO Working Paper n. 62, January 2005, University of Geneva, 13 p., 2005.
  • Ailomaa, M. Alphonse, Y., Ghorbel, H. Kadlec, V., Lisowska, A., Rajman, M., Trutnev, A.,  "Natural Language Processing in Archivus: An Overview",  IM2.MDM Technical report 14. March 2005.
  • Pierre Andrews and Martin Rajman, "Thematic Annotation: extracting concepts out of documents", EPFL Technical report IC/2004/68. Lausanne. August 9th, 2004
  • Silvia Quarteroni and Martin Rajman, "Introducing Reset Patterns: an Extension to a Rapid Dialogue Prototyping Methodology", EPFL Technical report IC/2004/58. Lausanne. July 9th, 2004
  • Ruch P., Chichester G., Cohen G., Coray G., Ehrler F., Ghorbel, H., Müller H., Pallotta V., "Report on the TREC 2003 Experiment", In Genomic Track, TREC 2003
  • Georgescul M. & Popescu-Belis A., "Database and TQB Demonstrator: Installation Manual", Project Report IM2.MDM-10, (forthcoming)
  • Lisowska, A., "Multimodal Interface Design for the Multimodal Meeting Domain: Preliminary Indications from a Query Analysis Study", Project Report IM2.MDM-11, 30 pages, November 2003
  • Clark A. & Popescu-Belis A., "Multilevel Dialogue Acts and Feature Selection", Project Report IM2.MDM-12, WP.STAT Deliverable, 19 pages, March 2004
  • Popescu-Belis A. & Palacio E., "Data formatting and conversion procedures: summary of results", Project Report IM2.MDM-13, (forthcoming)
  • Silvia Quarteroni, Martin Rajman, "Introducing Reset Patterns: an extension to a Rapid Dialogue Prototyping Methodology", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Philipp Frei, Harald Romsdorfer, Beat Pfister, "Strong and weak assimilated mixed-lingual text analysis: a comparison", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Carlo Jelmini, Stéphane Marchand-Maillet, "Ontology Reasoning for Multimedia Semantic Retrieval", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Andrei Popescu-Belis, Maria Georgescul, "TQB: a Transcript-based Query and Browsing Interface for Meetings", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Popescu-Belis, "Resources, Tools and Projects for Multimodal Dialogue Understanding and Management: a Web-based Review", Report IM2.MDM-06, June 2003.
  • M. Georgescul, "SOAP Webservice for querying the IM2.MDM dialogue database: Java package documentation", April 2003.
  • Pallotta V. and Todirascu A. "Ontologies and Information Extraction", Proceedings of the Eurolan 2003 Workshop, Bucharest, Romania August 2003.
  • Vincenzo Pallotta, "Computational Dialogue Models", March 2003.
  • Alexander Clark, "Machine Learning Approaches to Shallow Parsing: A Literature Review," March 2003.
  • Andrei Popescu-Belis, Alexander Clark, Maria Georgescul, Marianne Starlander, Sandrine Zufferey, "A Thematic Bibliography on Dialogue Processing", June 2003.
  • Andrei Popescu-Belis, "Shallow Dialogue Analysis: Definition, Annotation, Visualisation",  July 2003.
  • Vincenzo Pallotta, Hatem Ghorbel, "Argumentative Segmentation and Annotation Guidelines", June 2003.
  • Andrei Popescu-Belis, "Dialogue act tagsets for meeting understanding: an abstraction based on the DAMSL", Switchboard and ICSI-MR tagsets, September 2003.

 

IM2.MI

 

  • Mark Barnard and Jean-Marc Odobez, "Sports event recognition using layered hmms", IDIAP-RR 07, 2005.
  • S. Chiappa and S. Bengio, "Sequence classifcation with input-output hidden markov models", IDIAP-RR 13, 2004.
  • Ronan Collobert, "Large Scale Machine Learning", PhD thesis, Universite de Paris VI, 2004.
  • Ronan Collobert, "Large scale machine learning", IDIAP-RR 42, IDIAP, 2004.
  • Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain McCowan, and Guillaume Lathoud, "Modeling Individual and Group Actions in Meetings With Layered HMMs", IDIAP-RR 33, Martigny, Switzerland, 2004. (submitted for publication)
  • J. del R. Millán, Silvia Chiappa, "Eeg-based bci systems and idiap eeg database", IDIAP-RR 64, 2003.
  • Mikaela Keller, Samy Bengio, and Siew Yeung Wong, "Surprising Outcome While Benchmarking Statistical Tests", IDIAP-RR 05-38, 2005.
  • Alexei Pozdnoukhov and Samy Bengio, "A Kernel Classifier for Distributions", IDIAP-RR 05-32, 2005.
  • D. Gatica-Perez, G. Lathoud, J.-M. Odobez, and I. McCowan, "Audio-visual probabilistic tracking of multiple speakers in meetings", IDIAP-RR 05-27, 2005.
  • Yves Grandvalet, Johnny Mariéthoz, and Samy Bengio, "A Probabilistic Interpretation of SVMs with an Application to Unbalanced Classification", IDIAP-RR 05-26, 2005.
  • S. Bengio, "Joint Training of Multi-Stream HMMs", IDIAP-RR 05-22, 2005.
  • David Barber, "Variational Information Maximization for Population Coding", IDIAP-RR 04-85, 2004.
  • Felix Agakov and David Barber, "Variational Information Maximization in Gaussian Channels", IDIAP-RR 04-88, 2004.
  • David Barber, "The Auxiliary Variable Trick for deriving Kalman Smoothers", IDIAP-RR 04-87, 2004.
  • Felix Agakov and David Barber, "An Auxiliary Variational Method", IDIAP-RR 04-86, 2004.
  • Christos Dimitrakakis and Samy Bengio, "Estimates of Parameter Distributions for Optimal Action Selection", IDIAP-RR 04-72, 2004.
  • David Barber, "Are two Classifiers performing equally? A treatment using Bayesian Hypothesis Testing", IDIAP-RR 04-57, 2004.
  • Alexei Pozdnoukhov and Samy Bengio, "Invariances in Kernel Methods: From Samples to Objects", IDIAP-RR 04-56, 2004.
  • Ronan Collobert, "Large Scale Machine Learning", IDIAP-RR 04-42, 2004
  • S. Chiappa and S. Bengio, "Sequence Classification with Input-Output Hidden Markov Models", IDIAP-RR 04-13, 2004
  • Ronan Collobert and Samy Bengio, "Links between Perceptrons, MLPs and SVMs", IDIAP-RR 04-06, 2004
  • S. Bengio  M. Keller, and J. Mariéthoz, "The Expected Performance Curve", IDIAP-RR 03-85, 2003
  • Silvia Chiappa and José del R. Millán, "EEG-based BCI Systems and IDIAP EEG Database", IDIAP-RR 03-64, 2003
  • S. Chiappa and S. Bengio, "HMM and IOHMM Modeling of EEG Rhythms for Asynchronous BCI Systems", IDIAP-RR 03-49, 2003
  • Quan Le and Samy Bengio, "Noise Robust Discriminative Models", IDIAP-RR 03-40, 2003
  • J. Kronegg, S. Voloshynovskiy, T. Pun, "Brain-computer interface model: upper-capacity bound, signal-to-noise estimation, and optimal number of symbols", submitted, Jan. 2004.
  • Alexei Pozdnoukhov, Samy Bengio, "Introducing invariances into SVM algorithms by Tangent Vector Kernels", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Silvia Chiappa, Samy Bengio, "Sequence Classification with Input-Output Hidden Markov Models", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Christos Dimitrakakis, Samy Bengio, "Methods for applying boosting to HMMs for speech recognition", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Mikaela Keller, Johnny Mariéthoz, Samy Bengio, "Performance Measures for 2-Class Classification Tasks with a desired Range of Operating Points", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Teodor Alecu, Sviatoslav Voloshynovskiy, Thierry Pun, "Regularized two-step brain activity reconstruction from spatio-temporal EEG data", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Ronan Collobert, Samy Bengio, "MLP=SVM^2", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Jean-François Paiement, David Barber, Samy Bengio, "A Graphical Model for Music Analysis and Generation", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • S. Chiappa and S. Bengio, "Nonlinear Analysis of Cognitive and Motor-related EEG Signals",  IDIAP-RR 03-14, 2003.
  • Yoshua Bengio and Jean-Sébastien Senécal, "Adaptive Importance Sampling to Accelerate Training of a Neural Probabilistic Language Model", IDIAP-RR 03-35, 2003.
  • J. del R. Millán, "On the Need for On-Line Learning in Brain-Computer Interfaces", IDIAP-RR 03-30, 2003
  • Pozdnoukhov and S. Bengio, "From Samples to Objects in Kernel Methods", IDIAP-RR 03-29, 2003.
  • R. Collobert and S. Bengio, "A New Margin-Based Criterion for Efficient Gradient Descent", IDIAP-RR 03-16, 2003.
  • A.Pozdnoukhov, "The analysis of kernel ridge regression learning algorithm", IDIAP-RR 02-54, 2002.
  • V. Lemaire, "Bagging Using the VMSE Cost Function, IDIAP-RR 02-27, 2002.
  • T. Alecu, "The inverse problem: solutions and resolutions", Comp. Science Dpt / CVML, Report 2003.04, September 2003
  • J. Kronegg, "Capacity study of the memoryless channel with additive independant Gaussian noise and its application to brain-computer interfaces", Comp. Science Dpt / CVML, Report 2003.05, September 2003
  • "Christos Dimtirakakis and Samy Bengio, ""Online Policy
  • Adaptation for Ensemble Algorithms'', IDIAP-RR 02-28, 2002,"
  • V. Lemaire, "Bagging Using the VMSE Cost Function'', IDIAP-RR-02-27, 2002,
  • S. Bengio, "An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition'', IDIAP-RR 02-26, 2002,
  • Nicolas Gilardi, Samy Bengio, and Mikhail Kanevski, "Estimation of Conditional Distributions using Gaussian Mixture Models'', IDIAP-RR 02-03, 2002,

 

IM2.MI, IM2.IP

 

  • R. Collobert, S. Bengio, and J. Mariéthoz, "Torch: a modular machine learning software library", IDIAP-RR 02-46, 2002.
  • S. Bengio, "An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition", IDIAP-RR 02-26, 2002.

 

IM2.MI, IM2.SA

 

  • Julien Tiphaigne and Sebastien Marcel, "A video package for Torch",  IDIAP-Com 04-02, 2004

 

IM2.MI, IM2.SP

 

  • David Barber and Bertrand Mesot, "Construction and comparison of approximations for switching linear gaussian state space models", IDIAP-RR 05-06, 2005.
  • David Barber, "A Stable Switching Kalman Smoother", IDIAP-RR 04-89, 2004.

 

IM2.SA

 

  • L. Granai and P. Vandergheynst, "Sparse Approximation by Linear Programming: Measuring the Error with the $\ell_1$ Norm", No TR-ITS-2005.015, June 2005.
  • G. Monaci, O. Divorra Escoda and P. Vandergheynst, "Analysis of Multimodal Sequences Using Geometric Video Representations", No TR-ITS-2005.017, June 2005.
  • P. Jost, P. Vandergheynst and P. Frossard, "Tree-Based Pursuit: Algorithm and Properties", No 2005-13, May 2005.
  • Manic, R. Figueras i Ventura, M. Flierl and P. Vandergheynst, "An improved decoding scheme for Matching Pursuit Streams", No TR-ITS-2005.011, April 2005.
  • Rahmoune, P. Vandergheynst and P. Frossard, "Sparse Approximation Using M-Term Pursuits with Applications to Image and Video Compression", No 2005.03, January 2005.
  • Rahmoune, P. Vandergheynst and P. Frossard, "The M-Term Pursuit for Image Representation and Compression", No 2005.04, January 2005.
  • Rahmoune, P. Vandergheynst and P. Frossard, "Scalable Video Representation and Coding Using Sparse Approximation", No 2005.05, January 2005.
  • O. Divorra Escoda and P. Vandergheynst, "An Analysis of Temporal Adaptivity in 3D Wavelet Video Coding", No 06/2005, January 2005.
  • P. Frossard and P. Vandergheynst, "Unequal Error Protection of Atomic Image Streams", No TR-ITS-2005.007, January 2005.
  • R. Figueras i Ventura, O. Divorra Escoda and P. Vandergheynst, "A Matching Pursuit Full Search Algorithm for Image Approximations", No 31/2004, December 2004.
  • O. Divorra Escoda, L. Granai and P. Vandergheynst, "On the Use of A Priori Information for Sparse Signal Approximations", No 23/2004, November 2004.
  • O. Divorra Escoda, M. Flierl and P. Vandergheynst, "Intra-Adaptive Motion-Compensated Lifted Wavelets for Video Coding", No 2004-27, November 2004.
  • G. Monaci, O. Divorra Escoda and P. Vandergheynst, "Multimodal Analysis Using Redundant Parametric Decompositions", No TR-ITS-2004.024, October 2004.
  • O. Divorra Escoda, P. Vandergheynst and M. Bierlaire, "Video Representation Using Greedy Approximations Over Redundant Parametric Dictionaries", No ITS-2004.019, September 2004.
  • O. Divorra Escoda, L. Granai and P. Vandergheynst, "On the Use of A Priori Information for Sparse Signal Representations", No 18/2004, September 2004.
  • Günter, S., "Vergleich von Erkennungsmethoden", Technical Report IAM TR-04-001, 2004.
  • B.Fasel, "Automatic Face Analysis with Unsupervised Convolutional Neural Networks", Computer Vision Lab, ETH Zuerich, RR No 268, September 2004.
  • S. Ba and J.-M. Odobez, "A Rao-Blackwellized Mixed State Particle Filter for Head Pose Esti-mation", IDIAP Research Report RR-05-35, June 2005.
  • (*)  Gatica-Perez, G. Lathoud, J.-M. Odobez, and I. McCowan, "Audio-visual Probabilistic Tracking of Multiple Speakers in Meetings", submitted to IEEE Trans. on Speech and Audio Processing, Jun. 2005.
  • J.-M. Odobez, D. Gatica-Perez, and S. Ba, "Embedding Motion in Model-Based Stochastic Tracking", submitted to Trans. Image Processing, Oct. 2004, under second round of review.
  • R. Villan, S. Voloshynovskiy, F. Deguillaume, Y. Rytsar, O. Koval, E. Topak, E. Rivera and T. Pun, "A theoretical framework for data-hiding in digital text documents", 9th IFIP TC-6 TC-11 CMS 2005, Conf. on Communications and Multimedia Security, 19-21 September 2005, Salzburg, Austria. Subm.
  • Just and S. Marcel, "Two-Handed Gesture Recognition", IDIAP-RR 05-24, 2005.
  • Jean-Marc Odobez and Daniel Gatica-Perez, "Motion likelihood and proposal modeling in Model-Based Stochastic Tracking", IDIAP-RR 04-61, 2004.
  • Just, O. Bernier, and S. Marcel, "HMM and IOHMM for the Recognition of Mono- and Bi-Manual 3D Hand Gestures", IDIAP-RR 04-39, 2004
  • Just, S. Marcel, O. Bernier, and J.E. Viallet, "Reconnaissance de gestes 3D bi-manuels", IDIAP-RR 03-79, 2003
  • Sileye Ba and Jean-Marc Odobez, "A Probabilistic Framework for Joint Head Tracking and Pose Estimation", IDIAP-RR 03-78, 2003
  • Mark Barnard and Jean-Marc Odobez, "Robust Playfield Segmentation using MAP Adaptation", IDIAP-RR 03-77, 2003
  • Pozdnoukhov and S. Bengio, "Tangent Vector Kernels for Invariant Image Classification with SVMs", IDIAP-RR 03-75, 2003
  • Jean-Marc Odobez, Daniel Gatica-Perez, and Sileye Ba, "Embedding Motion in Model-Based Stochastic Tracking", IDIAP-RR 03-72, 2003
  • Dong Zhang, S. Z. Li, and Daniel Gatica-Perez, "Real-Time Face Detection Using Boosting Learning in Hierarchical Feature Spaces", IDIAP-RR 03-70, 2003
  • Alessandro Vinciarelli, :Offline Cursive Handwriting: From Word To Text Recognition", IDIAP-RR 03-24, 2003
  • O. Koval, “Distributed Single Source Coding”, SIMILAR WP6&9 meeting, EPF-Lausanne, May 25-26, 2004
  • J. Vila, “Distributed Single Source Coding with Side Information” at ETH Zentrum, IM2.SA Workshop, April 23, 200
  • Guenter, S., "Vergleich von Erkennungsmethoden", IAM TR-04-001, 2004
  • S. Voloshynovskiy, O. Koval, T. Pun, "Image denoising based on the edge process model, Subm. to Elsevier Science, Signal Processing, April 2004.
  • S. Voloshynovskiy, O. Koval, K. Mihcak, T. Pun, "Data hiding capacity analysis for real images based on stochastic non-stationary models", Subm. to IEEE Trans. Image Processing, June 2004
  • S. Voloshynovskiy, O. Koval, F. Perez-Gonzalez, K. Mihcak, T. Pun, Data hiding with host state at the encoder and partial side information at the decoder, Subm. to IEEE Trans. Signal Processing, June 2004
  • R. Gribonval and P. Vandergheynst, "On the exponential convergence of matching pursuit in quasi-incoherent dictionaries", submitted to IEEE Transactions on Information Theory, 2004
  • Ivana Arsic, Jean-Phillipe Thiran, "Information Theoretic Feature Selection for Lipreading", poster presentation at MLMI'04 in Martigny  Switzerland, June 2004
  • Mark Barnard, Jean-Marc Odobez, "Robust Playfield Segmentation using MAP Adaptation" ,poster presentation at MLMI'04 in Martigny  Switzerland, June 2004
  • J.E. Vila Forcen, S. Voloshynovskiy, O. Koval, T. Pun, "Distributed single source coding framework for passport photo images", poster presentation at MLMI'04 in Martigny  Switzerland, June 2004
  • Pedro Quelhas, Jean-Marc Odobez, "Fusion of Structural and Color Local Descriptors for Enhanced Object Recognition", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Kevin Smith, Daniel Gatica-Perez, "A Distributed Sampling Method for Multi-Object Tracking", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Petr Doubek, Luc Van Gool, "Viewpoint Selection in Multi-Camera Setup", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Mihai Osian, Tinne Tuytelaars, Luc Van Gool, Fitting super-ellipses to incomplete contours", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Sileye O. Ba, Jean-Marc Odobez, "Joint Head Tracking and Pose Estimation with Multiple Cues and Model Adaptation", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Sylvain Calinon, Aude Billard, "Gesture Recognition and Reproduction for a Humanoid Robot using Hidden Markov Models", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Vlad Popovici, Jean-Philippe Thiran, Yann Rodriguez, Sebastien Marcel, "Assessing the performance of face detection and localization algorithms", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Florent Monay, Daniel Gatica-Perez, "PLSA-based Image Auto-Annotation: Constraining the Latent Space", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • David Leroux, "EPFL USB Camera", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Nicolas Moënne-Loccoz, Eric Bruno, Stéphane Marchand-Maillet, "Salient Decomposition of the Visual Content of Video Shot", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Vlad Popovici, Jean-Philippe Thiran, "Adaptive Dictionary for Kernel Matching Pursuit", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Datong Chen and Jean-Marc Odobez, "Video Text Segmentation Using Particle Filters", IDIAP-RR 03-43, 2003
  • A Localization/Verification Scheme for Finding Text in Images and Video Frames Based on Datong Chen, Jean-Marc Odobez, and Jean-Philippe Thiran, "Contrast Independent Features and Machine Learning Methods", IDIAP-RR 03-42, 2003
  • S. Ikbal, H. Hermansky, and H. Bourlard, "Nonlinear Spectral Transformations for Robust Speech Recognition", IDIAP-RR 03-36, 2003
  • A.Vinciarelli, S.Bengio and H.Bunke, "Offline Recognition of Unconstrained Handwritten Texts Using HMMs and Statistical Language Models", IDIAP-RR 03-22, 2003.
  • Datong Chen, Jean-Marc Odobez, and Hervé Bourlard, "Text Detection and Recognition in Images and Videos", IDIAP-RR 02-61, 2002.
  • J-M. Odobez, D. Gatica-Perez, and M. Guillemot, "On Spectral Methods and the Structuring of Home Videos", IDIAP-RR 02-55, 2002.
  • Sébastien Marcel, "Evaluation Protocols and Comparative Results for the Triesch Hand Posture Database", IDIAP-RR 02-50, 2002
  • Sébastien Marcel, "Gestures for Multi-Modal Interfaces: A Review", IDIAP-RR 02-34, 2002.
  • Vinciarelli, A. and Bengio, S., "Transforming the feature vectors to improve HMM based cursive word recognition systems", IDIAP-RR 02-32, 2002.
  • Datong Chen and Jean-Marc Odobez, "A New Method of Contrast Normalization for Verification of Extracted Video Text Having Complex Backgrounds", IDIAP-RR 02-16, 2002.
  • Frederic Kottelat and Jean-Marc Odobez, "Audio- Video Person Clustering In Video Databases", IDIAP-RR 03-46, 2003
  • "Daniel Gatica-Perez, Alexander Loui, and Ming-Ting Sun, ""Finding Structure in Consumer Videos by Probabilistic
  • Hierarchical Clustering'', IDIAP-RR 02-22, 2002, submitted to IEEE Trans. on Circuits and Systems for Video Technology"
  • Datong Chen and Jean-Marc Odobez, "Comparison of Support Vector Machine and Neural Network for Text Texture Verification'', IDIAP-RR~02-19, 2002

 

IM2.SA, IM2.ACP

 

  • S. Marcel, P. Jost, P. Vandergheynst and J. Thiran, "Face Authentication using Client-specific Matching Pursuit", IDIAP-RR 04-78, December 2004
  • Y. Rodriguez, F. Cardinaux, S. Bengio, and J. Mariéthoz, "Estimating the Quality of Face Localization for Face Verification", IDIAP-RR 04-07, 2004
  • Conrad Sanderson and Samy Bengio, "Statistical Transformation Techniques for Face Verification Using Faces Rotated in Depth", IDIAP-RR 04-04, 2004
  • S. Marcel, "Face Verification using LDA and MLP on the BANCA database", IDIAP-RR 03-66, 2003
  • Tamas Varga, Horst Bunke, "Training of Handwriting Recognition Systems Using Synthetic Data", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Andreas Schlapbach, Horst Bunke, "Using HMM Based Recognizers for Writer Identification and Verification", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Michel Neuhaus, Horst Bunke, "A Probabilistic Model for Learning the Edit Operation Costs in Graph Matching", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Yann Rodriguez, Fabien Cardinaux, Samy Bengio, Johnny Mariéthoz, "Estimating the Quality of Face Localization for Face Verification", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • C. Sanderson, "Face Processing & Frontal Face Verification", IDIAP Research Report 03-20, Martigny, Switzerland, April 2003.

 

IM2.SP

 

  • S. Bengio and H. Bourlard, Multi Channel Sequence Processing, IDIAP-RR 05-04, 2005.
  • Iain McCowan, Maganti Hari Krishna, Daniel Gatica-Perez, Darren Moore, and Sileye Ba, Speech Acquisition in Meetings with an Audio-Visual Sensor Array, IDIAP-RR 05-03, 2005.
  • T. Kaufmann. Ein HPSG-System zur Anwendung in der Spracherkennung. Zwischenbericht zum Nationalfonds-Projekt 105211-104078/1:Rule-Based Language Model for Speech Recognition. Institut TIK, ETH Zürich, Januar 2005.
  • B. Pfister und R. Beutler. Improving Speech Recognition thru Linguistics. Jahresbericht 2004 für das Projekt COST 278. Institut  TIK, ETH Zürich, Januar 2005.
  • Guillaume Lathoud, Mathew Magimai.-Doss, Bertrand Mesot, and Hervé Bourlard, Unsupervised Spectral Substraction for Noise-Robust ASR, IDIAP-RR 05-42,
  • D. Vandromme, Harmonic Plus Noise Model for Concatenative Speech Synthesis, IDIAP-RR 05-37, 2005.
  • S. Ikbal, Nonlinear Feature Transformations for Noise Robust Speech Recognition, IDIAP-RR 04-70, 2004.
  • Mathew Magimai.-Doss, John Dines, Hervé Bourlard, and Hynek Hermansky, Phoneme vs Grapheme Based Automatic Speech Recognition, IDIAP-RR 04-48, 2004.
  • Vivek Tyagi, Hervé Bourlard, and Christian Wellekens, On Variable-Scale Piecewise Stationary Spectral Analysis of Speech Signals for ASR, IDIAP-RR 05-09, 2005.
  • S. Ikbal, H. Misra, H. Bourlard, and H. Hermansky, "Phase AutoCorrelation (PAC) Features for Noise Robust ASR", IDIAP-RR 04-40
  • Petr Fousek, Petr Svojanovsky, Frantisek Grezl, and Hynek Hermansky, "New Nonsense Syllables Database -- Analyses and Preliminary ASR Experiments", IDIAP-RR 04-29, 2004
  • Mathew Magimai.-Doss and Hervé Bourlard, "On the Adequacy of Baseform Pronunciations and Pronunciation Variants", IDIAP-RR 04-27, 2004
  • Jithendra Vepa and Simon King, "Subjective Evaluation of Join Cost Functions Used in Unit Selection Speech Synthesis", IDIAP-RR 04-26, 2004
  • Mohamed F. BenZeghiba and Hervé Bourlard, "Posteriori Probabilities and Likelihoods Combination for Speech and Speaker Recognition", IDIAP-RR 04-23, 2004
  • Guillermo Aradilla, John Dines, and Sunil Sivadas, "Using RASTA in task independent TANDEM feature extraction", IDIAP-RR 04-22, 2004
  • Mathew Magimai.-Doss, Todd A. Stephenson, Shajith Ikbal, and Hervé Bourlard, "Modelling Auxiliary Features in Tandem Systems", IDIAP-RR 04-21, 2004
  • Johnny Mariéthoz and Samy Bengio, "A New Speech Recognition Baseline System for Numbers 95 Version 1.3 Based on Torch", IDIAP-RR 04-16, 2004
  • Guillaume Lathoud and Iain A. McCowan, "A Sector-Based Approach for Localization of Multiple Speakers with Microphone Arrays", IDIAP-RR 04-15, 2004
  • Guillaume Lathoud, Iain A. McCowan, and Jean-Marc Odobez, "Short-Term Spatio-Temporal Clustering of Sporadic and Concurrent Events", IDIAP-RR 04-14, 2004
  • Hynek Hermansky and Hervé Bourlard, "Some Emerging Concepts in Speech Recognition.", IDIAP-RR 03-82, 2003
  • Hynek Hermansky and Nelson Morgan, "Show What You Know: Musings on the Reporting of Negative Results in Speech Recognition Research", IDIAP-RR 03-81, 2003
  • Katrin Weber, "HMM Mixtures (HMM2) for Robust Speech Recognition", IDIAP-RR 03-34, 2003
  • Vivek Tyagi and Herve Bourlard, "On Multi-scale Fourier Transform Analysis of Speech Signals", IDIAP-RR 03-33, 2003
  • Todd Andrew Stephenson, "Speech Recognition with Auxiliary Information", IDIAP-RR 03-28, 2003
  • F. de Wet, K. Weber, L. Boves, B. Cranen, S. Bengio, and H. Bourlard, "Evaluation of formant-like features for automatic speech recognition", IDIAP-RR 03-08, 2003
  • Ait-Hassou Aissa, "HMM inference towards flexible speech recognition", IDIAP-Com 03-03, 2003
  • R. Dhillon, S. Bhagat, H. Carvey, and E. Shriberg, "Meeting Recorder Project: Dialog Act Labeling Guide", ICSI Technical Report TR-04-002
  • H. Hermansky and N. Morgan, "Show what you know: musings on the reporting of negative results in speech recognition research ", Journal of Negative Results in Speech and Audio Sciences 2004 Issue
  • U. Glavitsch, "Speaker Normalization With Respect to Fo: a Perceptual Approac", IM2.SP Project Report. TIK/ETH Zurich, December 2003
  • B. Pfister und R. Beutler, "Improving Speech Recognition thru Linguistics", Jahresbericht 2003 für das Projekt COST 278, Institut TIK, ETH Zürich, Januar 2004
  • P. Frei, "Untersuchung der Aussprache englischer Einschlüsse in spanischen Texten", Technischer Bericht Nr. 1 zum KTI-Projekt Nr. 6233.1 SUS-ET. Institut TIK, ETH Zürich, April 2004
  • H. Romsdorfer, "An Approach to an Improved Segmentation of Speech Signals for the Training of Statistical Prosody Models", Technischer Bericht Nr. 2 zum KTI-Projekt Nr. 6233.1 SUS-ET, Institut TIK, ETH Zürich, May 2004
  • René Beutler, Tobias Kaufmann, Beat Pfister, "Can grammars improve speech recognition accuracy?", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Viktoria Maier, Hynek Hermansky, "Perception of Synthetic Consonant-Vowel Stimuli", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Guillermo Aradilla, John Dines, Sunil Sivadas, "Using RASTA in task independent TANDEM feature extraction", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Petr Fousek, Petr Svojanovsky, Frantisek Grezl, Hynek Hermansky, "LDC Nonsense Syllables Corpus - Analyses and First Recognition Experiments", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Michael McGreevy, "Pseudo-syntactic Language Modeling for Disfluent Speech", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Mathew Magimai Doss, Auxiliary Sources of Knowledge for Automatic Speech Recognition", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Shajith Ikbal, Hemant Misra, Mathew Magimai.-Doss, Hervé Bourlard, Hynek Hermansky, "Noise Robust Speech Recognition: PAC and STAP features", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Marios Athineos, Hynek Hermansky, Dan Ellis, "Autoregressive modeling of spectro-temporal patterns", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • H. Misra, S. Sivadas, S. Ikbal, H. Bourlard, H. Hermansky, "Multi-Resolution Spectral Entropy Feature for Robust ASR", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Jaume Escofet Carmona and Todd A. Stephenson, "Automatic Speech Recognition using Dynamic Bayesian Networks with the Energy as an Auxiliary Variable", IDIAP-RR 03-18, 2003
  • Mathew Magimai.-Doss, Todd A. Stephenson, Hervé Bourlard, and Samy Bengio, "Phoneme-Grapheme Based Speech Recognition System", IDIAP-RR 03-37, 2003.
  • Todd A. Stephenson,"Conditional Gaussian Mixtures",  IDIAP-RR 03-11, 2003.
  • B. Pfister, H. Romsdorfer und K. Boesefeldt Hess, "Analyse englischer Einschlüsse in deutschem Text", Oktober 2002
  • B. Pfister, E. Wehrli et al., "Lexical and Syntactic Analysis of Mixed-Lingual Sentences for Text-to-Speech", Final Report of SNSF Project No 21-59396.99. TIK/ETHZ, November 2002
  • B. Pfister und R. Beutler, "Improving Speech Recognition thru Linguistics", Jahresbericht 2002 für das Projekt COST 278, TIK/ETHZ, Januar 2003
  • H. Romsdorfer, "Mixed-lingual Morpho-Syntactic Analysis of the SVOX Text-to-Speech System", March 2003
  • Christos Dimitrakakis and Samy Bengio, "Boosting HMMs with an application to speech recognition", 2003.
  • D. Moore, "TODE: A Decoder for Continuous Speech Recognition", IDIAP-Com 02-09, 2002.
  • Mathew Magimai.-Doss, Todd A. Stephenson, and Hervé Bourlard, "Modelling auxiliary information (pitch frequency) in hybrid HMM/ANN based ASR systems", IDIAP-RR 02-62, 2002
  • Astrid Hagen and Andrew C. Morris, "Recent advances in the multi-stream HMM/ANN hybrid approach to noise robust ASR", IDIAP-RR 02-57, 2002.
  • Lapidot, I., "What is Better: GMM of Two Gaussians or Two Clusters With One Gaussian?", IDIAP-RR 02-56, 2002
  • Lapidot, I. and Morris, A., "Extended BIC Criterion for Model Selection",  IDIAP-RR 02-42, 2002.
  • Hemant Misra, Hervé Bourlard, and Vivek Tyagi, "Entropy-Based Multi-Stream Combination", IDIAP-RR 02-31, 2002
  • Andrew C. Morris, "Noise PDF transformation in secondary feature processing", IDIAP-RR 02-29, 2002
  • McCowan and H. Bourlard, "Generalised Microphone Array Post-filter based on Noise Field Coherence'',  IDIAP-RR 01-40, submitted to IEEE Trans. on Signal Processing
  • J. Ajmera, H. Bourlard, and I. Lapidot, Improved Unknown-Multiple Speaker clustering using HMM'', IDIAP-RR 02-23, 2002,
  • F. de Wet, K. Weber, L. Boves, B. Cranen, S. Bengio, and H. Bourlard, "Evaluation of formant-like features for automatic speech recognition,'' submitted to Journal of the Acoustical Society of America (JASA)}, 2002

 

IM2.SP, IM2.SA

 

  • Guillaume Lathoud, Jean-Marc Odobez, Daniel Gatica-Perez, "AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking", poster presentation at MLMI'04 in Martigny Switzerland, June 2004
  • Dong Zhang, Daniel Gatica-Perez, Samy Bengio, Iain McCowan, Guillaume Lathoud, "Modeling Multimodal Group Actions with Layered Approaches", poster presentation at MLMI'04 in Martigny Switzerland, June 2004

Last modified 2008-07-14 16:26
 

Powered by Plone