Alex Waibel

Affiliations:
  • Karlsruhe Institute of Technology, Department of Informatics, Germany
  • Carnegie Mellon University, Computer Science Department, USA


According to our database1, Alex Waibel authored at least 586 papers between 1982 and 2024.

Collaborative distances:

Awards

IEEE Fellow

IEEE Fellow 2015, "For contributions to neural network based speech recognition and translation and multimodal interfaces".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS.
CoRR, 2024

Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck.
CoRR, 2024

Accent conversion using discrete units with parallel data synthesized from controllable accented TTS.
CoRR, 2024

Predictive Speech Recognition and End-of-Utterance Detection Towards Spoken Dialog Systems.
CoRR, 2024

Handling Numeric Expressions in Automatic Speech Recognition.
CoRR, 2024

Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024.
CoRR, 2024

SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading.
CoRR, 2024

Continuously Learning New Words in Automatic Speech Recognition.
CoRR, 2024

ConVoiFilter: A Case Study of Doing Cocktail Party Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

Synthetic Conversations Improve Multi-Talker ASR.
Proceedings of the IEEE International Conference on Acoustics, 2024

SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

From Text Segmentation to Smart Chaptering: A Novel Benchmark for Structuring Video Transcriptions.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DECM: Evaluating Bilingual ASR Performance on a Code-switching/mixing Benchmark.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Decoupled Vocabulary Learning Enables Zero-Shot Translation from Unseen Languages.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Unconstrained face mask and face-hand interaction datasets: building a computer vision system to help prevent the transmission of COVID-19.
Signal Image Video Process., June, 2023

A survey on computer vision based human analysis in the COVID-19 era.
Image Vis. Comput., February, 2023

Incremental Learning of Humanoid Robot Behavior from Natural Interaction and Large Language Models.
CoRR, 2023

Convoifilter: A case study of doing cocktail party speech recognition.
CoRR, 2023

Plug the Leaks: Advancing Audio-driven Talking Face Generation by Preventing Unintended Information Flow.
CoRR, 2023

Train Global, Tailor Local: Minimalist Multilingual Translation into Endangered Languages.
CoRR, 2023

Towards Efficient Simultaneous Speech Translation: CUNI-KIT System for Simultaneous Track at IWSLT 2023.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

KIT's Multilingual Speech Translation System for IWSLT 2023.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023


Incremental Blockwise Beam Search for Simultaneous Speech Translation with Controllable Quality-Latency Tradeoff.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Towards continually learning new languages.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Multimodal Error Correction with Natural Language and Pointing Gestures.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Face-Dubbing++: LIP-Synchronous, Voice Preserving Translation Of Videos.
Proceedings of the IEEE International Conference on Acoustics, 2023

SYNTACC : Synthesizing Multi-Accent Speech By Weight Factorization.
Proceedings of the IEEE International Conference on Acoustics, 2023

AdapITN: A Fast, Reliable, and Dynamic Adaptive Inverse Text Normalization.
Proceedings of the IEEE International Conference on Acoustics, 2023

Modular Design of a Front-End and Back-End Speech-to-Speech Translation Application for Psychiatric Treatment of Refugees.
Proceedings of the IEEE Global Humanitarian Technology Conference, 2023

End-to-End Evaluation for Low-Latency Simultaneous Speech Translation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Language-agnostic Code-Switching in End-To-End Speech Recognition.
CoRR, 2022

Code-Switching without Switching: Language Agnostic End-to-End Speech Translation.
CoRR, 2022

CUNI-KIT System for Simultaneous Speech Translation Task at IWSLT 2022.
CoRR, 2022

Short-Term Word-Learning in a Dynamically Changing Environment.
CoRR, 2022

CUNI-KIT System for Simultaneous Speech Translation Task at IWSLT 2022.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022

Effective combination of pretrained models - KIT@IWSLT2022.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022


Adaptive multilingual speech recognition with pretrained models.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Accent Conversion using Pre-trained Model and Synthesized Data from Voice Conversion.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Error-correction and extraction in request dialogs.
Proceedings of the 5th International Conference on Natural Language and Speech Processing, 2022

Interactive Multimodal Robot Dialog Using Pointing Gesture Recognition.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Alpha Matte Generation from Single Input for Portrait Matting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Exposure Correction Model to Enhance Image Quality.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Where did I leave my keys? - Episodic-Memory-Based Question Answering on Egocentric Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
Deep Episodic Memory for Verbalization of Robot Experience.
IEEE Robotics Autom. Lett., 2021

Active Learning for Massively Parallel Translation of Constrained Text into Low Resource Languages.
CoRR, 2021

CAGAN: Text-To-Image Generation with Combined Attention GANs.
CoRR, 2021

Family of Origin and Family of Choice: Massively Parallel Lexiconized Iterative Pretraining for Severely Low Resource Machine Translation.
CoRR, 2021

A Computer Vision System to Help Prevent the Transmission of COVID-19.
CoRR, 2021

Unsupervised Transfer Learning in Multilingual Neural Machine Translation with Cross-Lingual Word Embeddings.
CoRR, 2021

Text and Synthetic Data for Domain Adaptation in End-to-End Speech Recognition.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Multilingual Speech Translation KIT @ IWSLT2021.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021


Efficient Weight Factorization for Multilingual Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Super-Human Performance in Online Low-Latency Recognition of Conversational Speech.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

ELITR Multilingual Live Subtitling: Demo and Strategy.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

CAGAN: Text-To-Image Generation with Combined Attention Generative Adversarial Networks.
Proceedings of the Pattern Recognition - 43rd DAGM German Conference, DAGM GCPR 2021, Bonn, Germany, September 28, 2021

Instant One-Shot Word-Learning for Context-Specific Neural Sequence-to-Sequence Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Low Latency ASR for Simultaneous Speech Translation.
CoRR, 2020

Toward Cross-Domain Speech Recognition with End-to-End Models.
CoRR, 2020

German-Arabic Speech-to-Speech Translation for Psychiatric Diagnosis.
Proceedings of the Fifth Arabic Natural Language Processing Workshop, 2020

DaCToR: A Data Collection Tool for the RELATER Project.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Removing European Language Barriers with Innovative Machine Translation Technology.
Proceedings of the 1st International Workshop on Language Technology Platforms, 2020

Towards Stream Translation: Adaptive Computation Time for Simultaneous Machine Translation.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020

KIT's IWSLT 2020 SLT Translation System.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020


Relative Positional Encoding for Speech Recognition and Direct Translation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

High Performance Sequence-to-Sequence Model for Streaming Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Improving Sequence-To-Sequence Speech Recognition Training with On-The-Fly Data Augmentation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Gun Source and Muzzle Head Detection.
Proceedings of the Imaging and Multimedia Analytics in a Web and Mobile World 2020, 2020

Incorporating External Annotation to improve Named Entity Translation in NMT.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020

ELITR: European Live Translator.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020

Detailed Analysis of Different Strategies for Phrase Table Adaptation in SMT.
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers, 2020

2019
Attention-Passing Models for Robust and Data-Efficient End-to-End Speech Translation.
Trans. Assoc. Comput. Linguistics, 2019

Bimodal Speech Emotion Recognition Using Pre-Trained Language Models.
CoRR, 2019

Low-Resource Machine Translation using Interlinear Glosses.
CoRR, 2019

Learning Shared Encoding Representation for End-to-End Speech Recognition Models.
CoRR, 2019

Improving Zero-shot Translation with Language-Independent Constraints.
Proceedings of the Fourth Conference on Machine Translation, 2019

Fluent Translations from Disfluent Speech in End-to-End Speech Translation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

KIT's Submission to the IWSLT 2019 Shared Task on Text Translation.
Proceedings of the 16th International Conference on Spoken Language Translation, 2019

The IWSLT 2019 KIT Speech Translation System.
Proceedings of the 16th International Conference on Spoken Language Translation, 2019

An Interactive Indoor Drone Assistant.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Very Deep Self-Attention Networks for End-to-End Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Connecting Humans with Humans: Multimodal, Multilingual, Multiparty Mediation.
Proceedings of the International Conference on Multimodal Interaction, 2019

Neural Codes to Factor Language in Multilingual Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Incremental processing of noisy user utterances in the spoken language understanding task.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

Paraphrases as Foreign Languages in Multilingual Neural Machine Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Self-Attentional Models for Lattice Inputs.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Multimodal dialogue processing for machine translation.
Proceedings of the Handbook of Multimodal-Multisensor Interfaces: Language Processing, Software, Commercialization, and Emerging Directions, 2019

2018
Open Source Toolkit for Speech to Text Translation.
Prague Bull. Math. Linguistics, 2018

Multi-task learning to improve natural language understanding.
CoRR, 2018

A Hierarchical Approach to Neural Context-Aware Modeling.
CoRR, 2018

Massively Parallel Cross-Lingual Learning in Low-Resource Target Language Translation.
Proceedings of the Third Conference on Machine Translation: Research Papers, 2018

The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2018.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Building Real-Time Speech Recognition Without CMVN.
Proceedings of the Speech and Computer - 20th International Conference, 2018

Towards Fluent Translations From Disfluent Speech.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Automated Evaluation of Out-of-Context Errors.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

BULBasaa: A Bilingual Basaa-French Speech Corpus for the Evaluation of Language Documentation Tools.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

KIT-Multi: A Translation-Oriented Multilingual Embedding Corpus.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

KIT's IWSLT 2018 SLT Translation System.
Proceedings of the 15th International Conference on Spoken Language Translation, 2018

An End-to-End Goal-Oriented Dialog System with a Generative Natural Language Response Generation.
Proceedings of the 9th International Workshop on Spoken Dialogue System Technology, 2018

Subword and Crossword Units for CTC Acoustic Models.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Self-Attentional Acoustic Models.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Low-Latency Neural Speech Translation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Term Extraction via Neural Sequence Labeling a Comparative Evaluation of Strategies Using Recurrent Neural Networks.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Neural Language Codes for Multilingual Acoustic Models.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Exploring Ctc-Network Derived Features with Conventional Hybrid System.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multilingual Adaptation of RNN Based ASR Systems.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

KIT Lecture Translator: Multilingual Speech Translation with One-Shot Learning.
Proceedings of the COLING 2018, 2018

Towards one-shot learning for rare-word translation with external experts.
Proceedings of the 2nd Workshop on Neural Machine Translation and Generation, 2018

Robust and Scalable Differentiable Neural Computer for Question Answering.
Proceedings of the Workshop on Machine Reading for Question Answering@ACL 2018, 2018

Parameter Optimization for CTC Acoustic Models in a Less-resourced Scenario: An Empirical Study.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

2017
Transcribing against time.
Speech Commun., 2017

Phonemic and Graphemic Multilingual CTC Based Speech Recognition.
CoRR, 2017

The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2017.
Proceedings of the Second Conference on Machine Translation, 2017

The QT21 Combined Machine Translation System for English to Latvian.
Proceedings of the Second Conference on Machine Translation, 2017

Improved Speaker Adaptation by Combining I-vector and fMLLR with Deep Bottleneck Networks.
Proceedings of the Speech and Computer - 19th International Conference, 2017

Language Adaptive Multilingual CTC Speech Recognition.
Proceedings of the Speech and Computer - 19th International Conference, 2017

Toward Robust Neural Machine Translation for Noisy Input Sequences.
Proceedings of the 14th International Conference on Spoken Language Translation, 2017

KIT's Multilingual Neural Machine Translation systems for IWSLT 2017.
Proceedings of the 14th International Conference on Spoken Language Translation, 2017

The 2017 KIT IWSLT Speech-to-Text Systems for English and German.
Proceedings of the 14th International Conference on Spoken Language Translation, 2017

Effective Strategies in Zero-Shot Neural Machine Translation.
Proceedings of the 14th International Conference on Spoken Language Translation, 2017

Domain-independent Punctuation and Segmentation Insertion.
Proceedings of the 14th International Conference on Spoken Language Translation, 2017

Yeah, Right, Uh-Huh: A Deep Learning Backchannel Predictor.
Proceedings of the Advanced Social Interaction with Agents, 2017

Comparison of Decoding Strategies for CTC Acoustic Models.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Enhancing Backchannel Prediction Using Word Embeddings.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

NMT-Based Segmentation and Punctuation Insertion for Real-Time Spoken Language Translation.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Towards phoneme inventory discovery for documentation of unwritten languages.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Keynote Talk.
Proceedings of the 5th International Conference on Human Agent Interaction, 2017

Neural Lattice-to-Sequence Models for Uncertain Inputs.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

DBLSTM based multilingual articulatory feature extraction for language documentation.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Analyzing Neural MT Search and Model Performance.
Proceedings of the First Workshop on Neural Machine Translation, 2017

2016

Using Factored Word Representation in Neural Network Language Models.
Proceedings of the First Conference on Machine Translation, 2016

The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2016.
Proceedings of the First Conference on Machine Translation, 2016

Lecture Translator - Speech translation framework for simultaneous lecture translation.
Proceedings of the Demonstrations Session, 2016

Optimizing Computer-Assisted Transcription Quality with Iterative User Interfaces.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Evaluation of the KIT Lecture Translation System.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Integrating Encyclopedic Knowledge into Neural Language Models.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Audio Segmentation for Robust Real-Time Speech Recognition Based on Neural Networks.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

The 2016 KIT IWSLT Speech-to-Text Systems for English and German.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Toward Multilingual Neural Machine Translation with Universal Encoder and Decoder.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Multilingual Disfluency Removal using NMT.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Adaptation and Combination of NMT Systems: The KIT Translation Systems for IWSLT 2016.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Towards Improving Low-Resource Speech Recognition Using Articulatory and Language Features.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Towards an Open-Domain Social Dialog System.
Proceedings of the Dialogues with Social Robots, 2016

Unsupervised Phoneme Segmentation of Previously Unseen Languages.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Dynamic Transcription for Low-Latency Speech Translation.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Language Adaptive DNNs for Improved Low Resource Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

An empirical exploration of CTC acoustic models.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Lightly Supervised Quality Estimation.
Proceedings of the COLING 2016, 2016

Pre-Translation for Neural Machine Translation.
Proceedings of the COLING 2016, 2016

Training Deep Neural Networks for Reverberation Robust Speech Recognition.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Language Feature Vectors for Resource Constraint Speech Recognition.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Growing a Deep Neural Network Acoustic Model with Singular Value Decomposition.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Phoneme Boundary Detection using Deep Bidirectional LSTMs.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Personalized News Event Retrieval for Small Talk in Social Dialog Systems.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Using Tweets as "Ice-Breaking" Sentences in a Social Dialog System.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

2015
ListNet-based MT Rescoring.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

The KIT-LIMSI Translation System for WMT 2015.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

The Karlsruhe Institute of Technology Translation Systems for the WMT 2015.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

Evaluation of Crowdsourced User Input Data for Spoken Dialog Systems.
Proceedings of the SIGDIAL 2015 Conference, 2015

Using language adaptive deep neural networks for improved multilingual speech recognition.
Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015

The 2015 KIT IWSLT speech-to-text systems for English and German.
Proceedings of the 12th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2015, 2015

Multifeature modular deep neural network acoustic models.
Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015

Source discriminative word lexicon for translation disambiguation.
Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015

The KIT translation systems for IWSLT 2015.
Proceedings of the 12th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2015, 2015

Punctuation insertion for real-time spoken language translation.
Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015

Gaussian free cluster tree construction using deep neural network.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Combination of NN and CRF models for joint detection of punctuation and disfluencies.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Using Neural Networks for Data-Driven Backchannel Prediction: A Survey on Input Features and Training Techniques.
Proceedings of the Human-Computer Interaction: Interaction Technologies, 2015

Stripping Adjectives: Integration Techniques for Selective Stemming in SMT Systems.
Proceedings of the 18th Annual Conference of the European Association for Machine Translation, 2015

2014
Segmentation for Efficient Supervised Language Annotation with an Explicit Cost-Utility Tradeoff.
Trans. Assoc. Comput. Linguistics, 2014

The Karlsruhe Institute of Technology Translation Systems for the WMT 2014.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

EU-BRIDGE MT: Combined Machine Translation.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

The KIT-LIMSI Translation System for WMT 2014.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

A Neural Network Keyword Search System for Telephone Speech.
Proceedings of the Speech and Computer - 16th International Conference, 2014

On-the-fly user modeling for cost-sensitive correction of speech transcripts.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Manual Analysis of Structurally Informed Reordering in German-English Machine Translation.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

A Corpus of Spontaneous Speech in Lectures: The KIT Lecture Corpus for Spoken Language Processing and Translation.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Rule-based preordering on multiple syntactic levels in statistical machine translation.
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

The KIT translation systems for IWSLT 2014.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

Improving in-domain data selection for small in-domain sets.
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

The 2014 KIT IWSLT speech-to-text systems for English, German and Italian.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

Lexical translation model using a deep neural network architecture.
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

Combined spoken language translation.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

Machine translation of multi-party meetings: segmentation and disfluency removal strategies.
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

Extracting translation pairs from social network content.
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

Multilingual deep bottle neck features: a study on language selection and training techniques.
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

A World without Barriers: Connecting the World across Languages, Distances and Media.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

Training time reduction and performance improvements from multilingual techniques on the BABEL ASR task.
Proceedings of the IEEE International Conference on Acoustics, 2014

Multilingual shifting deep bottleneck features for low-resource ASR.
Proceedings of the IEEE International Conference on Acoustics, 2014

Optimization of Neural Network Language Models for keyword search.
Proceedings of the IEEE International Conference on Acoustics, 2014

Tight Integration of Speech Disfluency Removal into SMT.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Combining techniques from different NN-based language models for machine translation.
Proceedings of the 11th Conference of the Association for Machine Translation in the Americas: MT Researchers Track, 2014

2013
Training speech translation from audio recordings of interpreter-mediated communication.
Comput. Speech Lang., 2013

Joint WMT 2013 Submission of the QUAERO Project.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

An MT Error-Driven Discriminative Word Lexicon using Sentence Structure Features.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

The Karlsruhe Institute of Technology Translation Systems for the WMT 2013.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Combining Word Reordering Methods on different Linguistic Abstraction Levels for Statistical Machine Translation.
Proceedings of the Seventh Workshop on Syntax, 2013

Segmentation of Telephone Speech Based on Speech and Non-speech Models.
Proceedings of the Speech and Computer - 15th International Conference, 2013

Optimizing deep bottleneck feature extraction.
Proceedings of the 2013 IEEE RIVF International Conference on Computing and Communication Technologies, 2013

Measuring the Structural Importance through Rhetorical Structure Index.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

The 2013 KIT Quaero speech-to-text system for French.
Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, 2013

Maximum entropy language modeling for Russian ASR.
Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, 2013

The 2013 KIT IWSLT speech-to-text systems for German and English.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

Analyzing the potential of source sentence reordering in statistical machine translation.
Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, 2013

Incremental unsupervised training for university lecture recognition.
Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, 2013

The KIT translation systems for IWSLT 2013.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

EU-BRIDGE MT: text translation of talks in the EU-BRIDGE project.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

CRF-based disfluency detection using semantic features for German to English spoken language translation.
Proceedings of the 10th International Workshop on Spoken Language Translation: Papers, 2013

Efficient speech transcription through respeaking.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Slightly Supervised Adaptation of Acoustic Models on Captioned BBC Weather Forecasts.
Proceedings of the First Workshop on Speech, 2013

Modular combination of deep neural networks for acoustic modeling.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A real-world system for simultaneous translation of German lectures.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Learning discriminative basis coefficients for eigenspace MLLR unsupervised adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2013

Subspace mixture model for low-resource speech recognition in cross-lingual settings.
Proceedings of the IEEE International Conference on Acoustics, 2013

Warped Minimum Variance Distortionless Response based bottle neck features for LVCSR.
Proceedings of the IEEE International Conference on Acoustics, 2013

Extracting deep bottleneck features using stacked auto-encoders.
Proceedings of the IEEE International Conference on Acoustics, 2013

Models of tone for tonal and non-tonal languages.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

DNN acoustic modeling with modular multi-lingual feature extraction networks.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Letter N-Gram-based Input Encoding for Continuous Space Language Models.
Proceedings of the Workshop on Continuous Vector Space Models and their Compositionality, 2013

2012
Parallel Phrase Scoring for Extra-large Corpora.
Prague Bull. Math. Linguistics, 2012

The Karlsruhe Institute of Technology Translation Systems for the WMT 2012.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

Joint WMT 2012 Submission of the QUAERO Project.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

The KIT Lecture Corpus for Speech Translation.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

The 2012 KIT and KIT-NAIST English ASR systems for the IWSLT evaluation.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Continuous space language models using restricted Boltzmann machines.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

The KIT translation systems for IWSLT 2012.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Evaluation of interactive user corrections for lecture transcription.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

The KIT-NAIST (contrastive) English ASR system for IWSLT 2012.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Segmentation and punctuation prediction in speech language translation using a monolingual translation system.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Unsupervised vocabulary selection for real-time speech recognition of lectures.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Blind dereverberation of sinusoid signals using PLL-based combined phase and amplitude analysis.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A hybrid phonotactic language identification system with an SVM back-end for simultaneous lecture translation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Wider Context by Using Bilingual Language Models in Machine Translation.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

The Karlsruhe Institute of Technology Translation Systems for the WMT 2011.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Joint WMT Submission of the QUAERO Project.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Unsupervised Vocabulary Selection for Domain-Independent Simultaneous Lecture Translation.
Proceedings of Machine Translation Summit XIII: Papers, 2011

The 2011 KIT English ASR system for the IWSLT evaluation.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Using Wikipedia to translate domain-specific terms in SMT.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

The KIT English-French translation systems for IWSLT 2011.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Unsupervised vocabulary selection for simultaneous lecture translation.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011


The 2011 KIT QUAERO speech-to-text system for Spanish.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Advances on spoken language translation in the Quaero program.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

TriS: A Statistical Sentence Simplifier with Log-linear Models and Margin-based Discriminative Training.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

2010
The Karlsruhe Institute for Technology Translation System for the ACL-WMT 2010.
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010

Speech translators for humanitarian projects.
Proceedings of the 2nd Workshop on Spoken Language Technologies for Under-Resourced Languages, 2010

Jibbigo: Speech-to-speech translation on mobile devices.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Tools for Collecting Speech Corpora via Mechanical-Turk.
Proceedings of the 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, 2010

Spoken news queries over the world wide web.
Proceedings of the 2010 International Workshop on Searching Spontaneous Conversational Speech, 2010

The KIT translation system for IWSLT 2010.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

Real-time spoken language identification and recognition for speech-to-speech translation.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

Rapid development of speech translation using consecutive interpretation.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Named-entity projection and data-driven morphological decomposition for field maintainable speech-to-speech translation systems.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Spoken language translation from parallel speech audio: Simultaneous interpretation as SLT training data.
Proceedings of the IEEE International Conference on Acoustics, 2010

Towards social integration of humanoid robots by conversational concept learning.
Proceedings of the 10th IEEE-RAS International Conference on Humanoid Robots, 2010

Domain Adaptation in Statistical Machine Translation using Factored Translation Models.
Proceedings of the 14th Annual conference of the European Association for Machine Translation, 2010


2009
Computers in the Human Interaction Loop.
Proceedings of the Computers in the Human Interaction Loop, 2009

Beyond CHIL.
Proceedings of the Computers in the Human Interaction Loop, 2009

Consolidation-Based Speech Translation and Evaluation Approach.
IEICE Trans. Inf. Syst., 2009

The Universität Karlsruhe Translation System for the EACL-WMT 2009.
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009

Incremental Adaptation of Speech-to-Speech Translation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Human translations guided language discovery for ASR systems.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Multimodal Interfaces in Support of Human-Human Interaction.
Proceedings of the Gesture in Embodied Communication and Human-Computer Interaction, 2009

End-to-End Evaluation in Simultaneous Translation.
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009

Automatic translation from parallel speech: Simultaneous interpretation as MT training data.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Pronunciation modeling for dialectal arabic speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Spoken language translation.
IEEE Signal Process. Mag., 2008

A dialogue approach to learning object descriptions and semantic categories.
Robotics Auton. Syst., 2008

Towards human translations guided language discovery for ASR systems.
Proceedings of the First International Workshop on Spoken Languages Technologies for Under-Resourced Languages, 2008

Simultaneous machine translation of german lectures into english: Investigating research challenges for the future.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Modelling multimodal user ID in dialogue.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Confidence based multimodal fusion for person identification.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Probabilistic integration of sparse audio-visual cues for identity tracking.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Communicating Unknown Words in Machine Translation.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Simultaneous German-English lecture translation.
Proceedings of the 2008 International Workshop on Spoken Language Translation, 2008

Speech Processing in Support of Human-Human Communication (Invited Paper).
Proceedings of the ISUC 2008, 2008

Lightly supervised acoustic model training on EPPS recordings.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Class-based statistical machine translation for field maintainable speech-to-speech translation.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Stream decoding for simultaneous spoken language translation.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Extracting clues from human interpreter speech for spoken language translation.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Enabling Multimodal Human-Robot Interaction for the Karlsruhe Humanoid Robot.
IEEE Trans. Robotics, 2007

Far-Field Speaker Recognition.
IEEE Trans. Speech Audio Process., 2007

Simultaneous translation of lectures and speeches.
Mach. Transl., 2007

Translation Model Pruning via Usage Statistics for Statistical Machine Translation.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Estimating phrase pair relevance for translation model pruning.
Proceedings of Machine Translation Summit XI: Papers, 2007

The CMU-UKA statistical machine translation systems for IWSLT 2007.
Proceedings of the 2007 International Workshop on Spoken Language Translation, 2007

Computer-supported human-human multilingual communication.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Behavior models for learning and receptionist dialogs.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Speech Translation Enhanced ASR for European Parliament Speeches - On the Influence of ASR Performance on Speech Translation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Continuous Electromyographic Speech Recognition with a Multi-Stream Decoding Architecture.
Proceedings of the IEEE International Conference on Acoustics, 2007

Consolidation based speech translation.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
A Pattern Learning Approach to Question Answering Within the Ephyra Framework.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Speech-to-Speech Translation Services for the Olympic Games 2008.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

A Robot Learns to Know People - First Contacts of a Robot.
Proceedings of the KI 2006: Advances in Artificial Intelligence, 2006

The CMU-UKA syntax augmented machine translation system for IWSLT-06.
Proceedings of the 2006 International Workshop on Spoken Language Translation, 2006

The UKA/CMU statistical machine translation system for IWSLT 2006.
Proceedings of the 2006 International Workshop on Spoken Language Translation, 2006

Sub-word unit based non-audible speech recognition using surface electromyography.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Rapid simulation-driven reinforcement learning of multimodal dialog strategies in human-robot interaction.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Towards continuous speech recognition using surface electromyography.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Optimizing components for handheld two-way speech translation for an English-iraqi Arabic system.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

A multilingual expectations model for contextual utterances in mixed-initiative spoken dialogue.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Dynamic extension of a grammar-based dialogue system: constructing an all-recipes knowing robot.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Multimodal estimation of user interruptibility for smart mobile telephones.
Proceedings of the 8th International Conference on Multimodal Interfaces, 2006

Directing Attention in Online Aggregate Sensor Streams via Auditory Blind Value Assignment.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Articulatory Feature Classification using Surface Electromyography.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Open Domain Speech Recognition & Translation: Lectures and Speeches.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

A Flexible Online Server for Machine Translation Evaluation.
Proceedings of the 11th Annual conference of the European Association for Machine Translation, 2006

2005
CHIL - Computers in the Human Interaction Loop.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2005), 2005

Low Cost Portability for Statistical Machine Translation based on N-gram Coverage.
Proceedings of Machine Translation Summit X: Papers, 2005


The CMU statistical machine translation system for IWSLT 2005.
Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005

Low cost Portability for statistical machine translation based on n-gram frequency and TF-IDF.
Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005

Document driven machine translation enhanced ASR.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Clarification questions to improve dialogue flow and speech recognition in spoken dialogue systems.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Temporal ICA for classification of acoustic events i a kitchen environment.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Rapid porting of ASR-systems to mobile devices.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Spontaneous speech consolidation for spoken language applications.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

The connector: facilitating context-aware communication.
Proceedings of the 7th International Conference on Multimodal Interfaces, 2005

Automatically Transcribing Meetings using Distant Microphones.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Classifying user environment for mobile applications using linear autoencoding of ambient audio.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Whispery Speech Recognition using Adapted Articulatory Features.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

CHIL computing to overcome techno-clutter.
Proceedings of the 2005 joint conference on Smart objects and ambient intelligence, 2005

Adaptation of the translation model for statistical machine translation based on information retrieval.
Proceedings of the 10th EAMT Conference: Practical applications of machine translation, 2005

Augmenting a statistical translation system with a translation memory.
Proceedings of the 10th EAMT Conference: Practical applications of machine translation, 2005

Bilingual Word Spectral Clustering for Statistical Machine Translation.
Proceedings of the Workshop on Building and Using Parallel Texts@ACL 2005, 2005

Training and Evaluating Error Minimization Decision Rules for Statistical Machine Translation.
Proceedings of the Workshop on Building and Using Parallel Texts@ACL 2005, 2005

Learning a Log-Linear Model with Bilingual Phrase-Pair Features for Statistical Machine Translation.
Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing, 2005

Clustering and Classifying Person Names by Origin.
Proceedings of the Proceedings, 2005

2004
Automatic detection and recognition of signs from natural scenes.
IEEE Trans. Image Process., 2004

Speaker adaptation with all-pass transforms.
Speech Commun., 2004

A Thai Speech Translation System for Medical Dialogs.
Proceedings of the Demonstration Papers at HLT-NAACL 2004, 2004

Improving Named Entity Translation Combining Phonetic and Semantic Similarities.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004

Interpreting BLEU/NIST Scores: How Much Improvement do We Need to Have a Better System?
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Language Model Adaptation for Statistical Machine Translation Based on Information Retrieval.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

The ISL statistical translation system for spoken language translation.
Proceedings of the 2004 International Workshop on Spoken Language Translation, 2004

The ISL EDTRL system.
Proceedings of the 2004 International Workshop on Spoken Language Translation, 2004

Towards named entity extraction and translation in spoken language translation.
Proceedings of the 2004 International Workshop on Spoken Language Translation, 2004

Natural human-robot interaction using speech, head pose and gestures.
Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004

Speech translation: past, present and future.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Worldwide ongoing activities on multilingual speech to speech translation.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Adaptation for soft whisper recognition using a throat microphone.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Tight coupling of speech recognition and dialog management - dialog-context dependent grammar weighting for speech recognition.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Integrating thumbnail features for speech recognition using conditional exponential models.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Towards language portability in statistical speech translation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Minimum Kullback-Leibler distance based multivariate Gaussian feature adaptation for distant-talking speech recognition.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Performance comparisons of all-pass transform adaptation with maximum likelihood linear regression.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Phrase Pair Rescoring with Term Weighting for Statistical Machine Translatio.
Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing , 2004

Large Vocabulary Audio-Visual Speech Recognition Using the Janus Speech Recognition Toolkit.
Proceedings of the Pattern Recognition, 26th DAGM Symposium, August 30, 2004

Improving Statistical Machine Translation in the Medical Domain using the Unified Medical Language system.
Proceedings of the COLING 2004, 2004

2003
Extracting named entity translingual equivalence with limited resources.
ACM Trans. Asian Lang. Inf. Process., 2003

A Statistical Approach to Automatic Speech Summarization.
EURASIP J. Adv. Signal Process., 2003

Efficient Optimization for Bilingual Sentence Alignment Based on Linear Regression.
Proceedings of the HLT-NAACL 2003 Workshop on Building and Using Parallel Texts: Data Driven Machine Translation and Beyond, 2003

Speechalator: Two-Way Speech-to-Speech Translation in Your Hand.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

The CMU statistical machine translation system.
Proceedings of Machine Translation Summit IX: Papers, 2003

Minimum variance distortionless response on a warped frequency scale.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Speechalator: two-way speech-to-speech translation on a consumer PDA.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Integrating multilingual articulatory features into speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Calibration of a Hybrid Camera Network.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Comparison of acoustic model adaptation techniques on non-native speech.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

SMaRT: the Smart Meeting Room Task at ISL.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Multilingual articulatory features.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Maximum mutual information speaker adapted training with semi-tied covariance matrices.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Effective Phrase Translation Extraction from Alignment Models.
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003

Automatic Extraction of Named Entity Translingual Equivalence Based on Multi-Feature Cost Minimization.
Proceedings of the Workshop on Multilingual and Mixed-language Named Entity Recognition, 2003

2002
Modeling focus of attention for meeting indexing based on multiple cues.
IEEE Trans. Neural Networks, 2002

Automatic Detection of Signs with Affine Transformation.
Proceedings of the 6th IEEE Workshop on Applications of Computer Vision (WACV 2002), 2002

Automatic sign translation.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Compensating for hyperarticulation by modeling articulatory properties.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

A flexible stream architecture for ASR using articulatory features.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Interlingua based statistical machine translation.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Phonetic speaker identification.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

A Robust Approach for Recognition of Text Embedded in Natural Scenes.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

A PDA-Based Sign Translator.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Towards Universal Speech Recognition.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Flexi-Modal and Multi-Machine User Interfaces.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Integrating Emotional Cues into a Framework for Dialogue Management.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Automatic detection and translation of text from natural scenes.
Proceedings of the IEEE International Conference on Acoustics, 2002

Efficient language model lookahead through polymorphic linguistic context assignment.
Proceedings of the IEEE International Conference on Acoustics, 2002

Experiments on distant-talking speech recognition in meeting room using extended MAM.
Proceedings of the IEEE International Conference on Acoustics, 2002

On maximum mutual information speaker-adapted training.
Proceedings of the IEEE International Conference on Acoustics, 2002

Speaker identification using multilingual phone strings.
Proceedings of the IEEE International Conference on Acoustics, 2002

Automatic speech summarization applied to English broadcast news speech.
Proceedings of the IEEE International Conference on Acoustics, 2002

Improvements in Non-Verbal Cue Identification Using Multilingual Phone Strings.
Proceedings of the Workshop on Speech-to-Speech Translation: Algorithms and Systems@ACL 2002, 2002

2001
Multimodal error correction for speech user interfaces.
ACM Trans. Comput. Hum. Interact., 2001

Language-independent and language-adaptive acoustic modeling for speech recognition.
Speech Commun., 2001

The ISL View4You Broadcast News Transcription System.
Int. J. Speech Technol., 2001

Online handwriting recognition: the NPen++ recognizer.
Int. J. Document Anal. Recognit., 2001

An automatic sign recognition and translation system.
Proceedings of the 2001 workshop on Perceptive user interfaces, 2001

Towards Automatic Sign Translation.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Advances in meeting recognition.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Activity detection for information access to oral communication.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Architecture and Design Considerations in NESPOLE!: a Speech Translation System for E-commerce Applications.
Proceedings of the First International Conference on Human Language Technology Research, 2001

LingWear: A Mobile Tourist Information System.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Experiments on cross-language acoustic modeling.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Model-combination-based acoustic mapping.
Proceedings of the IEEE International Conference on Acoustics, 2001

Advances in automatic meeting record creation and access.
Proceedings of the IEEE International Conference on Acoustics, 2001

The ISL evaluation system for Verbmobil-II.
Proceedings of the IEEE International Conference on Acoustics, 2001

Speaker compensation with sine-log all-pass transforms.
Proceedings of the IEEE International Conference on Acoustics, 2001

Estimating focus of attention based on gaze and sound.
Proceedings of the Auditory-Visual Speech Processing, 2001

2000
Multilinguality in speech and spoken language systems.
Proc. IEEE, 2000

The Janus-III Translation System: Speech-to-Speech Translation in Multiple Domains.
Mach. Transl., 2000

Towards Unrestricted Lip Reading.
Int. J. Pattern Recognit. Artif. Intell., 2000

End to end evaluation of the ISL View4You broadcast news transcription and retrieval system.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 2000

Multimodal Meeting Tracker.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 2000

Shallow Discourse Genre Annotation in CallHome Spanish.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Streamlining the front end of a speech recognizer.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

New developments in automatic meeting transcription.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Phone dependent modeling of hyperarticulated effects#.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

The effects of room acoustics on MFCC speech parameter.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A na ve de-lambing method for speaker identification.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Application of LDA to speaker recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Dialogue management for multimodal user registration.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Simultaneous Tracking of Head Poses in a Panoramic View.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

Growing Gaussian Mixture Models for Pose Invariant Face Recognition.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

Towards a Multimodal Meeting Record.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

Specialized acoustic models for hyperarticulated speech.
Proceedings of the IEEE International Conference on Acoustics, 2000

Polyphone decision tree specialization for language adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2000

Strategies for automatic segmentation of audio data.
Proceedings of the IEEE International Conference on Acoustics, 2000

Segmenting Hands of Arbitrary Color.
Proceedings of the 4th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2000), 2000

Face Recognition in a Meeting Room.
Proceedings of the 4th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2000), 2000

DIASUMM: Flexible Summarization of Spontaneous Dialogues in Unrestricted Domains.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

Minimizing Word Error Rate in Textual Summaries of Spoken Language.
Proceedings of the 6th Applied Natural Language Processing Conference, 2000

1999
Stochastically-based semantic analysis for machine translation.
Comput. Speech Lang., 1999

From Gaze to Focus of Attention.
Proceedings of the Visual Information and Information Systems, 1999

Translation systems under the C-STAR framework.
Proceedings of Machine Translation Summit VII, 1999

Multimodal people ID for a multimedia meeting browser.
Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

Modeling focus of attention for meeting indexing.
Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

Smart Sight: A Tourist Assistant System.
Proceedings of the Third International Symposium on Wearable Computers (ISWC 1999), 1999

Progress in automatic meeting transcription.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Towards spontaneous speech recognition for on-board car navigation and information systems.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Mandarin large vocabulary speech recognition using the globalphone database.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Unsupervised training of a speech recognizer: recent experiments.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Navigating German cities by spontaneous French queries.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Modeling and efficient decoding of large vocabulary conversational speech.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Selection criteria for hypothesis driven lexical adaptation.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Model-Based and Empirical Evaluation of Multimodal Interactive Error Correction.
Proceedings of the Proceeding of the CHI '99 Conference on Human Factors in Computing Systems: The CHI is the Limit, 1999

Face translation: A multimodal translation agent.
Proceedings of the Auditory-Visual Speech Processing, 1999

1998
Linear discriminant - a new criterion for speaker normalization.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Fast decoding for statistical machine translation.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

On the influence of hyperarticulated speech on recognition performance.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Language independent and language adaptive large vocabulary speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

An interlingua based on domain actions for machine translation of task-oriented dialogues.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Unsupervised training of a speech recognizer using TV broadcasts.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Reducing the OOV rate in broadcast news speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

The interactive systems labs view4you video indexing system.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Phonetic-distance-based hypothesis driven lexical adaptation for transcribing multlingual broadcast news.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Conversational speech systems for on-board car navigation and assistance.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Probabilistic dialogue act extraction for concept based multilingual translation systems.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Effective structural adaptation of LVCSR systems to unseen domains using hierarchical connectionist acoustic models.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Experiments in automatic meeting transcription using JRTK.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Recognition of music types.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Serbo-Croatian LVCSR on the dictation and broadcast news domain.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Hierarchies of neural networks for connectionist speech recognition.
Proceedings of the 6th European Symposium on Artificial Neural Networks, 1998

Visual Tracking for Multimodal Human Computer Interaction.
Proceedings of the Proceeding of the CHI '98 Conference on Human Factors in Computing Systems, 1998

Interactive error repair for an online handwriting interface.
Proceedings of the CHI 98 Conference Summary on Human Factors in Computing Systems, 1998

Real-Time Face and Facial Feature Tracking and Applications.
Proceedings of the Auditory-Visual Speech Processing, 1998

A Modular Approach to Spoken Language Translation for Large Domains.
Proceedings of the Machine Translation and the Information Soup, 1998

Using Chunk Based Partial Parsing of Spontaneous Speech in Unrestricted Domains for Reducing Word Error Rate in Speech Recognition.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

Modeling with Structures in Statistical Machine Translation.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

Growing Semantic Grammars.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

Skin-Color Modeling and Adaptation.
Proceedings of the Computer Vision, 1998

1997
Janus: A System for Translation of Conversational Speech.
Künstliche Intell., 1997

A Model-Based Gaze Tracking System.
Int. J. Artif. Intell. Tools, 1997

Speaker normalization and speaker adaptation - a combination for conversational speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Statistical analysis of dialogue structure.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Exploiting repair context in interactive error recovery.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Fast bootstrapping of LVCSR systems with multilingual phoneme sets.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Japanese LVCSR on the spontaneous scheduling task with JANUS-3.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Dialogue strategies guiding users to their communicative goals.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Recognition of conversational telephone speech using the JANUS speech engine.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Multimodal interfaces for multimedia information agents.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Janus-III: speech-to-speech translation in multiple languages.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Context-dependent hybrid HME/HMM speech recognition using polyphone clustering decision trees.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Verbmobil: the combination of deep and shallow processing for spontaneous speech translation.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Decoding Algorithm in Statistical Machine Translation.
Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics, 1997

1996
Interactive Translation of Conversational Speech.
Computer, 1996

Multimodal Interfaces.
Artif. Intell. Rev., 1996

A real-time face tracker.
Proceedings of ThirdIEEE Workshop on Applications of Computer Vision, 1996

Adaptively Growing Hierarchical Mixtures of Experts.
Proceedings of the Advances in Neural Information Processing Systems 9, 1996

JANUS-II: towards spontaneous Spanish speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Word clustering with parallel spoken language corpora.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Interactive recovery from speech recognition errors in speech user interfaces.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Dictionary learning for spontaneous speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Class phrase models for language modelling.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Translation of conversational speech with JANUS-II.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Dialogue processing in a conversational speech translation system.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Recognition of spelled names over the telephone.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Recognizing emotion in speech.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Learning to parse spontaneous speech.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Focus of attention: Towards low bitrate video tele-conferencing.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996

JANUS-II-translation of spontaneous conversational speech.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

LVCSR-based language identification.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

End-to-End Evaluation in JANUS: A Speech-to-speech Translation System.
Proceedings of the Dialogue Processing in Spoken Language Systems, 1996

Search in a Learnable Spoken Language Parser.
Proceedings of the 12th European Conference on Artificial Intelligence, 1996

Multi-lingual Translation of Spontaneously Spoken Language in a Limited Domain.
Proceedings of the 16th International Conference on Computational Linguistics, 1996

FeasPar - A Feature Structure Parser Learning to Parse Spoken Language.
Proceedings of the 16th International Conference on Computational Linguistics, 1996

JANUS: multi-lingual translation of spontaneous speech in limited domain.
Proceedings of the Conference of the Association for Machine Translation in the Americas, 1996

1995
The challenge of spoken language systems: research directions for the nineties.
IEEE Trans. Speech Audio Process., 1995

Translation and interpretation of spontaneous speech.
Proceedings of Machine Translation Summit V, 1995

Integrating spelling into spoken dialogue recognition.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Speeding up the score computation of HMM speech regognizers with the bucket voronoi intersection algorithm.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Integrating different learning approaches into a multilingual spoken language translation system.
Proceedings of the Connectionist, 1995

NPen<sup>++</sup>: a writer independent, large vocabulary on-line cursive handwriting recognition system.
Proceedings of the Third International Conference on Document Analysis and Recognition, 1995

Concept-based speech translation.
Proceedings of the 1995 International Conference on Acoustics, 1995

Toward movement-invariant automatic lip-reading and speech recognition.
Proceedings of the 1995 International Conference on Acoustics, 1995

Knowing who to listen to in speech recognition: visually guided beamforming.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
Introduction Structured Connectionist Systems.
Mach. Learn., 1994

Recovering From Parser Failures: A Hybrid Statistical/Symbolic Approach.
CoRR, 1994

The Use of Dynamic Writing Information in a Connectionist On-Line Cursive Handwriting Recognition System.
Proceedings of the Advances in Neural Information Processing Systems 7, 1994

Inferring linguistic structure in spoken language.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Towards better language models for spontaneous speech.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Improving recognizer acceptance through robust, natural speech repair.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

See me, hear me: integrating automatic speech recognition and lip-reading.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Combining bitmaps with dynamic writing information for on-line handwriting recognition.
Proceedings of the 12th IAPR International Conference on Pattern Recognition, 1994

JANUS 93: towards spontaneous speech translation.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Learning state-dependent stream weights for multi-codebook HMM speech recognition systems.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Learning complex output representations in connectionist parsing of spoken language.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Future Directions.
Proceedings of the First Conference of the Association for Machine Translation in the Americas, 1994

1993
A neural fuzzy training approach for improving speech recognition.
Syst. Comput. Jpn., 1993

Machine Translation.
Proceedings of the Human Language Technology: Proceedings of a Workshop Held at Plainsboro, 1993

Recent advances in JANUS: a speech translation system.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Detection and transcription of new words.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Speaker-independent connected letter recognition with a multi-state time delay neural network.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Tuning by doing: flexibility through automatic structure optimization.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Bimodal sensor integration on the example of 'speechreading'.
Proceedings of International Conference on Neural Networks (ICNN'88), San Francisco, CA, USA, March 28, 1993

Application oriented automatic structuring of time-delay neural networks for high performance character and speech recognition.
Proceedings of International Conference on Neural Networks (ICNN'88), San Francisco, CA, USA, March 28, 1993

Improving the MS-TDNN for word spotting.
Proceedings of the IEEE International Conference on Acoustics, 1993

Multi-speaker/speaker-independent architectures for the multi-state time delay neural network.
Proceedings of the IEEE International Conference on Acoustics, 1993

Improving connected letter recognition by lipreading.
Proceedings of the IEEE International Conference on Acoustics, 1993

Multi-modal HCI: combination of gesture and speech recognition.
Proceedings of the Human-Computer Interaction, 1993

1992
Integrated phoneme and function word architecture of hidden control neural networks for continuous speech recognition.
Speech Commun., 1992

The Meta-Pi Network: Building Distributed Knowledge Representations for Robust Multisource Pattern Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 1992

Performance Through Consistency: MS-TDNN's for Large Vocabulary Continuous Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 5, [NIPS Conference, Denver, Colorado, USA, November 30, 1992

Connected Letter Recognition with a Multi-State Time Delay Neural Network.
Proceedings of the Advances in Neural Information Processing Systems 5, [NIPS Conference, Denver, Colorado, USA, November 30, 1992

A hybrid neural network, dynamic programming word spotter.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

Testing generality in JANUS: a multi-lingual speech translation system.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

PARSEC: a structured connectionist parsing system for spoken language.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
JANUS: Speech-to-Speech Translation Using Connectionist and Non-Connectionist Techniques.
Proceedings of the Advances in Neural Information Processing Systems 4, 1991

Multi-State Time Delay Networks for Continuous Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 4, 1991

Continuous Speech Recognition with the Connectionist Viterbi Training Procedure: A Summary of Recent Work.
Proceedings of the Artificial Neural Networks, 1991

Integrated phoneme-function word architecture of hidden control neural networks for continuous speech recognition.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Evaluation of speaker-independent phoneme recognition on TIMIT database using TDNNs.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Time-delay neural networks embedding time alignment: a performance analysis.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Recent work in continuous speech recognition using the connectionist viterbi training procedure.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

A connectionist model for dialog processing.
Proceedings of the 1991 International Conference on Acoustics, 1991

JANUS: a speech-to-speech translation system using connectionist and symbolic processing strategies.
Proceedings of the 1991 International Conference on Acoustics, 1991

Continuous speech recognition using linked predictive neural networks.
Proceedings of the 1991 International Conference on Acoustics, 1991

Integrating time alignment and neural networks for high performance continuous speech recognition.
Proceedings of the 1991 International Conference on Acoustics, 1991

Learning the architecture of neural networks for speech recognition.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
A novel objective function for improved phoneme recognition using time-delay neural networks.
IEEE Trans. Neural Networks, 1990

Spotting Phonemes and Syllables for Continuous Speech Recognition Using Time-Delay Neural Networks.
Syst. Comput. Jpn., 1990

A time-delay neural network architecture for isolated word recognition.
Neural Networks, 1990

Continuous Speech Recognition by Linked Predictive Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 3, 1990

The Tempo 2 Algorithm: Adjusting Time-Delays By Supervised Learning.
Proceedings of the Advances in Neural Information Processing Systems 3, 1990

Speech recognition using sub-phoneme recognition neural network.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Phoneme-based word recognition by neural network - a step toward large vocabulary recognition.
Proceedings of the IJCNN 1990, 1990

Speaker-independent phoneme recognition on TIMIT database using integrated time-delay neural networks (TDNNs).
Proceedings of the IJCNN 1990, 1990

Large vocabulary recognition using linked predictive neural networks.
Proceedings of the 1990 International Conference on Acoustics, 1990

Robust connectionist parsing of spoken language.
Proceedings of the 1990 International Conference on Acoustics, 1990

The Meta-Pi network: connectionist rapid adaptation for high-performance multi-speaker phoneme recognition.
Proceedings of the 1990 International Conference on Acoustics, 1990

Connectionist Viterbi training: a new hybrid method for continuous speech recognition.
Proceedings of the 1990 International Conference on Acoustics, 1990

1989
Modularity and scaling in large phonemic neural networks.
IEEE Trans. Acoust. Speech Signal Process., 1989

Phoneme recognition using time-delay neural networks.
IEEE Trans. Acoust. Speech Signal Process., 1989

Modular Construction of Time-Delay Neural Networks for Speech Recognition.
Neural Comput., 1989

Incremental Parsing by Modular Recurrent Connectionist Networks.
Proceedings of the Advances in Neural Information Processing Systems 2, 1989

Connectionist Architectures for Multi-Speaker Phoneme Recognition.
Proceedings of the Advances in Neural Information Processing Systems 2, 1989

A Connectionist Parser Aimed at Spoken Language.
Proceedings of the First International Workshop on Parsing Technologies, 1989

Fast back-propagation learning methods for large phonemic neural networks.
Proceedings of the First European Conference on Speech Communication and Technology, 1989

Consonant recognition by modular construction of large phonemic time-delay neural networks.
Proceedings of the IEEE International Conference on Acoustics, 1989

Spotting Japanese CV-syllables and phonemes using time-delay neural networks.
Proceedings of the IEEE International Conference on Acoustics, 1989

1988
Consonant Recognition by Modular Construction of Large Phonemic Time-Delay Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 1, 1988

Phoneme recognition: neural networks vs. hidden Markov models.
Proceedings of the IEEE International Conference on Acoustics, 1988

Noise reduction using connectionist models.
Proceedings of the IEEE International Conference on Acoustics, 1988

1987
Learned phonetic discrimination using connectionist networks.
Proceedings of the European Conference on Speech Technology, 1987

Prosodic knowledge sources for word hypothesization in a continuous speech recognition system.
Proceedings of the IEEE International Conference on Acoustics, 1987

1986
Recognition of lexical stress in a continuous speech understanding system - A pattern recognition approach.
Proceedings of the IEEE International Conference on Acoustics, 1986

1985
A coarse phonetic knowledge source for template independent large vocabulary word recognition.
Proceedings of the IEEE International Conference on Acoustics, 1985

1984
Suprasegmentals in very large vocabulary isolated word recognition.
Proceedings of the IEEE International Conference on Acoustics, 1984

1982
Performance trade-offs in search techniques for isolated word speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 1982


  Loading...