Katrin Kirchhoff

Orcid: 0000-0002-6645-6030

Affiliations:
  • Amazon
  • University of Washington, Seattle, USA


According to our database1, Katrin Kirchhoff authored at least 135 papers between 1996 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation.
CoRR, 2024

SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models.
CoRR, 2024

SpeechVerse: A Large-scale Generalizable Audio Language Model.
CoRR, 2024

AutoGluon-Multimodal (AutoMM): Supercharging Multimodal AutoML with Foundation Models.
CoRR, 2024

DeAL: Decoding-time Alignment for Large Language Models.
CoRR, 2024

SpeechGuard: Exploring the Adversarial Robustness of Multi-modal Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
DCTX-Conformer: Dynamic context carry-over for low latency unified streaming and non-streaming Conformer.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

A Metric-Driven Approach to Conformer Layer Pruning for Efficient ASR Inference.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Mask the Bias: Improving Domain-Adaptive Generalization of CTC-Based ASR with Internal Language Model Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Self-Supervised Speech Representation Learning: A Review.
IEEE J. Sel. Top. Signal Process., 2022

Device Directedness with Contextual Cues for Spoken Dialog Systems.
CoRR, 2022

Exploration of Language-Specific Self-Attention Parameters for Multilingual End-to-End Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Personalization of CTC Speech Recognition Models.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Directed speech separation for automatic speech recognition of long form conversational speech.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Domain Prompts: Towards memory and compute efficient domain adaptation of ASR systems.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Contextual Acoustic Barge-In Classification for Spoken Dialog Systems.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Representation Learning Through Cross-Modal Conditional Teacher-Student Training For Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Enhancing Contrastive Learning with Temporal Cognizance for Audio-Visual Representation Generation.
Proceedings of the IEEE International Conference on Acoustics, 2022

Listen, Know and Spell: Knowledge-Infused Subword Modeling for Improving ASR Performance of OOV Named Entities.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech.
CoRR, 2021

Efficient domain adaptation of language models in ASR systems using Prompt-tuning.
CoRR, 2021

ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling.
CoRR, 2021

"What's The Context?" : Long Context NLM Adaptation for ASR Rescoring in Conversational Agents.
CoRR, 2021

Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Adapting Long Context NLM for ASR Rescoring in Conversational Agents.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Speaker-Conversation Factorial Designs for Diarization Error Analysis.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Neural Inverse Text Normalization.
Proceedings of the IEEE International Conference on Acoustics, 2021

Transformer-Transducers for Code-Switched Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Remember the Context! ASR Slot Error Correction Through Memorization.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Robust Prediction of Punctuation and Truecasing for Medical ASR.
CoRR, 2020

Grapheme-to-Phoneme Transduction for Cross-Language ASR.
Proceedings of the Statistical Language and Speech Processing, 2020

BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Multimodal Semi-Supervised Learning Framework for Punctuation Prediction in Conversational Speech.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Continual Learning for Multi-Dialect Acoustic Models.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Deep Contextualized Acoustic Representations for Semi-Supervised Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Unsupervised Translation Disambiguation for Cross-Domain Statistical Machine Translation.
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers, 2020

Masked Language Model Scoring.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Pseudolikelihood Reranking with Masked Language Models.
CoRR, 2019

Contextual Phonetic Pretraining for End-to-end Utterance-level Language and Speaker Recognition.
CoRR, 2019

Simple, Fast, Accurate Intent Classification and Slot Labeling.
CoRR, 2019

Simple, Fast, Accurate Intent Classification and Slot Labeling for Goal-Oriented Dialogue Systems.
Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, 2019

Multi-Stream Network with Temporal Attention for Environmental Sound Classification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Speech Audio Super-Resolution for Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Self-attention Networks for Connectionist Temporal Classification in Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Development of machine translation technology for assisting health communication: A systematic review.
J. Biomed. Informatics, 2018

Context Models for OOV Word Translation in Low-Resource Languages.
Proceedings of the 13th Conference of the Association for Machine Translation in the Americas, 2018

2017
SVitchboard-II and FiSVer-I: Crafting high quality and low complexity conversational english speech corpora using submodular function optimization.
Comput. Speech Lang., 2017

2016
Graph-Based Semisupervised Learning for Acoustic Modeling in Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Novel Front-End Features Based on Neural Graph Embeddings for DNN-HMM and LSTM-CTC Acoustic Modeling.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Medical Text Simplification by Medical Trainees: A Feasibility Study.
Proceedings of the 2016 IEEE International Conference on Healthcare Informatics, 2016

Crowdsourced Evaluation of Medical Texts Simplified by Medical Trainees.
Proceedings of the AMIA 2016, 2016

Unsupervised Resolution of Acronyms and Abbreviations in Nursing Notes Using Document-Level Context Models.
Proceedings of the Seventh International Workshop on Health Text Mining and Information Analysis, 2016

2015
Syntactic and Semantic Features For Code-Switching Factored Language Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Modeling workflow to design machine translation applications for public health practice.
J. Biomed. Informatics, 2015

Exploiting Out-of-Domain Data Sources for Dialectal Arabic Statistical Machine Translation.
CoRR, 2015

Morphological Modeling for Machine Translation of English-Iraqi Arabic Spoken Dialogs.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Machine Assisted Translation of Health Materials to Chinese: An Initial Evaluation.
Proceedings of the MEDINFO 2015: eHealth-enabled Health, 2015

SVitchboard II and fiSVer i: high-quality limited-complexity corpora of conversational English speech.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Evaluating Groupware Prototypes with Discount Methods.
Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, 2015

Acoustic modeling with neural graph embeddings.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
A conjoint analysis framework for evaluating user preferences in machine translation.
Mach. Transl., 2014

Features for factored language models for code-Switching speech.
Proceedings of the 4th Workshop on Spoken Language Technologies for Under-resourced Languages, 2014

Graph-based semi-supervised acoustic modeling in DNN-based speech recognition.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Combining recurrent neural networks and factored language models during decoding of code-Switching speech.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Comparing approaches to convert recurrent neural networks into backoff language models for efficient decoding.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Submodular subset selection for large-scale speech training data.
Proceedings of the IEEE International Conference on Acoustics, 2014

Unsupervised submodular subset selection for speech data.
Proceedings of the IEEE International Conference on Acoustics, 2014

Submodularity for Data Selection in Machine Translation.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

2013
Using Document Summarization Techniques for Speech Data Subset Selection.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Integrated post-editing and translation management for lay user communities.
Proceedings of the 2nd Workshop on Post-editing Technology and Practice, 2013

Graph-based semi-supervised learning for phone and segment classification.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Classification of developmental disorders from speech signals using submodular feature selection.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Submodular feature selection for high-dimensional acoustic score spaces.
Proceedings of the IEEE International Conference on Acoustics, 2013

A web-based collaborative translation management system for public health workers.
Proceedings of the 2013 ACM SIGCHI Conference on Human Factors in Computing Systems, 2013

2012
Spectrum Identification using a Dynamic Bayesian Network Model of Tandem Mass Spectra.
Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, 2012

Evaluating User Preferences in Machine Translation Using Conjoint Analysis.
Proceedings of the 16th Annual conference of the European Association for Machine Translation, 2012

2011
Application of statistical machine translation to public health information: a feasibility study.
J. Am. Medical Informatics Assoc., 2011

Semi-supervised ranking for document retrieval.
Comput. Speech Lang., 2011

Phonetic Classification Using Controlled Random Walks.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010
Hand Gestures in Disambiguating Types of You Expressions in Multiparty Meetings.
Proceedings of the SIGDIAL 2010 Conference, 2010

Contextual Modeling for Meeting Translation Using Unsupervised Word Sense Disambiguation.
Proceedings of the COLING 2010, 2010

2009
Introduction to the Special Issue on Processing Morphologically Rich Languages.
IEEE Trans. Speech Audio Process., 2009

Graph-based Learning for Statistical Machine Translation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

The University of Washington machine translation system for IWSLT 2009.
Proceedings of the 2009 International Workshop on Spoken Language Translation, 2009

Communicative gestures in coreference identification in multiparty meetings.
Proceedings of the 11th International Conference on Multimodal Interfaces, 2009

2008
The University of Washington Machine Translation System for ACL WMT 2008.
Proceedings of the Third Workshop on Statistical Machine Translation, 2008

Learning to rank with partially-labeled data.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

Development of the SRI/nightingale Arabic ASR system.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Beyond Log-Linear Models: Boosted Minimum Error Rate Training for N-best Re-ranking.
Proceedings of the ACL 2008, 2008

2007
Bridging the gap between human and automatic speech recognition.
Speech Commun., 2007

Data-Driven Graph Construction for Semi-Supervised Graph-Based Learning in NLP.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Semi-automatic error analysis for large-scale statistical machine translation.
Proceedings of Machine Translation Summit XI: Papers, 2007

The University of Washington machine translation system for the IWSLT 2007 competition.
Proceedings of the 2007 International Workshop on Spoken Language Translation, 2007

Attention shift decoding for conversational speech recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

OOV detection by joint word/phone lattice alignment.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Graph-based learning for phonetic classification.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Recent innovations in speech-to-text transcription at SRI-ICSI-UW.
IEEE Trans. Speech Audio Process., 2006

Morphology-based language modeling for conversational Arabic speech recognition.
Comput. Speech Lang., 2006

Graphical Model Representations of Word Lattices.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

Factored Neural Language Models.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

The University of Washington machine translation system for IWSLT 2006.
Proceedings of the 2006 International Workshop on Spoken Language Translation, 2006

The Vocal Joystick.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Lexicon Acquisition for Dialectal Arabic Using Transductive Learning.
Proceedings of the EMNLP 2006, 2006

Phrase-Based Backoff Models for Machine Translation of Highly Inflected Languages.
Proceedings of the EACL 2006, 2006

Ambiguity Reduction for Machine Translation: Human-Computer Collaboration.
Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers, 2006

2005
Cross-dialectal data sharing for acoustic modeling in Arabic speech recognition.
Speech Commun., 2005

Error-correction detection and response generation in a spoken dialogue system.
Speech Commun., 2005

The Vocal Joystick: A Voice-Based Human-Computer Interface for Individuals with Motor Impairments.
Proceedings of the HLT/EMNLP 2005, 2005

Development of a conversational telephone speech recognizer for Levantine Arabic.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Genetic triangulation of graphical models for speech and language processing.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Landmark-Based Speech Recognition: Report of the 2004 Johns Hopkins Summer Workshop.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Improved Language Modeling for Statistical Machine Translation.
Proceedings of the Workshop on Building and Using Parallel Texts@ACL 2005, 2005

POS Tagging of Dialectal Arabic: A Minimally Supervised Approach.
Proceedings of the Workshop on Computational Approaches to Semitic Languages, 2005

2004
Morphology-based language modeling for arabic speech recognition.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Cross-dialectal acoustic data sharing for Arabic speech recognition.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Automatic Learning of Language Model Structure.
Proceedings of the COLING 2004, 2004

2003
Generalized rules for combination and joint training of classifiers.
Pattern Anal. Appl., 2003

Factored Language Models and Generalized Parallel Backoff.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Multi-stream language identification using data-driven dependency selection.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Novel approaches to Arabic speech recognition: report from the 2002 Johns-Hopkins Summer Workshop.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Combining acoustic and articulatory feature information for robust speech recognition.
Speech Commun., 2002

Low-resource noise-robust feature post-processing on Aurora 2.0.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

The 2001 GMTK-based SPINE ASR system.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Mixed-memory Markov models for Automatic Language Identification.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Multi-stream statistical n-gram modeling with application to automatic language identification.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Speech analysis by rule extraction from trained artificial neural networks.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Directed graphical models of classifier combination: application to phone recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Conversational speech recognition using acoustic and articulatory input.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Robust speech recognition using articulatory information.
PhD thesis, 1999

Dynamic classifier combination in hybrid speech recognition systems using utterance-level confidence values.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
Combining articulatory and acoustic information for speech recognition in noisy and reverberant environments.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1996
Phonologisch strukturierte HMMs zur automatischen Spracherkennung.
Proceedings of the Natural Language Processing and Speech Technology, 1996

Syllable-level desynchronisation of phonetic features for speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996


  Loading...