Mari Ostendorf

Affiliations:
  • University of Washington


According to our database1, Mari Ostendorf authored at least 291 papers between 1988 and 2024.

Collaborative distances:

Awards

IEEE Fellow

IEEE Fellow 2005, "For contributions to statistical modeling of speech signals.".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
CorrectionLM: Self-Corrections with SLM for Dialogue State Tracking.
CoRR, 2024

Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue.
CoRR, 2024

Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models.
CoRR, 2024

Encode Once and Decode in Parallel: Efficient Transformer Decoding.
CoRR, 2024

OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

2023
InSCIt: Information-Seeking Conversations with Mixed-Initiative Interactions.
Trans. Assoc. Comput. Linguistics, 2023

DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations.
CoRR, 2023

Fine-Grained Human Feedback Gives Better Rewards for Language Model Training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Selective Annotation Makes Language Models Better Few-Shot Learners.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Binding Language Models in Symbolic Languages.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Leveraging Multiple Sources in Automatic African American English Dialect Detection for Adults and Children.
Proceedings of the IEEE International Conference on Acoustics, 2023

One Embedder, Any Task: Instruction-Finetuned Text Embeddings.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Building blocks for complex tasks: Robust generative event extraction for radiology reports under domain shifts.
Proceedings of the 5th Clinical Natural Language Processing Workshop, 2023

2022
Spoken language interaction with robots: Recommendations for future research.
Comput. Speech Lang., 2022

Generalizing through Forgetting - Domain Generalization for Symptom Event Extraction in Clinical Notes.
CoRR, 2022

Automatic Dialect Density Estimation for African American English.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Leveraging Prosody for Punctuation Prediction of Spontaneous Speech.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

CONQRR: Conversational Query Rewriting for Retrieval with Reinforcement Learning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Unsupervised Learning of Hierarchical Conversation Structure.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

In-Context Learning for Few-Shot Dialogue State Tracking.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Annotating social determinants of health using active learning, and characterizing determinants using neural event extraction.
J. Biomed. Informatics, 2021

Extracting COVID-19 diagnoses and symptoms from clinical text: A new annotated corpus and neural event extraction framework.
J. Biomed. Informatics, 2021

Assessing the Use of Prosody in Constituency Parsing of Imperfect Transcripts.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Revisiting Parity of Human vs. Machine Conversational Speech Transcription.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document Contextualization.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Dialogue State Tracking with a Language Model using Schema-Driven Prompting.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Representations for Question Answering from Documents with Tables and Text.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

A Controllable Model of Grounded Response Generation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Extracting Summary Knowledge Graphs from Long Documents.
CoRR, 2020

Analysis of Disfluency in Children's Speech.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Mining Effective Negative Training Samples for Keyword Spotting.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Novel Corpus With Detailed Annotations of Social Determinants of Health.
Proceedings of the AMIA 2020, 2020

2019
Region Proposal Network Based Small-Footprint Keyword Spotting.
IEEE Signal Process. Lett., 2019

Giving Attention to the Unexpected: Using Prosody Innovations in Disfluency Detection.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

A general framework for information extraction using dynamic span graphs.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

A Dynamic Speaker Model for Conversational Interactions.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Disfluencies and Human Speech Transcription Errors.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

On the Role of Style in Parsing Speech with Neural Models.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Automated Essay Scoring with Discourse-Aware Neural Models.
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications, 2019

Automatic Identification of Social Determinants of Health from Clinical Records.
Proceedings of the AMIA 2019, 2019

2018
Conversation Modeling on Reddit Using a Graph-Structured LSTM.
Trans. Assoc. Comput. Linguistics, 2018

Low-Rank RNN Adaptation for Context-Aware Language Modeling.
Trans. Assoc. Comput. Linguistics, 2018

Robust cross-domain disfluency detection with pattern match networks.
CoRR, 2018

Scientific Relation Extraction with Selectively Incorporated Concept Embeddings.
CoRR, 2018

Real-Time Prediction of the Duration of Distribution System Outages.
CoRR, 2018

Asynchronous Speech Recognition Affects Physician Editing of Notes.
Appl. Clin. Inform., 2018

The UWNLP system at SemEval-2018 Task 7: Neural Relation Extraction Model with Selectively Incorporated Concept Embeddings.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

Parsing Speech: a Neural Approach to Integrating Lexical and Acoustic-Prosodic Information.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Community Member Retrieval on Social Media Using Textual Information.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Sounding Board: A User-Centric and Content-Driven Social Chatbot.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics, 2018

Training Augmentation with Adversarial Examples for Robust Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Domain Adversarial Training for Accented Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multi-Task Identification of Entities, Relations, and Coreference for Scientific Knowledge Graph Construction.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Estimating Linguistic Complexity for Science Texts.
Proceedings of the Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications@NAACL-HLT 2018, 2018

Using Neural Multi-task Learning to Extract Substance Abuse Information from Clinical Notes.
Proceedings of the AMIA 2018, 2018

Personalized Language Model for Query Auto-Completion.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
An Open Letter to the Members of the IEEE Industrial Electronics Technical Community.
IEEE Trans. Ind. Informatics, 2017

Joint Modeling of Text and Acoustic-Prosodic Cues for Neural Parsing.
CoRR, 2017

Improving Context Aware Language Models.
CoRR, 2017

Reinforcement Learning with External Knowledge and Two-Stage Q-functions for Predicting Popular Reddit Threads.
CoRR, 2017

Scientific Information Extraction with Semi-supervised Neural Tagging.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

A Factored Neural Network Model for Characterizing Online Discussions in Vector Space.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Language Based Mapping of Science Assessment Items to Skills.
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, 2017

Automatically Detecting Likely Edits in Clinical Notes Created Using Automatic Speech Recognition.
Proceedings of the AMIA 2017, 2017

2016
Using Pronunciation-Based Morphological Subword Units to Improve OOV Handling in Keyword Search.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

LSTM based Conversation Models.
CoRR, 2016

Deep Reinforcement Learning with a Combinatorial Action Space for Predicting and Tracking Popular Discussion Threads.
CoRR, 2016

Learning Latent Local Conversation Modes for Predicting Community Endorsement in Online Discussions.
CoRR, 2016

Continuous-Space Language Processing: Beyond Word Embeddings.
Proceedings of the Statistical Language and Speech Processing, 2016

Phonological Pun-derstanding.
Proceedings of the NAACL HLT 2016, 2016

Disfluency Detection Using a Bidirectional LSTM.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Domain Adaptation of Recurrent Neural Networks for Natural Language Understanding.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Characterizing the Language of Online Communities and its Relation to Community Reception.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Deep Reinforcement Learning with a Natural Language Action Space.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Hierarchical Character-Word Models for Language Identification.
Proceedings of The Fourth International Workshop on Natural Language Processing for Social Media, 2016

Learning Latent Local Conversation Modes for Predicting Comment Endorsement in Online Discussions.
Proceedings of The Fourth International Workshop on Natural Language Processing for Social Media, 2016

A Neural Model for Language Identification in Code-Switched Tweets.
Proceedings of the Second Workshop on Computational Approaches to Code Switching@EMNLP 2016, 2016

2015
A Sparse Plus Low-Rank Exponential Language Model for Limited Resource Scenarios.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Exponential Language Modeling Using Morphological Features and Multi-Task Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Leveraging Twitter for Low-Resource Conversational Speech Language Modeling.
CoRR, 2015

Deep Reinforcement Learning with an Unbounded Action Space.
CoRR, 2015

Data Selection With Fewer Words.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

Unediting: Detecting Disfluencies Without Careful Transcripts.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Aligning Sentences from Standard Wikipedia to Simple Wikipedia.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Learning phrase patterns for ASR name error detection using semantic similarity.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Investigating the role of 'yeah' in stance-dense conversation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Talking to the crowd: What do people react to in online discussions?
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

What Your Username Says About You.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Open-Domain Name Error Detection using a Multi-Task RNN.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

2014
Editorial: Expanding the Technical Reach of our Transactions.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Recognition of stance strength and polarity in spontaneous speech.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Effective data-driven feature learning for detecting name errors in automatic speech recognition.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Multi-domain disfluency and repair detection.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Learning phrase patterns for text classification using a knowledge graph and unlabeled data.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Relating automatic vowel space estimates to talker intelligibility.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Manipulating stance and involvement using collaborative tasks: an exploratory comparison.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Domain adaptation for parsing in automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Subword-based modeling for handling OOV words inkeyword spotting.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Graph-Based Query Strategies for Active Learning.
IEEE Trans. Speech Audio Process., 2013

Learning Phrase Patterns for Text Classification.
IEEE Trans. Speech Audio Process., 2013

Atypical Prosodic Structure as an Indicator of Reading Level and Text Difficulty.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

A sequential repetition model for improved disfluency detection.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Exceptions in language as learned by the multi-factor sparse plus low-rank language model.
Proceedings of the IEEE International Conference on Acoustics, 2013

"Can you give me another word for hyperbaric?": Improving speech translation using targeted clarification questions.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
A Message from the Vice President of Publications on New Developments in Signal Processing Society Publications.
IEEE Trans. Signal Process., 2012

Editorial Message From the Vice President of Publications on New Developments in Signal Processing Society Publications.
IEEE Trans. Image Process., 2012

Joint reranking of parsing and word recognition with automatic segmentation.
Comput. Speech Lang., 2012

Using syntactic and confusion network structure for out-of-vocabulary word detection.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Semi-supervised learning for text classification using feature affinity regularization.
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

A Sparse Plus Low Rank Maximum Entropy Language Model.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Detecting targets of alignment moves in multiparty discussions.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Low Rank Language Models for Small Training Sets.
IEEE Signal Process. Lett., 2011

Identifying targets for syntactic simplification.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2011

Analyzing conversations using rich phrase patterns.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

Question Detection in Spoken Conversations Using Textual Conversations.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010
Detecting authority bids in online discussions.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Extracting Phrase Patterns with Minimum Redundancy for Unsupervised Speaker Role Classification.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Automatic Generation of Personalized Annotation Tags for Twitter Users.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Unsupervised broadcast conversation speaker role labeling.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Building A Highly Accurate Mandarin Speech Recognizer With Language-Independent Technologies and Language-Dependent Modules.
IEEE Trans. Speech Audio Process., 2009

Expected dependency pair match: predicting translation quality with expected syntactic structure.
Mach. Transl., 2009

A machine learning approach to reading level assessment.
Comput. Speech Lang., 2009

Improving robustness of MLLR adaptation with speaker-clustered regression class trees.
Comput. Speech Lang., 2009

Analysis of vocabulary difficulty using Wiktionary.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2009

Classifying Factored Genres with Part-of-Speech Histograms.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Improving the recognition of names by document-level clustering.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Transcribing human-directed speech for spoken language processing.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Freshman design: A signal-processing approach.
Proceedings of the IEEE International Conference on Acoustics, 2009

Filtering web text to match target genres.
Proceedings of the IEEE International Conference on Acoustics, 2009

Acoustic-based pitch-accent detection in speech: Dependence on word identity and insensitivity to variations inword usage.
Proceedings of the IEEE International Conference on Acoustics, 2009

Part-of-speech histograms for genre classification of text.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Speech segmentation and spoken document processing.
IEEE Signal Process. Mag., 2008

Speech technology and information access [In the Spotlight].
IEEE Signal Process. Mag., 2008

Cross-validation and aggregated EM training for robust parameter estimation.
Comput. Speech Lang., 2008

Modeling Vocal Interaction for Text-Independent Participant Characterization in Multi-Party Conversation.
Proceedings of the SIGDIAL 2008 Workshop, 2008

Non-segmental duration feature extraction for prosodic classification.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Parsing-based objective functions for speech recognition in translation applications.
Proceedings of the IEEE International Conference on Acoustics, 2008

Punctuating speech for information extraction.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Multirate Coupled Hidden Markov Models and Their Application to Machining Tool-Wear Classification.
IEEE Trans. Signal Process., 2007

Web resources for language modeling in conversational speech recognition.
ACM Trans. Speech Lang. Process., 2007

Symbolic phonetic features for modeling of pronunciation variation.
Speech Commun., 2007

Text simplification for language learners: a corpus analysis.
Proceedings of the Workshop on Speech and Language Technology in Education, 2007

Modeling Vocal Interaction for Text-Independent Classification of Conversation Type.
Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, 2007

Problem-Sensitive Response Generation in Human-Robot Dialogs.
Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, 2007

iROVER: Improving System Combination with Classification.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Automatic acoustic segmentation for speech recognition on broadcast recordings.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Improving speech translation with automatic boundary prediction.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Cross-Validation EM Training for Robust Parameter Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Word-Level Tone Modeling for Mandarin Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2007

Cross-Site and Intra-Site ASR System Combination: Comparisons on Lattice and 1-Best Methods.
Proceedings of the IEEE International Conference on Acoustics, 2007

Efficient use of overlap information in speaker diarization.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Building a highly accurate Mandarin speech recognizer.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Recent innovations in speech-to-text transcription at SRI-ICSI-UW.
IEEE Trans. Speech Audio Process., 2006

Enriching speech recognition with automatic detection of sentence boundaries and disfluencies.
IEEE Trans. Speech Audio Process., 2006

Parse Structure and Segmentation for Improving speech Recognition.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

Impact of Automatic Comma Prediction on POS/Name Tagging of speech.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

Agreement/Disagreement Classification: Exploiting Unlabeled Data using Contrast Classifiers.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

The ISL RT-06S Speech-to-Text System.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

SParseval: Evaluation Metrics for Parsing Speech.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Assessing the reading level of web pages.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Speaker clustered regression-class trees for MLLR adaptation.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Improved tone modeling for Mandarin broadcast news speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Advances in lecture recognition: the ISL RT-06s evaluation system.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Compensating for Word Posterior Estimation Bias in Confusion Networks.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Pushing the envelope - aside [speech recognition].
IEEE Signal Process. Mag., 2005

A quantitative assessment of the importance of tone in mandarin speech recognition.
IEEE Signal Process. Lett., 2005

Error-correction detection and response generation in a spoken dialogue system.
Speech Commun., 2005

Improving out-of-vocabulary name resolution.
Comput. Speech Lang., 2005

Effective Use of Prosody in Parsing Conversational Speech.
Proceedings of the HLT/EMNLP 2005, 2005

Data sampling for improved speech recognizer training.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Leveraging speaker-dependent variation of adaptation.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Incorporating tone-related MLP posteriors in the feature representation for Mandarin ASR.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Using symbolic prominence to help design feature subsets for topic classification and clustering of natural human-human conversations.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Human language technology: opportunities and challenges.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Web-Data Augmented Language Models for Mandarin Conversational Speech Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Structural metadata research in the EARS program.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

DBN-Based Multi-stream Models for Mandarin Toneme Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Multi-Rate and Variable-Rate Modeling of Speech At Phone and Syllable Time Scales.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Reading Level Assessment Using Support Vector Machines and Statistical Language Models.
Proceedings of the ACL 2005, 2005

A Quantitative Analysis of Lexical Differences Between Genders in Telephone Conversations.
Proceedings of the ACL 2005, 2005

2004
Adaptive language modeling with varied sources to cover new vocabulary items.
IEEE Trans. Speech Audio Process., 2004

Combining Multiple Clustering Systems.
Proceedings of the Knowledge Discovery in Databases: PKDD 2004, 2004

Detecting Structural Metadata with Decision Trees and Transformation-Based Learning.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004

Parsing Conversational Speech Using Enhanced Segmentation.
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

Improving Automatic Sentence Boundary Detection with Confusion Networks.
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

The 2004 ICSI-SRI-UW Meeting Recognition System.
Proceedings of the Machine Learning for Multimodal Interaction, 2004

Progress on Mandarin conversational telephone speech recognition.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

From switchboard to meetings: development of the 2004 ICSI-SRI-UW meeting recognition system.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

The ICSI-SRI-UW metadata extraction system.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Multi-rate hidden Markov models and their application to machining tool-wear classification.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Multilevel Classification of Milling Tool Wear with Confidence Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2003

Acoustic model clustering based on syllable structure.
Comput. Speech Lang., 2003

Parameter reduction schemes for loosely coupled HMMs.
Comput. Speech Lang., 2003

Detection Of Agreement vs. Disagreement In Meetings: Training With Unlabeled Data.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Getting More Mileage from Web Text Sources for Conversational Speech Language Modeling using Class-Dependent Mixtures.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Cross-stream observation dependencies for multi-stream speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Graceful degradation of speech recognition performance over packet-erasure networks.
IEEE Trans. Speech Audio Process., 2002

Efficient integrated response generation from multiple targets using weighted finite state transducers.
Comput. Speech Lang., 2002

The 2001 GMTK-based SPINE ASR system.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Text normalization with varied data sources for conversational speech language modeling.
Proceedings of the IEEE International Conference on Acoustics, 2002

Robust splicing costs and efficient search with BMM Models for concatenative speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Normalization of non-standard words.
Comput. Speech Lang., 2001

Obituary: M. W. Macon 1969-2001.
Comput. Speech Lang., 2001

Improving Information Extraction by Modeling Errors in Speech Recognizer Output.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Graceful degradation of speech recognition performance over lossy packet networks.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Improved word confidence estimation using long range features.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Unit selection for speech synthesis using splicing costs with weighted finite state transducers.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Joint use of dynamical classifiers and ambiguity plane features.
Proceedings of the IEEE International Conference on Acoustics, 2001

Joint prosody prediction and unit selection for concatenative speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
ML parameter estimation of a multiscale stochastic process using the EM algorithm.
IEEE Trans. Signal Process., 2000

Variable n-grams and extensions for conversational speech language modeling.
IEEE Trans. Speech Audio Process., 2000

Robust information extraction from automatically generated speech transcriptions.
Speech Commun., 2000

Editorial: New developments at CSL.
Comput. Speech Lang., 2000

Obituary: J. Allen 1934-2000.
Comput. Speech Lang., 2000

Integrating a context-dependent phrase grammar in the variable n-gram framework.
Proceedings of the IEEE International Conference on Acoustics, 2000

Use of higher level linguistic structure in acoustic modeling for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2000

Hidden Markov models for monitoring machining tool-wear.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
A dynamical system model for generating fundamental frequency for speech synthesis.
IEEE Trans. Speech Audio Process., 1999

Modeling long distance dependence in language: topic mixtures versus dynamic cache models.
IEEE Trans. Speech Audio Process., 1999

Reducing the effects of linear channel distortion on continuous speech recognition.
IEEE Trans. Speech Audio Process., 1999

Joint lexicon, acoustic unit inventory and model design.
Speech Commun., 1999

Relevance weighting for combining multi-domain data for n-gram language modeling.
Comput. Speech Lang., 1999

Robust information extraction from spoken language data.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

A new metric for stochastic language model evaluation.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Predicting gradient F0 variation: pitch range and accent prominence.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
A comparison of constrained trajectory segment models for large vocabulary speech recognition.
IEEE Trans. Speech Audio Process., 1998

Automatic detection of sentence boundaries and disfluencies based on recognized words.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

SABLE: a standard for TTS markup.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Prosody prediction for speech synthesis using transformational rule-based learning.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Using automatically-derived acoustic sub-word units in large vocabulary speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997
Using out-of-domain data to improve in-domain language models.
IEEE Signal Process. Lett., 1997

Prosodic and lexical indications of discourse structure in human-machine interactions.
Speech Commun., 1997

HMM topology design using maximum likelihood successive state splitting.
Comput. Speech Lang., 1997

Variable n-gram language modeling and extensions for conversational speech.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Modeling dependency in adaptation of acoustic models using multiscale tree processes.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Transforming out-of-domain estimates to improve in-domain language models.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Adaptation of polynomial trajectory segment models for large vocabulary speech recognition.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

A Multi-level Model for Recognition of Intonation Labels.
Proceedings of the Computing Prosody, 1997

1996
From HMM's to segment models: a unified view of stochastic modeling for speech recognition.
IEEE Trans. Speech Audio Process., 1996

Prediction of abstract prosodic labels for speech synthesis.
Comput. Speech Lang., 1996

Modeling disfluencies in conversational speech.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Modeling long distance dependence in language: topic mixtures vs. dynamic cache models.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Maximum likelihood successive state splitting.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

A dependence tree model of phone correlation.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Design of a speech recognition system based on acoustically derived segmental units.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
The challenge of spoken language systems: research directions for the nineties.
IEEE Trans. Speech Audio Process., 1995

Parameter estimation of dependence tree models using the EM algorithm.
IEEE Signal Process. Lett., 1995

A dynamical system model for recognizing intonation patterns.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Lattice-based search strategies for large vocabulary speech recognition.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
Automatic labeling of prosodic patterns.
IEEE Trans. Speech Audio Process., 1994

Maximum likelihood clustering of Gaussians for speech recognition.
IEEE Trans. Speech Audio Process., 1994

A Hierarchical Stochastic Model for Automatic Prediction of Prosodic Boundary Location.
Comput. Linguistics, 1994

A dynamical system model for generating F0 for synthesis.
Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 1994

Segment-Based Acoustic Models for Continuous Speech Recognition.
Proceedings of the Human Language Technology, 1994

Evaluating the Use of Prosodic Information in Speech Recognition and Understanding.
Proceedings of the Human Language Technology, 1994

Language Modeling with Sentence-Level Mixtures.
Proceedings of the Human Language Technology, 1994

1993
ML estimation of a stochastic linear system with the EM algorithm and its application to speech recognition.
IEEE Trans. Speech Audio Process., 1993

Parse scoring with prosodic information: an analysis/synthesis approach.
Comput. Speech Lang., 1993

rosody/Parse Scoring and its Application in ATIS.
Proceedings of the Human Language Technology: Proceedings of a Workshop Held at Plainsboro, 1993

On The Use Of Tied-Mixture Distributions.
Proceedings of the Human Language Technology: Proceedings of a Workshop Held at Plainsboro, 1993

Probabilistic parse scoring with prosodic information.
Proceedings of the IEEE International Conference on Acoustics, 1993

A comparison of trajectory and mixture modeling in segment-based word recognition.
Proceedings of the IEEE International Conference on Acoustics, 1993

1992
Context modeling with the stochastic segment model.
IEEE Trans. Signal Process., 1992

Fast algorithms for phone classification and recognition using segment-based models.
IEEE Trans. Signal Process., 1992

Probabilistic Parse Scoring Based on Prosodic Phrasing.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992

Recognition Using Classification and Segmentation Scoring.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992

Weight Estimation for N-Best Rescoring.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992

Parse scoring with prosodic information.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

TOBI: a standard for labeling English prosody.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Factors affecting pitch accent placement.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Automatic recognition of intonational features.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

A Bayesian approach to speaker adaptation for the stochastic segment model.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
Use of Prosody in Syntactic Disambiguation: An Analysis-by-Synthesis Approach.
Proceedings of the Speech and Natural Language, 1991

The Use of Prosody in Syntactic Disambiguation.
Proceedings of the Speech and Natural Language, 1991

Integration of Diverse Recognition Methodologies Through Reevaluation of N-Best Sentence Hypotheses.
Proceedings of the Speech and Natural Language, 1991

Session 6: Demonstrations and Videotapes of Speech and Natural Language Technologies.
Proceedings of the Speech and Natural Language, 1991

Automatic recognition of prosodic phrases.
Proceedings of the 1991 International Conference on Acoustics, 1991

A dynamical system approach to continuous speech recognition.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
Evaluating the Use of Prosodic Information in Speech Recognition and Understanding.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990

Fast Search Algorithms for Connected Phone Recognition Using the Stochastic Segment Model.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990

The use of relative duration in syntactic disambiguation.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Markov modeling of prosodic phrase structure.
Proceedings of the 1990 International Conference on Acoustics, 1990

Joint quantizer design and parameter estimation for discrete hidden Markov models.
Proceedings of the 1990 International Conference on Acoustics, 1990

Isolated word intonation recognition using hidden Markov models.
Proceedings of the 1990 International Conference on Acoustics, 1990

1989
A stochastic segment model for phoneme-based continuous speech recognition.
IEEE Trans. Acoust. Speech Signal Process., 1989

Prosody and Parsing.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Cape Cod, 1989

Segment-Based Acoustic Models with Multi-level Search Algorithms for Continuous Speech Recognition.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Cape Cod, 1989

Improvements in the Stochastic Segment Model for Phoneme Recognition.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Cape Cod, 1989

1988
Stochastic segment modelling using the estimate-maximize algorithm [speech recognition].
Proceedings of the IEEE International Conference on Acoustics, 1988


  Loading...