Karen Livescu

Proceedings of the IEEE International Conference on Acoustics, 2024

Towards Robust Speech Representation Learning for Thousands of Languages.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Structured Tree Alignment for Evaluation of (Speech) Constituency Parsing.

[BibT_eX]

[DOI]

Freda Shi

Giambattista Parascandolo

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

On the Evaluation of Speech Foundation Models for Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.

[BibT_eX]

[DOI]

Bartlomiej Bojanowski

Christopher D. Manning

Daniel Moseguí González

Eunice Engefu Manyasi

Evgenii Zheltonozhskii

Fanyue Xia

Fatemeh Siar

Fernando Martínez-Plumed

Giorgio Mariani

Gloria Wang

Gonzalo Jaimovitch-López

Jaime Fernández Fisac

Jascha Sohl-Dickstein

José Hernández-Orallo

Karthik Gopalakrishnan

Lidia Contreras Ochando

Louis-Philippe Morency

María José Ramírez-Quintana

Michael I. Ivanitskiy

Neta Gur-Ari Krakover

Nitish Shirish Keskar

Pablo Antonio Moreno Casares

Pegah Alipoormolabashi

Shyamolima (Shammie) Debnath

Sneha Priscilla Makini

Yadollah Yaghoobzadeh

Trans. Mach. Learn. Res., 2023

Self-Supervised Video Transformers for Isolated Sign Language Recognition.

[BibT_eX]

[DOI]

Marcelo Sandoval-Castañeda

Yanhong Li

Marcelo Sandoval-Castañeda

CoRR, 2023

TTIC's Submission to WMT-SLT 23.

[BibT_eX]

[DOI]

Proceedings of the Eighth Conference on Machine Translation, 2023

Context-Aware Fine-Tuning of Self-Supervised Speech Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Comparative Layer-Wise Analysis of Self-Supervised Speech Models.

[BibT_eX]

[DOI]

Ankita Pasad

Proceedings of the IEEE International Conference on Acoustics, 2023

Toward Joint Language Modeling for Speech Units and Text.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Audio-Visual Neural Syntax Acquisition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Few-Shot Spoken Language Understanding Via Joint Speech-Text Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Self-Supervised Speech Representation Learning: A Review.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2022

Editorial Editorial of Special Issue on Self-Supervised Learning for Speech and Audio Processing.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2022

TTIC's WMT-SLT 22 Sign Language Translation System.

[BibT_eX]

[DOI]

Proceedings of the Seventh Conference on Machine Translation, 2022

On the Use of External Data for Spoken Named Entity Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

SLUE: New Benchmark Tasks For Spoken Language Understanding Evaluation on Natural Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Baked-in State Probing.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Open-Domain Sign Language Translation Learned from Online Video.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Substructure Distribution Projection for Zero-Shot Cross-Lingual Dependency Parsing.

[BibT_eX]

[DOI]

Freda Shi

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Searching for fingerspelled content in American Sign Language.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Chess as a Testbed for Language Model State Tracking.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

On Generalization in Coreference Resolution.

[BibT_eX]

[DOI]

CoRR, 2021

Learning Chess Blindfolded: Evaluating Language Models on State Tracking.

[BibT_eX]

[DOI]

CoRR, 2021

Whole-Word Segmental Speech Recognition with Acoustic Word Embeddings.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Acoustic Span Embeddings for Multilingual Query-by-Example Search.

[BibT_eX]

[DOI]

Yushi Hu

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Learning Speech Models from Multi-Modal Data.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Fingerspelling Detection in American Sign Language.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Layer-Wise Analysis of a Self-Supervised Speech Representation Model.

[BibT_eX]

[DOI]

Ankita Pasad

Ju-Chieh Chou

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Substructure Substitution: Structured Data Augmentation for NLP.

[BibT_eX]

[DOI]

Haoyue Shi

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

A Correspondence Variational Autoencoder for Unsupervised Acoustic Word Embeddings.

[BibT_eX]

[DOI]

Puyuan Peng

CoRR, 2020

A Cross-Task Analysis of Text Span Representations.

[BibT_eX]

[DOI]

Proceedings of the 5th Workshop on Representation Learning for NLP, 2020

Multilingual Jointly Trained Acoustic and Written Word Embeddings.

[BibT_eX]

[DOI]

Yushi Hu

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Unsupervised Pre-Training of Bidirectional Speech Encoders via Masked Reconstruction.

[BibT_eX]

[DOI]

Qingming Tang

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Learning to Ignore: Long Document Coreference with Bounded Memory Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

On the Role of Supervision in Unsupervised Constituency Parsing.

[BibT_eX]

[DOI]

Haoyue Shi

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

PeTra: A Sparsely Supervised Memory Model for People Tracking.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Discrete Latent Variable Representations for Low-Resource Text Classification.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Semantic Speech Retrieval With a Visually Grounded Model of Untranscribed Speech.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Pre-training on high-resource speech recognition improves low-resource speech-to-text translation.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

On the Contributions of Visual and Textual Supervision in Low-Resource Semantic Speech Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Pre-Trained Text Embeddings for Enhanced Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Fingerspelling Recognition in the Wild With Iterative Visual Attention.

[BibT_eX]

[DOI]

Aurora Martinez Del Rio

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Acoustically Grounded Word Embeddings for Improved Acoustics-to-word Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Semantic Query-by-example Speech Search Using Visual Grounding.

[BibT_eX]

[DOI]

Aristotelis Anastassiou

Proceedings of the IEEE International Conference on Acoustics, 2019

Visually Grounded Neural Syntax Acquisition.

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018

Hierarchical Multitask Learning for CTC-based Speech Recognition.

[BibT_eX]

[DOI]

Kalpesh Krishna

Shubham Toshniwal

CoRR, 2018

A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

American Sign Language Fingerspelling Recognition in the Wild.

[BibT_eX]

[DOI]

Aurora Martinez Del Rio

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Parsing Speech: a Neural Approach to Integrating Lexical and Acoustic-Prosodic Information.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Low-Resource Speech-to-Text Translation.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Acoustic Feature Learning Using Cross-Domain Articulatory Measurements.

[BibT_eX]

[DOI]

Qingming Tang

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Study of All-Convolutional Encoders for Connectionist Temporal Classification.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Variational Sequential Labelers for Semi-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017

End-to-End Neural Segmental Models for Speech Recognition.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2017

Lexicon-free fingerspelling recognition from video: Data, models, and signer adaptation.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2017

Semantic keyword spotting by learning from images and speech.

[BibT_eX]

[DOI]

CoRR, 2017

Joint Modeling of Text and Acoustic-Prosodic Cues for Neural Parsing.

[BibT_eX]

[DOI]

CoRR, 2017

Learning to Embed Words in Context for Syntactic Tasks.

[BibT_eX]

[DOI]

Lifu Tu

Proceedings of the 2nd Workshop on Representation Learning for NLP, 2017

Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Acoustic Feature Learning via Deep Variational Canonical Correlation Analysis.

[BibT_eX]

[DOI]

Qingming Tang

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Query-by-Example Search with Discriminative Neural Acoustic Word Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Visually Grounded Learning of Keyword Prediction from Untranscribed Speech.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Multi-view Recurrent Neural Acoustic Word Embeddings.

[BibT_eX]

[DOI]

Wanjia He

Proceedings of the 5th International Conference on Learning Representations, 2017

Multitask training with unlabeled data for end-to-end sign language fingerspelling recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

An embedded segmental K-means model for unsupervised segmentation and clustering of speech.

[BibT_eX]

[DOI]

Sharon Goldwater

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

Speech Production in Speech Technologies: Introduction to the CSL Special Issue.

[BibT_eX]

[DOI]

Frank Rudzicz

Mark Hasegawa-Johnson

Jeff A. Bilmes

Comput. Speech Lang., 2016

Articulatory feature-based pronunciation modeling.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2016

Towards Universal Paraphrastic Sentence Embeddings.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Learning Representations, 2016

Deep Variational Canonical Correlation Analysis.

[BibT_eX]

[DOI]

Honglak Lee

CoRR, 2016

Large-Scale Approximate Kernel Canonical Correlation Analysis.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Learning Representations, 2016

On Deep Multi-View Representation Learning: Objectives and Optimization.

[BibT_eX]

[DOI]

CoRR, 2016

Discriminative Acoustic Word Embeddings: Recurrent Neural Network-Based Approaches.

[BibT_eX]

[DOI]

CoRR, 2016

Jointly learning to align and convert graphemes to phonemes with neural attention models.

[BibT_eX]

[DOI]

Shubham Toshniwal

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

End-to-end training approaches for discriminative segmental models.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Discriminative acoustic word embeddings: Tecurrent neural network-based approaches.

[BibT_eX]

[DOI]

Pranava Swaroop Madhyastha

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Mapping Unseen Words to Task-Trained Embedding Spaces.

[BibT_eX]

[DOI]

Mohit Bansal

Proceedings of the 1st Workshop on Representation Learning for NLP, 2016

Triphone State-Tying via Deep Canonical Correlation Analysis.

[BibT_eX]

[DOI]

Hao Tang

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Efficient Segmental Cascades for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Nonparametric Canonical Correlation Analysis.

[BibT_eX]

[DOI]

Tomer Michaeli

Proceedings of the 33nd International Conference on Machine Learning, 2016

Deep convolutional acoustic word embeddings using word-pair side information.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Signer-independent fingerspelling recognition with deep neural network adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Charagram: Embedding Words and Sentences via Character n-grams.

[BibT_eX]

[DOI]

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015

From Paraphrase Database to Compositional Paraphrase Model and Back.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2015

Deep Multilingual Correlation for Improved Word Embeddings.

[BibT_eX]

[DOI]

Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

On Deep Multi-View Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 32nd International Conference on Machine Learning, 2015

Unsupervised learning of acoustic features via deep canonical correlation analysis.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Discriminative segmental cascades for feature-rich phone recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Stochastic optimization for deep CCA via nonlinear orthogonal iterations.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual Allerton Conference on Communication, 2015

2014

Reconstruction of articulatory measurements with smoothed low-rank matrix completion.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Revisiting Word Neighborhoods for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2014 Joint Meeting of SIGMORPHON and SIGFSM, 2014

A comparison of training approaches for discriminative segmental models.

[BibT_eX]

[DOI]

Hao Tang

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Multi-view learning with supervision for transformed bottleneck features.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Tailoring Continuous Word Representations for Dependency Parsing.

[BibT_eX]

[DOI]

Mohit Bansal

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013

Discriminative training of WFST factors with application to pronunciation modeling.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Deep Canonical Correlation Analysis.

[BibT_eX]

[DOI]

Proceedings of the 30th International Conference on Machine Learning, 2013

Fingerspelling Recognition with Semi-Markov Conditional Random Fields.

[BibT_eX]

[DOI]

Taehwan Kim

Proceedings of the IEEE International Conference on Computer Vision, 2013

Discriminative articulatory models for spoken term detection in low-resource conversational settings.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Multi-view CCA-based acoustic features for phonetic recognition across speakers and domains.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Fixed-dimensional acoustic embeddings of variable-length segments in low-resource settings.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012

Subword Modeling for Automatic Speech Recognition: Past, Present, and Emerging Approaches.

[BibT_eX]

[DOI]

Florian Metze

IEEE Signal Process. Mag., 2012

American sign language fingerspelling recognition with phonological feature-based tandem models.

[BibT_eX]

[DOI]

Taehwan Kim

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Discriminative spoken term detection with limited data.

[BibT_eX]

[DOI]

Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Kernel CCA for multi-view learning of acoustic features using articulatory measurements.

[BibT_eX]

[DOI]

Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Discriminatively learning factorized finite state pronunciation models from dynamic Bayesian networks.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Stochastic optimization for PCA and PLS.

[BibT_eX]

[DOI]

Proceedings of the 50th Annual Allerton Conference on Communication, 2012

Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach.

[BibT_eX]

[DOI]

Hao Tang

Joseph Keshet

Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011

Articulatory Feature Classification Using Nearest Neighbors.

[BibT_eX]

[DOI]

Arild Brandrud Næss

Rohit Prabhavalkar

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Nearest Neighbors with Learned Distances for Phonetic Frame Classification.

[BibT_eX]

[DOI]

John Labiak

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Lexical access experiments with context-dependent articulatory feature-based models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

A factored conditional random field model for articulatory feature forced transcription.

[BibT_eX]

[DOI]

Rohit Prabhavalkar

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010

Audio-visual anticipatory coarticulation modeling by human and machine.

[BibT_eX]

[DOI]

Louis H. Terry

Janet B. Pierrehumbert

Aggelos K. Katsaggelos

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Modeling pronunciation variation with context-dependent articulatory feature decision trees.

[BibT_eX]

[DOI]

Samuel R. Bowman

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

Multistream Articulatory Feature-Based Models for Visual Speech Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2009

Multi-view clustering via canonical correlation analysis.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual International Conference on Machine Learning, 2009

On the phonetic information in ultrasonic microphone signals.

[BibT_eX]

[DOI]

Bo Zhu

Proceedings of the IEEE International Conference on Acoustics, 2009

Multi-view learning of acoustic features for speaker recognition.

[BibT_eX]

[DOI]

Mark Stoehr

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008

Invited talk: Phonological Models in Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Tenth Meeting of ACL Special Interest Group on Computational Morphology and Phonology, 2008

2007

Articulatory feature classifiers trained on 2000 hours of telephone speech.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop.

[BibT_eX]

[DOI]

Özgür Çetin

Mark Hasegawa-Johnson

Stephen Dawson-Haggerty

Proceedings of the IEEE International Conference on Acoustics, 2007

Manual Transcription of Conversational Speech at the Articulatory Feature Level.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

An Articulatory Feature-Based Tandem Approach and Factored Observation Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPS.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006

An Asynchronous DBN for Audio-Visual speech Recognition.

[BibT_eX]

[DOI]

Kate Saenko

Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

2005

Feature-based pronunciation modeling for automatic speech recognition.

[BibT_eX]

[DOI]

PhD thesis, 2005

Pronunciation modeling using a finite-state transducer representation.

[BibT_eX]

[DOI]

Speech Commun., 2005

Visual Speech Recognition with Loosely Synchronized Feature Streams.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Production domain modeling of pronunciation for visual speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Landmark-Based Speech Recognition: Report of the 2004 Johns Hopkins Summer Workshop.

[BibT_eX]

[DOI]

Mark Hasegawa-Johnson

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

Feature-based Pronunciation Modeling for Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

Feature-based pronunciation modeling with trainable asynchrony probabilities.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003

Hidden feature models for speech recognition using dynamic Bayesian networks.

[BibT_eX]

[DOI]

Jeff A. Bilmes

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002

Structurally discriminative graphical models for automatic speech recognition - results from the 2001 Johns Hopkins Summer Workshop.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

Segment-based recognition on the phonebook task: initial results and observations on duration modeling.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000

Lexical modeling of non-native speech for automatic speech recognition.

[BibT_eX]

[DOI]