Karen Livescu

Orcid: 0000-0003-4962-946X

According to our database1, Karen Livescu authored at least 146 papers between 2000 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
What Do Self-Supervised Speech Models Know About Words?
Trans. Assoc. Comput. Linguistics, 2024

Speech Recognition for Analysis of Police Radio Communication.
CoRR, 2024

Approaching Deep Learning through the Spectral Dynamics of Weights.
CoRR, 2024

DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding.
CoRR, 2024

On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models.
CoRR, 2024

ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets.
CoRR, 2024

Self-Supervised Speech Representations are More Phonetic than Semantic.
CoRR, 2024

SignMusketeers: An Efficient Multi-Stream Approach for Sign Language Translation at Scale.
CoRR, 2024

UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Generative Context-Aware Fine-Tuning of Self-Supervised Speech Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

AV2WAV: Diffusion-Based Re-Synthesis from Continuous Self-Supervised Features for Audio-Visual Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2024

Towards Robust Speech Representation Learning for Thousands of Languages.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Structured Tree Alignment for Evaluation of (Speech) Constituency Parsing.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

On the Evaluation of Speech Foundation Models for Spoken Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Trans. Mach. Learn. Res., 2023

Self-Supervised Video Transformers for Isolated Sign Language Recognition.
CoRR, 2023

TTIC's Submission to WMT-SLT 23.
Proceedings of the Eighth Conference on Machine Translation, 2023

Context-Aware Fine-Tuning of Self-Supervised Speech Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

Comparative Layer-Wise Analysis of Self-Supervised Speech Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

Toward Joint Language Modeling for Speech Units and Text.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Audio-Visual Neural Syntax Acquisition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Few-Shot Spoken Language Understanding Via Joint Speech-Text Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Self-Supervised Speech Representation Learning: A Review.
IEEE J. Sel. Top. Signal Process., 2022

Editorial Editorial of Special Issue on Self-Supervised Learning for Speech and Audio Processing.
IEEE J. Sel. Top. Signal Process., 2022

TTIC's WMT-SLT 22 Sign Language Translation System.
Proceedings of the Seventh Conference on Machine Translation, 2022

On the Use of External Data for Spoken Named Entity Recognition.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

SLUE: New Benchmark Tasks For Spoken Language Understanding Evaluation on Natural Speech.
Proceedings of the IEEE International Conference on Acoustics, 2022

Baked-in State Probing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Open-Domain Sign Language Translation Learned from Online Video.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Substructure Distribution Projection for Zero-Shot Cross-Lingual Dependency Parsing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Searching for fingerspelled content in American Sign Language.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Chess as a Testbed for Language Model State Tracking.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
On Generalization in Coreference Resolution.
CoRR, 2021

Learning Chess Blindfolded: Evaluating Language Models on State Tracking.
CoRR, 2021

Whole-Word Segmental Speech Recognition with Acoustic Word Embeddings.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Acoustic Span Embeddings for Multilingual Query-by-Example Search.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Learning Speech Models from Multi-Modal Data.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Fingerspelling Detection in American Sign Language.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Layer-Wise Analysis of a Self-Supervised Speech Representation Model.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Substructure Substitution: Structured Data Augmentation for NLP.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
A Correspondence Variational Autoencoder for Unsupervised Acoustic Word Embeddings.
CoRR, 2020

A Cross-Task Analysis of Text Span Representations.
Proceedings of the 5th Workshop on Representation Learning for NLP, 2020

Multilingual Jointly Trained Acoustic and Written Word Embeddings.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Unsupervised Pre-Training of Bidirectional Speech Encoders via Masked Reconstruction.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Learning to Ignore: Long Document Coreference with Bounded Memory Neural Networks.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

On the Role of Supervision in Unsupervised Constituency Parsing.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

PeTra: A Sparsely Supervised Memory Model for People Tracking.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Discrete Latent Variable Representations for Low-Resource Text Classification.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Semantic Speech Retrieval With a Visually Grounded Model of Untranscribed Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Pre-training on high-resource speech recognition improves low-resource speech-to-text translation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

On the Contributions of Visual and Textual Supervision in Low-Resource Semantic Speech Retrieval.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Pre-Trained Text Embeddings for Enhanced Text-to-Speech Synthesis.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Fingerspelling Recognition in the Wild With Iterative Visual Attention.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Acoustically Grounded Word Embeddings for Improved Acoustics-to-word Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

Semantic Query-by-example Speech Search Using Visual Grounding.
Proceedings of the IEEE International Conference on Acoustics, 2019

Visually Grounded Neural Syntax Acquisition.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Hierarchical Multitask Learning for CTC-based Speech Recognition.
CoRR, 2018

A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

American Sign Language Fingerspelling Recognition in the Wild.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Parsing Speech: a Neural Approach to Integrating Lexical and Acoustic-Prosodic Information.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Low-Resource Speech-to-Text Translation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Acoustic Feature Learning Using Cross-Domain Articulatory Measurements.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Study of All-Convolutional Encoders for Connectionist Temporal Classification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Variational Sequential Labelers for Semi-Supervised Learning.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
End-to-End Neural Segmental Models for Speech Recognition.
IEEE J. Sel. Top. Signal Process., 2017

Lexicon-free fingerspelling recognition from video: Data, models, and signer adaptation.
Comput. Speech Lang., 2017

Semantic keyword spotting by learning from images and speech.
CoRR, 2017

Joint Modeling of Text and Acoustic-Prosodic Cues for Neural Parsing.
CoRR, 2017

Learning to Embed Words in Context for Syntactic Tasks.
Proceedings of the 2nd Workshop on Representation Learning for NLP, 2017

Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Acoustic Feature Learning via Deep Variational Canonical Correlation Analysis.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Query-by-Example Search with Discriminative Neural Acoustic Word Embeddings.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Visually Grounded Learning of Keyword Prediction from Untranscribed Speech.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Multi-view Recurrent Neural Acoustic Word Embeddings.
Proceedings of the 5th International Conference on Learning Representations, 2017

Multitask training with unlabeled data for end-to-end sign language fingerspelling recognition.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

An embedded segmental K-means model for unsupervised segmentation and clustering of speech.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Speech Production in Speech Technologies: Introduction to the CSL Special Issue.
Comput. Speech Lang., 2016

Articulatory feature-based pronunciation modeling.
Comput. Speech Lang., 2016

Towards Universal Paraphrastic Sentence Embeddings.
Proceedings of the 4th International Conference on Learning Representations, 2016

Deep Variational Canonical Correlation Analysis.
CoRR, 2016

Large-Scale Approximate Kernel Canonical Correlation Analysis.
Proceedings of the 4th International Conference on Learning Representations, 2016

On Deep Multi-View Representation Learning: Objectives and Optimization.
CoRR, 2016

Discriminative Acoustic Word Embeddings: Recurrent Neural Network-Based Approaches.
CoRR, 2016

Jointly learning to align and convert graphemes to phonemes with neural attention models.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

End-to-end training approaches for discriminative segmental models.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Discriminative acoustic word embeddings: Tecurrent neural network-based approaches.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Mapping Unseen Words to Task-Trained Embedding Spaces.
Proceedings of the 1st Workshop on Representation Learning for NLP, 2016

Triphone State-Tying via Deep Canonical Correlation Analysis.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Efficient Segmental Cascades for Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Nonparametric Canonical Correlation Analysis.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Deep convolutional acoustic word embeddings using word-pair side information.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Signer-independent fingerspelling recognition with deep neural network adaptation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Charagram: Embedding Words and Sentences via Character n-grams.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015
From Paraphrase Database to Compositional Paraphrase Model and Back.
Trans. Assoc. Comput. Linguistics, 2015

Deep Multilingual Correlation for Improved Word Embeddings.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

On Deep Multi-View Representation Learning.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Unsupervised learning of acoustic features via deep canonical correlation analysis.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Discriminative segmental cascades for feature-rich phone recognition.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Stochastic optimization for deep CCA via nonlinear orthogonal iterations.
Proceedings of the 53rd Annual Allerton Conference on Communication, 2015

2014
Reconstruction of articulatory measurements with smoothed low-rank matrix completion.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Revisiting Word Neighborhoods for Speech Recognition.
Proceedings of the 2014 Joint Meeting of SIGMORPHON and SIGFSM, 2014

A comparison of training approaches for discriminative segmental models.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Multi-view learning with supervision for transformed bottleneck features.
Proceedings of the IEEE International Conference on Acoustics, 2014

Tailoring Continuous Word Representations for Dependency Parsing.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Discriminative training of WFST factors with application to pronunciation modeling.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Deep Canonical Correlation Analysis.
Proceedings of the 30th International Conference on Machine Learning, 2013

Fingerspelling Recognition with Semi-Markov Conditional Random Fields.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Discriminative articulatory models for spoken term detection in low-resource conversational settings.
Proceedings of the IEEE International Conference on Acoustics, 2013

Multi-view CCA-based acoustic features for phonetic recognition across speakers and domains.
Proceedings of the IEEE International Conference on Acoustics, 2013

Fixed-dimensional acoustic embeddings of variable-length segments in low-resource settings.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Subword Modeling for Automatic Speech Recognition: Past, Present, and Emerging Approaches.
IEEE Signal Process. Mag., 2012

American sign language fingerspelling recognition with phonological feature-based tandem models.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Discriminative spoken term detection with limited data.
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Kernel CCA for multi-view learning of acoustic features using articulatory measurements.
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Discriminatively learning factorized finite state pronunciation models from dynamic Bayesian networks.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Stochastic optimization for PCA and PLS.
Proceedings of the 50th Annual Allerton Conference on Communication, 2012

Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Articulatory Feature Classification Using Nearest Neighbors.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Nearest Neighbors with Learned Distances for Phonetic Frame Classification.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Lexical access experiments with context-dependent articulatory feature-based models.
Proceedings of the IEEE International Conference on Acoustics, 2011

A factored conditional random field model for articulatory feature forced transcription.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Audio-visual anticipatory coarticulation modeling by human and machine.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Modeling pronunciation variation with context-dependent articulatory feature decision trees.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Multistream Articulatory Feature-Based Models for Visual Speech Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Multi-view clustering via canonical correlation analysis.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

On the phonetic information in ultrasonic microphone signals.
Proceedings of the IEEE International Conference on Acoustics, 2009

Multi-view learning of acoustic features for speaker recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Invited talk: Phonological Models in Automatic Speech Recognition.
Proceedings of the Tenth Meeting of ACL Special Interest Group on Computational Morphology and Phonology, 2008

2007
Articulatory feature classifiers trained on 2000 hours of telephone speech.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop.
Proceedings of the IEEE International Conference on Acoustics, 2007

Manual Transcription of Conversational Speech at the Articulatory Feature Level.
Proceedings of the IEEE International Conference on Acoustics, 2007

An Articulatory Feature-Based Tandem Approach and Factored Observation Modeling.
Proceedings of the IEEE International Conference on Acoustics, 2007

Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPS.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
An Asynchronous DBN for Audio-Visual speech Recognition.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

2005
Feature-based pronunciation modeling for automatic speech recognition.
PhD thesis, 2005

Pronunciation modeling using a finite-state transducer representation.
Speech Commun., 2005

Visual Speech Recognition with Loosely Synchronized Feature Streams.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Production domain modeling of pronunciation for visual speech recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Landmark-Based Speech Recognition: Report of the 2004 Johns Hopkins Summer Workshop.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Feature-based Pronunciation Modeling for Speech Recognition.
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

Feature-based pronunciation modeling with trainable asynchrony probabilities.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003
Hidden feature models for speech recognition using dynamic Bayesian networks.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Structurally discriminative graphical models for automatic speech recognition - results from the 2001 Johns Hopkins Summer Workshop.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Segment-based recognition on the phonebook task: initial results and observations on duration modeling.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Lexical modeling of non-native speech for automatic speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2000


  Loading...