James R. Glass
Orcid: 0000-0002-3097-360XAffiliations:
- Massachusetts Institute of Technology (MIT), CSAIL, Cambridge, MA, USA
According to our database1,
James R. Glass
authored at least 421 papers
between 1985 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on dl.acm.org
On csauthors.net:
Bibliography
2024
Decoding on Graphs: Faithful and Sound Reasoning on Knowledge Graphs through Generation of Well-Formed Chains.
CoRR, 2024
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models.
CoRR, 2024
CoRR, 2024
Automatic Prediction of Amyotrophic Lateral Sclerosis Progression using Longitudinal Speech Transformer.
CoRR, 2024
CoRR, 2024
Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation.
CoRR, 2024
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
R-Spin: Efficient Speaker and Noise-invariant Representation Learning with Acoustic Pieces.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024
What, When, and Where? Self-Supervised Spatio- Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Found in the middle: Calibrating Positional Attention Bias Improves Long Context Utilization.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
CoRR, 2023
CoRR, 2023
PCFG-Based Natural Language Interface Improves Generalization for Controlled Text Generation.
Proceedings of the The 12th Joint Conference on Lexical and Computational Semantics, 2023
Proceedings of the 8th Workshop on Representation Learning for NLP, 2023
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2023
On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
2022
IEEE J. Sel. Top. Signal Process., 2022
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-Level Cross-Lingual Speech Representation.
IEEE J. Sel. Top. Signal Process., 2022
Developing a Series of AI Challenges for the United States Department of the Air Force.
CoRR, 2022
CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification.
CoRR, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Speak: A Toolkit Using Amazon Mechanical Turk to Collect and Validate Speech Audio Recordings.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
On the Interplay between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Repetition Assessment for Speech and Language Disorders: A Study of the Logopenic Variant of Primary Progressive Aphasia.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
CoRR, 2021
CoRR, 2021
PSLA: Improving Audio Event Classification with Pretraining, Sampling, Labeling, and Aggregation.
CoRR, 2021
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Exposure Bias versus Self-Recovery: Are Distortions Really Incremental for Autoregressive Text Generation?
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Analyzing the Forgetting Problem in Pretrain-Finetuning of Open-domain Dialogue Response Models.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
Int. J. Comput. Vis., 2020
Constructing a Knowledge Graph from Unstructured Documents without External Alignment.
CoRR, 2020
CoRR, 2020
CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning.
CoRR, 2020
Comput. Linguistics, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Pair Expansion for Learning Multilingual Semantic Embeddings Using Disjoint Visually-Grounded Speech Audio Datasets.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
What Does an End-to-End Dialect Identification Model Learn About Non-Dialectal Information?
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
A Systematic Characterization of Sampling Algorithms for Open-ended Language Generation.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Trilingual Semantic Embeddings of Visually Grounded Speech with Self-Attention Mechanisms.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Audio-Visual Calibration with Polynomial Regression for 2-D Projection Using SVD-PHAT.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
What Was Written vs. Who Read It: News Media Profiling Using Text Analysis and Social Media Context.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 3rd Clinical Natural Language Processing Workshop, 2020
2019
Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Deep Learning for Database Mapping and Asking Clarification Questions in Dialogue Systems.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Trans. Assoc. Comput. Linguistics, 2019
ACM J. Data Inf. Qual., 2019
Inf. Process. Manag., 2019
Mix-review: Alleviate Forgetting in the Pretrain-Finetune Framework for Neural Language Generation Models.
CoRR, 2019
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models.
CoRR, 2019
Team QCRI-MIT at SemEval-2019 Task 4: Propaganda Analysis Meets Hyperpartisan News Detection.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Multi-Task Ordinal Regression for Jointly Predicting the Trustworthiness and the Leading Political Ideology of News Media.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
MCE 2018: The 1st Multi-Target Speaker Detection and Identification Challenge Evaluation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Integrating Video Retrieval and Moment Detection in a Unified Corpus for Video Question Answering.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 7th International Conference on Learning Representations, 2019
Proceedings of the 7th International Conference on Learning Representations, 2019
Noise-tolerant Audio-visual Online Person Verification Using an Attention-based Neural Network Fusion.
Proceedings of the IEEE International Conference on Acoustics, 2019
Domain Attentive Fusion for End-to-end Dialect Identification with Unknown Target Domain.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
A Factorial Deep Markov Model for Unsupervised Disentangled Representation Learning from Speech.
Proceedings of the IEEE International Conference on Acoustics, 2019
Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Subword Regularization and Beam Search Decoding for End-to-end Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019
Explicit Alignment of Text and Speech Encodings for Attention-Based End-to-End Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
The MGB-5 Challenge: Recognition and Dialect Identification of Dialectal Arabic Speech.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Improving Neural Language Models by Segmenting, Attending, and Predicting the Future.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
What Is One Grain of Sand in the Desert? Analyzing Individual Neurons in Deep NLP Models.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
A Low-Power Speech Recognizer and Voice Activity Detector Using Deep Neural Networks.
IEEE J. Solid State Circuits, 2018
A Study of the Complexity and Accuracy of Direction of Arrival Estimation Methods Based on GCC-PHAT for a Pair of Close Microphones.
CoRR, 2018
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation (MCE) Plan, Dataset and Baseline System.
CoRR, 2018
Disentangling by Partitioning: A Representation Learning Framework for Multimodal Sensory Data.
CoRR, 2018
Convolutional Neural Networks and Language Embeddings for End-to-End Dialect Recognition.
CoRR, 2018
Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign.
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, 2018
On Training Recurrent Networks with Truncated Backpropagation Through time in Speech Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Frame-Level Speaker Embeddings for Text-Independent Speaker Recognition and Analysis of End-to-End Model.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Convolutional Neural Networks for Dialogue State Tracking without Pre-Trained Word Vectors or Semantic Dictionaries.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Convolutional Neural Network and Language Embeddings for End-to-End Dialect Recognition.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language Inference.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
A Study of Enhancement, Augmentation and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Unsupervised Adaptation with Interpretable Disentangled Representations for Distant Conversational Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 24th International Conference on Pattern Recognition, 2018
Exploiting Convolutional Neural Networks for Phonotactic Based Dialect Identification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Convolutional Neural Networks and Multitask Strategies for Semantic Mapping of Natural Language Input to a Structured Database.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Extracting Domain Invariant Features by Unsupervised Learning for Robust Automatic Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Learning Word Representations with Cross-Sentence Dependencyfor End-to-End Co-reference Resolution.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Bidirectional Backpropagation: Towards Biologically Plausible Error Signal Transmission in Neural Networks.
CoRR, 2017
Unsupervised Learning of Disentangled and Interpretable Representations from Sequential Data.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
14.4 A scalable speech recognizer with deep-neural-network acoustic models and voice-activated power gating.
Proceedings of the 2017 IEEE International Solid-State Circuits Conference, 2017
Character-Based Embedding Models and Reranking Strategies for Understanding Natural Language Meal Descriptions.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
An Environmental Feature Representation for Robust Speech Recognition and for Environment Identification.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Evaluating Layers of Representation in Neural Machine Translation on Part-of-Speech and Semantic Tagging Tasks.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017
Semantic mapping of natural language input to database entries via convolutional neural networks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
MIT-QCRI Arabic dialect identification system for the 2017 multi-genre broadcast challenge.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Unsupervised domain adaptation for robust speech recognition via variational autoencoder-based data augmentation.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
CoRR, 2016
Large-Scale Machine Translation between Arabic and Hebrew: Available Corpora and Initial Results.
CoRR, 2016
A Character-level Convolutional Neural Network for Distinguishing Similar Languages and Dialects.
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, 2016
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Development of the MIT ASR system for the 2016 Arabic Multi-genre Broadcast Challenge.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016
SLS at SemEval-2016 Task 3: Neural-based Approaches for Ranking in Community Question Answering.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016
Proceedings of the 1st Workshop on Representation Learning for NLP, 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Exploiting Depth and Highway Connections in Convolutional Recurrent Deep Neural Networks for Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Prediction-adaptation-correction recurrent neural networks for low-resource language speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Personalized mispronunciation detection and diagnosis based on unsupervised error pattern discovery.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the COLING 2016, 2016
2015
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Trans. Assoc. Comput. Linguistics, 2015
IEEE J. Solid State Circuits, 2015
A Situationally Aware Voice-commandable Robotic Forklift Working Alongside People in Unstructured Outdoor Environments.
J. Field Robotics, 2015
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015
VectorSLU: A Continuous Word Vector Approach to Answer Selection in Community Question Answering Systems.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015
Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
On using heterogeneous data for vehicle-based speech recognition: A DNN-based approach.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the 2014 IEEE International Conference on Solid-State Circuits Conference, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Graph-based re-ranking using acoustic feature similarity between search results for spoken term detection on low-resource languages.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Speech recognition without a lexicon - bridging the gap between graphemic and phonetic systems.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Extracting deep neural network bottleneck features using low-rank matrix factorization.
Proceedings of the IEEE International Conference on Acoustics, 2014
Speech feature denoising and dereverberation via deep autoencoders for noisy reverberant speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014
A Study of using Syntactic and Semantic Structures for Concept Segmentation and Labeling.
Proceedings of the COLING 2014, 2014
Proceedings of the 36th Annual Meeting of the Cognitive Science Society, 2014
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2014
2013
IEEE Trans. Speech Audio Process., 2013
IEEE Trans. Speech Audio Process., 2013
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2013
Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Mispronunciation detection via dynamic time warping on deep belief network-based posteriorgrams.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Proceedings of the 11th International Conference on Information Science, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Fast spoken query detection using lower-bound Dynamic Time Warping on Graphical Processing Units.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Handling uncertain observations in unsupervised topic-mixture language model adaptation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Evaluation of multi-level context-dependent acoustic model for large vocabulary speaker adaptation tasks.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012
2011
A Piecewise Aggregate Approximation Lower-Bound Estimate for Posteriorgram-Based Dynamic Time Warping.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Robust Voice Activity Detector for Real World Applications Using Harmonicity and Modulation Frequency.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
2010
Introduction to the Issue on Speech Processing for Natural Interaction With Intelligent Environments.
IEEE J. Sel. Top. Signal Process., 2010
Combining missing-feature theory, speech enhancement, and speaker-dependent/-independent modeling for speech separation.
Comput. Speech Lang., 2010
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010
Unsupervised Speaker Adaptation based on the Cosine Similarity for Text-Independent Speaker Verification.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010
Proceedings of the International Conference on Language Resources and Evaluation, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
A voice-commandable robotic forklift working alongside humans in minimally-prepared outdoor environments.
Proceedings of the IEEE International Conference on Robotics and Automation, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the 5th ACM/IEEE International Conference on Human Robot Interaction, 2010
2009
Updated MINDS report on speech recognition and understanding, Part 2 [DSP Education].
IEEE Signal Process. Mag., 2009
Developments and directions in speech recognition and understanding, Part 1 [DSP Education].
IEEE Signal Process. Mag., 2009
IEEE Trans. Pattern Anal. Mach. Intell., 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Discriminative training of hierarchical acoustic models for large vocabulary continuous speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009
Proceedings of the 27th International Conference on Human Factors in Computing Systems, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
N-gram Weighting: Reducing Training Data Mismatch in Cross-Domain Language Model Estimation.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008
Proceedings of the ACL 2008, 2008
2007
IEEE Trans. Speech Audio Process., 2007
An Implementation of Rational Wavelets and Filter Design for Phonetic Classification.
IEEE Trans. Speech Audio Process., 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Noise Robust Phonetic Classificationwith Linear Regularized Least Squares and Second-Order Features.
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
Proceedings of the ACL 2007, 2007
2006
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006
A Comparative Study of Methods for Handheld Speaker Verification in Realistic Noisy Conditions.
Proceedings of the Odyssey 2006: The Speaker and Language Recognition Workshop, 2006
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Flexible Multi-Stream Framework for Speech Recognition using Multi-Tape Finite-State Transducers.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Proceedings of the EMNLP 2006, 2006
2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Automatic Processing of Audio Lectures for Information Retrieval: Vocabulary Selection and Language Modeling.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
2004
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 6th International Conference on Multimodal Interfaces, 2004
A segment-based audio-visual speech recognizer: data collection, development, and initial experiments.
Proceedings of the 6th International Conference on Multimodal Interfaces, 2004
A Framework for Developing Conversational User Interfaces.
Proceedings of the Computer-Aided Design of User Interfaces IV, 2004
2003
Comput. Speech Lang., 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Segment-based recognition on the phonebook task: initial results and observations on duration modeling.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
2000
IEEE Trans. Speech Audio Process., 2000
Guest editorial introduction to the special issue on language modeling and dialogue systems.
IEEE Trans. Speech Audio Process., 2000
A flexible, scalable finite-state transducer architecture for corpus-based concatenative speech synthesis.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Data collection and performance evaluation of spoken dialogue systems: the MIT experience.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the IEEE International Conference on Acoustics, 2000
Heterogeneous lexical units for automatic speech recognition: preliminary investigations.
Proceedings of the IEEE International Conference on Acoustics, 2000
1999
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999
1998
Evaluation methodology for a telephone-based conversational system.
Proceedings of the First International Conference on Language Resources and Evaluation, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
MUSE: a scripting language for the development of interactive speech analysis and recognition tools.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
1996
Multilingual human-computer interactions: from information access to language learning.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
1995
Speech Commun., 1995
1994
Speech Communication, 1994
Proceedings of the Human Language Technology, 1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
1993
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
A comparative study of signal representations and classification techniques for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 1993
1992
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992
Proceedings of the Second International Conference on Spoken Language Processing, 1992
Proceedings of the Second International Conference on Spoken Language Processing, 1992
1991
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications) - RIAO 1991, 3rd International Conference, Universitad Autonoma de Barcelona, Spain, April 2, 1991
Proceedings of the Speech and Natural Language, 1991
Proceedings of the Speech and Natural Language, 1991
The MIT ATIS system; preliminary development, spontaneous speech data collection, and performance evaluation.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991
Automatic learning of lexical representations for sub-word unit based speech recognition systems.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991
Integration of speech recognition and natural language processing in the MIT VOYAGER system.
Proceedings of the 1991 International Conference on Acoustics, 1991
1990
Proceedings of the Advances in Neural Information Processing Systems 3, 1990
Proceedings of the Advances in Neural Information Processing Systems 3, 1990
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990
Proceedings of the First International Conference on Spoken Language Processing, 1990
Detection and classification of phonemes using context-independent error back-propagation.
Proceedings of the First International Conference on Spoken Language Processing, 1990
Proceedings of the 1990 International Conference on Acoustics, 1990
Proceedings of the 1990 International Conference on Acoustics, 1990
1989
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Philadelphia, 1989
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Cape Cod, 1989
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Cape Cod, 1989
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Cape Cod, 1989
Proceedings of the IEEE International Conference on Acoustics, 1989
1988
PhD thesis, 1988
Proceedings of the IEEE International Conference on Acoustics, 1988
1986
Proceedings of the IEEE International Conference on Acoustics, 1986
1985
Proceedings of the IEEE International Conference on Acoustics, 1985