David F. Harwath
Orcid: 0000-0003-0206-0253Affiliations:
- Massachusetts Institute of Technology, Cambridge, USA (PhD 2018)
According to our database1,
David F. Harwath
authored at least 75 papers
between 2012 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
How to Learn a New Language? An Efficient Solution for Self-Supervised Learning Models Unseen Languages Adaption in Low-Resource Scenario.
CoRR, 2024
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.
CoRR, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
SpeechCLIP+: Self-Supervised Multi-Task Representation Learning for Speech Via Clip and Speech-Image Data.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Integrating Self-Supervised Speech Model with Pseudo Word-Level Targets from Visually-Grounded Speech Model.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
CoRR, 2023
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Mode.
CoRR, 2023
Proceedings of the 11th International IEEE/EMBS Conference on Neural Engineering, 2023
Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Style-transfer based Speech and Audio-visual Scene understanding for Robot Action Sequence Acquisition from Videos.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2023
A Dataset for Foreground Speech Analysis With Smartwatches In Everyday Home Environments.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
When to Use Efficient Self Attention? Profiling Text, Speech and Image Transformer Variants.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023
2022
Automated detection of foreground speech with wearable sensing in everyday home environments: A transfer learning approach.
CoRR, 2022
Self-Supervised Representation Learning for Speech Using Visual Grounding and Masked Language Modeling.
CoRR, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Speak: A Toolkit Using Amazon Mechanical Turk to Collect and Validate Speech Audio Recordings.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
Int. J. Comput. Vis., 2020
CoRR, 2020
Pair Expansion for Learning Multilingual Semantic Embeddings Using Disjoint Visually-Grounded Speech Audio Datasets.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
Trilingual Semantic Embeddings of Visually Grounded Speech with Self-Attention Mechanisms.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019
2018
Vision as an Interlingua: Learning Multilingual Semantic Embeddings of Untranscribed Speech.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Choosing useful word alternates for automatic speech recognition correction interfaces.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Speech recognition without a lexicon - bridging the gap between graphemic and phonetic systems.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
2013
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Topic identification based extrinsic evaluation of summarization techniques applied to conversational speech.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012