Shang-Wen Li
Orcid: 0000-0003-0656-9874Affiliations:
- Apple Inc., Cupertino, CA, USA
- Amazon, Seattle, WA, USA (former)
- Massachusetts Institute of Technology, Cambridge, USA (PhD 2017)
According to our database1,
Shang-Wen Li
authored at least 75 papers
between 2010 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Trans. Mach. Learn. Res., 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
SpeechDPR: End-To-End Spoken Passage Retrieval For Open-Domain Spoken Question Answering.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond.
CoRR, 2023
CoRR, 2023
CoRR, 2023
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Mode.
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Improving Textless Spoken Language Understanding with Discrete Units as Intermediate Target.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Disentangled Training with Adversarial Examples for Robust Small-Footprint Keyword Spotting.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Findings of the 2023 ML-Superb Challenge: Pre-Training And Evaluation Over More Languages And Beyond.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
2022
IEEE J. Sel. Top. Signal Process., 2022
CoRR, 2022
Superb @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
CoRR, 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Zero-shot Generalization in Dialog State Tracking through Generative Question Answering.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021
2020
CoRR, 2020
Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Style Attuned Pre-Training and Parameter Efficient Fine-Tuning for Spoken Language Understanding.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 3rd Clinical Natural Language Processing Workshop, 2020
2017
PhD thesis, 2017
2016
Proceedings of the Social Computing, 2016
Proceedings of the 16th IEEE International Conference on Advanced Learning Technologies, 2016
2015
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2015
Structuring lectures in massive open online courses (MOOCs) for efficient learning by linking similar sections and predicting prerequisites.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 15th IEEE International Conference on Advanced Learning Technologies, 2015
Proceedings of the 15th IEEE International Conference on Advanced Learning Technologies, 2015
2014
Proceedings of the 27th Annual ACM Symposium on User Interface Software and Technology, 2014
2013
An Experimental Analysis on Integrating Multi-Stream Spectro-Temporal, Cepstral and Pitch Information for Mandarin Speech Recognition.
IEEE Trans. Speech Audio Process., 2013
2011
Improved Tonal Language Speech Recognition by Integrating Spectro-Temporal Evidence and Pitch Information with Properly Chosen Tonal Acoustic Units.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Multi-stream spectro-temporal and cepstral features based on data-driven hierarchical phoneme clusters.
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
Improved phoneme recognition by integrating evidence from spectro-temporal and cepstral features.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010