Yuan Gong
Orcid: 0000-0002-4537-0078Affiliations:
- Massachusetts Institute of Technology, Cambridge, MA, USA
According to our database1,
Yuan Gong
authored at least 37 papers
between 2017 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
A Closer Look at Neural Codec Resynthesis: Bridging the Gap between Codec and Waveform Generation.
CoRR, 2024
Automatic Prediction of Amyotrophic Lateral Sclerosis Progression using Longitudinal Speech Transformer.
CoRR, 2024
Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation.
CoRR, 2024
DASS: Distilled Audio State Space Models are Stronger and More Duration-Scalable Learners.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
2023
Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification.
CoRR, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
PSLA: Improving Audio Event Classification with Pretraining, Sampling, Labeling, and Aggregation.
CoRR, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
2020
Dataset, June, 2020
IEEE Signal Process. Lett., 2020
2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Protecting Voice Controlled Systems Using Sound Source Identification Based on Acoustic Cues.
Proceedings of the 27th International Conference on Computer Communication and Networks, 2018
Automatic Autism Spectrum Disorder Detection Using Everyday Vocalizations Captured by Smart Devices.
Proceedings of the 2018 ACM International Conference on Bioinformatics, 2018
Proceedings of the 2018 ACM International Conference on Bioinformatics, 2018
2017
Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, CA, USA, October 23, 2017
Proceedings of the 2017 IEEE International Conference on Healthcare Informatics, 2017