Tom Ko
Orcid: 0000-0002-5324-8961
According to our database1,
Tom Ko
authored at least 60 papers
between 2008 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent.
CoRR, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
Proceedings of the 20th International Conference on Spoken Language Translation, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
2021
CoRR, 2021
Exploring Machine Speech Chain for Domain Adaptation and Few-Shot Speaker Adaptation.
CoRR, 2021
An Investigation of Positional Encoding in Transformer-based End-to-end Speech Recognition.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
A Meta-Learning Approach for User-Defined Spoken Term Classification with Varying Classes and Examples.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
An Encoder-Decoder Based Audio Captioning System with Transfer and Reinforcement Learning.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021
2020
AutoSpeech 2020: The Second Automated Machine Learning Challenge for Speech Classification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
IEEE Trans. Speech Audio Process., 2013
2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
A Fully Automated Derivation of State-Based Eigentriphones for Triphone Modeling with No Tied States Using Regularization.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
Automatic estimation of decoding parameters using large-margin iterative linear programming.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
2008
Min-max discriminative training of decoding parameters using iterative linear programming.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008