TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Instruction-Following Speech Recognition.
CoRR, 2023
Less is More: Removing Text-regions Improves CLIP Training Efficiency and Robustness.
CoRR, 2023
The effect of snow damage on self-organization in a primary subtropical evergreen broadleaved forest in Southwest China.
Ecol. Informatics, 2022
Unsupervised Data Selection via Discrete Speech Representation for ASR.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Improving the Fusion of Acoustic and Text Representations in RNN-T.
Proceedings of the IEEE International Conference on Acoustics, 2022
Input Length Matters: An Empirical Study Of RNN-T And MWER Training For Long-form Telephony Speech Recognition.
CoRR, 2021
Exploring Targeted Universal Adversarial Perturbations to End-to-End ASR Models.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Improving Streaming Automatic Speech Recognition with Non-Streaming Model Distillation on Unsupervised Data.
Proceedings of the IEEE International Conference on Acoustics, 2021
Uncertainty Estimation with Infinitesimal Jackknife, Its Distribution and Mean-Field Approximation.
CoRR, 2020
A Large Scale Speech Sentiment Corpus.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Speech Sentiment Analysis via Pre-Trained Features from End-to-End ASR Models.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Kernel Approximation Methods for Speech Recognition.
,
,
,
,
,
,
,
,
,
,
,
J. Mach. Learn. Res., 2019
Hyper-parameter Tuning under a Budget Constraint.
CoRR, 2019
Hyper-parameter Tuning under a Budget Constraint.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Learning compact recurrent neural networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
A comparison between deep neural nets and kernel acoustic models for speech recognition.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
How to Scale Up Kernel Methods to Be As Good As Deep Neural Nets.
,
,
,
,
,
,
,
,
,
,
CoRR, 2014
Selecting β-Divergence for Nonnegative Matrix Factorization by Score Matching.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2012, 2012