2025

TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights.

[DOI]

Aiwei Liu

Haoping Bai

Albin Madappally Jose

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics.

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

TIS-DPO: Token-level Importance Sampling for Direct Preference Optimization With Estimated Weights.

[DOI]

Albin Madappally Jose

CoRR, 2024

Apple Intelligence Foundation Language Models.

[DOI]

Albin Madappally Jose

Hannah Gillis Coleman

CoRR, 2024

Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation.

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Instruction-Following Speech Recognition.

[DOI]

CoRR, 2023

Less is More: Removing Text-regions Improves CLIP Training Efficiency and Robustness.

[DOI]

CoRR, 2023

2022

The effect of snow damage on self-organization in a primary subtropical evergreen broadleaved forest in Southwest China.

[DOI]

Palingamoorthy Gnanamoorthy

Ecol. Informatics, 2022

Unsupervised Data Selection via Discrete Speech Representation for ASR.

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR.

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Improving the Fusion of Acoustic and Text Representations in RNN-T.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Input Length Matters: An Empirical Study Of RNN-T And MWER Training For Long-form Telephony Speech Recognition.

[DOI]

CoRR, 2021

Exploring Targeted Universal Adversarial Perturbations to End-to-End ASR Models.

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Improving Streaming Automatic Speech Recognition with Non-Streaming Model Distillation on Unsupervised Data.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Uncertainty Estimation with Infinitesimal Jackknife, Its Distribution and Mean-Field Approximation.

[DOI]

Zhiyun Lu

Eugene Ie

Fei Sha

CoRR, 2020

A Large Scale Speech Sentiment Corpus.

[DOI]

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Speech Sentiment Analysis via Pre-Trained Features from End-to-End ASR Models.

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Kernel Approximation Methods for Speech Recognition.

[DOI]

Avner May

Alireza Bagheri Garakani

J. Mach. Learn. Res., 2019

Hyper-parameter Tuning under a Budget Constraint.

[DOI]

Zhiyun Lu

Chao-Kai Chiang

Fei Sha

CoRR, 2019

Hyper-parameter Tuning under a Budget Constraint.

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

2016

Learning compact recurrent neural networks.

[DOI]

Zhiyun Lu

Vikas Sindhwani

Tara N. Sainath

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A comparison between deep neural nets and kernel acoustic models for speech recognition.

[DOI]

Zhiyun Lu

Dong Guo

Alireza Bagheri Garakani

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2014

How to Scale Up Kernel Methods to Be As Good As Deep Neural Nets.

[DOI]

Zhiyun Lu

Avner May

Kuan Liu

Alireza Bagheri Garakani

CoRR, 2014

2012

Selecting β-Divergence for Nonnegative Matrix Factorization by Score Matching.

[DOI]

Zhiyun Lu

Zhirong Yang

Erkki Oja

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2012, 2012