Jinchuan Tian

Orcid: 0000-0002-2129-471X

According to our database1, Jinchuan Tian authored at least 29 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
VERSA: A Versatile Evaluation Toolkit for Speech, Audio, and Music.
CoRR, 2024

SpoofCeleb: Speech Deepfake Detection and SASV In The Wild.
CoRR, 2024

ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech.
CoRR, 2024

Preference Alignment Improves Language Model-Based TTS.
CoRR, 2024

Text-To-Speech Synthesis In The Wild.
CoRR, 2024

On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models.
CoRR, 2024

ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets.
CoRR, 2024

The Interspeech 2024 Challenge on Speech Processing Using Discrete Units.
CoRR, 2024

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer.
CoRR, 2024

UniAudio: Towards Universal Audio Generation with Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

AutoPrep: An Automatic Preprocessing Framework for In-The-Wild Speech Data.
Proceedings of the IEEE International Conference on Acoustics, 2024

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study.
Proceedings of the IEEE International Conference on Acoustics, 2024

Towards Robust Speech Representation Learning for Thousands of Languages.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Integrating Lattice-Free MMI Into End-to-End Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

UniAudio: An Audio Foundation Model Toward Universal Audio Generation.
CoRR, 2023

HiFi-Codec: Group-residual Vector quantization for High Fidelity Audio Codec.
CoRR, 2023

The MineTrans Systems for IWSLT 2023 Offline Speech Translation and Speech-to-Speech Translation Tasks.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Bayes Risk Transducer: Transducer with Controllable Alignment Prediction.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Bayes Risk CTC: Controllable CTC Alignment in Sequence-to-Sequence Tasks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Improving Mandarin End-to-End Speech Recognition With Word N-Gram Language Model.
IEEE Signal Process. Lett., 2022

LAE: Language-Aware Encoder for Monolingual and Multilingual ASR.
CoRR, 2022

Integrate Lattice-Free MMI into End-to-End Speech Recognition.
CoRR, 2022

Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

LAE: Language-Aware Encoder for Monolingual and Multilingual ASR.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Consistent Training and Decoding for End-to-End Speech Recognition Using Lattice-Free MMI.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via Layer Consistency.
CoRR, 2021

2020
A Random Gossip BMUF Process for Neural Language Modeling.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020


  Loading...