Yao Qian
Orcid: 0000-0003-1855-9630
According to our database1,
Yao Qian
authored at least 144 papers
between 2001 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
GDNet: a low-light image enhancement network based on Ghost-Block and unique image decomposition.
J. Supercomput., January, 2025
2024
RDGMEF: a multi-exposure image fusion framework based on Retinex decompostion and guided filter.
Neural Comput. Appl., July, 2024
GAN-GA: infrared and visible image fusion generative adversarial network based on global awareness.
Appl. Intell., July, 2024
EgeFusion: Towards Edge Gradient Enhancement in Infrared and Visible Image Fusion With Multi-Scale Transform.
IEEE Trans. Computational Imaging, 2024
Pattern Recognit., 2024
DANT-GAN: A dual attention-based of nested training network for infrared and visible image fusion.
Digit. Signal Process., 2024
CoRR, 2024
VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers.
CoRR, 2024
CoRR, 2024
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations.
CoRR, 2024
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Diffusion Conditional Expectation Model for Efficient and Robust Target Speech Extraction.
CoRR, 2023
CoRR, 2023
CoRR, 2023
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data.
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
DATA2VEC-SG: Improving Self-Supervised Learning Representations for Speech Generation Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
IEEE J. Sel. Top. Signal Process., 2022
Deploying self-supervised learning in the wild for hybrid automatic speech recognition.
CoRR, 2022
A Comprehensive Study on Self-Supervised Distillation for Speaker Representation Learning.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Improving Self-Supervised Learning for Speech Recognition with Intermediate Layer Supervision.
Proceedings of the IEEE International Conference on Acoustics, 2022
Optimizing Alignment of Speech and Language Latent Spaces for End-To-End Speech Recognition and Understanding.
Proceedings of the IEEE International Conference on Acoustics, 2022
Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction.
Proceedings of the IEEE International Conference on Acoustics, 2022
Wav2vec-Switch: Contrastive Learning from Original-Noisy Speech Pairs for Robust Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022
Unispeech-Sat: Universal Speech Representation Learning With Speaker Aware Pre-Training.
Proceedings of the IEEE International Conference on Acoustics, 2022
Large-Scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
2021
CoRR, 2021
CoRR, 2021
CoRR, 2021
CoRR, 2021
Automated Scoring of Spontaneous Speech from Young Learners of English Using Transformers.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Automatic Detection of Word-Level Reading Errors in Non-native English Speech Based on ASR Output.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Spoken Language Understanding of Human-Machine Conversations for Language Learning Applications.
J. Signal Process. Syst., 2020
Discriminative Transfer Learning for Optimizing ASR and Semantic Labeling in Task-Oriented Spoken Dialog.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
2019
A Bipolar-Input Thermoelectric Energy-Harvesting Interface With Boost/Flyback Hybrid Converter and On-Chip Cold Starter.
IEEE J. Solid State Circuits, 2019
To Trust, or Not to Trust? A Study of Human Bias in Automated Video Interview Assessments.
CoRR, 2019
Scoring Interactional Aspects of Human-Machine Dialog for Language Learning and Assessment using Text Features.
Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, 2019
An 84% Peak Efficiency Bipolar-Input Boost/Flyback Hybrid Converter With MPPT and on-Chip Cold Starter for Thermoelectric Energy Harvesting.
Proceedings of the IEEE International Solid- State Circuits Conference, 2019
Automatic Detection of Off-Topic Spoken Responses Using Very Deep Convolutional Neural Networks.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Automated Estimation of Oral Reading Fluency During Summer Camp e-Book Reading with MyTurnToRead.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the Adjunct of the 2019 International Conference on Multimodal Interaction, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Application of an Automatic Plagiarism Detection System in a Large-scale Assessment of English Speaking Proficiency.
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications, 2019
Using Very Deep Convolutional Neural Networks to Automatically Detect Plagiarized Spoken Responses.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Native Language Identification from Raw Waveforms Using Deep Convolutional Neural Networks with Attentive Pooling.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
An On-Chip Transformer-Based Self-Startup Hybrid SIDITO Converter for Thermoelectric Energy Harvesting.
IEEE Trans. Circuits Syst. II Express Briefs, 2018
Exploring End-To-End Attention-Based Neural Networks For Native Language Identification.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
A Prompt-Aware Neural Network Approach to Content-Based Scoring of Non-Native Spontaneous Speech.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Automatic Turn-Level Language Identification for Code-Switched Spanish-English Dialog.
Proceedings of the 9th International Workshop on Spoken Dialogue System Technology, 2018
From Speech Signals to Semantics - Tagging Performance at Acoustic, Phonetic and Word Levels.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Improvements to an Automated Content Scoring System for Spoken CALL Responses: the ETS Submission to the Second Spoken CALL Shared Task.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
A SIDIDO DC-DC Converter With Dual-Mode and Programmable-Capacitor-Array MPPT Control for Thermoelectric Energy Harvesting.
IEEE Trans. Circuits Syst. II Express Briefs, 2017
Proceedings of the 6th International Workshop on Child Computer Interaction, 2017
Using an Automated Content Scoring Engine for Spoken CALL Responses: The ETS submission for the Spoken CALL Challenge.
Proceedings of the 7th ISCA International Workshop on Speech and Language Technology in Education, 2017
Improving Sub-Phone Modeling for Better Native Language Identification with Non-Native English Speech.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Bidirectional LSTM-RNN for Improving Automated Assessment of Non-Native Children's Speech.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, 2017
Exploring ASR-free end-to-end modeling to improve spoken language understanding in a cloud-based dialog system.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Improving native language (L1) identifation with better VAD and TDNN trained separately on native and non-native English corpora.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
2016
Speech Commun., 2016
Improving DNN-Based Automatic Recognition of Non-native Children Speech with Adult Speech.
Proceedings of the 5th Workshop on Child Computer Interaction, 2016
Learning Distributed Word Representations For Bidirectional LSTM Recurrent Neural Network.
Proceedings of the NAACL HLT 2016, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Noise and Metadata Sensitive Bottleneck Features for Improving Speaker Recognition with Non-Native Speech Input.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
A comparison of ASR and human errors for transcription of non-native spontaneous speech.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers.
Speech Commun., 2015
A Unified Tagging Solution: Bidirectional LSTM Recurrent Neural Network with Word Embedding.
CoRR, 2015
Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Recurrent Neural Network.
CoRR, 2015
An improved DNN-based approach to mispronunciation detection and diagnosis of L2 learners' speech.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2015
Sequence generation error (SGE) minimization based deep neural networks training for text-to-speech synthesis.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Using bidirectional lstm recurrent neural networks to learn high-level abstractions of sequential features for automated scoring of non-native spontaneous speech.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Proceedings of the 2014 IEEE International Conference on Robotics and Biomimetics, 2014
4.3 An 87%-peak-efficiency DVS-capable single-inductor 4-output DC-DC buck converter with ripple-based adaptive off-time control.
Proceedings of the 2014 IEEE International Conference on Solid-State Circuits Conference, 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
A new Neural Network based logistic regression classifier for improving mispronunciation detection of L2 language learners.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Modeling DCT parameterized F0 trajectory at intonation phrase level with DNN or decision tree.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
A DNN-based acoustic modeling of tonal language and its application to Mandarin pronunciation training.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
IEEE Trans. Speech Audio Process., 2013
A new preprocessing algorithm and local binary pattern based facial expression recognition.
Proceedings of the 2013 IEEE/SICE International Symposium on System Integration, 2013
A new DNN-based high quality pronunciation evaluation for computer-aided language learning (CALL).
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE 10th International Conference on ASIC, 2013
2012
Proceedings of the Mobile HCI '12, 2012
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
A unified trajectory tiling approach to high quality TTS and cross-lingual voice transformation.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
2011
Improved Prosody Generation by Maximizing Joint Probability of State and Longer Units.
IEEE Trans. Speech Audio Process., 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
Automatic prosody prediction and detection with Conditional Random Field (CRF) models.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
An HMM Trajectory Tiling (HTT) Approach to High Quality TTS - Microsoft Entry to Blizzard Challenge 2010.
Proceedings of the Blizzard Challenge 2010, Kansai Science City, Japan, September 25, 2010, 2010
2009
A Cross-Language State Sharing and Mapping Approach to Bilingual (Mandarin-English) TTS.
IEEE Trans. Speech Audio Process., 2009
A Multi-Space Distribution (MSD) and two-stream tone modeling approach to Mandarin speech recognition.
Speech Commun., 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Improved prosody generation by maximizing joint likelihood of state and longer units.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
2008
Tone-enhanced generalized character posterior probability (GCPP) for Cantonese LVCSR.
Comput. Speech Lang., 2008
Modeling and Generating Tone Contour with Phrase Intonation for Mandarin Chinese Speech.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Prosody for Mandarin speech recognition: a comparative study of read and spontaneous speech.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
2006
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
2004
ACM Trans. Asian Lang. Inf. Process., 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
2002
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Proceedings of the IEEE International Conference on Acoustics, 2002
2001
Int. J. Comput. Linguistics Chin. Lang. Process., 2001
Proceedings of the IEEE International Conference on Acoustics, 2001