Zhizheng Wu
Orcid: 0009-0001-1192-9857Affiliations:
- Chinese University of Hong Kong-Shenzhen (CUHK-SZ), Shenzhen, China
- Meta (former)
- JD.com (former)
- Apple (former)
- University of Edinburgh, UK (former)
- Microsoft Research Asia (former)
- Nanyang Technological University, Singapore (Ph.D., 2015)
According to our database1,
Zhizheng Wu
authored at least 108 papers
between 2008 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoders.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Noro: A Noise-Robust One-shot Voice Conversion System with Hidden Speaker Representation Capabilities.
CoRR, 2024
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation.
CoRR, 2024
PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation.
CoRR, 2024
CoRR, 2024
CoRR, 2024
An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoder.
CoRR, 2024
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models.
CoRR, 2024
CoAVT: A Cognition-Inspired Unified Audio-Visual-Text Pre-Training Model for Multimodal Processing.
CoRR, 2024
Comput. Graph., 2024
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
An Initial Investigation of Neural Replay Simulator for Over-The-Air Adversarial Perturbations to Automatic Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Optimization of Cross-Lingual Voice Conversion With Linguistics Losses to Reduce Foreign Accents.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
IEEE Signal Process. Lett., 2023
IEEE Signal Process. Lett., 2023
Leveraging Content-based Features from Multiple Acoustic Models for Singing Voice Conversion.
CoRR, 2023
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
2022
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022
2021
Cross-Lingual Voice Conversion with a Cycle Consistency Loss on Linguistic Representation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the Blizzard Challenge 2019, Vienna, Austria, September 23, 2019, 2019
2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
IEEE J. Sel. Top. Signal Process., 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
2016
Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Improving Trajectory Modelling for DNN-Based Speech Synthesis by Using Stacked Bottleneck Features and Minimum Generation Error Training.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
On the study of replay and voice conversion attacks to text-dependent speaker verification.
Multim. Tools Appl., 2016
Improving Trajectory Modelling for DNN-based Speech Synthesis by using Stacked Bottleneck Features and Minimum Trajectory Error Training.
CoRR, 2016
Spoofing detection under noisy conditions: a preliminary investigation and an initial database.
CoRR, 2016
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
An Investigation of Spoofing Speech Detection Under Additive Noise and Reverberant Conditions.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Waveform Generation Based on Signal Reshaping for Statistical Parametric Speech Synthesis.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the Blizzard Challenge 2016, Cuppertino, CA, USA, September 16, 2016, 2016
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
On the use of I-vectors and average voice model for voice conversion without parallel data.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Predicting articulatory movement from text using deep architecture with stacked bottleneck features.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
2015
Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015
Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015
IEEE Trans. Inf. Forensics Secur., 2015
Speech Commun., 2015
Multim. Tools Appl., 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
ASVspoof 2015: the first automatic speaker verification spoofing and countermeasures challenge.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Automatic speaker verification spoofing and countermeasures (ASVspoof 2015): introductory talk by the organizers.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Minimum trajectory error training for deep neural networks, combined with stacked bottleneck features.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Deep neural network context embeddings for model selection in rich-context HMM synthesis.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Fusion of multiple parameterisations for DNN-based sinusoidal speech synthesis with multi-task learning.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Deep neural networks employing Multi-Task Learning and stacked bottleneck features for speech synthesis.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
Proceedings of the Handbook of Biometric Anti-Spoofing, 2014
Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
A comparative study of spectral transformation techniques for singing voice synthesis.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
2013
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
2012
Mixture of Factor Analyzers Using Priors From Non-Parallel Speech for Voice Conversion.
IEEE Signal Process. Lett., 2012
Detecting Converted Speech and Natural Speech for anti-Spoofing Attack in Speaker Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
A study on spoofing attack in state-of-the-art speaker verification: the telephone speech case.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
2011
Improved Prosody Generation by Maximizing Joint Probability of State and Longer Units.
IEEE Trans. Speech Audio Process., 2011
2010
Automatic prosody prediction and detection with Conditional Random Field (CRF) models.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Improved prosody generation by maximizing joint likelihood of state and longer units.
Proceedings of the IEEE International Conference on Acoustics, 2009
2008
Modeling and Generating Tone Contour with Phrase Intonation for Mandarin Chinese Speech.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008