Zhiyao Duan
Orcid: 0000-0002-8334-9974Affiliations:
- University of Rochester, USA
- Northwestern University, USA (former)
According to our database1,
Zhiyao Duan
authored at least 125 papers
between 2007 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Measure by Measure: Measure-Based Automatic Music Composition with Modern Staff Notation.
Trans. Int. Soc. Music. Inf. Retr., January, 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition.
CoRR, 2024
A Multi-Stream Fusion Approach with One-Class Learning for Audio-Visual Deepfake Detection.
CoRR, 2024
CoRR, 2024
CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection.
CoRR, 2024
CoRR, 2024
Scoring Intervals using Non-Hierarchical Transformer For Automatic Piano Transcription.
CoRR, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Trans. Int. Soc. Music. Inf. Retr., January, 2023
EDMSound: Spectrogram Based Diffusion Models for Efficient and High-Quality Audio Synthesis.
CoRR, 2023
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
IEEE Trans. Multim., 2022
Trans. Int. Soc. Music. Inf. Retr., 2022
ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm.
CoRR, 2022
Predicting Global Head-Related Transfer Functions From Scanned Head Geometry Using Deep Learning and Compact Representations.
CoRR, 2022
A Data-Driven Methodology for Considering Feasibility and Pairwise Likelihood in Deep Learning Based Guitar Tablature Transcription Systems.
CoRR, 2022
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022
DyViSE: Dynamic Vision-Guided Speaker Embedding for Audio-Visual Speaker Diarization.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022
Proceedings of the 32nd IEEE International Workshop on Machine Learning for Signal Processing, 2022
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
A Study of The Robustness of Raw Waveform Based Speaker Embeddings Under Mismatched Conditions.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
IEEE Signal Process. Lett., 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021
BeatNet: CRNN and Particle Filtering for Online Joint Beat, Downbeat and Meter Tracking.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Don't Look Back: An Online Beat Tracking Method Using RNN and Enhanced Particle Filtering.
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Speaker Attractor Network: Generalizing Speech Separation to Unseen Numbers of Sources.
IEEE Signal Process. Lett., 2020
Do not look back: an online beat tracking method using RNN and enhanced particle filtering.
CoRR, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 20th International Conference on New Interfaces for Musical Expression, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the CHIIR '20: Conference on Human Information Interaction and Retrieval, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Creating a Multitrack Classical Music Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications.
IEEE Trans. Multim., 2019
Trans. Int. Soc. Music. Inf. Retr., 2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
IEEE Signal Process. Mag., 2019
IEEE J. Sel. Top. Signal Process., 2019
Spoofing Speaker Verification Systems with Deep Multi-speaker Text-to-speech Synthesis.
CoRR, 2019
CoRR, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
2018
IEEE Signal Process. Lett., 2018
Speech Commun., 2018
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018
Skeleton Plays Piano: Online Generation of Pianist Body Movements from MIDI Performance.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018
Joint Speaker Diarization and Recognition Using Convolutional and Recurrent Neural Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Visualization and Interpretation of Siamese Style Convolutional Neural Networks for Sound Search by Vocal Imitation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Unsupervised Learning Approach to Feature Analysis for Automatic Speech Emotion Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the Latent Variable Analysis and Signal Separation, 2018
Proceedings of the Computer Vision - ECCV 2018, 2018
Proceedings of the Computer Vision - ECCV 2018, 2018
Vocal Imitation Set: a dataset of vocally imitated sound events using the AudioSet ontology.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018
2017
IEEE Signal Process. Lett., 2017
Enhanced multiclass SVM with thresholding fusion for speech-based emotion classification.
Int. J. Speech Technol., 2017
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
See and listen: Score-informed association of sound tracks to players in chamber music performance videos.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Creating A Musical Performance Dataset for Multimodal Music Analysis: Challenges, Insights, and Applications.
CoRR, 2016
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016
Proceedings of the 4th Workshop on Sentiment Analysis where AI meets Psychology (SAAIP 2016) co-located with 25th International Joint Conference on Artificial Intelligence (IJCAI 2016), 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Rotational reset strategy for online semi-supervised NMF-based speech enhancement for long recordings.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015
Proceedings of the 25th IEEE International Workshop on Machine Learning for Signal Processing, 2015
Proceedings of the 25th IEEE International Workshop on Machine Learning for Signal Processing, 2015
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
IEEE ACM Trans. Audio Speech Lang. Process., 2014
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014
A novel cepstral representation for timbre modeling of sound sources in polyphonic mixtures.
Proceedings of the IEEE International Conference on Acoustics, 2014
2012
Speech Enhancement by Online Non-negative Spectrogram Decomposition in Non-stationary Noise Environments.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the Latent Variable Analysis and Signal Separation, 2012
2011
IEEE J. Sel. Top. Signal Process., 2011
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
Multiple Fundamental Frequency Estimation by Modeling Spectral Peaks and Non-Peak Regions.
IEEE Trans. Speech Audio Process., 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009
2008
Unsupervised Single-Channel Music Source Separation by Average Harmonic Structure Modeling.
IEEE Trans. Speech Audio Process., 2008
Proceedings of the ISMIR 2008, 2008
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
2007
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007
Proceedings of the 2007 International Computer Music Conference, 2007