Rongzhi Gu

Orcid: 0000-0003-1861-9170

According to our database1, Rongzhi Gu authored at least 40 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
The Sound Demixing Challenge 2023 - Cinematic Demixing Track.
Trans. Int. Soc. Music. Inf. Retr., January, 2024

ReZero: Region-Customizable Sound Extraction.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

MuCodec: Ultra Low-Bitrate Music Codec.
CoRR, 2024

Gull: A Generative Multifunctional Audio Codec.
CoRR, 2024

Fast Random Approximation of Multi-Channel Room Impulse Response.
Proceedings of the IEEE International Conference on Acoustics, 2024

A Unified Geometry-Aware Source Localization and Separation Framework for AD-HOC Microphone Array.
Proceedings of the IEEE International Conference on Acoustics, 2024

Improving Music Source Separation with Simo Stereo Band-Split Rnn.
Proceedings of the IEEE International Conference on Acoustics, 2024

SECap: Speech Emotion Captioning with Large Language Model.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

The Sound Demixing Challenge 2023 - Cinematic Demixing Track.
CoRR, 2023

3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty.
CoRR, 2023

High Fidelity Speech Enhancement with Band-split RNN.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

TSpeech-AI System Description to the 5th Deep Noise Suppression (DNS) Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Learnable Sparse Filterbank for Speaker Verification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention.
Proceedings of the IEEE International Conference on Acoustics, 2022

Learning Decoupling Features Through Orthogonality Regularization.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Complex Neural Spatial Filter: Enhancing Multi-Channel Target Speech Separation in Complex Domain.
IEEE Signal Process. Lett., 2021

Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via Layer Consistency.
CoRR, 2021

Text Anchor Based Metric Learning for Small-Footprint Keyword Spotting.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Effective Phase Encoding for End-To-End Speaker Verification.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

3D Spatial Features for Multi-Channel Target Speech Separation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Multi-Modal Multi-Channel Target Speech Separation.
IEEE J. Sel. Top. Signal Process., 2020

Temporal-Spatial Neural Filter: Direction Informed End-to-End Multi-channel Target Speech Separation.
CoRR, 2020

Audio-Visual Multi-Channel Recognition of Overlapped Speech.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Deep Speaker Embedding with Long Short Term Centroid Learning for Text-Independent Speaker Verification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Enhancing End-to-End Multi-Channel Speech Separation Via Spatial Feature Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Context-adaptive Gaussian Attention for Text-independent Speaker Verification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
End-to-End Multi-Channel Speech Separation.
CoRR, 2019

Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Logistic Similarity Metric Learning via Affinity Matrix for Text-Independent Speaker Verification.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Speaker-discriminative Embedding Learning via Affinity Matrix for Short Utterance Speaker Verification.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Alleviate Cross-chunk Permutation through Chunk-level Speaker Embedding for Blind Speech Separation.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2017
Interaction Data Detection System to Upgrade Brick and Mortar Shops: Metrics Allow Offline Shops to Compete with Online Retailers.
IEEE Consumer Electron. Mag., 2017

Learning a robust DOA estimation model with acoustic vector sensor cues.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017


  Loading...