Shengkui Zhao

According to our database1, Shengkui Zhao authored at least 46 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions.
CoRR, 2024

Towards Audio Codec-based Speech Separation.
CoRR, 2024

Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis.
CoRR, 2024

MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2024

SPGM: Prioritizing Local Features for Enhanced Speech Separation Performance.
Proceedings of the IEEE International Conference on Acoustics, 2024

Are Soft Prompts Good Zero-Shot Learners for Speech Recognition?
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention.
CoRR, 2023

ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Adapter-tuning with Effective Token-dependent Representation Shift for Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

MossFormer: Pushing the Performance Limit of Monaural Speech Separation Using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions.
Proceedings of the IEEE International Conference on Acoustics, 2023

D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network Using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
FRCRN: Boosting Feature Representation Using Frequency Recurrence for Monaural Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2022

End-to-End Complex-Valued Multidilated Convolutional Neural Network for Joint Acoustic Echo Cancellation and Noise Suppression.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram.
Proceedings of the IEEE International Conference on Acoustics, 2021

Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019
Fast Learning for Non-Parallel Many-to-Many Voice Conversion with Residual Star Generative Adversarial Networks.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multi-Task Multi-Network Joint-Learning of Deep Residual Networks and Cycle-Consistency Generative Adversarial Networks for Robust Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2017
On time-frequency mask estimation for MVDR beamforming with application in robust speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A novel sparse model for multi-source localization using distributed microphone array.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Speech dereverberation for enhancement and recognition using dynamic features constrained deep neural networks and feature adaptation.
EURASIP J. Adv. Signal Process., 2016

Large region acoustic source mapping: A generalized sparse constrained deconvolution approach.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

An expectation-maximization eigenvector clustering approach to direction of arrival estimation of multiple speech sources.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
ITEM: Immersive Telepresence for Entertainment and Meetings - A Practical Approach.
IEEE J. Sel. Top. Signal Process., 2015

Learning to estimate reverberation time in noisy and reverberant rooms.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Large region acoustic source mapping using movable arrays.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A learning-based approach to direction of arrival estimation in noisy and reverberant environments.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Teleimmersive Audio-Visual Communication Using Commodity Hardware [Applications Corner].
IEEE Signal Process. Mag., 2014

Underdetermined direction of arrival estimation using acoustic vector sensor.
Signal Process., 2014

New Variable Step-Sizes Minimizing Mean-Square Deviation for the LMS-Type Algorithms.
Circuits Syst. Signal Process., 2014

ITEM: Immersive Telepresence for Entertainment and Meetings - A Practical Approach.
CoRR, 2014

A new auxiliary-vector algorithm with conjugate orthogonality for speech enhancement.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Robust DOA estimation of multiple speech sources.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Spatialized audio multiparty teleconferencing with commodity miniature microphone array.
Proceedings of the ACM Multimedia Conference, 2013

2012
Adaptive fast finite-time multiple-surface sliding control for a class of uncertain non-linear systems.
Int. J. Model. Identif. Control., 2012

A Fast-Converging Adaptive Frequency-Domain MVDR Beamformer for Speech Enhancement.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Real-time implementation and performance optimization of 3D sound localization on GPUs.
Proceedings of the 2012 Design, Automation & Test in Europe Conference & Exhibition, 2012

2010
Nonlinear image restoration using recurrent radial basis function network.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010

2009
Performance analysis and enhancements of adaptive algorithms and their applications
PhD thesis, 2009

Stability and Convergence Analysis of Transform-Domain LMS Adaptive Filters With Second-Order Autoregressive Process.
IEEE Trans. Signal Process., 2009

Variable step-size LMS algorithm with a quotient form.
Signal Process., 2009

A generalized data windowing scheme for adaptive conjugate gradient algorithms.
Signal Process., 2009

2008
Comments on "Adaptive multiple-surface sliding control for non-autonomous systems with mismatched uncertainties".
Autom., 2008

2006
Modified LMS and NLMS Algorithms with a New Variable Step Size.
Proceedings of the Ninth International Conference on Control, 2006

Sliding Mode Control of Fuzzy Dynamic Systems.
Proceedings of the Ninth International Conference on Control, 2006


  Loading...