Riemannian acceleration with preconditioning for symmetric eigenvalue problems.
Numerische Mathematik, February, 2025
CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR.
CoRR, February, 2025
EPIC: A Provable Accelerated Eigensolver Based on Preconditioning and Implicit Convexity.
SIAM J. Matrix Anal. Appl., 2025
Self-Supervised Audio Teacher-Student Transformer for Both Clip-Level and Frame-Level Tasks.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
A preconditioned inverse iteration with an improved convergence guarantee.
CoRR, 2024
A randomized small-block Lanczos method for large-scale null space computations.
CoRR, 2024
RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Fine-Tune the Pretrained ATST Model for Sound Event Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024
Frame-Wise Streaming end-to-end Speaker Diarization with Non-Autoregressive Self-Attention-Based Attractors.
Proceedings of the IEEE International Conference on Acoustics, 2024
A locally optimal preconditioned Newton-Schur method for symmetric elliptic eigenvalue problems.
Math. Comput., June, 2023
Convergence Analysis of Newton-Schur Method for Symmetric Elliptic Eigenvalue Problem.
SIAM J. Numer. Anal., February, 2023
Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level Tasks.
CoRR, 2023
Mixhead: Breaking the low-rank bottleneck in multi-head attention language models.
Knowl. Based Syst., 2022
RCT: Random consistency training for semi-supervised sound event detection.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Multi-chain Fudan-CCDC model for COVID-19 - a revisit to Singapore's case.
Quant. Biol., 2020