2025
A Soft-Contrastive Pseudo Learning Approach Toward Open-World Forged Speech Attribution.
IEEE Trans. Inf. Forensics Secur., 2025
Introducing Euclidean distance optimization into Softmax loss under neural collapse.
Pattern Recognit., 2025
Cross-domain redundancy exploration by a deep encoder-decoder network for speech steganography.
J. Inf. Secur. Appl., 2025
Invertible generative speech hiding with normalizing flow for secure IoT voice.
Internet Things, 2025
A transformer-based deep learning approach for recognition of forgery methods in spoofing speech attribution.
Appl. Soft Comput., 2025
Bayesian Nonparametric Clustering for Source Counting with a Small Aperture Microphone Array.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
2024
An attack-agnostic defense method against adversarial attacks on speaker verification by fusing downsampling and upsampling of speech signals.
Inf. Sci., 2024
Target speaker filtration by mask estimation for source speaker traceability in voice conversion.
Eng. Appl. Artif. Intell., 2024
Noise-robust voice conversion using adversarial training with multi-feature decoupling.
Eng. Appl. Artif. Intell., 2024
Scale-aware dual-branch complex convolutional recurrent network for monaural speech enhancement.
Comput. Speech Lang., 2024
Multi-Speaker Localization in the Circular Harmonic Domain on Small Aperture Microphone Arrays Using Deep Convolutional Networks.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Alternating Direction Method of Multipliers for Convolutive Non-Negative Matrix Factorization.
IEEE Trans. Cybern., December, 2023
Self-distillation object segmentation via pyramid knowledge representation and transfer.
Multim. Syst., October, 2023
A unified speech enhancement approach to mitigate both background noises and adversarial perturbations.
Inf. Fusion, July, 2023
2022
Spatial-temporal slowfast graph convolutional network for skeleton-based action recognition.
IET Comput. Vis., 2022
Waveform level adversarial example generation for joint attacks against both automatic speaker verification and spoofing countermeasures.
Eng. Appl. Artif. Intell., 2022
Fun2Vec: a Contrastive Learning Framework of Function-level Representation for Binary.
CoRR, 2022
A New Adjacency Matrix Configuration in GCN-based Models for Skeleton-based Action Recognition.
CoRR, 2022
Perception-guided generative adversarial network for end-to-end speech enhancement.
Appl. Soft Comput., 2022
A Multi-Resolution Front-End for End-to-End Speech Anti-Spoofing.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022
A Novel Method to Evaluate the Privacy Protection in Speaker Anonymization.
Proceedings of the Artificial Intelligence and Security - 8th International Conference, 2022
2021
When Automatic Voice Disguise Meets Automatic Speaker Verification.
IEEE Trans. Inf. Forensics Secur., 2021
A Survey on the Development of Self-Organizing Maps for Unsupervised Intrusion Detection.
Mob. Networks Appl., 2021
Entropy-Defined Direct Batch Growing Hierarchical Self-Organizing Mapping for Efficient Network Anomaly Detection.
IEEE Access, 2021
2020
On the Complementary Role of DNN Multi-Level Enhancement for Noisy Robust Speaker Recognition in an I-Vector Framework.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2020
Direct Batch Growth Hierarchical Self-Organizing Mapping Based on Statistics for Efficient Network Intrusion Detection.
IEEE Access, 2020
Joint Decision of Anti-Spoofing and Automatic Speaker Verification by Multi-Task Learning With Contrastive Loss.
IEEE Access, 2020
2019
Detection of People With Camouflage Pattern Via Dense Deconvolution Network.
IEEE Signal Process. Lett., 2019
Spectra Restoration of Bone-Conducted Speech via Attention-Based Contextual Information and Spectro-Temporal Structure Constraint.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2019
Multi-Feature Fusion Network for Salient Region Detection.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2019
Parallel Feature Network For Saliency Detection.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2019
Noise Robust Speaker Recognition Based on Adaptive Frame Weighting in GMM for i-Vector Extraction.
IEEE Access, 2019
Statistics-Enhanced Direct Batch Growth Self-Organizing Mapping for Efficient DoS Attack Detection.
IEEE Access, 2019
A BLSTM and WaveNet-Based Voice Conversion Method With Waveform Collapse Suppression by Post-Processing.
IEEE Access, 2019
Robust Hierarchical Learning for Non-Negative Matrix Factorization With Outliers.
IEEE Access, 2019
The Influence of Clipping on the Performance of a Low Bit Rate Parametric Speech Coder.
Proceedings of the 12th International Congress on Image and Signal Processing, 2019
Improving the Spectra Recovering of Bone-Conducted Speech via Structural SIMilarity Loss Function.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Voice Conversion by Dual-Domain Bidirectional Long Short-Term Memory Networks with Temporal Attention.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Multi-task learning of deep neural networks for joint automatic speaker verification and spoofing detection.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
2018
Deep Neural Network Based Monaural Speech Enhancement with Low-Rank Analysis and Speech Present Probability.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2018
A Spectra-Based Equalization-Generation Combined Framework for Throat Microphone Speech Enhancement.
IEEE Access, 2018
Experimental study on speech enhancement using DNN with perceptual weighting.
Proceedings of the 4th International Conference on Communication and Information Processing, 2018
2017
A Video Salient Region Detection Framework Using Spatiotemporal Consistency Optimization.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017
Semi-Supervised Speech Enhancement Combining Nonnegative Matrix Factorization and Robust Principal Component Analysis.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017
An Improved Supervised Speech Separation Method Based on Perceptual Weighted Deep Recurrent Neural Networks.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017
Joint Optimization of Perceptual Gain Function and Deep Neural Networks for Single-Channel Speech Enhancement.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017
Off-Grid Frequency Estimation with Random Measurements.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017
2016
Unseen Noise Estimation Using Separable Deep Auto Encoder for Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Unsupervised Learning of Continuous Density HMM for Variable-Length Spoken Unit Discovery.
IEICE Trans. Inf. Syst., 2016
Automatic Model Order Selection for Convolutive Non-Negative Matrix Factorization.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2016
Online Convolutive Non-Negative Bases Learning for Speech Enhancement.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2016
Improved Semi-Supervised NMF Based Real-Time Capable Speech Enhancement.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2016
A Perceptually Motivated Approach for Speech Enhancement Based on Deep Neural Network.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2016
Speech Enhancement Using Non-negative Low-Rank Modeling with Temporal Continuity and Sparseness Constraints.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016
Joint Optimization of a Perceptual Modified Wiener Filtering Mask and Deep Neural Networks for Monaural Speech Separation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016
Mask estimate through Itakura-Saito nonnegative RPCA for speech enhancement.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016
Perceptual improvement of deep neural networks for monaural speech enhancement.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016
Joint optimization of audible noise suppression and deep neural networks for single-channel speech enhancement.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016
Adaptive extraction of repeating non-negative temporal patterns for single-channel speech enhancement.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Speech Enhancement Under Low SNR Conditions Via Noise Estimation Using Sparse and Low-Rank NMF with Kullback-Leibler Divergence.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
A stable approach for model order selection in nonnegative matrix factorization.
Pattern Recognit. Lett., 2015
Speech Enhancement Combining NMF Weighted by Speech Presence Probability and Statistical Model.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2015
Cramer-Rao Bounds for Compressive Frequency Estimation.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2015
Speech enhancement based on robust NMF solved by alternating direction method of multipliers.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015
Supervised Multi-scale Locality Sensitive Hashing.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015
SegBOMP: An efficient algorithm for block non-sparse signal recovery.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015
Unsupervised monaural speech enhancement using robust NMF with low-rank and sparse constraints.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015
2013
Joint training of non-negative Tucker decomposition and discrete density hidden Markov models.
Comput. Speech Lang., 2013
2012
Large Scale Graph Regularized Non-Negative Matrix Factorization With ℓ<sub>1</sub> Normalization Based on Kullback-Leibler Divergence.
IEEE Trans. Signal Process., 2012
Tri-factorization learning of sub-word units with application to vocabulary acquisition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
A two-layer non-negative matrix factorization model for vocabulary discovery.
Proceedings of the 2011 Symposium on Machine Learning in Speech and Language Processing, 2011
Image pattern discovery by using the spatial closeness of visual code words.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Unsupervised vocabulary discovery using non-negative matrix factorization with graph regularization.
Proceedings of the IEEE International Conference on Acoustics, 2011