Jibin Yang

IEEE Trans. Inf. Forensics Secur., 2025

Introducing Euclidean distance optimization into Softmax loss under neural collapse.

[DOI]

Pattern Recognit., 2025

Cross-domain redundancy exploration by a deep encoder-decoder network for speech steganography.

[DOI]

J. Inf. Secur. Appl., 2025

Invertible generative speech hiding with normalizing flow for secure IoT voice.

[DOI]

Internet Things, 2025

A transformer-based deep learning approach for recognition of forgery methods in spoofing speech attribution.

[DOI]

Appl. Soft Comput., 2025

Bayesian Nonparametric Clustering for Source Counting with a Small Aperture Microphone Array.

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

An attack-agnostic defense method against adversarial attacks on speaker verification by fusing downsampling and upsampling of speech signals.

[DOI]

Inf. Sci., 2024

Target speaker filtration by mask estimation for source speaker traceability in voice conversion.

[DOI]

Eng. Appl. Artif. Intell., 2024

Noise-robust voice conversion using adversarial training with multi-feature decoupling.

[DOI]

Eng. Appl. Artif. Intell., 2024

Scale-aware dual-branch complex convolutional recurrent network for monaural speech enhancement.

[DOI]

Comput. Speech Lang., 2024

Multi-Speaker Localization in the Circular Harmonic Domain on Small Aperture Microphone Arrays Using Deep Convolutional Networks.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Alternating Direction Method of Multipliers for Convolutive Non-Negative Matrix Factorization.

[DOI]

IEEE Trans. Cybern., December, 2023

Self-distillation object segmentation via pyramid knowledge representation and transfer.

[DOI]

Multim. Syst., October, 2023

A unified speech enhancement approach to mitigate both background noises and adversarial perturbations.

[DOI]

Yihao Li

Inf. Fusion, July, 2023

2022

Spatial-temporal slowfast graph convolutional network for skeleton-based action recognition.

[DOI]

IET Comput. Vis., 2022

Waveform level adversarial example generation for joint attacks against both automatic speaker verification and spoofing countermeasures.

[DOI]

Eng. Appl. Artif. Intell., 2022

Fun2Vec: a Contrastive Learning Framework of Function-level Representation for Binary.

[DOI]

CoRR, 2022

A New Adjacency Matrix Configuration in GCN-based Models for Skeleton-based Action Recognition.

[DOI]

CoRR, 2022

Perception-guided generative adversarial network for end-to-end speech enhancement.

[DOI]

Yihao Li

Appl. Soft Comput., 2022

A Multi-Resolution Front-End for End-to-End Speech Anti-Spoofing.

[DOI]

Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

A Novel Method to Evaluate the Privacy Protection in Speaker Anonymization.

[DOI]

Proceedings of the Artificial Intelligence and Security - 8th International Conference, 2022

2021

When Automatic Voice Disguise Meets Automatic Speaker Verification.

[DOI]

IEEE Trans. Inf. Forensics Secur., 2021

A Survey on the Development of Self-Organizing Maps for Unsupervised Intrusion Detection.

[DOI]

Mob. Networks Appl., 2021

Entropy-Defined Direct Batch Growing Hierarchical Self-Organizing Mapping for Efficient Network Anomaly Detection.

[DOI]

IEEE Access, 2021

2020

On the Complementary Role of DNN Multi-Level Enhancement for Noisy Robust Speaker Recognition in an I-Vector Framework.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2020

Direct Batch Growth Hierarchical Self-Organizing Mapping Based on Statistics for Efficient Network Intrusion Detection.

[DOI]

IEEE Access, 2020

Joint Decision of Anti-Spoofing and Automatic Speaker Verification by Multi-Task Learning With Contrastive Loss.

[DOI]

IEEE Access, 2020

2019

Detection of People With Camouflage Pattern Via Dense Deconvolution Network.

[DOI]

IEEE Signal Process. Lett., 2019

Spectra Restoration of Bone-Conducted Speech via Attention-Based Contextual Information and Spectro-Temporal Structure Constraint.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2019

Multi-Feature Fusion Network for Salient Region Detection.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2019

Parallel Feature Network For Saliency Detection.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2019

Noise Robust Speaker Recognition Based on Adaptive Frame Weighting in GMM for i-Vector Extraction.

[DOI]

IEEE Access, 2019

Statistics-Enhanced Direct Batch Growth Self-Organizing Mapping for Efficient DoS Attack Detection.

[DOI]

IEEE Access, 2019

A BLSTM and WaveNet-Based Voice Conversion Method With Waveform Collapse Suppression by Post-Processing.

[DOI]

IEEE Access, 2019

Robust Hierarchical Learning for Non-Negative Matrix Factorization With Outliers.

[DOI]

IEEE Access, 2019

The Influence of Clipping on the Performance of a Low Bit Rate Parametric Speech Coder.

[DOI]

Proceedings of the 12th International Congress on Image and Signal Processing, 2019

Improving the Spectra Recovering of Bone-Conducted Speech via Structural SIMilarity Loss Function.

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Voice Conversion by Dual-Domain Bidirectional Long Short-Term Memory Networks with Temporal Attention.

[DOI]

Xiaokong Miao

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Multi-task learning of deep neural networks for joint automatic speaker verification and spoofing detection.

[DOI]

Jiakang Li

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

Deep Neural Network Based Monaural Speech Enhancement with Low-Rank Analysis and Speech Present Probability.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2018

A Spectra-Based Equalization-Generation Combined Framework for Throat Microphone Speech Enhancement.

[DOI]

IEEE Access, 2018

Experimental study on speech enhancement using DNN with perceptual weighting.

[DOI]

Proceedings of the 4th International Conference on Communication and Information Processing, 2018

2017

A Video Salient Region Detection Framework Using Spatiotemporal Consistency Optimization.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017

Semi-Supervised Speech Enhancement Combining Nonnegative Matrix Factorization and Robust Principal Component Analysis.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017

An Improved Supervised Speech Separation Method Based on Perceptual Weighted Deep Recurrent Neural Networks.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017

Joint Optimization of Perceptual Gain Function and Deep Neural Networks for Single-Channel Speech Enhancement.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017

Off-Grid Frequency Estimation with Random Measurements.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017

2016

Unseen Noise Estimation Using Separable Deep Auto Encoder for Speech Enhancement.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Unsupervised Learning of Continuous Density HMM for Variable-Length Spoken Unit Discovery.

[DOI]

IEICE Trans. Inf. Syst., 2016

Automatic Model Order Selection for Convolutive Non-Negative Matrix Factorization.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2016

Online Convolutive Non-Negative Bases Learning for Speech Enhancement.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2016

Improved Semi-Supervised NMF Based Real-Time Capable Speech Enhancement.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2016

A Perceptually Motivated Approach for Speech Enhancement Based on Deep Neural Network.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2016

Speech Enhancement Using Non-negative Low-Rank Modeling with Temporal Continuity and Sparseness Constraints.

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Joint Optimization of a Perceptual Modified Wiener Filtering Mask and Deep Neural Networks for Monaural Speech Separation.

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Mask estimate through Itakura-Saito nonnegative RPCA for speech enhancement.

[DOI]

Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Perceptual improvement of deep neural networks for monaural speech enhancement.

[DOI]

Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Joint optimization of audible noise suppression and deep neural networks for single-channel speech enhancement.

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Adaptive extraction of repeating non-negative temporal patterns for single-channel speech enhancement.

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Speech Enhancement Under Low SNR Conditions Via Noise Estimation Using Sparse and Low-Rank NMF with Kullback-Leibler Divergence.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2015

A stable approach for model order selection in nonnegative matrix factorization.

[DOI]

Pattern Recognit. Lett., 2015

Speech Enhancement Combining NMF Weighted by Speech Presence Probability and Statistical Model.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2015

Cramer-Rao Bounds for Compressive Frequency Estimation.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2015

Speech enhancement based on robust NMF solved by alternating direction method of multipliers.

[DOI]

Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

Supervised Multi-scale Locality Sensitive Hashing.

[DOI]

Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

SegBOMP: An efficient algorithm for block non-sparse signal recovery.

[DOI]

Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Unsupervised monaural speech enhancement using robust NMF with low-rank and sparse constraints.

[DOI]

Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

2013

Joint training of non-negative Tucker decomposition and discrete density hidden Markov models.

[DOI]

Comput. Speech Lang., 2013

2012

Large Scale Graph Regularized Non-Negative Matrix Factorization With ℓ<sub>1</sub> Normalization Based on Kullback-Leibler Divergence.

[DOI]

IEEE Trans. Signal Process., 2012

Tri-factorization learning of sub-word units with application to vocabulary acquisition.

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

A two-layer non-negative matrix factorization model for vocabulary discovery.

[DOI]

Proceedings of the 2011 Symposium on Machine Learning in Speech and Language Processing, 2011

Image pattern discovery by using the spatial closeness of visual code words.

[DOI]

Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Unsupervised vocabulary discovery using non-negative matrix factorization with graph regularization.

[DOI]