Hongbin Suo

According to our database1, Hongbin Suo authored at least 36 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


On csauthors.net:


The Whu Wake Word Lipreading System for the 2024 Chat-Scenario Chinese Lipreading Challenge.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Outlier-aware Inlier Modeling and Multi-scale Scoring for Anomalous Sound Detection via Multitask Learning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

SEF-Net: Speaker Embedding Free Target Speaker Extraction Network.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Robust Audio Anti-spoofing Countermeasure with Joint Training of Front-end and Back-end Models.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Task-Agnostic Structured Pruning of Speech Representation Models.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Multi-channel multi-speaker transformer for speech recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Low-complexity Multi-Channel Speaker Extraction with Pure Speech Cues.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Reformulating Speaker Diarization As Community Detection With Emphasis On Topological Structure.
Proceedings of the IEEE International Conference on Acoustics, 2022

Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data.
Proceedings of the IEEE International Conference on Acoustics, 2022

BeamTransformer: Microphone Array-based Overlapping Speech Detection.
CoRR, 2021

Investigation of Spatial-Acoustic Features for Overlapping Speech Detection in Multiparty Meetings.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A Real-Time Speaker Diarization System Based on Spatial Spectrum.
Proceedings of the IEEE International Conference on Acoustics, 2021

Cam: Context-Aware Masking for Robust Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021

Phonetically-Aware Coupled Network For Short Duration Text-Independent Speaker Verification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Autoencoder-Based Semi-Supervised Curriculum Learning for Out-of-Domain Speaker Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Towards a Fault-Tolerant Speaker Verification System: A Regularization Approach to Reduce the Condition Number.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Maximum A Posteriori Linear Regression for language recognition.
Expert Syst. Appl., 2012

Low-dimensional representation of Gaussian mixture model supervector for language recognition.
EURASIP J. Adv. Signal Process., 2012

Factor analysis of Laplacian approach for speaker recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Language recognition with language total variability.
Proceedings of the 2011 International Conference on Innovative Computing and Cloud Computing, 2011

Speaker recognition using the resynthesized speech via spectrum modeling.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Using a Kind of Novel Phonotactic Information for SVM Based Speaker Recognition.
IEICE Trans. Inf. Syst., 2009

Approximate Decision Function and Optimization for GMM-UBM Based Speaker Verification.
IEICE Trans. Inf. Syst., 2009

An LVCSR Based Reading Miscue Detection System Using Knowledge of Reference and Error Patterns.
IEICE Trans. Inf. Syst., 2009

Automatic Singing Performance Evaluation for Untrained Singers.
IEICE Trans. Inf. Syst., 2009

WAPS: An Audio Program Surveillance System for Large Scale Web Data Stream.
Proceedings of the Web Information Systems and Mining, International Conference, 2009

A Novel Fuzzy-Based Automatic Speaker Clustering Algorithm.
Proceedings of the Advances in Neural Networks, 2009

Robust Speaker Clustering Using Affinity Propagation.
IEICE Trans. Inf. Syst., 2008

Melody Track Selection Using Discriminative Language Model.
IEICE Trans. Inf. Syst., 2008

Automatic Language Identification with Discriminative Language Characterization Based on SVM.
IEICE Trans. Inf. Syst., 2008

Using SVM as Back-End Classifier for Language Identification.
EURASIP J. Audio Speech Music. Process., 2008

Speaker Recognition using a Kind of Novel Phonotactic Information.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Spoken language identification using score vector modeling and support vector machine.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

The Design of Backend Classifiers in PPRLM System for Language Identification.
Proceedings of the Third International Conference on Natural Computation, 2007
