Jiqing Han
Orcid: 0000-0002-4297-4300Affiliations:
- Harbin Institute of Technology, School of Computer Science and Technology, China
According to our database1,
Jiqing Han
authored at least 141 papers
between 1997 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Sound Activity-Aware Based Cross-Task Collaborative Training for Semi-Supervised Sound Event Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Capturing High-Level Semantic Correlations via Graph for Multimodal Sentiment Analysis.
IEEE Signal Process. Lett., 2024
Mutual Information-based Representations Disentanglement for Unaligned Multimodal Language Sequences.
CoRR, 2024
Contrastive Loss Based Frame-Wise Feature Disentanglement for Polyphonic Sound Event Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024
Modeling Quasi-Periodic Dependency via Self-Supervised Pre-Training for Respiratory Sound Classification.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Task-driven common subspace learning based semantic feature extraction for acoustic event recognition.
Expert Syst. Appl., December, 2023
Biomed. Signal Process. Control., 2023
Mutual Information-based Embedding Decoupling for Generalizable Speaker Verification.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Personality-aware Training based Speaker Adaptation for End-to-end Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Using Auxiliary Tasks In Multimodal Fusion of Wav2vec 2.0 And Bert for Multimodal Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Time-Weighted Frequency Domain Audio Representation with GMM Estimator for Anomalous Sound Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Sentiment Knowledge Enhanced Self-supervised Learning for Multimodal Sentiment Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
2022
IEEE Signal Process. Lett., 2022
CoRR, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Exploring attention mechanisms based on summary information for end-to-end automatic speech recognition.
Neurocomputing, 2021
Semantic feature extraction based on subspace learning with temporal constraints for acoustic event recognition.
Digit. Signal Process., 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Model-Agnostic Fast Adaptive Multi-Objective Balancing Algorithm for Multilingual Automatic Speech Recognition Model Training.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Capturing Temporal Dependencies Through Future Prediction for CNN-Based Audio Classifiers.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Nonnegative Matrix Factorization Based Transfer Subspace Learning for Cross-Corpus Speech Emotion Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
A Joint Framework of Denoising Autoencoder and Generative Vocoder for Monaural Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Learning Temporal Relations from Semantic Neighbors for Acoustic Scene Classification.
IEEE Signal Process. Lett., 2020
Circuits Syst. Signal Process., 2020
La Furca: Iterative Context-Aware End-to-End Monaural Speech Separation Based on Dual-Path Deep Parallel Inter-Intra Bi-LSTM with Attention.
CoRR, 2020
FurcaNeXt: End-to-End Monaural Speech Separation with Dynamic Gated Dilated Temporal Convolutional Networks.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020
ATReSN-Net: Capturing Attentive Temporal Relations in Semantic Neighborhood for Acoustic Scene Classification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Speech Separation Based on Multi-Stage Elaborated Dual-Path Deep BiLSTM with Auxiliary Identity Loss.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Self-Supervised Adversarial Multi-Task Learning for Vocoder-Based Monaural Speech Enhancement.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Double Adversarial Network Based Monaural Speech Enhancement for Robust Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
A bilevel framework for joint optimization of session compensation and classification for speaker identification.
Digit. Signal Process., 2019
A Multi-Task Learning Framework for Overcoming the Catastrophic Forgetting in Automatic Speech Recognition.
CoRR, 2019
CoRR, 2019
FurcaNeXt: End-to-end monaural speech separation with dynamic gated dilated temporal convolutional networks.
CoRR, 2019
FurcaNet: An end-to-end deep gated convolutional, long short-term memory, deep neural networks for single channel speech separation.
CoRR, 2019
CoRR, 2019
Abnormal heart sound detection using temporal quasi-periodic features and long short-term memory without segmentation.
Biomed. Signal Process. Control., 2019
IEEE Access, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Deep Attention Gated Dilated Temporal Convolutional Networks with Intra-Parallel Convolutional Modules for End-to-End Monaural Speech Separation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
End-to-End Monaural Speech Separation with Multi-Scale Dynamic Weighted Gated Dilated Convolutional Pyramid Network.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Cross-Corpus Speech Emotion Recognition Using Semi-Supervised Transfer Non-Negative Matrix Factorization with Adaptation Regularization.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Convolutional Grid Long Short-Term Memory Recurrent Neural Network for Automatic Speech Recognition.
Proceedings of the Neural Information Processing - 26th International Conference, 2019
Furcax: End-to-end Monaural Speech Separation Based on Deep Gated (De)convolutional Neural Networks with Adversarial Example Training.
Proceedings of the IEEE International Conference on Acoustics, 2019
Investigation of Monaural Front-End Processing for Robust Speech Recognition Without Retraining or Joint-Training.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
2018
Efficient general sparse denoising with non-convex sparse constraint and total variation regularization.
Digit. Signal Process., 2018
Investigation of Monaural Front-End Processing for Robust ASR without Retraining or Joint-Training.
CoRR, 2018
Biomed. Signal Process. Control., 2018
Unsupervised Temporal Feature Learning Based on Sparse Coding Embedded BoAW for Acoustic Event Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
A Compact and Discriminative Feature Based on Auditory Summary Statistics for Acoustic Scene Classification.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Deep Neural Network Based Discriminative Training for I-Vector/PLDA Speaker Verification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Expert Syst. Appl., 2017
Heart sound classification based on scaled spectrogram and partial least squares regression.
Biomed. Signal Process. Control., 2017
Speaker Verification via Estimating Total Variability Space Using Probabilistic Partial Least Squares.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Learning Deep Neural Network Based Kernel Functions for Small Sample Size Classification.
Proceedings of the Neural Information Processing - 24th International Conference, 2017
Towards Heart Sound Classification Without Segmentation Using Convolutional Neural Network.
Proceedings of the Computing in Cardiology, 2017
2016
IEEE Trans. Signal Process., 2016
IEEE Signal Process. Lett., 2016
Int. J. Pattern Recognit. Artif. Intell., 2016
Neurocomputing, 2016
Towards heart sound classification without segmentation via autocorrelation feature and diffusion maps.
Future Gener. Comput. Syst., 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Abnormal Heart Sounds detection based on the Scaled Time-Frequency Representation and Feature Selection.
Proceedings of the Computing in Cardiology, CinC 2016, Vancouver, 2016
2015
Inf. Sci., 2015
Digit. Signal Process., 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
2014
Confidence Measure Based on Context Consistency Using Word Occurrence Probability and Topic Adaptation for Spoken Term Detection.
IEICE Trans. Inf. Syst., 2014
Digit. Signal Process., 2014
Sparse Representation with Optimized Learned Dictionary for Robust Voice Activity Detection.
Circuits Syst. Signal Process., 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Robust minimum statistics project coefficients feature for acoustic environment recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
ACM Trans. Intell. Syst. Technol., 2013
Identification of Objectionable Audio Segments Based on Pseudo and Heterogeneous Mixture Models.
IEEE Trans. Speech Audio Process., 2013
Audio Segment Classification Using Online Learning Based Tensor Representation Feature Discrimination.
IEEE Trans. Speech Audio Process., 2013
Statistical voice activity detection based on sparse representation over learned dictionary.
Digit. Signal Process., 2013
Proceedings of the IJCAI 2013, 2013
Case based reasoning solution to the problem of sustained learning in keyword spotting.
Proceedings of the IEEE International Conference on Acoustics, 2013
Upper and lower bounds for approximation of the Kullback-Leibler divergence between Hidden Markov models.
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Int. J. Pattern Recognit. Artif. Intell., 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
IEEE Signal Process. Lett., 2011
J. Signal Inf. Process., 2011
Voice activity detection based on conjugate subspace matching pursuit and likelihood ratio test.
EURASIP J. Audio Speech Music. Process., 2011
Online Learning for Classification of Low-rank Representation Features and Its Applications in Audio Segment Classification
CoRR, 2011
CoRR, 2011
Heterogeneous mixture models using sparse representation features for applause and laugh detection.
Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011
Real-World Speech/Non-Speech Audio Classification Based on Sparse Representation Features and GPCs.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the Neural Information Processing - 18th International Conference, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Compensation of partly reliable components for band-limited speech recognition with missing data techniques.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
Comput. Animat. Virtual Worlds, 2010
Int. J. Pattern Recognit. Artif. Intell., 2010
Compensation of signal with erasures via sparse representation into its significant subspace.
Proceedings of the 10th International Conference on Information Sciences, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Voice Activity Detection Based on Complex Exponential Atomic Decomposition and Likelihood Ratio Test.
Proceedings of the 20th International Conference on Pattern Recognition, 2010
2009
Speaker identification and verification from audio coded speech in matched and mismatched conditions.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2009
Proceedings of the Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2009), 2009
2008
Text-independent Speaker Identification Based on MAP Channel Compensation and Pitch-dependent Features.
Proceedings of the 2008 International Conference on Information & Knowledge Engineering, 2008
2007
Automatic conversion from lexical words to prosodic words for mandarin text-to-speech system.
Int. J. Speech Technol., 2007
2006
J. Comput. Res. Dev., 2006
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
2005
Modifying Spectral Envelope to Synthetically Adjust Voice Quality and Articulation Parameters for Emotional Speech Synthesis.
Proceedings of the Affective Computing and Intelligent Interaction, 2005
2002
Proceedings of the MICAI 2002: Advances in Artificial Intelligence, 2002
2001
Robust Speech Recognition Method Based on Discriminative Environment Feature Extraction.
J. Comput. Sci. Technol., 2001
2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
1999
Pattern Recognit., 1999
1998
Discriminative learning of additive noise and channel distortions for robust speech recognition.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
1997
Relative mel-frequency cepstral coefficients compensation for robust telephone speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997