Jianwu Dang
Orcid: 0000-0002-9237-4821Affiliations:
- Tianjin University, Tianjin Key Laboratory of Cognitive Computing and Application, College of Intelligence and Computing, China
- Institute of Communication Parlee, ICP, Center of National Research Scientific, France (2002-2003)
- Japan Advanced Institute of Science and Technology, JAIST, Japan
- Shizuoka University, Japan (PhD 1992)
According to our database1,
Jianwu Dang
authored at least 327 papers
between 1994 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
DSTCNet: Deep Spectro-Temporal-Channel Attention Network for Speech Emotion Recognition.
IEEE Trans. Neural Networks Learn. Syst., January, 2025
HC-APNet: Harmonic Compensation Auditory Perception Network for low-complexity speech enhancement.
Speech Commun., 2025
Expert Syst. Appl., 2025
2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
ZMM-TTS: Zero-Shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-Supervised Discrete Speech Representations.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Adversarial Domain Generalized Transformer for Cross-Corpus Speech Emotion Recognition.
IEEE Trans. Affect. Comput., 2024
Robust voice activity detection using an auditory-inspired masked modulation encoder based convolutional attention network.
Speech Commun., 2024
Knowl. Based Syst., 2024
Enriching Multimodal Sentiment Analysis through Textual Emotional Descriptions of Visual-Audio Content.
CoRR, 2024
Progressive Residual Extraction based Pre-training for Speech Representation Learning.
CoRR, 2024
VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing.
CoRR, 2024
Error Correction by Paying Attention to Both Acoustic and Confidence References for Automatic Speech Recognition.
CoRR, 2024
AIMDiT: Modality Augmentation and Interaction via Multimodal Dimension Transformation for Emotion Recognition in Conversations.
CoRR, 2024
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios.
CoRR, 2024
Proceedings of the Natural Language Processing and Chinese Computing, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Speech Commun., November, 2023
Appl. Intell., November, 2023
IEEE ACM Trans. Audio Speech Lang. Process., 2023
CFDRN: A Cognition-Inspired Feature Decomposition and Recombination Network for Dysarthric Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
IEEE Signal Process. Lett., 2023
CoRR, 2023
SDNet: Stream-attention and Dual-feature Learning Network for Ad-hoc Array Speech Separation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Discrimination of the Different Intents Carried by the Same Text Through Integrating Multimodal Information.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Improving Zero-shot Cross-domain Slot Filling via Transformer-based Slot Semantics Fusion.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Local and Global Context Modeling with Relation Matching Task for Dialog Act Recognition.
Proceedings of the International Joint Conference on Neural Networks, 2023
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Multi-Modal Sarcasm Detection Based on Cross-Modal Composition of Inscribed Entity Relations.
Proceedings of the 35th IEEE International Conference on Tools with Artificial Intelligence, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Time-Domain Speech Enhancement Assisted by Multi-Resolution Frequency Encoder and Decoder.
Proceedings of the IEEE International Conference on Acoustics, 2023
Speech and Noise Dual-Stream Spectrogram Refine Network With Speech Distortion Loss For Robust Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Leveraging Positional-Related Local-Global Dependency for Synthetic Speech Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023
VF-Taco2: Towards Fast and Lightweight Synthesis for Autoregressive Models with Variation Autoencoder and Feature Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2023
Brain Network Features Differentiate Intentions from Different Emotional Expressions of the Same Text.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Augmenting Affective Dependency Graph via Iterative Incongruity Graph Learning for Sarcasm Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
IEICE Trans. Inf. Syst., November, 2022
Constructing Accurate and Efficient Deep Spiking Neural Networks With Double-Threshold and Augmented Schemes.
IEEE Trans. Neural Networks Learn. Syst., 2022
Toward Efficient Processing and Learning With Spikes: New Approaches for Multispike Learning.
IEEE Trans. Cybern., 2022
Speech Commun., 2022
Learning affective representations based on magnitude and dynamic relative phase information for speech emotion recognition.
Speech Commun., 2022
Emotion Recognition With Multimodal Transformer Fusion Framework Based on Acoustic and Lexical Information.
IEEE Multim., 2022
Context- and Knowledge-Aware Graph Convolutional Network for Multimodal Emotion Recognition.
IEEE Multim., 2022
Detection of Brain Network Communities During Natural Speech Comprehension From Functionally Aligned EEG Sources.
Frontiers Comput. Neurosci., 2022
Improving low-resource Tibetan end-to-end ASR by multilingual and multilevel unit modeling.
EURASIP J. Audio Speech Music. Process., 2022
MIMO-DBnet: Multi-channel Input and Multiple Outputs DOA-aware Beamforming Network for Speech Separation.
CoRR, 2022
Heterogeneous Graph Neural Networks using Self-supervised Reciprocally Contrastive Learning.
CoRR, 2022
IEEE Access, 2022
Proceedings of the DDAM@MM 2022: Proceedings of the 1st International Workshop on Deepfake Detection for Audio Multimedia, 2022
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022
Adaptive Attention Network with Domain Adversarial Training for Multi-Accent Speech Recognition.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022
MIMO-DoAnet: Multi-channel Input and Multiple Outputs DoA Network with Unknown Number of Sound Sources.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Self-Distillation Based on High-level Information Supervision for Compressing End-to-End ASR Model.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
TopicKS: Topic-driven Knowledge Selection for Knowledge-grounded Dialogue Generation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Monaural Speech Enhancement Based on Spectrogram Decomposition for Convolutional Neural Network-sensitive Feature Extraction.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Finer-grained Modeling units-based Meta-Learning for Low-resource Tibetan Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Global Signal-to-noise Ratio Estimation Based on Multi-subband Processing Using Convolutional Neural Network.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Improve emotional speech synthesis quality by learning explicit and implicit representations with semi-supervised training.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the International Joint Conference on Neural Networks, 2022
An Improved Stimulus Reconstruction Method for EEG-Based Short-Time Auditory Attention Detection.
Proceedings of the Neural Information Processing - 29th International Conference, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Multi-Stage Graph Representation Learning for Dialogue-Level Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022
Cache: Modeling Contribution-Aware Context Hierarchically for Long-Range Dialogue State Tracking.
Proceedings of the IEEE International Conference on Acoustics, 2022
Compressing Transformer-Based ASR Model by Task-Driven Loss and Attention-Based Multi-Level Feature Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2022
Using Multiple Reference Audios and Style Embedding Constraints for Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Relationship Between Speakers' Physiological Structure and Acoustic Speech Signals: Data-Driven Study Based on Frequency-Wise Attentional Neural Network.
Proceedings of the 30th European Signal Processing Conference, 2022
2021
Robust Environmental Sound Recognition With Sparse Key-Point Encoding and Efficient Multispike Learning.
IEEE Trans. Neural Networks Learn. Syst., 2021
IEEE Trans. Knowl. Data Eng., 2021
A Tibetan Language Model That Considers the Relationship Between Suffixes and Functional Words.
IEEE Signal Process. Lett., 2021
Neural Networks, 2021
Multi-resolution modulation-filtered cochleagram feature for LSTM-based dimensional emotion recognition from speech.
Neural Networks, 2021
Replay attack detection using variable-frequency resolution phase and magnitude features.
Comput. Speech Lang., 2021
Exploiting Explicit and Inferred Implicit Personas for Multi-turn Dialogue Generation.
Proceedings of the Natural Language Processing and Chinese Computing, 2021
A Sentiment Similarity-Oriented Attention Model with Multi-task Learning for Text-Based Emotion Recognition.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021
Dialogue Act Recognition using Branch Architecture with Attention Mechanism for Imbalanced Data.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
An Eye-tracking Study of Transposed-letter Effect in English Word Recognition by Mandarin Speakers.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Frequency-specific Brain Network Dynamics during Perceiving Real Words and Pseudowords.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Joint Feature Enhancement and Speaker Recognition with Multi-Objective Task-Oriented Network.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Domain-Specific Multi-Agent Dialog Policy Learning in Multi-Domain Task-Oriented Scenarios.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Time-Frequency Representation Learning with Graph Convolutional Network for Dialogue-Level Speech Emotion Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
TacoLPCNet: Fast and Stable TTS by Conditioning LPCNet on Mel Spectrogram Predictions.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Metric Learning Based Feature Representation with Gated Fusion Model for Speech Emotion Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the International Joint Conference on Neural Networks, 2021
Proceedings of the Neural Information Processing - 28th International Conference, 2021
Proceedings of the Neural Information Processing - 28th International Conference, 2021
Exploring Effective Speech Representation via ASR for High-Quality End-to-End Multispeaker TTS.
Proceedings of the Neural Information Processing - 28th International Conference, 2021
Proceedings of the ICMI '21 Companion: Companion Publication of the 2021 International Conference on Multimodal Interaction, Montreal, QC, Canada, October 18, 2021
CONSK-GCN: Conversational Semantic- and Knowledge-Oriented Graph Convolutional Network for Multimodal Emotion Recognition.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Multimodal Emotion Recognition with Capsule Graph Convolutional Based Representation Fusion.
Proceedings of the IEEE International Conference on Acoustics, 2021
Robust Voice Activity Detection Using a Masked Auditory Encoder Based Convolutional Neural Network.
Proceedings of the IEEE International Conference on Acoustics, 2021
Representation Learning with Spectro-Temporal-Channel Attention for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021
Improving Naturalness and Controllability of Sequence-to-Sequence Speech Synthesis by Learning Local Prosody Representations.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Domain-Adversarial Autoencoder with Attention Based Feature Level Fusion for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
Learning Language and Speaker Information for Code-Switch Speech Synthesis with Limited Data.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Study on Simultaneous Estimation of Glottal Source and Vocal Tract Parameters by ARMAX-LF Model for Speech Analysis/Synthesis.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Hierarchical Prosody Analysis Improves Categorical and Dimensional Emotion Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Zero-shot Domain Adaptation with Inference Relation Paths for Spoken Language Understanding.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
2020
Towards Efficient Processing and Learning with Spikes: New Approaches for Multi-Spike Learning.
CoRR, 2020
Speech Emotion Recognition Using 3D Convolutions and Attention-Based Sliding Recurrent Networks With Auditory Front-Ends.
IEEE Access, 2020
Relation Modeling with Graph Convolutional Networks for Facial Action Unit Detection.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Temporal Attention Convolutional Network for Speech Emotion Recognition with Latent Representation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Staged Knowledge Distillation for End-to-End Dysarthric Speech Recognition and Speech Attribute Transcription.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Segment-Level Effects of Gender, Nationality and Emotion Information on Text-Independent Speaker Verification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the Neural Information Processing - 27th International Conference, 2020
Hierarchical Interactive Matching Network for Multi-turn Response Selection in Retrieval-Based Chatbots.
Proceedings of the Neural Information Processing - 27th International Conference, 2020
Adversarial Shared-Private Attention Network for Joint Slot Filling and Intent Detection.
Proceedings of the Neural Information Processing - 27th International Conference, 2020
Investigation of Effectively Synthesizing Code-Switched Speech Using Highly Imbalanced Mix-Lingual Data.
Proceedings of the Neural Information Processing - 27th International Conference, 2020
Integrating Group Homophily and Individual Personality of Topics Can Better Model Network Communities.
Proceedings of the 20th IEEE International Conference on Data Mining, 2020
Proceedings of the ICCAI '20: 2020 6th International Conference on Computing and Artificial Intelligence, 2020
A Hierarchical Model for Dialog Act Recognition Considering Acoustic and Lexical Context Information.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Spectrograms Fusion with Minimum Difference Masks Estimation for Monaural Speech Dereverberation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Temporal-Spatial-Spectral Investigation of Brain Network Dynamics in Human Speech Perception.
Proceedings of the Brain Informatics - 13th International Conference, 2020
A Multi-subject Temporal-spatial Hyper-alignment Method for EEG-based Neural Entrainment to Speech.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
Topic Enhanced Sentiment Spreading Model in Social Networks Considering User Interest.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
Working Memory-Driven Neural Networks with a Novel Knowledge Enhancement Paradigm for Implicit Discourse Relation Recognition.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Int. J. Mach. Learn. Cybern., 2019
Story co-segmentation of Chinese broadcast news using weakly-supervised semantic similarity.
Neurocomputing, 2019
IEICE Trans. Inf. Syst., 2019
Combination of links and node contents for community discovery using a graph regularization approach.
Future Gener. Comput. Syst., 2019
EURASIP J. Audio Speech Music. Process., 2019
Robust Environmental Sound Recognition with Sparse Key-point Encoding and Efficient Multi-spike Learning.
CoRR, 2019
Exploration of Complementary Features for Speech Emotion Recognition Based on Kernel Extreme Learning Machine.
IEEE Access, 2019
Implicit Discourse Relation Recognition via a BiLSTM-CNN Architecture With Dynamic Chunk-Based Max Pooling.
IEEE Access, 2019
An integrated system for robust gender classification with convolutional restricted Boltzmann machine and spiking neural network.
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2019
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2019
CNN-BLSTM Based Question Detection from Dialogs Considering Phase and Context Information.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Environment-Dependent Attention-Driven Recurrent Convolutional Neural Network for Robust Speech Enhancement.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Acoustic and Articulatory Study of Ewe Vowels: A Comparative Study of Male and Female.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
A Spiking Neural Network with Distributed Keypoint Encoding for Robust Sound Recognition.
Proceedings of the International Joint Conference on Neural Networks, 2019
Time-Frequency Deep Representation Learning for Speech Emotion Recognition Integrating Self-attention.
Proceedings of the Neural Information Processing - 26th International Conference, 2019
A Fast Convolutional Self-attention Based Speech Dereverberation Method for Robust Speech Recognition.
Proceedings of the Neural Information Processing - 26th International Conference, 2019
Proceedings of the International Conference on Multimodal Interaction, 2019
Proceedings of the 2019 3rd International Conference on Digital Signal Processing, 2019
NVSRN: A Neural Variational Scaling Reasoning Network for Initiative Response Generation.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019
Replay Attack Detection Using Magnitude and Phase Information with Attention-based Adaptive Filters.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Robust Sound Event Classification with Local Time-Frequency Information and Convolutional Neural Networks.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2019: Text and Time Series, 2019
Emotional Contagion-Based Social Sentiment Mining in Social Networks by Introducing Network Communities.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019
Dimensional Emotion Recognition from Speech Using Modulation Spectral Features and Recurrent Neural Networks.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Effective Training End-to-End ASR systems for Low-resource Lhasa Dialect of Tibetan Language.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Acoustic Attributes of Citation Tones in Standard Chinese Produced by Prelingually Deaf Adults.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Sensors, 2018
Multim. Tools Appl., 2018
Unsupervised measure of Chinese lexical semantic similarity using correlated graph model for news story segmentation.
Neurocomputing, 2018
Incorporating network structure with node contents for community detection on large networks using deep learning.
Neurocomputing, 2018
Autoencoder Based Community Detection with Adaptive Integration of Network Topology and Node Contents.
Proceedings of the Knowledge Science, Engineering and Management, 2018
Investigation of the Comprehension Process during Silent Reading based on Eye Movements.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Manifold-based incremental community detection method for online speaker identification.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Replay Attacks Detection Using Phase and Magnitude Features with Various Frequency Resolutions.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Distant-talking Speech Recognition Based on Multi-objective Learning using Phase and Magnitude-based Feature.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Pitch Synchronized Relative Phase with Peak Error Detection For Noise-robust Speaker Recognition.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Revealing Spatiotemporal Brain Dynamics of Speech Production Based on EEG and Eye Movement.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Speech Emotion Recognition by Combining Amplitude and Phase Information Using Convolutional Neural Network.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Convolutional Neural Network with Spectrogram and Perceptual Features for Speech Emotion Recognition.
Proceedings of the Neural Information Processing - 25th International Conference, 2018
Proceedings of the Neural Information Processing - 25th International Conference, 2018
Auditory-Inspired End-to-End Speech Emotion Recognition Using 3D Convolutional Recurrent Neural Networks Based on Spectral-Temporal Representation.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018
A Feature Fusion Method Based on Extreme Learning Machine for Speech Emotion Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2018, 2018
Interaction-Aware Topic Model for Microblog Conversations through Network Embedding and User Attention.
Proceedings of the 27th International Conference on Computational Linguistics, 2018
Implicit Discourse Relation Recognition using Neural Tensor Network with Interactive Attention and Sparse Learning.
Proceedings of the 27th International Conference on Computational Linguistics, 2018
Robust Detection of Link Communities in Large Social Networks by Exploiting Link Semantics.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
Multimodal sensory fusion for soccer robot self-localization based on long short-term memory recurrent neural network.
J. Ambient Intell. Humaniz. Comput., 2017
Simulation of heat conduction in fluids on GPU with particle method.
Comput. Syst. Sci. Eng., 2017
Proceedings of the Studies on Speech Production - 11th International Seminar, 2017
Proceedings of the Studies on Speech Production - 11th International Seminar, 2017
Proceedings of the Studies on Speech Production - 11th International Seminar, 2017
Particle Interaction Adaptivity and Absorbing Boundary Conditions in the Lagrangian Particle Aeroacoustic Model.
Proceedings of the Studies on Speech Production - 11th International Seminar, 2017
Global Monitoring of Dynamic Functional Interactions in the Brain During Chinese Verbs Perception.
Proceedings of the Studies on Speech Production - 11th International Seminar, 2017
Proceedings of the Studies on Speech Production - 11th International Seminar, 2017
Proceedings of the Studies on Speech Production - 11th International Seminar, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 29th IEEE International Conference on Tools with Artificial Intelligence, 2017
Proceedings of the Neural Information Processing - 24th International Conference, 2017
Proceedings of the Neural Information Processing - 24th International Conference, 2017
Exploiting the Tibetan Radicals in Recurrent Neural Network for Low-Resource Language Models.
Proceedings of the Neural Information Processing - 24th International Conference, 2017
Proceedings of the Neural Information Processing - 24th International Conference, 2017
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
Pseudo-pitch-synchronized phase information extraction and its application for robust speaker recognition.
Proceedings of the IEEE 6th Global Conference on Consumer Electronics, 2017
Speech emotion recognition using multichannel parallel convolutional recurrent neural networks based on gammatone auditory filterbank.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
The acoustic characteristics of tone 3 in standard chinese produced by prelingually deaf adults.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
2016
J. Signal Process. Syst., 2016
Multim. Tools Appl., 2016
Mapping ultrasound-based articulatory images and vowel sounds with a deep neural network framework.
Multim. Tools Appl., 2016
Sketch4Image: a novel framework for sketch-based image retrieval based on product quantization with coding residuals.
Multim. Tools Appl., 2016
Audio-visual speech recognition integrating 3D lip information obtained from the Kinect.
Multim. Syst., 2016
Proceedings of the 2016 IEEE Trustcom/BigDataSE/ISPA, 2016
Tongue performance in articulating Mandarin apical syllables by prelingual deaf adults using ultrasonic technology: Two case studies.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Relationship between perception and production of English vowels by Chinese English learners.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Voice activity detection based on sequential Gaussian mixture model with maximum likelihood criterion.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Study on the relation of fundamental and formant frequencies for affective speech synthesis.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
The singing voice before and after vocal warm-up by students of Chinese national singing.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Effects of Subglottal-Coupling and Interdental-Space on Formant Trajectories During Front-to-Back Vowel Transitions in Chinese.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Investigations into vowel and consonant structures in articulatory and auditory spaces using Laplacian eigenmaps.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the Chinese Lexical Semantics - 17th Workshop, 2016
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Investigation on acoustic modeling with different phoneme set for continuous Lhasa Tibetan recognition based on DNN method.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
A new method of acceleration measurement for observing tongue movement in ultrasound image during speech production.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
2015
Strength of syllabic influences on articulation in Mandarin Chinese and French: Insights from a motor control approach.
J. Phonetics, 2015
An empirical study of phonetic transfer in English monophthong learning by Tibetan (Lhasa) speakers.
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015
Proceedings of the 2015 International Conference Oriental COCOSDA held jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
The perception of English vowel contrasts by Chinese EFL learners and native English speakers.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015
A lip protrusion mechanism examined by magnetic resonance imaging and finite element modeling.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015
Vocal responses to frequency modulated composite sinewaves via auditory and vibrotactile pathways.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
Chinese opera genre classification based on multi-feature fusion and extreme learning machine.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
Automatic tongue contour tracking in ultrasound sequences without manual initialization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
Investigation of relation between speech perception and production based on EEG source reconstruction.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
2014
Detection of speaker individual information using a phoneme effect suppression method.
Speech Commun., 2014
Frontiers Comput. Sci., 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014
Proceedings of the 2014 International Conference on Asian Language Processing, 2014
Proceedings of the 2014 International Conference on Asian Language Processing, 2014
2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the International Conference on Machine Learning and Cybernetics, 2013
Emotional McGurk Effect? A Cross-Cultural Investigation on Emotion Expression under Vocal and Facial Conflict.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2013
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
2012
Efficient feature extraction of speaker identification using phoneme mean F-ratio for Chinese.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Mandarin vowel synthesis based on 2D and 3D vocal tract model by finite-difference time-domain method.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
An anisotropic diffusion filter for reducing speckle noise of ultrasound images based on separability.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
An investigation of dependencies between frequency components and speaker characteristics based on phoneme mean F-ratio contribution.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
2011
IEEE ACM Trans. Audio Speech Lang. Process., 2011
Proceedings of the 17th International Congress of Phonetic Sciences, 2011
2010
IEEE Trans. Speech Audio Process., 2010
Investigation of muscle activation in speech production based on an articulatory model.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Discrimination between natural and unnatural articulations based on articulatory structure.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
IEICE Trans. Inf. Syst., 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
2008
An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification.
Speech Commun., 2008
A model based investigation of activation patterns of the tongue muscles for vowel production.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
2007
IEICE Trans. Inf. Syst., 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Physiological Feature Extraction for Text Independent Speaker Identification using Non-Uniform Subband Processing.
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
Communication Between Speech Production and Perception Within the Brain-Observation and Simulation.
J. Comput. Sci. Technol., 2006
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
2004
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004
2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
2002
Estimation of vocal tract shapes from speech sounds with a physiological articulatory model.
J. Phonetics, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
1996
An improved vocal tract model of vowel production implementing piriform resonance and transvelar nasal coupling.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
1994
A physiological model of speech production and the implication of tongue-larynx interaction.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994