Int. J. Intell. Robotics Appl., September, 2024

Mechanism analysis and suppression control strategy of frictional impact for humanoid robots.

[DOI]

Int. J. Intell. Robotics Appl., September, 2024

A Feasible Scheme for the Backward Transmission in the Three-User X Channel with Reciprocal Propagation Delay.

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2024

EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer.

[DOI]

CoRR, 2024

SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.

[DOI]

CoRR, 2024

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis.

[DOI]

CoRR, 2024

Enhancing Zero-shot Text-to-Speech Synthesis with Human Feedback.

[DOI]

CoRR, 2024

Asynchronous and Segmented Bidirectional Encoding for NMT.

[DOI]

CoRR, 2024

A Paradigm for Generating Operational Seamless Land Surface Temperature Products.

[DOI]

Proceedings of the IGARSS 2024, 2024

Efficient Reinforcement Learning via Decoupling Exploration and Utilization.

[DOI]

Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Finding Spoken Identifications: Using GPT-4 Annotation for an Efficient and Fast Dataset Creation Pipeline.

[DOI]

Mark Hasegawa-Johnson

Laureano Moro-Velázquez

Najim Dehak

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023

Diffsound: Discrete Diffusion Model for Text-to-Sound Generation.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Optimistic and Pessimistic Actor in RL: Decoupling Exploration and Utilization.

[DOI]

CoRR, 2023

Benchmarking Large Language Models on CMExam - A comprehensive Chinese Medical Exam Dataset.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS.

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model.

[DOI]

Laureano Moro-Velázquez

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Masked Spectrogram Prediction for Self-Supervised Audio Pre-Training.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Convergence analysis of the nontrivial stationary solution of the memristor-based neural networks with reaction-diffusion terms.

[DOI]

Proceedings of the 15th International Conference on Advanced Computational Intelligence, 2023

2022

Singular Configuration Analysis and Singularity Avoidance with Application in an Intelligent Robotic Manipulator.

[DOI]

Sensors, 2022

A Truck-Borne System Based on Cold Atom Gravimeter for Measuring the Absolute Gravity in the Field.

[DOI]

Sensors, 2022

Multi-Irradiance: A Method for Simultaneous Measurement of the Temperature and Spectral Emissivity of High-Temperature Targets in SWIR.

[DOI]

Sensors, 2022

Adaptive Recurrent cerebellar error observer for robust Dynamic biped walking.

[DOI]

Hao Zhang

Int. J. Robotics Autom., 2022

A Two-student Learning Framework for Mixed Supervised Target Sound Detection.

[DOI]

CoRR, 2022

Calibrate and Refine! A Novel and Agile Framework for ASR Error Robust Intent Detection.

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection.

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Improving Target Sound Extraction with Timestamp Information.

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Mutual Learning Framework for Few-Shot Sound Event Detection.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

A Mixed Supervised Learning Framework For Target Sound Detection.

[DOI]

Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Detect What You Want: Target Sound Detection.

[DOI]

Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021

Detect what you want: Target Sound Detection.

[DOI]

CoRR, 2021

Audio-Oriented Multimodal Machine Comprehension: Task, Dataset and Model.

[DOI]

CoRR, 2021

Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via Layer Consistency.

[DOI]

CoRR, 2021

Unsupervised Multi-Target Domain Adaptation for Acoustic Scene Classification.

[DOI]

Dongchao Yang

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification.

[DOI]

Wenwu Wang

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation.

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Contrastive Self-Supervised Learning for Text-Independent Speaker Verification.

[DOI]

Haoran Zhang

Proceedings of the IEEE International Conference on Acoustics, 2021

A Global-Local Attention Framework for Weakly Labelled Audio Tagging.

[DOI]

Wenwu Wang

Proceedings of the IEEE International Conference on Acoustics, 2021

Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information.

[DOI]

Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

Audio-Oriented Multimodal Machine Comprehension via Dynamic Inter- and Intra-modality Attention.

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Modeling Label Dependencies for Audio Tagging With Graph Convolutional Network.

[DOI]

IEEE Signal Process. Lett., 2020

Finite-time stabilization of periodic orbits for under-actuated biped walking with hybrid zero dynamics.

[DOI]

Commun. Nonlinear Sci. Numer. Simul., 2020

Environmental Sound Classification with Parallel Temporal-Spectral Attention.

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Acoustic Scene Classification with Spectrogram Processing Strategies.

[DOI]

DaDing Chong

Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019

Learning discriminative and robust time-frequency representations for environmental sound classification.

[DOI]

CoRR, 2019

What Affects the Performance of Convolutional Neural Networks for Audio Event Classification.

[DOI]

Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2019

2017

Gait generation and control of biped robot with moving torso based on virtual constraint.

[DOI]

Hao Zhang

Proceedings of the 2017 IEEE International Conference on Systems, Man, and Cybernetics, 2017

2016

Omnidirectional walking based on preview control for biped robots.

[DOI]

Chengju Liu