Jing Shi
Orcid: 0000-0003-3225-7145Affiliations:
- Chinese Academy of Sciences, Institute of Automation, Research Center for Brain-inspired Intelligence, Beijing, China
According to our database1,
Jing Shi
authored at least 35 papers
between 2016 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction.
Mach. Intell. Res., February, 2024
ViLaS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
IEEE CAA J. Autom. Sinica, 2023
A dilemma of ground truth in noisy speech separation and an approach to lessen the impact of imperfect training data.
Comput. Speech Lang., 2023
Mixture of personality improved Spiking actor network for efficient multi-agent cooperation.
CoRR, 2023
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages.
CoRR, 2023
Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Enhancing Visual Question Answering via Deconstructing Questions and Explicating Answers.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
Train from scratch: Single-stage joint training of speech separation and recognition.
Comput. Speech Lang., 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
2021
Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem.
CoRR, 2021
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021
ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
Proceedings of the International Joint Conference on Neural Networks, 2021
Training Noisy Single-Channel Speech Separation with Noisy Oracle Sources: A Large Gap and a Small Step.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans.
CoRR, 2020
Audio-visual Speech Separation with Adversarially Disentangled Visual Representation.
CoRR, 2020
Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
A Unified Framework for Low-Latency Speaker Extraction in Cocktail Party Environments.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
2019
Concept learning through deep reinforcement learning with memory-augmented neural networks.
Neural Networks, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
2018
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
Listen, Think and Listen Again: Capturing Top-down Auditory Attention for Speaker-independent Speech Separation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2016
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016
Proceedings of the COLING 2016, 2016