Jing Shi

Orcid: 0000-0003-3225-7145

Affiliations:

Chinese Academy of Sciences, Institute of Automation, Research Center for Brain-inspired Intelligence, Beijing, China

According to our database¹, Jing Shi authored at least 35 papers between 2016 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction.

[BibT_eX]

[DOI]

Mach. Intell. Res., February, 2024

ViLaS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

VLP: A Survey on Vision-language Pre-training.

[BibT_eX]

[DOI]

Int. J. Autom. Comput., 2023

Local-to-Global Causal Reasoning for Cross-Document Relation Extraction.

[BibT_eX]

[DOI]

IEEE CAA J. Autom. Sinica, 2023

A dilemma of ground truth in noisy speech separation and an approach to lessen the impact of imperfect training data.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2023

ViLaS: Integrating Vision and Language into Automatic Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2023

Mixture of personality improved Spiking actor network for efficient multi-agent cooperation.

[BibT_eX]

[DOI]

CoRR, 2023

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages.

[BibT_eX]

[DOI]

CoRR, 2023

Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Enhancing Visual Question Answering via Deconstructing Questions and Explicating Answers.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Matching-Based Term Semantics Pre-Training for Spoken Patient Query Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Train from scratch: Single-stage joint training of speech separation and recognition.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2022

Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021

Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem.

[BibT_eX]

[DOI]

CoRR, 2021

Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration.

[BibT_eX]

[DOI]

Chenda Li

Jing Shi

Wangyou Zhang

Aswin Shanmugam Subramanian

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2021

Training Noisy Single-Channel Speech Separation with Noisy Oracle Sources: A Large Gap and a Small Step.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Recent Developments on Espnet Toolkit Boosted By Conformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Aswin Shanmugam Subramanian

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans.

[BibT_eX]

[DOI]

Aswin Shanmugam Subramanian

Wangyou Zhang

CoRR, 2020

Audio-visual Speech Separation with Adversarially Disentangled Visual Representation.

[BibT_eX]

[DOI]

CoRR, 2020

Neural Speaker Diarization with Speaker-Wise Chain Rule.

[BibT_eX]

[DOI]

CoRR, 2020

Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Unified Framework for Low-Latency Speaker Extraction in Cocktail Party Environments.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Speaker-Conditional Chain Model for Speech Separation and Extraction.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

Concept learning through deep reinforcement learning with memory-augmented neural networks.

[BibT_eX]

[DOI]

Neural Networks, 2019

Which Ones Are Speaking? Speaker-Inferred Model for Multi-Talker Speech Separation.

[BibT_eX]

[DOI]

Jing Shi

Jiaming Xu

Bo Xu

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018

Learning to activate logic rules for textual reasoning.

[BibT_eX]

[DOI]

Neural Networks, 2018

Improving Speech Separation with Adversarial Network and Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Distilled Binary Neural Network for Monaural Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Listen, Think and Listen Again: Capturing Top-down Auditory Attention for Speaker-independent Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2016

Ensemble of Feature Sets and Classification Methods for Stance Detection.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Understanding and Intelligent Applications, 2016

Hierarchical Memory Networks for Answer Selection on Unknown Words.

[BibT_eX]

[DOI]

Proceedings of the COLING 2016, 2016

Jing Shi

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...