Jing Shi

Orcid: 0000-0003-3225-7145

Affiliations:
  • Chinese Academy of Sciences, Institute of Automation, Research Center for Brain-inspired Intelligence, Beijing, China


According to our database1, Jing Shi authored at least 35 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction.
Mach. Intell. Res., February, 2024

ViLaS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
VLP: A Survey on Vision-language Pre-training.
Int. J. Autom. Comput., 2023

Local-to-Global Causal Reasoning for Cross-Document Relation Extraction.
IEEE CAA J. Autom. Sinica, 2023

A dilemma of ground truth in noisy speech separation and an approach to lessen the impact of imperfect training data.
Comput. Speech Lang., 2023

ViLaS: Integrating Vision and Language into Automatic Speech Recognition.
CoRR, 2023

Mixture of personality improved Spiking actor network for efficient multi-agent cooperation.
CoRR, 2023

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages.
CoRR, 2023

Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Enhancing Visual Question Answering via Deconstructing Questions and Explicating Answers.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Matching-Based Term Semantics Pre-Training for Spoken Patient Query Understanding.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Train from scratch: Single-stage joint training of speech separation and recognition.
Comput. Speech Lang., 2022

Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021
Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem.
CoRR, 2021

Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
Proceedings of the International Joint Conference on Neural Networks, 2021

Training Noisy Single-Channel Speech Separation with Noisy Oracle Sources: A Large Gap and a Small Step.
Proceedings of the IEEE International Conference on Acoustics, 2021

Recent Developments on Espnet Toolkit Boosted By Conformer.
Proceedings of the IEEE International Conference on Acoustics, 2021

An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans.
CoRR, 2020

Audio-visual Speech Separation with Adversarially Disentangled Visual Representation.
CoRR, 2020

Neural Speaker Diarization with Speaker-Wise Chain Rule.
CoRR, 2020

Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Unified Framework for Low-Latency Speaker Extraction in Cocktail Party Environments.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Speaker-Conditional Chain Model for Speech Separation and Extraction.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019
Concept learning through deep reinforcement learning with memory-augmented neural networks.
Neural Networks, 2019

Which Ones Are Speaking? Speaker-Inferred Model for Multi-Talker Speech Separation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
Learning to activate logic rules for textual reasoning.
Neural Networks, 2018

Improving Speech Separation with Adversarial Network and Reinforcement Learning.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Distilled Binary Neural Network for Monaural Speech Separation.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Listen, Think and Listen Again: Capturing Top-down Auditory Attention for Speaker-independent Speech Separation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2016
Ensemble of Feature Sets and Classification Methods for Stance Detection.
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016

Hierarchical Memory Networks for Answer Selection on Unknown Words.
Proceedings of the COLING 2016, 2016


  Loading...