Cong Liu

Orcid: 0009-0003-0328-423X

Affiliations:
  • IFLYTEK Research, Hefei, China
  • Microsoft Research Asia, Beijing, China (former)
  • University of Science and Technology of China, Hefei, China (former)


According to our database1, Cong Liu authored at least 37 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Dynamic facial expression recognition with pseudo-label guided multi-modal pre-training.
IET Comput. Vis., February, 2024

2023
1DFormer: Learning 1D Landmark Representations via Transformer for Facial Landmark Tracking.
CoRR, 2023

MADNet: Maximizing Addressee Deduction Expectation for Multi-Party Conversation Generation.
CoRR, 2023

SHINE: Syntax-augmented Hierarchical Interactive Encoder for Zero-shot Cross-lingual Information Extraction.
CoRR, 2023

X-Adv: Physical Adversarial Object Attacks against X-ray Prohibited Item Detection.
Proceedings of the 32nd USENIX Security Symposium, 2023

Handwritten Chemical Structure Image to Structure-Specific Markup Using Random Conditional Guided Decoder.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

JiuZhang 2.0: A Unified Chinese Pre-trained Language Model for Multi-task Mathematical Problem Solving.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Self-Supervised Audio-Visual Speech Representations Learning by Multimodal Self-Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2023

The Multimodal Information Based Speech Processing (Misp) 2022 Challenge: Audio-Visual Diarization And Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Summary on the Multimodal Information Based Speech Processing (MISP) 2022 Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

MADNet: Maximizing Addressee Deduction Expectation for Multi-Party Conversation Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

GIFT: Graph-Induced Fine-Tuning for Multi-Party Conversation Understanding.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
RCIT: An RSVP-Based Concealed Information Test Framework Using EEG Signals.
IEEE Trans. Cogn. Dev. Syst., 2022

InterHT: Knowledge Graph Embeddings by Interaction between Head and Tail Entities.
CoRR, 2022

Few-shot X-ray Prohibited Item Detection: A Benchmark and Weak-feature Enhancement Network.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

JiuZhang: A Chinese Pre-trained Language Model for Mathematical Problem Understanding.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

The First Multimodal Information Based Speech Processing (Misp) Challenge: Data, Tasks, Baselines And Results.
Proceedings of the IEEE International Conference on Acoustics, 2022

Wider & Closer: Mixture of Short-channel Distillers for Zero-shot Cross-lingual Named Entity Recognition.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
A Deep Analysis of Speech Separation Guided Diarization Under Realistic Conditions.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2018
Fast and Robust Detection of Anatomical Landmarks Using Cascaded 3D Convolutional Networks Guided by Linear Square Regression.
Proceedings of the Biometric Recognition - 13th Chinese Conference, 2018

2017
Nonrecurrent Neural Structure for Long-Term Dependence.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

2016
Joint training of DNNs by incorporating an explicit dereverberation structure for distant speech recognition.
EURASIP J. Adv. Signal Process., 2016

2015
Feedforward Sequential Memory Networks: A New Structure to Learn Long-term Dependency.
CoRR, 2015

Multi-task deep neural network acoustic models with model adaptation using discriminative speaker identity for whisper recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Improving Deep Neural Network Based Speech Enhancement in Low SNR Environments.
Proceedings of the Latent Variable Analysis and Signal Separation, 2015

A unified speaker-dependent speech separation and enhancement system based on deep neural networks.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

2013
A cluster-based multiple deep neural networks method for large vocabulary continuous speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Incoherent training of deep neural networks to de-correlate bottleneck features for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Investigation of deep neural networks (DNN) for large vocabulary continuous speech recognition: Why DNN surpasses GMMS in acoustic modeling.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

2011
Trust Region-Based Optimization for Maximum Mutual Information Estimation of HMMs in Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2011

2010
Phonetic clustering based confidence measure for embedded speech recognition.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

A bounded trust region optimization for discriminative training of HMMS in speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
A trust region based optimization for maximum mutual information estimation of HMMS in speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
A Constrained Line Search Optimization Method for Discriminative Training of HMMs.
IEEE Trans. Speech Audio Process., 2008

Exploiting Non-Target Region Information for Confidence Measure Based on Bayesian Information Criterion.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

2007
A Constrained Line Search Optimization for Discriminative Training in Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2007

A constrained line search approach to general discriminative HMM training.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007


  Loading...