Jing Huang

Affiliations:
  • JD AI Research and Platform, Mountain View, CA, USA
  • IBM Thomas J. Watson Research Center, Yorktown Heights, NY, USA
  • Cornell University, Ithaca, NY, USA (PhD 1998)


According to our database1, Jing Huang authored at least 80 papers between 1997 and 2022.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2022
SpanDrop: Simple and Effective Counterfactual Learning for Long Sequences.
CoRR, 2022

Video2StyleGAN: Encoding Video in Latent Space for Manipulation.
CoRR, 2022

Cross-modal Contrastive Distillation for Instructional Activity Anticipation.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Improving Time Sensitivity for Question Answering over Temporal Knowledge Graphs.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Joint System-Wise Optimization for Pipeline Goal-Oriented Dialog System.
CoRR, 2021

Conversational AI Systems for Social Good: Opportunities and Challenges.
CoRR, 2021

Entity and Evidence Guided Document-Level Relation Extraction.
Proceedings of the 6th Workshop on Representation Learning for NLP, 2021

Variance-reduced First-order Meta-learning for Natural Language Processing Tasks.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Graph Ensemble Learning over Multiple Dependency Trees for Aspect-level Sentiment Classification.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Inductive Learning on Commonsense Knowledge Graph Completion.
Proceedings of the International Joint Conference on Neural Networks, 2021

Multi-hop Attention Graph Neural Networks.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Semantic Categorization of Social Knowledge for Commonsense Question Answering.
Proceedings of the Second Workshop on Simple and Efficient Natural Language Processing, 2021

Open Temporal Relation Extraction for Question Answering.
Proceedings of the 3rd Conference on Automated Knowledge Base Construction, 2021

Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Direct Multi-hop Attention based Graph Neural Network.
CoRR, 2020

Entity and Evidence Guided Relation Extraction for DocRED.
CoRR, 2020

Graph Sequential Network for Reasoning over Sequences.
CoRR, 2020

SkipConvNet: Skip Convolutional Neural Network for Speech Dereverberation Using Optimally Smoothed Spectral Mapping.
Proceedings of the Interspeech 2020, 2020

Improving Neural Language Generation with Spectrum Control.
Proceedings of the 8th International Conference on Learning Representations, 2020

Speaker-Invariant Affective Representation Learning via Adversarial Training.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Orthogonal Relation Transforms with Graph Context Modeling for Knowledge Graph Embedding.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Select, Answer and Explain: Interpretable Multi-Hop Reading Comprehension over Multiple Documents.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Zero-Shot Text-to-SQL Learning with Auxiliary Task.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Improving Graph Attention Networks with Large Margin-based Constraints.
CoRR, 2019

Selective Attention Based Graph Convolutional Networks for Aspect-Level Sentiment Classification.
CoRR, 2019

Relation Module for Non-answerable Prediction on Question Answering.
CoRR, 2019

Multiple instance learning with graph neural networks.
CoRR, 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.
CoRR, 2019

Towards adversarial learning of speaker-invariant representation for speech emotion recognition.
CoRR, 2019

Speaker Diarization with Lexical Information.
Proceedings of the Interspeech 2019, 2019


Multi-Stride Self-Attention for Speech Recognition.
Proceedings of the Interspeech 2019, 2019

Cross-lingual Text-independent Speaker Verification Using Unsupervised Adversarial Discriminative Domain Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Deep Speaker Embedding Learning with Multi-level Pooling for Text-independent Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2019

Relation Module for Non-Answerable Predictions on Reading Comprehension.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

End-to-End Structure-Aware Convolutional Networks for Knowledge Base Completion.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2017
Interpretable Convolutional Neural Networks with Dual Local and Global Attention for Review Rating Prediction.
Proceedings of the Eleventh ACM Conference on Recommender Systems, 2017

2013
State of the art discriminative training of subspace constrained Gaussian mixture models in big training corpora.
Proceedings of the IEEE International Conference on Acoustics, 2013

Audio-visual deep learning for noise robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Multi-View and Multi-Objective Semi-Supervised Learning for HMM-Based Automatic Speech Recognition.
IEEE Trans. Speech Audio Process., 2012

Affine invariant sparse maximum a posteriori adaptation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Front-end feature transforms with context filtering for speaker adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2011

Multi-view and multi-objective semi-supervised learning for large vocabulary continuous speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Sparse Maximum A Posteriori adaptation.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2009
Automatic Speech Recognition.
Proceedings of the Computers in the Human Interaction Loop, 2009

Combined discriminative training for multi-stream HMM-based audio-visual speech recognition.
Proceedings of the INTERSPEECH 2009, 2009

Acoustic fall detection using Gaussian mixture models and GMM supervectors.
Proceedings of the IEEE International Conference on Acoustics, 2009

Long-time span acoustic activity analysis from far-field sensors in smart homes.
Proceedings of the IEEE International Conference on Acoustics, 2009

Improved decision trees for multi-stream HMM-based audio-visual continuous speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2007
Detection, diarization, and transcription of far-field lecture speech.
Proceedings of the INTERSPEECH 2007, 2007

Improving speaker diarization for CHIL lecture meetings.
Proceedings of the INTERSPEECH 2007, 2007

The IBM RT07 Evaluation Systems for Speaker Diarization on Lecture Meetings.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

The IBM Rich Transcription 2007 Speech-to-Text Systems for Lecture Meetings.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

2006
The IBM RT06s Evaluation System for Speech Activity Detection in CHIL Seminars.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

The IBM Rich Transcription Spring 2006 Speech-to-Text System for Lecture Meetings.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

2005
Improving lip-reading with feature space transforms for multi-stream audio-visual speech recognition.
Proceedings of the INTERSPEECH 2005, 2005

Discriminatively trained features using fMPE for multi-stream audio-visual speech recognition.
Proceedings of the INTERSPEECH 2005, 2005

Rapid Feature Space Speaker Adaptation for Multi-Stream HMM-Based Audio-Visual Speech Recognition.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

2004
Audio-visual speech recognition using an infrared headset.
Speech Commun., 2004

Towards practical deployment of audio-visual speech recognition.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Editorial.
EURASIP J. Adv. Signal Process., 2003

Automatic Hierarchical Color Image Classification.
EURASIP J. Adv. Signal Process., 2003

Impact of audio segmentation and segment clustering on automated transcription accuracy of large spoken archives.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Towards automatic transcription of large spoken archives - English ASR for the MALACH project.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Improving audio-visual speech recognition with an infrared headset.
Proceedings of the AVSP 2003, 2003

2002
Automatic speech recognition performance on a voicemail transcription task.
IEEE Trans. Speech Audio Process., 2002

Maximum entropy model for punctuation annotation from speech.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Large vocabulary conversational speech recognition with the extended maximum likelihood linear transformation (EMLLT) model.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Extracting caller information from voicemail.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Information Extraction from Voicemail.
Proceedings of the Association for Computational Linguistic, 2001

2000
Recent improvements in speech recognition performance on large vocabulary conversational speech (voicemail and switchboard).
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Multistage coarticulation model combining articulatory, formant and cepstral features.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Spatial Color Indexing and Applications.
Int. J. Comput. Vis., 1999

Recent improvements in voicemail transcription.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

A study of adaptation techniques on a voicemail transcription task.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
An Automatic Hierarchical Image Classification Scheme.
Proceedings of the 6th ACM International Conference on Multimedia '98, 1998

Spatial Color Indexing and Applications.
Proceedings of the Sixth International Conference on Computer Vision (ICCV-98), 1998

1997
Combining Supervised Learning with Color Correlograms for Content-Based Image Retrieval.
Proceedings of the Fifth ACM International Conference on Multimedia '97, 1997

Image Indexing Using Color Correlograms.
Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97), 1997


  Loading...