We stand with Ukraine

We stand with Ukraine

Jing Huang

Affiliations:

JD AI Research and Platform, Mountain View, CA, USA
IBM Thomas J. Watson Research Center, Yorktown Heights, NY, USA
Cornell University, Ithaca, NY, USA (PhD 1998)

According to our database¹, Jing Huang authored at least 81 papers between 1997 and 2023.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

On csauthors.net:

Bibliography

2023

PragmatiCQA: A Dataset for Pragmatic Question Answering in Conversations.

[DOI]

,

,

Christopher D. Manning

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

SpanDrop: Simple and Effective Counterfactual Learning for Long Sequences.

[DOI]

,

,

CoRR, 2022

Video2StyleGAN: Encoding Video in Latent Space for Manipulation.

[DOI]

,

,

,

,

CoRR, 2022

Cross-modal Contrastive Distillation for Instructional Activity Anticipation.

[DOI]

,

,

,

,

,

,

Proceedings of the 26th International Conference on Pattern Recognition, 2022

Improving Time Sensitivity for Question Answering over Temporal Knowledge Graphs.

[DOI]

,

,

,

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Joint System-Wise Optimization for Pipeline Goal-Oriented Dialog System.

[DOI]

,

,

,

,

CoRR, 2021

Conversational AI Systems for Social Good: Opportunities and Challenges.

[DOI]

,

,

,

,

CoRR, 2021

Entity and Evidence Guided Document-Level Relation Extraction.

[DOI]

,

,

,

,

Proceedings of the 6th Workshop on Representation Learning for NLP, 2021

Variance-reduced First-order Meta-learning for Natural Language Processing Tasks.

[DOI]

,

,

,

,

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Graph Ensemble Learning over Multiple Dependency Trees for Aspect-level Sentiment Classification.

[DOI]

,

,

,

,

,

,

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Inductive Learning on Commonsense Knowledge Graph Completion.

[DOI]

,

,

,

,

,

Proceedings of the International Joint Conference on Neural Networks, 2021

Multi-hop Attention Graph Neural Networks.

[DOI]

,

,

,

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Semantic Categorization of Social Knowledge for Commonsense Question Answering.

[DOI]

,

,

,

Kathleen R. McKeown

,

Proceedings of the Second Workshop on Simple and Efficient Natural Language Processing, 2021

Open Temporal Relation Extraction for Question Answering.

[DOI]

,

,

,

,

,

Proceedings of the 3rd Conference on Automated Knowledge Base Construction, 2021

Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling.

[DOI]

,

,

,

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Direct Multi-hop Attention based Graph Neural Network.

[DOI]

,

,

,

CoRR, 2020

Entity and Evidence Guided Relation Extraction for DocRED.

[DOI]

,

,

,

CoRR, 2020

Graph Sequential Network for Reasoning over Sequences.

[DOI]

,

,

,

CoRR, 2020

SkipConvNet: Skip Convolutional Neural Network for Speech Dereverberation Using Optimally Smoothed Spectral Mapping.

[DOI]

Vinay Kothapally

,

,

Shahram Ghorbani

,

John H. L. Hansen

,

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Improving Neural Language Generation with Spectrum Control.

[DOI]

,

,

,

,

,

Proceedings of the 8th International Conference on Learning Representations, 2020

Speaker-Invariant Affective Representation Learning via Adversarial Training.

[DOI]

,

,

,

Shrikanth Narayanan

,

Panayiotis G. Georgiou

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Orthogonal Relation Transforms with Graph Context Modeling for Knowledge Graph Embedding.

[DOI]

,

,

,

,

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Select, Answer and Explain: Interpretable Multi-Hop Reading Comprehension over Multiple Documents.

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Zero-Shot Text-to-SQL Learning with Auxiliary Task.

[DOI]

Shuaichen Chang

,

,

,

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Improving Graph Attention Networks with Large Margin-based Constraints.

[DOI]

,

,

,

CoRR, 2019

Selective Attention Based Graph Convolutional Networks for Aspect-Level Sentiment Classification.

[DOI]

,

,

,

,

,

CoRR, 2019

Relation Module for Non-answerable Prediction on Question Answering.

[DOI]

,

,

,

,

CoRR, 2019

Multiple instance learning with graph neural networks.

[DOI]

,

,

,

CoRR, 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.

[DOI]

CoRR, 2019

Towards adversarial learning of speaker-invariant representation for speech emotion recognition.

[DOI]

,

,

,

,

CoRR, 2019

Speaker Diarization with Lexical Information.

[DOI]

,

,

,

,

,

Panayiotis G. Georgiou

,

Shrikanth Narayanan

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multi-Stride Self-Attention for Speech Recognition.

[DOI]

,

,

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Cross-lingual Text-independent Speaker Verification Using Unsupervised Adversarial Discriminative Domain Adaptation.

[DOI]

,

,

John H. L. Hansen

Proceedings of the IEEE International Conference on Acoustics, 2019

Deep Speaker Embedding Learning with Multi-level Pooling for Text-independent Speaker Verification.

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2019

Relation Module for Non-Answerable Predictions on Reading Comprehension.

[DOI]

,

,

,

,

Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs.

[DOI]

,

,

,

,

,

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

End-to-End Structure-Aware Convolutional Networks for Knowledge Base Completion.

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2017

Interpretable Convolutional Neural Networks with Dual Local and Global Attention for Review Rating Prediction.

[DOI]

,

,

,

Proceedings of the Eleventh ACM Conference on Recommender Systems, 2017

2013

State of the art discriminative training of subspace constrained Gaussian mixture models in big training corpora.

[DOI]

,

,

Proceedings of the IEEE International Conference on Acoustics, 2013

Audio-visual deep learning for noise robust speech recognition.

[DOI]

,

Brian Kingsbury

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Multi-View and Multi-Objective Semi-Supervised Learning for HMM-Based Automatic Speech Recognition.

[DOI]

,

,

Jen-Tzung Chien

IEEE Trans. Speech Audio Process., 2012

Affine invariant sparse maximum a posteriori adaptation.

[DOI]

,

,

Steven J. Rennie

,

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Front-end feature transforms with context filtering for speaker adaptation.

[DOI]

,

Karthik Visweswariah

,

,

Proceedings of the IEEE International Conference on Acoustics, 2011

Multi-view and multi-objective semi-supervised learning for large vocabulary continuous speech recognition.

[DOI]

,

,

Jen-Tzung Chien

Proceedings of the IEEE International Conference on Acoustics, 2011

Sparse Maximum A Posteriori adaptation.

[DOI]

,

,

,

Steven J. Rennie

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2009

Automatic Speech Recognition.

[DOI]

Gerasimos Potamianos

,

,

Matthias Wölfel

,

,

Etienne Marcheret

,

,

,

John W. McDonough

,

Javier Hernando

,

,

Proceedings of the Computers in the Human Interaction Loop, 2009

Combined discriminative training for multi-stream HMM-based audio-visual speech recognition.

[DOI]

,

Karthik Visweswariah

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Acoustic fall detection using Gaussian mixture models and GMM supervectors.

[DOI]

,

,

Gerasimos Potamianos

,

Mark Hasegawa-Johnson

Proceedings of the IEEE International Conference on Acoustics, 2009

Long-time span acoustic activity analysis from far-field sensors in smart homes.

[DOI]

,

,

,

Gerasimos Potamianos

Proceedings of the IEEE International Conference on Acoustics, 2009

Improved decision trees for multi-stream HMM-based audio-visual continuous speech recognition.

[DOI]

,

Karthik Visweswariah

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2007

Detection, diarization, and transcription of far-field lecture speech.

[DOI]

,

Etienne Marcheret

,

Karthik Visweswariah

,

,

Gerasimos Potamianos

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Improving speaker diarization for CHIL lecture meetings.

[DOI]

,

Etienne Marcheret

,

Karthik Visweswariah

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

The IBM RT07 Evaluation Systems for Speaker Diarization on Lecture Meetings.

[DOI]

,

Etienne Marcheret

,

Karthik Visweswariah

,

Gerasimos Potamianos

Proceedings of the Multimodal Technologies for Perception of Humans, 2007

The IBM Rich Transcription 2007 Speech-to-Text Systems for Lecture Meetings.

[DOI]

,

Etienne Marcheret

,

Karthik Visweswariah

,

,

Gerasimos Potamianos

Proceedings of the Multimodal Technologies for Perception of Humans, 2007

2006

The IBM RT06s Evaluation System for Speech Activity Detection in CHIL Seminars.

[DOI]

Etienne Marcheret

,

Gerasimos Potamianos

,

Karthik Visweswariah

,

Proceedings of the Machine Learning for Multimodal Interaction, 2006

The IBM Rich Transcription Spring 2006 Speech-to-Text System for Lecture Meetings.

[DOI]

,

Martin Westphal

,

Stanley F. Chen

,

,

,

,

,

,

,

Gerasimos Potamianos

Proceedings of the Machine Learning for Multimodal Interaction, 2006

2005

Improving lip-reading with feature space transforms for multi-stream audio-visual speech recognition.

[DOI]

,

Karthik Visweswariah

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Discriminatively trained features using fMPE for multi-stream audio-visual speech recognition.

[DOI]

,

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Rapid Feature Space Speaker Adaptation for Multi-Stream HMM-Based Audio-Visual Speech Recognition.

[DOI]

,

Etienne Marcheret

,

Karthik Visweswariah

Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

2004

Audio-visual speech recognition using an infrared headset.

[DOI]

,

Gerasimos Potamianos

,

Jonathan Connell

,

Chalapathy Neti

Speech Commun., 2004

Towards practical deployment of audio-visual speech recognition.

[DOI]

Gerasimos Potamianos

,

Chalapathy Neti

,

,

Jonathan H. Connell

,

,

,

Etienne Marcheret

,

,

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Editorial.

[DOI]

,

Mukund Padmanabhan

,

Savitha Srinivasan

EURASIP J. Adv. Signal Process., 2003

Automatic Hierarchical Color Image Classification.

[DOI]

,

,

EURASIP J. Adv. Signal Process., 2003

Impact of audio segmentation and segment clustering on automated transcription accuracy of large spoken archives.

[DOI]

Bhuvana Ramabhadran

,

,

Upendra V. Chaudhari

,

Giridharan Iyengar

,

Harriet J. Nock

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Towards automatic transcription of large spoken archives - English ASR for the MALACH project.

[DOI]

Bhuvana Ramabhadran

,

,

Michael Picheny

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Improving audio-visual speech recognition with an infrared headset.

[DOI]

,

Gerasimos Potamianos

,

Chalapathy Neti

Proceedings of the AVSP 2003, 2003

2002

Automatic speech recognition performance on a voicemail transcription task.

[DOI]

Mukund Padmanabhan

,

,

,

Brian Kingsbury

,

IEEE Trans. Speech Audio Process., 2002

Maximum entropy model for punctuation annotation from speech.

[DOI]

,

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Large vocabulary conversational speech recognition with the extended maximum likelihood linear transformation (EMLLT) model.

[DOI]

,

,

Ramesh Gopinath

,

Brian Kingsbury

,

,

Karthik Visweswariah

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001

Extracting Caller Information from Voicemail.

[DOI]

,

,

Mukund Padmanabhan

Proceedings of the Information Retrieval Techniques for Speech Applications [this book is based on the workshop "Information Retrieval Techniques for Speech Applications", 2001

Information Extraction from Voicemail.

[DOI]

,

,

Mukund Padmanabhan

Proceedings of the Association for Computational Linguistic, 2001

2000

Recent improvements in speech recognition performance on large vocabulary conversational speech (voicemail and switchboard).

[DOI]

,

Brian Kingsbury

,

,

Mukund Padmanabhan

,

,

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Multistage coarticulation model combining articulatory, formant and cepstral features.

[DOI]

,

,

,

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999

Spatial Color Indexing and Applications.

[DOI]

,

,

,

,

Int. J. Comput. Vis., 1999

Recent improvements in voicemail transcription.

[DOI]

Mukund Padmanabhan

,

,

,

,

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

A study of adaptation techniques on a voicemail transcription task.

[DOI]

,

Mukund Padmanabhan

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998

An Automatic Hierarchical Image Classification Scheme.

[DOI]

,

,

Proceedings of the 6th ACM International Conference on Multimedia '98, 1998

Spatial Color Indexing and Applications.

[DOI]

,

,

,

Proceedings of the Sixth International Conference on Computer Vision (ICCV-98), 1998

1997

Combining Supervised Learning with Color Correlograms for Content-Based Image Retrieval.

[DOI]

,

,

Proceedings of the Fifth ACM International Conference on Multimedia '97, 1997

Image Indexing Using Color Correlograms.

[DOI]

,

,

,

,

Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97), 1997

Loading...