Pascale Fung

Orcid: 0000-0002-0628-7132

Affiliations:
  • Hong Kong University of Science and Technology


According to our database1, Pascale Fung authored at least 310 papers between 1991 and 2024.

Collaborative distances:

Awards

IEEE Fellow

IEEE Fellow 2015, "For contributions to human-machine interactions".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Measuring Political Bias in Large Language Models: What Is Said and How It Is Said.
CoRR, 2024

LLMs Are Few-Shot In-Context Low-Resource Language Learners.
CoRR, 2024

Subobject-level Image Tokenization.
CoRR, 2024

2023
Survey of Hallucination in Natural Language Generation.
ACM Comput. Surv., December, 2023

Confucius, cyberpunk and Mr. Science: comparing AI ethics principles between China and the EU.
AI Ethics, May, 2023

IndoRobusta: Towards Robustness Against Diverse Code-Mixed Indonesian Local Languages.
CoRR, 2023

InstructTODS: Large Language Models for End-to-End Task-Oriented Dialogue Systems.
CoRR, 2023

Towards Mitigating Hallucination in Large Language Models via Self-Reflection.
CoRR, 2023

Negative Object Presence Evaluation (NOPE) to Measure Object Hallucination in Vision-Language Models.
CoRR, 2023

Survey of Social Bias in Vision-Language Models.
CoRR, 2023

Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition.
CoRR, 2023

Instruct-Align: Teaching Novel Languages with to LLMs through Alignment-based Cross-Lingual Instruction.
CoRR, 2023

Learn What NOT to Learn: Towards Generative Safety in Chatbots.
CoRR, 2023

InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Improving Query-Focused Meeting Summarization with Query-Relevant Knowledge.
Proceedings of the Findings of the Association for Computational Linguistics: IJCNLP-AACL 2023, 2023

PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

Diverse and Faithful Knowledge-Grounded Dialogue Generation via Sequential Posterior Inference.
Proceedings of the International Conference on Machine Learning, 2023

Improving Fairness and Robustness in End-to-End Speech Recognition Through Unsupervised Clustering.
Proceedings of the IEEE International Conference on Acoustics, 2023

RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Towards Mitigating LLM Hallucination via Self Reflection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Contrastive Learning for Inference in Dialogue.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Mitigating Framing Bias with Polarity Minimization Loss.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Context Generation Improves Open Domain Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Which One Are You Referring To? Multimodal Object Identification in Situated Dialogue.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: EACL 2023, 2023

Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Generating Hashtags for Short-form Videos with Guided Signals.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

RHO: Reducing Hallucination in Open-domain Dialogues with Knowledge Grounding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023


2022
NusaCrowd: Open Source Initiative for Indonesian NLP Resources.
CoRR, 2022

RHO ($ρ$): Reducing Hallucination in Open-domain Dialogues with Knowledge Grounding.
CoRR, 2022

Casual Conversations v2: Designing a large consent-driven dataset to measure algorithmic bias and robustness.
CoRR, 2022

Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values.
CoRR, 2022

Every picture tells a story: Image-grounded controllable stylistic story generation.
CoRR, 2022

Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands.
CoRR, 2022

NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages.
CoRR, 2022

AiSocrates: Towards Answering Ethical Quandary Questions.
CoRR, 2022

VScript: Controllable Script Generation with Audio-Visual Presentation.
CoRR, 2022

CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition.
CoRR, 2022

Clozer": " Adaptable Data Augmentation for Cloze-style Reading Comprehension.
Proceedings of the 7th Workshop on Representation Learning for NLP, 2022

Factuality Enhanced Language Models for Open-Ended Text Generation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

NeuS: Neutral Multi-News Summarization for Mitigating Framing Bias.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Korean Language Modeling via Syntactic Guide.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

CI-AVSR: A Cantonese Audio-Visual Speech Datasetfor In-car Command Recognition.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

VScript: Controllable Script Generation with Visual Presentation.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

QA4QG: Using Question Answering to Constrain Multi-Hop Question Generation.
Proceedings of the IEEE International Conference on Acoustics, 2022

ToKen: Task Decomposition and Knowledge Infusion for Few-Shot Hate Speech Detection.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

SNP2Vec: Scalable Self-Supervised Pre-Training for Genome-Wide Association Study.
Proceedings of the 21st Workshop on Biomedical Language Processing, 2022

QAConv: Question Answering on Informative Conversations.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Integrating Question Rewrites in Conversational Question Answering: A Reinforcement Learning Approach.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2022

Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Read before Generate! Faithful Long Form Question Answering with Machine Reading.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

How Long Is Enough? Exploring the Optimal Intervals of Long-Range Clinical Note Language Modeling.
Proceedings of the 13th International Workshop on Health Text Mining and Information Analysis, 2022

Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters.
Proceedings of the Second DialDoc Workshop on Document-grounded Dialogue and Conversational Question Answering, 2022

2021
ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation.
CoRR, 2021

NER-BERT: A Pre-trained Model for Low-Resource Entity Tagging.
CoRR, 2021

Confucius, Cyberpunk and Mr. Science: Comparing AI ethics between China and the EU.
CoRR, 2021

Few-Shot Bot: Prompt-Based Learning for Dialogue Systems.
CoRR, 2021

Language Models are Few-shot Multilingual Learners.
CoRR, 2021

Greenformer: Factorization Toolkit for Efficient Deep Neural Networks.
CoRR, 2021

Nora: The Well-Being Coach.
CoRR, 2021

Retrieval-Free Knowledge-Grounded Dialogue Response Generation with Adapters.
CoRR, 2021

Weakly-supervised Multi-task Learning for Multimodal Affect Recognition.
CoRR, 2021

Dynamically Addressing Unseen Rumor via Continual Learning.
CoRR, 2021

Mitigating Media Bias through Neutral Article Generation.
CoRR, 2021

Towards Few-Shot Fact-Checking via Perplexity.
CoRR, 2021

ERICA: An Empathetic Android Companion for Covid-19 Quarantine.
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021

Assessing Political Prudence of Open-domain Chatbots.
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021

X2Parser: Cross-Lingual and Cross-Domain Framework for Task-Oriented Compositional Semantic Parsing.
Proceedings of the 6th Workshop on Representation Learning for NLP, 2021

Preserving Cross-Linguality of Pre-trained Models via Continual Learning.
Proceedings of the 6th Workshop on Representation Learning for NLP, 2021

BiToD: A Bilingual Multi-Domain Dataset For Task-Oriented Dialogue Modeling.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

AdaptSum: Towards Low-Resource Domain Adaptation for Abstractive Summarization.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

On Unifying Misinformation Detection.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Towards Few-shot Fact-Checking via Perplexity.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Multimodal End-to-End Sparse Model for Emotion Recognition.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Ethical and Technological Challenges of Conversational AI.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Continual Learning in Task-Oriented Dialogue Systems.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Zero-Shot Dialogue State Tracking via Cross-Task Transfer.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Fight for 4230 at CheckThat! 2021: Domain-Specific Preprocessing and Pretrained Model for Ranking Claims by Check-Worthiness.
Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum, Bucharest, Romania, September 21st - to, 2021

Improve Query Focused Abstractive Summarization by Incorporating Answer Relevance.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Continual Mixed-Language Pre-Training for Extremely Low-Resource Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

CAiRE in DialDoc21: Data Augmentation for Information Seeking Dialogue System.
Proceedings of the 1st Workshop on Document-grounded Dialogue and Conversational Question Answering, 2021

Are Multilingual Models Effective in Code-Switching?
Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching, 2021

On the Importance of Word Order Information in Cross-lingual Sequence Labeling.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

CrossNER: Evaluating Cross-Domain Named Entity Recognition.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

The Adapter-Bot: All-In-One Controllable Conversational Model.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Model Generalization on COVID-19 Fake News Detection.
Proceedings of the Combating Online Hostile Posts in Regional Languages during Emergency Situation, 2021

2020
A Study on the Autoregressive and non-Autoregressive Multi-label Learning.
CoRR, 2020

Dimsum @LaySumm 20: BART-based Approach for Scientific Document Summarization.
CoRR, 2020

EmoGraph: Capturing Emotion Correlations using Graph Networks.
CoRR, 2020

Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems.
CoRR, 2020

Misinformation Has High Perplexity.
CoRR, 2020

CAiRE-COVID: A Question Answering and Multi-Document Summarization System for COVID-19 Research.
CoRR, 2020

Exploring Fine-tuning Techniques for Pre-trained Cross-lingual Models via Continual Learning.
CoRR, 2020

Kungfupanda at SemEval-2020 Task 12: BERT-Based Multi-Task Learning for Offensive Language Detection.
CoRR, 2020

Variational Transformers for Diverse Response Generation.
CoRR, 2020

XPersona: Evaluating Multilingual Personalized Chatbot.
CoRR, 2020

Do We Need Word Order Information for Cross-lingual Sequence Labeling.
CoRR, 2020

Attention over Parameters for Dialogue Systems.
CoRR, 2020

Multilingual and Interlingual Semantic Representations for Natural Language Processing: A Brief Introduction.
Comput. Linguistics, 2020

Kungfupanda at SemEval-2020 Task 12: BERT-Based Multi-TaskLearning for Offensive Language Detection.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

Zero-Resource Cross-Domain Named Entity Recognition.
Proceedings of the 5th Workshop on Representation Learning for NLP, 2020

Getting To Know You: User Attribute Extraction from Dialogues.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Learning Fast Adaptation on Cross-Accented Speech Recognition.
Proceedings of the Interspeech 2020, 2020

IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Modality-Transferable Emotion Embeddings for Low-Resource Multimodal Emotion Recognition.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Lightweight and Efficient End-To-End Speech Recognition Using Low-Rank Transformer.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Improving Spoken Question Answering Using Contextualized Word Representation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Generating Empathetic Responses by Looking Ahead the User's Sentiment.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Dimsum @LaySumm 20.
Proceedings of the First Workshop on Scholarly Document Processing, 2020

MEGATRON-CNTRL: Controllable Story Generation with External Knowledge Using Large-Scale Language Models.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

CAiRE-COVID: A Question Answering and Query-focused Multi-Document Summarization System for COVID-19 Scholarly Information Management.
Proceedings of the 1st Workshop on NLP for COVID-19@ EMNLP 2020, Online, December 2020, 2020

Multi-hop Question Generation with Graph Convolutional Network.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Plug-and-Play Conversational Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Learning Knowledge Bases with Parameters for Task-Oriented Dialogue Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Cross-lingual Spoken Language Understanding with Regularized Representation Alignment.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Exploring Versatile Generative Language Model Via Parameter-Efficient Transfer Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Meta-Transfer Learning for Code-Switched Speech Recognition.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Coach: A Coarse-to-Fine Approach for Cross-domain Slot Filling.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Attention-Informed Mixed-Language Training for Zero-Shot Cross-Lingual Task-Oriented Dialogue Systems.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

CAiRE: An End-to-End Empathetic Chatbot.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Learning to Classify the Wrong Answers for Multiple Choice Question Answering (Student Abstract).
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
On the Effectiveness of Low-Rank Matrix Factorization for LSTM Model Compression.
CoRR, 2019

CAiRE: An End-to-End Empathetic Chatbot.
CoRR, 2019

HappyBot: Generating Empathetic Dialogue Responses by Improving User Experience Look-ahead.
CoRR, 2019

Towards Universal End-to-End Affect Recognition from Multilingual Speech by ConvNets.
CoRR, 2019

Exploring Perceived Emotional Intelligence of Personality-Driven Virtual Agents in Handling User Challenges.
Proceedings of the World Wide Web Conference, 2019

Incorporating Word and Subword Units in Unsupervised Machine Translation Using Language Model Rescoring.
Proceedings of the Fourth Conference on Machine Translation, 2019

CAiRE_HKUST at SemEval-2019 Task 3: Hierarchical Attention for Dialogue Emotion Classification.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Team yeon-zi at SemEval-2019 Task 4: Hyperpartisan News Detection by De-noising Weakly-labeled Data.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Learning Multilingual Meta-Embeddings for Code-Switching Named Entity Recognition.
Proceedings of the 4th Workshop on Representation Learning for NLP, 2019

Modality-based Factorization for Multimodal Fusion.
Proceedings of the 4th Workshop on Representation Learning for NLP, 2019

A Submodular Feature-Aware Framework for Label Subset Selection in Extreme Classification Problems.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

A Novel Repetition Normalized Adversarial Reward for Headline Generation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Incorporate User Representation for Personal Question Answer Selection Using Siamese Network.
Proceedings of the IEEE International Conference on Acoustics, 2019

Learning Comment Generation by Leveraging User-generated Data.
Proceedings of the IEEE International Conference on Acoustics, 2019

Clickbait? Sensational Headline Generation with Auto-tuned Reinforcement Learning.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Hierarchical Meta-Embeddings for Code-Switching Named Entity Recognition.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Zero-shot Cross-lingual Dialogue Systems with Transferable Latent Variables.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

MoEL: Mixture of Empathetic Listeners.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Human-Agent Communication: Connecting Research and Development in HCI and AI.
Proceedings of the Companion Publication of the 2019 ACM Conference on Computer Supported Cooperative Work and Social Computing, 2019

Code-Switched Language Models Using Neural Based Synthetic Data from Parallel Sentences.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Personalizing Dialogue Agents via Meta-Learning.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Exploring Social Bias in Chatbots using Stereotype Knowledge.
Proceedings of the 2019 Workshop on Widening NLP@ACL 2019, Florence, Italy, July 28, 2019, 2019

Understanding the Shades of Sexism in Popular TV Series.
Proceedings of the 2019 Workshop on Widening NLP@ACL 2019, Florence, Italy, July 28, 2019, 2019

Generalizing Question Answering System with Pre-trained Language Model Fine-tuning.
Proceedings of the 2nd Workshop on Machine Reading for Question Answering, 2019

GlobalTrait: Personality Alignment of Multilingual Word Embeddings.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Modality-based Factorization for Multimodal Fusion.
CoRR, 2018

Towards End-to-end Automatic Code-Switching Speech Recognition.
CoRR, 2018

Learn to Code-Switch: Data Augmentation using Copy Mechanism on Language Modeling.
CoRR, 2018

Investigating Audio, Visual, and Text Fusion Methods for End-to-End Automatic Personality Prediction.
CoRR, 2018

Cross-domain Dialogue Policy Transfer via Simultaneous Speech-act and Slot Alignment.
CoRR, 2018

Emo2Vec: Learning Generalized Emotion Representation by Multi-task Training.
Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, 2018

PlusEmo2Vec at SemEval-2018 Task 1: Exploiting emotion knowledge from emoji and #hashtags.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

End-to-End Dynamic Query Memory Network for Entity-Value Independent Task-Oriented Dialog.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Attention-Based LSTM for Psychological Stress Detection from Spoken Language Using Distant Supervision.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Reducing Gender Bias in Abusive Language Detection.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Improving Large-Scale Fact-Checking using Decomposable Attention Models and Lexical Tagging.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Investigating Audio, Video, and Text Fusion Methods for End-to-End Automatic Personality Prediction.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Bilingual Character Representation for Efficiently Addressing Out-of-Vocabulary Words in Code-Switching Named Entity Recognition.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

Code-Switching Language Modeling using Syntax-Aware Multi-Task Learning.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

2017
Fine Grained Knowledge Transfer for Personalized Task-oriented Dialogue Systems.
CoRR, 2017

Adapting a Virtual Agent to User Personality.
Proceedings of the Advanced Social Interaction with Agents, 2017

Nora the Empathetic Psychologist.
Proceedings of the Interspeech 2017, 2017

Bilingual Word Embeddings for Cross-Lingual Personality Recognition Using Convolutional Neural Nets.
Proceedings of the Interspeech 2017, 2017

Emojive! Collecting Emotion Data from Speech and Facial Expression Using Mobile Game App.
Proceedings of the Interspeech 2017, 2017

A Note Based Query By Humming System Using Convolutional Neural Network.
Proceedings of the Interspeech 2017, 2017

A first look into a Convolutional Neural Network for speech emotion detection.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Perceived Emotional Intelligence in Virtual Agents.
Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 2017

Zara Returns: Improved Personality Induction and Adaptation by an Empathetic Virtual Agent.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

One-step and Two-step Classification for Abusive Language Detection on Twitter.
Proceedings of the First Workshop on Abusive Language Online, 2017

2016
Multimodal deep neural nets for detecting humor in TV sitcoms.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Zara The Supergirl: An Empathetic Personality Recognition System.
Proceedings of the Demonstrations Session, 2016

A Long Short-Term Memory Framework for Predicting Humor in Dialogues.
Proceedings of the NAACL HLT 2016, 2016

A Machine Learning based Music Retrieval and Recommendation System.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Deep Learning of Audio and Language Features for Humor Prediction.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Zara: An Empathetic Interactive Virtual Agent.
Proceedings of the Interspeech 2016, 2016

Computational Approaches to Linguistic Code Switching.
Proceedings of the Interspeech 2016, 2016

Predicting humor response in dialogues from TV sitcoms.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Real-Time Speech Emotion and Sentiment Recognition for Interactive Dialogue Systems.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Zara: A Virtual Interactive Dialogue System Incorporating Emotion, Sentiment and Personality Recognition.
Proceedings of the COLING 2016, 2016

Towards Empathetic Human-Robot Interactions.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2016

2015
HLTC-HKUST: A Neural Network Paraphrase Classifier using Translation Metrics, Semantic Roles and Lexical Similarity Features.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

A comparison between a DNN and a CRF disfluency detection and reconstruction system.
Proceedings of the INTERSPEECH 2015, 2015

2014
Discriminatively Trained Sparse Inverse Covariance Matrices for Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Efficient Sparse Banded Acoustic Models for Speech Recognition.
IEEE Signal Process. Lett., 2014

A Hindi-English Code-Switching Corpus.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Co-Training for Classification of Live or Studio Music Recordings.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Code switch language modeling with Functional Head Constraint.
Proceedings of the IEEE International Conference on Acoustics, 2014

Acoustic modeling for hindi speech recognition in low-resource settings.
Proceedings of the International Conference on Audio, 2014

Language Modeling with Functional Head Constraint for Code Switching Speech Recognition.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Overview for the First Shared Task on Language Identification in Code-Switched Data.
Proceedings of the First Workshop on Computational Approaches to Code Switching@EMNLP 2014, 2014

2013
Sparse Inverse Covariance Matrices for Low Resource Speech Recognition.
IEEE Trans. Speech Audio Process., 2013

Cross-Lingual Language Modeling for Low-Resource Speech Recognition.
IEEE Trans. Speech Audio Process., 2013

Discriminatively trained sparse inverse covariance matrices for low resource acoustic modeling.
Proceedings of the INTERSPEECH 2013, 2013

Language modeling for mixed language speech recognition using weighted phrase extraction.
Proceedings of the INTERSPEECH 2013, 2013

Multimodal music emotion classification using AdaBoost with decision stumps.
Proceedings of the IEEE International Conference on Acoustics, 2013

Improved mixed language speech recognition using asymmetric acoustic model and language model with code-switch inversion constraints.
Proceedings of the IEEE International Conference on Acoustics, 2013

These words are music to my ears: Recognizing music emotion from lyrics using AdaBoost.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Identification of live or studio versions of a song via supervised learning.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Mining Parallel Documents Using Low Bandwidth and High Precision CLIR from the Heterogeneous Web.
Proceedings of the Building and Using Comparable Corpora., 2013

2012
Active learning with semi-automatic annotation for extractive speech summarization.
ACM Trans. Speech Lang. Process., 2012

Automatic Parliamentary Meeting Minute Generation Using Rhetorical Structure Modeling.
IEEE Trans. Speech Audio Process., 2012

Using English Acoustic Models for Hindi Automatic Speech Recognition.
Proceedings of the 3rd Workshop on South and Southeast Asian Natural Language Processing, 2012

Personalized music emotion classification via active learning.
Proceedings of the second international ACM workshop on Music information retrieval with user-centered and multimodal strategies, 2012

A Multilingual Natural Stress Emotion Database.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

A Mandarin-English Code-Switching Corpus.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

sparse banded precision matrices for low resource speech recognition.
Proceedings of the INTERSPEECH 2012, 2012

Lowresource speech recognition with automatically learned sparse inverse covariance matrices.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Phrase-level transduction model with reordering for spoken to written language transformation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Cross-Lingual Language Modeling with Syntactic Reordering for Low-Resource Speech Recognition.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Code-Switch Language Model with Inversion Constraints for Mixed Language Speech Recognition.
Proceedings of the COLING 2012, 2012

2011
Mining Parallel Documents Using Low Bandwidth and High Precision CLIR from the Heterogeneous Web.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

A Cross Gender and Cross Lingual Study on Acoustic Features for Stress Recognition in Speech.
Proceedings of the 17th International Congress of Phonetic Sciences, 2011

Automatic minute generation for parliamentary speech using conditional random fields.
Proceedings of the IEEE International Conference on Acoustics, 2011

Asymmetric acoustic modeling of mixed language speech.
Proceedings of the IEEE International Conference on Acoustics, 2011

Rare Word Translation Extraction from Aligned Comparable Documents.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Active Learning of Extractive Reference Summaries for Lecture Speech Summarization.
Proceedings of the 2nd Workshop on Building and Using Comparable Corpora: from Parallel to Non-parallel Corpora, 2011

2010
Chinese Machine Translation.
Proceedings of the Handbook of Natural Language Processing, Second Edition., 2010

Extractive Speech Summarization Using Shallow Rhetorical Structure Modeling.
IEEE Trans. Speech Audio Process., 2010

Learning deep rhetorical structure for extractive speech summarization.
Proceedings of the IEEE International Conference on Acoustics, 2010

A Rhetorical Syntax-Driven Model for Speech Summarization.
Proceedings of the COLING 2010, 2010

Unsupervised Synthesis of Multilingual Wikipedia Articles.
Proceedings of the COLING 2010, 2010

2009
Semantic Roles for SMT: A Hybrid Two-Pass Model.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Can Semantic Role Labeling Improve SMT?
Proceedings of the 13th Annual conference of the European Association for Machine Translation, 2009

Extractive speech summarization by active learning.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Multilingual spoken language processing.
IEEE Signal Process. Mag., 2008

RSHMM++ for extractive lecture speech summarization.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Using output probability distribution for oov word rejection.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Rhetorical-State Hidden Markov Models for extractive speech summarization.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Learning bilingual semantic frames: shallow semantic parsing vs. semantic role projection.
Proceedings of the 11th Conference on Theoretical and Methodological Issues in Machine Translation of Natural Languages: Papers, 2007

Speech Summarization Without Lexical Features for Mandarin Broadcast News.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

A comparative study on speech summarization of broadcast news and lecture speech.
Proceedings of the INTERSPEECH 2007, 2007

Improving lecture speech summarization using rhetorical information.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

A Mandarin lecture speech transcription system for speech summarization.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
One story, one flow: Hidden Markov Story Models for multilingual multidocument summarization.
ACM Trans. Speech Lang. Process., 2006

Aligning word senses using bilingual corpora.
ACM Trans. Asian Lang. Inf. Process., 2006

Automatic Learning of Chinese English Semantic Structure Mapping.
Proceedings of the 2006 IEEE ACL Spoken Language Technology Workshop, 2006

HKUST/MTS: A Very Large Scale Mandarin Telephone Speech Corpus.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Multi-accent Chinese speech recognition.
Proceedings of the INTERSPEECH 2006, 2006

Robust Word Sense Translation by EM Learning of Frame Semantics.
Proceedings of the ACL 2006, 2006

2005
Guest Editors Introduction: Machine Learning in Speech and Language Technologies.
Mach. Learn., 2005

Acoustic and phonetic confusions in accented speech recognition.
Proceedings of the INTERSPEECH 2005, 2005

Inversion Transduction Grammar Constraints for Mining Parallel Sentences from Quasi-Comparable Corpora.
Proceedings of the Natural Language Processing, 2005

2004
State-dependent phonetic tied mixtures with pronunciation modeling for spontaneous speech recognition.
IEEE Trans. Speech Audio Process., 2004

A maximum-entropy chinese parser augmented by transformation-based learning.
ACM Trans. Asian Lang. Inf. Process., 2004

Translation Disambiguation in Mixed Language Queries.
Mach. Transl., 2004

Pronunciation Modeling for Spontaneous Mandarin Speech Recognition.
Int. J. Speech Technol., 2004

Unsupervised Learning of a Chinese Spontaneous and Colloquial Speech Lexicon with Content and Filler Phrase Classification.
Int. J. Speech Technol., 2004

Using N-best lists for Named Entity Recognition from Chinese Speech.
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

Automatic Construction of an English-Chinese Bilingual FrameNet.
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

A system for Mandarin short phrase recognition on portable devices.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Development of a Chinese telephony conversational corpus for speech processing [speech recognition applications].
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

A grammar-based Chinese to English speech translation system for portable devices.
Proceedings of the INTERSPEECH 2004, 2004

Mining Very-Non-Parallel Corpora: Parallel Sentence and Lexicon Extraction via Bootstrapping and E.
Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing , 2004

Multi-level Bootstrapping For Extracting Parallel Sentences From a Quasi-Comparable Corpus.
Proceedings of the COLING 2004, 2004

BiFrameNet: Bilingual Frame Semantics Resource Construction by Cross-lingual Induction.
Proceedings of the COLING 2004, 2004

2003
Modeling partial pronunciation variations for spontaneous Mandarin speech recognition.
Comput. Speech Lang., 2003

Automatic phone set extension with confidence measure for spontaneous speech.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Flooring the observation probability for robust ASR in impulsive noise.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Triphone model reconstruction for Mandarin pronunciation variations.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Mandarin Pronunciation Modeling Based on CASS Corpus.
J. Comput. Sci. Technol., 2002

Reducing pronunciation lexicon confusion and using more data without phonetic transcription for pronunciation modeling.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Model partial pronunciation variations for spontaneous Mandarin speech recognition.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Identifying Concepts Across Languages: A First Step towards a Corpus-based Approach to Automatic Ontology Alignment.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

2001
Modeling pronunciation variation using context-dependent weighting and b/s refined acoustic modeling.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Estimating pronunciation variations from acoustic likelihood score for HMM reconstruction.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Automatic generation of pronunciation lexicons for Mandarin spontaneous speech.
Proceedings of the IEEE International Conference on Acoustics, 2001

CLEF 2001 Bilingual Task: Simple Dictionary-Based Query Translation.
Proceedings of the Working Notes for CLEF 2001 Workshop co-located with the 5th European Conference on Digital Libraries (ECDL 2001), 2001

2000
SALSA 3.0: A multilingual based speech Web Browser.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 2000

Principal mixture speaker adaptation for improved continuous speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

An LLR-based technique for frame selection for GMM-based text-independent speaker identification.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

MLLR-based accent model adaptation without accented data.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Modelling pronunciation variations in spontaneous Mandarin speech.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

CASS: a phonetically transcribed corpus of mandarin spontaneous speech.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Residual noise compensation for robust speech recognition in nonstationary noise.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Liftered forward masking procedure for robust digits recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

A monolingual semantic decoder based on word sense disambiguation for mixed language understanding.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Decision tree-based triphones are robust and practical for mandarian speech recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

MAP-based cross-language adaptation augmented by linguistic knowledge: from English to Chinese.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Fast accent identification and accented speech recognition.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

A more efficient and optimal LLR for decoding and verification.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Mixed Language Query Disambiguation.
Proceedings of the 27th Annual Meeting of the Association for Computational Linguistics, 1999

1998
SALSA version 1.0: a speech-based web browser for hong kong English.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

A Statistical View on Bilingual Lexicon Extraction: From Parallel Corpora to Non-parallel Corpora.
Proceedings of the Machine Translation and the Information Soup, 1998

An IR Approach for Translating New Words from Nonparallel, Comparable Texts.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

1997
A Technical Word- and Term-Translation Aid Using Noisy Parallel Corpora across Language Groups.
Mach. Transl., 1997

Finding Terminology Translations from Non-parallel Corpora.
Proceedings of the Fifth Workshop on Very Large Corpora, 1997

1996
Domain word translation by space-frequency analysis of context length histograms.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora.
Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, 1995

Compiling Bilingual Lexicon Entries From a Non-Parallel English-Chinese Corpus.
Proceedings of the Third Workshop on Very Large Corpora, 1995

1994
Statistical Augmentation of a Chinese Machine-Readable Dictionary.
CoRR, 1994

K-vec: A New Approach for Aligning Parallel Texts.
Proceedings of the 15th International Conference on Computational Linguistics, 1994

Improving Chinese Tokenization With Linguistic Filters On Statistical Lexical Acquisition.
Proceedings of the 4th Applied Natural Language Processing Conference, 1994

Aligning Noisy Parallel Corpora Across Language Groups: Word Pair Feature Matching by Dynamic Time Warping.
Proceedings of the First Conference of the Association for Machine Translation in the Americas, 1994

1993
The estimation of powerful language models from small and large corpora.
Proceedings of the IEEE International Conference on Acoustics, 1993

The BBN/HARC spoken language understanding system.
Proceedings of the IEEE International Conference on Acoustics, 1993

1992
BBN BYBLOS and HARC February 1992 ATIS Benchmark Results.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992

Design and performance of HARC, the BBN spoken language understanding system.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

1991
Unsupervised speaker normalization by speaker Markov model converter for speaker-independent speech recognition.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991


  Loading...