Yunbo Cao

Orcid: 0009-0005-2558-5206

According to our database1, Yunbo Cao authored at least 116 papers between 2002 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Pretraining without wordpieces: learning over a vocabulary of millions of words.
Int. J. Mach. Learn. Cybern., September, 2024

Unifying Token- and Span-level Supervisions for Few-shot Sequence Labeling.
ACM Trans. Inf. Syst., January, 2024

DialogVCS: Robust Natural Language Understanding in Dialogue System Upgrade.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

SkillNet-X: A Multilingual Multitask Model with Sparsely Activated Skills.
Proceedings of the IEEE International Conference on Acoustics, 2024

Large Language Models are not Fair Evaluators.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Automatic Context Pattern Generation for Entity Set Expansion.
IEEE Trans. Knowl. Data Eng., December, 2023

Cognitive Diagnosis-Based Personalized Exercise Group Assembly via a Multi-Objective Evolutionary Algorithm.
IEEE Trans. Emerg. Top. Comput. Intell., June, 2023

IEKM: A Model Incorporating External Keyword Matrices.
CoRR, 2023

Making Large Language Models Better Reasoners with Alignment.
CoRR, 2023

Large Language Models are not Fair Evaluators.
CoRR, 2023

Bi-Drop: Generalizable Fine-tuning for Pre-trained Language Models via Adaptive Subnetwork Optimization.
CoRR, 2023

DialogVCS: Robust Natural Language Understanding in Dialogue System Upgrade.
CoRR, 2023

Finding Similar Exercises in Retrieval Manner.
CoRR, 2023

TQ-Net: Mixed Contrastive Representation Learning For Heterogeneous Test Questions.
CoRR, 2023

QURG: Question Rewriting Guided Context-Dependent Text-to-SQL Semantic Parsing.
Proceedings of the PRICAI 2023: Trends in Artificial Intelligence, 2023

Contextual Similarity is More Valuable Than Character Similarity: An Empirical Study for Chinese Spell Checking.
Proceedings of the IEEE International Conference on Acoustics, 2023

DialogQAE: N-to-N Question Answer Pair Extraction from Customer Service Chatlog.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Bi-Drop: Enhancing Fine-tuning Generalization via Synchronous sub-net Estimation and Optimization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Read Then Respond: Multi-granularity Grounding Prediction for Knowledge-Grounded Dialogue Generation.
Proceedings of the Advanced Data Mining and Applications - 19th International Conference, 2023

Soft Language Clustering for Multilingual Model Pre-training.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

G³R: A Graph-Guided Generate-and-Rerank Framework for Complex and Cross-domain Text-to-SQL Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Enhancing Continual Relation Extraction via Classifier Decomposition.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
DialogQAE: N-to-N Question Answer Pair Extraction from Customer Service Chatlog.
CoRR, 2022

Instance Segmentation for Chinese Character Stroke Extraction, Datasets and Benchmarks.
CoRR, 2022

Less is More: Rethinking State-of-the-art Continual Relation Extraction Models with a Frustratingly Easy but Effective Approach.
CoRR, 2022

Contextual Similarity is More Valuable than Character Similarity: Curriculum Learning for Chinese Spell Checking.
CoRR, 2022

HPT: Hierarchy-aware Prompt Tuning for Hierarchical Text Classification.
CoRR, 2022

SmartSales: Sales Script Extraction and Analysis from Sales Chatlog.
CoRR, 2022

An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

A Non-Hierarchical Attention Network with Modality Dropout for Textual Response Generation in Multimodal Dialogue Systems.
Proceedings of the IEEE International Conference on Acoustics, 2022

Contrastive Learning with Prompt-derived Virtual Semantic Prototypes for Unsupervised Sentence Embedding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

DualNER: A Dual-Teaching framework for Zero-shot Cross-lingual Named Entity Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

CQR-SQL: Conversational Question Reformulation Enhanced Context-Dependent Text-to-SQL Parsers.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

HPT: Hierarchy-aware Prompt Tuning for Hierarchical Text Classification.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Learning Robust Representations for Continual Relation Extraction via Adversarial Class Augmentation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

DialogUSR: Complex Dialogue Utterance Splitting and Reformulation for Multiple Intent Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Linguistic Rules-Based Corpus Generation for Native Chinese Grammatical Error Correction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Learning from the Dictionary: Heterogeneous Knowledge Guided Fine-tuning for Chinese Spell Checking.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

AiM: Taking Answers in Mind to Correct Chinese Cloze Tests in Educational Applications.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

A Prerequisite Attention Model for Knowledge Proficiency Diagnosis of Students.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Knowledge-Sensed Cognitive Diagnosis for Intelligent Education Platforms.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Introducing Response Time into Guessing and Slipping for Cognitive Diagnosis.
Proceedings of the Artificial Intelligence in Education. Posters and Late Breaking Results, Workshops and Tutorials, Industry and Innovation Tracks, Practitioners' and Doctoral Consortium, 2022

Pre-training and Fine-tuning Neural Topic Model: A Simple yet Effective Approach to Incorporating External Knowledge.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Hierarchical Curriculum Learning for AMR Parsing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Seeking Patterns, Not just Memorizing Procedures: Contrastive Learning for Solving Math Word Problems.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

The Past Mistake is the Future Wisdom: Error-driven Contrastive Probability Optimization for Chinese Spell Checking.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Type-Driven Multi-Turn Corrections for Grammatical Error Correction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

G4: Grounding-guided Goal-oriented Dialogues Generation with Multiple Documents.
Proceedings of the Second DialDoc Workshop on Document-grounded Dialogue and Conversational Question Answering, 2022

2021
Exploring Student Representation For Neural Cognitive Diagnosis.
CoRR, 2021

Correlation-Guided Representation for Multi-Label Text Classification.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

A Divide-And-Conquer Approach for Multi-label Multi-hop Relation Detection in Knowledge Base Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Diversity and Consistency: Exploring Visual Question-Answer Pair Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

LANA: Towards Personalized Deep Knowledge Tracing Through Distinguishable Interactive Sequences.
Proceedings of the 14th International Conference on Educational Data Mining, 2021

Enhancing Dialogue-based Relation Extraction by Speaker and Trigger Words Prediction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Enhancing Label Correlation Feedback in Multi-Label Text Classification via Multi-Task Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Dialogue Response Selection with Hierarchical Curriculum Learning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Improving BERT with Syntax-aware Local Attention.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

A Unified Multi-Task Learning Framework for Joint Extraction of Entities and Relations.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Exploiting Unlabeled Data via Partial Label Assignment for Multi-Class Semi-Supervised Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

MMKE: A Multi-Model Knowledge Extraction System from Unstructured Texts.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

RadarMath: An Intelligent Tutoring System for Math Education.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
The SPPD System for Schema Guided Dialogue State Tracking Challenge.
CoRR, 2020

LARQ: Learning to Ask and Rewrite Questions for Community Question Answering.
Proceedings of the Natural Language Processing and Chinese Computing, 2020

Asking Effective and Diverse Questions: A Machine Reading Comprehension based Framework for Joint Entity-Relation Extraction.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Difference-aware Knowledge Selection for Knowledge-grounded Conversation Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Entity Relative Position Representation Based Multi-head Selection for Joint Entity and Relation Extraction.
Proceedings of the Chinese Computational Linguistics - 19th China National Conference, CCL 2020, Hainan, China, October 30, 2020

2018
Overview of the NLPCC 2018 Shared Task: Spoken Language Understanding in Task-Oriented Dialog Systems.
Proceedings of the Natural Language Processing and Chinese Computing, 2018

egoStellar: Visual Analysis of Anomalous Communication Behaviors from Egocentric Perspective.
Proceedings of the New Trends in Computer Technologies and Applications, 2018

Mention and Entity Description Co-Attention for Entity Disambiguation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
A Statistical Framework for Product Description Generation.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

2014
Collective Tweet Wikification based on Semi-supervised Graph Regularization.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
An error driven approach to query segmentation.
Proceedings of the 22nd International World Wide Web Conference, 2013

The MSR.KM System for Entity Linking at TAC 2013.
Proceedings of the Sixth Text Analysis Conference, 2013

Learning a Replacement Model for Query Segmentation with Consistency in Search Logs.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

2012
A Lazy Learning Model for Entity Linking using Query-Specific Information.
Proceedings of the COLING 2012, 2012

2011
Re-ranking question search results by clustering questions.
J. Assoc. Inf. Sci. Technol., 2011

A structural support vector method for extracting contexts and answers of questions from online forums.
Inf. Process. Manag., 2011

MSRA at TAC 2011: Entity Linking.
Proceedings of the Fourth Text Analysis Conference, 2011

I2R-NUS-MSRA at TAC 2011: Entity Linking.
Proceedings of the Fourth Text Analysis Conference, 2011

Leveraging Unlabeled Data to Scale Blocking for Record Linkage.
Proceedings of the IJCAI 2011, 2011

Learning to Suggest Questions in Online Forums.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
Microsoft Research Asia with Redmond at the NTCIR-8 Community QA Pilot Task.
Proceedings of the 8th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2010

Automatic extraction of web data records containing user-generated content.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2009
A Structural Support Vector Method for Extracting Contexts and Answers of Questions from Online Forums.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Learning to recommend questions based on user ratings.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
Competitor Mining with the Web.
IEEE Trans. Knowl. Data Eng., 2008

Recommending questions using the mdl-based tree cut model.
Proceedings of the 17th International Conference on World Wide Web, 2008

Understanding and Summarizing Answers in Community-Based Question Answering Services.
Proceedings of the COLING 2008, 2008

Searching Questions by Identifying Question Topic and Question Focus.
Proceedings of the ACL 2008, 2008

A Probabilistic Model for Fine-Grained Expert Search.
Proceedings of the ACL 2008, 2008

Question Utility: A Novel Static Ranking of Question Search.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007
Web page title extraction and its application.
Inf. Process. Manag., 2007

Research on Enterprise Track of TREC 2007 at SJTU APEX Lab.
Proceedings of The Sixteenth Text REtrieval Conference, 2007

Using Social Annotations to Smooth the Language Model for IR.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2007

Low-Quality Product Review Detection in Opinion Summarization.
Proceedings of the EMNLP-CoNLL 2007, 2007

Searching Documents Based on Relevance and Type.
Proceedings of the Advances in Information Retrieval, 2007

Using social annotations to improve language model for information retrieval.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

2006
A Supervised Learning Approach to Search of Definitions.
J. Comput. Sci. Technol., 2006

Automatic extraction of titles from general documents using machine learning.
Inf. Process. Manag., 2006

Research on Expert Search at Enterprise Track of TREC 2006.
Proceedings of the Fifteenth Text REtrieval Conference, 2006

Adapting ranking SVM to document retrieval.
Proceedings of the SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006

CoMiner: An Effective Algorithm for Mining Competitors from the Web.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

Mining Latent Associations of Objects Using a Typed Mixture Model--A Case Study on Expert/Expertise Mining.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

Cost-Sensitive Learning of SVM for Ranking.
Proceedings of the Machine Learning: ECML 2006, 2006

A Supervised Learning Approach to Entity Search.
Proceedings of the Information Retrieval Technology, 2006

2005
Ranking definitions with supervised learning methods.
Proceedings of the 14th international conference on World Wide Web, 2005

Research on Expert Search at Enterprise Track of TREC 2005.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

Title extraction from bodies of HTML documents and its application to web page retrieval.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Email data cleaning.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

Automatic extraction of titles from general documents using machine learning.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2005

A new approach to intranet search based on information extraction.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

2003
Using Bilingual Web Data to Mine and Rank Translations.
IEEE Intell. Syst., 2003

Uncertainty Reduction in Collaborative Bootstrapping: Measure and Algorithm.
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003

2002
Base Noun Phrase Translation Using Web Data and the EM Algorithm.
Proceedings of the 19th International Conference on Computational Linguistics, 2002


  Loading...