Yu Zhang

Orcid: 0000-0003-0540-6758

Affiliations:

Texas A&M University, Department of Computer Science & Engineering, TX, USA
University of Illinois at Urbana-Champaign, Department of Computer Science, IL, USA (former)

According to our database¹, Yu Zhang authored at least 81 papers between 2017 and 2026.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction.

[BibT_eX]

[DOI]

CoRR, May, 2026

Benchmarking Knowledge-Extraction Attack and Defense on Retrieval-Augmented Generation.

[BibT_eX]

[DOI]

Mahantesh Halappanavar

CoRR, February, 2026

Improving Scientific Document Retrieval with Academic Concept Index.

[BibT_eX]

[DOI]

CoRR, January, 2026

Knowledge Homophily in Large Language Models.

[BibT_eX]

[DOI]

Utkarsh Sahu

Zhisheng Qi

Mahantesh Halappanavar

Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining, 2026

Rigorizing Retrieval-augmented Generation with Structured Knowledge Intelligence (6 Hrs).

[BibT_eX]

[DOI]

Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining, 2026

MemeBridge: A Dataset for Benchmarking and Mitigating the Bidirectional Cultural Gap in Meme Interpretation.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026

ReviewGrounder: Improving Review Substantiveness with Rubric-Guided, Tool-Integrated Agents.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use.

[BibT_eX]

[DOI]

CoRR, October, 2025

Curriculum Reinforcement Learning from Easy to Hard Tasks Improves LLM Reasoning.

[BibT_eX]

[DOI]

CoRR, June, 2025

RM-R1: Reward Modeling as Reasoning.

[BibT_eX]

[DOI]

CoRR, May, 2025

Chain-of-Factors Paper-Reviewer Matching.

[BibT_eX]

[DOI]

Proceedings of the ACM on Web Conference 2025, 2025

Improving Scientific Document Retrieval with Concept Coverage-based Query Set Generation.

[BibT_eX]

[DOI]

Proceedings of the Eighteenth ACM International Conference on Web Search and Data Mining, 2025

Machine Learning on Graphs in the Era of Generative Artificial Intelligence.

[BibT_eX]

[DOI]

Mahantesh Halappanavar

Jiliang Tang

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

SKnow-LLM Workshop: Structured Knowledge for Large Language Models.

[BibT_eX]

[DOI]

Vassilis N. Ioannidis

Leman Akoglu

Danai Koutra

Huzefa Rangwala

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

Protein Large Language Models: A Comprehensive Survey.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Internal and External Impacts of Natural Language Processing Papers.

[BibT_eX]

[DOI]

Yu Zhang

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025

A Unified Taxonomy-Guided Instruction Tuning Framework for Entity Set Expansion and Taxonomy Expansion.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs.

[BibT_eX]

[DOI]

CoRR, 2024

Bridging Text Data and Graph Data: Towards Semantics and Structure-aware Knowledge Discovery.

[BibT_eX]

[DOI]

Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

Ontology Enrichment for Effective Fine-grained Entity Typing.

[BibT_eX]

[DOI]

Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

The MAPLE Benchmark for Graph Mining.

[BibT_eX]

[DOI]

Dataset, February, 2023

The MAPLE Benchmark for Graph Mining.

[BibT_eX]

[DOI]

Dataset, February, 2023

The MAPLE Benchmark for Scientific Literature Tagging.

[BibT_eX]

[DOI]

Dataset, February, 2023

"Why Should I Review This Paper?" Unifying Semantic, Topic, and Citation Factors for Paper-Reviewer Matching.

[BibT_eX]

[DOI]

CoRR, 2023

Learning Multiplex Embeddings on Text-rich Networks with One Text Encoder.

[BibT_eX]

[DOI]

CoRR, 2023

PromptClass: Weakly-Supervised Text Classification with Prompting Enhanced Noise-Robust Self-Training.

[BibT_eX]

[DOI]

CoRR, 2023

Chain-of-Skills: A Configurable Model for Open-domain Question Answering.

[BibT_eX]

[DOI]

CoRR, 2023

The Effect of Metadata on Scientific Literature Tagging: A Cross-Field Cross-Model Study.

[BibT_eX]

[DOI]

Proceedings of the ACM Web Conference 2023, 2023

Tutorials at The Web Conference 2023.

[BibT_eX]

[DOI]

Krishnaram Kenthapadi

Behrooz Omidvar-Tehrani

Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

Effective Seed-Guided Topic Discovery by Integrating Multiple Types of Contexts.

[BibT_eX]

[DOI]

Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation.

[BibT_eX]

[DOI]

Proceedings of the 2023 SIAM International Conference on Data Mining, 2023

Weakly Supervised Multi-Label Classification of Full-Text Scientific Papers.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Pretrained Language Representations for Text Understanding: A Weakly-Supervised Perspective.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Heterformer: Transformer-based Deep Node Representation Learning on Heterogeneous Text-Rich Networks.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Edgeformers: Graph-Empowered Transformers for Representation Learning on Textual-Edge Networks.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

PIEClass: Weakly-Supervised Text Classification with Prompting and Noise-Robust Iterative Ensemble Training.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Mining Structures from Massive Texts by Exploring the Power of Pre-trained Language Models.

[BibT_eX]

[DOI]

Yu Zhang

Yunyi Zhang

Jiawei Han

Proceedings of the Proceedings 26th International Conference on Extending Database Technology, 2023

Chain-of-Skills: A Configurable Model for Open-Domain Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Patton: Language Model Pretraining on Text-Rich Networks.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Heterogeneous Network Representation Learning: A Unified Framework With Survey and Benchmark.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., 2022

Heterformer: A Transformer Architecture for Node Representation Learning on Heterogeneous Text-Rich Networks.

[BibT_eX]

[DOI]

CoRR, 2022

Metadata-Induced Contrastive Learning for Zero-Shot Multi-Label Text Classification.

[BibT_eX]

[DOI]

Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations.

[BibT_eX]

[DOI]

Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

MotifClass: Weakly Supervised Text Classification with Higher-order Metadata Information.

[BibT_eX]

[DOI]

Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Generating Training Data with Language Models: Towards Zero-Shot Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Adapting Pretrained Representations for Text Mining.

[BibT_eX]

[DOI]

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Entity Set Co-Expansion in StackOverflow.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Big Data, 2022

REACTCLASS: Cross-Modal Supervision for Subword-Guided Reactant Entity Classification.

[BibT_eX]

[DOI]

Danielle Cherrice Loving

Heng Ji

Martin D. Burke

Jiawei Han

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2022

2021

MATCH: Metadata-Aware Text Classification in A Large Hierarchy.

[BibT_eX]

[DOI]

Proceedings of the WWW '21: The Web Conference 2021, 2021

Hierarchical Metadata-Aware Document Categorization under Weak Supervision.

[BibT_eX]

[DOI]

Proceedings of the WSDM '21, 2021

Simulating Online Social Response: A Stimulus/Response Perspective.

[BibT_eX]

[DOI]

Proceedings of the Winter Simulation Conference, 2021

On the Power of Pre-Trained Text Representations: Models and Applications in Text Mining.

[BibT_eX]

[DOI]

Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020

Partially-Typed NER Datasets Integration: Connecting Practice to Theory.

[BibT_eX]

[DOI]

CoRR, 2020

Heterogeneous Network Representation Learning: Survey, Benchmark, Evaluation, and Beyond.

[BibT_eX]

[DOI]

CoRR, 2020

Multiscale online media simulation with SocialCube.

[BibT_eX]

[DOI]

Boleslaw K. Szymanski

Comput. Math. Organ. Theory, 2020

Discriminative Topic Mining via Category-Name Guided Text Embedding.

[BibT_eX]

[DOI]

Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Minimally Supervised Categorization of Text with Metadata.

[BibT_eX]

[DOI]

Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding.

[BibT_eX]

[DOI]

Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Textual Evidence Mining via Spherical Heterogeneous Information Network Embedding.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

Pattern-enhanced Named Entity Recognition with Distant Supervision.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

2019

Cross-type biomedical named entity recognition with deep multi-task learning.

[BibT_eX]

[DOI]

Bioinform., 2019

Integrating Local Context and Global Cohesiveness for Open Information Extraction.

[BibT_eX]

[DOI]

Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, 2019

HiGitClass: Keyword-Driven Hierarchical Classification of GitHub Repositories.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Distantly Supervised Biomedical Named Entity Recognition with Dictionary Expansion.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Bioinformatics and Biomedicine, 2019

Diversifying seeds and audience in social influence maximization.

[BibT_eX]

[DOI]

Yu Zhang

Proceedings of the ASONAM '19: International Conference on Advances in Social Networks Analysis and Mining, 2019

2018

Integrating Local Context and Global Cohesiveness for Open Information Extraction.

[BibT_eX]

[DOI]

CoRR, 2018

Open Information Extraction with Global Structure Constraints.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the The Web Conference 2018, 2018

Weakly-supervised Relation Extraction by Pattern-enhanced Embedding Learning.

[BibT_eX]

[DOI]

Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Pattern Discovery for Wide-Window Open Information Extraction in Biomedical Literature.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018

PENNER: Pattern-enhanced Nested Named Entity Recognition in Biomedical Literature.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018

Open Information Extraction with Meta-pattern Discovery in Biomedical Literature.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM International Conference on Bioinformatics, 2018

2017

Overcoming Limited Supervision in Relation Extraction: A Pattern-enhanced Distributional Representation Approach.

[BibT_eX]

[DOI]

CoRR, 2017

Top-K Influential Nodes in Social Networks: A Game Perspective.

[BibT_eX]

[DOI]

Yu Zhang

Yan Zhang

Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Yu Zhang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...