Yu Zhao

Orcid: 0000-0002-0074-091X

Affiliations:
  • Nankai University, College of Computer Science, Tianjin, China


According to our database1, Yu Zhao authored at least 49 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Dark Side of Modalities: Reinforced Multimodal Distillation for Multimodal Knowledge Graph Reasoning.
CoRR, July, 2025

Hyper-modal Imputation Diffusion Embedding with Dual-Distillation for Federated Multimodal Knowledge Graph Completion.
CoRR, June, 2025

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly.
CoRR, May, 2025

Noiser: Bounded Input Perturbations for Attributing Large Language Models.
CoRR, April, 2025

Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression.
CoRR, March, 2025

ZS-MNET: A zero-shot learning based approach to multimodal named entity typing.
Neural Networks, 2025

ME3A: A Multimodal Entity Entailment framework for multimodal Entity Alignment.
Inf. Process. Manag., 2025

Grammar induction from visual, speech and text.
Artif. Intell., 2025

Multimodal Graph-Based Variational Mixture of Experts Network for Zero-Shot Multimodal Information Extraction.
Proceedings of the ACM on Web Conference 2025, 2025

Multimodal Knowledge Graph Error Detection with Disentanglement VAE and Multi-Grained Triplet Confidence.
Proceedings of the ACM on Web Conference 2025, 2025

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Are We Done with MMLU?
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Structured Packing in LLM Training Improves Long Context Utilization.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Retrieval-style In-context Learning for Few-shot Hierarchical Text Classification.
Trans. Assoc. Comput. Linguistics, 2024

Analysing the Residual Stream of Language Models Under Knowledge Conflicts.
CoRR, 2024

A Simple and Effective L<sub>2</sub> Norm-Based Strategy for KV Cache Compression.
CoRR, 2024

A Comprehensive Survey on Underwater Image Enhancement Based on Deep Learning.
CoRR, 2024

The Hallucinations Leaderboard - An Open Effort to Measure Hallucinations in Large Language Models.
CoRR, 2024

Contrast then Memorize: Semantic Neighbor Retrieval-Enhanced Inductive Multimodal Knowledge Graph Completion.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Synergistic Dual Spatial-aware Generation of Image-to-text and Text-to-image.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

The ACM Multimedia 2024 Viual Spatial Description Grand Challenge.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SpeechEE: A Novel Benchmark for Speech Event Extraction.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

XFashion: Character Animation Generation via Facial-enhanced and Granularly Controlling.
Proceedings of the 5th International Workshop on Human-centric Multimedia Analysis, 2024

TimeR⁴ : Time-aware Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

A Simple and Effective L_2 Norm-Based Strategy for KV Cache Compression.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MCIL: Multimodal Counterfactual Instance Learning for Low-resource Entity-based Multimodal Information Extraction.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Bring Invariant to Variant: A Contrastive Prompt-based Framework for Temporal Knowledge Graph Forecasting.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Look before You Leap: Dual Logical Verification for Knowledge-based Visual Question Generation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Analysing The Impact of Sequence Composition on Language Model Pre-Training.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

MELOV: Multimodal Entity Linking with Optimized Visual Features in Latent Space.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Harnessing Holistic Discourse Features and Triadic Interaction for Sentiment Quadruple Extraction in Dialogues.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Non-parametric, Nearest-neighbor-assisted Fine-tuning for Neural Machine Translation.
CoRR, 2023

Constructing Holistic Spatio-Temporal Scene Graph for Video Semantic Role Labeling.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Revisiting Disentanglement and Fusion on Modality and Context in Conversational Multimodal Emotion Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

HITSZ TMG at ICASSP 2023 SPGC Shared Task: Leveraging Pre-Training and Distillation Method for Title Generation with Limited Resource.
Proceedings of the IEEE International Conference on Acoustics, 2023

Incorporating Object-Level Visual Context for Multimodal Fine-Grained Entity Typing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

From Alignment to Entailment: A Unified Textual Entailment Framework for Entity Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Generating Visual Spatial Description via Holistic 3D Scene Understanding.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Fake news detection via knowledgeable prompt learning.
Inf. Process. Manag., 2022

Medical Dialogue Response Generation with Pivotal Information Recalling.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

MoSE: Modality Split and Ensemble for Multimodal Knowledge Graph Completion.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

An Efficient Memory-Augmented Transformer for Knowledge-Intensive NLP Tasks.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Overcoming Language Priors in Visual Question Answering via Distinguishing Superficially Similar Instances.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

DEAR: Dual-Level Self-attention GRU for Online Early Prediction of Sepsis.
Proceedings of the Web Information Systems and Applications, 2022

2021
GlyphCRM: Bidirectional Encoder Representation for Chinese Character with its Glyph.
CoRR, 2021

MSDF: A General Open-Domain Multi-skill Dialog Framework.
Proceedings of the Natural Language Processing and Chinese Computing, 2021

2020
Dependency Parsing with Noisy Multi-annotation Data.
Proceedings of the Natural Language Processing and Chinese Computing, 2020

2017


  Loading...