Yihong Liu

Orcid: 0000-0003-1073-0958

Affiliations:
  • Ludwig Maximilian University of Munich (LMU), Center for Information and Language Processing, Munich Center for Machine Learning (MCML), Munich, Germany
  • Southwest University, College of Computer and Information Science, Chongqing, China (former)


According to our database1, Yihong Liu authored at least 26 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Your Pretrained Model Tells the Difficulty Itself: A Self-Adaptive Curriculum Learning Paradigm for Natural Language Understanding.
CoRR, July, 2025

Refusal Direction is Universal Across Safety-Aligned Languages.
CoRR, May, 2025

Tracing Multilingual Factual Knowledge Acquisition in Pretraining.
CoRR, May, 2025

HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization.
CoRR, April, 2025

On Relation-Specific Neurons in Large Language Models.
CoRR, February, 2025

M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis.
CoRR, February, 2025

How Transliterations Improve Crosslingual Alignment.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

TransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated Data.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Understanding In-Context Machine Translation for Low-Resource Languages: A Case Study on Manchu.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

How Programming Concepts and Neurons Are Shared in Code Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Lost in Multilinguality: Dissecting Cross-lingual Factual Inconsistency in Transformer Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

LangSAMP: Language-Script Aware Multilingual Pretraining.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists.
CoRR, 2024

Exploring the Role of Transliteration in In-Context Learning for Low-resource Languages Written in Non-Latin Scripts.
CoRR, 2024

MoSECroT: Model Stitching with Static Word Embeddings for Crosslingual Zero-shot Transfer.
CoRR, 2024

OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

SynthEval: Hybrid Behavioral Testing of NLP Models with Synthetic Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Breaking the Script Barrier in Multilingual Pre-Trained Language Models with Transliteration-Based Post-Training Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
A study of conceptual language similarity: comparison and evaluation.
CoRR, 2023

Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs.
CoRR, 2023

On the Copying Problem of Unsupervised NMT: A Training Schedule with a Language Discriminator Loss.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

A Crosslingual Investigation of Conceptualization in 1335 Languages.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Flow-Adapter Architecture for Unsupervised Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
A label-oriented loss function for learning sentence representations.
Comput. Speech Lang., 2021


  Loading...