We stand with Ukraine

We stand with Ukraine

Lijun Wu

Orcid: 0000-0002-3530-590X

Affiliations:

Microsoft Research, Beijing, China
Sun Yat-sen University, School of Data and Computer Science, Guangzhou, China (PhD 2020)

According to our database¹, Lijun Wu authored at least 104 papers between 2017 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

On csauthors.net:

Bibliography

2025

Sequential Diffusion Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, September, 2025

Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, August, 2025

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, August, 2025

Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, July, 2025

CovDocker: Benchmarking Covalent Drug Design with Tasks, Datasets, and Solutions.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, June, 2025

GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2025

PM4Bench: A Parallel Multilingual Multi-Modal Multi-task Benchmark for Large Vision Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, March, 2025

NatureLM: Deciphering the Language of Nature for Scientific Discovery.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Krzysztof Maziarz

,

Marwin H. S. Segler

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2025

GRAIT: Gradient-Driven Refusal-Aware Instruction Tuning for Effective Hallucination Mitigation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, February, 2025

SE3Set: Harnessing Equivariant Hypergraph Neural Networks for Molecular Representation Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Trans. Mach. Learn. Res., 2025

GRAIT: Gradient-Driven Refusal-Aware Instruction Tuning for Effective Hallucination Mitigation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

FABind+: Enhancing Molecular Docking through Improved Pocket Prediction and Pose Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.1, 2025

3D-MolT5: Leveraging Discrete Structural Information for Molecule-Text Modeling.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

OpenHuEval: Evaluating Large Language Model on Hungarian Specifics.

[BibT_eX]

[DOI]

,

,

,

Noémi Ligeti-Nagy

,

,

,

Zijian Gyozo Yang

,

,

,

,

,

,

,

,

,

,

,

,

,

Gábor Prószéky

,

Proceedings of the Findings of the Association for Computational Linguistics, 2025

MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenge.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

Randomness Regularization With Simple Consistency Training for Neural Networks.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., August, 2024

Enhancing Low-Resource NLP by Consistency Training With Data and Model Perturbations.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Tokenizing 3D Molecule Structure with Quantized Spherical Coordinates.

[BibT_eX]

[DOI]

,

,

,

,

,

John E. Hopcroft

,

,

CoRR, 2024

SFM-Protein: Integrative Co-evolutionary Pre-training for Advanced Protein Sequence Representation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Xiaozhuan Liang

,

,

,

CoRR, 2024

3D-MolT5: Towards Unified 3D Molecule-Text Modeling with 3D Molecular Tokenization.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

FABind+: Enhancing Molecular Docking through Improved Pocket Prediction and Pose Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Infusing Self-Consistency into Density Functional Theory Hamiltonian Prediction via Deep Equilibrium Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

CReSE: Benchmark Data and Automatic Evaluation Framework for Recommending Eligibility Criteria from Clinical Trial Information.

[BibT_eX]

[DOI]

,

,

David Seung U. Lee

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Exploiting Pre-trained Models for Drug Target Affinity Prediction with Nearest Neighbors.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task Tuning.

[BibT_eX]

[DOI]

,

,

,

Xiaozhuan Liang

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Breaking the barriers of data scarcity in drug-target affinity prediction.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Briefings Bioinform., September, 2023

Are the BERT family zero-shot learners? A study on their potential and limitations.

[BibT_eX]

[DOI]

,

,

,

,

Artif. Intell., September, 2023

Incorporating NODE with pre-trained neural differential operator for learning dynamics.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Neurocomputing, April, 2023

Masked Contrastive Representation Learning for Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

R2-DDI: relation-aware feature refinement for drug-drug interaction prediction.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Briefings Bioinform., January, 2023

AMOM: Adaptive Masking over Masking for Conditional Masked Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

FABind: Fast and Accurate Protein-Ligand Binding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Dual-view Molecular Pre-training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Pre-training Antibody Language Models for Antigen-Specific Computational Antibody Design.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

CT4Rec: Simple yet Effective Consistency Training for Sequential Recommendation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

𝒪-GNN: incorporating ring priors into molecular modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Making Better Decision by Directly Planning in Continuous Control.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

De Novo Molecular Generation via Connection-aware Motif Mining.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Revisiting Machine-Learning based Drug Repurposing: Drug Indications Are Not a Right Prediction Target.

[BibT_eX]

[DOI]

,

,

David Seung U. Lee

,

,

,

,

,

Proceedings of the Conference on Health, Inference, and Learning, 2023

MolXPT: Wrapping Molecules with Text for Generative Pre-training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Dynamic and Efficient Inference for Text Generation via BERT Family.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

AMOM: Adaptive Masking over Masking for Conditional Masked Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Retrosynthesis Prediction with Local Template Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Discovering drug-target interaction knowledge from biomedical literature.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Bioinform., November, 2022

Direct Molecular Conformation Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Trans. Mach. Learn. Res., 2022

Multimodal Sentiment Analysis With Two-Phase Multi-Task Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Multi-Teacher Distillation With Single Model for Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2022

End-to-end entity-aware neural machine translation.

[BibT_eX]

[DOI]

,

,

,

,

,

Mach. Learn., 2022

A study of BERT for context-aware neural machine translation.

[BibT_eX]

[DOI]

,

,

,

,

,

Mach. Learn., 2022

Incorporating Pre-training Paradigm for Antibody Sequence-Structure Co-design.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2022

Tailoring Molecules for Protein Pockets: a Transformer-based Generative Solution for Structured-based Drug Design.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2022

SMT-DTA: Improving Drug-Target Affinity Prediction with Semi-supervised Multi-task Training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2022

Dynamic Relation Discovery and Utilization in Multi-Entity Time Series Forecasting.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2022

AF<sub>2</sub>: Adaptive Focus Framework for Aerial Imagery Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2022

Direct Molecular Conformation Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2022

Back translation for molecule generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Bioinform., 2022

SPRoBERTa: protein embedding learning with local fragment modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Briefings Bioinform., 2022

Unified 2D and 3D Pre-Training of Molecular Representations.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

RetroGraph: Retrosynthetic Planning with Graph Search.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Target-Side Input Augmentation for Sequence to Sequence Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

JANUS: Joint Autoregressive and Non-autoregressive Training with Auxiliary Loss for Sequence Generation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021

C<sup>2</sup>-Rec: An Effective Consistency Constraint for Sequential Recommendation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2021

Pre-training Co-evolutionary Protein Representation via A Pairwise Masked Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2021

Discovering Drug-Target Interaction Knowledge from Biomedical Literature.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2021

R-Drop: Regularized Dropout for Neural Networks.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

UniDrop: A Simple yet Effective Technique to Improve Transformer without Extra Cost.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

mixSeq: A Simple Data Augmentation Methodfor Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Temporally Correlated Task Scheduling for Sequence Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

IOT: Instance-wise Layer Reordering for Transformer Structures.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Learning to Reweight with Deep Interactions.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

ADD: Augmented Disentanglement Distillation Framework for Improving Stock Trend Forecasting.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

Masked Contrastive Representation Learning for Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2020

Learn to Use Future Information in Simultaneous Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2020

Learning to Teach with Deep Interactions.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2020

Multi-branch Attentive Transformer.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2020

Sequence Generation with Mixed Representations.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 37th International Conference on Machine Learning, 2020

Incorporating BERT into Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 8th International Conference on Learning Representations, 2020

Transductive Ensemble Learning for Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

ChengXiang Zhai

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Beyond Error Propagation: Language Branching Also Affects the Accuracy of Sequence Generation.

[BibT_eX]

[DOI]

,

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Microsoft Research Asia's Systems for WMT19.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2019

Efficient Bidirectional Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2019

Microsoft Research Asia's Systems for WMT19.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Fourth Conference on Machine Translation, 2019

Machine Translation With Weakly Paired Documents.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Exploiting Monolingual Data at Scale for Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Depth Growing for Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Soft Contextual Data Augmentation for Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018

Achieving Human Parity on Automatic Chinese to English News Translation.

[BibT_eX]

[DOI]

,

,

,

Vishal Chowdhary

,

,

Christian Federmann

,

,

Marcin Junczys-Dowmunt

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2018

Learning to Teach with Dynamic Loss Functions.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Efficient Sequence Learning with Group Recurrent Networks.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

A Study of Reinforcement Learning for Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Beyond Error Propagation in Neural Machine Translation: Characteristics of Language Also Matter.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Adversarial Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of The 10th Asian Conference on Machine Learning, 2018

Word Attention for Sequence to Sequence Text Understanding.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Deliberation Networks: Sequence Generation Beyond One-Pass Decoding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Sequence Prediction with Unlabeled Data by Reward Function Learning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Loading...