Linli Xu

Orcid: 0000-0003-0227-3793

Affiliations:
  • University of Science and Technology of China (USTC), State Key Laboratory of Cognitive Intelligence, China


According to our database1, Linli Xu authored at least 76 papers between 2012 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Dynamic Token Compression for Efficient Video Understanding through Reinforcement Learning.
CoRR, March, 2026

When Thinking Hurts: Mitigating Visual Forgetting in Video Reasoning via Frame Repetition.
CoRR, March, 2026

Large Reasoning Embedding Models: Towards Next-Generation Dense Retrieval Paradigm.
Proceedings of the ACM Web Conference 2026, 2026

ExpV2S: Zero-Shot Expressive Video-to-Speech Synthesis via Latent Diffusion Model.
Proceedings of the 2026 International Conference on Multimedia Retrieval, 2026

Multimodal Table Understanding with Difficulty-aware Reinforcement Learning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
DiG: Differential Grounding for Enhancing Fine-Grained Perception in Multimodal Large Language Model.
CoRR, December, 2025

Adaptive Weighting Push-SUM for Decentralized Optimization With Statistical Diversity.
IEEE Trans. Control. Netw. Syst., September, 2025

CROP: Integrating Topological and Spatial Structures via Cross-View Prefixes for Molecular LLMs.
CoRR, August, 2025

ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning.
CoRR, March, 2025

Communication-efficient distributed learning with Local Immediate Error Compensation.
Neural Networks, 2025

<i>ImageScope: </i> Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning.
Proceedings of the ACM on Web Conference 2025, 2025

CROP: Integrating Topological and Spatial Structures via Cross-View Prefixes for Molecular LLMs.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Input Domain Aware MoE: Decoupling Routing Decisions from Task Optimization in Mixture of Experts.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Video In-context Learning: Autoregressive Transformers are Zero-Shot Video Imitators.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

BASIC: Boosting Visual Alignment with Intrinsic Refined Embeddings in Multimodal Large Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Dynamic Prefix as Instructor for Incremental Named Entity Recognition: A Unified Seq2Seq Generation Framework.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

S²MILE: Semantic-and-Structure-Aware Music-Driven Lyric Generation.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Communication-efficient clustered federated learning via model distance.
Mach. Learn., June, 2024

Addressing Representation Collapse in Vector Quantized Models with One Linear Layer.
CoRR, 2024

Video In-context Learning.
CoRR, 2024

Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Empowering Diffusion Models on the Embedding Space for Text Generation.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Break the Visual Perception: Adversarial Attacks Targeting Encoded Visual Tokens of Large Vision-Language Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Bridging Gaps in Content and Knowledge for Multimodal Entity Linking.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks.
Proceedings of the IEEE International Conference on Acoustics, 2024

Summarizing Like Human: Edit-Based Text Summarization with Keywords.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2024, 2024

HRVDA: High-Resolution Visual Document Assistant.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Few-shot Temporal Pruning Accelerates Diffusion Models for Text Generation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Visual Hallucination Elevates Speech Recognition.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
ItrievalKD: An Iterative Retrieval Framework Assisted with Knowledge Distillation for Noisy Text-to-Image Retrieval.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2023

Cross and Self Attention Based Graph Convolutional Network for Aspect-Based Sentiment Analysis.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

Multi-Grained Multimodal Interaction Network for Entity Linking.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Span-level Aspect-based Sentiment Analysis via Table Filling.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Difformer: Empowering Diffusion Model on Embedding Space for Text Generation.
CoRR, 2022

Bridging Music and Text with Crowdsourced Music Comments: A Sequence-to-Sequence Framework for Thematic Music Comments Generation.
CoRR, 2022

Semantic-Preserving Abstractive Text Summarization with Siamese Generative Adversarial Net.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

CoCGAN: Contrastive Learning for Adversarial Category Text Generation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Sequence-to-Action: Grammatical Error Correction with Action Guided Sequence Generation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Adaptive Adapters: An Efficient Way to Incorporate BERT Into Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Towards Variable-Length Textual Adversarial Attacks.
CoRR, 2021

Hierarchical Multi-label Text Classification with Horizontal and Vertical Category Correlations.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

STL-SGD: Speeding Up Local SGD with Stagewise Communication Period.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Incorporating BERT into Parallel Sequence Decoding with Adapters.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Label Incorporated Graph Neural Networks for Text Classification.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Jointly Masked Sequence-to-Sequence Model for Non-Autoregressive Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

IntroVNMT: An Introspective Model for Variational Neural Machine Translation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Faster Distributed Deep Net Training: Computation and Communication Decoupled Stochastic Gradient Descent.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

SPINE: Structural Identity Preserved Inductive Network Embedding.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Adaptive Proximal Average Based Variance Reducing Stochastic Methods for Optimization with Composite Regularization.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Non-Autoregressive Neural Machine Translation with Enhanced Decoder Input.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Asynchronous Stochastic Composition Optimization with Variance Reduction.
CoRR, 2018

SPINE: Structural Identity Preserved Inductive Network Embedding.
CoRR, 2018

Tracking and Forecasting Dynamics in Crowdfunding: A Basis-Synthesis Approach.
Proceedings of the IEEE International Conference on Data Mining, 2018

Enhancing Network Embedding with Auxiliary Information: An Explicit Matrix Factorization Perspective.
Proceedings of the Database Systems for Advanced Applications, 2018

2017
Generalized Neural Graph Embedding with Matrix Factorization.
CoRR, 2017

2016
Aligned Matrix Completion: Integrating Consistency and Independency in Multiple Domains.
Proceedings of the IEEE 16th International Conference on Data Mining, 2016

2015
Selecting Social Media Responses to News: A Convex Framework Based On Data Reconstruction.
Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

Regularity and Conformity: Location Prediction Using Heterogeneous Mobility Data.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Word Embedding Revisited: A New Representation Learning and Explicit Matrix Factorization Perspective.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Feature Selection with Integrated Relevance and Redundancy Optimization.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

Community Detection Based on Structure and Content: A Content Propagation Perspective.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

A Nonconvex Relaxation Approach for Rank Minimization Problems.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Temporally Adaptive Restricted Boltzmann Machine for Background Modeling.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Exploiting Task-Feature Co-Clusters in Multi-Task Learning.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Learning Low-Rank Label Correlations for Multi-label Classification with Missing Labels.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

Robust Dynamic Trajectory Regression on Road Networks: A Multi-task Learning Framework.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

2012
Capturing correlations of multiple labels: A generative probabilistic model for multi-label learning.
Neurocomputing, 2012

Image Denoising and Inpainting with Deep Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Ensemble Pruning via Constrained Eigen-Optimization.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

Leveraging tagging for neighborhood-aware probabilistic matrix factorization.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012


  Loading...