Xiang Li

Orcid: 0009-0004-2003-9217

Affiliations:
  • Beihang University, Beijing, China


According to our database1, Xiang Li authored at least 21 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Mixture of disentangled experts with missing modalities for robust multimodal sentiment analysis.
Inf. Fusion, 2026

A multi-scale representation and multi-level decision learning network for multimodal sentiment analysis.
Expert Syst. Appl., 2026

Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Reconstructing KV Caches with Cross-layer Fusion For Enhanced Transformers.
CoRR, December, 2025

Hybrid CNN-Mamba Enhancement Network for Robust Multimodal Sentiment Analysis.
CoRR, July, 2025

Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library.
CoRR, June, 2025

TF-Mamba: Text-enhanced Fusion Mamba with Missing Modalities for Robust Multimodal Sentiment Analysis.
CoRR, May, 2025

GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning.
CoRR, April, 2025

SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models.
CoRR, February, 2025

ChineseSimpleVQA - "See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models.
CoRR, February, 2025

Learning fine-grained representation with token-level alignment for multimodal sentiment analysis.
Expert Syst. Appl., 2025

SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

TF-Mamba: Text-enhanced Fusion Mamba with Missing Modalities for Robust Multimodal Sentiment Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

XFormParser: A Simple and Effective Multimodal Multilingual Semi-structured Form Parser.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

ECLIPSE: Efficient Cross-Lingual Log Intelligence Parser with Semantic Entropy-Enhanced LCS Algorithm.
Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

Reverse Chain-of-Thought and Causal Path Verification: A Modular Plugin for Aligning LLMs with Knowledge Graphs.
Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

See the World, Discover Knowledge: A Chinese Factuality Evaluation for Large Vision Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
XFormParser: A Simple and Effective Multimodal Multilingual Semi-structured Form Parser.
CoRR, 2024

VIPTR: A Vision Permutable Extractor for Fast and Efficient Scene Text Recognition.
CoRR, 2024

Adaptive Token Selection and Fusion Network for Multimodal Sentiment Analysis.
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024

SVIPTR: Fast and Efficient Scene Text Recognition with Vision Permutable Extractor.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024


  Loading...