Xin Li

Affiliations:

Tencent YouTu Lab, Shenzhen, China

According to our database¹, Xin Li authored at least 13 papers between 2013 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

DREAM: Document Reconstruction via End-to-end Autoregressive Model.

[BibT_eX]

[DOI]

CoRR, July, 2025

2024

Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

The Devil Is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-training.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

The Devil is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-Training.

[BibT_eX]

[DOI]

CoRR, 2022

Relational Representation Learning in Visually-Rich Documents.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Query-driven Generative Network for Document Information Extraction in the Wild.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Neural Collaborative Graph Machines for Table Structure Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Show, Read and Reason: Table Structure Recognition with Flexible Context Aggregator.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2016

On application-unbiased benchmarking of web videos from a social network perspective.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2016

2013

GeSoDeck: a geo-social event detection and tracking system.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia Conference, 2013

Xin Li

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...