Jinxu Zhang
Orcid: 0009-0000-9876-1454
According to our database1,
Jinxu Zhang authored at least 11 papers
between 2023 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives.
Int. J. Comput. Vis., December, 2025
DocRouter: Prompt guided vision transformer and Mixture of Experts connector for document understanding.
Inf. Fusion, 2025
Predicting trajectories of coastal area vessels with a lightweight Slice-Diff self attention.
Complex Intell. Syst., 2025
DREAM: Integrating Hierarchical Multimodal Retrieval with Multi-page Multimodal Language Model for Documents VQA.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
MP-FIRE: An End-to-End Cross-Modal Framework for Complex Multi-Page Document Question Answering.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025
DocAssistant: Integrating Key-region Reading and Step-wise Reasoning for Robust Document Visual Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
2024
CFRet-DVQA: Coarse-to-Fine Retrieval and Efficient Tuning for Document Visual Question Answering.
CoRR, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
A Sequence-to-Sequence Based Muti-Semantic Network with Attention for Long-Term Vessel Trajectory Prediction.
Proceedings of the 2nd International Conference on Computer, 2024
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives.
CoRR, 2023