Hongwei Xue

According to our database¹, Hongwei Xue authored at least 21 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models.

[BibT_eX]

[DOI]

CoRR, April, 2026

MM-CondChain: A Programmatically Verified Benchmark for Visually Grounded Deep Compositional Reasoning.

[BibT_eX]

[DOI]

CoRR, March, 2026

Beyond Scattered Acceptance: Fast and Coherent Inference for DLMs via Longest Stable Prefixes.

[BibT_eX]

[DOI]

CoRR, March, 2026

SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs.

[BibT_eX]

[DOI]

CoRR, February, 2026

2025

You Only Forward Once: An Efficient Compositional Judging Paradigm.

[BibT_eX]

[DOI]

CoRR, November, 2025

CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms.

[BibT_eX]

[DOI]

CoRR, May, 2025

2024

Multi-Modal Generative Embedding Model.

[BibT_eX]

[DOI]

CoRR, 2024

Visual Perception by Large Language Model's Weights.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

2023

CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Alignment.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Stare at What You See: Masked Image Modeling without Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

A coarse-to-fine and automatic algorithm for breast diagnosis on multi-series MRI images.

[BibT_eX]

[DOI]

Frontiers Comput. Sci., 2022

CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment.

[BibT_eX]

[DOI]

CoRR, 2022

Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Tri-axial Motion Sensing with Mechanomagnetic Effect for Human-Machine Interface.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Robotics and Applications - 15th International Conference, 2022

Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training.

[BibT_eX]

[DOI]

CoRR, 2021

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Fine-Grained Motion Embedding for Landscape Animation.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Semantic Tag Augmented XlanV Model for Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Unifying Multimodal Transformer for Bi-directional Image and Text Generation.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020

Sed-Net: Detecting Multi-Type Edits Of Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Hongwei Xue

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...