Yicong Hong

Orcid: 0000-0002-5068-1508

According to our database1, Yicong Hong authored at least 30 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Diffusion Transformer-to-Mamba Distillation for High-Resolution Image Generation.
CoRR, June, 2025

Test-Time Training Done Right.
CoRR, May, 2025

VEGGIE: Instructional Editing and Reasoning of Video Concepts with Grounded Generation.
CoRR, March, 2025

REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder.
CoRR, March, 2025

Pushing the Boundaries of State Space Models for Image and Video Generation.
CoRR, February, 2025

Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Progressive Autoregressive Video Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

2024
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts.
CoRR, 2024

Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats.
CoRR, 2024

Bi-directional Training for Composed Image Retrieval via Text Prompt Learning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation.
Proceedings of the Robotics: Science and Systems XX, 2024

Instant3D: Fast Text-to-3D with Sparse-view Generation and Large Reconstruction Model.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

LRM: Large Reconstruction Model for Single Image to 3D.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Augmented Commonsense Knowledge for Remote Object Grounding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
HOP+: History-Enhanced and Order-Aware Pre-Training for Vision-and-Language Navigation.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Scaling Data Generation in Vision-and-Language Navigation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Navigational Visual Representations with Semantic Map Supervision.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
1st Place Solutions for RxR-Habitat Vision-and-Language Navigation Competition (CVPR 2022).
CoRR, 2022

HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation.
CoRR, 2022

HOP: History-and-Order Aware Pretraining for Vision-and-Language Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation.
CoRR, 2021

Learning structure-aware semantic segmentation with image-level supervision.
Proceedings of the International Joint Conference on Neural Networks, 2021

The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

VLN BERT: A Recurrent Vision-and-Language BERT for Navigation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
A Recurrent Vision-and-Language BERT for Navigation.
CoRR, 2020

Language and Visual Entity Relationship Graph for Agent Navigation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Sub-Instruction Aware Vision-and-Language Navigation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020


  Loading...