Wenyi Hong

According to our database¹, Wenyi Hong authored at least 27 papers between 2017 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification.

[BibT_eX]

[DOI]

CoRR, March, 2026

GLM-OCR Technical Report.

[BibT_eX]

[DOI]

CoRR, March, 2026

Glyph: Scaling Context Windows via Visual-Text Compression.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025

UI2CodeN: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation.

[BibT_eX]

[DOI]

CoRR, November, 2025

WebVIA: A Web-based Vision-Language Agentic Framework for Interactive and Verifiable UI-to-Code Generation.

[BibT_eX]

[DOI]

CoRR, November, 2025

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, July, 2025

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LVBench: An Extreme Long Video Understanding Benchmark.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

MathGLM-Vision: Solving Mathematical Problems with Multi-Modal Large Language Model.

[BibT_eX]

[DOI]

CoRR, 2024

CogVLM2: Visual Language Models for Image and Video Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents.

[BibT_eX]

[DOI]

CoRR, 2024

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer.

[BibT_eX]

[DOI]

CoRR, 2024

LVBench: An Extreme Long Video Understanding Benchmark.

[BibT_eX]

[DOI]

CoRR, 2024

CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations.

[BibT_eX]

[DOI]

CoRR, 2024

CogVLM: Visual Expert for Pretrained Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Relay Diffusion: Unifying diffusion process across resolutions for image synthesis.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

CogAgent: A Visual Language Model for GUI Agents.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

CogAgent: A Visual Language Model for GUI Agents.

[BibT_eX]

[DOI]

CoRR, 2023

CogVLM: Visual Expert for Pretrained Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

CogView: Mastering Text-to-Image Generation via Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2017

Improved Approximation Algorithm for the Combination of Parallel Machine Scheduling and Vertex Cover.

[BibT_eX]

[DOI]

Wenyi Hong

Zhenbo Wang

Int. J. Found. Comput. Sci., 2017

Wenyi Hong

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...