Zichen Wen
Orcid: 0009-0002-6157-5898
According to our database1,
Zichen Wen authored at least 41 papers
between 2024 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
StreamMeCo: Long-Term Agent Memory Compression for Efficient Streaming Video Understanding.
CoRR, April, 2026
CoRR, April, 2026
Flash-Unified: A Training-Free and Task-Aware Acceleration Framework for Native Unified Models.
CoRR, March, 2026
CoRR, February, 2026
CoRR, January, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
CoRR, December, 2025
CoRR, December, 2025
CoRR, December, 2025
CoRR, December, 2025
VideoCompressa: Data-Efficient Video Understanding via Joint Temporal Compression and Spatial Reconstruction.
CoRR, November, 2025
OmniLayout: Enabling Coarse-to-Fine Learning with LLMs for Universal Document Layout Generation.
CoRR, October, 2025
CoRR, October, 2025
CoRR, October, 2025
AudioMarathon: A Comprehensive Benchmark for Long-Context Audio Understanding and Efficiency in Audio LLMs.
CoRR, October, 2025
Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods.
CoRR, October, 2025
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation.
CoRR, October, 2025
Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning.
CoRR, September, 2025
CoRR, September, 2025
Prune2Drive: A Plug-and-Play Framework for Accelerating Vision-Language Models in Autonomous Driving.
CoRR, August, 2025
CoRR, August, 2025
CoRR, July, 2025
EfficientVLA: Training-Free Acceleration and Compression for Vision-Language-Action Models.
CoRR, June, 2025
CoRR, June, 2025
CoRR, May, 2025
Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation.
CoRR, March, 2025
Proceedings of the Forty-second International Conference on Machine Learning, 2025
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Stop Looking for "Important Tokens" in Multimodal Language Models: Duplication Matters More.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2024
AIDBench: A benchmark for evaluating the authorship identification capability of large language models.
CoRR, 2024
CoRR, 2024
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
CoRR, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024