Bin Lin
Orcid: 0009-0003-4805-9730Affiliations:
- Peking University, Shenzhen Graduate School, Rabbitpre Intelligence, China
According to our database1,
Bin Lin authored at least 34 papers
between 2023 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2026
CoRR, March, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization.
CoRR, November, 2025
Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward.
CoRR, November, 2025
Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback.
CoRR, October, 2025
IEEE Trans. Pattern Anal. Mach. Intell., September, 2025
FlashI2V: Fourier-Guided Latent Shifting Prevents Conditional Image Leakage in Image-to-Video Generation.
CoRR, September, 2025
UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation.
CoRR, June, 2025
OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation.
CoRR, May, 2025
SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any Video.
CoRR, March, 2025
CoRR, March, 2025
Sci. China Inf. Sci., 2025
OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025
WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025
2024
CoRR, 2024
WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model.
CoRR, 2024
OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model.
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
2023
Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models.
CoRR, 2023
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment.
CoRR, 2023