Zhijie Lin

Orcid: 0000-0003-3461-8952

Affiliations:

Zhejiang University, Hangzhou, China

According to our database¹, Zhijie Lin authored at least 35 papers between 2019 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

FARMER: Flow AutoRegressive Transformer over Pixels.

[BibT_eX]

[DOI]

CoRR, October, 2025

LoCo: Low-Bit Communication Adaptor for Large-Scale Model Training.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2025

Seedance 1.0: Exploring the Boundaries of Video Generation Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training.

[BibT_eX]

[DOI]

CoRR, June, 2025

Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation.

[BibT_eX]

[DOI]

CoRR, March, 2025

Long Context Tuning for Video Generation.

[BibT_eX]

[DOI]

CoRR, March, 2025

How Far Is Video Generation from World Model: A Physical Law Perspective.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Parallelized Autoregressive Visual Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Loong: Generating Minute-level Long Videos with Autoregressive Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning.

[BibT_eX]

[DOI]

CoRR, 2024

MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

LVD-2M: A Long-take Video Dataset with Temporally Dense Captions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023

Towards Garment Sewing Pattern Reconstruction from a Single Image.

[BibT_eX]

[DOI]

ACM Trans. Graph., December, 2023

Weakly-Supervised Video Moment Retrieval via Regularized Two-Branch Proposal Networks with Erasing Mechanism.

[BibT_eX]

[DOI]

CoRR, 2023

ChatAnything: Facetime Chat with LLM-Enhanced Personas.

[BibT_eX]

[DOI]

CoRR, 2023

Unsupervised Discovery of Interpretable Directions in h-space of Pre-trained Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs.

[BibT_eX]

[DOI]

CoRR, 2023

EditAnything: Empowering Unparalleled Flexibility in Image Editing and Generation.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

DATE: Domain Adaptive Product Seeker for E-Commerce.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Unsupervised Representation Learning from Pre-trained Diffusion Probabilistic Models.

[BibT_eX]

[DOI]

Zijian Zhang

Zhou Zhao

Zhijie Lin

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Pseudo Numerical Methods for Diffusion Models on Manifolds.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

2021

Temporal Textual Localization in Video via Adversarial Bi-Directional Interaction Networks.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Learning to Rehearse in Long Sequence Memorization.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Cascaded Prediction Network via Segment Tree for Temporal Video Grounding.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Moment Retrieval via Cross-Modal Interaction Networks With Query Reconstruction.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Counterfactual Contrastive Learning for Weakly-Supervised Vision-Language Grounding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Regularized Two-Branch Proposal Networks for Weakly-Supervised Moment Retrieval in Videos.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Object-Aware Multi-Branch Relation Networks for Spatio-Temporal Video Grounding.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Weakly-Supervised Video Moment Retrieval via Semantic Completion Network.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos.

[BibT_eX]

[DOI]

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Open-Ended Long-Form Video Question Answering via Hierarchical Convolutional Self-Attention Networks.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Localizing Unseen Activities in Video via Image Query.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Location-Based End-to-End Speech Recognition with Multiple Language Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Zhijie Lin

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...