Zhijie Lin

Orcid: 0000-0003-3461-8952

Affiliations:
  • Zhejiang University, Hangzhou, China


According to our database1, Zhijie Lin authored at least 35 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
LoCo: Low-Bit Communication Adaptor for Large-Scale Model Training.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2025

Seedance 1.0: Exploring the Boundaries of Video Generation Models.
CoRR, June, 2025

SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training.
CoRR, June, 2025

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model.
CoRR, April, 2025

Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation.
CoRR, March, 2025

Long Context Tuning for Video Generation.
CoRR, March, 2025

Parallelized Autoregressive Visual Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
How Far is Video Generation from World Model: A Physical Law Perspective.
CoRR, 2024

Loong: Generating Minute-level Long Videos with Autoregressive Language Models.
CoRR, 2024

PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning.
CoRR, 2024

MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation.
CoRR, 2024

LVD-2M: A Long-take Video Dataset with Temporally Dense Captions.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
Towards Garment Sewing Pattern Reconstruction from a Single Image.
ACM Trans. Graph., December, 2023

Weakly-Supervised Video Moment Retrieval via Regularized Two-Branch Proposal Networks with Erasing Mechanism.
CoRR, 2023

ChatAnything: Facetime Chat with LLM-Enhanced Personas.
CoRR, 2023

Unsupervised Discovery of Interpretable Directions in h-space of Pre-trained Diffusion Models.
CoRR, 2023

BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs.
CoRR, 2023

EditAnything: Empowering Unparalleled Flexibility in Image Editing and Generation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

DATE: Domain Adaptive Product Seeker for E-Commerce.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Unsupervised Representation Learning from Pre-trained Diffusion Probabilistic Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Pseudo Numerical Methods for Diffusion Models on Manifolds.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Temporal Textual Localization in Video via Adversarial Bi-Directional Interaction Networks.
IEEE Trans. Multim., 2021

SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Learning to Rehearse in Long Sequence Memorization.
Proceedings of the 38th International Conference on Machine Learning, 2021

Cascaded Prediction Network via Segment Tree for Temporal Video Grounding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Moment Retrieval via Cross-Modal Interaction Networks With Query Reconstruction.
IEEE Trans. Image Process., 2020

Counterfactual Contrastive Learning for Weakly-Supervised Vision-Language Grounding.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Regularized Two-Branch Proposal Networks for Weakly-Supervised Moment Retrieval in Videos.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Object-Aware Multi-Branch Relation Networks for Spatio-Temporal Video Grounding.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Weakly-Supervised Video Moment Retrieval via Semantic Completion Network.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Cross-Modal Interaction Networks for Query-Based Moment Retrieval in Videos.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Open-Ended Long-Form Video Question Answering via Hierarchical Convolutional Self-Attention Networks.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Localizing Unseen Activities in Video via Image Query.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Location-Based End-to-End Speech Recognition with Multiple Language Models.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019


  Loading...