Anyi Rao

Orcid: 0000-0003-1004-7753

According to our database1, Anyi Rao authored at least 27 papers between 2017 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
A Coarse-to-Fine Framework for Automatic Video Unscreen.
IEEE Trans. Multim., 2023

Cinematic Behavior Transfer via NeRF-based Differentiable Filming.
CoRR, 2023

SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models.
CoRR, 2023

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning.
CoRR, 2023

CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers.
CoRR, 2023

Dynamic Storyboard Generation in an Engine-based Virtual Environment for Video Production.
CoRR, 2023

Automated Conversion of Music Videos into Lyric Videos.
Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology, 2023

Dynamic Storyboard Generation in an Engine-based Virtual Environment for Video Production.
Proceedings of the ACM SIGGRAPH 2023 Posters, 2023

Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and Maximization.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

HireVAE: An Online and Adaptive Factor Model Based on Hierarchical and Regime-Switch VAE.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Adding Conditional Control to Text-to-Image Diffusion Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Self-Supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Jointly Learning the Attributes and Composition of Shots for Boundary Detection in Videos.
IEEE Trans. Multim., 2022

Temporal and Contextual Transformer for Multi-Camera Editing of TV Shows.
CoRR, 2022

A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language.
CoRR, 2022

Shoot360: Normal View Video Creation from City Panorama Footage.
Proceedings of the SIGGRAPH '22: Special Interest Group on Computer Graphics and Interactive Techniques Conference, Vancouver, BC, Canada, August 7, 2022

BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering.
Proceedings of the Computer Vision - ECCV 2022, 2022

AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
CityNeRF: Building NeRF at City Scale.
CoRR, 2021

BlockPlanner: City Block Generation with Vectorized Graph Representation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Online Multi-modal Person Search in Videos.
Proceedings of the Computer Vision - ECCV 2020, 2020

A Unified Framework for Shot Type Classification Based on Subject Centric Lens.
Proceedings of the Computer Vision - ECCV 2020, 2020

MovieNet: A Holistic Dataset for Movie Understanding.
Proceedings of the Computer Vision - ECCV 2020, 2020

A Local-to-Global Approach to Multi-Modal Movie Scene Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2018
Automatic Music Accompanist.
CoRR, 2018

HotFlip: White-Box Adversarial Examples for Text Classification.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
HotFlip: White-Box Adversarial Examples for NLP.
CoRR, 2017


  Loading...