Rui Qian

Affiliations:
  • Apple Inc.
  • Cornell University, Ithaca, NY, USA (former, PhD)
  • Peking University, Institute of Computer Science and Technology, Beijing, China (former)


According to our database1, Rui Qian authored at least 18 papers between 2018 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
CAR-Flow: Condition-Aware Reparameterization Aligns Source and Target for Better Flow Matching.
CoRR, September, 2025

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer.
CoRR, September, 2025

DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation.
CoRR, March, 2025

2024
Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models.
Trans. Mach. Learn. Res., 2024

Imagen 3.
CoRR, 2024

VideoPrism: A Foundational Visual Encoder for Video Understanding.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2022
Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models.
CoRR, 2022

Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset.
Proceedings of the Computer Vision - ECCV 2022, 2022

Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

On Temporal Granularity in Self-Supervised Video Representation Learning.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Exploring Temporal Granularity in Self-Supervised Video Representation Learning.
CoRR, 2021

Revisiting 3D ResNets for Video Recognition.
CoRR, 2021

VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Spatiotemporal Contrastive Video Representation Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Simple Copy-Paste Is a Strong Data Augmentation Method for Instance Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Weakly Supervised Scene Parsing with Point-Based Distance Metric Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Attentive Generative Adversarial Network for Raindrop Removal From a Single Image.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018


  Loading...