Jian Wang

Orcid: 0000-0001-5266-3808

Affiliations:
  • Snap Inc., Venice, CA, USA
  • Carnegie Mellon University, Department of Electrical and Computer Engineering, Pittsburgh, PA, USA (PhD 2018)


According to our database1, Jian Wang authored at least 90 papers between 2005 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Velocity Disambiguation for Video Frame Interpolation.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2026

Bounded-Compute Multimodal Regression for Product-Rating Prediction.
CoRR, May, 2026

Helix4D: Complex 4D Mesh Generation.
CoRR, May, 2026

ScaleMoGen: Autoregressive Next-Scale Prediction for Human Motion Generation.
CoRR, May, 2026

A Survey on Human Interaction Motion Generation.
Int. J. Comput. Vis., March, 2026

HandX: Scaling Bimanual Motion and Interaction Generation.
CoRR, March, 2026

Unleashing Guidance Without Classifiers for Human-Object Interaction Animation.
CoRR, March, 2026

Process-of-Thought Reasoning for Videos.
CoRR, February, 2026

Why Keep Your Doubts to Yourself? Trading Visual Uncertainties in Multi-Agent Bandit Systems.
CoRR, January, 2026

RigMo: Unifying Rig and Motion Learning for Generative Animation.
CoRR, January, 2026

3D-Agent:Tri-Modal Multi-Agent Collaboration for Scalable 3D Object Annotation.
CoRR, January, 2026

Animated 3DGS Avatars in Diverse Scenes with Consistent Lighting and Shadows.
CoRR, January, 2026

Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026

LLM-CAS: Dynamic Neuron Perturbation for Real-Time Hallucination Correction.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Top-Down Semantic Refinement for Image Captioning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

3DAlign-DAER: Dynamic Attention Policy and Efficient Retrieval Strategy for Fine-grained 3D-Text Alignment at Scale.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Cost-Effective Communication: An Auction-based Method for Language Agent Interaction.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models.
CoRR, December, 2025

LLM-CAS: Dynamic Neuron Perturbation for Real-Time Hallucination Correction.
CoRR, December, 2025

HybridToken-VLM: Hybrid Token Compression for Vision-Language Models.
CoRR, December, 2025

MM-CoT:A Benchmark for Probing Visual Chain-of-Thought Reasoning in Multimodal Models.
CoRR, December, 2025

AHA! Animating Human Avatars in Diverse Scenes with Gaussian Splatting.
CoRR, November, 2025

Boosting Fidelity for Pre-Trained-Diffusion-Based Low-Light Image Enhancement via Condition Refinement.
CoRR, October, 2025

Text2Interact: High-Fidelity and Diverse Text-to-Two-Person Interaction Generation.
CoRR, October, 2025

Towards Redundancy Reduction in Diffusion Models for Efficient Video Super-Resolution.
CoRR, September, 2025

VQualA 2025 Challenge on Engagement Prediction for Short Videos: Methods and Results.
CoRR, September, 2025

CF-VLM:CounterFactual Vision-Language Fine-tuning.
CoRR, June, 2025

From Motion to Behavior: Hierarchical Modeling of Humanoid Generative Behavior Control.
CoRR, June, 2025

DiffVQA: Video Quality Assessment Using Diffusion Feature Extractor.
CoRR, May, 2025

Dance Like a Chicken: Low-Rank Stylization for Human Motion Diffusion.
CoRR, March, 2025

A Survey on Human Interaction Motion Generation.
CoRR, March, 2025

Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars.
CoRR, March, 2025

Privacy-Preserving Visual Localization With Event Cameras.
IEEE Trans. Image Process., 2025

Personalized Restoration via Dual-Pivot Tuning.
IEEE Trans. Image Process., 2025

Towards 4D human video stylization.
Comput. Vis. Image Underst., 2025

InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

DuetGen: Music Driven Two-Person Dance Generation via Hierarchical Masked Modeling.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

4KAgent: Agentic Any Image to 4K Super-Resolution.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

GAM-Agent: Game-Theoretic and Uncertainty-Aware Collaboration for Complex Visual Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

SnapMoGen: Human Motion Generation from Expressive Texts.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems.
Proceedings of the Forty-second International Conference on Machine Learning, 2025


T2Bs: Text-to-Character Blendshapes via Video Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Ponimator: Unfolding Interactive Pose for Versatile Human-Human Interaction Animation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025


Scenemi: Motion In-Betweening for Modeling Human-Scene Interactions.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution.
Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025

DiffBody: Human Body Image Restoration with Generative Diffusion Prior.
Proceedings of the IEEE International Conference on Computational Photography, 2025

Exposure-Limited Image Enhancement with Generative Diffusion Prior.
Proceedings of the IEEE International Conference on Computational Photography, 2025

DrDiff: Dynamic Routing Diffusion with Hierarchical Attention for Breaking the Efficiency-Quality Trade-off.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

FRAME: Floor-aligned Representation for Avatar Motion from Egocentric Video.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Ego4o: Egocentric Human Motion Capture and Understanding from Multi-Modal Input.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Perspective-Aligned AR Mirror with Under-Display Camera.
ACM Trans. Graph., December, 2024

DisCO: Portrait Distortion Correction with Perspective-Aware 3D GANs.
Int. J. Comput. Vis., November, 2024

Light Codes for Fast Two-Way Human-Centric Visual Communication.
ACM Trans. Graph., February, 2024

Learning to Recover Spectral Reflectance From RGB Images.
IEEE Trans. Image Process., 2024

Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion Prior.
CoRR, 2024

MIPI 2024 Challenge on Nighttime Flare Removal: Methods and Results.
CoRR, 2024

DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior.
CoRR, 2024

Matting by Generation.
Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame Interpolation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Delving Deep into Engagement Prediction of Short Videos.
Proceedings of the Computer Vision - ECCV 2024, 2024

Holodepth: Programmable Depth-Varying Projection via Computer-Generated Holography.
Proceedings of the Computer Vision - ECCV 2024, 2024

RobustSAM: Segment Anything Robustly on Degraded Images.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
N-euro Predictor: A Neural Network Approach for Smoothing and Predicting Motion Trajectory.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., September, 2023

Glass Segmentation With RGB-Thermal Image Pairs.
IEEE Trans. Image Process., 2023

A Unified Conditional Framework for Diffusion-based Image Restoration.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

QfaR: Location-Guided Scanning of Visual Codes from Long Distances.
Proceedings of the 29th Annual International Conference on Mobile Computing and Networking, 2023

Be Real in Scale: Swing for True Scale in Dual Camera Mode.
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2023

Energy-Efficient Adaptive 3D Sensing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Privacy-Preserving Visual Localization with Event Cameras.
CoRR, 2022

Structured Light with Redundancy Codes.
CoRR, 2022

Seeing Far in the Dark with Patterned Flash.
Proceedings of the Computer Vision - ECCV 2022, 2022

3D Photo Stylization: Learning to Generate Stylized Novel Views from a Single Image.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Seeing in Extra Darkness Using a Deep-Red Flash.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Group Contextual Encoding for 3D Point Clouds.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
Micro-Baseline Structured Light.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Agile Depth Sensing Using Triangulation Light Curtains.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Programmable Triangulation Light Curtains.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Reflectance Capture Using Univariate Sampling of BRDFs.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Compressive spectral anomaly detection.
Proceedings of the 2017 IEEE International Conference on Computational Photography, 2017

2016
Dual Structured Light 3D Using a 1D Sensor.
Proceedings of the Computer Vision - ECCV 2016, 2016

2015
Photometric Stereo with Small Angular Variations.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

LiSens- A Scalable Architecture for Video Compressive Sensing.
Proceedings of the 2015 IEEE International Conference on Computational Photography, 2015

Reconstruction-free inference on compressive measurements.
Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015

2011
Human Identification and Gender Recognition from Boxing.
Proceedings of the Biometric Recognition - 6th Chinese Conference, 2011

2009
SRAM parametric failure analysis.
Proceedings of the 46th Design Automation Conference, 2009

2007
Parameterized Macromodeling for Analog System-Level Design Exploration.
Proceedings of the 44th Design Automation Conference, 2007

2005
Performance-centering optimization for system-level analog design exploration.
Proceedings of the 2005 International Conference on Computer-Aided Design, 2005


  Loading...