We stand with Ukraine

We stand with Ukraine

Pengfei Wan

Orcid: 0000-0001-7225-565X

Affiliations:

Kuaishou Technology, Beijing, China
Meitu Inc., Beijing, China (former)
Hong Kong University of Science and Technology, Hong Kong (PhD 2015)

According to our database¹, Pengfei Wan authored at least 149 papers between 2012 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on orcid.org
on scholar.google.com

On csauthors.net:

Bibliography

2025

MotionCrafter: Plug-and-Play Motion Guidance for Diffusion Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

IEEE Trans. Vis. Comput. Graph., October, 2025

NeRFFaceShop: Learning a Photo-Realistic 3D-Aware Generative Model of Animatable and Relightable Heads From Large-Scale in-the-Wild Videos.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

IEEE Trans. Vis. Comput. Graph., October, 2025

A-SDM: Accelerating Stable Diffusion Through Model Assembly and Feature Inheritance Strategies.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Neural Networks Learn. Syst., October, 2025

OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, October, 2025

VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

GRPO-Guard: Mitigating Implicit Over-Optimization in Flow Matching via Regulated Clipping.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

Latent Diffusion Model without Variational Autoencoder.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, October, 2025

Terra: Explorable Native 3D World Model with Point Latents.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, October, 2025

Less is More: Improving LLM Reasoning with Minimal Test-Time Intervention.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, October, 2025

Mitigating the Noise Shift for Denoising Generative Models via Noise Awareness Guidance.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, October, 2025

AdaViewPlanner: Adapting Video Diffusion Models for Viewpoint Planning in 4D Scenes.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

UniVideo: Unified Understanding, Generation, and Editing for Videos.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, October, 2025

UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, October, 2025

Free Lunch Alignment of Text-to-Image Diffusion Models without Preference Image Pairs.

[BibT_eX]

[DOI]

Jia Jun Cheng Xian

,

,

,

,

,

,

CoRR, September, 2025

OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

HunyuanImage 3.0 Technical Report.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

VC-Agent: An Interactive Agent for Customized Video Dataset Collection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, September, 2025

Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

MIDAS: Multimodal Interactive Digital-humAn Synthesis via Real-time Autoregressive Video Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, August, 2025

Score Augmentation for Diffusion Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, August, 2025

DVIS++: Improved Decoupled Framework for Universal Video Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., July, 2025

Imbalance in Balance: Online Concept Balancing in Generation Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, July, 2025

VMoBA: Mixture-of-Block Attention for Video Diffusion Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, June, 2025

SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, June, 2025

VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

UNIC: Unified In-Context Video Editing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

Context as Memory: Scene-Consistent Interactive Long Video Generation with Memory Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, June, 2025

CamCloneMaster: Enabling Reference-based Camera Control for Video Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, June, 2025

Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, June, 2025

RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, May, 2025

OmniSync: Towards Universal Lip Synchronization via Diffusion Transformers.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, May, 2025

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2025

Scaling Image and Video Generation via Test-Time Evolutionary Search.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, May, 2025

Training-Free Efficient Video Generation via Dynamic Token Carving.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, May, 2025

VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption.

[BibT_eX]

[DOI]

Tianxiong Zhong

,

,

,

,

,

,

CoRR, May, 2025

Flow-GRPO: Training Flow Matching Models via Online RL.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, May, 2025

A Survey of Interactive Generative Video.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, April, 2025

BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, April, 2025

SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, April, 2025

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, March, 2025

SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, March, 2025

FullDiT: Multi-Task Video Generative Foundation Model with Full Attention.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, March, 2025

Boosting Resolution Generalization of Diffusion Transformers with Randomized Positional Encodings.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, March, 2025

Position: Interactive Generative Video as Next-Generation Game Engine.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, March, 2025

DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

MTV-Inpaint: Multi-Task Long Video Inpainting.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, March, 2025

ExGes: Expressive Human Motion Retrieval and Modulation for Audio-Driven Gesture Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, March, 2025

FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, February, 2025

CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, February, 2025

Improving Video Generation with Human Feedback.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, January, 2025

GameFactory: Creating New Games with Generative Interactive Videos.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, January, 2025

ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, January, 2025

DiffCap: Diffusion-Based Real-Time Human Motion Capture Using Sparse IMUs and a Monocular Camera.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Vis. Comput. Graph., 2025

3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Stable Segment Anything Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Towards Precise Scaling Laws for Video Diffusion Transformers.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Victor Shea-Jay Huang

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

StyleMaster: Stylize Your Video with Artistic Generation and Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SketchVideo: Sketch-based Video Generation and Editing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GPAvatar: High-fidelity Head Avatars by Learning Efficient Gaussian Projections.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Owl-1: Omni World Model for Consistent Long Video Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing.

[BibT_eX]

[DOI]

,

Tianxiong Zhong

,

,

,

,

,

,

CoRR, 2024

MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

ViMo: Generating Motions from Casual Videos.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

4Dynamic: Text-to-4D Generation with Hybrid Priors.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

VideoTetris: Towards Compositional Text-to-Video Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

Motion Inversion for Video Customization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Towards Unified 3D Hair Reconstruction from Single-View Portraits.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024

VRMM: A Volumetric Relightable Morphable Head Model.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

VideoTetris: Towards Compositional Text-to-Video Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

PlacidDreamer: Advancing Harmony in Text-to-3D Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Agent Attention: On the Integration of Softmax and Linear Attention.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Multi-Modal Face Stylization with a Generative Prior.

[BibT_eX]

[DOI]

,

,

,

,

,

Comput. Graph. Forum, October, 2023

Snowflake Point Deconvolution for Point Cloud Completion and Generation With Skip-Transformer.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

EM-Gaze: eye context correlation and metric learning for gaze estimation.

[BibT_eX]

[DOI]

,

,

,

,

,

Vis. Comput. Ind. Biomed. Art, 2023

Predicting Personalized Head Movement From Short Video and Speech Signal.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

IEEE Trans. Multim., 2023

PMP-Net++: Point Cloud Completion by Transformer-Enhanced Multi-Step Point Moving Paths.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., 2023

I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

Stable Segment Anything Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2023

Temporal-Aware Refinement for Video-based Human Pose and Shape Recovery.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

1st Place Solution for the 5th LSVOS Challenge: Video Instance Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2023

1st Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2023

Towards Practical Capture of High-Fidelity Relightable Avatars.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

Augmentation-Aware Self-Supervision for Data-Efficient GAN Training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Automatic Human Scene Interaction through Contact Estimation and Motion Adaptation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 31st ACM International Conference on Multimedia, 2023

DVIS: Decoupled Video Instance Segmentation Framework.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FEditNet: Few-Shot Editing of Latent Semantics in GAN Spaces.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Bridging CLIP and StyleGAN through Latent Alignment for Image Editing.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2022

ITTR: Unpaired Image-to-Image Translation with Transformers.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2022

Debiased Self-Training for Semi-Supervised Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning an Inference-accelerated Network from a Pre-trained Model with Frequency-enhanced Feature Distillation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Wavelet Knowledge Distillation: Towards Efficient Image-to-Image Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Exploring Set Similarity for Dense Self-supervised Representation Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Assessing a Single Image in Reference-Guided Image Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Write-An-Animation: High-level Text-based Animation Editing with Character-Scene Interaction.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Comput. Graph. Forum, 2021

BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

SnowflakeNet: Point Cloud Completion by Snowflake Point Deconvolution with Skip-Transformer.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

PMP-Net: Point Cloud Completion by Learning Multi-Step Point Moving Paths.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Cycle4Completion: Unpaired Point Cloud Completion Using Cycle Transformation With Missing Region Coding.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Camera-Space Hand Mesh Recovery via Semantic Aggregation and Adaptive 2D-1D Registration.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Advancing Image Understanding in Poor Visibility Environments: A Collective Benchmark Study.

[BibT_eX]

[DOI]

,

,

,

,

Walter J. Scheirer

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

IEEE Trans. Image Process., 2020

2019

High Bit-Depth Image Acquisition Framework Using Embedded Quantization Bias.

[BibT_eX]

[DOI]

,

,

,

IEEE Trans. Computational Imaging, 2019

GraphPoseGAN: 3D Hand Pose Estimation from a Monocular RGB Image via Adversarial Learning on Graphs.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2019

NTIRE 2019 Challenge on Image Enhancement: Methods and Results.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

NTIRE 2019 Image Dehazing Challenge Report.

[BibT_eX]

[DOI]

Codruta O. Ancuti

,

,

,

,

,

Ming-Hsuan Yang

,

,

,

Venkateswararao Cherukuri

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Pablo Navarrete Michelini

,

,

,

,

Sanchayan Santra

,

,

Bhabatosh Chanda

,

,

Tzofi Klinghoffer

,

,

,

,

,

,

,

Kuldeep Purohit

,

,

A. N. Rajagopalan

,

Raimondo Schettini

,

,

,

,

,

,

,

,

Subrahmanyam Murala

,

,

Harshjeet Singh Aulakh

,

Tianxiang Zheng

,

,

,

,

,

Jean-Philippe Tarel

,

Chuansheng Wang

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018

Range Scaling Global U-Net for Perceptual Image Enhancement on Mobile Devices.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

2016

Image Bit-Depth Enhancement via Maximum A Posteriori Estimation of AC Signal.

[BibT_eX]

[DOI]

,

,

Dinei A. F. Florêncio

,

,

IEEE Trans. Image Process., 2016

2015

Precision Enhancement of 3-D Surfaces from Compressed Multiview Depth Maps.

[BibT_eX]

[DOI]

,

,

,

Dinei A. F. Florêncio

,

,

IEEE Signal Process. Lett., 2015

Motion vector fields based video coding.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

2014

Precision Enhancement of 3D Surfaces from Multiple Compressed Depth Maps.

[BibT_eX]

[DOI]

,

,

,

Dinei A. F. Florêncio

,

,

CoRR, 2014

Solving dense stereo matching via quadratic programming.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

A fast intermode decision algorithm based on analysis of inter prediction residual.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE 16th International Workshop on Multimedia Signal Processing, 2014

Image bit-depth enhancement via maximum-a-posteriori estimation of graph AC component.

[BibT_eX]

[DOI]

,

,

Dinei A. F. Florêncio

,

,

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

High bit-precision image acquisition and reconstruction by planned sensor distortion.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

DCT coefficients generation model for film grain noise and its application in super-resolution.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Palette-based compound image compression in HEVC by exploiting non-local spatial correlation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2014

SSIM-based rate-distortion optimization in H.264.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2014

Improved temporal psychovisual modulation for backward-compatible stereoscopic display.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014

Fast binary motion estimation for screen content video coding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013

3-D Motion Estimation for Visual Saliency Modeling.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Signal Process. Lett., 2013

Precision enhancement of 3D surfaces from multiple quantized depth maps.

[BibT_eX]

[DOI]

,

,

,

Dinei Florêncio

,

,

Proceedings of the 11th IVMSP Workshop: 3D Image/Video Technologies and Applications, 2013

Personal photo album compression and management.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Optimal dependent bit allocation for AVS intra-frame coding via successive convex approximation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Image Processing, 2013

A robust interpolation-free approach for sub-pixel accuracy motion estimation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Image Processing, 2013

3D motion in visual saliency modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

From 2D Extrapolation to 1D Interpolation: Content Adaptive Image Bit-Depth Expansion.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Image de-quantization via spatially varying sparsity prior.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Super resolution for subpixel-based downsampled images.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Loading...