We stand with Ukraine

We stand with Ukraine

Baoyuan Wang

Orcid: 0000-0002-8268-7517

According to our database¹, Baoyuan Wang authored at least 73 papers between 2008 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

PartNerFace: Part-based Neural Radiance Fields for Animatable Facial Avatar Reconstruction.

[DOI]

,

,

,

,

,

,

CoRR, April, 2026

Boosting MLLM Spatial Reasoning with Geometrically Referenced 3D Scene Representations.

[DOI]

,

,

CoRR, March, 2026

2025

GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance.

[DOI]

,

,

,

,

,

,

,

,

IEEE Trans. Vis. Comput. Graph., October, 2025

Subobject-level Image Tokenization.

[DOI]

,

Samuel Cahyawijaya

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

LLM-driven Multimodal and Multi-Identity Listening Head Generation.

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conversations.

[DOI]

,

,

,

,

,

Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024

ViTMatte: Boosting image matting with pre-trained plain vision transformers.

[DOI]

,

,

,

Inf. Fusion, March, 2024

An Effective Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds.

[DOI]

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., January, 2024

Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer.

[DOI]

,

,

CoRR, 2024

Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conversations.

[DOI]

,

,

,

,

CoRR, 2024

From Good to Great: Improving Math Reasoning with Tool-Augmented Interleaf Prompting.

[DOI]

,

,

,

CoRR, 2024

Controlling Character Motions without Observable Driving Source.

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents.

[DOI]

,

,

,

Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

Portrait4D-V2: Pseudo Multi-view Data Creates Better 4D Head Synthesizer.

[DOI]

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

AvatarGPT: All-in-One Framework for Motion Understanding, Planning, Generation and Beyond.

[DOI]

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HAVE-FUN: Human Avatar Reconstruction from Few-Shot Unconstrained Images.

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PICTURE: PhotorealistIC Virtual Try-on from UnconstRained dEsigns.

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Portrait4D: Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data.

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Towards Objectively Benchmarking Social Intelligence of Language Agents at the Action Level.

[DOI]

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Visual Instruction Tuning with Polite Flamingo.

[DOI]

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data.

[DOI]

,

,

,

,

CoRR, 2023

AgentAvatar: Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents.

[DOI]

,

,

,

CoRR, 2023

A Unified Framework for Multimodal, Multi-Part Human Motion Synthesis.

[DOI]

,

,

CoRR, 2023

MDSC: Towards Evaluating the Style Consistency Between Music and Dance.

[DOI]

,

CoRR, 2023

ViTMatte: Boosting Image Matting with Pretrained Plain Vision Transformers.

[DOI]

,

,

,

CoRR, 2023

FashionTex: Controllable Virtual Try-on with Text and Texture.

[DOI]

,

,

,

,

,

Proceedings of the ACM SIGGRAPH 2023 Conference Proceedings, 2023

Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors.

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Reinforced Disentanglement for Face Swapping without Skip Connection.

[DOI]

,

,

,

Heung-Yeung Shum

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Mimic3D: Thriving 3D-Aware GANs via 3D-to-2D Imitation.

[DOI]

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Orca: A Few-shot Benchmark for Chinese Conversational Machine Reading Comprehension.

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Natural Response Generation for Chinese Reading Comprehension.

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models.

[DOI]

,

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

UDE: A Unified Driving Engine for Human Motion Generation.

[DOI]

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Progressive Disentangled Representation Learning for Fine-Grained Controllable Talking Head Synthesis.

[DOI]

,

,

,

Heung-Yeung Shum

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Detailed Radiance Manifolds for High-Fidelity and 3D-Consistent Portrait Synthesis from Monocular Image.

[DOI]

,

,

Heung-Yeung Shum

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Hand Avatar: Free-Pose Hand Animation and Rendering from Monocular Video.

[DOI]

,

,

Heung-Yeung Shum

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Triplet-Free Knowledge-Guided Response Generation.

[DOI]

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Hierarchical Verbalizer for Few-Shot Hierarchical Text Classification.

[DOI]

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

LiveChat: A Large-Scale Personalized Dialogue Dataset Automatically Constructed from Live Streaming.

[DOI]

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Local-Adaptive Face Recognition via Graph-based Meta-Clustering and Regularized Adaptation.

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds.

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Privacy-preserving Online AutoML for Domain-Specific Face Detection.

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

CRFace: Confidence Ranker for Model-Agnostic Face Detection Refinement.

[DOI]

Noranart Vesdapunt

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Real-Time Burst Photo Selection Using a Light-Head Adversarial Network.

[DOI]

,

Noranart Vesdapunt

,

,

IEEE Trans. Image Process., 2020

Animating Face using Disentangled Audio Representations.

[DOI]

,

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

JNR: Joint-Based Neural Rig Representation for Compact 3D Face Modeling.

[DOI]

Noranart Vesdapunt

,

,

,

Proceedings of the Computer Vision - ECCV 2020, 2020

Personalized Face Modeling for Improved Face Reconstruction and Motion Retargeting.

[DOI]

Bindita Chaudhuri

,

Noranart Vesdapunt

,

Linda G. Shapiro

,

Proceedings of the Computer Vision - ECCV 2020, 2020

ReDA: Reinforced Differentiable Attribute for 3D Face Reconstruction.

[DOI]

,

,

,

Noranart Vesdapunt

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Personalized Exposure Control Using Adaptive Metering and Reinforcement Learning.

[DOI]

,

,

Noranart Vesdapunt

,

,

IEEE Trans. Vis. Comput. Graph., 2019

Joint Face Detection and Facial Motion Retargeting for Multiple Faces.

[DOI]

Bindita Chaudhuri

,

Noranart Vesdapunt

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Exposure: A White-Box Photo Post-Processing Framework.

[DOI]

,

,

,

,

ACM Trans. Graph., 2018

Personalized Attention-Aware Exposure Control Using Reinforcement Learning.

[DOI]

,

,

Noranart Vesdapunt

,

,

CoRR, 2018

2017

Understanding and Predicting The Attractiveness of Human Action Shot.

[DOI]

,

,

CoRR, 2017

Personalized Cinemagraphs Using Semantic Understanding and Collaborative Learning.

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Computer Vision, 2017

FC^4: Fully Convolutional Color Constancy with Confidence-Weighted Pooling.

[DOI]

,

,

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Automatic Photo Adjustment Using Deep Neural Networks.

[DOI]

,

,

,

,

ACM Trans. Graph., 2016

Weakly Supervised Metric Learning for Traffic Sign Recognition in a LIDAR-Equipped Vehicle.

[DOI]

,

,

,

,

IEEE Trans. Intell. Transp. Syst., 2016

Robust object recognition via weakly supervised metric and template learning.

[DOI]

,

,

,

,

Neurocomputing, 2016

Maximal Sparsity with Deep Networks?

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

2015

Unsupervised Extraction of Video Highlights via Robust Recurrent Auto-Encoders.

[DOI]

,

,

,

,

,

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Harvesting Discriminative Meta Objects with Deep CNN Features for Scene Classification.

[DOI]

,

,

,

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Adaptive pooling over multiple trajectory attributes for action recognition.

[DOI]

,

,

Proceedings of the 12th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2015

2014

Automatic Photo Adjustment Using Deep Learning.

[DOI]

,

,

,

,

CoRR, 2014

Action-Gons: Action Recognition with a Discriminative Dictionary of Structured Elements with Varying Granularity.

[DOI]

,

,

,

,

Proceedings of the Computer Vision - ACCV 2014, 2014

2013

Human Action Recognition by Mining Discriminative Segment with Novel Skeleton Joint Feature.

[DOI]

,

,

Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Max-Margin Multiple-Instance Dictionary Learning.

[DOI]

,

,

,

,

Proceedings of the 30th International Conference on Machine Learning, 2013

Action Recognition with Actons.

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Computer Vision, 2013

2012

Parallel H-Tree Based Data Cubing on Graphics Processors.

[DOI]

,

Int. J. Softw. Informatics, 2012

2011

Example-based image color and tone style enhancement.

[DOI]

,

,

ACM Trans. Graph., 2011

2010

Data-driven image color theme enhancement.

[DOI]

,

,

,

,

ACM Trans. Graph., 2010

ZoomTree: Unrestricted Zoom Paths in Multiscale Visual Analysis of Relational Databases.

[DOI]

,

,

,

Proceedings of the Computer Vision, Imaging and Computer Graphics. Theory and Applications, 2010

Multiscale Visualization of Relational Databases using Layered Zoom Trees and Partial Data Cubes.

,

,

,

Proceedings of the IMAGAPP 2010 - Proceedings of the International Conference on Imaging Theory and Applications and IVAPP 2010 - Proceedings of the International Conference on Information Visualization Theory and Applications, Angers, France, May 17, 2010

2008

General Subdomain Boundary Mapping Procedure for Structured Grid Implicit CFD Parallel Computation.

[DOI]

,

,

J. Aerosp. Comput. Inf. Commun., 2008

Loading...