Baoyuan Wang

Orcid: 0000-0002-8268-7517

According to our database1, Baoyuan Wang authored at least 65 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
ViTMatte: Boosting image matting with pre-trained plain vision transformers.
Inf. Fusion, March, 2024

An Effective Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2024

Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer.
CoRR, 2024

Subobject-level Image Tokenization.
CoRR, 2024

Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conversations.
CoRR, 2024

From Good to Great: Improving Math Reasoning with Tool-Augmented Interleaf Prompting.
CoRR, 2024

Controlling Character Motions without Observable Driving Source.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Visual Instruction Tuning with Polite Flamingo.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face Guidance.
CoRR, 2023

PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsigns.
CoRR, 2023

Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data.
CoRR, 2023

AgentAvatar: Disentangling Planning, Driving and Rendering for Photorealistic Avatar Agents.
CoRR, 2023

A Unified Framework for Multimodal, Multi-Part Human Motion Synthesis.
CoRR, 2023

AvatarGPT: All-in-One Framework for Motion Understanding, Planning, Generation and Beyond.
CoRR, 2023

HAVE-FUN: Human Avatar Reconstruction from Few-Shot Unconstrained Images.
CoRR, 2023

MDSC: Towards Evaluating the Style Consistency Between Music and Dance.
CoRR, 2023

ViTMatte: Boosting Image Matting with Pretrained Plain Vision Transformers.
CoRR, 2023

FashionTex: Controllable Virtual Try-on with Text and Texture.
Proceedings of the ACM SIGGRAPH 2023 Conference Proceedings, 2023

Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Reinforced Disentanglement for Face Swapping without Skip Connection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Mimic3D: Thriving 3D-Aware GANs via 3D-to-2D Imitation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Orca: A Few-shot Benchmark for Chinese Conversational Machine Reading Comprehension.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Natural Response Generation for Chinese Reading Comprehension.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

UDE: A Unified Driving Engine for Human Motion Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Progressive Disentangled Representation Learning for Fine-Grained Controllable Talking Head Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Detailed Radiance Manifolds for High-Fidelity and 3D-Consistent Portrait Synthesis from Monocular Image.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Hand Avatar: Free-Pose Hand Animation and Rendering from Monocular Video.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Triplet-Free Knowledge-Guided Response Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Hierarchical Verbalizer for Few-Shot Hierarchical Text Classification.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

LiveChat: A Large-Scale Personalized Dialogue Dataset Automatically Constructed from Live Streaming.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Local-Adaptive Face Recognition via Graph-based Meta-Clustering and Regularized Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Privacy-preserving Online AutoML for Domain-Specific Face Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
CRFace: Confidence Ranker for Model-Agnostic Face Detection Refinement.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Real-Time Burst Photo Selection Using a Light-Head Adversarial Network.
IEEE Trans. Image Process., 2020

Animating Face using Disentangled Audio Representations.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

JNR: Joint-Based Neural Rig Representation for Compact 3D Face Modeling.
Proceedings of the Computer Vision - ECCV 2020, 2020

Personalized Face Modeling for Improved Face Reconstruction and Motion Retargeting.
Proceedings of the Computer Vision - ECCV 2020, 2020

ReDA: Reinforced Differentiable Attribute for 3D Face Reconstruction.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Personalized Exposure Control Using Adaptive Metering and Reinforcement Learning.
IEEE Trans. Vis. Comput. Graph., 2019

Joint Face Detection and Facial Motion Retargeting for Multiple Faces.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Exposure: A White-Box Photo Post-Processing Framework.
ACM Trans. Graph., 2018

Personalized Attention-Aware Exposure Control Using Reinforcement Learning.
CoRR, 2018

2017
Understanding and Predicting The Attractiveness of Human Action Shot.
CoRR, 2017

Personalized Cinemagraphs Using Semantic Understanding and Collaborative Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2017

FC^4: Fully Convolutional Color Constancy with Confidence-Weighted Pooling.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Automatic Photo Adjustment Using Deep Neural Networks.
ACM Trans. Graph., 2016

Weakly Supervised Metric Learning for Traffic Sign Recognition in a LIDAR-Equipped Vehicle.
IEEE Trans. Intell. Transp. Syst., 2016

Robust object recognition via weakly supervised metric and template learning.
Neurocomputing, 2016

Maximal Sparsity with Deep Networks?
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

2015
Unsupervised Extraction of Video Highlights via Robust Recurrent Auto-Encoders.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Harvesting Discriminative Meta Objects with Deep CNN Features for Scene Classification.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Adaptive pooling over multiple trajectory attributes for action recognition.
Proceedings of the 12th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2015

2014
Automatic Photo Adjustment Using Deep Learning.
CoRR, 2014

Action-Gons: Action Recognition with a Discriminative Dictionary of Structured Elements with Varying Granularity.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
Human Action Recognition by Mining Discriminative Segment with Novel Skeleton Joint Feature.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Max-Margin Multiple-Instance Dictionary Learning.
Proceedings of the 30th International Conference on Machine Learning, 2013

Action Recognition with Actons.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2012
Parallel H-Tree Based Data Cubing on Graphics Processors.
Int. J. Softw. Informatics, 2012

2011
Example-based image color and tone style enhancement.
ACM Trans. Graph., 2011

2010
Data-driven image color theme enhancement.
ACM Trans. Graph., 2010

ZoomTree: Unrestricted Zoom Paths in Multiscale Visual Analysis of Relational Databases.
Proceedings of the Computer Vision, Imaging and Computer Graphics. Theory and Applications, 2010

Multiscale Visualization of Relational Databases using Layered Zoom Trees and Partial Data Cubes.
Proceedings of the IMAGAPP 2010 - Proceedings of the International Conference on Imaging Theory and Applications and IVAPP 2010 - Proceedings of the International Conference on Information Visualization Theory and Applications, Angers, France, May 17, 2010

2008
General Subdomain Boundary Mapping Procedure for Structured Grid Implicit CFD Parallel Computation.
J. Aerosp. Comput. Inf. Commun., 2008


  Loading...