Chunyu Wang

Orcid: 0000-0002-9400-9107

Affiliations:
  • Microsoft Research Asia, Beijing, China


According to our database1, Chunyu Wang authored at least 60 papers between 2011 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
VMarker-Pro: Probabilistic 3D Human Mesh Estimation From Virtual Markers.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2025

Harnessing Caption Detailness for Data-Efficient Text-to-Image Generation.
CoRR, May, 2025

VolumeDiffusion: Feed-forward text-to-3D generation with efficient volumetric encoder.
Graph. Model., 2025

Shift Equivariant Pose Network.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

2024
Correlation-Embedded Transformer Tracking: A Single-Branch Framework.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling.
CoRR, 2024

Unsupervised Graphic Layout Grouping with Transformers.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

GAIA: Zero-shot Talking Avatar Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Plan, Posture and Go: Towards Open-Vocabulary Text-to-Motion Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

ART•V: Auto-Regressive Text-to-Video Generation with Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Multiple View Geometry Transformers for 3D Human Pose Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
VoxelTrack: Multi-Person 3D Human Pose Estimation and Tracking in the Wild.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Plan, Posture and Go: Towards Open-World Text-to-Motion Generation.
CoRR, 2023

VolumeDiffusion: Flexible Text-to-3D Generation with Efficient Volumetric Encoder.
CoRR, 2023

ART·V: Auto-Regressive Text-to-Video Generation with Diffusion Models.
CoRR, 2023

Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

All in Tokens: Unifying Output Space of Visual Tasks via Soft Token.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

3D Human Mesh Estimation from Virtual Markers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Human Pose as Compositional Tokens.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Neighborhood Geometric Structure-Preserving Variational Autoencoder for Smooth and Bounded Data Sources.
IEEE Trans. Neural Networks Learn. Syst., 2022

Locally Connected Network for Monocular 3D Human Pose Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Robust Multi-object Tracking by Marginal Inference.
Proceedings of the Computer Vision - ECCV 2022, 2022

One-Shot Medical Landmark Localization by Edge-Guided Transform and Noisy Landmark Refinement.
Proceedings of the Computer Vision - ECCV 2022, 2022

Faster VoxelPose: Real-time 3D Human Pose Estimation by Orthographic Projection.
Proceedings of the Computer Vision - ECCV 2022, 2022

VirtualPose: Learning Generalizable 3D Human Pose Models from Virtual Data.
Proceedings of the Computer Vision - ECCV 2022, 2022

Correlation-Aware Deep Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Semantically Synchronizing Multiple-Camera Systems with Human Pose Estimation.
Sensors, 2021

FairMOT: On the Fairness of Detection and Re-identification in Multiple Object Tracking.
Int. J. Comput. Vis., 2021

AdaFuse: Adaptive Multiview Fusion for Accurate Human Pose Estimation in the Wild.
Int. J. Comput. Vis., 2021

Learning Tracking Representations via Dual-Branch Fully Transformer Networks.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

An Empirical Study of the Collapsing Problem in Semi-Supervised 2D Human Pose Estimation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Context Modeling in 3D Human Pose Estimation: A Unified Perspective.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Semantic Image Segmentation by Scale-Adaptive Networks.
IEEE Trans. Image Process., 2020

Object Detection in Videos by High Quality Object Linking.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Humble Teacher and Eager Student: Dual Network Learning for Semi-supervised 2D Human Pose Estimation.
CoRR, 2020

End-to-End Estimation of Multi-Person 3D Poses from Multiple Cameras.
CoRR, 2020

A Simple Baseline for Multi-Object Tracking.
CoRR, 2020

VoxelPose: Towards Multi-camera 3D Human Pose Estimation in Wild Environment.
Proceedings of the Computer Vision - ECCV 2020, 2020

Fusing Wearable IMUs With Multi-View Images for Human Pose Estimation: A Geometric Approach.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

MetaFuse: A Pre-trained Fusion Model for Human Pose Estimation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Robust 3D Human Pose Estimation from Single Images or Video Sequences.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Cross View Fusion for 3D Human Pose Estimation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Optimizing Network Structure for 3D Human Pose Estimation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Learning Basis Representation to Refine 3D Human Pose Estimations.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Learning to Refine 3D Human Pose Sequences.
Proceedings of the 2019 International Conference on 3D Vision, 2019

2018
Object Detection in Videos by Short and Long Range Object Linking.
CoRR, 2018

Online Dictionary Learning for Approximate Archetypal Analysis.
Proceedings of the Computer Vision - ECCV 2018, 2018

Video Object Segmentation by Learning Location-Sensitive Embeddings.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Learning Discriminative Activated Simplices for Action Recognition.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Mining 3D Key-Pose-Motifs for Action Recognition.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Recognizing Actions in 3D Using Action-Snippets and Activated Simplices.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2014
Representing Data by a Mixture of Activated Simplices.
CoRR, 2014

Robust Estimation of 3D Human Poses from a Single Image.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
An Approach to Pose-Based Action Recognition.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
PQ-WGLOH: A bit-rate scalable local feature descriptor.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Generating vocabulary for global feature representation towards commerce image retrieval.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011


  Loading...