Zongxin Yang

Orcid: 0000-0001-8783-8313

According to our database1, Zongxin Yang authored at least 46 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Show Me a Video: A Large-Scale Narrated Video Dataset for Coherent Story Illustration.
IEEE Trans. Multim., 2024

Explore Synergistic Interaction Across Frames for Interactive Video Object Segmentation.
CoRR, 2024

DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models.
CoRR, 2024

GD^2-NeRF: Generative Detail Compensation via GAN and Diffusion for One-shot Generalizable Neural Radiance Fields.
CoRR, 2024

Controllable 3D Face Generation with Conditional Style Code Diffusion.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Co-Learning Meets Stitch-Up for Noisy Multi-Label Visual Recognition.
IEEE Trans. Image Process., 2023

Collaborative Content-Dependent Modeling: A Return to the Roots of Salient Object Detection.
IEEE Trans. Image Process., 2023

Human101: Training 100+FPS Human Gaussians in 100s from 1 View.
CoRR, 2023

SEEAvatar: Photorealistic Text-to-3D Avatar Generation with Constrained Geometry and Appearance.
CoRR, 2023

SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction.
CoRR, 2023

Global-correlated 3D-decoupling Transformer for Clothed Avatar Reconstruction.
CoRR, 2023

ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: TREK-150 Single Object Tracking.
CoRR, 2023

ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: Semi-Supervised Video Object Segmentation.
CoRR, 2023

Segment and Track Anything.
CoRR, 2023

Global-correlated 3D-decoupling Transformer for Clothed Avatar Reconstruction.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

CATR: Combinatorial-Dependence Audio-Queried Transformer for Audio-Visual Video Segmentation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Pyramid Diffusion Models for Low-light Image Enhancement.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Video Object Segmentation in Panoptic Wild Scenes.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Decompose to Generalize: Species-Generalized Animal Pose Estimation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

The First Visual Object Tracking Segmentation VOTS2023 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Shuffled Autoregression for Motion Interpolation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Global-to-Local Modeling for Video-Based 3D Human Pose and Shape Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

FedSeg: Class-Heterogeneous Federated Learning for Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ProD: Prompting-to-disentangle Domain Knowledge for Cross-domain Few-shot Image Classification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Collaborative Video Object Segmentation by Multi-Scale Foreground-Background Integration.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

V<sup>2</sup>L: Leveraging Vision and Vision-language Models into Large-scale Product Retrieval.
CoRR, 2022

Associating Objects with Scalable Transformers for Video Object Segmentation.
CoRR, 2022

Decoupling Features in Hierarchical Propagation for Video Object Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

In-N-Out Generative Learning for Dense Unsupervised Video Segmentation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Instance as Identity: A Generic Online Paradigm for Video Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

The Tenth Visual Object Tracking VOT2022 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

H2FA R-CNN: Holistic and Hierarchical Feature Alignment for Cross-domain Weakly Supervised Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Sequence Modelling with Deep Learning for Visual Content Generation and Understanding
PhD thesis, 2021

Rethinking Cross-modal Interaction from a Top-down Perspective for Referring Video Object Segmentation.
CoRR, 2021

Associating Objects with Transformers for Video Object Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-Scale Consistency.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Collaborative Video Object Segmentation by Foreground-Background Integration.
Proceedings of the Computer Vision - ECCV 2020, 2020

Gated Channel Transformation for Visual Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Going Deeper Into Embedding Learning for Video Object Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Dual Embedding Learning for Video Instance Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Very Long Natural Scenery Image Prediction by Outpainting.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019


  Loading...