Wei Yang

Orcid: 0000-0002-1189-1254

Affiliations:
  • Huazhong University of Science and Technology, School of Computer Science and Technology, China
  • DGene Inc., Plex-VR, Santa Clara, CA, USA
  • University of Delaware, Newark, USA (PhD 2017)


According to our database1, Wei Yang authored at least 70 papers between 2014 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
CurEvo: Curriculum-Guided Self-Evolution for Video Understanding.
CoRR, April, 2026

GateMOT: Q-Gated Attention for Dense Object Tracking.
CoRR, April, 2026

Hypergraph-State Collaborative Reasoning for Multi-Object Tracking.
CoRR, April, 2026

Counterfactual Co-Occurring Learning for Bias Mitigation in Weakly-Supervised Object Localization.
IEEE Trans. Multim., 2026

2025
Efficient Cipher-Image Coding via Compressive Sensing and Auxiliary-Information-Guided Mapping for Secure Cloud Storage in Consumer Electronics.
IEEE Trans. Consumer Electron., November, 2025

TimeJudge: empowering video-LLMs as zero-shot judges for temporal consistency in video captions.
Frontiers Inf. Technol. Electron. Eng., November, 2025

BANG: Dividing 3D Assets via Generative Exploded Dynamics.
ACM Trans. Graph., August, 2025

CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image.
ACM Trans. Graph., August, 2025

HyperFusion: Hierarchical Multimodal Ensemble Learning for Social Media Popularity Prediction.
CoRR, July, 2025

LoRA-Mixer: Coordinate Modular LoRA Experts Through Serial Attention Routing.
CoRR, July, 2025

GA-S<sup>3</sup>: Comprehensive Social Network Simulation with Group Agents.
CoRR, June, 2025

STG: Spatiotemporal Graph Neural Network with Fusion and Spatiotemporal Decoupling Learning for Prognostic Prediction of Colorectal Cancer Liver Metastasis.
CoRR, May, 2025

RURANET++: An Unsupervised Learning Method for Diabetic Macular Edema Based on SCSE Attention Mechanisms and Dynamic Multi-Projection Head Clustering.
CoRR, February, 2025

TANGLED: Generating 3D Hair Strands from Images with Arbitrary Styles and Viewpoints.
CoRR, February, 2025

EfficientGS: Streamlining Gaussian Splatting for Large-Scale High-Resolution Scene Representation.
IEEE Multim., 2025

Exploiting Appearance Re-Emergence for Robust Visual Tracking.
Proceedings of the MMAsia '25 Workshops: Proceedings of the 7th ACM International Conference on Multimedia in Asia, 2025

MVP: Winning Solution to SMP Challenge 2025 Video Track.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

HER2 Expression Prediction with Flexible Multi-Modal Inputs via Dynamic Bidirectional Reconstruction.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Generating 3D Hair Strands from Images with Diverse Styles and Viewpoints.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

FITMM: Adaptive Frequency-Aware Multimodal Recommendation via Information-Theoretic Representation Learning.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

MCA-RG: Enhancing LLMs with Medical Concept Alignment for Radiology Report Generation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

Cross-Modality Masked Learning for Survival Prediction in ICI Treated NSCLC Patients.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

Optimized View and Geometry Distillation from Multi-view Diffuser.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

CA-Diff: Collaborative Anatomy Diffusion for Brain Tissue Segmentation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Ref-GS: Directional Factorization for 2D Gaussian Splatting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SF2T: Self-supervised Fragment Finetuning of Video-LLMs for Fine-Grained Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GA-S³: Comprehensive Social Network Simulation with Group Agents.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Temporal Coherent Object Flow for Multi-Object Tracking.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets.
ACM Trans. Graph., July, 2024

IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking.
CoRR, 2024

Coupled Mamba: Enhanced Multi-modal Fusion with Coupled State Space Model.
CoRR, 2024

TIGER: Text-Instructed 3D Gaussian Retrieval and Coherent Editing.
CoRR, 2024

GaussianHair: Hair Modeling and Rendering with Light-aware Gaussians.
CoRR, 2024

Coupled Mamba: Enhanced Multimodal Fusion with Coupled State Space Model.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Autogenic Language Embedding for Coherent Point Tracking.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Agnostic Feature Compression with Semantic Guided Channel Importance Analysis.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Dynamic Feature Pruning and Consolidation for Occluded Person Re-identification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Progressive Text-to-Image Diffusion with Soft Latent Direction.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

DiffusionTrack: Diffusion Model for Multi-Object Tracking.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

AMD: Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Attacking Transformers with Feature Diversity Adversarial Perturbation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
HACK: Learning a Parametric Head and Neck Model for High-fidelity Animation.
ACM Trans. Graph., August, 2023

DreamFace: Progressive Generation of Animatable 3D Faces under Text Guidance.
ACM Trans. Graph., August, 2023

Optimized View and Geometry Distillation from Multi-view Diffuser.
CoRR, 2023

Fine-grained Appearance Transfer with Diffusion Models.
CoRR, 2023

NeMF: Inverse Volume Rendering with Neural Microflake Field.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

NeReF: Neural Refractive Field for Fluid Surface Reconstruction and Rendering.
Proceedings of the IEEE International Conference on Computational Photography, 2023

Dual Memory Units with Uncertainty Regulation for Weakly Supervised Video Anomaly Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Compact Transformer Tracker with Correlative Masked Modeling.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Video-Driven Neural Physically-Based Facial Asset for Production.
ACM Trans. Graph., 2022

Artemis: articulated neural pets with appearance and motion synthesis.
ACM Trans. Graph., 2022

TightCap: 3D Human Shape Capture with Clothing Tightness Field.
ACM Trans. Graph., 2022

Dynamic Feature Pruning and Consolidation for Occluded Person Re-Identification.
CoRR, 2022

NeReF: Neural Refractive Field for Fluid Surface Reconstruction and Implicit Representation.
CoRR, 2022

NeuVV: Neural Volumetric Videos with Immersive Rendering and Editing.
CoRR, 2022

HumanNeRF: Efficiently Generated Human Radiance Field from Sparse Inputs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Transformer Tracking with Cyclic Shifting Window Attention.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Structure From Motion on XSlit Cameras.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

SportsCap: Monocular 3D Human Motion Capture and Fine-Grained Understanding in Challenging Sports Videos.
Int. J. Comput. Vis., 2021

HumanNeRF: Generalizable Neural Human Radiance Field from Sparse Inputs.
CoRR, 2021

2019
Multi-Flash Light Field Photography.
IEEE Access, 2019

Fragmentation Guided Human Shape Reconstruction.
IEEE Access, 2019

2017
Ray Space Features for Plenoptic Structure-from-Motion.
Proceedings of the IEEE International Conference on Computer Vision, 2017

The light field 3D scanner.
Proceedings of the 2017 IEEE International Conference on Computational Photography, 2017

Robust 3D Human Motion Reconstruction via Dynamic Template Construction.
Proceedings of the 2017 International Conference on 3D Vision, 2017

2015
Resolving Scale Ambiguity via XSlit Aspect Ratio Analysis.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Ambient occlusion via compressive visibility estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Depth-of-Field and Coded Aperture Imaging on XSlit Lens.
Proceedings of the Computer Vision - ECCV 2014, 2014

Coplanar Common Points in Non-centric Cameras.
Proceedings of the Computer Vision - ECCV 2014, 2014


  Loading...