Wei Yang

Orcid: 0000-0002-1189-1254

Affiliations:

Huazhong University of Science and Technology, School of Computer Science and Technology, China
DGene Inc., Plex-VR, Santa Clara, CA, USA
University of Delaware, Newark, USA (PhD 2017)

According to our database¹, Wei Yang authored at least 72 papers between 2014 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

CurEvo: Curriculum-Guided Self-Evolution for Video Understanding.

[BibT_eX]

[DOI]

CoRR, April, 2026

GateMOT: Q-Gated Attention for Dense Object Tracking.

[BibT_eX]

[DOI]

CoRR, April, 2026

Hypergraph-State Collaborative Reasoning for Multi-Object Tracking.

[BibT_eX]

[DOI]

CoRR, April, 2026

Counterfactual Co-Occurring Learning for Bias Mitigation in Weakly-Supervised Object Localization.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2026

EvoCap: Enhancing video captioning via self-evolving video-LLMs with knowledge consolidation.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2026

Semantic-Aware Logical Reasoning via a Semiotic Framework.

[BibT_eX]

[DOI]

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025

Efficient Cipher-Image Coding via Compressive Sensing and Auxiliary-Information-Guided Mapping for Secure Cloud Storage in Consumer Electronics.

[BibT_eX]

[DOI]

IEEE Trans. Consumer Electron., November, 2025

TimeJudge: empowering video-LLMs as zero-shot judges for temporal consistency in video captions.

[BibT_eX]

[DOI]

Frontiers Inf. Technol. Electron. Eng., November, 2025

BANG: Dividing 3D Assets via Generative Exploded Dynamics.

[BibT_eX]

[DOI]

ACM Trans. Graph., August, 2025

CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image.

[BibT_eX]

[DOI]

ACM Trans. Graph., August, 2025

HyperFusion: Hierarchical Multimodal Ensemble Learning for Social Media Popularity Prediction.

[BibT_eX]

[DOI]

CoRR, July, 2025

LoRA-Mixer: Coordinate Modular LoRA Experts Through Serial Attention Routing.

[BibT_eX]

[DOI]

CoRR, July, 2025

GA-S<sup>3</sup>: Comprehensive Social Network Simulation with Group Agents.

[BibT_eX]

[DOI]

CoRR, June, 2025

STG: Spatiotemporal Graph Neural Network with Fusion and Spatiotemporal Decoupling Learning for Prognostic Prediction of Colorectal Cancer Liver Metastasis.

[BibT_eX]

[DOI]

CoRR, May, 2025

RURANET++: An Unsupervised Learning Method for Diabetic Macular Edema Based on SCSE Attention Mechanisms and Dynamic Multi-Projection Head Clustering.

[BibT_eX]

[DOI]

CoRR, February, 2025

TANGLED: Generating 3D Hair Strands from Images with Arbitrary Styles and Viewpoints.

[BibT_eX]

[DOI]

CoRR, February, 2025

EfficientGS: Streamlining Gaussian Splatting for Large-Scale High-Resolution Scene Representation.

[BibT_eX]

[DOI]

IEEE Multim., 2025

Exploiting Appearance Re-Emergence for Robust Visual Tracking.

[BibT_eX]

[DOI]

Proceedings of the MMAsia '25 Workshops: Proceedings of the 7th ACM International Conference on Multimedia in Asia, 2025

MVP: Winning Solution to SMP Challenge 2025 Video Track.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

HER2 Expression Prediction with Flexible Multi-Modal Inputs via Dynamic Bidirectional Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Generating 3D Hair Strands from Images with Diverse Styles and Viewpoints.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

FITMM: Adaptive Frequency-Aware Multimodal Recommendation via Information-Theoretic Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

MCA-RG: Enhancing LLMs with Medical Concept Alignment for Radiology Report Generation.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

Cross-Modality Masked Learning for Survival Prediction in ICI Treated NSCLC Patients.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

Optimized View and Geometry Distillation from Multi-view Diffuser.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

CA-Diff: Collaborative Anatomy Diffusion for Brain Tissue Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Ref-GS: Directional Factorization for 2D Gaussian Splatting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

SF2T: Self-supervised Fragment Finetuning of Video-LLMs for Fine-Grained Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GA-S³: Comprehensive Social Network Simulation with Group Agents.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Temporal Coherent Object Flow for Multi-Object Tracking.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets.

[BibT_eX]

[DOI]

ACM Trans. Graph., July, 2024

IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking.

[BibT_eX]

[DOI]

CoRR, 2024

Coupled Mamba: Enhanced Multi-modal Fusion with Coupled State Space Model.

[BibT_eX]

[DOI]

CoRR, 2024

TIGER: Text-Instructed 3D Gaussian Retrieval and Coherent Editing.

[BibT_eX]

[DOI]

CoRR, 2024

GaussianHair: Hair Modeling and Rendering with Light-aware Gaussians.

[BibT_eX]

[DOI]

CoRR, 2024

Coupled Mamba: Enhanced Multimodal Fusion with Coupled State Space Model.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Autogenic Language Embedding for Coherent Point Tracking.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Agnostic Feature Compression with Semantic Guided Channel Importance Analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Dynamic Feature Pruning and Consolidation for Occluded Person Re-identification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Progressive Text-to-Image Diffusion with Soft Latent Direction.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

DiffusionTrack: Diffusion Model for Multi-Object Tracking.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

AMD: Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Attacking Transformers with Feature Diversity Adversarial Perturbation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

HACK: Learning a Parametric Head and Neck Model for High-fidelity Animation.

[BibT_eX]

[DOI]

ACM Trans. Graph., August, 2023

DreamFace: Progressive Generation of Animatable 3D Faces under Text Guidance.

[BibT_eX]

[DOI]

ACM Trans. Graph., August, 2023

Optimized View and Geometry Distillation from Multi-view Diffuser.

[BibT_eX]

[DOI]

CoRR, 2023

Fine-grained Appearance Transfer with Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

NeMF: Inverse Volume Rendering with Neural Microflake Field.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

NeReF: Neural Refractive Field for Fluid Surface Reconstruction and Rendering.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computational Photography, 2023

Dual Memory Units with Uncertainty Regulation for Weakly Supervised Video Anomaly Detection.

[BibT_eX]

[DOI]

Hang Zhou

Junqing Yu

Wei Yang

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Compact Transformer Tracker with Correlative Masked Modeling.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Video-Driven Neural Physically-Based Facial Asset for Production.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2022

Artemis: articulated neural pets with appearance and motion synthesis.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2022

TightCap: 3D Human Shape Capture with Clothing Tightness Field.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2022

Dynamic Feature Pruning and Consolidation for Occluded Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2022

NeReF: Neural Refractive Field for Fluid Surface Reconstruction and Implicit Representation.

[BibT_eX]

[DOI]

CoRR, 2022

NeuVV: Neural Volumetric Videos with Immersive Rendering and Editing.

[BibT_eX]

[DOI]

CoRR, 2022

HumanNeRF: Efficiently Generated Human Radiance Field from Sparse Inputs.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Transformer Tracking with Cyclic Shifting Window Attention.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Structure From Motion on XSlit Cameras.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

SportsCap: Monocular 3D Human Motion Capture and Fine-Grained Understanding in Challenging Sports Videos.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2021

HumanNeRF: Generalizable Neural Human Radiance Field from Sparse Inputs.

[BibT_eX]

[DOI]

CoRR, 2021

2019

Multi-Flash Light Field Photography.

[BibT_eX]

[DOI]

IEEE Access, 2019

Fragmentation Guided Human Shape Reconstruction.

[BibT_eX]

[DOI]

IEEE Access, 2019

2017

Ray Space Features for Plenoptic Structure-from-Motion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

The light field 3D scanner.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Computational Photography, 2017

Robust 3D Human Motion Reconstruction via Dynamic Template Construction.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Conference on 3D Vision, 2017

2015

Resolving Scale Ambiguity via XSlit Aspect Ratio Analysis.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Ambient occlusion via compressive visibility estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014

Depth-of-Field and Coded Aperture Imaging on XSlit Lens.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

Coplanar Common Points in Non-centric Cameras.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

Wei Yang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...