Hewei Wang

Orcid: 0000-0002-6952-0886

Affiliations:
  • Carnegie Mellon University, Pittsburgh, PA, USA
  • Beijing University of Technology, Beijing-Dublin International College, China (former)
  • University College Dublin, Ireland (former)


According to our database1, Hewei Wang authored at least 36 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Building a Precise Video Language with Human-AI Oversight.
CoRR, April, 2026

Well Begun is Half Done: Training-Free and Model-Agnostic Semantically Guaranteed User Representation Initialization for Multimodal Recommendation.
CoRR, April, 2026

CAMMSR: Category-Guided Attentive Mixture of Experts for Multimodal Sequential Recommendation.
CoRR, March, 2026

DGGVAE: Dual-Granularity Graph Variational Auto-Encoder for Group Recommendation.
ACM Trans. Inf. Syst., February, 2026

CLIP-Guided Unsupervised Semantic-Aware Exposure Correction.
CoRR, January, 2026

VI-MMRec: Similarity-Aware Training Cost-free Virtual User-Item Interactions for Multimodal Recommendation.
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026

Learning and Editing Universal Graph Prompt Tuning via Reinforcement Learning.
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026

Multi-modal Dynamic Proxy Learning for Personalized Multiple Clustering.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
ToolMem: Enhancing Multimodal Agents with Learnable Tool Capability Memory.
CoRR, October, 2025

Spec-LLaVA: Accelerating Vision-Language Models with Dynamic Tree-Based Speculative Decoding.
CoRR, September, 2025

Consistent Video Editing as Flow-Driven Image-to-Video Generation.
CoRR, June, 2025

MedSentry: Understanding and Mitigating Safety Risks in Medical LLM Multi-Agent Systems.
CoRR, May, 2025

Towards Understanding Camera Motions in Any Video.
CoRR, April, 2025

Segregation and Context Aggregation Network for Real-time Cloud Segmentation.
CoRR, April, 2025

MDTeamGPT: A Self-Evolving LLM-based Multi-Agent Framework for Multi-Disciplinary Team Medical Consultation.
CoRR, March, 2025

Multi-Cali Anything: Dense Feature Multi-Frame Structure-from-Motion for Large-Scale Camera Array Calibration.
CoRR, March, 2025

RAINER: A Robust Ensemble Learning Grid Search-Tuned Framework for Rainfall Patterns Prediction.
CoRR, January, 2025

CP2M: Clustered-Patch-Mixed Mosaic Augmentation for Aerial Image Segmentation.
CoRR, January, 2025

DDUNet: Dual Dynamic U-Net for Highly-Efficient Cloud Segmentation.
CoRR, January, 2025

NLGCL: Naturally Existing Neighbor Layers Graph Contrastive Learning for Recommendation.
Proceedings of the Nineteenth ACM Conference on Recommender Systems, 2025

Towards Understanding Camera Motions in Any Video.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

MDVT: Enhancing Multimodal Recommendation with Model-Agnostic Multimodal-Driven Virtual Triplets.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

Multi-Cali Anything: Dense Feature Multi-Frame Structure-from-Motion for Large-Scale Camera Array Calibration.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

Hypercomplex Prompt-aware Multimodal Recommendation.
Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

MENTOR: Multi-level Self-supervised Learning for Multimodal Recommendation.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
MFCSNet: A Musician-Follower Complex Social Network for Measuring Musical Influence.
Entertain. Comput., January, 2024

UCloudNet: A Residual U-Net with Deep Supervision for Cloud Image Segmentation.
Proceedings of the IGARSS 2024, 2024

AlignGroup: Learning and Aligning Group Consensus with Member Preferences for Group Recommendation.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

CPDR: Towards Highly-Efficient Salient Object Detection via Crossed Post-decoder Refinement.
Proceedings of the 35th British Machine Vision Conference, 2024

2023
DAANet: Dual Attention Aggregating Network for Salient Object Detection.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2023

VGRISys: A Vision-Guided Robotic Intelligent System for Autonomous Instrument Calibration.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2023

2022
AMDCNet: An attentional multi-directional convolutional network for stereo matching.
Displays, 2022

DMCNet: Diversified Model Combination Network for Understanding Engagement from Video Screengrabs.
CoRR, 2022

A predictive analytics approach for stroke prediction using machine learning and neural networks.
CoRR, 2022

SYGNet: A SVD-YOLO based GhostNet for Real-time Driving Scene Parsing.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

2021
Stereo Matching Based on Visual Sensitive Information.
CoRR, 2021


  Loading...