Mengyu Yang

Orcid: 0000-0001-7832-0926

According to our database1, Mengyu Yang authored at least 33 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
HyBridge: hybrid decoupling-to-recoupling adaptation of vision-language models for real-time video action recognition.
J. Real Time Image Process., June, 2026

Memorize Theorems, Not Instances: Probing SFT Generalization through Mathematical Reasoning.
CoRR, May, 2026

Large Vision-Language Models Get Lost in Attention.
CoRR, May, 2026

SEC: Enabling MLLMs for Low-Latency IoT Video Analysis via Semantic-Aware Edge-Cloud Collaboration.
IEEE Internet Things J., 2026

CoPHo: Classifier-guided Conditional Topology Generation with Persistent Homology.
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026

2025
Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration.
CoRR, November, 2025

GEN3D: Generating Domain-Free 3D Scenes from a Single Image.
CoRR, November, 2025

REM: Enabling Real-Time Neural-Enhanced Video Streaming on Mobile Devices Using Macroblock-Aware Lookup Table.
IEEE Trans. Mob. Comput., March, 2025

SDP: Spiking Diffusion Policy for Robotic Manipulation with Learnable Channel-Wise Membrane Thresholds.
Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

Clink! Chop! Thud! - Learning Object Sounds From Real-World Interactions.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Celestial Equilibrium Theory-Based Optimal Deployment of SRv6 for Traffic Engineering.
Proceedings of the IEEE International Conference on Communications, 2025

2024
SDP: Spiking Diffusion Policy for Robotic Manipulation with Learnable Channel-Wise Membrane Thresholds.
CoRR, 2024

Global Patch-wise Attention is Masterful Facilitator for Masked Image Modeling.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

WaveDN: A Wavelet-based Training-free Zero-shot Enhancement for Vision-Language Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Leveraging Coarse-to-Fine Grained Representations in Contrastive Learning for Differential Medical Visual Question Answering.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Semantic Fusion Based Graph Network for Video Scene Detection.
Proceedings of the International Joint Conference on Neural Networks, 2024

DTA: Deformable Temporal Attention for Video Recognition.
Proceedings of the International Joint Conference on Neural Networks, 2024

The Un-Kidnappable Robot: Acoustic Localization of Sneaking People.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

AdaViPro: Region-Based Adaptive Visual Prompt For Large-Scale Models Adapting.
Proceedings of the IEEE International Conference on Image Processing, 2024

Near-Lossless Gradient Compression for Data-Parallel Distributed DNN Training.
Proceedings of the 2024 ACM Symposium on Cloud Computing, 2024

2023
FedCL: Federated contrastive learning for multi-center medical image classification.
Pattern Recognit., November, 2023

A Comparative Measurement Study of Point Cloud-Based Volumetric Video Codecs.
IEEE Trans. Broadcast., September, 2023

An accurate shared bicycle detection network based on faster R-CNN.
IET Image Process., May, 2023

Improving Social Media Popularity Prediction with Multiple Post Dependencies.
CoRR, 2023

A Fuzzy Error Based Fine-Tune Method for Spatio-Temporal Recognition Model.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

View while Moving: Efficient Video Recognition in Long-untrimmed Videos.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Cost-effective Modality Selection for Video Popularity Prediction.
Proceedings of the International Joint Conference on Neural Networks, 2023

Understanding and Improving Perceptual Quality of Volumetric Video Streaming.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Object-Based Multipath Transmission Scheduling Algorithm in Multi-Modal Scenarios.
Proceedings of the IEEE Global Communications Conference, 2023

2021
TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation.
CoRR, 2021

Mask-Guided Discovery of Semantic Manifolds in Generative Models.
CoRR, 2021

TriBERT: Human-centric Audio-visual Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Soloist: Generating Mixed-Initiative Tutorials from Existing Guitar Instructional Videos Through Audio Processing.
Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021


  Loading...