Mengyu Yang

Orcid: 0000-0001-7832-0926

According to our database¹, Mengyu Yang authored at least 33 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

HyBridge: hybrid decoupling-to-recoupling adaptation of vision-language models for real-time video action recognition.

[BibT_eX]

[DOI]

J. Real Time Image Process., June, 2026

Memorize Theorems, Not Instances: Probing SFT Generalization through Mathematical Reasoning.

[BibT_eX]

[DOI]

CoRR, May, 2026

Large Vision-Language Models Get Lost in Attention.

[BibT_eX]

[DOI]

CoRR, May, 2026

SEC: Enabling MLLMs for Low-Latency IoT Video Analysis via Semantic-Aware Edge-Cloud Collaboration.

[BibT_eX]

[DOI]

IEEE Internet Things J., 2026

CoPHo: Classifier-guided Conditional Topology Generation with Persistent Homology.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026

2025

Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration.

[BibT_eX]

[DOI]

CoRR, November, 2025

GEN3D: Generating Domain-Free 3D Scenes from a Single Image.

[BibT_eX]

[DOI]

CoRR, November, 2025

REM: Enabling Real-Time Neural-Enhanced Video Streaming on Mobile Devices Using Macroblock-Aware Lookup Table.

[BibT_eX]

[DOI]

IEEE Trans. Mob. Comput., March, 2025

SDP: Spiking Diffusion Policy for Robotic Manipulation with Learnable Channel-Wise Membrane Thresholds.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

Clink! Chop! Thud! - Learning Object Sounds From Real-World Interactions.

[BibT_eX]

[DOI]

Arun Balajee Vasudevan

James Hays

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Celestial Equilibrium Theory-Based Optimal Deployment of SRv6 for Traffic Engineering.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Communications, 2025

2024

SDP: Spiking Diffusion Policy for Robotic Manipulation with Learnable Channel-Wise Membrane Thresholds.

[BibT_eX]

[DOI]

CoRR, 2024

Global Patch-wise Attention is Masterful Facilitator for Masked Image Modeling.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

WaveDN: A Wavelet-based Training-free Zero-shot Enhancement for Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Leveraging Coarse-to-Fine Grained Representations in Contrastive Learning for Differential Medical Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Semantic Fusion Based Graph Network for Video Scene Detection.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2024

DTA: Deformable Temporal Attention for Video Recognition.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2024

The Un-Kidnappable Robot: Acoustic Localization of Sneaking People.

[BibT_eX]

[DOI]

Mengyu Yang

Patrick Grady

Samarth Brahmbhatt

Arun Balajee Vasudevan

Charles C. Kemp

James Hays

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

AdaViPro: Region-Based Adaptive Visual Prompt For Large-Scale Models Adapting.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2024

Near-Lossless Gradient Compression for Data-Parallel Distributed DNN Training.

[BibT_eX]

[DOI]

Proceedings of the 2024 ACM Symposium on Cloud Computing, 2024

2023

FedCL: Federated contrastive learning for multi-center medical image classification.

[BibT_eX]

[DOI]

Pattern Recognit., November, 2023

A Comparative Measurement Study of Point Cloud-Based Volumetric Video Codecs.

[BibT_eX]

[DOI]

IEEE Trans. Broadcast., September, 2023

An accurate shared bicycle detection network based on faster R-CNN.

[BibT_eX]

[DOI]

IET Image Process., May, 2023

Improving Social Media Popularity Prediction with Multiple Post Dependencies.

[BibT_eX]

[DOI]

CoRR, 2023

A Fuzzy Error Based Fine-Tune Method for Spatio-Temporal Recognition Model.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

View while Moving: Efficient Video Recognition in Long-untrimmed Videos.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Cost-effective Modality Selection for Video Popularity Prediction.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2023

Understanding and Improving Perceptual Quality of Volumetric Video Streaming.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Object-Based Multipath Transmission Scheduling Algorithm in Multi-Modal Scenarios.

[BibT_eX]

[DOI]

Proceedings of the IEEE Global Communications Conference, 2023

2021

TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation.

[BibT_eX]

[DOI]

Tanzila Rahman

Mengyu Yang

Leonid Sigal

CoRR, 2021

Mask-Guided Discovery of Semantic Manifolds in Generative Models.

[BibT_eX]

[DOI]

Mengyu Yang

David Rokeby

Xavier Snelgrove

CoRR, 2021

TriBERT: Human-centric Audio-visual Representation Learning.

[BibT_eX]

[DOI]

Tanzila Rahman

Mengyu Yang

Leonid Sigal

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Soloist: Generating Mixed-Initiative Tutorials from Existing Guitar Instructional Videos Through Audio Processing.

[BibT_eX]

[DOI]

Bryan Wang

Mengyu Yang

Tovi Grossman

Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021

Mengyu Yang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...