Zheming Yang

Orcid: 0000-0001-9957-5792

According to our database1, Zheming Yang authored at least 41 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Recursive Offloading for LLM Serving in Multi-Tier Networks.
IEEE Trans. Mob. Comput., June, 2026

R2E-VID: Two-Stage Robust Routing via Temporal Gating for Elastic Edge-Cloud Video Inference.
CoRR, April, 2026

When Is Thinking Enough? Early Exit via Sufficiency Assessment for Efficient Reasoning.
CoRR, April, 2026

DAT: Dual-Aware Adaptive Transmission for Efficient Multimodal LLM Inference in Edge-Cloud Systems.
CoRR, April, 2026

MSAO: Adaptive Modality Sparsity-Aware Offloading with Edge-Cloud Collaboration for Efficient Multimodal LLM Inference.
CoRR, April, 2026

Not All Negative Samples Are Equal: LLMs Learn Better from Plausible Reasoning.
CoRR, February, 2026

From Atoms to Chains: Divergence-Guided Reasoning Curriculum for Unlabeled LLM Domain Adaptation.
CoRR, January, 2026

Mimic Human Cognition, Master Multi-Image Reasoning: A Meta-Action Framework for Enhanced Visual Understanding.
CoRR, January, 2026

AIVD: Adaptive Edge-Cloud Collaboration for Accurate and Efficient Industrial Visual Detection.
CoRR, January, 2026

ThinkDrive: Chain-of-Thought Guided Progressive Reinforcement Learning Fine-Tuning for Autonomous Driving.
CoRR, January, 2026

SearchAttack: Red-Teaming LLMs against Real-World Threats via Framing Unsafe Web Information-Seeking Tasks.
CoRR, January, 2026

2025
MMLongCite: A Benchmark for Evaluating Fidelity of Long-Context Vision-Language Models.
CoRR, October, 2025

Logo-VGR: Visual Grounded Reasoning for Open-world Logo Recognition.
CoRR, September, 2025

Adaptive Guidance Semantically Enhanced via Multimodal LLM for Edge-Cloud Object Detection.
CoRR, September, 2025

SAEC: Scene-Aware Enhanced Edge-Cloud Collaborative Industrial Vision Inspection with Multimodal LLM.
CoRR, September, 2025

MoA-Off: Adaptive Heterogeneous Modality-Aware Offloading with Edge-Cloud Collaboration for Efficient Multimodal LLM Inference.
CoRR, September, 2025

CURE: Critical-Token-Guided Re-Concatenation for Entropy-Collapse Prevention.
CoRR, August, 2025

EC2MoE: Adaptive End-Cloud Pipeline Collaboration Enabling Scalable Mixture-of-Experts Inference.
CoRR, August, 2025

GThinker: Towards General Multimodal Reasoning via Cue-Guided Rethinking.
CoRR, June, 2025

Recursive Offloading for LLM Serving in Multi-tier Networks.
CoRR, May, 2025

Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models.
CoRR, April, 2025

CDIO: Cross-Domain Inference Optimization with Resource Preference Prediction for Edge-Cloud Collaboration.
CoRR, February, 2025

A Low-Rank Enhanced Lightweight Multimodal LLM Framework for Efficient Edge Power Visual Detection.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2025

SpaceServe: Spatial Multiplexing of Complementary Encoders and Decoders for Multimodal LLMs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

AODMS: Adaptive Online Edge-Cloud Collaborative Inference with Dynamic Model Switching and Resource Allocation.
Proceedings of the 31th IEEE International Conference on Parallel and Distributed Systems, 2025

DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Decoder-Only LLMs can be Masked Auto-Encoders.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025

2024
PerLLM: Personalized Inference Scheduling with Edge-Cloud Collaboration for Diverse LLM Services.
CoRR, 2024

DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers.
CoRR, 2024

2023
Visual E<sup>2</sup>C: AI-Driven Visual End-Edge-Cloud Architecture for 6G in Low-Carbon Smart Cities.
IEEE Wirel. Commun., June, 2023

JVAP: A Joint Video Acceleration Processing Architecture for Online Edge Systems.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2023

JAVP: Joint-Aware Video Processing with Edge-Cloud Collaboration for DNN Inference.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2021
An Intelligent End-Edge-Cloud Architecture for Visual IoT-Assisted Healthcare Systems.
IEEE Internet Things J., 2021

An intelligence optimization method based on crowd intelligence for IoT devices.
Int. J. Crowd Sci., 2021

Crowd evolution method based on intelligence level clustering.
Int. J. Crowd Sci., 2021

2020
Crowd V-IoE: Visual Internet of Everything Architecture in AI-Driven Fog Computing.
IEEE Wirel. Commun., 2020

Meta measurement of intelligence with crowd network.
Int. J. Crowd Sci., 2020

Understanding Crowd Intelligence in Large-scale Systems: A Hierarchical Binary Particle Swarm Optimization Approach.
Proceedings of the IEEE International Conference on Parallel & Distributed Processing with Applications, 2020

A Quality- Time Model of Heterogeneous Agents Measure for Crowd Intelligence.
Proceedings of the IEEE International Conference on Parallel & Distributed Processing with Applications, 2020

Crowd Intelligence Empowered Video Transmission in Ultra-low-bandwidth Constrained Circumstances.
Proceedings of the IEEE International Conference on Parallel & Distributed Processing with Applications, 2020

2019
A Universal Intelligence Measurement Method Based on Meta-analysis.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019


  Loading...