Keda Tao

Orcid: 0009-0006-0618-6506

According to our database1, Keda Tao authored at least 21 papers between 2024 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs.
CoRR, March, 2026

A Survey of Token Compression for Efficient Multimodal Large Language Models.
Trans. Mach. Learn. Res., 2026

Revisiting MLLM Token Technology through the Lens of Classical Visual Coding.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2026

2025
OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding.
CoRR, December, 2025

StreamingAssistant: Efficient Visual Token Pruning for Accelerating Online Video Understanding.
CoRR, December, 2025

OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models.
CoRR, November, 2025

StreamingTOM: Streaming Token Compression for Efficient Video Understanding.
CoRR, October, 2025

Which Heads Matter for Reasoning? RL-Guided KV Cache Compression.
CoRR, October, 2025

TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs.
CoRR, July, 2025

When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios.
CoRR, July, 2025

PhotoArtAgent: Intelligent Photo Retouching with Language Model-Based Artist Agents.
CoRR, May, 2025

RadioDiff: An Effective Generative Diffusion Model for Sampling-Free Dynamic Radio Map Construction.
IEEE Trans. Cogn. Commun. Netw., April, 2025

Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models.
CoRR, March, 2025

Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs.
CoRR, January, 2025

HoliTom: Holistic Token Merging for Fast Video Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

RadioDiff-Turbo: Lightweight Generative Large Electromagnetic Model for Wireless Digital Twin Construction.
Proceedings of the IEEE INFOCOM 2025, 2025

Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Transductive zero-shot learning with generative model-driven structure alignment.
Pattern Recognit., 2024

Is Oracle Pruning the True Oracle?
CoRR, 2024

RadioDiff: An Effective Generative Diffusion Model for Sampling-Free Dynamic Radio Map Construction.
CoRR, 2024


  Loading...