Keda Tao

Orcid: 0009-0006-0618-6506

According to our database¹, Keda Tao authored at least 21 papers between 2024 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs.

[BibT_eX]

[DOI]

CoRR, March, 2026

A Survey of Token Compression for Efficient Multimodal Large Language Models.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2026

Revisiting MLLM Token Technology through the Lens of Classical Visual Coding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Circuits and Systems, 2026

2025

OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding.

[BibT_eX]

[DOI]

CoRR, December, 2025

StreamingAssistant: Efficient Visual Token Pruning for Accelerating Online Video Understanding.

[BibT_eX]

[DOI]

CoRR, December, 2025

OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, November, 2025

StreamingTOM: Streaming Token Compression for Efficient Video Understanding.

[BibT_eX]

[DOI]

CoRR, October, 2025

Which Heads Matter for Reasoning? RL-Guided KV Cache Compression.

[BibT_eX]

[DOI]

CoRR, October, 2025

TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs.

[BibT_eX]

[DOI]

CoRR, July, 2025

When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token Compression across Images, Videos, and Audios.

[BibT_eX]

[DOI]

CoRR, July, 2025

PhotoArtAgent: Intelligent Photo Retouching with Language Model-Based Artist Agents.

[BibT_eX]

[DOI]

CoRR, May, 2025

RadioDiff: An Effective Generative Diffusion Model for Sampling-Free Dynamic Radio Map Construction.

[BibT_eX]

[DOI]

IEEE Trans. Cogn. Commun. Netw., April, 2025

Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models.

[BibT_eX]

[DOI]

CoRR, March, 2025

Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs.

[BibT_eX]

[DOI]

CoRR, January, 2025

HoliTom: Holistic Token Merging for Fast Video Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

RadioDiff-Turbo: Lightweight Generative Large Electromagnetic Model for Wireless Digital Twin Construction.

[BibT_eX]

[DOI]

Proceedings of the IEEE INFOCOM 2025, 2025

Overcoming False Illusions in Real-World Face Restoration with Multi-Modal Guided Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Transductive zero-shot learning with generative model-driven structure alignment.

[BibT_eX]

[DOI]

Pattern Recognit., 2024

Is Oracle Pruning the True Oracle?

[BibT_eX]

[DOI]

Sicheng Feng

Keda Tao

Huan Wang

CoRR, 2024

RadioDiff: An Effective Generative Diffusion Model for Sampling-Free Dynamic Radio Map Construction.

[BibT_eX]

[DOI]

CoRR, 2024

Keda Tao

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...