Jimin Zhuang

According to our database1, Jimin Zhuang authored at least 7 papers between 2024 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models.
CoRR, June, 2025

ACVUBench: Audio-Centric Video Understanding Benchmark.
CoRR, March, 2025

Improving LLM Video Understanding with 16 Frames Per Second.
CoRR, March, 2025

video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model.
CoRR, February, 2025

Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
Enhancing Multimodal LLM for Detailed and Accurate Video Captioning using Multi-Round Preference Optimization.
CoRR, 2024

Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation.
CoRR, 2024


  Loading...