Jianjun Gao

Orcid: 0009-0004-9137-2869

Affiliations:
  • Nanyang Technological University, Singapore


According to our database1, Jianjun Gao authored at least 21 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Accelerating Diffusion-based Video Editing via Heterogeneous Caching: Beyond Full Computing at Sampled Denoising Timestep.
CoRR, March, 2026

PromptSR: Cascade Prompting for Lightweight Image Super-Resolution.
IEEE Trans. Multim., 2026

LLMArk: Instance-aware foundation model for flood risk assessment.
Inf. Fusion, 2026

MotionAnimate: Animate human images with pose motion for vivid and temporally consistent video generation.
Inf. Fusion, 2026

2025
OccluTrack: Rethinking Awareness of Occlusion for Enhancing Multiple Pedestrian Tracking.
IEEE Trans. Intell. Transp. Syst., July, 2025

From Semantics, Scene to Instance-awareness: Distilling Foundation Model for Open-vocabulary Situation Recognition.
CoRR, July, 2025

CL-HOI: Cross-level human-object interaction distillation from multimodal large language models.
Knowl. Based Syst., 2025

SSH-Net: A self-supervised and hybrid network for noisy image watermark removal.
J. Vis. Commun. Image Represent., 2025

From Semantics, Scene to Instance-awareness: Distilling Foundation Model for Open-vocabulary Grounded Situation Recognition.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

A Structure-Aware and Motion-Adaptive Framework for 3D Human Pose Estimation with Mamba.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
DEO-Net: Joint Density Estimation and Object Detection for Crowd Counting.
IEEE Trans. Instrum. Meas., 2024

CL-HOI: Cross-Level Human-Object Interaction Distillation from Vision Large Language Models.
CoRR, 2024

CM2-Net: Continual Cross-Modal Mapping Network for Driver Action Recognition.
CoRR, 2024

Video sentence grounding with temporally global textual knowledge.
CoRR, 2024

MultiFuser: Multimodal Fusion Transformer for Enhanced Driver Action Recognition.
Proceedings of the 26th IEEE International Workshop on Multimedia Signal Processing, 2024

Temporal Sentence Grounding with Temporally Global Textual Knowledge.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

CM<sup>2</sup>-Net: Continual Cross-Modal Mapping Network For Driver Action Recognition.
Proceedings of the IEEE International Conference on Image Processing, 2024

Hdplifter: Hierarchical Dynamics Perception For 2D-to-3D Human Pose Lifting.
Proceedings of the IEEE International Conference on Image Processing, 2024

Contextual Human Object Interaction Understanding from Pre-Trained Large Language Model.
Proceedings of the IEEE International Conference on Acoustics, 2024

Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
METFormer: A Motion Enhanced Transformer for Multiple Object Tracking.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2023


  Loading...