Rui Yan

Orcid: 0000-0002-0694-9458

Affiliations:
  • Nanjing University, Department of Computer Science and Technology, China


According to our database1, Rui Yan authored at least 54 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Spatio-Temporal Decoupled Knowledge Compensator for Few-Shot Action Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2026

VideoExpert: Augmented LLM for Temporal-Sensitive Video Understanding.
IEEE Trans. Circuits Syst. Video Technol., May, 2026

Locality-Aware Cross-Modal Correspondence Learning for Dense Audio-Visual Events Detection.
IEEE Trans. Circuits Syst. Video Technol., May, 2026

Attack-Augmented Mixing-Contrastive Skeletal Representation Learning.
IEEE Trans. Image Process., 2026

Spatiotemporal-Untrammelled Mixture of Experts for Multi-Person Motion Prediction.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Eliminating Semantic Ambiguity in Human Pose Estimation via Stable Feature Upsampling.
IEEE Trans. Circuits Syst. Video Technol., December, 2025

See the Text: From Tokenization to Visual Reading.
CoRR, October, 2025

IMAGEdit: Let Any Subject Transform.
CoRR, October, 2025

STPM: Spatial-Temporal Token Pruning and Merging for Complex Activity Recognition.
IEEE Trans. Circuits Syst. Video Technol., June, 2025

R-Genie: Reasoning-Guided Generative Image Editing.
CoRR, May, 2025

Leveraging Frame- and Feature-level Progressive Augmentation for Semi-supervised Action Recognition.
ACM Trans. Multim. Comput. Commun. Appl., April, 2025

Coarse-Fine Nested Network for Weakly Supervised Group Activity Recognition.
IEEE Trans. Neural Networks Learn. Syst., April, 2025

Appearance-Agnostic Representation Learning for Compositional Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., April, 2025

ShapeSpeak: Body Shape-Aware Textual Alignment for Visible-Infrared Person Re-Identification.
CoRR, April, 2025

Hierarchical Relation-augmented Representation Generalization for Few-shot Action Recognition.
CoRR, April, 2025

Vision-centric Token Compression in Large Language Model.
CoRR, February, 2025

TEST-V: TEst-time Support-set Tuning for Zero-shot Video Classification.
CoRR, February, 2025

MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition.
IEEE Trans. Multim., 2025

Hierarchical Motion-Enhanced Matching Framework for Few-Shot Action Recognition.
IEEE Trans. Multim., 2025

GPT4Ego: Unleashing the Potential of Pre-Trained Models for Zero-Shot Egocentric Action Recognition.
IEEE Trans. Multim., 2025

Test-Time Tuning for Zero-Shot Spatio-Temporal Action Localization.
Proceedings of the Pattern Recognition and Computer Vision - 8th Chinese Conference, 2025

Time-IC: Empowering MLLM with Interleaved Context for Temporal-Sensitive Video Understanding.
Proceedings of the 7th ACM International Conference on Multimedia in Asia, 2025

TEST-V: TEst-time Support-set Tuning for Zero-shot Video Classification.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Reliable and Diverse Hierarchical Adapter for Zero-shot Video Classification.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

Exploiting Frequency Dynamics for Enhanced Multimodal Event-Based Action Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
Boosting Weakly-Supervised Image Segmentation via Representation, Transform, and Compensator.
IEEE Trans. Circuits Syst. Video Technol., November, 2024

Semantic-Disentangled Transformer With Noun-Verb Embedding for Compositional Action Recognition.
IEEE Trans. Image Process., 2024

Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization.
CoRR, 2024

DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

AdaFPP: Adapt-Focused Bi-Propagating Prototype Learning for Panoramic Activity Recognition.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DTS-TPT: Dual Temporal-Sync Test-time Prompt Tuning for Zero-shot Activity Recognition.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

2023
Com-STAL: Compositional Spatio-Temporal Action Localization.
IEEE Trans. Circuits Syst. Video Technol., December, 2023

Progressive Instance-Aware Feature Learning for Compositional Action Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

HiGCIN: Hierarchical Graph-Based Cross Inference Network for Group Activity Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Magi-Net: Meta Negative Network for Early Activity Prediction.
IEEE Trans. Image Process., 2023

M<sup>3</sup>Net: Multi-view Encoding, Matching, and Fusion for Few-shot Fine-grained Action Recognition.
CoRR, 2023

Attack is Good Augmentation: Towards Skeleton-Contrastive Representation Learning.
CoRR, 2023

Slowfast Diversity-aware Prototype Learning for Egocentric Action Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Foreground/Background-Masked Interaction Learning for Spatio-temporal Action Detection.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

MUP: Multi-granularity Unified Perception for Panoramic Activity Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

M3Net: Multi-view Encoding, Matching, and Fusion for Few-shot Fine-grained Action Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Video-Text Pre-training with Learned Regions for Retrieval.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Position-Aware Participation-Contributed Temporal Dynamic Model for Group Activity Recognition.
IEEE Trans. Neural Networks Learn. Syst., 2022

Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2022

Coherence Constrained Graph LSTM for Group Activity Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Skip-attention encoder-decoder framework for human motion prediction.
Multim. Syst., 2022

Spatiotemporal Perturbation Based Dynamic Consistency for Semi-supervised Temporal Action Detection.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

Look Less Think More: Rethinking Compositional Action Recognition.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021
Video-Text Pre-training with Learned Regions.
CoRR, 2021

2020
Interactive Fusion of Multi-level Features for Compositional Activity Recognition.
CoRR, 2020

Storyboard relational model for group activity recognition.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Social Adaptive Module for Weakly-Supervised Group Activity Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020

2018
A Feature Selection Method for Projection Twin Support Vector Machine.
Neural Process. Lett., 2018

Participation-Contributed Temporal Dynamic Model for Group Activity Recognition.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018


  Loading...