Rui Yan

Orcid: 0000-0002-0694-9458

Affiliations:
  • Nanjing University, Department of Computer Science and Technology, China


According to our database1, Rui Yan authored at least 42 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
STPM: Spatial-Temporal Token Pruning and Merging for Complex Activity Recognition.
IEEE Trans. Circuits Syst. Video Technol., June, 2025

R-Genie: Reasoning-Guided Generative Image Editing.
CoRR, May, 2025

Leveraging Frame- and Feature-level Progressive Augmentation for Semi-supervised Action Recognition.
ACM Trans. Multim. Comput. Commun. Appl., April, 2025

Coarse-Fine Nested Network for Weakly Supervised Group Activity Recognition.
IEEE Trans. Neural Networks Learn. Syst., April, 2025

Appearance-Agnostic Representation Learning for Compositional Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., April, 2025

ShapeSpeak: Body Shape-Aware Textual Alignment for Visible-Infrared Person Re-Identification.
CoRR, April, 2025

Hierarchical Relation-augmented Representation Generalization for Few-shot Action Recognition.
CoRR, April, 2025

Vision-centric Token Compression in Large Language Model.
CoRR, February, 2025

TEST-V: TEst-time Support-set Tuning for Zero-shot Video Classification.
CoRR, February, 2025

Hierarchical Motion-Enhanced Matching Framework for Few-Shot Action Recognition.
IEEE Trans. Multim., 2025

GPT4Ego: Unleashing the Potential of Pre-Trained Models for Zero-Shot Egocentric Action Recognition.
IEEE Trans. Multim., 2025

2024
Boosting Weakly-Supervised Image Segmentation via Representation, Transform, and Compensator.
IEEE Trans. Circuits Syst. Video Technol., November, 2024

Semantic-Disentangled Transformer With Noun-Verb Embedding for Compositional Action Recognition.
IEEE Trans. Image Process., 2024

EventCrab: Harnessing Frame and Point Synergy for Event-based Action Recognition and Beyond.
CoRR, 2024

Locality-aware Cross-modal Correspondence Learning for Dense Audio-Visual Events Localization.
CoRR, 2024

MVP-Shot: Multi-Velocity Progressive-Alignment Framework for Few-Shot Action Recognition.
CoRR, 2024

DVF: Advancing Robust and Accurate Fine-Grained Image Retrieval with Retrieval Guidelines.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

AdaFPP: Adapt-Focused Bi-Propagating Prototype Learning for Panoramic Activity Recognition.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DTS-TPT: Dual Temporal-Sync Test-time Prompt Tuning for Zero-shot Activity Recognition.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

2023
Com-STAL: Compositional Spatio-Temporal Action Localization.
IEEE Trans. Circuits Syst. Video Technol., December, 2023

Progressive Instance-Aware Feature Learning for Compositional Action Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

HiGCIN: Hierarchical Graph-Based Cross Inference Network for Group Activity Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Magi-Net: Meta Negative Network for Early Activity Prediction.
IEEE Trans. Image Process., 2023

M<sup>3</sup>Net: Multi-view Encoding, Matching, and Fusion for Few-shot Fine-grained Action Recognition.
CoRR, 2023

Attack is Good Augmentation: Towards Skeleton-Contrastive Representation Learning.
CoRR, 2023

Slowfast Diversity-aware Prototype Learning for Egocentric Action Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Foreground/Background-Masked Interaction Learning for Spatio-temporal Action Detection.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

MUP: Multi-granularity Unified Perception for Panoramic Activity Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

M3Net: Multi-view Encoding, Matching, and Fusion for Few-shot Fine-grained Action Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Video-Text Pre-training with Learned Regions for Retrieval.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Position-Aware Participation-Contributed Temporal Dynamic Model for Group Activity Recognition.
IEEE Trans. Neural Networks Learn. Syst., 2022

Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2022

Coherence Constrained Graph LSTM for Group Activity Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Skip-attention encoder-decoder framework for human motion prediction.
Multim. Syst., 2022

Spatiotemporal Perturbation Based Dynamic Consistency for Semi-supervised Temporal Action Detection.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

Look Less Think More: Rethinking Compositional Action Recognition.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021
Video-Text Pre-training with Learned Regions.
CoRR, 2021

2020
Interactive Fusion of Multi-level Features for Compositional Activity Recognition.
CoRR, 2020

Storyboard relational model for group activity recognition.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Social Adaptive Module for Weakly-Supervised Group Activity Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020

2018
A Feature Selection Method for Projection Twin Support Vector Machine.
Neural Process. Lett., 2018

Participation-Contributed Temporal Dynamic Model for Group Activity Recognition.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018


  Loading...