Rui Feng

Orcid: 0000-0002-4747-0574

Affiliations:
  • Fudan University, School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, Shanghai, China
  • Shanghai Jiao Tong University, China (PhD 2003)


According to our database1, Rui Feng authored at least 130 papers between 2005 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
A Medical Multimodal Large Language Model for Pediatric Pneumonia.
IEEE J. Biomed. Health Informatics, September, 2025

QMix: Quality-Aware Learning With Mixed Noise for Robust Retinal Disease Diagnosis.
IEEE Trans. Medical Imaging, August, 2025

Dual Semantic-Aware Network for Noise Suppressed Ultrasound Video Segmentation.
CoRR, July, 2025

Advancing Lung Disease Diagnosis in 3D CT Scans.
CoRR, July, 2025

Multi-Source COVID-19 Detection via Variance Risk Extrapolation.
CoRR, June, 2025

AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation.
CoRR, May, 2025

Text-promptable Propagation for Referring Medical Image Sequence Segmentation.
CoRR, February, 2025

FineMedLM-o1: Enhancing the Medical Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time Training.
CoRR, January, 2025

Toward Causal and Evidential Open-Set Temporal Action Detection.
IEEE Access, 2025

EmoCharacter: Evaluating the Emotional Fidelity of Role-Playing Agents in Dialogues.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

TGSAM-2: Text-Guided Medical Image Segmentation Using Segment Anything Model 2.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

Towards Multi-Label Open Set Diagnosis for Medical Images.
Proceedings of the 22nd IEEE International Symposium on Biomedical Imaging, 2025

ConTrack3D: Contrastive Learning Contributes Concise 3D Multi-Object Tracking.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Human Simulacra: Benchmarking the Personification of Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Minimizing Disparities between Real and Pseudo Queries for Unsupervised Visual Grounding.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Uncertainty-Aware Dynamic Fusion for Multimodal Clinical Prediction Tasks.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Sketch-based Point Cloud Generation with Diffusion Model and Pre-training Enhancement.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

RoBGuard: Enhancing LLMs to Assess Risk of Bias in Clinical Trial Documents.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

GLiM: Integrating Graph Transformer and LLM for Document-Level Biomedical Relation Extraction with Incomplete Labeling.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

AS-Det: Active Sampling for Adaptive 3D Object Detection in Point Clouds.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
DRAC 2022: A public benchmark for diabetic retinopathy analysis on ultra-wide optical coherence tomography angiography images.
Patterns, March, 2024

Learning Music-Dance Representations Through Explicit-Implicit Rhythm Synchronization.
IEEE Trans. Multim., 2024

Cross-Fundus Transformer for Multi-modal Diabetic Retinopathy Grading with Cataract.
CoRR, 2024

CT2C-QA: Multimodal Question Answering over Chinese Text, Table and Chart.
CoRR, 2024

Human Simulacra: A Step toward the Personification of Large Language Models.
CoRR, 2024

Semi-Supervised and Class-Imbalanced Open Set Medical Image Recognition.
IEEE Access, 2024

Mixtures of Experts for Audio-Visual Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

CT<sup>2</sup>C-QA: Multimodal Question Answering over Chinese Text, Table and Chart.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DeepPointMap2: Accurate and Robust LiDAR-Visual SLAM with Neural Descriptors.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Anatomical Structure-Guided Medical Vision-Language Pre-training.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

ADSNet: Cross-Domain LTV Prediction with an Adaptive Siamese Network in Advertising.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Memory-Augmented Transformer for Efficient End-to-End Video Grounding.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Temporal Feature Aggregation for Efficient 2D Video Grounding.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Tag2Text: Guiding Vision-Language Model via Image Tagging.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Cross-Image Distillation for Semi-Supervised Semantic Segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Exploring Object-Centered External Knowledge for Fine-Grained Video Paragraph Captioning.
Proceedings of the IEEE International Conference on Acoustics, 2024

Lesion-Aware Open Set Medical Image Recognition with Domain Shift.
Proceedings of the IEEE International Conference on Acoustics, 2024

ControlCap: Controllable Captioning via No-Fuss Lexicon.
Proceedings of the IEEE International Conference on Acoustics, 2024

Fine-Granularity Face Sketch Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2024

Domain Adaptation Using Pseudo Labels for COVID-19 Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Retrieval-Augmented Egocentric Video Captioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Advancing COVID-19 Detection in 3D CT Scans.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Evidential Open Set Recognition for Imbalanced Medical Images via Multi-level Data Augmentation.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2024

PRIDE: Pediatric Radiological Image Diagnosis Engine Guided by Medical Domain Knowledge.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2024

DeepPointMap: Advancing LiDAR SLAM with Unified Neural Descriptors.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Towards Evidential and Class Separable Open Set Object Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Deep cross-modal hashing with fine-grained similarity.
Appl. Intell., December, 2023

Class-Incremental Generalized Zero-Shot Learning.
Multim. Tools Appl., October, 2023

Rethinking Local and Global Feature Representation for Dense Prediction.
Pattern Recognit., 2023

Inject Semantic Concepts into Image Tagging for Open-Set Recognition.
CoRR, 2023

DRAC: Diabetic Retinopathy Analysis Challenge with Ultra-Wide Optical Coherence Tomography Angiography Images.
CoRR, 2023

Chest X-Ray Quality Assessment Method With Medical Domain Knowledge Fusion.
IEEE Access, 2023

Enhancing Open-Set Object Detection via Uncertainty-Boxes Identification.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

CAMG: Context-Aware Moment Graph Network for Multimodal Temporal Activity Localization via Language.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

Ensemble Deep Learning Approaches for Myopic Maculopathy Plus Lesions Segmentation.
Proceedings of the Myopic Maculopathy Analysis, 2023

Towards Label-Efficient Deep Learning for Myopic Maculopathy Classification.
Proceedings of the Myopic Maculopathy Analysis, 2023

SPTNET: Span-based Prompt Tuning for Video Grounding.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Conditional Video-Text Reconstruction Network with Cauchy Mask for Weakly Supervised Temporal Sentence Grounding.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Class-aware Variational Auto-encoder for Open Set Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Video Captioning via Relation-Aware Graph Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

Boosting Fine-Grained Sketch-Based Image Retrieval with Self-Supervised Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

SCSGNet: Spatial-Correlated and Shape-Guided Network for Breast Mass Segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Motion-Aware Video Paragraph Captioning via Exploring Object-Centered Internal Knowledge.
Proceedings of the IEEE International Conference on Acoustics, 2023

Diabetic Retinopathy Grading with Weakly-Supervised Lesion Priors.
Proceedings of the IEEE International Conference on Acoustics, 2023

Large Language Models are Complex Table Parsers.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Sparse Frame Grouping Network with Action Centered for Untrimmed Video Paragraph Captioning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Learning Open-Vocabulary Semantic Segmentation Models From Natural Language Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Semi-MedSeq: Semi-supervised Semantic Segmentation for Medical Image Sequences.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2023

Enhanced Knowledge Injection for Radiology Report Generation.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2023

2022
Stacked Multimodal Attention Network for Context-Aware Video Captioning.
IEEE Trans. Circuits Syst. Video Technol., 2022

Balanced single-shot object detection using cross-context attention-guided network.
Pattern Recognit., 2022

AE-Net: Fine-grained sketch-based image retrieval via attention-enhanced network.
Pattern Recognit., 2022

CMC v2: Towards More Accurate COVID-19 Detection with Discriminative Video Priors.
CoRR, 2022

Self-Supervised Learning of Music-Dance Representation through Explicit-Implicit Rhythm Synchronization.
CoRR, 2022

FDVTS's Solution for 2nd COV19D Competition on COVID-19 Detection and Severity Analysis.
CoRR, 2022

Modality-aware Contrastive Instance Learning with Self-Distillation for Weakly-Supervised Audio-Visual Violence Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

IDEA: Increasing Text Diversity via Online Multi-Label Recognition for Vision-Language Pre-training.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Deep-OCTA: Ensemble Deep Learning Approaches for Diabetic Retinopathy Analysis on OCTA Images.
Proceedings of the Mitosis Domain Generalization and Diabetic Retinopathy Analysis, 2022

TCCNet: Temporally Consistent Context-Free Network for Semi-supervised Video Polyp Segmentation.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Dynamically Connected Graph Representation for Object Detection.
Proceedings of the Neural Information Processing - 29th International Conference, 2022

STDNet: Spatio-Temporal Decomposed Network for Video Grounding.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Self-Supervised Video Representation Learning with Motion-Contrastive Perception.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Semantic-Driven Saliency-Context Separation for Video Captioning.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

ST<sup>2PE</sup>: Spatial and Temporal Transformer for Pose Estimation.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2022, 2022

Boosting COVID-19 Severity Detection with Infection-Aware Contrastive Mixup Classification.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

CMC_v2: Towards More Accurate COVID-19 Detection with Discriminative Video Priors.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

CREAM: Weakly Supervised Object Localization via Class RE-Activation Mapping.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

MedSeq: Semantic Segmentation for Medical Image Sequences.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2022

Multi-contrast High Quality MR Image Super-Resolution with Dual Domain Knowledge Fusion.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2022

An Interpretable Causal Approach for Bronchopulmonary Dysplasia Prediction.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2022

Cross-Field Transformer for Diabetic Retinopathy Grading on Two-field Fundus Images.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2022

Single-Modality Endoscopic Polyp Segmentation via Random Color Reversal Synthesis and Two-Branched Learning.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2022

2021
Periphery-aware COVID-19 diagnosis with contrastive representation enhancement.
Pattern Recognit., 2021

Cross-modal retrieval with dual multi-angle self-attention.
J. Assoc. Inf. Sci. Technol., 2021

A Novel Multi-Sensor Fusion Based Object Detection and Recognition Algorithm for Intelligent Assisted Driving.
IEEE Access, 2021

Exploring Logical Reasoning for Referring Expression Comprehension.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Disentangled Feature Network for Fine-Grained Recognition.
Proceedings of the Neural Information Processing - 28th International Conference, 2021

MPN: Multimodal Parallel Network for Audio-Visual Event Localization.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Temporally Coarse to Fine Snippets Relationship Learning with Graph Convolution for Temporal Action Proposal Generation.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Exploiting Deep Cross-Slice Features From CT Images For Multi-Class Pneumonia Classification.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

CMC-COV19D: Contrastive Mixup Classification for COVID-19 Diagnosis.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Object-Oriented Relational Distillation for Object Detection.
Proceedings of the IEEE International Conference on Acoustics, 2021

Improving Multimodal Speech Enhancement by Incorporating Self-Supervised and Curriculum Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021

Semi-supervised Medical Image Segmentation with Distribution Calibration and Non-local Semantic Constraint.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

CellDet: Dual-Task Cell Detection Network for IHC-Stained Image Analysis.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

2020
Deep reinforcement hashing with redundancy elimination for effective image retrieval.
Pattern Recognit., 2020

Deep cascaded cross-modal correlation learning for fine-grained sketch-based image retrieval.
Pattern Recognit., 2020

Automated diabetic retinopathy grading and lesion detection based on the modified R-FCN object-detection algorithm.
IET Comput. Vis., 2020

Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Fuzzy Graph Neural Network for Few-Shot Learning.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

A Dynamic-Attention on Crowd Region with Physical Optical Flow Features for Crowd Counting.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Learning Error-Driven Curriculum for Crowd Counting.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Video Captioning With Temporal And Region Graph Convolution Network.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

DG-FPN: Learning Dynamic Feature Fusion Based on Graph Convolution Network For Object Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Representation Reconstruction Head for Object Detection.
Proceedings of the IEEE International Conference on Image Processing, 2020

Learning Class-Based Graph Representation for Object Detection.
Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

OSD: An Occlusion Skeleton Dataset for Action Recognition.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

Data-Efficient Histopathology Image Analysis with Deformation Representation Learning.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2020

Zero-Shot Sketch-Based Image Retrieval via Graph Convolution Network.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2017
Adaptively Weighted Multi-task Deep Network for Person Attribute Classification.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Multi-task Deep Neural Network for Joint Face Recognition and Facial Attribute Prediction.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

2016
Flexible multi-task learning with latent task grouping.
Neurocomputing, 2016

2015
Cross-Modal Image-Tag Relevance Learning for Social Images.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

2013
Beauty is here: evaluating aesthetics in videos using multimodal features and free training data.
Proceedings of the ACM Multimedia Conference, 2013

Understanding and Predicting Interestingness of Videos.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2009
Web image retrieval reranking with multi-view clustering.
Proceedings of the 18th International Conference on World Wide Web, 2009

2005
Study on Optimized Bandwidth Selection Approach of Drifting Learning.
Proceedings of the Fifth International Conference on Computer and Information Technology (CIT 2005), 2005


  Loading...