Fan Ma

Orcid: 0000-0002-4131-1222

According to our database1, Fan Ma authored at least 59 papers between 2012 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Analytical framework for UAV-enabled wireless communication with D2D networks.
Discov. Internet Things, December, 2025

Adversarial-Guided Diffusion for Multimodal LLM Attacks.
CoRR, July, 2025

MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2025

From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment.
CoRR, March, 2025

DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization.
CoRR, February, 2025

TV-Dialogue: Crafting Theme-Aware Video Dialogues with Immersive Interaction.
CoRR, January, 2025

VLAB: Enhancing Video Language Pretraining by Feature Adapting and Blending.
IEEE Trans. Multim., 2025

Assessing large language models as assistive tools in medical consultations for Kawasaki disease.
Frontiers Artif. Intell., 2025

Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Autonomous LLM-Enhanced Adversarial Attack for Text-to-Motion.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation.
CoRR, 2024

Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy.
CoRR, 2024

AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks.
CoRR, 2024

Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models.
CoRR, 2024

HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting.
CoRR, 2024

MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis.
CoRR, 2024

A new adversarial malware detection method based on enhanced lightweight neural network.
Comput. Secur., 2024

FedPAM: Federated Personalized Augmentation Model for Text-to-Image Retrieval.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

VividDreamer: Invariant Score Distillation for Hyper-Realistic Text-to-3D Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting.
Proceedings of the Computer Vision - ECCV 2024, 2024

MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Knowledge-Enhanced Dual-Stream Zero-Shot Composed Image Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Psychometry: An Omnifit Model for Image Reconstruction from Human Brain Activity.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Clustering for Protein Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Vista-llama: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CapHuman: Capture Your Moments in Parallel Universes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Stitching Segments and Sentences towards Generalization in Video-Text Pre-training.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens.
CoRR, 2023

VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending.
CoRR, 2023

Temporal Perceiving Video-Language Pre-training.
CoRR, 2023

PD-SegNet: Semantic Segmentation of Small Agricultural Targets in Complex Environments.
IEEE Access, 2023

2022
Learning With Noisy Labels via Self-Reweighting From Class Centroids.
IEEE Trans. Neural Networks Learn. Syst., 2022

TSD-Truncated Structurally Aware Distance for Small Pest Object Detection.
Sensors, 2022

Weakly Supervised Moment Localization with Decoupled Consistent Concept Prediction.
Int. J. Comput. Vis., 2022

Unified Transformer Tracker for Object Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Research on LED multi-sector stable display model in variable rotational speed.
J. Comput. Methods Sci. Eng., 2021

2020
Self-paced Multi-view Co-training.
J. Mach. Learn. Res., 2020

SF-Net: Single-Frame Supervision for Temporal Action Localization.
Proceedings of the Computer Vision - ECCV 2020, 2020

Context Modulated Dynamic Networks for Actor and Action Video Segmentation with Language Queries.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Analysis of Single-Phase-to-Ground Faults at the Valve-Side of HB-MMCs in HVDC Systems.
IEEE Trans. Ind. Electron., 2019

Few-Example Object Detection with Model Communication.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

PLAC: Partitioning Based Lazy Classification.
J. Softw., 2019

Online Learning to Rank in a Listwise Approach for Information Retrieval.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

2018
An Emergency Control Strategy for Isolated Power System of Three-Phase Inverter and Diesel-Engine Generator Operating in Parallel.
IEEE Access, 2018

Activities in Extended Video.
Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018

2017
Few-shot Object Detection.
CoRR, 2017

A Dual-Network Progressive Approach to Weakly Supervised Object Detection.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

A co-training approach to the classification of local climate zones with multi-source data.
Proceedings of the 2017 IEEE International Geoscience and Remote Sensing Symposium, 2017

Emergency control strategy of hybrid power system under sudden load applying.
Proceedings of the IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society, Beijing, China, October 29, 2017

Directional interlocking overcurrent protection of microgrids powered by inverters injected with characteristic currents.
Proceedings of the IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society, Beijing, China, October 29, 2017

Self-Paced Co-training.
Proceedings of the 34th International Conference on Machine Learning, 2017

2013
The design and research of generator wave filter based on the APF and zig-zag transformers.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2013

2012
Fast image super resolution via local regression.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Background Subtraction Based on Multi-channel SILTP.
Proceedings of the Computer Vision - ACCV 2012 Workshops, 2012


  Loading...