Feilong Tang

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2026
LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence.
CoRR, May, 2026

DermAgent: A Self-Reflective Agentic System for Dermatological Image Analysis with Multi-Tool Reasoning and Traceable Decision-Making.
CoRR, May, 2026

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development.
CoRR, March, 2026

Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding.
CoRR, March, 2026

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence.
CoRR, February, 2026

Hallucination Begins Where Saliency Drops.
CoRR, January, 2026

PsychEthicsBench: Evaluating Large Language Models Against Australian Mental Health Ethics.
CoRR, January, 2026

SAM2-UNet: segment anything 2 makes strong encoder for natural and medical image segmentation.
Vis. Intell., 2026

Rhythm of Opinion: Interpretable Hawkes-Graph Networks for Hierarchical Opinion Propagation.
Proceedings of the ACM Web Conference 2026, 2026

CNText2Sign and CNSign: Unified Chinese Sign Language Datasets for Bidirectional Accessibility.
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026

ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

CP-CLIP: Customized Parameter Generation for Open-vocabulary Semantic Segmentation.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

DeepGB-TB: A Risk-Balanced Cross-Attention Gradient-Boosted Convolutional Network for Rapid, Interpretable Tuberculosis Screening.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

DeLo: Dual Decomposed Low-Rank Experts Collaboration for Continual Missing Modality Learning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process.
CoRR, November, 2025

Phenome-Wide Multi-Omics Integration Uncovers Distinct Archetypes of Human Aging.
CoRR, October, 2025

HeSRN: Representation Learning On Heterogeneous Graphs via Slot-Aware Retentive Network.
CoRR, October, 2025

SAM-DCE: Addressing Token Uniformity and Semantic Over-Smoothing in Medical Segmentation.
CoRR, September, 2025

Neighbor-Guided Unbiased Framework for Generalized Category Discovery in Medical Image Classification.
IEEE J. Biomed. Health Informatics, August, 2025

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers.
CoRR, August, 2025

NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding.
CoRR, August, 2025

StreamAgent: Towards Anticipatory Agents for Streaming Video Understanding.
CoRR, August, 2025

Towards Efficient Medical Reasoning with Minimal Fine-Tuning Data.
CoRR, August, 2025

MCA-LLaVA: Manhattan Causal Attention for Reducing Hallucination in Large Vision-Language Models.
CoRR, July, 2025

Dual-Stage Value-Guided Inference with Margin-Based Reward Adjustment for Fast and Faithful VLM Captioning.
CoRR, June, 2025

RationalVLA: A Rational Vision-Language-Action Model with Dual System.
CoRR, June, 2025

SAM-aware Test-time Adaptation for Universal Medical Image Segmentation.
CoRR, June, 2025

Confidence-Aware Self-Distillation for Multimodal Sentiment Analysis with Incomplete Modalities.
CoRR, June, 2025

TAGS: A Test-Time Generalist-Specialist Framework with Retrieval-Augmented Reasoning and Verification.
CoRR, May, 2025

Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery.
CoRR, May, 2025

Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding.
CoRR, May, 2025

Rhythm of Opinion: A Hawkes-Graph Framework for Dynamic Propagation Analysis.
CoRR, April, 2025

PG-SAM: Prior-Guided SAM with Medical for Multi-organ Segmentation.
CoRR, March, 2025

ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos.
CoRR, March, 2025

Enforcing Consistency and Fairness in Multi-level Hierarchical Classification with a Mask-based Output Layer.
CoRR, March, 2025

Decoding the Flow: CauseMotion for Emotional Causality Analysis in Long-form Conversations.
CoRR, January, 2025

Beyond Words: AuralLLM and SignMST-C for Precise Sign Language Production and Bidirectional Accessibility.
CoRR, January, 2025

Identification of Maize Diseases Based on Dynamic Convolution and Tri-Attention Mechanism.
IEEE Access, 2025

Game-Theoretic Optimization for Coalition-Based Intrusion Detection with Multi-Source Adaptive Dispatching.
Proceedings of the 44th International Symposium on Reliable Distributed Systems, 2025

Decoding Causal Structure: End-to-End Mediation Pathways Inference.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

MCA-LLaVA: Manhattan Causal Attention for Reducing Hallucination in Large Vision-Language Models.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Genesis: A Large-Scale Benchmark for Multimodal Large Language Model in Emotional Causality Analysis.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025

MSWAL: 3D Multi-class Segmentation of Whole Abdominal Lesions Dataset.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

Robust Multimodal Learning for Ophthalmic Disease Grading via Disentangled Representation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

Confidence-Aware Self-Distillation for Multimodal Sentiment Analysis with Incomplete Modalities.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

GlassWizard: Harvesting Diffusion Priors for Glass Surface Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Exploring CLIP's Dense Knowledge for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Star with Bilinear Mapping.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

DeepChest: Dynamic Gradient-Free Task Weighting for Effective Multi-Task Learning in Chest X-Ray Classification.
Proceedings of the IEEE International Conference on Big Data, 2025

PG-SAM: A Fine-Grained Prior-Guided SAM Framework for Prompt-Free Medical Image Segmentation.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2025

Decoding the Flow: CauseMotion for Emotional Causality Analysis in Long-form Conversations.
Proceedings of the IEEE International Conference on Advanced Visual and Signal-Based Systems, 2025

MMRC: A Large-Scale Benchmark for Understanding Multimodal Large Language Model in Real-World Conversation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Neighbor Does Matter: Density-Aware Contrastive Learning for Medical Semi-supervised Segmentation.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Incomplete Modality Disentangled Representation for Ophthalmic Disease Grading and Diagnosis.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Towards Realistic Semi-supervised Medical Image Classification.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Meta Curvature-Aware Minimization for Domain Generalization.
CoRR, 2024

VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation.
CoRR, 2024

Discriminating retinal microvascular and neuronal differences related to migraines: Deep Learning based Crossectional Study.
CoRR, 2024

Diffusion Model Driven Test-Time Image Adaptation for Robust Skin Lesion Classification.
CoRR, 2024

Polyp-Mamba: Polyp Segmentation with Visual Mamba.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Generalizing to Unseen Domains in Diabetic Retinopathy with Disentangled Representations.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding.
Proceedings of the Computer Vision - ECCV 2024, 2024

Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SFC: Shared Feature Calibration in Weakly Supervised Semantic Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
DuAT: Dual-Aggregation Transformer Network for Medical Image Segmentation.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

2022
DuAT: Dual-Aggregation Transformer Network for Medical Image Segmentation.
CoRR, 2022

Stepwise Feature Fusion: Local Guides Global.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022


  Loading...