Feilong Tang

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2025
Unified Diffusion VLA: Vision-Language-Action Model via Joint Discrete Denoising Diffusion Process.
CoRR, November, 2025

Phenome-Wide Multi-Omics Integration Uncovers Distinct Archetypes of Human Aging.
CoRR, October, 2025

HeSRN: Representation Learning On Heterogeneous Graphs via Slot-Aware Retentive Network.
CoRR, October, 2025

SAM-DCE: Addressing Token Uniformity and Semantic Over-Smoothing in Medical Segmentation.
CoRR, September, 2025

Neighbor-Guided Unbiased Framework for Generalized Category Discovery in Medical Image Classification.
IEEE J. Biomed. Health Informatics, August, 2025

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers.
CoRR, August, 2025

ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.
CoRR, August, 2025

NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding.
CoRR, August, 2025

DeepGB-TB: A Risk-Balanced Cross-Attention Gradient-Boosted Convolutional Network for Rapid, Interpretable Tuberculosis Screening.
CoRR, August, 2025

StreamAgent: Towards Anticipatory Agents for Streaming Video Understanding.
CoRR, August, 2025

Towards Efficient Medical Reasoning with Minimal Fine-Tuning Data.
CoRR, August, 2025

MCA-LLaVA: Manhattan Causal Attention for Reducing Hallucination in Large Vision-Language Models.
CoRR, July, 2025

Dual-Stage Value-Guided Inference with Margin-Based Reward Adjustment for Fast and Faithful VLM Captioning.
CoRR, June, 2025

RationalVLA: A Rational Vision-Language-Action Model with Dual System.
CoRR, June, 2025

SAM-aware Test-time Adaptation for Universal Medical Image Segmentation.
CoRR, June, 2025

Confidence-Aware Self-Distillation for Multimodal Sentiment Analysis with Incomplete Modalities.
CoRR, June, 2025

DeepChest: Dynamic Gradient-Free Task Weighting for Effective Multi-Task Learning in Chest X-ray Classification.
CoRR, May, 2025

TAGS: A Test-Time Generalist-Specialist Framework with Retrieval-Augmented Reasoning and Verification.
CoRR, May, 2025

Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery.
CoRR, May, 2025

Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding.
CoRR, May, 2025

Rhythm of Opinion: A Hawkes-Graph Framework for Dynamic Propagation Analysis.
CoRR, April, 2025

PG-SAM: Prior-Guided SAM with Medical for Multi-organ Segmentation.
CoRR, March, 2025

ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos.
CoRR, March, 2025

Enforcing Consistency and Fairness in Multi-level Hierarchical Classification with a Mask-based Output Layer.
CoRR, March, 2025

Decoding the Flow: CauseMotion for Emotional Causality Analysis in Long-form Conversations.
CoRR, January, 2025

Beyond Words: AuralLLM and SignMST-C for Precise Sign Language Production and Bidirectional Accessibility.
CoRR, January, 2025

Identification of Maize Diseases Based on Dynamic Convolution and Tri-Attention Mechanism.
IEEE Access, 2025

MSWAL: 3D Multi-class Segmentation of Whole Abdominal Lesions Dataset.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

Robust Multimodal Learning for Ophthalmic Disease Grading via Disentangled Representation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2025, 2025

Confidence-Aware Self-Distillation for Multimodal Sentiment Analysis with Incomplete Modalities.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Exploring CLIP's Dense Knowledge for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Star with Bilinear Mapping.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Decoding the Flow: CauseMotion for Emotional Causality Analysis in Long-form Conversations.
Proceedings of the IEEE International Conference on Advanced Visual and Signal-Based Systems, 2025

MMRC: A Large-Scale Benchmark for Understanding Multimodal Large Language Model in Real-World Conversation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Neighbor Does Matter: Density-Aware Contrastive Learning for Medical Semi-supervised Segmentation.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Incomplete Modality Disentangled Representation for Ophthalmic Disease Grading and Diagnosis.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Towards Realistic Semi-supervised Medical Image Classification.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Meta Curvature-Aware Minimization for Domain Generalization.
CoRR, 2024

VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation.
CoRR, 2024

OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining.
CoRR, 2024

SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation.
CoRR, 2024

Discriminating retinal microvascular and neuronal differences related to migraines: Deep Learning based Crossectional Study.
CoRR, 2024

Diffusion Model Driven Test-Time Image Adaptation for Robust Skin Lesion Classification.
CoRR, 2024

Polyp-Mamba: Polyp Segmentation with Visual Mamba.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Generalizing to Unseen Domains in Diabetic Retinopathy with Disentangled Representations.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding.
Proceedings of the Computer Vision - ECCV 2024, 2024

Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SFC: Shared Feature Calibration in Weakly Supervised Semantic Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
DuAT: Dual-Aggregation Transformer Network for Medical Image Segmentation.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

2022
DuAT: Dual-Aggregation Transformer Network for Medical Image Segmentation.
CoRR, 2022

Stepwise Feature Fusion: Local Guides Global.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022


  Loading...