Minghan Li

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2026
Applying a particle swarm optimizer controlled by a fuzzy logic controller in a multi-pivot means clustering algorithm.
Appl. Soft Comput., 2026

2025
SplitFlow: Flow Decomposition for Inversion-Free Text-to-Image Editing.
CoRR, October, 2025

UrbanVLA: A Vision-Language-Action Model for Urban Micromobility.
CoRR, October, 2025

TrackVLA++: Unleashing Reasoning and Memory Capabilities in VLA Models for Embodied Visual Tracking.
CoRR, October, 2025

MM-Nav: Multi-View VLA Model for Robust Visual Navigation via Multi-Expert Learning.
CoRR, October, 2025

Embodied Navigation Foundation Model.
CoRR, September, 2025

Query Expansion in the Age of Pre-trained and Large Language Models: A Comprehensive Survey.
CoRR, September, 2025

A Survey of Long-Document Retrieval in the PLM and LLM Era.
CoRR, September, 2025

Delta Velocity Rectified Flow for Text-to-Image Editing.
CoRR, September, 2025

FairFedMed: Benchmarking Group Fairness in Federated Medical Imaging with FairLoRA.
CoRR, August, 2025

VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding.
CoRR, July, 2025

TrackVLA: Embodied Visual Tracking in the Wild.
CoRR, May, 2025

HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard.
CoRR, March, 2025

FiVE: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models.
CoRR, March, 2025

Efficient CORDIC-Based Activation Functions for RNN Acceleration on FPGAs.
IEEE Trans. Artif. Intell., January, 2025

Na Vid-4D: Unleashing Spatial Intelligence in Egocentric RGB-D Videos for Vision-and-Language Navigation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

MaSS13K: A Matting-level Semantic Segmentation Benchmark.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks.
CoRR, 2024

Experimental Secure Multiparty Computation from Quantum Oblivious Transfer with Bit Commitment.
CoRR, 2024

ReCIDE: robust estimation of cell type proportions by integrating single-reference-based deconvolutions.
Briefings Bioinform., 2024

Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

SZTU-CMU at MER2024: Improving Emotion-LLaMA with Conv-Attention for Multimodal Emotion Recognition.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

2021
Field demonstration of distributed quantum sensing without post-selection.
Proceedings of the Optical Fiber Communications Conference and Exhibition, 2021


  Loading...