Ming Li

Orcid: 0000-0002-9948-4644

Affiliations:
  • Deakin University, VIC, Australia


According to our database1, Ming Li authored at least 93 papers between 2002 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Learning from Noisy Preferences: A Semi-Supervised Learning Approach to Direct Preference Optimization.
CoRR, April, 2026

ClawEnvKit: Automatic Environment Generation for Claw-Like Agents.
CoRR, April, 2026

History-Conditioned Spatio-Temporal Visual Token Pruning for Efficient Vision-Language Navigation.
CoRR, March, 2026

Rethinking LLM-as-a-Judge: Representation-as-a-Judge with Small Language Models via Semantic Capacity Asymmetry.
CoRR, January, 2026

LoL: Longer than Longer, Scaling Video Generation to Hour.
CoRR, January, 2026

DiffusionEngine: Diffusion model is scalable data engine for object detection.
Pattern Recognit., 2026

Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook.
Proceedings of the ACM Conference on AI and Agentic Systems, 2026

AnomalyPainter: Vision-Language-Diffusion Synergy for Realistic and Diverse Unseen Industrial Anomaly Synthesis.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
V-REX: Benchmarking Exploratory Visual Reasoning via Chain-of-Questions.
CoRR, December, 2025

Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following.
CoRR, November, 2025

Can World Simulators Reason? Gen-ViRe: A Generative Visual Reasoning Benchmark.
CoRR, November, 2025

Fourier Transform Multiple Instance Learning for Whole Slide Image Classification.
CoRR, October, 2025

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation.
CoRR, October, 2025

RewardDance: Reward Scaling in Visual Generation.
CoRR, September, 2025

VisR-Bench: An Empirical Study on Visual Retrieval-Augmented Generation for Multilingual Long Document Understanding.
CoRR, August, 2025

MDK12-Bench: A Comprehensive Evaluation of Multimodal Large Language Models on Multidisciplinary Exams.
CoRR, August, 2025

CaughtCheating: Is Your MLLM a Good Cheating Detective? Exploring the Boundary of Visual Perception and Reasoning.
CoRR, July, 2025

Sekai: A Video Dataset towards World Exploration.
CoRR, June, 2025

AI Idea Bench 2025: AI Research Idea Generation Benchmark.
CoRR, April, 2025

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients.
CoRR, April, 2025

Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?
CoRR, April, 2025

MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models.
CoRR, April, 2025

Towards Visual Text Grounding of Multimodal Large Language Model.
CoRR, April, 2025

LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis.
CoRR, March, 2025

AnomalyPainter: Vision-Language-Diffusion Synergy for Zero-Shot Realistic and Diverse Industrial Anomaly Synthesis.
CoRR, March, 2025

CPO: Condition Preference Optimization for Controllable Image Generation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

RuleR: Improving LLM Controllability by Rule-based Data Recycling.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

BenTo: Benchmark Reduction with In-Context Transferability.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Self-Enhanced Reasoning Training: Activating Latent Reasoning in Small Models for Enhanced Reasoning Distillation.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

IL-NeRF: Incremental Learning for Neural Radiance Fields with Camera Pose Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Generative Models for Synthetic Data: Transforming Data Mining in the GenAI Era.
Proceedings of the 34th ACM International Conference on Information and Knowledge Management, 2025

Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

ATLAS: Agent Tuning via Learning Critical Steps.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective.
CoRR, 2024

BenTo: Benchmark Task Reduction with In-Context Transferability.
CoRR, 2024

PFID: Privacy First Inference Delegation Framework for LLMs.
CoRR, 2024

Mosaic IT: Enhancing Instruction Tuning with Data Mosaics.
CoRR, 2024

A Survey on Knowledge Distillation of Large Language Models.
CoRR, 2024

From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Frame Interpolation with Consecutive Brownian Bridge Diffusion.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Leveraging Biases in Large Language Models: "bias-kNN" for Effective Few-Shot Learning.
Proceedings of the IEEE International Conference on Acoustics, 2024

LucidDreaming: Controllable Object-Centric 3D Generation.
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.
Proceedings of the Computer Vision - ECCV 2024, 2024

Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Region-Aware Arbitrary-Shaped Text Detection With Progressive Fusion.
IEEE Trans. Multim., 2023

Character-Aware Sampling and Rectification for Scene Text Recognition.
IEEE Trans. Multim., 2023

Dual Relation Network for Scene Text Recognition.
IEEE Trans. Multim., 2023

Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning.
CoRR, 2023

DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection.
CoRR, 2023

DLIP: Distilling Language-Image Pre-training.
CoRR, 2023

First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment.
CoRR, 2023

Beyond the Label Distribution Prior for Long-Tailed Recognition.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2023

AlignDet: Aligning Pre-training and Fine-tuning in Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Parallel Pre-trained Transformers (PPT) for Synthetic Data-based Instance Segmentation.
CoRR, 2022

Multi-granularity Distillation Scheme Towards Lightweight Semi-supervised Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

The Tenth Visual Object Tracking VOT2022 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

2013
Multidimensional Routing Protocol in Human-Associated Delay-Tolerant Networks.
IEEE Trans. Mob. Comput., 2013

Privacy Protected Data Forwarding in Human Associated Delay Tolerant Networks.
Proceedings of the 12th IEEE International Conference on Trust, 2013

A Hierarchical Cloud Pricing System.
Proceedings of the IEEE Ninth World Congress on Services, 2013

An Efficient Social Based Data Forwarding Mechanism for Mobile Cloud Computing.
Proceedings of the IEEE 9th International Conference on Mobile Ad-hoc and Sensor Networks, 2013

Anonymous Data Forwarding in Human Associated Delay Tolerant Networks.
Proceedings of the 33rd International Conference on Distributed Computing Systems Workshops (ICDCS 2013 Workshops), 2013

2012
M-Dimension: Multi-characteristics based routing protocol in human associated delay-tolerant networks with improved performance over one dimensional classic models.
J. Netw. Comput. Appl., 2012

AMDD: Exploring Entropy Based Anonymous Multi-dimensional Data Detection for Network Optimization in Human Associated DTNs.
Proceedings of the 11th IEEE International Conference on Trust, 2012

Effects of Social Characters in Viral Propagation Seeding Strategies in Online Social Networks.
Proceedings of the 11th IEEE International Conference on Trust, 2012

Detecting Topic Labels for Tweets by Matching Features from Pseudo-Relevance Feedback.
Proceedings of the Tenth Australasian Data Mining Conference, AusDM 2012, Sydney, 2012

2011
Improving P2P IPTV random peers search through user similarity.
Proceedings of the 5th International Conference on Network and System Security, 2011

MAR: Message-aware routing for opportunistic wireless ad hoc networks.
Proceedings of the Australasian Telecommunication Networks and Applications Conference, 2011

T-OSN: A Trust Evaluation Model in Online Social Networks.
Proceedings of the IEEE/IFIP 9th International Conference on Embedded and Ubiquitous Computing, 2011

Multi-level virtual ring: An architecture for content routing in wireless sensor network.
Proceedings of the IEEE 17th Asia-Pacific Conference on Communications, 2011

2010
Context-aware fusion: A case study on fusion of gait and face for human identification in video.
Pattern Recognit., 2010

S-Kcore: A Social-aware Kcore Decomposition Algorithm in Pocket Switched Networks.
Proceedings of the IEEE/IFIP 8th International Conference on Embedded and Ubiquitous Computing, 2010

OST: A Transaction Based Online Social Trust Model for Social Network and File Sharing Security.
Proceedings of the IEEE/IFIP 8th International Conference on Embedded and Ubiquitous Computing, 2010

2009
Image/video-based pattern analysis and HCI applications.
Pattern Recognit. Lett., 2009

Editorial.
Int. J. Pattern Recognit. Artif. Intell., 2009

Multi-Level Virtual Ring: Cross-Level Name Routing with Embedded Identifier in Sensor Network.
Proceedings of the 2009 International Conference on Wireless Networks, 2009

GeoConnect: Geographic and Connectivity-Aware Routing Protocol for Wireless Sensor Network.
Proceedings of the 2009 International Conference on Wireless Networks, 2009

2008
Adaptive Fusion of Gait and Face for Human Identification in Video.
Proceedings of the 9th IEEE Workshop on Applications of Computer Vision (WACV 2008), 2008

2007
Cross-layer Resource Control to Improve TCP Performance over Wireless Network.
Proceedings of the 6th Annual IEEE/ACIS International Conference on Computer and Information Science (ICIS 2007), 2007

2005
Fair intelligent admission control over resource-feedback DiffServ network.
Comput. Commun., 2005

FIAC: a resource discovery-based two-level admission control for differentiated service networks.
Comput. Commun., 2005

2004
Resource discovery and fair intelligent admission control over scalable Internet
PhD thesis, 2004

Edge-aware resource discovery and fair intelligent admission control scheme over multi-domain differentiated services networks.
Proceedings of IEEE International Conference on Communications, 2004

2003
Achieving Flow Fairness in DiffServ Class: Per-flow Fair Admission Control over Differentiated Service Network.
Proceedings of the ACIS Fourth International Conference on Software Engineering, 2003

Fair intelligent admission control over DiffServ network.
Proceedings of the 11th IEEE International Conference on Networks, 2003

Class-Based Fair Intelligent Admission Control over an Enhanced Differentiated Service Network.
Proceedings of the Information Networking, 2003

2002
Fair Intelligent Congestion Control Resource Discovery Protocol on TCP based Network.
Proceedings of the Converged Networking: Data and Real-time Communications over IP, 2002

Fair Intelligent Feedback Mechanism on TCP Based Network.
Proceedings of the International Conference on Internet Computing, 2002


  Loading...