Mingyu Liu

This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.

Bibliography

2026
DeepVerifier: Learning to Update Test Sequences for Coverage-Guided Verification.
ACM Trans. Design Autom. Electr. Syst., July, 2026

Decoupling Endpoint and Semantic Transition Learning for Zero-Shot Composed Image Retrieval.
CoRR, May, 2026

GaMMA: Towards Joint Global-Temporal Music Understanding in Large Multimodal Models.
CoRR, May, 2026

Context-guided and multi-level feature fusion DETR for lightweight detection of cacao pod pests and diseases in complex environments.
Int. J. Mach. Learn. Cybern., April, 2026

CCTVBench: Contrastive Consistency Traffic VideoQA Benchmark for Multimodal LLMs.
CoRR, April, 2026

LoViF 2026 Challenge on Real-World All-in-One Image Restoration: Methods and Results.
CoRR, April, 2026

OmniJigsaw: Enhancing Omni-Modal Reasoning via Modality-Orchestrated Reordering.
CoRR, April, 2026

SGTA: Scene-Graph Based Multi-Modal Traffic Agent for Video Understanding.
CoRR, April, 2026

ILVMamba: Illumination-Aware Lightweight Visual Mamba Framework for Efficient High-Resolution Image Enhancement.
IEEE Trans. Artif. Intell., March, 2026

Datasets, Metrics, Benchmarks and Future Research in Autonomous Driving: A Review.
IEEE CAA J. Autom. Sinica, March, 2026

World Guidance: World Modeling in Condition Space for Action Generation.
CoRR, February, 2026

VideoAfford: Grounding 3D Affordance from Human-Object-Interaction Videos via Multimodal Large Language Model.
CoRR, February, 2026

HiePlace: Efficient Hierarchical PCB Placement.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., January, 2026

FD-TE Diagnosis: Enhancing Microservice Fault Diagnosis With Frequency Domain Features and Centrality-Aware Time Encoding.
IEEE Trans. Reliab., 2026

DRFIR: A dimensionality reduction framework for all-in-one image restoration in spatial and frequency domains.
Expert Syst. Appl., 2026

LLM-Powered Structurer: Normalizing Natural Language to Information Delivery Specification for Industrial Data Exchange.
Proceedings of the Companion Proceedings of the ACM Web Conference 2026, 2026

FalconFS: Distributed File System for Large-Scale Deep Learning Pipeline.
Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026

An Energy-Efficient Multimodal Retrieval Framework for Inference on Heterogeneous Edge Nodes.
Proceedings of the 2026 International Conference on Multimedia Retrieval, 2026

Topological Optimization-Based Layer Assignment Method for Fan-Out Wafer-Level Packaging.
Proceedings of the 31st Asia and South Pacific Design Automation Conference, 2026

Affordance-R1: Reinforcement Learning for Generalizable Affordance Reasoning in Multimodal Large Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon Tasks.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Frequency-Prompted Image Restoration to Enhance Perception in Intelligent Transportation Systems.
IEEE Trans. Intell. Transp. Syst., December, 2025

Semantic Segmentation Algorithm Based on Light Field and LiDAR Fusion.
CoRR, October, 2025

StaMo: Unsupervised Learning of Generalizable Robot Motion from Compact State Representation.
CoRR, October, 2025

Bridge Thinking and Acting: Unleashing Physical Potential of VLM with Generalizable Action Expert.
CoRR, October, 2025

NoTVLA: Narrowing of Dense Action Trajectories for Generalizable Robot Manipulation.
CoRR, October, 2025

Modumer: Modulating Transformer for Image Restoration.
IEEE Trans. Neural Networks Learn. Syst., September, 2025

Learning by Imagining: Debiased Feature Augmentation for Compositional Zero-Shot Learning.
CoRR, September, 2025

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling.
CoRR, September, 2025

ScaleCache: Scalable and Production-grade Buffer Management for Disk-based Database Systems.
Proc. VLDB Endow., August, 2025

Learning Primitive Embodied World Models: Towards Scalable Robotic Learning.
CoRR, August, 2025

Affordance-R1: Reinforcement Learning for Generalizable Affordance Reasoning in Multimodal Large Language Model.
CoRR, August, 2025

SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models.
CoRR, August, 2025

DAG: Unleash the Potential of Diffusion Model for Open-Vocabulary 3D Affordance Grounding.
CoRR, August, 2025

LIEDNet: A Lightweight Network for Low-Light Enhancement and Deblurring.
IEEE Trans. Circuits Syst. Video Technol., July, 2025

Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO.
CoRR, May, 2025

Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration.
CoRR, May, 2025

CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning.
CoRR, May, 2025

DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks.
CoRR, February, 2025

TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes.
CoRR, February, 2025

RECaching: Cost-Effective Edge Caching for Cloud Storage With Differentiated Regional Workloads.
IEEE Trans. Serv. Comput., 2025

Plasticized electrohydraulic robot autopilots in the deep sea.
Sci. Robotics, 2025

An efficient scale-aware model based on the improved RT-DETR for pomegranate growth stage detection.
Neurocomputing, 2025

RFM_Trans: Runoff forecasting model for catchment flood protection using strategies optimized Transformer.
Expert Syst. Appl., 2025

ERL-RTDETR: A Lightweight Transformer-Based Framework for High-Accuracy Apple Disease Detection in Precision Agriculture.
Concurr. Comput. Pract. Exp., 2025

Construction of a Person-Job Temporal Knowledge Graph Using Large Language Models.
Big Data Cogn. Comput., 2025

3D Understanding of Deformable Linear Objects: Datasets and Transferability Benchmark.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Generative Video Matting.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

PointCompress3D: A Point Cloud Compression Framework for Roadside Lidars in Intelligent Transportation Systems.
Proceedings of the 28th IEEE International Conference on Intelligent Transportation Systems, 2025

Surface3D: A Surface-Aware Framework for Refining 3D Object Detection.
Proceedings of the 28th IEEE International Conference on Intelligent Transportation Systems, 2025

Multi-view Joint Online LiDAR-Camera Extrinsic Calibration.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2025

MambaSFLNet: A Mamba-based Model for Low-Light Image Enhancement with Spatial and Frequency Features.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

DepthCNE: Contrastive Neighbor Embedding in Self-Supervised Learning for Point Clouds.
Proceedings of the International Joint Conference on Neural Networks, 2025

Seeing the Unseen: Composing Outliers for Compositional Zero-Shot Learning.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

TUMTraf CrossVision: A Multi-View Multi-Modal Vision Dataset for Arterial Intersection Traffic Surveillance.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2025

TUMTraf VideoQA: Dataset and Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Transformer-Based Spatial-Temporal Counterfactual Outcomes Estimation.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

PerturboLLaVA: Reducing Multimodal Hallucinations with Perturbative Visual Training.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Graph Structure Learning via Transfer Entropy for Multivariate Time Series Anomaly Detection.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Disa: Accurate Learning-based Static Disassembly with Attentions.
Proceedings of the 2025 ACM SIGSAC Conference on Computer and Communications Security, 2025

2024
Collaborative Storage for Tiered Cloud and Edge: A Perspective of Optimizing Cost and Latency.
IEEE Trans. Mob. Comput., December, 2024

A Survey on Autonomous Driving Datasets: Statistics, Annotation Quality, and a Future Outlook.
IEEE Trans. Intell. Veh., November, 2024

Few-shot class incremental learning via prompt transfer and knowledge distillation.
Image Vis. Comput., 2024

MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence.
CoRR, 2024

Look Within, Why LLMs Hallucinate: A Causal Perspective.
CoRR, 2024

A Transformer variant for multi-step forecasting of water level and hydrometeorological sensitivity analysis based on explainable artificial intelligence technology.
CoRR, 2024

PointCompress3D - A Point Cloud Compression Framework for Roadside LiDARs in Intelligent Transportation Systems.
CoRR, 2024

Diffusion Models Trained with Large Data Are Transferable Visual Models.
CoRR, 2024

A Survey on Autonomous Driving Datasets: Data Statistic, Annotation, and Outlook.
CoRR, 2024

FSDN: Image frequency and semantic decomposition network for image dehazing.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2024

WARM-3D: A Weakly-Supervised Sim2Real Domain Adaptation Framework for Roadside Monocular 3D Object Detection.
Proceedings of the 27th IEEE International Conference on Intelligent Transportation Systems, 2024

GraphRelate3D: Context-Dependent 3D Object Detection with Inter-Object Relationship Graphs.
Proceedings of the 27th IEEE International Conference on Intelligent Transportation Systems, 2024

X-Cover: Better Music Version Identification System by Integrating Pretrained ASR Model.
Proceedings of the 25th International Society for Music Information Retrieval Conference, 2024

Hybrid Frequency Modulation Network for Image Restoration.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

(Neo4j)^ Browser: Visualizing Variable-Aware Analysis Results.
Proceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering: Companion Proceedings, 2024

TrafficScene: A Multi-modal Dataset including Light Field for Semantic Segmentation of Traffic Scenes.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Boundary-Driven Active Learning for Anomaly Detection in Time Series Data Streams.
Proceedings of the IEEE International Conference on Acoustics, 2024

ByteHum: Fast and Accurate Query-by-Humming in the Wild.
Proceedings of the IEEE International Conference on Acoustics, 2024

Take a Step Back: Rethinking the Two Stages in Visual Reasoning.
Proceedings of the Computer Vision - ECCV 2024, 2024

Exploring Interactive Color Palettes for Abstraction-Driven Exploratory Image Colorization.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024

2023
Stochastic bounded consensus for multi-agent systems with fractional Brownian motions via sliding mode control.
Appl. Math. Comput., June, 2023

RLTiering: A Cost-Driven Auto-Tiering System for Two-Tier Cloud Storage Using Deep Reinforcement Learning.
IEEE Trans. Parallel Distributed Syst., February, 2023

Multi-weight susceptible-infected model for predicting COVID-19 in China.
Neurocomputing, 2023

Cost Optimization for Cloud Storage from User Perspectives: Recent Advances, Taxonomy, and Survey.
ACM Comput. Surv., 2023

Vision Language Models in Autonomous Driving and Intelligent Transportation Systems.
CoRR, 2023

Implementing a new fully stepwise decomposition-based sampling technique for the hybrid water level forecasting model in real-world application.
CoRR, 2023

Looking and Listening: Audio Guided Text Recognition.
CoRR, 2023

Neuro-Causal Factor Analysis.
CoRR, 2023

On the Hidden Mystery of OCR in Large Multimodal Models.
CoRR, 2023

Local-Adaptive Transformer for Multivariate Time Series Anomaly Detection and Diagnosis.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2023

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ICDAR 2023 Competition on Reading the Seal Title.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

NeuroEscape: Ordered Escape Routing via Monte-Carlo Tree Search and Neural Network.
Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

Smoothing Point Adjustment-Based Evaluation of Time Series Anomaly Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

Instance-Wise Adaptive Tuning and Caching for Vision-Language Models.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

FanoutNet: A Neuralized PCB Fanout Automation Method Using Deep Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Resource allocation applied to flexible printed circuit routing based on constrained Delaunay triangulation.
Integr., 2022

Effeclouds: A cost-effective cloud-of-clouds framework for two-tier storage.
Future Gener. Comput. Syst., 2022

3D Object Detection with a Self-supervised Lidar Scene Flow Backbone.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Keep Hot or Go Cold: A Randomized Online Migration Algorithm for Cost Optimization in STaaS Clouds.
IEEE Trans. Netw. Serv. Manag., 2021

Analysis of Glomerular Filtration Rate in Ischemic Cerebrovascular Diseases under the Magnetic Resonance Angiography Image Segmentation Algorithm.
Sci. Program., 2021

Antecedents and consequences of supply chain risk management capabilities: an investigation in the post-coronavirus crisis.
Int. J. Prod. Res., 2021

Behaviour detection in crowded classroom scenes via enhancing features robust to scale and perspective variations.
IET Image Process., 2021

Analysis on the Establishment of the Standards for the Shelves of Paper Books in University Libraries Based on the Book-Reader Association.
Proceedings of the IPEC 2021: 2nd Asia-Pacific Conference on Image Processing, 2021

Fden: Mining Effective Information of Features in Detecting Network Anomalies.
Proceedings of the IEEE International Conference on Acoustics, 2021

Machine Learning Based Acceleration Method for Ordered Escape Routing.
Proceedings of the GLSVLSI '21: Great Lakes Symposium on VLSI 2021, 2021

NH-CIL: A Nested Hierarchy Algorithm for Class Incremental Learning.
Proceedings of the 24th IEEE International Conference on Computer Supported Cooperative Work in Design, 2021

2020
Cyber-physical resilience modelling and assessment of urban roadway system interrupted by rainfall.
Reliab. Eng. Syst. Saf., 2020

High-Resolution Reconstruction of the Maximum Snow Water Equivalent Based on Remote Sensing Data in a Mountainous Area.
Remote. Sens., 2020

Validation of the SNTHERM Model Applied for Snow Depth, Grain Size, and Brightness Temperature Simulation at Meteorological Stations in China.
Remote. Sens., 2020

UAV monitoring and forecasting model in intelligent traffic oriented applications.
Comput. Commun., 2020

A Cooperative Lane Change Model for Connected and Automated Vehicles.
IEEE Access, 2020

Multimodal Deep Learning Framework for Mental Disorder Recognition.
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020

Automatic Detection of Self-Adaptors for Psychological Distress.
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020


Efficient Semantic Enrichment Process for Spatiotemporal Trajectories in Geospatial Environment.
Proceedings of the Web and Big Data - 4th International Joint Conference, 2020

2019
Turning responsible purchasing and supply into supply chain responsiveness.
Ind. Manag. Data Syst., 2019

To Transfer or Not: An Online Cost Optimization Algorithm for Using Two-Tier Storage-as-a-Service Clouds.
IEEE Access, 2019

Central-Diffused Instance Generation Method in Class Incremental Learning.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2019: Deep Learning, 2019

2017
A Reconstruction-Registration Integrated Data Fusion Method for Measurement of Multiscaled Complex Surfaces.
IEEE Trans. Instrum. Meas., 2017

Full-differential paralleled high slew rate linear current sink and applications.
Proceedings of the IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society, Beijing, China, October 29, 2017

Exemplar-Based Photo Color Correction by Exploring Visual Aesthetics.
Proceedings of the Internet Multimedia Computing and Service, 2017

2015
Gunslinger: Subtle Arms-down Mid-air Interaction.
Proceedings of the 28th Annual ACM Symposium on User Interface Software & Technology, 2015

Noninvasive breast tumors detection based on saliva protein surface enhanced Raman spectroscopy and regularized multinomial regression.
Proceedings of the 8th International Conference on Biomedical Engineering and Informatics, 2015

Test of label-free Nasopharyngeal carinoma tissue at different stages by Raman spectroscopy.
Proceedings of the 8th International Conference on Biomedical Engineering and Informatics, 2015

2007
Brain-Computer Interfaces Based on Attention and Complex Mental Tasks.
Proceedings of the Digital Human Modeling, 2007

2005
Non-negative Matrix Factorizations Based Spontaneous Electroencephalographic Signals Classification Using Back Propagation Feedback Neural Networks.
Proceedings of the Advances in Neural Networks - ISNN 2005, Second International Symposium on Neural Networks, Chongqing, China, May 30, 2005

2000
Factorable FIR Nyquist filters with least stopband energy under sidelobe level constraints.
IEEE Trans. Signal Process., 2000

1998
Scaling functions and the optimum Nyquist-type signaling waveform in digital communications.
Proceedings of the 9th IEEE International Symposium on Personal, 1998


  Loading...