Ming Lu

Orcid: 0000-0002-2448-3081

Affiliations:

Intel Lab China, Cognitive Computing Laboratory, Beijing, China
Tsinghua University, Department of information and communication engineering, National Key Laboratory for Multimedia Information Processing, Beijing, China (PhD 2019)

According to our database¹, Ming Lu authored at least 80 papers between 2017 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Jarvis: Towards Personalized AI Assistant via Personal KV-Cache Retrieval.

[BibT_eX]

[DOI]

CoRR, October, 2025

Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks.

[BibT_eX]

[DOI]

CoRR, October, 2025

BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models.

[BibT_eX]

[DOI]

CoRR, September, 2025

ManipDreamer3D : Synthesizing Plausible Robotic Manipulation Video with Occupancy-aware 3D Trajectory.

[BibT_eX]

[DOI]

CoRR, September, 2025

MMG-Vid: Maximizing Marginal Gains at Segment-level and Token-level for Efficient Video LLMs.

[BibT_eX]

[DOI]

CoRR, August, 2025

Small-Large Collaboration: Training-efficient Concept Personalization for Large VLM using a Meta Personalized Small VLM.

[BibT_eX]

[DOI]

CoRR, August, 2025

UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying.

[BibT_eX]

[DOI]

CoRR, August, 2025

FastDriveVLA: Efficient End-to-End Driving via Plug-and-Play Reconstruction-based Token Pruning.

[BibT_eX]

[DOI]

CoRR, July, 2025

FastInit: Fast Noise Initialization for Temporally Consistent Video Generation.

[BibT_eX]

[DOI]

CoRR, June, 2025

AutoV: Learning to Retrieve Visual Prompt for Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.

[BibT_eX]

[DOI]

CoRR, June, 2025

OmniIndoor3D: Comprehensive Indoor 3D Reconstruction.

[BibT_eX]

[DOI]

CoRR, May, 2025

UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens.

[BibT_eX]

[DOI]

CoRR, May, 2025

ManipDreamer: Boosting Robotic Manipulation World Model with Action Tree and Visual Guidance.

[BibT_eX]

[DOI]

CoRR, April, 2025

EmbodiedOcc++: Boosting Embodied 3D Occupancy Prediction with Plane Regularization and Uncertainty Sampler.

[BibT_eX]

[DOI]

CoRR, April, 2025

Concept-as-Tree: Synthetic Data is All You Need for VLM Personalization.

[BibT_eX]

[DOI]

CoRR, March, 2025

DreamCar: Leveraging Car-Specific Prior for In-the-Wild 3D Car Reconstruction.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., February, 2025

SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation.

[BibT_eX]

[DOI]

CoRR, January, 2025

PLGS: Robust Panoptic Lifting With 3D Gaussian Splatting.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2025

GaussianEnhancer++: A General GS-Agnostic Rendering Enhancer.

[BibT_eX]

[DOI]

Symmetry, 2025

SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

GaussianEnhancer: A General Rendering Enhancer for Gaussian Splatting.

[BibT_eX]

[DOI]

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for Multi-View BEV 3D Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Veh., January, 2024

ASGDiffusion: Parallel High-Resolution Generation with Asynchronous Structure Guidance.

[BibT_eX]

[DOI]

CoRR, 2024

[CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.

[BibT_eX]

[DOI]

CoRR, 2024

EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting.

[BibT_eX]

[DOI]

CoRR, 2024

GazeGaussian: High-Fidelity Gaze Redirection with 3D Gaussian Splatting.

[BibT_eX]

[DOI]

CoRR, 2024

MC-LLaVA: Multi-Concept Personalized Vision-Language Model.

[BibT_eX]

[DOI]

CoRR, 2024

3DRealCar: An In-the-wild RGB-D Car Dataset with 360-degree Views.

[BibT_eX]

[DOI]

CoRR, 2024

S<sup>3</sup>Gaussian: Self-Supervised Street Gaussians for Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2024

Implicit Neural Image Field for Biological Microscopy Image Compression.

[BibT_eX]

[DOI]

CoRR, 2024

SpikeNVS: Enhancing Novel View Synthesis from Blurry Images via Spike Camera.

[BibT_eX]

[DOI]

CoRR, 2024

Proximity QA: Unleashing the Power of Multi-Modal Large Language Models for Spatial Proximity Analysis.

[BibT_eX]

[DOI]

CoRR, 2024

RustNeRF: Robust Neural Radiance Field with Low-Quality Images.

[BibT_eX]

[DOI]

CoRR, 2024

Unsupervised Spike Depth Estimation via Cross-modality Cross-domain Knowledge Transfer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

I-MedSAM: Implicit Medical Image Segmentation with Segment Anything.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

NTO3D: Neural Target Object 3D Reconstruction with Segment Anything.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Semantically Disentangled Variational Autoencoder for Modeling 3D Facial Details.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., August, 2023

MoEC: Mixture of Experts Implicit Neural Compression.

[BibT_eX]

[DOI]

CoRR, 2023

NOC: High-Quality Neural Object Cloning with 3D Lifting of Segment Anything.

[BibT_eX]

[DOI]

CoRR, 2023

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation.

[BibT_eX]

[DOI]

CoRR, 2023

MoWE: Mixture of Weather Experts for Multiple Adverse Weather Removal.

[BibT_eX]

[DOI]

CoRR, 2023

RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery.

[BibT_eX]

[DOI]

Proceedings of the 33rd Workshop on Network and Operating System Support for Digital Audio and Video, 2023

HQRetouch: Learning Professional Face Retouching Via Masked Feature Fusion and Semantic-Aware Modulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2023

QD-BEV : Quantization-aware View-guided Distillation for Multi-view 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Comprehensive Comparison of Projections in Omnidirectional Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large Input.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Emotion-Preserving Blendshape Update With Real-Time Face Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., 2022

BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for BEV 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2022

Multi-latent Space Alignments for Unsupervised Domain Adaptation in Multi-view 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2022

Uncertainty Guided Depth Fusion for Spike Camera.

[BibT_eX]

[DOI]

CoRR, 2022

Unsupervised Spike Depth Estimation via Cross-modality Cross-domain Knowledge Transfer.

[BibT_eX]

[DOI]

CoRR, 2022

Adaptive Patch Exiting for Scalable Single Image Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Structure-Aware Editable Morphable Model for 3D Facial Detail Animation and Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Efficient Meta-Tuning for Content-Aware Neural Video Delivery.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Multi-view 3D Morphable Face Reconstruction via Canonical Volume Fusion.

[BibT_eX]

[DOI]

Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

3D Face Cartoonizer: Generating Personalized 3D Cartoon Faces from 2D Real Photos with a Hybrid Dataset.

[BibT_eX]

[DOI]

Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

2021

Deep Likelihood Network for Image Restoration With Multiple Degradation Levels.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Overfitting the Data: Compact Neural Video Delivery via Content-aware Feature Modulation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

In-the-Wild Facial Highlight Removal via Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the Artificial Intelligence - First CAAI International Conference, 2021

SamplingAug: On the Importance of Patch Sampling Augmentation for Single Image Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

Single image portrait relighting via explicit multiple reflectance channel modeling.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2020

Learning to Draw Sight Lines.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2020

Pointly-supervised scene parsing with uncertainty mixture.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2020

LID 2020: The Learning from Imperfect Data Challenge Results.

[BibT_eX]

[DOI]

CoRR, 2020

Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Explicit Residual Descent for 3D Human Pose Estimation from 2D Joint Locations.

[BibT_eX]

[DOI]

Proceedings of the 31st British Machine Vision Conference 2020, 2020

2019

A Closed-Form Solution to Universal Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018

A Direct 3D Object Tracking Method Based on Dynamic Textured Model Rendering and Extended Dense Feature Fields.

[BibT_eX]

[DOI]

Leisheng Zhong

Ming Lu

Li Zhang

IEEE Trans. Circuits Syst. Video Technol., 2018

Exemplar-Based Portrait Style Transfer.

[BibT_eX]

[DOI]

IEEE Access, 2018

2017

Real-time 3D eyelids tracking from semantic edges.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2017

Decoder Network over Lightweight Reconstructed Feature for Fast Semantic Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Physics Inspired Optimization on Semantic Transfer Features: An Alternative Method for Room Layout Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

RON: Reverse Connection with Objectness Prior Networks for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Ming Lu

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...