Jiaming Liu

Affiliations:

Peking University, School of Computer Science, Beijing, China

According to our database¹, Jiaming Liu authored at least 92 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

Robobench: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models as Embodied Brain.

[BibT_eX]

[DOI]

CoRR, October, 2025

RepCaM++: Exploring Transparent Visual Prompt With Inference-Time Re-Parameterization for Neural Video Delivery.

[BibT_eX]

[DOI]

IEEE Trans. Mob. Comput., September, 2025

MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, September, 2025

dVLA: Diffusion Vision-Language-Action Model with Multimodal Chain-of-Thought.

[BibT_eX]

[DOI]

CoRR, September, 2025

WoW: Towards a World omniscient World model Through Embodied Interaction.

[BibT_eX]

[DOI]

CoRR, September, 2025

RwoR: Generating Robot Demonstrations from Human Hand Collection for Policy Learning without Robot.

[BibT_eX]

[DOI]

CoRR, July, 2025

AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation.

[BibT_eX]

[DOI]

CoRR, July, 2025

MinD: Unified Visual Imagination and Control via Hierarchical World Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

SpikePingpong: High-Frequency Spike Vision-based Robot Learning for Precise Striking in Table Tennis Game.

[BibT_eX]

[DOI]

CoRR, June, 2025

Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoning.

[BibT_eX]

[DOI]

CoRR, June, 2025

BEVUDA++: Geometric-Aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., May, 2025

SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping.

[BibT_eX]

[DOI]

CoRR, May, 2025

CrayonRobo: Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, May, 2025

EmpathyAgent: Can Embodied Agents Conduct Empathetic Actions?

[BibT_eX]

[DOI]

CoRR, March, 2025

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model.

[BibT_eX]

[DOI]

CoRR, March, 2025

Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network With Graph Representation Learning.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., February, 2025

CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World.

[BibT_eX]

[DOI]

CoRR, February, 2025

3DWG: 3D Weakly Supervised Visual Grounding via Category and Instance-Level Alignment.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

PseDet: Revisiting the Power of Pseudo Label in Incremental Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Let's Verify and Reinforce Image Generation Step by Step.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Lift3D Policy: Lifting 2D Foundation Models for Robust 3D Robotic Manipulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

VFM-Adapter: Adapting Visual Foundation Models for Dense Prediction with Dynamic Hybrid Operation Mapping.

[BibT_eX]

[DOI]

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for Multi-View BEV 3D Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Veh., January, 2024

RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

MAVIS: Mathematical Visual Instruction Tuning.

[BibT_eX]

[DOI]

CoRR, 2024

MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception.

[BibT_eX]

[DOI]

CoRR, 2024

AIC MLLM: Autonomous Interactive Correction MLLM for Robust Robotic Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation.

[BibT_eX]

[DOI]

CoRR, 2024

Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation.

[BibT_eX]

[DOI]

CoRR, 2024

Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2024

A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge - Multi-Task Robustness Track.

[BibT_eX]

[DOI]

CoRR, 2024

RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Distribution-Aware Continual Test-Time Adaptation for Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Unsupervised Spike Depth Estimation via Cross-modality Cross-domain Knowledge Transfer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Any2Point: Empowering Any-Modality Large Models for Efficient 3D Understanding.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

No Time to Train: Empowering Non-Parametric Networks for Few-Shot 3D Scene Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FreeKD: Knowledge Distillation via Semantic Frequency Prompt.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

NTO3D: Neural Target Object 3D Reconstruction with Segment Anything.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Cloud-Device Collaborative Learning for Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Autonomous Interactive Correction MLLM for Robust Robotic Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 6-9 November 2024, Munich, Germany., 2024

Efficient Deweahter Mixture-of-Experts with Uncertainty-Aware Feature-Wise Linear Modulation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation.

[BibT_eX]

[DOI]

CoRR, 2023

Cloud-Device Collaborative Learning for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding.

[BibT_eX]

[DOI]

CoRR, 2023

Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation.

[BibT_eX]

[DOI]

CoRR, 2023

Split & Merge: Unlocking the Potential of Visual Adapters via Sparse Training.

[BibT_eX]

[DOI]

CoRR, 2023

ImageManip: Image-based Robotic Manipulation with Affordance-guided Next View Selection.

[BibT_eX]

[DOI]

CoRR, 2023

Distribution-Aware Continual Test Time Adaptation for Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

NOC: High-Quality Neural Object Cloning with 3D Lifting of Segment Anything.

[BibT_eX]

[DOI]

CoRR, 2023

RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision.

[BibT_eX]

[DOI]

CoRR, 2023

Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks.

[BibT_eX]

[DOI]

CoRR, 2023

PM-DETR: Domain Adaptive Prompt Memory for Object Detection with Transformers.

[BibT_eX]

[DOI]

CoRR, 2023

DiffuseIR: Diffusion Models For Isotropic Reconstruction of 3D Microscopic Images.

[BibT_eX]

[DOI]

CoRR, 2023

UniOcc: Unifying Vision-Centric 3D Occupancy Prediction with Geometric and Semantic Rendering.

[BibT_eX]

[DOI]

CoRR, 2023

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation.

[BibT_eX]

[DOI]

CoRR, 2023

Exploring Sparse Visual Prompt for Cross-domain Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery.

[BibT_eX]

[DOI]

Proceedings of the 33rd Workshop on Network and Operating System Support for Digital Audio and Video, 2023

DiffuseIR: Diffusion Models for Isotropic Reconstruction of 3D Microscopic Images.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

HQRetouch: Learning Professional Face Retouching Via Masked Feature Fusion and Semantic-Aware Modulation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2023

A Comprehensive Comparison of Projections in Omnidirectional Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large Input.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Cloud-Device Collaborative Adaptation to Continual Changing Environments in the Real-World.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for BEV 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2022

Multi-latent Space Alignments for Unsupervised Domain Adaptation in Multi-view 3D Object Detection.

[BibT_eX]

[DOI]

CoRR, 2022

Uncertainty Guided Depth Fusion for Spike Camera.

[BibT_eX]

[DOI]

CoRR, 2022

Unsupervised Spike Depth Estimation via Cross-modality Cross-domain Knowledge Transfer.

[BibT_eX]

[DOI]

CoRR, 2022

Cross-Domain Object Detection with Mean-Teacher Transformer.

[BibT_eX]

[DOI]

CoRR, 2022

MTTrans: Cross-domain Object Detection with Mean Teacher Transformer.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Adaptive Patch Exiting for Scalable Single Image Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Efficient Meta-Tuning for Content-Aware Neural Video Delivery.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

Overfitting the Data: Compact Neural Video Delivery via Content-aware Feature Modulation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

SamplingAug: On the Importance of Patch Sampling Augmentation for Single Image Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

FGSD: A Dataset for Fine-Grained Ship Detection in High Resolution Satellite Images.

[BibT_eX]

[DOI]

CoRR, 2020

2019

Towards Accurate High Resolution Satellite Image Semantic Segmentation.

[BibT_eX]

[DOI]

IEEE Access, 2019

2018

Queuing Strategy Optimization with Restricted Service Resources.

[BibT_eX]

[DOI]

Wirel. Pers. Commun., 2018

Jiaming Liu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...