Jiaming Liu

Affiliations:
  • Peking University, School of Computer Science, Beijing, China


According to our database1, Jiaming Liu authored at least 88 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
RepCaM++: Exploring Transparent Visual Prompt With Inference-Time Re-Parameterization for Neural Video Delivery.
IEEE Trans. Mob. Comput., September, 2025

RwoR: Generating Robot Demonstrations from Human Hand Collection for Policy Learning without Robot.
CoRR, July, 2025

AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation.
CoRR, July, 2025

MinD: Unified Visual Imagination and Control via Hierarchical World Models.
CoRR, June, 2025

SpikePingpong: High-Frequency Spike Vision-based Robot Learning for Precise Striking in Table Tennis Game.
CoRR, June, 2025

Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoning.
CoRR, June, 2025

BEVUDA++: Geometric-Aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection.
IEEE Trans. Circuits Syst. Video Technol., May, 2025

SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping.
CoRR, May, 2025

CrayonRobo: Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation.
CoRR, May, 2025

3DWG: 3D Weakly Supervised Visual Grounding via Category and Instance-Level Alignment.
CoRR, May, 2025

EmpathyAgent: Can Embodied Agents Conduct Empathetic Actions?
CoRR, March, 2025

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model.
CoRR, March, 2025

Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network With Graph Representation Learning.
IEEE Trans. Neural Networks Learn. Syst., February, 2025

CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World.
CoRR, February, 2025

MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

PseDet: Revisiting the Power of Pseudo Label in Incremental Object Detection.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Let's Verify and Reinforce Image Generation Step by Step.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Lift3D Policy: Lifting 2D Foundation Models for Robust 3D Robotic Manipulation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

VFM-Adapter: Adapting Visual Foundation Models for Dense Prediction with Dynamic Hybrid Operation Mapping.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for Multi-View BEV 3D Object Detection.
IEEE Trans. Intell. Veh., January, 2024

RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation.
CoRR, 2024

Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation.
CoRR, 2024

MAVIS: Mathematical Visual Instruction Tuning.
CoRR, 2024

MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception.
CoRR, 2024

AIC MLLM: Autonomous Interactive Correction MLLM for Robust Robotic Manipulation.
CoRR, 2024

RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation.
CoRR, 2024

Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation.
CoRR, 2024

Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation.
CoRR, 2024

Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding.
CoRR, 2024

Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection.
CoRR, 2024

A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge - Multi-Task Robustness Track.
CoRR, 2024

RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Distribution-Aware Continual Test-Time Adaptation for Semantic Segmentation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Unsupervised Spike Depth Estimation via Cross-modality Cross-domain Knowledge Transfer.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Any2Point: Empowering Any-Modality Large Models for Efficient 3D Understanding.
Proceedings of the Computer Vision - ECCV 2024, 2024

LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model.
Proceedings of the Computer Vision - ECCV 2024, 2024

No Time to Train: Empowering Non-Parametric Networks for Few-Shot 3D Scene Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FreeKD: Knowledge Distillation via Semantic Frequency Prompt.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

NTO3D: Neural Target Object 3D Reconstruction with Segment Anything.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Cloud-Device Collaborative Learning for Multimodal Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Autonomous Interactive Correction MLLM for Robust Robotic Manipulation.
Proceedings of the Conference on Robot Learning, 6-9 November 2024, Munich, Germany., 2024

Efficient Deweahter Mixture-of-Experts with Uncertainty-Aware Feature-Wise Linear Modulation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation.
CoRR, 2023

Cloud-Device Collaborative Learning for Multimodal Large Language Models.
CoRR, 2023

LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding.
CoRR, 2023

Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation.
CoRR, 2023

Split & Merge: Unlocking the Potential of Visual Adapters via Sparse Training.
CoRR, 2023

ImageManip: Image-based Robotic Manipulation with Affordance-guided Next View Selection.
CoRR, 2023

Distribution-Aware Continual Test Time Adaptation for Semantic Segmentation.
CoRR, 2023

NOC: High-Quality Neural Object Cloning with 3D Lifting of Segment Anything.
CoRR, 2023

RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision.
CoRR, 2023

Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks.
CoRR, 2023

PM-DETR: Domain Adaptive Prompt Memory for Object Detection with Transformers.
CoRR, 2023

DiffuseIR: Diffusion Models For Isotropic Reconstruction of 3D Microscopic Images.
CoRR, 2023

UniOcc: Unifying Vision-Centric 3D Occupancy Prediction with Geometric and Semantic Rendering.
CoRR, 2023

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation.
CoRR, 2023

Exploring Sparse Visual Prompt for Cross-domain Semantic Segmentation.
CoRR, 2023

RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery.
Proceedings of the 33rd Workshop on Network and Operating System Support for Digital Audio and Video, 2023

DiffuseIR: Diffusion Models for Isotropic Reconstruction of 3D Microscopic Images.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

HQRetouch: Learning Professional Face Retouching Via Masked Feature Fusion and Semantic-Aware Modulation.
Proceedings of the IEEE International Conference on Image Processing, 2023

A Comprehensive Comparison of Projections in Omnidirectional Super-Resolution.
Proceedings of the IEEE International Conference on Acoustics, 2023

CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large Input.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Cloud-Device Collaborative Adaptation to Continual Changing Environments in the Real-World.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for BEV 3D Object Detection.
CoRR, 2022

Multi-latent Space Alignments for Unsupervised Domain Adaptation in Multi-view 3D Object Detection.
CoRR, 2022

Uncertainty Guided Depth Fusion for Spike Camera.
CoRR, 2022

Unsupervised Spike Depth Estimation via Cross-modality Cross-domain Knowledge Transfer.
CoRR, 2022

Cross-Domain Object Detection with Mean-Teacher Transformer.
CoRR, 2022

MTTrans: Cross-domain Object Detection with Mean Teacher Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

Adaptive Patch Exiting for Scalable Single Image Super-Resolution.
Proceedings of the Computer Vision - ECCV 2022, 2022

Efficient Meta-Tuning for Content-Aware Neural Video Delivery.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Overfitting the Data: Compact Neural Video Delivery via Content-aware Feature Modulation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

SamplingAug: On the Importance of Patch Sampling Augmentation for Single Image Super-Resolution.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
FGSD: A Dataset for Fine-Grained Ship Detection in High Resolution Satellite Images.
CoRR, 2020

2019
Towards Accurate High Resolution Satellite Image Semantic Segmentation.
IEEE Access, 2019

2018
Queuing Strategy Optimization with Restricted Service Resources.
Wirel. Pers. Commun., 2018


  Loading...