We stand with Ukraine

We stand with Ukraine

Mingyu Ding

Orcid: 0000-0001-6556-8359

According to our database¹, Mingyu Ding authored at least 78 papers between 2014 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Context Autoencoder for Self-supervised Representation Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Int. J. Comput. Vis., January, 2024

Compositional Physical Reasoning of Objects and Events from Videos.

[BibT_eX]

[DOI]

,

,

,

,

,

Antonio Torralba

,

Joshua B. Tenenbaum

,

CoRR, 2024

WOMD-Reasoning: A Large-Scale Language Dataset for Interaction and Driving Intentions Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

Masayoshi Tomizuka

,

,

,

CoRR, 2024

Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Masayoshi Tomizuka

CoRR, 2024

RoadBEV: Road Surface Reconstruction in Bird's Eye View.

[BibT_eX]

[DOI]

,

,

,

,

Masayoshi Tomizuka

,

CoRR, 2024

Q-SLAM: Quadric Representations for Monocular SLAM.

[BibT_eX]

[DOI]

,

,

,

,

,

Masayoshi Tomizuka

,

,

,

CoRR, 2024

DrPlanner: Diagnosis and Repair of Motion Planners Using Large Language Models.

[BibT_eX]

[DOI]

,

,

,

Masayoshi Tomizuka

,

,

Matthias Althoff

CoRR, 2024

PhyGrasp: Generalizing Robotic Grasping with Physics-informed Large Multimodal Models.

[BibT_eX]

[DOI]

,

,

,

,

Masayoshi Tomizuka

,

,

CoRR, 2024

RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Depth-aware Volume Attention for Texture-less Stereo Matching.

[BibT_eX]

[DOI]

,

,

,

Masayoshi Tomizuka

,

CoRR, 2024

Open X-Embodiment: Robotic Learning Datasets and RT-X Models : Open X-Embodiment Collaboration.

[BibT_eX]

[DOI]

,

,

Abhiram Maddukuri

,

,

Abhishek Padalkar

,

,

,

,

,

,

,

,

Alexander Herzog

,

,

Alexander Khazatsky

,

,

,

,

,

,

Aniruddha Kembhavi

,

,

,

,

,

,

,

Ashwin Balakrishna

,

,

Ben Burgess-Limerick

,

,

Bernhard Schölkopf

,

,

,

,

,

,

,

,

,

,

Chenguang Huang

,

,

Christopher Agia

,

,

,

,

,

,

,

,

,

,

Dieter Büchler

,

Dinesh Jayaraman

,

Dmitry Kalashnikov

,

,

,

Ethan Paul Foster

,

,

,

,

,

,

,

Gaurav S. Sukhatme

,

Gautam Salhotra

,

,

,

,

,

,

,

,

,

,

,

,

Henrik I. Christensen

,

,

,

,

,

,

Ilija Radosavovic

,

,

,

Jad Abou-Chakra

,

,

,

,

,

,

,

Jeffrey Bingham

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

João Silvério

,

,

Jonathan Booher

,

Jonathan Tompson

,

,

,

,

,

,

,

,

,

,

Keerthana Gopalakrishnan

,

,

,

,

Kento Kawaharazuka

,

,

,

,

,

,

,

,

Krishnan Srinivasan

,

,

Kunal Pratap Singh

,

,

,

,

,

Lawrence Yunliang Chen

,

,

,

,

,

,

,

,

,

,

,

Masayoshi Tomizuka

,

,

Mateo Guaman Castro

,

,

,

,

,

,

,

,

Mohan Kumar Srirama

,

,

,

Naoaki Kanazawa

,

,

,

Nikhil J. Joshi

,

Niko Sünderhauf

,

,

,

Nur Muhammad (Mahi) Shafiullah

,

,

,

,

Pannag R. Sanketi

,

Patrick Tree Miller

,

,

,

,

Peter David Fagan

,

,

Pierre Sermanet

,

,

Priya Sundaresan

,

,

,

Rafael Rafailov

,

,

,

Roberto Martín-Martín

,

,

Rosario Scalise

,

,

,

,

,

Russell Mendonca

,

,

,

,

Samuel Bustamante

,

,

,

,

,

,

,

Shubham D. Sonawani

,

,

,

Siddhant Haldar

,

Siddharth Karamcheti

,

,

,

Soroush Nasiriany

,

,

,

,

Subramanian Ramamoorthy

,

,

Suneel Belkhale

,

,

,

Suvir Mirchandani

,

,

,

,

Tatsuya Matsushima

,

,

,

,

,

,

,

Travis Armstrong

,

,

,

,

Vincent Vanhoucke

,

,

,

Wolfram Burgard

,

,

,

,

,

,

,

,

,

Yecheng Jason Ma

,

,

Yevgen Chebotar

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Zichen Jeff Cui

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

VDT: General-purpose Video Diffusion Transformers via Mask Modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

Masayoshi Tomizuka

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Tree-Planner: Efficient Close-loop Task Planning with Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

Mach. Intell. Res., August, 2023

Understanding Self-Supervised Pretraining with Part-Aware Representation Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Trans. Mach. Learn. Res., 2023

Quadric Representations for LiDAR Odometry, Mapping and Localization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Masayoshi Tomizuka

,

IEEE Robotics Autom. Lett., 2023

NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

IEEE Robotics Autom. Lett., 2023

SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution.

[BibT_eX]

[DOI]

,

,

,

Masayoshi Tomizuka

,

,

CoRR, 2023

A Survey of Reasoning with Foundation Models.

[BibT_eX]

[DOI]

,

Chuanyang Zheng

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

Interfacing Foundation Models' Embeddings.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Arul Aravinthan

,

,

CoRR, 2023

EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2023

LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Shengbo Eben Li

,

Masayoshi Tomizuka

,

,

CoRR, 2023

Human-oriented Representation Learning for Robotic Manipulation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Masayoshi Tomizuka

,

CoRR, 2023

Generalizable Long-Horizon Manipulations with Large Language Models.

[BibT_eX]

[DOI]

,

,

,

Masayoshi Tomizuka

,

,

CoRR, 2023

RSRD: A Road Surface Reconstruction Dataset and Benchmark for Safe and Comfortable Autonomous Driving.

[BibT_eX]

[DOI]

,

,

,

Masayoshi Tomizuka

,

,

CoRR, 2023

Pre-training on Synthetic Driving Data for Trajectory Prediction.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Masayoshi Tomizuka

,

CoRR, 2023

An Efficient General-Purpose Modular Vision Model via Multi-Task Heterogeneous Training.

[BibT_eX]

[DOI]

,

,

,

,

Masayoshi Tomizuka

,

Erik G. Learned-Miller

,

CoRR, 2023

VDT: An Empirical Study on Video Diffusion with Transformers.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

EC^2: Emergent Communication for Embodied Control.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

Doubly-Robust Self-Training.

[BibT_eX]

[DOI]

,

,

Philip L. Jacobson

,

,

,

Michael I. Jordan

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Towards Free Data Selection with General-Purpose Models.

[BibT_eX]

[DOI]

,

,

Masayoshi Tomizuka

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical Properties.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners.

[BibT_eX]

[DOI]

,

,

,

,

Masayoshi Tomizuka

,

Proceedings of the International Conference on Machine Learning, 2023

Planning with Large Language Models for Code Generation.

[BibT_eX]

[DOI]

,

,

,

,

Joshua B. Tenenbaum

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

TextPSG: Panoptic Scene Graph Generation from Textual Descriptions.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

EC<sup>2</sup>: Emergent Communication for Embodied Control.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Joshua B. Tenenbaum

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners.

[BibT_eX]

[DOI]

,

,

,

,

Hengshuang Zhao

,

Erik G. Learned-Miller

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

PolarMask++: Enhanced Polar Representation for Single-Shot Instance Segmentation and Beyond.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Mod-Squad: Designing Mixture of Experts As Modular Multi-Task Learners.

[BibT_eX]

[DOI]

,

,

,

,

Hengshuang Zhao

,

Erik G. Learned-Miller

,

CoRR, 2022

NeRF-Loc: Transformer-Based Object Localization Within Neural Radiance Fields.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2022

Multimodal foundation models are better simulators of the human brain.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2022

LGDN: Language-Guided Denoising Network for Video-Language Modeling.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

Learning Versatile Neural Architectures by Propagating Network Codes.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

ComPhy: Compositional Physical Reasoning of Objects and Events from Videos.

[BibT_eX]

[DOI]

,

,

,

,

Antonio Torralba

,

Joshua B. Tenenbaum

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

DaViT: Dual Attention Vision Transformers.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision, 2022

Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following.

[BibT_eX]

[DOI]

,

,

,

David Daniel Cox

,

,

Joshua B. Tenenbaum

,

Proceedings of the Conference on Robot Learning, 2022

2021

Affimer-Based Europium Chelates Allow Sensitive Optical Biosensing in a Range of Human Disease Biomarkers.

[BibT_eX]

[DOI]

,

Alexandre Vakurov

,

,

,

,

,

Sensors, 2021

Domain-Adaptive Few-Shot Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Compressed Video Contrastive Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Self-Supervised Video Representation Learning with Constrained Spatiotemporal Jigsaw.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

IEPT: Instance-Level and Episode-Level Pretext Tasks for Few-Shot Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

L2M-GAN: Learning To Manipulate Latent Space Semantics for Facial Attribute Editing.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

HR-NAS: Searching Efficient High-Resolution Neural Architectures With Lightweight Transformers.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

A Global Occlusion-Aware Approach to Self-Supervised Monocular Visual Odometry.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Domain-Adaptive Few-Shot Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2020

SegVoxelNet: Exploring Semantic Context and Depth-aware Features for 3D Vehicle Detection from Point Cloud.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Pyramid Multi-view Stereo Net with Self-adaptive View Aggregation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2020, 2020

Dense Hybrid Recurrent Multi-view Stereo Net with Dynamic Consistency Checking.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2020, 2020

Segmenting Transparent Objects in the Wild.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2020, 2020

Lightweight Action Recognition in Compressed Videos.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Learning Depth-Guided Convolutions for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Every Frame Counts: Joint Learning of Video Segmentation and Optical Flow.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Cross-domain mapping learning for transductive zero-shot learning.

[BibT_eX]

[DOI]

,

,

Comput. Vis. Image Underst., 2019

CamNet: Coarse-to-Fine Retrieval for Camera Re-Localization.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Face-Focused Cross-Stream Network for Deception Detection in Videos.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Domain-Invariant Projection Learning for Zero-Shot Recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

InsightGAN: Semi-Supervised Feature Learning with Generative Adversarial Network for Drug Abuse Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Neural Information Processing - 25th International Conference, 2018

Zero-Shot Learning with Superclasses.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Neural Information Processing - 25th International Conference, 2018

DeepInsight: Multi-Task Multi-Scale Deep Learning for Mental Disorder Diagnosis.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the British Machine Vision Conference 2018, 2018

2017

One-Step Facile Synthesis of Aptamer-Modified Graphene Oxide for Highly Specific Enrichment of Human A-Thrombin in Plasma.

[BibT_eX]

[DOI]

,

,

,

Sensors, 2017

2015

Research on the Interaction of Hydrogen-Bond Acidic Polymer Sensitive Sensor Materials with Chemical Warfare Agents Simulants by Inverse Gas Chromatography.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Sensors, 2015

2014

Portable Solid Phase Micro-Extraction Coupled with Ion Mobility Spectrometry System for On-Site Analysis of Chemical Warfare Agents and Simulants in Water Samples.

[BibT_eX]

[DOI]

,

,

,

,

,

Sensors, 2014

Loading...