Ruihang Chu

Orcid: 0000-0001-9057-745X

According to our database1, Ruihang Chu authored at least 28 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
A Survey of Reasoning with Foundation Models: Concepts, Methodologies, and Outlook.
ACM Comput. Surv., November, 2025

Exploiting Discriminative Codebook Prior for Autoregressive Image Generation.
CoRR, August, 2025

DreamVE: Unified Instruction-based Image and Video Editing.
CoRR, August, 2025

TTS-VAR: A Test-Time Scaling Framework for Visual Auto-Regressive Generation.
CoRR, July, 2025

AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning.
CoRR, July, 2025

Zero-P-to-3: Zero-Shot Partial-View Images to 3D Object.
CoRR, May, 2025

Wan: Open and Advanced Large-Scale Video Generative Models.
CoRR, March, 2025

IterPref: Focal Preference Learning for Code Generation via Iterative Debugging.
CoRR, March, 2025

DialogGen: Multi-modal Interactive Dialogue System with Multi-turn Text-Image Generation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

InSerter: Speech Instruction Following with Unsupervised Interleaved Pre-training.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion.
CoRR, 2024

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models.
CoRR, 2024

DriveCoT: Integrating Chain-of-Thought Reasoning with End-to-End Driving.
CoRR, 2024

DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation.
CoRR, 2024

2023
A Survey of Reasoning with Foundation Models.
CoRR, 2023

DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation.
CoRR, 2023

DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DiffComplete: Diffusion-based Generative 3D Shape Completion.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Mask-Attention-Free Transformer for 3D Instance Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TriVol: Point Cloud Rendering via Triple Volumes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Command-driven Articulated Object Understanding and Manipulation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
ICM-3D: Instantiated Category Modeling for 3D Instance Segmentation.
IEEE Robotics Autom. Lett., 2022

TWIST: Two-Way Inter-label Self-Training for Semi-supervised 3D Instance Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Simultaneous Semantic and Collision Learning for 6-DoF Grasp Pose Estimation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Scale-Aware Automatic Augmentation for Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Co-Actuation: A Method for Achieving High Stiffness and Low Inertia for Haptic Devices.
IEEE Trans. Haptics, 2020

2019
An Intuitive End-to-End Human-UAV Interaction System for Field Exploration.
Frontiers Neurorobotics, 2019

Vehicle Re-Identification With Viewpoint-Aware Metric Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019


  Loading...