Lingting Zhu

Orcid: 0000-0002-1478-3232

According to our database1, Lingting Zhu authored at least 35 papers between 2020 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Generative Enhancement for 3D Medical Images.
Int. J. Comput. Vis., March, 2026

Cross-Scale Pansharpening via ScaleFormer and the PanScale Benchmark.
CoRR, March, 2026

AssetFormer: Modular 3D Assets Generation with Autoregressive Transformer.
CoRR, February, 2026

MuMA: 3D PBR Texturing via Multi-Channel Multi-View Generation and Albedo Post-Processing.
IEEE Trans. Image Process., 2026

Multi-contrast low-field MRI acceleration with k-space progressive learning and image-space hybrid attention fusion.
Medical Image Anal., 2026

2025
ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning.
CoRR, December, 2025

CaliTex: Geometry-Calibrated Attention for View-Coherent 3D Texture Generation.
CoRR, November, 2025

LumiTex: Towards High-Fidelity PBR Texture Generation with Illumination Context.
CoRR, November, 2025

V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models.
CoRR, November, 2025

Large Material Gaussian Model for Relightable 3D Generation.
CoRR, September, 2025

Improving Foundation Model for Endoscopy Video Analysis via Representation Learning on Long Sequences.
IEEE J. Biomed. Health Informatics, May, 2025

StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation.
CoRR, May, 2025

DanceGRPO: Unleashing GRPO on Visual Generation.
CoRR, May, 2025

GarmentX: Autoregressive Parametric Representations for High-Fidelity 3D Garment Generation.
CoRR, April, 2025

MuMA: 3D PBR Texturing via Multi-Channel Multi-View Generation and Agentic Post-Processing.
CoRR, March, 2025

Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation.
CoRR, March, 2025

Unleash the Power of State Space Model for Whole Slide Image With Local Aware Scanning and Importance Resampling.
IEEE Trans. Medical Imaging, February, 2025

AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance.
CoRR, February, 2025

Multi-Sensor Learning Enables Information Transfer Across Different Sensory Data and Augments Multi-Modality Imaging.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2025

AssetDropper: Asset Extraction via Diffusion Models with Reward-Driven Optimization.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2025

Generalizable Human Gaussians from Single-View Image.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

A Dynamic Agent Framework for Large Language Model Reasoning for Medical and Visual Question Answering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, ICCV 2025, 2025

MAR-3D: Progressive Masked Auto-regressor for High-Resolution 3D Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Large Images Are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities.
CoRR, 2024

Deformable Endoscopic Tissues Reconstruction with Gaussian Splatting.
CoRR, 2024

CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects.
CoRR, 2024

Low-to-High Frequency Progressive K-Space Learning for MRI Reconstruction.
Proceedings of the Machine Learning in Medical Imaging - 15th International Workshop, 2024

EndoGS: Deformable Endoscopic Tissues Reconstruction with Gaussian Splatting.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024 Workshops, 2024

HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction.
Proceedings of the 35th British Machine Vision Conference, 2024

2023
IDRNet: Intervention-Driven Relation Network for Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Make-A-Volume: Leveraging Latent Diffusion Models for Cross-Modality 3D Brain MRI Synthesis.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Cheap Lunch for Medical Image Segmentation by Fine-Tuning SAM on Few Exemplars.
Proceedings of the Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 2023

2020
Machine Learning-Based Resource Optimization for D2D Communication Underlaying Networks.
Proceedings of the 92nd IEEE Vehicular Technology Conference, 2020


  Loading...