Yunze Liu
This page is a disambiguation page, it actually contains multiple papers from persons of the same or a similar name.
Bibliography
2026
OmniRetriever: Any-to-Any Audio-Video-Text Retrieval via Fusion-as-Teacher Distillation.
CoRR, May, 2026
O-MARC: Omni Memory-Augmented Compression Distillation for Efficient Video Understanding.
CoRR, May, 2026
Bridging Modalities, Spanning Time: Structured Memory for Ultra-Long Agentic Video Reasoning.
CoRR, May, 2026
A Neural Network and Genetic Algorithm-Based Model for Evaluating and Enhancing Quality-Oriented Teaching Systems.
J. Circuits Syst. Comput., April, 2026
NTIRE 2026 The 3rd Restore Any Image Model (RAIM) Challenge: Professional Image Quality Assessment (Track 1).
CoRR, April, 2026
CoRR, January, 2026
ST-Think: How Multimodal Large Language Models Reason About 4D Worlds from Ego-Centric Videos.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026
PointNet4D: A Lightweight 4D Point Cloud Video Backbone for Online and Offline Perception in Robotic Applications.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2026
2025
CoRR, October, 2025
CoRR, September, 2025
CoRR, July, 2025
ST-Think: How Multimodal Large Language Models Reason About 4D Worlds from Ego-Centric Videos.
CoRR, March, 2025
CoRR, March, 2025
IEEE Trans. Image Process., 2025
Ring Artifacts Correction Based on Global-Local Feature Interaction Guidance in the Projection Domain.
IEEE Trans. Instrum. Meas., 2025
A 40-μV Offset 130-dB CMRR Analog Front End With Automatic Offset Calibration and Common Mode Cancellation for High-Precision Instrumentation Measurement.
IEEE Trans. Instrum. Meas., 2025
MutualNeRF: Improve the Performance of NeRF under Limited Samples with Mutual Information Theory.
Proceedings of the Conference on Uncertainty in Artificial Intelligence, 2025
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
CULTURE3D: A Large-Scale and Diverse Dataset of Cultural Landmarks and Terrains for Gaussian-Based Scene Rendering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
MobileH2R: Learning Generalizable Human to Mobile Robot Handover Exclusively from Scalable and Diverse Synthetic Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
MAP: Unleashing Hybrid Mamba-Transformer Vision Backbone's Potential with Masked Autoregressive Pretraining.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
PhysReaction: Physically Plausible Real-Time Humanoid Reaction Synthesis via Forward Dynamics Guided 4D Imitation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
CrossVideo: Self-supervised Cross-modal Contrastive Learning for Point Cloud Video Understanding.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
BMC Bioinform., December, 2023
Interactive Humanoid: Online Full-Body Motion Reaction Synthesis with Social Affordance Canonicalization and Forecasting.
CoRR, 2023
CoRR, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
2020
P4Contrast: Contrastive Learning with Pairs of Point-Pixel Pairs for RGB-D Scene Understanding.
CoRR, 2020