Yikang Ding

Orcid: 0009-0007-9256-4805

According to our database1, Yikang Ding authored at least 34 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models.
CoRR, March, 2026

Kling-MotionControl Technical Report.
CoRR, March, 2026

2025
KlingAvatar 2.0 Technical Report.
CoRR, December, 2025

Interpretable Cross-Modal Alignment Network for EEG Visual Decoding With Algorithm Unrolling.
IEEE Trans. Neural Networks Learn. Syst., November, 2025

Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis.
CoRR, September, 2025

Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching.
CoRR, July, 2025

ORV: 4D Occupancy-centric Robot Video Generation.
CoRR, June, 2025

DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation.
CoRR, March, 2025

MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction.
CoRR, March, 2025

Joint multi-layer network and coupling redundancy minimization for semi-supervised EEG-based emotion recognition.
Knowl. Based Syst., 2025

CLAIM: Camera-LiDAR Alignment with Intensity and Monodepth.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2025

HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

UniScene: Unified Occupancy-centric Driving Scene Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
The Devil is in the Edges: Monocular Depth Estimation with Edge-aware Consistency Fusion.
CoRR, 2024

OccTransformer: Improving BEVFormer for 3D camera-only occupancy prediction.
CoRR, 2024

M<sup>2</sup>Depth: Self-supervised Two-Frame Multi-camera Metric Depth Estimation.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Enhancing Grid-Based 3D Object Detection in Autonomous Driving With Improved Dimensionality Reduction.
IEEE Access, 2023

LayerDiffusion: Layered Controlled Image Editing with Diffusion Models.
Proceedings of the SIGGRAPH Asia 2023 Technical Communications, 2023

Sem-Avatar: Semantic Controlled Neural Field for High-Fidelity Audio Driven Avatar.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Towards Practical Consistent Video Depth Estimation.
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023

Edge-aware Neural Implicit Surface Reconstruction.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Volumetric 3D Reconstruction with Window-Wise Global Feature Aggregation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Rethinking Feature Context in Learning Image-Guided Depth Completion.
Proceedings of the Artificial Neural Networks and Machine Learning, 2023

Adaptive Assignment for Geometry Aware Local Feature Matching.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Rethinking Dimensionality Reduction in Grid-based 3D Object Detection.
CoRR, 2022

KD-MVS: Knowledge Distillation Based Self-supervised Learning for MVS.
CoRR, 2022

Adaptive Assignment for Geometry Aware Local Feature Matching.
CoRR, 2022

WT-MVSNet: Window-based Transformers for Multi-view Stereo.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Enhancing Multi-View Stereo with Contrastive Matching and Weighted Focal Loss.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives.
Proceedings of the Computer Vision - ECCV 2022, 2022

KD-MVS: Knowledge Distillation Based Self-supervised Learning for Multi-view Stereo.
Proceedings of the Computer Vision - ECCV 2022, 2022

TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Adaptive Range Guided Multi-view Depth Estimation with Normal Ranking Loss.
Proceedings of the Computer Vision - ACCV 2022, 2022


  Loading...