Kaicheng Yang

Orcid: 0009-0008-6073-9014

Affiliations:
  • DeepGlint, Beijing, China


According to our database1, Kaicheng Yang authored at least 15 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Region-based Cluster Discrimination for Visual Representation Learning.
CoRR, July, 2025

ForCenNet: Foreground-Centric Network for Document Image Rectification.
CoRR, July, 2025

Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs.
CoRR, April, 2025

RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation Paradigm.
CoRR, February, 2025

The Solution to the WWW25 Text-based Person Anomaly Search Challenge.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2025, 2025

ORID: Organ-Regional Information Driven Framework for Radiology Report Generation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension.
CoRR, 2024

High-Fidelity Facial Albedo Estimation via Texture Quantization.
CoRR, 2024

1st Place Solution to the 1st SkatingVerse Challenge.
CoRR, 2024

RWKV-CLIP: A Robust Vision-Language Representation Learner.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Multi-label Cluster Discrimination for Visual Representation Learning.
Proceedings of the Computer Vision - ECCV 2024, 2024

LaPA: Latent Prompt Assist Model for Medical Visual Question Answering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Unicom: Universal and Compact Representation Learning for Image Retrieval.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

ALIP: Adaptive Language-Image Pre-training with Synthetic Caption.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023


  Loading...