Rong Xiao
Orcid: 0000-0003-2207-5698Affiliations:
- Intellifusion Inc., Shenzhen, Guangdong, China
- Ping An Property & Casualty Insurance Company
- Microsoft Research Asia, Beijing, China (former)
According to our database1,
Rong Xiao
authored at least 47 papers
between 2006 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on dl.acm.org
On csauthors.net:
Bibliography
2025
HCAttention: Extreme KV Cache Compression via Heterogeneous Attention Computing for LLMs.
CoRR, July, 2025
CoRR, July, 2025
Not All Attention Heads Are What You Need: Refining CLIP's Image Representation with Attention Ablation.
CoRR, July, 2025
Language Embedding Meets Dynamic Graph: A New Exploration for Neural Architecture Representation Learning.
CoRR, June, 2025
From Word to Sentence: A Large-Scale Multi-Instance Dataset for Open-Set Aerial Detection.
CoRR, May, 2025
Señorita-2M: A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists.
CoRR, February, 2025
Neural Normalized Cut: A differential and generalizable approach for spectral clustering.
Pattern Recognit., 2025
Expert Syst. Appl., 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Graph Cut-guided Maximal Coding Rate Reduction for Learning Image Embedding and Clustering.
CoRR, 2024
OVA-DETR: Open Vocabulary Aerial Object Detection Using Image-Text Alignment and Fusion.
CoRR, 2024
Generation Meets Verification: Accelerating Large Language Model Inference with Smart Parallel Auto-Correct Decoding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Graph Cut-Guided Maximal Coding Rate Reduction for Learning Image Embedding and Clustering.
Proceedings of the Computer Vision - ACCV 2024, 2024
2023
NAR-Former V2: Rethinking Transformer for Universal Neural Network Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
2022
EMU: Effective Multi-Hot Encoding Net for Lightweight Scene Text Recognition With a Large Character Set.
IEEE Trans. Circuits Syst. Video Technol., 2022
2021
Pattern Recognit., 2021
CoRR, 2021
PingAn-VCGroup's Solution for ICDAR 2021 Competition on Scientific Literature Parsing Task B: Table Recognition to HTML.
CoRR, 2021
PingAn-VCGroup's Solution for ICDAR 2021 Competition on Scientific Table Image Recognition to Latex.
CoRR, 2021
2020
CoRR, 2020
PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
2019
A Novel Joint Character Categorization and Localization Approach for Character-Level Scene Text Recognition.
Proceedings of the Second International Workshop on Machine Learning, 2019
Proceedings of the 30th British Machine Vision Conference 2019, 2019
2018
Revisit Multinomial Logistic Regression in Deep Learning: Data Dependent Model Initialization for Image Recognition.
CoRR, 2018
2014
IEEE Trans. Pattern Anal. Mach. Intell., 2014
2012
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Proceedings of the Computer Vision - ECCV 2012, 2012
2011
Proceedings of the 20th International Conference on World Wide Web, 2011
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011
2010
Proceedings of the 19th International Conference on World Wide Web, 2010
Proceedings of the 18th ACM SIGSPATIAL International Symposium on Advances in Geographic Information Systems, 2010
2009
Proceedings of the 17th International Conference on Multimedia 2009, 2009
2008
Proceedings of the Computer Vision, 2008
Proceedings of the Computer Vision, 2008
2007
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007
EasyAlbum: an interactive photo annotation system based on face clustering and re-ranking.
Proceedings of the 2007 Conference on Human Factors in Computing Systems, 2007
2006
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006