Cong Ma
Orcid: 0000-0002-9787-6273Affiliations:
- Chinese Academy of Sciences, Institute of Automation, State Key Laboratory of Multimodal Artificial Intelligence Systems (MAIS) / formerly, National Laboratory of Pattern Recognition (NLPR), Beijing, China
- University of Chinese Academy of Sciences, School of Artificial Intelligence, Beijing, China
According to our database1,
Cong Ma
authored at least 14 papers
between 2017 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
Understand Layout and Translate Text: Unified Feature-Conductive End-to-End Document Image Translation.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2025
2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
[inline-graphic not available: see fulltext] TableRocket: An Efficient and Effective Framework for Table Reconstruction.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Vector Quantization Knowledge Transfer for End-to-End Text Image Machine Translation.
Proceedings of the IEEE International Conference on Acoustics, 2024
Born a BabyNet with Hierarchical Parental Supervision for End-to-End Text Image Machine Translation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
2023
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
2022
Improving End-to-End Text Image Translation From the Auxiliary Text Translation Task.
Proceedings of the 26th International Conference on Pattern Recognition, 2022
2020
Proceedings of the 17th International Conference on Spoken Language Translation, 2020
2019
Read, Watch, Listen, and Summarize: Multi-Modal Summarization for Asynchronous Text, Image, Audio and Video.
IEEE Trans. Knowl. Data Eng., 2019
2017
Multi-modal Summarization for Asynchronous Collection of Text, Image, Audio and Video.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017