Cong Ma

Orcid: 0000-0002-9787-6273

Affiliations:
  • Chinese Academy of Sciences, Institute of Automation, State Key Laboratory of Multimodal Artificial Intelligence Systems (MAIS) / formerly, National Laboratory of Pattern Recognition (NLPR), Beijing, China
  • University of Chinese Academy of Sciences, School of Artificial Intelligence, Beijing, China


According to our database1, Cong Ma authored at least 14 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Understand Layout and Translate Text: Unified Feature-Conductive End-to-End Document Image Translation.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2025

2024
Modal Contrastive Learning Based End-to-End Text Image Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

[inline-graphic not available: see fulltext] TableRocket: An Efficient and Effective Framework for Table Reconstruction.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Document Image Machine Translation with Dynamic Multi-pre-trained Models Assembling.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Vector Quantization Knowledge Transfer for End-to-End Text Image Machine Translation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Born a BabyNet with Hierarchical Parental Supervision for End-to-End Text Image Machine Translation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Multi-Teacher Knowledge Distillation For Text Image Machine Translation.
CoRR, 2023

E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine Translation.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Multi-teacher Knowledge Distillation for End-to-End Text Image Machine Translation.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

CCIM: Cross-modal Cross-lingual Interactive Image Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
Improving End-to-End Text Image Translation From the Auxiliary Text Translation Task.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

2020
CASIA's System for IWSLT 2020 Open Domain Translation.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020

2019
Read, Watch, Listen, and Summarize: Multi-Modal Summarization for Asynchronous Text, Image, Audio and Video.
IEEE Trans. Knowl. Data Eng., 2019

2017
Multi-modal Summarization for Asynchronous Collection of Text, Image, Audio and Video.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017


  Loading...