Bo Ren

Orcid: 0000-0002-0619-7188

Affiliations:
  • Tencent YouTu Lab, Hefei, China


According to our database1, Bo Ren authored at least 48 papers between 2018 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Joint optimization for attention-based generation and recognition of chinese characters using tree position embedding.
Pattern Recognit., August, 2023

Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution.
CoRR, 2023

Visual Information Extraction in the Wild: Practical Dataset and End-to-End Solution.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Turning a CLIP Model into a Scene Text Detector.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

NewsNet: A Novel Dataset for Hierarchical Temporal Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

OSAN: A One-Stage Alignment Network to Unify Multimodal Alignment and Unsupervised Domain Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DeepContract: Controllable Authorization of Deep Learning Models.
Proceedings of the Annual Computer Security Applications Conference, 2023

FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

TaCo: Textual Attribute Recognition via Contrastive Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

The Devil Is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-training.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Adaptive Hierarchy-Branch Fusion for Online Knowledge Distillation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
VLMAE: Vision-Language Masked Autoencoder.
CoRR, 2022

Open-Vocabulary Multi-Label Classification via Multi-modal Knowledge Transfer.
CoRR, 2022

Contrastive Graph Multimodal Model for Text Classification in Videos.
CoRR, 2022

The Devil is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-Training.
CoRR, 2022

Semantic-Preserving Abstractive Text Summarization with Siamese Generative Adversarial Net.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

RAAT: Relation-Augmented Attention Transformer for Relation Modeling in Document-Level Event Extraction.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

GMN: Generative Multi-modal Network for Practical Document Information Extraction.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

OS-MSL: One Stage Multimodal Sequential Link Framework for Scene Segmentation and Classification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Relational Representation Learning in Visually-Rich Documents.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Query-driven Generative Network for Document Information Extraction in the Wild.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Leveraging Key Information Modeling to Improve Less-Data Constrained News Headline Generation via Duality Fine-Tuning.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Grafting Pre-trained Models for Multimodal Headline Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: EMNLP 2022 - Industry Track, Abu Dhabi, UAE, December 7, 2022

See Finer, See More: Implicit Modality Alignment for Text-Based Person Retrieval.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Hyperspherical Learning in Multi-Label Classification.
Proceedings of the Computer Vision - ECCV 2022, 2022

Scene Consistency Representation Learning for Video Scene Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Knowledge Mining with Scene Text for Fine-Grained Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

HybridCR: Weakly-Supervised 3D Point Cloud Semantic Segmentation via Hybrid Contrastive Regularization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Neural Collaborative Graph Machines for Table Structure Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

CoCGAN: Contrastive Learning for Adversarial Category Text Generation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

TDv2: A Novel Tree-Structured Decoder for Offline Mathematical Expression Recognition.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Perceiving Stroke-Semantic Context: Hierarchical Contrastive Learning for Robust Scene Text Recognition.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Sequence-to-Action: Grammatical Error Correction with Action Guided Sequence Generation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Comprehensive Regularization in a Bi-directional Predictive Network for Video Anomaly Detection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Stroke constrained attention network for online handwritten mathematical expression recognition.
Pattern Recognit., 2021

Head and Body: Unified Detector and Graph Network for Person Search in Media.
CoRR, 2021

Show, Read and Reason: Table Structure Recognition with Flexible Context Aggregator.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

RecycleNet: An Overlapped Text Instance Recovery Approach.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Radical Composition Network for Chinese Character Generation.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

MRD: A Memory Relation Decoder for Online Handwritten Mathematical Expression Recognition.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Hierarchical Multi-label Text Classification with Horizontal and Vertical Category Correlations.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Person Attribute Recognition by Sequence Contextual Relation Learning.
IEEE Trans. Circuits Syst. Video Technol., 2020

PuzzleNet: Scene Text Detection by Segment Context Graph Learning.
CoRR, 2020

Accurate Structured-Text Spotting for Arithmetical Exercise Correction.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2018
Sequence-based Person Attribute Recognition with Joint CTC-Attention Model.
CoRR, 2018


  Loading...