Jinfeng Bai

Orcid: 0000-0001-8940-480X

According to our database1, Jinfeng Bai authored at least 47 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
CK12: A Rounded K12 Knowledge Graph Based Benchmark for Chinese Holistic Cognition Evaluation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

LRANet: Towards Accurate and Efficient Scene Text Detection with Low-Rank Approximation Network.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Leveraging Local Variance for Pseudo-Label Selection in Semi-supervised Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Decoupled Textual Embeddings for Customized Image Generation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Dual Contrastive Prediction for Incomplete Multi-View Representation Learning.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

Robust Multi-View Clustering With Incomplete Information.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

GPT Can Solve Mathematical Problems Without a Calculator.
CoRR, 2023

Patch Is Not All You Need.
CoRR, 2023

DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model.
CoRR, 2023

Inferring and Leveraging Parts from Object Shape for Improving Semantic Image Synthesis.
CoRR, 2023

TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

CCLAP: Controllable Chinese Landscape Painting Generation Via Latent Diffusion Model.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Decoupling Visual-Semantic Features Learning with Dual Masked Autoencoder for Self-Supervised Scene Text Recognition.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ViSA: Visual and Semantic Alignment for Robust Scene Text Recognition.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Semantic Graph Representation Learning for Handwritten Mathematical Expression Recognition.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ICDAR 2023 Competition on Recognition of Multi-line Handwritten Mathematical Expressions.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Synthetic Corpus Generation Method for Neural Vocoder Training.
Proceedings of the IEEE International Conference on Acoustics, 2023

DSPGAN: A Gan-Based Universal Vocoder for High-Fidelity TTS by Time-Frequency Domain Supervision from DSP.
Proceedings of the IEEE International Conference on Acoustics, 2023

Unveiling the Implicit Toxicity in Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Black-Box Tuning of Vision-Language Models with Effective Gradient Approximation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Texts as Images in Prompt Tuning for Multi-Label Image Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Inferring and Leveraging Parts from Object Shape for Improving Semantic Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ReCoT: Regularized Co-Training for Facial Action Unit Recognition with Noisy Labels.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

Hybrid Syllable and Character Representations for Mandarin ASR.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Unsupervised Neural Rendering for Image Hazing.
IEEE Trans. Image Process., 2022

1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object Segmentation.
CoRR, 2022

Position-Aware Contrastive Alignment for Referring Image Segmentation.
CoRR, 2022

1st Place Solutions for UG2+ Challenge 2022 ATMOSPHERIC TURBULENCE MITIGATION.
CoRR, 2022

1st Place Solutions for the UVO Challenge 2022.
CoRR, 2022

BERT-LID: Leveraging BERT to Improve Spoken Language Identification.
CoRR, 2022

Towards Diverse and Faithful One-shot Adaption of Generative Adversarial Networks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Improving Speech Separation with Knowledge Distilled from Self-supervised Pre-trained Models.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

BERT-LID: Leveraging BERT to Improve Spoken Language Identification.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Summary On The ISCSLP 2022 Chinese-English Code-Switching ASR Challenge.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

TALCS: An open-source Mandarin-English code-switching corpus and a speech recognition baseline.
Proceedings of the Interspeech 2022, 2022

A Vision Transformer Based Scene Text Recognizer with Multi-grained Encoding and Decoding.
Proceedings of the Frontiers in Handwriting Recognition - 18th International Conference, 2022

Time-Domain Audio-Visual Speech Separation on Low Quality Videos.
Proceedings of the IEEE International Conference on Acoustics, 2022

When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

2014
Anchor Shot Detection with Deep Neural Network.
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014

CeleLabel: an interactive system for annotating celebrities in web videos.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Image character recognition using deep convolutional neural network learned from different languages.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Chinese Image Text Recognition on grayscale pixels.
Proceedings of the IEEE International Conference on Acoustics, 2014

Chinese Image Character Recognition Using DNN and Machine Simulated Training Samples.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2014, 2014

2013
Camera based cross devices manipulating with augmented reality.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Binarization of natural scene text based on L1-Norm PCA.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

2012
Multi-modal information fusion for news story segmentation in broadcast video.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012


  Loading...