Xu-Cheng Yin

Orcid: 0000-0003-0023-0220

According to our database1, Xu-Cheng Yin authored at least 153 papers between 2005 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Open-Set Text Recognition - Concepts, Framework, and Algorithms
Springer Briefs in Computer Science, Springer, ISBN: 978-981-97-0360-9, 2024

Arbitrary Shape Text Detection via Boundary Transformer.
IEEE Trans. Multim., 2024

Sample Weighting with Hierarchical Equalization Loss for Dense Object Detection.
IEEE Trans. Multim., 2024

Inverse-Like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling.
IEEE Trans. Image Process., 2024

Irregular License Plate Recognition via Global Information Integration.
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024

Improving Small License Plate Detection with Bidirectional Vehicle-Plate Relation.
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024

Recurrent Partial Kernel Network for Efficient Optical Flow Estimation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
HFENet: Hybrid Feature Enhancement Network for Detecting Texts in Scenes and Traffic Panels.
IEEE Trans. Intell. Transp. Syst., December, 2023

Kernel Proposal Network for Arbitrary Shape Text Detection.
IEEE Trans. Neural Networks Learn. Syst., November, 2023

Hypersphere guided embedding for masked face recognition.
Pattern Recognit. Lett., October, 2023

Self-supervised contrastive speaker verification with nearest neighbor positive instances.
Pattern Recognit. Lett., September, 2023

Beyond OCR + VQA: Towards end-to-end reading and reasoning for robust and accurate textvqa.
Pattern Recognit., June, 2023

Arbitrary Shape Text Detection via Segmentation With Probability Maps.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering.
IEEE Trans. Multim., 2023

Towards open-set text recognition via label-to-prototype learning.
Pattern Recognit., 2023

InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition.
CoRR, 2023

Rethinking Speech Recognition with A Multimodal Perspective via Acoustic and Semantic Cooperative Decoding.
CoRR, 2023

Unsupervised Multi-view Pedestrian Detection.
CoRR, 2023

Graph fusion network for multi-oriented object detection.
Appl. Intell., 2023

Feature Implicit Enhancement via Super-Resolution for Small Object Detection.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Feature Enhancement and Reconstruction for Small Object Detection.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023

LiteHandNet: A Lightweight Hand Pose Estimation Network via Structural Feature Enhancement.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023

Towards Discriminative Semantic Relationship for Fine-grained Crowd Counting.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Complex Glyph Enhancement for License Plate Generation.
Proceedings of the Image and Graphics - 12th International Conference, 2023

Open-Set Text Recognition via Shape-Awareness Visual Reconstruction.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

End-to-End Multi-line License Plate Recognition with Cascaded Perception.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Learning Correction Filter via Degradation-Adaptive Regression for Blind Single Image Super-Resolution.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Self-Convolution for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Depth-Guided Progressive Network for Object Detection.
IEEE Trans. Intell. Transp. Syst., 2022

Occluded Pedestrian Detection via Distribution-Based Mutual-Supervised Feature Learning.
IEEE Trans. Intell. Transp. Syst., 2022

Weakly Correlated Knowledge Integration for Few-shot Image Classification.
Int. J. Autom. Comput., 2022

Boosting Multi-Modal E-commerce Attribute Value Extraction via Unified Learning Scheme and Dynamic Range Minimization.
CoRR, 2022

Towards Open-Set Text Recognition via Label-to-Prototype Learning.
CoRR, 2022

SCDNet: Real-time Semantic Segmentation Network with Split Connection and Flexible Dilated Convolution.
Proceedings of the IEEE Smartworld, 2022

Vertex Adjustment Loss for Multidirectional License Plate Detection and Recognition.
Proceedings of the IEEE Smartworld, 2022

Anchor-Free Location Refinement Network for Small License Plate Detection.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

SD-GAN: Semantic Decomposition for Face Image Synthesis with Discrete Attribute.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

From Token to Word: OCR Token Evolution via Contrastive Learning and Semantic Matching for Text-VQA.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

CAliC: Accurate and Efficient Image-Text Retrieval via Contrastive Alignment and Visual Contexts Modeling.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

DANet: Dynamic Attention to Spoof Patterns for Face Anti-Spoofing.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Semi-Supervised Fine-Grained Classification with Web Data via Noisy Sample Selection.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Adaptive Rounding Compensation for Post-training Quantization.
Proceedings of the Neural Information Processing - 29th International Conference, 2022

Non-Autoregressive Transformer with Unified Bidirectional Decoder for Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Open-Set Text Recognition via Character-Context Decoupling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Scene Text Recognition with Single-Point Decoding Network.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Learning Aligned Cross-Modal Representation for Generalized Zero-Shot Classification.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Detecting Text in Scene and Traffic Guide Panels With Attention Anchor Mechanism.
IEEE Trans. Intell. Transp. Syst., 2021

Self-Adaptive Aspect Ratio Anchor for Oriented Object Detection in Remote Sensing Images.
Remote. Sens., 2021

GCCNet: Grouped channel composition network for scene text detection.
Neurocomputing, 2021

Multi-orientation scene text detection with scale-guided regression.
Neurocomputing, 2021

End-to-end trainable network for degraded license plate detection via vehicle-plate relation mining.
Neurocomputing, 2021

Hybrid approach for big data localization and semantic annotation.
Concurr. Comput. Pract. Exp., 2021

An Efficient Temporal Model for Small-Footprint Keyword Spotting.
Proceedings of the 7th IEEE International Conference on Network Intelligence and Digital Content, 2021

Real-World Image Super-Resolution Via Spatio-Temporal Correlation Network.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Online Scene Text Tracking with Spatial-Temporal Relation.
Proceedings of the Image and Graphics - 11th International Conference, 2021

Robust Chinese License Plate Generation via Foreground Text and Background Separation.
Proceedings of the Image and Graphics - 11th International Conference, 2021

Dynamic Receptive Field Adaptation for Attention-Based Text Recognition.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Fast Recognition for Multidirectional and Multi-type License Plates with 2D Spatial Attention.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Adaptive Pattern-Parameter Matching for Robust Pedestrian Detection.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Simultaneous End-to-End Vehicle and License Plate Detection With Multi-Branch Attention Neural Network.
IEEE Trans. Intell. Transp. Syst., 2020

HAM: Hidden Anchor Mechanism for Scene Text Detection.
IEEE Trans. Image Process., 2020

Chinese Short Text Classification with Mutual-Attention Convolutional Neural Networks.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2020

Ranking via partial ordering for answer selection.
Inf. Sci., 2020

SUMAC 2020: The 2nd Workshop on Structuring and Understanding of Multimedia heritAge Contents.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Global Context-Based Network with Transformer for Image2latex.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Semantic Bilinear Pooling for Fine-Grained Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Mutual-Supervised Feature Modulation Network for Occluded Pedestrian Detection.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

A Hybrid Self-Attention Model for Pedestrians Detection.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Detecting Multi-Resolution Pedestrians Using Group Cost-Sensitive Boosting with Channel Features.
Sensors, 2019

Diversity-Based Random Forests with Sample Weight Learning.
Cogn. Comput., 2019

Health assistant: answering your questions anytime from biomedical literature.
Bioinform., 2019

Detecting Advertising Materials via Multi-Scale Instance Segmentation Network.
Aust. J. Intell. Inf. Process. Syst., 2019

Deep Neural Network-Based Severity Prediction of Bug Reports.
IEEE Access, 2019

Joint Rotation-Invariance Face Detection and Alignment with Angle-Sensitivity Cascaded Networks.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Pyramid Memory Block and Timestep Attention for Speech Emotion Recognition.
Proceedings of the Interspeech 2019, 2019

Combined Correlation Filters with Siamese Region Proposal Network for Visual Tracking.
Proceedings of the Neural Information Processing - 26th International Conference, 2019

Feature-Matching Speech Denoising GANs via Progressive Training.
Proceedings of the 2019 International Conference on Data Mining Workshops, 2019

Image Generation Framework for Unbalanced License Plate Data Set.
Proceedings of the 2019 International Conference on Data Mining Workshops, 2019

Detecting Text in News Images with Similarity Embedded Proposals.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

2018
A Unified Framework for Tracking Based Text Detection and Recognition from Web Videos.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

LSTM<sup>2</sup> : Multi-Label Ranking for Document Classification.
Neural Process. Lett., 2018

Effective human detection via multi-model classification and adaptive late fusion.
Int. J. Wavelets Multiresolution Inf. Process., 2018

SSDM2: a Two-Stage Semantic Sequential Dependence Model Framework for Biomedical Question Answering.
Cogn. Comput., 2018

Selection of optimal denoising-based regularization hyper-parameters for performance improvement in a sensor validation model.
Artif. Intell. Rev., 2018

Scene Video Text Tracking With Graph Matching.
IEEE Access, 2018

Semantic Sequential Query Expansion for Biomedical Article Search.
IEEE Access, 2018

TED-KISS: A Known-Item Speech Video Search Benchmark.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

2017
Tracking Based Multi-Orientation Scene Text Detection: A Unified Framework With Dynamic Programming.
IEEE Trans. Image Process., 2017

Fast alignment for sparse representation based face recognition.
Pattern Recognit., 2017

AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition.
CoRR, 2017

Building Your Own Reading List Anytime via Embedding Relevance, Quality, Timeliness and Diversity.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Satellite super-resolution images depending on deep learning methods: A comparative study.
Proceedings of the 2017 IEEE International Conference on Signal Processing, 2017

ICDAR2017 Robust Reading Challenge on Text Extraction from Biomedical Literature Figures (DeTEXT).
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

A calculation method for social network user credibility.
Proceedings of the IEEE International Conference on Communications, 2017

A Multi-strategy Query Processing Approach for Biomedical Question Answering: USTB_PRIR at BioASQ 2017 Task 5B.
Proceedings of the BioNLP 2017, Vancouver, Canada, August 4, 2017, 2017

Applying BinaryWeights in Neural Networks for Sequential Text Recognition.
Proceedings of the 4th IAPR Asian Conference on Pattern Recognition, 2017

2016
Text Detection, Tracking and Recognition in Video: A Comprehensive Survey.
IEEE Trans. Image Process., 2016

A generic pseudo relevance feedback framework with heterogeneous social information.
Inf. Sci., 2016

A Short Survey of Recent Advances in Graph Matching.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Scene Text Detection in Video by Learning Locally and Globally.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Multi-orientation scene text detection with multi-information fusion.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

USTB at Social Book Search 2016 Suggestion Task: Active Books Set and Re-ranking.
Proceedings of the Working Notes of CLEF 2016, 2016

Multi-label Ranking with LSTM ^2 for Document Classification.
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016

Robust Segmentation for Video Captions with Complex Backgrounds.
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016

Bloody Image Classification with Global and Local Features.
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016

2015
Multi-Orientation Scene Text Detection with Adaptive Clustering.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Learning Imbalanced Classifiers Locally and Globally with One-Side Probability Machine.
Neural Process. Lett., 2015

DE<sup>2</sup>: Dynamic ensemble of ensembles for learning nonstationary data.
Neurocomputing, 2015

A VidEo-Based Intelligent Recognition and Decision System for the Phacoemulsification Cataract Surgery.
Comput. Math. Methods Medicine, 2015

Learning Document Semantic Representation with Hybrid Deep Belief Network.
Comput. Intell. Neurosci., 2015

Multi-strategy tracking based text detection in scene videos.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

USTB at Social Book Search 2015 Suggestion Task: Metadata Expansion and Reranking.
Proceedings of the Working Notes of CLEF 2015, 2015

A generic retrieval system for biomedical literatures: USTB at BioASQ2015 Question Answering Task.
Proceedings of the Working Notes of CLEF 2015, 2015

2014
Robust Text Detection in Natural Scene Images.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Convex ensemble learning with sparsity and diversity.
Inf. Fusion, 2014

A novel classifier ensemble method with sparsity and diversity.
Neurocomputing, 2014

A central tendency-based privacy preserving model for sensitive XML association rules using Bayesian networks.
Intell. Data Anal., 2014

Learning to Diversify via Weighted Kernels for Classifier Ensemble.
CoRR, 2014

Bayesian network scores based text localization in scene images.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Shallow Classification or Deep Learning: An Experimental Study.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Diversity-Based Ensemble with Sample Weight Learning.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Text Categorization with Diversity Random Forests.
Proceedings of the Neural Information Processing - 21st International Conference, 2014

Social Book Search with Pseudo-Relevance Feedback.
Proceedings of the Neural Information Processing - 21st International Conference, 2014

USTB at INEX2014: Social Book Search Track.
Proceedings of the Working Notes for CLEF 2014 Conference, 2014

Social Book Search Reranking with Generalized Content-Based Filtering.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

2013
Robust Text Detection in Natural Scene Images
CoRR, 2013

Accurate and robust text detection: a step-in for text retrieval in natural scene images.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

Weakly Supervised Compressive Tracking with Effective Prediction Model.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Transfer learning based compressive tracking.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Classifier comparison for MSER-based text classification in scene images.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Learning Bayesian Network leveled-structure from support based XML frequent itemsets.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Dynamic Ensemble of Ensembles in Nonstationary Environments.
Proceedings of the Neural Information Processing - 20th International Conference, 2013

Sorting-Based Dynamic Classifier Ensemble Selection.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

2012
Automatic segmentation of cervical vertebrae in X-ray images.
Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN), 2012

Effective text localization in natural scene images with MSER, geometry-based grouping and AdaBoost.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Classifier Ensemble Using a Heuristic Learning with Sparsity and Diversity.
Proceedings of the Neural Information Processing - 19th International Conference, 2012

Pedestrian Analysis and Counting System with Videos.
Proceedings of the Neural Information Processing - 19th International Conference, 2012

2011
FMI image based rock structure classification using classifier combination.
Neural Comput. Appl., 2011

Exchange rate prediction with non-numerical information.
Neural Comput. Appl., 2011

An improved topic relevance algorithm for focused crawling.
Proceedings of the IEEE International Conference on Systems, 2011

Handwritten Chinese character identification with Bagged One-Class support vector machines.
Proceedings of the 2011 International Joint Conference on Neural Networks, 2011

Learning Based Visibility Measuring with Images.
Proceedings of the Neural Information Processing - 18th International Conference, 2011

Robust Vanishing Point Detection for MobileCam-Based Documents.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

2010
Ellipse Detection with an Improved Randomized Hough Transform for Intellectual Phacoemulsification Surgery Systems.
Proceedings of the International Conference on Image Processing, 2010

2009
A Rock Structure Recognition System Using FMI Images.
Proceedings of the Neural Information Processing, 16th International Conference, 2009

Exchange Rate Forecasting Using Classifier Ensemble.
Proceedings of the Neural Information Processing, 16th International Conference, 2009

Rejection Strategies with Multiple Classifiers for Handwritten Character Recognition.
Proceedings of the 10th International Conference on Document Analysis and Recognition, 2009

2007
A Multi-Stage Strategy to Perspective Rectification for Mobile Phone Camera-Based Document Images.
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), 2007

2005
Feature combination using boosting.
Pattern Recognit. Lett., 2005

Financial Document Image Coding with Regions of Interest Using JPEG2000.
Proceedings of the Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), 29 August, 2005


  Loading...