Yu Zhou

Orcid: 0000-0003-4188-9953

Affiliations:
  • Chinese Academy of Sciences, Institute of Information Engineering, Beijing, China
  • University of Chinese Academy of Sciences, School of Cyber Security, Beijing, China
  • Shanghai Jiaotong University, China (former)
  • Harbin Institute of Technology, Heilongjiang, China (former)


According to our database1, Yu Zhou authored at least 81 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Beyond Instance Discrimination: Relation-Aware Contrastive Self-Supervised Learning.
IEEE Trans. Multim., 2024

Masked and Permuted Implicit Context Learning for Scene Text Recognition.
IEEE Signal Process. Lett., 2024

TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model.
CoRR, 2024

Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing.
CoRR, 2024

2023
Self-Supervised Motion Perception for Spatiotemporal Representation Learning.
IEEE Trans. Neural Networks Learn. Syst., December, 2023

Beyond OCR + VQA: Towards end-to-end reading and reasoning for robust and accurate textvqa.
Pattern Recognit., June, 2023

IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition.
CoRR, 2023

Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning.
CoRR, 2023

Masked and Permuted Implicit Context Learning for Scene Text Recognition.
CoRR, 2023

UATVR: Uncertainty-Adaptive Text-Video Retrieval.
CoRR, 2023

Feature Enhancement with Text-Specific Region Contrast for Scene Text Detection.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Filling in the Blank: Rationale-Augmented Prompt Tuning for TextVQA.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Pseudo Object Replay and Mining for Incremental Object Detection.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene Text Detector.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Divide Rows and Conquer Cells: Towards Structure Recognition for Large Tables.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Mask-Guided Stamp Erasure for Real Document Image.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

UATVR: Uncertainty-Adaptive Text-Video Retrieval.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

EI<sup>2</sup>SR: Learning an Enhanced Intra-Instance Semantic Relationship for Arbitrary-Shaped Scene Text Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

One-Shot Replay: Boosting Incremental Object Detection via Retrospecting One Object.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
RD-IOD: Two-Level Residual-Distillation-Based Triple-Network for Incremental Object Detection.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Exploring Relations in Untrimmed Videos for Self-Supervised Learning.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Deep collaborative multi-task network: A human decision process inspired model for hierarchical image classification.
Pattern Recognit., 2022

Multi-View correlation distillation for incremental object detection.
Pattern Recognit., 2022

Beyond Instance Discrimination: Relation-aware Contrastive Self-supervised Learning.
CoRR, 2022

TextBlock: Towards Scene Text Spotting without Fine-grained Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

TPSNet: Reverse Thinking of Thin Plate Splines for Arbitrary Shape Scene Text Representation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MaMiCo: Macro-to-Micro Semantic Correspondence for Self-supervised Video Representation Learning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

UNITS: Unsupervised Intermediate Training Stage for Scene Text Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Video Motion Perception for Self-supervised Representation Learning.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2022, 2022

Imagine by Reasoning: A Reasoning-Based Implicit Semantic Data Augmentation for Long-Tailed Classification.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Video 3D Sampling for Self-supervised Representation Learning.
CoRR, 2021

Multi-View Correlation Distillation for Incremental Object Detection.
CoRR, 2021

Exploring Instance Relations for Unsupervised Feature Embedding.
CoRR, 2021

Binary Neural Network Hashing for Image Retrieval.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

A Cost-Efficient Framework for Scene Text Detection in the Wild.
Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021

Beyond OCR + VQA: Involving OCR into the Flow for Robust and Accurate TextVQA.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Mask is All You Need: Rethinking Mask R-CNN for Dense and Arbitrary-Shaped Scene Text Detection.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text Recognition.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Dense Semantic Contrast for Self-Supervised Visual Representation Learning.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Density-Net: A Density-Aware Network for 3D Object Detection.
Proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence, 2021

FC<sup>2</sup>RN: A Fully Convolutional Corner Refinement Network for Accurate Multi-Oriented Scene Text Detection.
Proceedings of the IEEE International Conference on Acoustics, 2021

MMF: Multi-task Multi-structure Fusion for Hierarchical Image Classification.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2021, 2021

Which and Where to Focus: A Simple yet Accurate Framework for Arbitrary-Shaped Nearby Text Detection in Scene Images.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2021, 2021

2020
Two-Level Residual Distillation based Triple Network for Incremental Object Detection.
CoRR, 2020

Expert Training: Task Hardness Aware Meta-Learning for Few-Shot Classification.
CoRR, 2020

FC2RN: A Fully Convolutional Corner Refinement Network for Accurate Multi-Oriented Scene Text Detection.
CoRR, 2020

Video Playback Rate Perception for Self-supervisedSpatio-Temporal Representation Learning.
CoRR, 2020

Asymmetric Deep Hashing for Efficient Hash Code Compression.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Deep Unsupervised Hybrid-similarity Hadamard Hashing.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Gaussian Constrained Attention Network for Scene Text Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Self-Training for Domain Adaptive Scene Text Detection.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Progressive Cluster Purification for Unsupervised Feature Learning.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Video Playback Rate Perception for Self-Supervised Spatio-Temporal Representation Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Video Cloze Procedure for Self-Supervised Spatio-Temporal Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Constrained Relation Network for Character Detection in Scene Images.
Proceedings of the PRICAI 2019: Trends in Artificial Intelligence, 2019

Curved Text Detection in Natural Scene Images with Semi- and Weakly-Supervised Learning.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Feature Hourglass Network for Skeleton Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2016
A Semantics-Aware Approach to the Automated Network Protocol Identification.
IEEE/ACM Trans. Netw., 2016

Matching User Photos to Online Products with Robust Deep Features.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

2015
Unsupervised adaptive sign language recognition based on hypothesis comparison guided cross validation and linguistic prior filtering.
Neurocomputing, 2015

Summarizing surveillance videos with local-patch-learning-based abnormality detection, blob sequence optimization, and type-based synopsis.
Neurocomputing, 2015

Semantics constrained dictionary learning for signer-independent sign language recognition.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Fast sign language recognition benefited from low rank approximation.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Weakly Supervised Metric Learning towards Signer Adaptation for Sign Language Recognition.
Proceedings of the British Machine Vision Conference 2015, 2015

2014
Visual Similarity Based Anti-phishing with the Combination of Local and Global Features.
Proceedings of the 13th IEEE International Conference on Trust, 2014

A Segmentation Pattern Based Approach to Automated Protocol Identification.
Proceedings of the 15th International Conference on Parallel and Distributed Computing, 2014

Text Detection in Natural Scene Images with Stroke Width Clustering and Superpixel.
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014

Representing And Recognizing Motion Trajectories: A Tube And Droplet Approach.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Text localization in natural scene images with stroke width histogram and superpixel.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Improved human head and shoulder detection with local main gradient and tracklets-based feature.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Perspective Scene Text Recognition with Feature Compression and Ranking.
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014

Curve Matching from the View of Manifold for Sign Language Recognition.
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014

2011
A new global-based video enhancement algorithm by fusing features of multiple region-of-interests.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

Priority pyramid based bit allocation for multiview video coding.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

Hypothesis comparison guided cross validation for unsupervised signer adaptation.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

2010
Adaptive Sign Language Recognition With Exemplar Extraction and MAP/IVFS.
IEEE Signal Process. Lett., 2010

2008
Mahalanobis distance based Polynomial Segment Model for Chinese Sign Language Recogniton.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

2007
Signer Adaptation Based on Etyma for Large Vocabulary Chinese Sign Language Recognition.
Proceedings of the Advances in Multimedia Information Processing, 2007


  Loading...