Yongtao Wang

Orcid: 0000-0003-1379-2206

According to our database1, Yongtao Wang authored at least 126 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
FlowNAS: Neural Architecture Search for Optical Flow Estimation.
Int. J. Comput. Vis., April, 2024

A Tightness Detection Method for Railway Fasteners Based on RGB-P Bimodal Data.
IEEE Trans. Instrum. Meas., 2024

RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object Detection.
CoRR, 2024

GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting.
CoRR, 2024

BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving Scenarios.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
HIGF-Net: Hierarchical information-guided fusion network for polyp segmentation based on transformer and convolution feature learning.
Comput. Biol. Medicine, July, 2023

A cascaded refined rgb-d salient object detection network based on the attention mechanism.
Appl. Intell., June, 2023

TEDT: Transformer-Based Encoding-Decoding Translation Network for Multimodal Sentiment Analysis.
Cogn. Comput., January, 2023

An Intelligent MT Data Inversion Method With Seismic Attribute Enhancement.
IEEE Trans. Geosci. Remote. Sens., 2023

DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving Scenes.
CoRR, 2023

DUAW: Data-free Universal Adversarial Watermark against Stable Diffusion Customization.
CoRR, 2023

A Simple and Generic Framework for Feature Distillation via Channel-wise Transformation.
CoRR, 2023

Foreground Guidance and Multi-Layer Feature Fusion for Unsupervised Object Discovery with Transformers.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

A Simple and Generic Framework for Feature Distillation via Channel-wise Transformation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SAMPLING: Scene-adaptive Hierarchical Multiplane Images Representation for Novel View Synthesis from a Single Image.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Fair and Comprehensive Comparisons for Image-Based 3D Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Differentiable Architecture Search with Random Features.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DynamicDet: A Unified Dynamic Architecture for Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

T-SEA: Transfer-Based Self-Ensemble Attack on Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Screen Content Video Quality Assessment Model Using Hybrid Spatiotemporal Features.
IEEE Trans. Image Process., 2022

CBNet: A Composite Backbone Network Architecture for Object Detection.
IEEE Trans. Image Process., 2022

Application of U-Net for the Recognition of Regional Features in Geophysical Inversion Results.
IEEE Trans. Geosci. Remote. Sens., 2022

MEEMD Decomposition-Prediction-Reconstruction Model of Precipitation Time Series.
Sensors, 2022

Application of PSO-BPNN-PID Controller in Nutrient Solution EC Precise Control System: Applied Research.
Sensors, 2022

Cross-modality reinforcement for unaligned sequences sentiment analysis.
J. Intell. Fuzzy Syst., 2022

A tiny charge-scaling in the OPLS-AA + L-OPLS force field delivers the realistic dynamics and structure of liquid primary alcohols.
J. Comput. Chem., 2022

BEV-MAE: Bird's Eye View Masked Autoencoders for Outdoor Point Cloud Pre-training.
CoRR, 2022

FlowNAS: Neural Architecture Search for Optical Flow Estimation.
CoRR, 2022

Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object Detection.
CoRR, 2022

Training Protocol Matters: Towards Accurate Scene Text Recognition via Training Protocol Searching.
CoRR, 2022

SBDF-Net: A versatile dual-branch fusion network for medical image segmentation.
Biomed. Signal Process. Control., 2022

BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

IterVM: Iterative Vision Modeling Module for Scene Text Recognition.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Continual Contrastive Learning for Image Classification.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Cascading Scene and Viewpoint Feature Learning for Pedestrian Gender Recognition.
IEEE Internet Things J., 2021

Continual Contrastive Self-supervised Learning for Image Classification.
CoRR, 2021

CBNetV2: A Composite Backbone Network Architecture for Object Detection.
CoRR, 2021

CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes.
CoRR, 2021

Rpattack: Refined Patch Attack on General Object Detectors.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Geometric Object 3D Reconstruction from Single Line Drawing Image Based on a Network for Classification and Sketch Extraction.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

OPANAS: One-Shot Path Aggregation Network Architecture Search for Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Learning a Single Model With a Wide Range of Quality Factors for JPEG Image Artifacts Removal.
IEEE Trans. Image Process., 2020

Learning Local and Global Priors for JPEG Image Artifacts Removal.
IEEE Signal Process. Lett., 2020

A quadrilateral scene text detector with two-stage network architecture.
Pattern Recognit., 2020

GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Pixel Labeling.
CoRR, 2020

DADA: Differentiable Automatic Data Augmentation.
CoRR, 2020

MixTConv: Mixed Temporal Convolutional Kernels for Efficient Action Recogntion.
CoRR, 2020

Towards Accurate Panel Detection in Manga: A Combined Effort of CNN and Heuristics.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

PD-DARTS: Progressive Discretization Differentiable Architecture Search.
Proceedings of the Pattern Recognition and Artificial Intelligence, 2020

GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Semantic Segmentation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

MixTConv: Mixed Temporal Convolutional Kernels for Efficient Action Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Dual Loss for Manga Character Recognition with Imbalanced Training Data.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Differentiable Automatic Data Augmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

CBNet: A Novel Composite Backbone Network Architecture for Object Detection.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
A Robust Method for Automatic Panoramic UAV Image Mosaic.
Sensors, 2019

Synthesizing data for text recognition with style transfer.
Multim. Tools Appl., 2019

MFPN: A Novel Mixture Feature Pyramid Network of Multiple Architectures for Object Detection.
CoRR, 2019

SGDNet: An End-to-End Saliency-Guided Deep Neural Network for No-Reference Image Quality Assessment.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

High Performance Gesture Recognition via Effective and Efficient Temporal Modeling.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Drone Image Stitching Guided by Robust Elastic Warping and Locality Preserving Matching.
Proceedings of the 2019 IEEE International Geoscience and Remote Sensing Symposium, 2019

Scene Text Recognition via Gated Cascade Attention.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Progressive deep feature learning for manga character recognition via unlabeled training data.
Proceedings of the ACM Turing Celebration Conference - China, 2019

Pyramidal region context module for semantic segmentation.
Proceedings of the ACM Turing Celebration Conference - China, 2019

M2Det: A Single-Shot Object Detector Based on Multi-Level Feature Pyramid Network.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Deep Dual Pyramid Network for Barcode Segmentation using Barcode-30k Database.
CoRR, 2018

CFENet: An Accurate and Efficient Single-Shot Object Detector for Autonomous Driving.
CoRR, 2018

Bottom-up/Top-down Geometric Object Reconstruction with CNN Classification for Mobile Education.
Proceedings of the 26th Pacific Conference on Computer Graphics and Applications, 2018

An End-to-End Quadrilateral Regression Network for Comic Panel Extraction.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018


VisDrone-DET2018: The Vision Meets Drone Object Detection in Image Challenge Results.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Comprehensive Feature Enhancement Module for Single-Shot Object Detector.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
A Faster R-CNN Based Method for Comic Characters Face Detection.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

Geometric Object 3D Reconstruction from Single Line Drawing Image with Bottom-Up and Top-Down Classification and Sketch Generation.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

Mutual Enhancement for Detection of Multiple Logos in Sports Videos.
Proceedings of the IEEE International Conference on Computer Vision, 2017

SReN: Shape Regression Network for Comic Storyboard Extraction.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Improving retrieval of plane geometry figure with learning to rank.
Pattern Recognit. Lett., 2016

Recovering solid geometric object from single line drawing image.
Multim. Tools Appl., 2016

Comic storyboard extraction via edge segment analysis.
Multim. Tools Appl., 2016

An R-CNN Based Method to Localize Speech Balloons in Comics.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Context-aware Geometric Object Reconstruction for Mobile Education.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Design of the arm-wrestling robot's force acquisition system based on Qt.
Proceedings of the Ninth International Conference on Machine Vision, 2016

2015
A tree conditional random field model for panel detection in comic images.
Pattern Recognit., 2015

一个高效的属性基密钥协商协议 (Efficient Attribute Based Key Agreement Protocol).
计算机科学, 2015

Mitigating Key Escrow in Attribute-Based Encryption.
Int. J. Netw. Secur., 2015

Aesthetic QR Codes Based on Two-Stage Image Blending.
Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

Comic frame extraction via line segments combination.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

A clump splitting based method to localize speech balloons in comics.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

Solid Geometric Object Reconstruction from Single Line Drawing Image.
Proceedings of the GRAPP 2015, 2015

A fast and robust ellipse detector based on top-down least-square fitting.
Proceedings of the British Machine Vision Conference 2015, 2015

2014
SW-MAC: A Low-Latency MAC Protocol with Adaptive Sleeping for Wireless Sensor Networks.
Wirel. Pers. Commun., 2014

Are significant inventions more diversified?
Scientometrics, 2014

Epipolar geometry estimation for wide baseline stereo by Clustering Pairing Consensus.
Pattern Recognit. Lett., 2014

Automatic comic page segmentation based on polygon detection.
Multim. Tools Appl., 2014

Lattice Ciphertext Policy Attribute-based Encryption in the Standard Model.
Int. J. Netw. Secur., 2014

Multiple-access wiretap channel with common channel state information at the encoders.
IET Commun., 2014

Efficient two-stage early SKIP mode termination for depth video coding.
Comput. Electr. Eng., 2014

A robust circle detection algorithm based on top-down least-square fitting analysis.
Comput. Electr. Eng., 2014

A Method of Density Analysis for Chinese Characters.
Proceedings of the Natural Language Processing and Chinese Computing, 2014

Comic2CEBX: A system for automatic comic content adaptation.
Proceedings of the IEEE/ACM Joint Conference on Digital Libraries, 2014

Automatic comic page image understanding based on edge segment analysis.
Proceedings of the Document Recognition and Retrieval XXI, 2014

Logical Labeling of Fixed Layout PDF Documents Using Multiple Contexts.
Proceedings of the 11th IAPR International Workshop on Document Analysis Systems, 2014

2013
Infrared Patch-Image Model for Small Target Detection in a Single Image.
IEEE Trans. Image Process., 2013

Newspaper article reconstruction using ant colony optimization and bipartite graph.
Appl. Soft Comput., 2013

Degraded multiple-access channel with confidential messages.
Proceedings of the Sixth International Workshop on Signal Design and Its Applications in Communications, 2013

Unsupervised Speech Text Localization in Comic Images.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

Comic image understanding based on polygon detection.
Proceedings of the Document Recognition and Retrieval XX, 2013

2012
Identity-Based Key-Insulated Signcryption.
Informatica, 2012

Automatic extraction of foreground objects from Mars images.
Geo spatial Inf. Sci., 2012

Accountable authority key policy attribute-based encryption.
Sci. China Inf. Sci., 2012

A graph-based method of newspaper article reconstruction.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

2011
Large Disparity Motion Layer Extraction via Topological Clustering.
IEEE Trans. Image Process., 2011

Structure extraction from PDF-based book documents.
Proceedings of the 2011 Joint International Conference on Digital Libraries, 2011

Wavelet-domain image shrinkage using variance field diffusion.
Proceedings of the First Asian Conference on Pattern Recognition, 2011

Spatially-adaptive regularized super-resolution image reconstruction using a gradient-based saliency measure.
Proceedings of the First Asian Conference on Pattern Recognition, 2011

2010
Energy-efficient Hardware Architecture and VLSI Implementation of a Polyphase Channelizer with Applications to Subband Adaptive Filtering.
J. Signal Process. Syst., 2010

A Handheld Testing System for Irrigation System Management.
Proceedings of the Information and Automation - International Symposium, 2010

A feature selection method for document clustering based on part-of-speech and word co-occurrence.
Proceedings of the Seventh International Conference on Fuzzy Systems and Knowledge Discovery, 2010

2007
Design of Sigma-Delta Modulators With Arbitrary Transfer Functions.
IEEE Trans. Signal Process., 2007

2005
CSDC: a new complexity reduction technique for multiplierless implementation of digital FIR filters.
IEEE Trans. Circuits Syst. I Regul. Pap., 2005

A new reduced-complexity sphere decoder with true lattice-boundary-awareness for multi-antenna systems.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

A novel low-complexity method for parallel multiplierless implementation of digital FIR filters.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

2004
Computation sharing programmable FIR filter for low-power and high-performance applications.
IEEE J. Solid State Circuits, 2004

Hardware architecture and VLSI implementation of a low-power high-performance polyphase channelizer with applications to subband adaptive filtering.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2002
High performance and low power FIR filter design based on sharing multiplication.
Proceedings of the 2002 International Symposium on Low Power Electronics and Design, 2002


  Loading...