Chunhua Shen

Affiliations:
  • University of Adelaide, School of Computer Science, Adelaide, Australia


According to our database1, Chunhua Shen authored at least 567 papers between 2003 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
SC-DepthV3: Robust Self-Supervised Monocular Depth Estimation for Dynamic Scenes.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2024

MobileVLM V2: Faster and Stronger Baseline for Vision Language Model.
CoRR, 2024

2023
Learning From Partially Labeled Data for Multi-Organ and Tumor Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

SPTS v2: Single-Point Scene Text Spotting.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Super Vision Transformer.
Int. J. Comput. Vis., December, 2023

Single-Path Bit Sharing for Automatic Loss-Aware Model Compression.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

SPL-Net: Spatial-Semantic Patch Learning Network for Facial Attribute Recognition with Limited Labeled Data.
Int. J. Comput. Vis., August, 2023

From Open Set to Closed Set: Supervised Spatial Divide-and-Conquer for Object Counting.
Int. J. Comput. Vis., July, 2023

Structured Knowledge Distillation for Dense Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

DeepEMD: Differentiable Earth Mover's Distance for Few-Shot Learning.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Towards Accurate Reconstruction of 3D Scene Shape From A Single Monocular Image.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Dynamic Convolution for 3D Point Cloud Instance Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

DenseCL: A simple framework for self-supervised dense visual pre-training.
Vis. Informatics, March, 2023

Instance and Panoptic Segmentation Using Conditional Convolutions.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

A Dynamic Feature Interaction Framework for Multi-task Visual Perception.
Int. J. Comput. Vis., 2023

MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices.
CoRR, 2023

GenDeF: Learning Generative Deformation Field for Video Generation.
CoRR, 2023

Paragraph-to-Image Generation with Information-Enriched Diffusion Model.
CoRR, 2023

DA-STC: Domain Adaptive Video Semantic Segmentation via Spatio-Temporal Consistency.
CoRR, 2023

AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort.
CoRR, 2023

Object-aware Inversion and Reassembly for Image Editing.
CoRR, 2023

De novo protein design using geometric vector field networks.
CoRR, 2023

RGM: A Robust Generalist Matching Model.
CoRR, 2023

Self-Supervised 3D Scene Flow Estimation and Motion Prediction using Local Rigidity Prior.
CoRR, 2023

PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm.
CoRR, 2023

Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering.
CoRR, 2023

StableLLaVA: Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data.
CoRR, 2023

Target before Shooting: Accurate Anomaly Detection and Localization under One Millisecond via Cascade Patch Retrieval.
CoRR, 2023

DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models.
CoRR, 2023

FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models.
CoRR, 2023

SegViTv2: Exploring Efficient and Continual Semantic Segmentation with Plain Vision Transformers.
CoRR, 2023

A Dynamic Feature Interaction Framework for Multi-task Visual Perception.
CoRR, 2023

Efficient Anomaly Detection with Budget Annotation Using Semi-Supervised Residual Transformer.
CoRR, 2023

A Geometric Perspective on Diffusion Models.
CoRR, 2023

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation.
CoRR, 2023

Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning.
CoRR, 2023

Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching.
CoRR, 2023

SegGPT: Segmenting Everything In Context.
CoRR, 2023

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models.
CoRR, 2023

Zolly: Zoom Focal Length Correctly for Perspective-Distorted Human Mesh Reconstruction.
CoRR, 2023

Background Matters: Enhancing Out-of-distribution Detection with Domain Features.
CoRR, 2023

Traffic Scene Parsing through the TSP6K Dataset.
CoRR, 2023

DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Deep Weakly-supervised Anomaly Detection.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

A Survey on Efficient Training of Transformers.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Conditional Positional Encodings for Vision Transformers.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

SegPrompt: Boosting Open-world Segmentation via Category-level Prompt Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Generative Prompt Model for Weakly Supervised Object Localization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CTVIS: Consistent Training for Online Video Instance Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SegGPT: Towards Segmenting Everything In Context.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Zolly: Zoom Focal Length Correctly for Perspective-Distorted Human Mesh Reconstruction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Images Speak in Images: A Generalist Painter for In-Context Visual Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Conditional Attributes for Compositional Zero-Shot Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Point-Teaching: Weakly Semi-supervised Object Detection with Point Annotations.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Effective Eyebrow Matting with Domain Adaptation.
Comput. Graph. Forum, October, 2022

Improving Monocular Visual Odometry Using Learned Depth.
IEEE Trans. Robotics, 2022

Editorial Deep Learning for Anomaly Detection.
IEEE Trans. Neural Networks Learn. Syst., 2022

Part-Guided Attention Learning for Vehicle Instance Retrieval.
IEEE Trans. Intell. Transp. Syst., 2022

NSSNet: Scale-Aware Object Counting With Non-Scale Suppression.
IEEE Trans. Intell. Transp. Syst., 2022

Intra- and Inter-Pair Consistency for Semi-Supervised Gland Segmentation.
IEEE Trans. Image Process., 2022

TSGB: Target-Selective Gradient Backprop for Probing CNN Visual Saliency.
IEEE Trans. Image Process., 2022

Guest Editorial Introduction to the Special Section on Video and Language.
IEEE Trans. Circuits Syst. Video Technol., 2022

Learning discriminative region representation for person retrieval.
Pattern Recognit., 2022

Arbitrarily shaped scene text detection with dynamic convolution.
Pattern Recognit., 2022

Effective Training of Convolutional Neural Networks With Low-Bitwidth Weights and Activations.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Plenty is Plague: Fine-Grained Learning for Visual Question Answering.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Virtual Normal: Enforcing Geometric Constraints for Accurate and Robust Depth Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

SOLO: A Simple Framework for Instance Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

PAN++: Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped Text.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Towards End-to-End Text Spotting in Natural Scenes.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

FCOS: A Simple and Strong Anchor-Free Object Detector.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Index Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

ABCNet v2: Adaptive Bezier-Curve Network for Real-Time End-to-End Text Spotting.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Improving Generative Adversarial Networks With Local Coordinate Coding.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Auto-Rectify Network for Unsupervised Indoor Depth Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Structured Binary Neural Networks for Image Recognition.
Int. J. Comput. Vis., 2022

Memory-Efficient Hierarchical Neural Architecture Search for Image Restoration.
Int. J. Comput. Vis., 2022

Dual-Attention-Guided Network for Ghost-Free High Dynamic Range Imaging.
Int. J. Comput. Vis., 2022

Joint Classification and Regression for Visual Tracking with Fully Convolutional Siamese Networks.
Int. J. Comput. Vis., 2022

Deep Learning for Anomaly Detection: A Review.
ACM Comput. Surv., 2022

Hierarchical Normalization for Robust Monocular Depth Estimation.
CoRR, 2022

Multi-dataset Training of Transformers for Robust Action Recognition.
CoRR, 2022

Towards Domain-agnostic Depth Completion.
CoRR, 2022

Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization.
CoRR, 2022

Super Vision Transformer.
CoRR, 2022

PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining.
CoRR, 2022

PointInst3D: Segmenting 3D Instances by Points.
CoRR, 2022

End-to-End Video Text Spotting with Transformer.
CoRR, 2022

PointAttN: You Only Need Attention for Point Cloud Completion.
CoRR, 2022

Training Protocol Matters: Towards Accurate Scene Text Recognition via Training Protocol Searching.
CoRR, 2022

Efficient Video Segmentation Models with Per-frame Inference.
CoRR, 2022

The devil is in the labels: Semantic segmentation from sentences.
CoRR, 2022

Hierarchical Normalization for Robust Monocular Depth Estimation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Fully Convolutional One-Stage 3D Object Detection on LiDAR Range Images.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Multi-dataset Training of Transformers for Robust Action Recognition.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Adv-Attribute: Inconspicuous and Transferable Adversarial Attack on Face Recognition.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

DENSE: Data-Free One-Shot Federated Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SegViT: Semantic Segmentation with Plain Vision Transformers.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SPTS: Single-Point Text Spotting.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Poseur: Direct Human Pose Regression with Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

PointInst3D: Segmenting 3D Instances by Points.
Proceedings of the Computer Vision - ECCV 2022, 2022

DisCo: Remedying Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Efficient Decoder-Free Object Detection with Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

FreeSOLO: Learning to Segment Objects without Annotations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Retrieval Augmented Classification for Long-Tail Visual Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

RigidFlow: Self-Supervised Scene Flow Learning on Point Clouds by Local Rigidity Prior.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Catching Both Gray and Black Swans: Open-set Supervised Anomaly Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Boosting Robustness of Image Matting with Context Assembling and Strong Data Augmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
OPMP: An Omnidirectional Pyramid Mask Proposal Network for Arbitrary-Shape Scene Text Detection.
IEEE Trans. Multim., 2021

Viral Pneumonia Screening on Chest X-Rays Using Confidence-Aware Anomaly Detection.
IEEE Trans. Medical Imaging, 2021

SESV: Accurate Medical Image Segmentation by Predicting and Correcting Errors.
IEEE Trans. Medical Imaging, 2021

Multi-Instance Learning With Emerging Novel Class.
IEEE Trans. Knowl. Data Eng., 2021

A Robust Attentional Framework for License Plate Recognition in the Wild.
IEEE Trans. Intell. Transp. Syst., 2021

Real-Time High-Performance Semantic Image Segmentation of Urban Street Scenes.
IEEE Trans. Intell. Transp. Syst., 2021

Learning deep part-aware embedding for person retrieval.
Pattern Recognit., 2021

An adversarial human pose estimation network injected with graph structure.
Pattern Recognit., 2021

Ordered or Orderless: A Revisit for Video Based Person Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

BiSeNet V2: Bilateral Network with Guided Aggregation for Real-Time Semantic Segmentation.
Int. J. Comput. Vis., 2021

NAS-FCOS: Efficient Search for Object Detection Architectures.
Int. J. Comput. Vis., 2021

Separating Content from Style Using Adversarial Learning for Recognizing Text in the Wild.
Int. J. Comput. Vis., 2021

Exploring the Capacity of an Orderless Box Discretization Network for Multi-orientation Scene Text Detection.
Int. J. Comput. Vis., 2021

Unsupervised Scale-Consistent Depth Learning from Video.
Int. J. Comput. Vis., 2021

SPTS: Single-Point Text Spotting.
CoRR, 2021

TSG: Target-Selective Gradient Backprop for Probing CNN Visual Saliency.
CoRR, 2021

Explainable Deep Few-shot Anomaly Detection with Deviation Networks.
CoRR, 2021

PAN++: Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped Text.
CoRR, 2021

Twins: Revisiting Spatial Attention Design in Vision Transformers.
CoRR, 2021

Kernel Agnostic Real-world Image Super-resolution.
CoRR, 2021

A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation.
CoRR, 2021

TFPose: Direct Human Pose Estimation with Transformers.
CoRR, 2021

Object Detection Made Simpler by Eliminating Heuristic NMS.
CoRR, 2021

Multi-intersection Traffic Optimisation: A Benchmark Dataset and a Strong Baseline.
CoRR, 2021

ABS: Automatic Bit Sharing for Model Compression.
CoRR, 2021

Towards Light-Weight Portrait Matting via Parameter Sharing.
Comput. Graph. Forum, 2021

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Twins: Revisiting the Design of Spatial Attention in Vision Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Deep Reasoning Network for Few-shot Semantic Segmentation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Fully Quantized Image Super-Resolution Networks.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Toward Deep Supervised Anomaly Detection: Reinforcement Learning from Partially Labeled Anomaly Data.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

FastFlowNet: A Lightweight Network for Fast Optical Flow Estimation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation<sup>*</sup>.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

BV-Person: A Large-scale Dataset for Bird-view Person Re-identification.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Occluded Person Re-Identification with Single-scale Global Representations.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Channel-wise Knowledge Distillation for Dense Prediction<sup>*</sup>.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

FATNN: Fast and Accurate Ternary Neural Networks<sup>*</sup>.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Meta Navigator: Search for a Good Adaptation Policy for Few-shot Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

DoDNet: Learning To Segment Multi-Organ and Tumors From Multiple Partially Labeled Datasets.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Dense Contrastive Learning for Self-Supervised Visual Pre-Training.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

End-to-End Video Instance Segmentation With Transformers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

BoxInst: High-Performance Instance Segmentation With Box Annotations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Spatial-Semantic Relationship for Facial Attribute Recognition With Limited Labeled Data.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

FCPose: Fully Convolutional Multi-Person Pose Estimation With Dynamic Instance-Aware Convolutions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Generic Perceptual Loss for Modeling Structured Output Dependencies.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

HCRF-Flow: Scene Flow From Point Clouds With Continuous High-Order CRFs and Position-Aware Flow Embedding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

DyCo3D: Robust Instance Segmentation of 3D Point Clouds Through Dynamic Convolution.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Graph Attention Tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Affinity-Aware Upsampling for Deep Image Matting.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

AQD: Towards Accurate Quantized Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning To Recover 3D Scene Shape From a Single Image.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Diverse Knowledge Distillation for End-to-End Person Search.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

SA-BNN: State-Aware Binary Neural Network.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Real-time Image Smoothing via Iterative Least Squares.
ACM Trans. Graph., 2020

Accurate Tensor Completion via Adaptive Low-Rank Representation.
IEEE Trans. Neural Networks Learn. Syst., 2020

Deep Clustering With Sample-Assignment Invariance Prior.
IEEE Trans. Neural Networks Learn. Syst., 2020

Learning Deep Gradient Descent Optimization for Image Deconvolution.
IEEE Trans. Neural Networks Learn. Syst., 2020

Joint Deep Learning of Facial Expression Synthesis and Recognition.
IEEE Trans. Multim., 2020

A Mutual Bootstrapping Model for Automated Skin Lesion Segmentation and Classification.
IEEE Trans. Medical Imaging, 2020

Structural Analysis of Attributes for Vehicle Re-Identification and Retrieval.
IEEE Trans. Intell. Transp. Syst., 2020

Towards Effective Deep Embedding for Zero-Shot Learning.
IEEE Trans. Circuits Syst. Video Technol., 2020

Human Detection Aided by Deeply Learned Semantic Masks.
IEEE Trans. Circuits Syst. Video Technol., 2020

Embedding Bilateral Filter in Least Squares for Efficient Edge-Preserving Image Smoothing.
IEEE Trans. Circuits Syst. Video Technol., 2020

Counting Objects by Blockwise Classification.
IEEE Trans. Circuits Syst. Video Technol., 2020

Monocular Depth Estimation With Augmented Ordinal Depth Relationships.
IEEE Trans. Circuits Syst. Video Technol., 2020

MobileFAN: Transferring deep hidden representation for face alignment.
Pattern Recognit., 2020

RefineNet: Multi-Path Refinement Networks for Dense Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Adaptive Importance Learning for Improving Lightweight Image Super-Resolution Network.
Int. J. Comput. Vis., 2020

Channel-wise Distillation for Semantic Segmentation.
CoRR, 2020

PGL: Prior-Guided Local Self-supervised Learning for 3D Medical Image Segmentation.
CoRR, 2020

Robust Watermarking Using Inverse Gradient Attention.
CoRR, 2020

Unifying Instance and Panoptic Segmentation with Dynamic Rank-1 Convolutions.
CoRR, 2020

Deep Reinforcement Learning for Unknown Anomaly Detection.
CoRR, 2020

FATNN: Fast and Accurate Ternary Neural Networks.
CoRR, 2020

Unsupervised Depth Learning in Challenging Indoor Video: Weak Rectification to Rescue.
CoRR, 2020

Scope Head for Accurate Localization in Object Detection.
CoRR, 2020

COVID-19 Screening on Chest X-ray Images Using Deep Learning based Anomaly Detection.
CoRR, 2020

SOLOv2: Dynamic, Faster and Stronger.
CoRR, 2020

DiverseDepth: Affine-invariant Depth Prediction Using Diverse Data.
CoRR, 2020

Memorizing Comprehensively to Learn Adaptively: Unsupervised Cross-Domain Person Re-ID with Multi-level Memory.
CoRR, 2020

Template-Based Automatic Search of Compact Semantic Segmentation Architectures.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Architecture Search of Dynamic Cells for Semantic Video Segmentation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

SOLOv2: Dynamic and Fast Instance Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Pairwise Relation Learning for Semi-supervised Gland Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Medical Data Inquiry Using a Question Answering Model.
Proceedings of the 17th IEEE International Symposium on Biomedical Imaging, 2020

Unsupervised Representation Learning by Predicting Random Distances.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Representative Graph Neural Network.
Proceedings of the Computer Vision - ECCV 2020, 2020

Segmenting Transparent Objects in the Wild.
Proceedings of the Computer Vision - ECCV 2020, 2020

Scene Text Image Super-Resolution in the Wild.
Proceedings of the Computer Vision - ECCV 2020, 2020

Soft Expert Reward Learning for Vision-and-Language Navigation.
Proceedings of the Computer Vision - ECCV 2020, 2020

SOLO: Segmenting Objects by Locations.
Proceedings of the Computer Vision - ECCV 2020, 2020

AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting.
Proceedings of the Computer Vision - ECCV 2020, 2020

Conditional Convolutions for Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Efficient Semantic Video Segmentation with Per-Frame Inference.
Proceedings of the Computer Vision - ECCV 2020, 2020

Weighing Counts: Sequential Crowd Counting by Reinforcement Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

Instance-Aware Embedding for Point Cloud Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning and Memorizing Representative Prototypes for 3D Point Cloud Semantic and Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Training Quantized Neural Networks With a Full-Precision Auxiliary Module.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Mask Encoding for Single Shot Instance Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Memory-Efficient Hierarchical Neural Architecture Search for Image Denoising.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

DeepEMD: Few-Shot Image Classification With Differentiable Earth Mover's Distance and Structured Classifiers.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Context Prior for Scene Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PolarMask: Single Shot Instance Segmentation With Polar Representation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

NAS-FCOS: Fast Neural Architecture Search for Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Self-Trained Deep Ordinal Regression for End-to-End Video Anomaly Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

AIML at VQA-Med 2020: Knowledge Inference via a Skeleton-based Sentence Mapping Approach for Medical Domain Visual Question Answering.
Proceedings of the Working Notes of CLEF 2020, 2020

Task-Aware Monocular Depth Estimation for 3D Object Detection.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

V-PROM: A Benchmark for Visual Reasoning Using Visual Progressive Matrices.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Decoupled Spatial Neural Attention for Weakly Supervised Semantic Segmentation.
IEEE Trans. Multim., 2019

Attention Residual Learning for Skin Lesion Classification.
IEEE Trans. Medical Imaging, 2019

Toward End-to-End Car License Plate Detection and Recognition With Deep Neural Networks.
IEEE Trans. Intell. Transp. Syst., 2019

Salient Object Detection With Lossless Feature Reflection and Weighted Structural Loss.
IEEE Trans. Image Process., 2019

Piecewise Classifier Mappings: Learning Fine-Grained Learners for Novel Categories With Few Examples.
IEEE Trans. Image Process., 2019

Hyperspectral Classification Based on Lightweight 3-D-CNN With Transfer Learning.
IEEE Trans. Geosci. Remote. Sens., 2019

Unsupervised Domain Adaptation Using Robust Class-Wise Matching.
IEEE Trans. Circuits Syst. Video Technol., 2019

Semantics-Aware Visual Object Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2019

Heritage image annotation via collective knowledge.
Pattern Recognit., 2019

Wider or Deeper: Revisiting the ResNet Model for Visual Recognition.
Pattern Recognit., 2019

Unsupervised object discovery and co-localization by deep descriptor transformation.
Pattern Recognit., 2019

Order-aware convolutional pooling for video based action recognition.
Pattern Recognit., 2019

Ordinal Constraint Binary Coding for Approximate Nearest Neighbor Search.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Accurate imagery recovery using a multi-observation patch model.
Inf. Sci., 2019

Exploring the Capacity of Sequential-free Box Discretization Network for Omnidirectional Scene Text Detection.
CoRR, 2019

To Balance or Not to Balance: An Embarrassingly Simple Approach for Learning with Long-Tailed Distributions.
CoRR, 2019

Unified Multifaceted Feature Learning for Person Re-Identification.
CoRR, 2019

DirectPose: Direct End-to-End Multi-Person Pose Estimation.
CoRR, 2019

Weakly-supervised Deep Anomaly Detection with Pairwise Relation Learning.
CoRR, 2019

Structured Binary Neural Networks for Image Recognition.
CoRR, 2019

IR-NAS: Neural Architecture Search for Image Restoration.
CoRR, 2019

TextSR: Content-Aware Text Super-Resolution Guided by Recognition.
CoRR, 2019

Part-Guided Attention Learning for Vehicle Re-Identification.
CoRR, 2019

Training Compact Neural Networks via Auxiliary Overparameterization.
CoRR, 2019

Index Network.
CoRR, 2019

Regularizing Proxies with Multi-Adversarial Training for Unsupervised Domain-Adaptive Semantic Segmentation.
CoRR, 2019

A Generalized Framework for Edge-preserving and Structure-preserving Image Smoothing.
CoRR, 2019

NAS-FCOS: Fast Neural Architecture Search for Object Detection.
CoRR, 2019

RERERE: Remote Embodied Referring Expressions in Real indoor Environments.
CoRR, 2019

A Simple and Robust Convolutional-Attention Network for Irregular Text Recognition.
CoRR, 2019

Training Quantized Network with Auxiliary Gradient Module.
CoRR, 2019

Semi- and Weakly Supervised Directional Bootstrapping Model for Automated Skin Lesion Segmentation.
CoRR, 2019

Multi-marginal Wasserstein GAN.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Deep Hashing by Discriminating Hard Examples.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Deep Segmentation-Emendation Model for Gland Instance Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Deep Anomaly Detection with Deviation Networks.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Light-Weight Hybrid Convolutional Network for Liver Tumor Segmentation.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Real-Time Joint Semantic Segmentation and Depth Estimation Using Asymmetric Annotations.
Proceedings of the International Conference on Robotics and Automation, 2019

Self-Training With Progressive Augmentation for Unsupervised Cross-Domain Person Re-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Exploiting Temporal Consistency for Real-Time Video Depth Estimation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Enforcing Geometric Constraints of Virtual Normal for Depth Prediction.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

From Open Set to Closed Set: Counting Objects by Spatial Divide-and-Conquer.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Efficient and Accurate Arbitrary-Shaped Text Detection With Pixel Aggregation Network.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

FCOS: Fully Convolutional One-Stage Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Indices Matter: Learning to Index for Deep Image Matting.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

CANet: Class-Agnostic Segmentation Networks With Iterative Refinement and Attentive Few-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Mind Your Neighbours: Image Annotation With Metadata Neighbourhood Graph Co-Attention Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Attention-Guided Network for Ghost-Free High Dynamic Range Imaging.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Associatively Segmenting Instances and Semantics in Point Clouds.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Decoders Matter for Semantic Segmentation: Data-Dependent Decoding Enables Flexible Feature Aggregation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Knowledge Adaptation for Efficient Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Visual Question Answering as Reading Comprehension.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Neighbourhood Watch: Referring Expression Comprehension via Language-Guided Graph Attention Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Structured Learning of Tree Potentials in CRF for Image Segmentation.
IEEE Trans. Neural Networks Learn. Syst., 2018

Multilabel Image Classification With Regional Latent Semantic Dependencies.
IEEE Trans. Multim., 2018

Automatic Image Cropping for Visual Aesthetic Enhancement Using Deep Neural Networks and Cascaded Regression.
IEEE Trans. Multim., 2018

An Extended Filtered Channel Framework for Pedestrian Detection.
IEEE Trans. Intell. Transp. Syst., 2018

An Embarrassingly Simple Approach to Visual Domain Adaptation.
IEEE Trans. Image Process., 2018

Crowd Counting via Weighted VLAD on a Dense Attribute Feature Map.
IEEE Trans. Circuits Syst. Video Technol., 2018

Pushing the Limits of Deep CNNs for Pedestrian Detection.
IEEE Trans. Circuits Syst. Video Technol., 2018

Estimating Depth From Monocular Images as Classification Using Deep Fully Convolutional Residual Networks.
IEEE Trans. Circuits Syst. Video Technol., 2018

Multi-label learning based deep transfer neural network for facial attribute classification.
Pattern Recognit., 2018

Mask-CNN: Localizing parts and selecting descriptors for fine-grained bird species categorization.
Pattern Recognit., 2018

Image Captioning and Visual Question Answering Based on Attributes and External Knowledge.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

FVQA: Fact-Based Visual Question Answering.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Exploring Context with Deep Structured Models for Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Reading car license plates using deep neural networks.
Image Vis. Comput., 2018

Cluster Sparsity Field: An Internal Hyperspectral Imagery Prior for Reconstruction.
Int. J. Comput. Vis., 2018

Neighbourhood Watch: Referring Expression Comprehension via Language-guided Graph Attention Networks.
CoRR, 2018

RGB-D Based Action Recognition with Light-weight 3D Convolutional Networks.
CoRR, 2018

Correlation Propagation Networks for Scene Text Detection.
CoRR, 2018

Diagnostics in Semantic Segmentation.
CoRR, 2018

Training Compact Neural Networks with Binary Weights and Low Precision Activations.
CoRR, 2018

Monocular Depth Estimation with Augmented Ordinal Depth Relationships.
CoRR, 2018

HyperFusion-Net: Densely Reflective Fusion for Salient Object Detection.
CoRR, 2018

Learning an Optimizer for Image Deconvolution.
CoRR, 2018

Agile Amulet: Real-Time Salient Object Detection with Contextual Attention.
CoRR, 2018

Edge-Preserving Piecewise Linear Image Smoothing Using Piecewise Constant Filters.
CoRR, 2018

Salient Object Detection by Lossless Feature Reflection.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Adversarial Learning with Local Coordinate Coding.
Proceedings of the 35th International Conference on Machine Learning, 2018

Learning Deep Representations Using Convolutional Auto-Encoders with Symmetric Skip Connections.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Goal-Oriented Visual Question Generation via Intermediate Rewards.
Proceedings of the Computer Vision - ECCV 2018, 2018

Learning to Predict Crisp Boundaries.
Proceedings of the Computer Vision - ECCV 2018, 2018

Parallel Attention: A Unified Framework for Visual Object Discovery Through Dialogs and Queries.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Towards Effective Low-Bitwidth Convolutional Neural Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Monocular Relative Depth Perception With Web Stereo Data Supervision.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Are You Talking to Me? Reasoned Visual Dialog Generation Through Adversarial Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Repulsion Loss: Detecting Pedestrians in a Crowd.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

VITAL: VIsual Tracking via Adversarial Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Bootstrapping the Performance of Webly Supervised Semantic Segmentation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

An End-to-End TextSpotter With Explicit Alignment and Attention.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

FSRNet: End-to-End Learning Face Super-Resolution With Facial Priors.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Visual Question Answering With Memory-Augmented Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Light-Weight RefineNet for Real-Time Semantic Segmentation.
Proceedings of the British Machine Vision Conference 2018, 2018

A Hybrid Probabilistic Model for Camera Relocalization.
Proceedings of the British Machine Vision Conference 2018, 2018

Coarse-to-Fine: A RNN-Based Hierarchical Attention Model for Vehicle Re-identification.
Proceedings of the Computer Vision - ACCV 2018, 2018

Deep Attention-Based Classification Network for Robust Depth Prediction.
Proceedings of the Computer Vision - ACCV 2018, 2018

HCVRD: A Benchmark for Large-Scale Human-Centered Visual Relationship Detection.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Kill Two Birds With One Stone: Weakly-Supervised Neural Network for Image Annotation and Tag Refinement.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Deep CNNs With Spatially Weighted Pooling for Fine-Grained Car Recognition.
IEEE Trans. Intell. Transp. Syst., 2017

Discriminative Training of Deep Fully Connected Continuous CRFs With Task-Specific Loss.
IEEE Trans. Image Process., 2017

Exploiting Depth From Single Monocular Images for Object Detection and Semantic Segmentation.
IEEE Trans. Image Process., 2017

Part-Based Robust Tracking Using Online Latent Structured Learning.
IEEE Trans. Circuits Syst. Video Technol., 2017

Temporal Pyramid Pooling-Based Convolutional Neural Network for Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2017

Removal of Optically Thick Clouds From High-Resolution Satellite Imagery Using Dictionary Group Learning and Interdictionary Nonlocal Joint Sparse Coding.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2017

Deep linear discriminant analysis on fisher networks: A hybrid architecture for person re-identification.
Pattern Recognit., 2017

Learning discriminative trajectorylet detector sets for accurate skeleton-based action recognition.
Pattern Recognit., 2017

Large-Scale Binary Quadratic Optimization Using Semidefinite Relaxation and Applications.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Compositional Model Based Fisher Vector Coding for Image Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Cross-Convolutional-Layer Pooling for Image Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Structured Learning of Binary Codes with Column Generation for Optimizing Ranking Measures.
Int. J. Comput. Vis., 2017

Mining Mid-level Visual Patterns with Deep CNN Activations.
Int. J. Comput. Vis., 2017

Visual question answering: A survey of methods and datasets.
Comput. Vis. Image Underst., 2017

Structured learning of metric ensembles with application to person re-identification.
Comput. Vis. Image Underst., 2017

Real-time Semantic Image Segmentation via Spatial Sparsity.
CoRR, 2017

Asking the Difficult Questions: Goal-Oriented Visual Question Generation via Intermediate Rewards.
CoRR, 2017

Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization.
CoRR, 2017

Towards End-to-End Car License Plates Detection and Recognition with Deep Neural Networks.
CoRR, 2017

Beyond Low Rank: A Data-Adaptive Tensor Completion Method.
CoRR, 2017

Care about you: towards large-scale human-centric visual relationship detection.
CoRR, 2017

Towards Context-aware Interaction Recognition.
CoRR, 2017

Unsupervised Object Discovery and Co-Localization by Deep Descriptor Transforming.
CoRR, 2017

Adversarial Generation of Training Examples for Vehicle License Plate Recognition.
CoRR, 2017

Visually Aligned Word Embeddings for Improving Zero-shot Learning.
CoRR, 2017

Visual Question Answering with Memory-Augmented Networks.
CoRR, 2017

TasselNet: Counting maize tassels in the wild via local counts regression network.
CoRR, 2017

Relative Depth Order Estimation Using Multi-scale Densely Connected Convolutional Networks.
CoRR, 2017

Robust Guided Image Filtering.
CoRR, 2017

Deep Descriptor Transforming for Image Co-Localization.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Explicit Knowledge-based Reasoning for Visual Question Answering.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Learning Multi-level Region Consistency with Dense Multi-label Networks for Semantic Segmentation.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Deep learning features at scale for visual place recognition.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Towards Context-Aware Interaction Recognition for Visual Relationship Detection.
Proceedings of the IEEE International Conference on Computer Vision, 2017

When Unsupervised Domain Adaptation Meets Tensor Representations.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Semi-Global Weighted Least Squares in Image Filtering.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Towards End-to-End Text Spotting with Convolutional Recurrent Neural Networks.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning from Web Data.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Multi-attention Network for One Shot Learning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Sequential Person Recognition in Photo Albums with a Recurrent Network.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Weakly Supervised Semantic Segmentation Based on Co-segmentation.
Proceedings of the British Machine Vision Conference 2017, 2017

2016
Scalable Linear Visual Feature Learning via Online Parallel Nonnegative Matrix Factorization.
IEEE Trans. Neural Networks Learn. Syst., 2016

Fast Detection of Multiple Objects in Traffic Scenes With a Common Detection Framework.
IEEE Trans. Intell. Transp. Syst., 2016

Dictionary Learning for Promoting Structured Sparsity in Hyperspectral Compressive Sensing.
IEEE Trans. Geosci. Remote. Sens., 2016

Face image classification by pooling raw features.
Pattern Recognit., 2016

Face recognition using linear representation ensembles.
Pattern Recognit., 2016

Pedestrian Detection with Spatially Pooled Features and Structured Ensemble Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

A Generalized Probabilistic Framework for Compact Codebook Creation.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Online Metric-Weighted Linear Representations for Robust Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Canonical principal angles correlation analysis for two-view data.
J. Vis. Commun. Image Represent., 2016

Online unsupervised feature learning for visual tracking.
Image Vis. Comput., 2016

Unsupervised Feature Learning for Dense Correspondences Across Scenes.
Int. J. Comput. Vis., 2016

Efficient Semidefinite Branch-and-Cut for MAP-MRF Inference.
Int. J. Comput. Vis., 2016

Multi-Label Image Classification with Regional Latent Semantic Dependencies.
CoRR, 2016

Image Captioning and Visual Question Answering Based on Attributes and Their Related External Knowledge.
CoRR, 2016

Deep Recurrent Convolutional Networks for Video-based Person Re-identification: An End-to-End Approach.
CoRR, 2016

Bridging Category-level and Instance-level Semantic Image Segmentation.
CoRR, 2016

High-performance Semantic Segmentation Using Very Deep Fully Convolutional Networks.
CoRR, 2016

PersonNet: Person Re-identification with Deep Convolutional Neural Networks.
CoRR, 2016

Hi Detector, What's Wrong with that Object? Identifying Irregular Object From Images by Modelling the Detection Score Distribution.
CoRR, 2016

Crowd Counting via Weighted VLAD on Dense Attribute Feature Maps.
CoRR, 2016

Image Restoration Using Convolutional Auto-encoders with Symmetric Skip Connections.
CoRR, 2016

Image Denoising Using Very Deep Fully Convolutional Encoder-Decoder Networks with Symmetric Skip Connections.
CoRR, 2016

Discriminative Training of Deep Fully-connected Continuous CRF with Task-specific Loss.
CoRR, 2016

Structured Learning of Binary Codes with Column Generation.
CoRR, 2016

Reading Car License Plates Using Deep Convolutional Neural Networks and LSTMs.
CoRR, 2016

Unsupervised Feature Learning With Symmetrically Connected Convolutional Denoising Auto-encoders.
CoRR, 2016

Where to Focus: Query Adaptive Matching for Instance Retrieval Using Convolutional Feature Maps.
CoRR, 2016

Image Restoration Using Very Deep Convolutional Encoder-Decoder Networks with Symmetric Skip Connections.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Cluster Sparsity Field for Hyperspectral Imagery Denoising.
Proceedings of the Computer Vision - ECCV 2016, 2016

Image Co-localization by Mimicking a Good Detector's Confidence Score Distribution.
Proceedings of the Computer Vision - ECCV 2016, 2016

SLNSW-UTS: A Historical Image Dataset for Image Multi-Labeling and Retrieval.
Proceedings of the 2016 International Conference on Digital Image Computing: Techniques and Applications, 2016

Fast Training of Triplet-Based Deep Binary Embedding Networks.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Ask Me Anything: Free-Form Visual Question Answering Based on Knowledge from External Sources.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

What Value Do Explicit High Level Concepts Have in Vision to Language Problems?
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

What's Wrong with That Object? Identifying Images of Unusual Objects by Modelling the Detection Score Distribution.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Less is More: Zero-Shot Learning from Online Textual Documents with Noise Suppression.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Hashing on Nonlinear Manifolds.
IEEE Trans. Image Process., 2015

A Computational Model of the Short-Cut Rule for 2D Shape Decomposition.
IEEE Trans. Image Process., 2015

Worst Case Linear Discriminant Analysis as Scalable Semidefinite Feasibility Problems.
IEEE Trans. Image Process., 2015

CRF learning with CNN features for image segmentation.
Pattern Recognit., 2015

Supervised Hashing Using Graph Cuts and Boosted Decision Trees.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Extrinsic Methods for Coding and Dictionary Learning on Grassmann Manifolds.
Int. J. Comput. Vis., 2015

Unsupervised Feature Learning for Dense Correspondences across Scenes.
CoRR, 2015

Image Captioning with an Intermediate Attributes Layer.
CoRR, 2015

Temporal Pyramid Pooling Based Convolutional Neural Networks for Action Recognition.
CoRR, 2015

Cross-convolutional-layer Pooling for Generic Visual Recognition.
CoRR, 2015

Data Driven Robust Image Guided Depth Map Restoration.
CoRR, 2015

Deeply Learning the Messages in Message Passing Inference.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Hyperspectral Compressive Sensing Using Manifold-Structured Sparsity Prior.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Efficient SDP inference for fully-connected CRFs based on low-rank decomposition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Learning graph structure for multi-label image classification via clique generation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Supervised Discrete Hashing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Learning to rank in person re-identification with metric ensembles.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Sequence searching with deep-learnt depth for condition- and viewpoint-invariant route-based place recognition.
Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015

Deep convolutional neural fields for depth estimation from a single image.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

The treasure beneath convolutional layers: Cross-convolutional-layer pooling for image classification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Depth and surface normal estimation from monocular images using regression on deep features and hierarchical CRFs.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Mid-level deep pattern mining.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Semidefinite Programming.
Computer Vision, A Reference Guide, 2014

Efficient Dual Approach to Distance Metric Learning.
IEEE Trans. Neural Networks Learn. Syst., 2014

RandomBoost: Simplified Multiclass Boosting Through Randomization.
IEEE Trans. Neural Networks Learn. Syst., 2014

Asymmetric Pruning for Learning Cascade Detectors.
IEEE Trans. Multim., 2014

Context-Aware Hypergraph Construction for Robust Spectral Clustering.
IEEE Trans. Knowl. Data Eng., 2014

Multiple Kernel Learning in the Primal for Multimodal Alzheimer's Disease Classification.
IEEE J. Biomed. Health Informatics, 2014

Efficient Semidefinite Spectral Clustering via Lagrange Duality.
IEEE Trans. Image Process., 2014

Large-Margin Learning of Compact Binary Image Encodings.
IEEE Trans. Image Process., 2014

Characterness: An Indicator of Text in the Wild.
IEEE Trans. Image Process., 2014

Multiple kernel clustering based on centered kernel alignment.
Pattern Recognit., 2014

A Hierarchical Word-Merging Algorithm with Class Separability Measure.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

StructBoost: Boosting Methods for Predicting Structured Output Variables.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Fast approximate L<sub>∞</sub> minimization: Speeding up robust regression.
Comput. Stat. Data Anal., 2014

Large-scale Binary Quadratic Optimization Using Semidefinite Relaxation and Applications.
CoRR, 2014

Face Identification with Second-Order Pooling.
CoRR, 2014

Face Image Classification by Pooling Raw Features.
CoRR, 2014

From Kernel Machines to Ensemble Learning.
CoRR, 2014

Encoding High Dimensional Local Features by Sparse Coding Based Fisher Vectors.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Strengthening the Effectiveness of Pedestrian Detection with Spatially Pooled Features.
Proceedings of the Computer Vision - ECCV 2014, 2014

Optimizing Ranking Measures for Compact Binary Code Learning.
Proceedings of the Computer Vision - ECCV 2014, 2014

Fast Supervised Hashing with Decision Trees for High-Dimensional Data.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Shape Similarity Analysis by Self-Tuning Locally Constrained Mixed-Diffusion.
IEEE Trans. Multim., 2013

A survey of appearance models in visual object tracking.
ACM Trans. Intell. Syst. Technol., 2013

Approximate Least Trimmed Sum of Squares Fitting and Applications in Image Analysis.
IEEE Trans. Image Process., 2013

Visual Tracking With Spatio-Temporal Dempster-Shafer Information Fusion.
IEEE Trans. Image Process., 2013

Incremental Learning of 3D-DCT Compact Representations for Robust Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Fully corrective boosting with arbitrary loss and regularization.
Neural Networks, 2013

Training Effective Node Classifiers for Cascade Classification.
Int. J. Comput. Vis., 2013

Fast Approximate L_infty Minimization: Speeding Up Robust Regression
CoRR, 2013

An Efficient Dual Approach to Distance Metric Learning
CoRR, 2013

RandomBoost: Simplified Multi-class Boosting through Randomization
CoRR, 2013

Constraint Reduction using Marginal Polytope Diagrams for MAP LP Relaxations.
CoRR, 2013

Generic Image Classification Approaches Excel on Face Recognition.
CoRR, 2013

Efficient pedestrian detection by directly optimize the partial area under the ROC curve.
CoRR, 2013

A scalable stage-wise approach to large-margin multi-class loss based boosting.
CoRR, 2013

Multiple Kernel Learning in the Primal for Multi-modal Alzheimer's Disease Classification.
CoRR, 2013

Contextual Hypergraph Modelling for Salient Object Detection.
CoRR, 2013

Learning Hash Functions Using Column Generation.
Proceedings of the 30th International Conference on Machine Learning, 2013

Extended depth-of-field via focus stacking and graph cuts.
Proceedings of the IEEE International Conference on Image Processing, 2013

Approximate constraint generation for efficient structured boosting.
Proceedings of the IEEE International Conference on Image Processing, 2013

Leveraging surrounding context for scene text detection.
Proceedings of the IEEE International Conference on Image Processing, 2013

Efficient Pedestrian Detection by Directly Optimizing the Partial Area under the ROC Curve.
Proceedings of the IEEE International Conference on Computer Vision, 2013

A General Two-Step Approach to Learning-Based Hashing.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Contextual Hypergraph Modeling for Salient Object Detection.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Dictionary Learning and Sparse Coding on Grassmann Manifolds: An Extrinsic Solution.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Part-Based Visual Tracking with Online Latent Structural Learning.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Bilinear Programming for Human Activity Recognition with Unknown MRF Graphs.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

A Fast Semidefinite Approach to Solving Binary Quadratic Problems.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Inductive Hashing on Manifolds.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Learning Compact Binary Codes for Visual Tracking.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Fast and Robust Object Detection Using Asymmetric Totally Corrective Boosting.
IEEE Trans. Neural Networks Learn. Syst., 2012

UBoost: Boosting with the Universum.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Positive Semidefinite Metric Learning Using Boosting-like Algorithms.
J. Mach. Learn. Res., 2012

A Direct Approach to Multi-class Boosting and Extensions
CoRR, 2012

Is margin preserved after random projection?.
Proceedings of the 29th International Conference on Machine Learning, 2012

Robust Tracking with Weighted Online Structured Learning.
Proceedings of the Computer Vision - ECCV 2012, 2012

Sharing features in multi-class boosting via group sparsity.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Non-sparse linear representations for visual tracking with online reservoir metric learning.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Fast Training of Effective Multi-class Boosting Using Coordinate Descent Optimization.
Proceedings of the Computer Vision, 2012

2011
Efficiently Learning a Detection Cascade With Sparse Eigenvectors.
IEEE Trans. Image Process., 2011

Incremental Training of a Detector Using Online Sparse Eigendecomposition.
IEEE Trans. Image Process., 2011

Face Recognition using Optimal Representation Ensemble
CoRR, 2011

Graph mode-based contextual kernels for robust SVM tracking.
Proceedings of the IEEE International Conference on Computer Vision, 2011

On the Optimality of Sequential Forward Feature Selection Using Class Separability Measure.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Laplacian Margin Distribution Boosting for Learning from Sparsely Labeled Data.
Proceedings of the 2011 International Conference on Digital Image Computing: Techniques and Applications (DICTA), 2011

Is face recognition really a Compressive Sensing problem?
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

A scalable dual approach to semidefinite metric learning.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

A direct formulation for totally-corrective multi-class boosting.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Real-time visual tracking using compressive sensing.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Efficiently Learning a Distance Metric for Large Margin Nearest Neighbor Classification.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
Feature selection with redundancy-constrained class separability.
IEEE Trans. Neural Networks, 2010

Boosting through optimization of margin distributions.
IEEE Trans. Neural Networks, 2010

Scalable large-margin Mahalanobis distance metric learning.
IEEE Trans. Neural Networks, 2010

Generalized Kernel-Based Visual Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2010

On the Dual Formulation of Boosting Algorithms.
IEEE Trans. Pattern Anal. Mach. Intell., 2010

Interactive color image segmentation with linear programming.
Mach. Vis. Appl., 2010

Real-time Visual Tracking Using Sparse Representation
CoRR, 2010

Totally Corrective Multiclass Boosting with Binary Weak Learners
CoRR, 2010

Effective Pedestrian Detection Using Center-symmetric Local Binary/Trinary Patterns
CoRR, 2010

Optimally Training a Cascade Classifier
CoRR, 2010

Incremental Training of a Detector Using Online Sparse Eigen-decomposition
CoRR, 2010

Hippocampal Shape Classification Using Redundancy Constrained Feature Selection.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention, 2010

Pedestrian Detection Using Center-Symmetric Local Binary Patterns.
Proceedings of the International Conference on Image Processing, 2010

Improved human detection and classification in thermal images.
Proceedings of the International Conference on Image Processing, 2010

Training a multi-exit cascade with linear asymmetric classification for efficient object detection.
Proceedings of the International Conference on Image Processing, 2010

LACBoost and FisherBoost: Optimally Building Cascade Classifiers.
Proceedings of the Computer Vision, 2010

Robust Face Recognition via Accurate Face Alignment and Sparse Representation.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2010

Rapid face recognition using hashing.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Pyramid Center-Symmetric Local Binary/Trinary Patterns for Effective Pedestrian Detection.
Proceedings of the Computer Vision - ACCV 2010, 2010

Asymmetric Totally-Corrective Boosting for Real-Time Object Detection.
Proceedings of the Computer Vision - ACCV 2010, 2010

Face Detection with Effective Feature Extraction.
Proceedings of the Computer Vision - ACCV 2010, 2010

Totally-Corrective Multi-class Boosting.
Proceedings of the Computer Vision - ACCV 2010, 2010

2009
A Duality View of Boosting Algorithms
CoRR, 2009

Positive Semidefinite Metric Learning with Boosting.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

An overview of fast pedestrian detection: Feature selection and cascade framework of boosted features.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A Two-Layer Night-Time Vehicle Detector.
Proceedings of the DICTA 2009, 2009

Smooth Approximation of L_infinity-Norm for Multi-view Geometry.
Proceedings of the DICTA 2009, 2009

Efficiently training a better visual detector with sparse eigenvectors.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

A Variant of the Trace Quotient Formulation for Dimensionality Reduction.
Proceedings of the Computer Vision, 2009

A Scalable Algorithm for Learning a Mahalanobis Distance Metric.
Proceedings of the Computer Vision, 2009

2008
Fast Pedestrian Detection Using a Cascade of Boosted Covariance Features.
IEEE Trans. Circuits Syst. Video Technol., 2008

Supervised dimensionality reduction via sequential semidefinite programming.
Pattern Recognit., 2008

PSDBoost: Matrix-Generation Linear Programming for Positive Semidefinite Matrices Learning.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

An experimental study on pedestrian classification using local features.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2008), 2008

Face detection from few training examples.
Proceedings of the International Conference on Image Processing, 2008

A Fast Algorithm for Creating a Compact and Discriminative Visual Codebook.
Proceedings of the Computer Vision, 2008

Self-Calibrating Cameras Using Semidefinite Programming.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2008

Multi-view Human Motion Capture with an Improved Deformation Skin Model.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2008

Boosting the Minimum Margin: LPBoost vs. AdaBoost.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2008

Learning Cascaded Reduced-Set SVMs Using Linear Programming.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2008

2007
Fast Global Kernel Density Mode Seeking: Applications to Localization and Tracking.
IEEE Trans. Image Process., 2007

Adaptive Object Tracking Based on an Effective Appearance Filter.
IEEE Trans. Pattern Anal. Mach. Intell., 2007

Object-Respecting Color Image Segmentation.
Proceedings of the International Conference on Image Processing, 2007

Feature Extraction Using Sequential Semidefinite Programming.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2007

An Experimental Evaluation of Local Features for Pedestrian Classification.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2007

Color Image Labelling Using Linear Programming.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2007

Kernel-based Tracking from a Probabilistic Viewpoint.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

A Convex Programming Approach to the Trace Quotient Problem.
Proceedings of the Computer Vision, 2007

2006
Classification-Based Likelihood Functions for Bayesian Tracking.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

Enhanced Kernel-Based Tracking for Monochromatic and Thermographic Video.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

An LMI Approach for Reliable PTZ Camera Self-Calibration.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

2005
Adaptive over-relaxed mean shift.
Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, 2005

Visual tracking via efficient kernel discriminant subspace learning.
Proceedings of the 2005 International Conference on Image Processing, 2005

Augmented particle filtering for efficient visual tracking.
Proceedings of the 2005 International Conference on Image Processing, 2005

Fast Global Kernel Density Mode Seeking with Application to Localisation and Tracking.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

2004
Enhanced Importance Sampling: Unscented Auxiliary Particle Filtering for Visual Tracking.
Proceedings of the AI 2004: Advances in Artificial Intelligence, 2004

2D Articulated Tracking with Dynamic Bayesian Networks.
Proceedings of the 2004 International Conference on Computer and Information Technology (CIT 2004), 2004

2003
Probabilistic Multiple Cue Integration for Particle Filter Based Tracking.
Proceedings of the Seventh International Conference on Digital Image Computing: Techniques and Applications, 2003


  Loading...