Ping Luo

Orcid: 0000-0002-6685-7950

Affiliations:

University of Hong Kong, Department of Computer Science, Hong Kong
Chinese University of Hong Kong, Department of Information Engineering, Hong Kong (PhD 2014)
Sun Yat-Sen University, School of Software, Guangzhou, China (former)
Lotus Hill Insititue, China (former)

According to our database¹, Ping Luo authored at least 251 papers between 2009 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Deeply Unsupervised Patch Re-Identification for Pre-Training Object Detectors.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

FAT: Frequency-Aware Transformation for Bridging Full-Precision and Low-Precision Deep Representations.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., February, 2024

Context Autoencoder for Self-supervised Representation Learning.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., January, 2024

RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis.

[BibT_eX]

[DOI]

CoRR, 2024

AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks.

[BibT_eX]

[DOI]

CoRR, 2024

PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models.

[BibT_eX]

[DOI]

CoRR, 2024

2023

RestoreFormer++: Towards Real-World Blind Face Restoration From Undegraded Key-Value Pairs.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Sparse R-CNN: An End-to-End Framework for Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

CycleMLP: A MLP-Like Architecture for Dense Visual Predictions.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Fashion Retrieval via Graph Reasoning Networks on a Similarity Pyramid.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

ZoomNAS: Searching for Whole-Body Human Pose Estimation in the Wild.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

RelativeNAS: Relative Neural Architecture Search via Slow-Fast Learning.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2023

MGL: Mutual Graph Learning for Camouflaged Object Detection.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks.

[BibT_eX]

[DOI]

CoRR, 2023

A Survey of Reasoning with Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2023

You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception.

[BibT_eX]

[DOI]

CoRR, 2023

Tree-Planner: Efficient Close-loop Task Planning with Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

MeanAP-Guided Reinforced Active Learning for Object Detection.

[BibT_eX]

[DOI]

CoRR, 2023

LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2023

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis.

[BibT_eX]

[DOI]

CoRR, 2023

SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2023

StyleAdapter: A Single-Pass LoRA-Free Model for Stylized Image Generation.

[BibT_eX]

[DOI]

CoRR, 2023

GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition.

[BibT_eX]

[DOI]

CoRR, 2023

OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Exploring Transformers for Open-world Instance Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.

[BibT_eX]

[DOI]

CoRR, 2023

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest.

[BibT_eX]

[DOI]

CoRR, 2023

Align, Adapt and Inject: Sound-guided Unified Image Generation.

[BibT_eX]

[DOI]

CoRR, 2023

LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

DiffRate : Differentiable Compression Rate for Efficient Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2023

VDT: An Empirical Study on Video Diffusion with Transformers.

[BibT_eX]

[DOI]

CoRR, 2023

VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks.

[BibT_eX]

[DOI]

CoRR, 2023

Going Denser with Open-Vocabulary Part Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

VideoChat: Chat-Centric Video Understanding.

[BibT_eX]

[DOI]

CoRR, 2023

InternGPT: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language.

[BibT_eX]

[DOI]

CoRR, 2023

MultiModal-GPT: A Vision and Language Model for Dialogue with Humans.

[BibT_eX]

[DOI]

CoRR, 2023

Road Genome: A Topology Reasoning Benchmark for Scene Understanding in Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2023

MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

Topology Reasoning for Driving Scenes.

[BibT_eX]

[DOI]

CoRR, 2023

EGC: Image Generation and Classification via a Diffusion Energy-Based Model.

[BibT_eX]

[DOI]

CoRR, 2023

Multi-Level Contrastive Learning for Dense Prediction Task.

[BibT_eX]

[DOI]

CoRR, 2023

DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2023

Vehicle-Infrastructure Cooperative 3D Object Detection via Feature Flow Prediction.

[BibT_eX]

[DOI]

CoRR, 2023

Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception.

[BibT_eX]

[DOI]

CoRR, 2023

EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

ChiPFormer: Transferable Chip Placement via Offline Decision Transformer.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Learning Object-Language Alignments for Open-Vocabulary Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

CO3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

MetaBEV: Solving Sensor Failures for 3D Detection and Map Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Dense Distinct Query for End-to-End Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Transferable Spatiotemporal Representations from Natural Script Knowledge.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Policy Adaptation from Foundation Model Feedback.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Universal Instance Perception as Object Discovery and Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2022

MetaCloth: Learning Unseen Tasks of Dense Fashion Landmark Detection From a Few Samples.

[BibT_eX]

[DOI]

Yuying Ge

Ruimao Zhang

Ping Luo

IEEE Trans. Image Process., 2022

PolarMask++: Enhanced Polar Representation for Single-Shot Instance Segmentation and Beyond.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

PVT v2: Improved baselines with Pyramid Vision Transformer.

[BibT_eX]

[DOI]

Comput. Vis. Media, 2022

Self-Play and Self-Describe: Policy Adaptation with Vision-Language Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2022

Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning.

[BibT_eX]

[DOI]

CoRR, 2022

DiffusionDet: Diffusion Model for Object Detection.

[BibT_eX]

[DOI]

CoRR, 2022

Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model.

[BibT_eX]

[DOI]

CoRR, 2022

Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe.

[BibT_eX]

[DOI]

CoRR, 2022

ZoomNAS: Searching for Whole-body Human Pose Estimation in the Wild.

[BibT_eX]

[DOI]

CoRR, 2022

Pose for Everything: Towards Category-Agnostic Pose Estimation.

[BibT_eX]

[DOI]

CoRR, 2022

Exploiting Context Information for Generic Event Boundary Captioning.

[BibT_eX]

[DOI]

CoRR, 2022

CO^3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving.

[BibT_eX]

[DOI]

CoRR, 2022

MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval.

[BibT_eX]

[DOI]

CoRR, 2022

Semantic-Aware Pretraining for Dense Video Captioning.

[BibT_eX]

[DOI]

CoRR, 2022

M<sup>2</sup>BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation.

[BibT_eX]

[DOI]

CoRR, 2022

End-to-End Video Text Spotting with Transformer.

[BibT_eX]

[DOI]

CoRR, 2022

WegFormer: Transformers for Weakly Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2022

MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning.

[BibT_eX]

[DOI]

CoRR, 2022

BridgeFormer: Bridging Video-text Retrieval with Multiple Choice Questions.

[BibT_eX]

[DOI]

CoRR, 2022

DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Rethinking Resolution in the Context of Efficient Video Recognition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MaskPlace: Fast Chip Placement via Reinforced Visual Representation Learning.

[BibT_eX]

[DOI]

Yao Lai

Yao Mu

Ping Luo

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for Visual Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Objects in Semantic Topology.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Dynamic Token Normalization improves Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Learning Versatile Neural Architectures by Propagating Network Codes.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

CycleMLP: A MLP-like Architecture for Dense Prediction.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Polygon-Free: Unconstrained Scene Text Detection with Box Annotations.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

ByteTrack: Multi-object Tracking by Associating Every Detection Box.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Towards Grand Unification of Object Tracking.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Pose for Everything: Towards Category-Agnostic Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Not All Models Are Equal: Predicting Model Transferability in a Self-challenging Fisher Space.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

3D Interacting Hand Pose Estimation by Hand De-occlusion and Removal.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

PoseTrans: A Simple yet Effective Pose Transformation Augmentation for Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

DaViT: Dual Attention Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision, 2022

Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Language as Queries for Referring Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

RestoreFormer: High-Quality Blind Face Restoration from Undegraded Key-Value Pairs.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Scale-Equivalent Distillation for Semi-Supervised Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Bridging Video-text Retrieval with Multiple Choice Questions.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2022

Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Switchable Normalization for Learning-to-Normalize Deep Representation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Dynamic Token Normalization Improves Vision Transformer.

[BibT_eX]

[DOI]

CoRR, 2021

FAST: Searching for a Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation.

[BibT_eX]

[DOI]

CoRR, 2021

ByteTrack: Multi-Object Tracking by Associating Every Detection Box.

[BibT_eX]

[DOI]

CoRR, 2021

Revitalizing CNN Attentions via Transformers in Self-Supervised Visual Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Towards High-Quality Temporal Action Detection with Sparse Proposals.

[BibT_eX]

[DOI]

CoRR, 2021

Panoptic SegFormer.

[BibT_eX]

[DOI]

CoRR, 2021

CycleMLP: A MLP-like Architecture for Dense Prediction.

[BibT_eX]

[DOI]

CoRR, 2021

PVTv2: Improved Baselines with Pyramid Vision Transformer.

[BibT_eX]

[DOI]

CoRR, 2021

BWCP: Probabilistic Learning-to-Prune Channels for ConvNets via Batch Whitening.

[BibT_eX]

[DOI]

CoRR, 2021

Unsupervised Pretraining for Object Detection by Patch Reidentification.

[BibT_eX]

[DOI]

CoRR, 2021

FAT: Learning Low-Bitwidth Parametric Representation via Frequency-Aware Transformation.

[BibT_eX]

[DOI]

CoRR, 2021

DetCo: Unsupervised Contrastive Learning for Object Detection.

[BibT_eX]

[DOI]

CoRR, 2021

Trans2Seg: Transparent Object Segmentation with Transformer.

[BibT_eX]

[DOI]

CoRR, 2021

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Model-Based Reinforcement Learning via Imagination with Derived Memory.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Compressed Video Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Rethinking the Pruning Criteria for Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Multi-frame Collaboration for Effective Endoscopic Video Polyp Detection via Spatial-Temporal Feature Transformation.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Multi-compound Transformer for Accurate Biomedical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Segmenting Transparent Objects in the Wild with Transformer.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

What Makes for End-to-End Object Detection?

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Do 2D GANs Know 3D Shape? Unsupervised 3D Shape Reconstruction from 2D Image GANs.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

STAR: A Structure-aware Lightweight Transformer for Real-time Image Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

DetCo: Unsupervised Contrastive Learning for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

End-to-End Dense Video Captioning with Parallel Decoding.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Bringing Events into Video Deblurring with Non-consecutively Blurry Frames.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Watch Only Once: An End-to-End Video Action Detection Framework.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Adversarial Robustness for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

When Human Pose Estimation Meets Robustness: Adversarial Algorithms and Benchmarks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Sparse R-CNN: End-to-End Object Detection With Learnable Proposals.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Parser-Free Virtual Try-On via Distilling Appearance Flows.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Disentangled Cycle Consistency for Highly-Realistic Virtual Try-On.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

HR-NAS: Searching Efficient High-Resolution Neural Architectures With Lightweight Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

SSN: Learning Sparse Switchable Normalization via SparsestMax.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2020

TransTrack: Multiple-Object Tracking with Transformer.

[BibT_eX]

[DOI]

CoRR, 2020

OneNet: Towards End-to-End One-Stage Object Detection.

[BibT_eX]

[DOI]

CoRR, 2020

SelfText Beyond Polygon: Unconstrained Text Detection with Box Supervision and Dynamic Self-Training.

[BibT_eX]

[DOI]

CoRR, 2020

Convolution-Weight-Distribution Assumption: Rethinking the Criteria of Channel Pruning.

[BibT_eX]

[DOI]

Zhongzhan Huang

Xinjiang Wang

Ping Luo

CoRR, 2020

AdaX: Adaptive Gradient Descent with Exponential Long Term Memory.

[BibT_eX]

[DOI]

CoRR, 2020

Domain-Adaptive Few-Shot Learning.

[BibT_eX]

[DOI]

CoRR, 2020

How Does BN Increase Collapsed Neural Network Filters?

[BibT_eX]

[DOI]

CoRR, 2020

UXNet: Searching Multi-level Feature Aggregation for 3D Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Channel Equilibrium Networks for Learning Deep Representation.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Webly Supervised Image Classification with Self-contained Confidence.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Segmenting Transparent Objects in the Wild.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Dynamic and Static Context-Aware LSTM for Multi-agent Motion Prediction.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Whole-Body Human Pose Estimation in the Wild.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Differentiable Hierarchical Graph Grouping for Multi-person Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Exemplar Normalization for Learning Deep Representation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

3D Human Mesh Regression With Dense Correspondence.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Towards Photo-Realistic Virtual Try-On by Adaptively Generating↔Preserving Image Content.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PolarMask: Single Shot Instance Segmentation With Polar Representation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning a Reinforced Agent for Flexible Exposure Bracketing Selection.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

MaskGAN: Towards Diverse and Interactive Facial Image Manipulation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Online Knowledge Distillation via Collaborative Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning Depth-Guided Convolutions for Monocular 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Every Frame Counts: Joint Learning of Video Segmentation and Optical Flow.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Human Centric Visual Analysis with Deep Learning

[BibT_eX]

[DOI]

Springer, ISBN: 978-981-13-2386-7, 2020

2019

SCAN: Self-and-Collaborative Attention Network for Video Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2019

TextSR: Content-Aware Text Super-Resolution Guided by Recognition.

[BibT_eX]

[DOI]

CoRR, 2019

Towards Improving Generalization of Deep Networks via Consistent Normalization.

[BibT_eX]

[DOI]

CoRR, 2019

WIDER Face and Pedestrian Challenge 2018: Methods and Results.

[BibT_eX]

[DOI]

CoRR, 2019

DeepFashion2: A Versatile Benchmark for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images.

[BibT_eX]

[DOI]

CoRR, 2019

Differentiable Dynamic Normalization for Learning Deep Representation.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

Towards Understanding Regularization in Batch Normalization.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Differentiable Learning-to-Normalize via Switchable Normalization.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Vision-Infused Deep Audio Inpainting.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Differentiable Learning-to-Group Channels via Groupable Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Switchable Whitening for Deep Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Once a MAN: Towards Multi-Target Attack via Learning Multi-Target Adversarial Network Once.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Deep Self-Learning From Noisy Labels.

[BibT_eX]

[DOI]

Jiangfan Han

Ping Luo

Xiaogang Wang

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

CamNet: Coarse-to-Fine Retrieval for Camera Re-Localization.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

SSN: Learning Sparse Switchable Normalization via SparsestMax.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

DeepFashion2: A Versatile Benchmark for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Talking Face Generation by Adversarially Disentangled Audio-Visual Representation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Faceness-Net: Face Detection through Deep Facial Part Responses.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

Deep Learning Markov Random Field for Semantic Segmentation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

From Facial Expression Recognition to Interpersonal Relation Prediction.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2018

FaceFeat-GAN: a Two-Stage Approach for Identity-Preserving Face Synthesis.

[BibT_eX]

[DOI]

CoRR, 2018

Do Normalization Layers in a Deep ConvNet Really Need to Be Distinct?

[BibT_eX]

[DOI]

CoRR, 2018

Differentiable Learning-to-Normalize via Switchable Normalization.

[BibT_eX]

[DOI]

Ping Luo

Jiamin Ren

Zhanglin Peng

CoRR, 2018

Batch Kalman Normalization: Towards Training Deep Neural Networks with Micro-Batches.

[BibT_eX]

[DOI]

CoRR, 2018

Kalman Normalization: Normalizing Internal Representations Across Network Layers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Temporal Sequence Distillation: Towards Few-Frame Action Recognition in Videos.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Two at Once: Enhancing Learning and Generalization Capacities via IBN-Net.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

FaceID-GAN: Learning a Symmetry Three-Player GAN for Identity-Preserving Face Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

CUImage: A Neverending Learning Platform on a Convolutional Knowledge Graph of Billion Web Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

Scheduling Large-scale Distributed Training via Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

Mix-and-Match Tuning for Self-Supervised Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Spatial as Deep: Spatial CNN for Traffic Scene Understanding.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

DeepID-Net: Object Detection with Deformable Part Based Convolutional Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2017

Video Object Segmentation with Re-identification.

[BibT_eX]

[DOI]

CoRR, 2017

Unconstrained Fashion Landmark Detection via Hierarchical Recurrent Transformer Networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

EigenNet: Towards Fast and Structural Learning of Deep Neural Networks.

[BibT_eX]

[DOI]

Ping Luo

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Learning Deep Architectures via Generalized Whitened Neural Networks.

[BibT_eX]

[DOI]

Ping Luo

Proceedings of the 34th International Conference on Machine Learning, 2017

Deep Dual Learning for Semantic Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Learning Object Interactions and Descriptions for Semantic Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Learning Compositional Shape Models of Multiple Distance Metrics by Information Projection.

[BibT_eX]

[DOI]

Ping Luo

Liang Lin

Xiaobai Liu

IEEE Trans. Neural Networks Learn. Syst., 2016

Clothes Co-Parsing Via Joint Image Segmentation and Labeling With Application to Clothing Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2016

Learning Deep Representation for Face Alignment with Auxiliary Attributes.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2016

Joint Face Representation Adaptation and Clustering in Videos.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

Fashion Landmark Detection in the Wild.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

WIDER FACE: A Face Detection Benchmark.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Face Model Compression by Distilling Knowledge from Neurons.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Learning to Recognize Pedestrian Attribute.

[BibT_eX]

[DOI]

CoRR, 2015

Learning Social Relation Traits from Face Images.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

From Facial Parts Responses to Face Detection: A Deep Learning Approach.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Deep Learning Strong Parts for Pedestrian Detection.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Deep Learning Face Attributes in the Wild.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Semantic Image Segmentation via Deep Parsing Network.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

A large-scale car dataset for fine-grained categorization and verification.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Pedestrian detection aided by deep learning semantic tasks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

DeepID-Net: Deformable deep convolutional neural networks for object detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Deep Representation Learning with Target Coding.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014

Deep learning for attribute inference, parsing, and recognition of face.

[BibT_eX]

[DOI]

Ping Luo

PhD thesis, 2014

Deep Learning Multi-View Representation for Face Recognition.

[BibT_eX]

[DOI]

CoRR, 2014

Recover Canonical-View Faces in the Wild with Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2014

Learning and Transferring Multi-task Deep Representation for Face Alignment.

[BibT_eX]

[DOI]

CoRR, 2014

DeepID-Net: multi-stage and deformable deep convolutional neural networks for object detection.

[BibT_eX]

[DOI]

CoRR, 2014

Multi-View Perceptron: a Deep Model for Learning Face Identity and View Representations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Pedestrian Attribute Recognition At Far Distance.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Facial Landmark Detection by Deep Multi-task Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

Clothing Co-parsing by Joint Image Segmentation and Labeling.

[BibT_eX]

[DOI]

Wei Yang

Ping Luo

Liang Lin

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Switchable Deep Network for Pedestrian Detection.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013

Deep Learning Identity-Preserving Face Space.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2013

A Deep Sum-Product Architecture for Robust Facial Attributes Analysis.

[BibT_eX]

[DOI]

Ping Luo

Xiaogang Wang

Xiaoou Tang

Proceedings of the IEEE International Conference on Computer Vision, 2013

Pedestrian Parsing via Deep Decompositional Network.

[BibT_eX]

[DOI]

Ping Luo

Xiaogang Wang

Xiaoou Tang

Proceedings of the IEEE International Conference on Computer Vision, 2013

2012

Representing and recognizing objects with massive local image patches.

[BibT_eX]

[DOI]

Pattern Recognit., 2012

Joint semantic segmentation by searching for compatible-competitive references.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Hierarchical face parsing via deep learning.

[BibT_eX]

[DOI]

Ping Luo

Xiaogang Wang

Xiaoou Tang

Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2010

A Discriminative Model for Object Representation and Detection via Sparse Features.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Pattern Recognition, 2010

Semantics-driven portrait cartoon stylization.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2010

Learning Shape Detector by Quantizing Curve Segments with Multiple Distance Metrics.

[BibT_eX]

[DOI]

Ping Luo

Liang Lin

Hongyang Chao

Proceedings of the Computer Vision, 2010

2009

Hierarchical 3D perception from a single image.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2009

Ping Luo

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...