Humphrey Shi

Orcid: 0000-0002-2922-5663

Affiliations:
  • Georgia Tech, Atlanta, GA, USA
  • University of Illinois at Urbana-Champaign, Beckman Institute, Urbana, IL, USA
  • University of Oregon, Eugene, OR, USA (former)


According to our database1, Humphrey Shi authored at least 141 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
FaceCLIP: Facial Image-to-Video Translation via a Brief Text Description.
IEEE Trans. Circuits Syst. Video Technol., June, 2024

Understanding and Accelerating Neural Architecture Search With Training-Free and Theory-Grounded Metrics.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2024

Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment.
CoRR, 2024

Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis.
CoRR, 2024

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts.
CoRR, 2024

UVMap-ID: A Controllable and Personalized UV Map Generative Model.
CoRR, 2024

OpenBias: Open-set Bias Detection in Text-to-Image Generative Models.
CoRR, 2024

Learning Trimaps via Clicks for Image Matting.
CoRR, 2024

Benchmarking Object Detectors with COCO: A New Path Forward.
CoRR, 2024

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text.
CoRR, 2024

Faster Neighborhood Attention: Reducing the O(n^2) Cost of Self Attention at the Threadblock Level.
CoRR, 2024

Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community.
CoRR, 2024

VASE: Object-Centric Appearance and Shape Manipulation of Real Videos.
CoRR, 2024

Towards Better Structured Pruning Saliency by Reorganizing Convolution.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Continuous Adaptation for Interactive Segmentation Using Teacher-Student Architecture.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

FarSight: A Physics-Driven Whole-Body Biometric System at Large Distance and Altitude.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Video Instance Matting.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

VMFormer: End-to-End Video Matting with Transformer.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

2023
Semi-supervised transfer learning with hierarchical self-regularization.
Pattern Recognit., December, 2023

Pyramid Attention Network for Image Restoration.
Int. J. Comput. Vis., December, 2023

Interactive Neural Painting.
Comput. Vis. Image Underst., October, 2023

CCNet: Criss-Cross Attention for Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Broad Spectrum Image Deblurring via an Adaptive Super-Network.
IEEE Trans. Image Process., 2023

Collaborative Content-Dependent Modeling: A Return to the Roots of Salient Object Detection.
IEEE Trans. Image Process., 2023

VCoder: Versatile Vision Encoders for Multimodal Large Language Models.
CoRR, 2023

HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models.
CoRR, 2023

Diffusion for Natural Image Matting.
CoRR, 2023

Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models.
CoRR, 2023

HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models.
CoRR, 2023

Combating Label Noise With A General Surrogate Model For Sample Selection.
CoRR, 2023

Multi-Concept T2I-Zero: Tweaking Only The Text Embeddings and Nothing Else.
CoRR, 2023

Reference-based Painterly Inpainting via Diffusion: Crossing the Wild Reference Domain Gap.
CoRR, 2023

FarSight: A Physics-Driven Whole-Body Biometric System at Large Distance and Altitude.
CoRR, 2023

Matting Anything.
CoRR, 2023

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models.
CoRR, 2023

Zero-shot Generative Model Adaptation via Image-specific Prompt Learning.
CoRR, 2023

Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models.
CoRR, 2023

PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models.
CoRR, 2023

Image Completion with Heterogeneously Filtered Spectral Hints.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Sim2RealVS: A New Benchmark for Video Stabilization with a Strong Baseline.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

More Control for Free! Image Synthesis with Semantic Diffusion Guidance.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Learning Mask-aware CLIP Representations for Zero-Shot Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

SeMask: Semantically Masked Transformers for Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

MI-GAN: A Simple Baseline for Image Inpainting on Mobile Devices.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Specialist Diffusion: Plug-and-Play Sample-Efficient Fine-Tuning of Text-to-Image Diffusion Models to Learn Any Unseen Style.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

VideoMatt: A Simple Baseline for Accessible Real-Time Video Matting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ConvMLP: Hierarchical Convolutional MLPs for Vision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

OneFormer: One Transformer to Rule Universal Image Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Zero-Shot Generative Model Adaptation via Image-Specific Prompt Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Automatic High Resolution Wire Segmentation and Removal.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Graph Transformer GANs for Graph-Constrained House Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Neighborhood Attention Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Boosted Dynamic Neural Networks.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
AlignSeg: Feature-Aligned Segmentation Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model.
CoRR, 2022

StyleNAT: Giving Each Head a New Perspective.
CoRR, 2022

Dilated Neighborhood Attention Transformer.
CoRR, 2022

Grasping the Arrow of Time from the Singularity: Decoding Micromotion in Low-dimensional Latent Spaces from StyleGAN.
CoRR, 2022

Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural Architecture Search.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Mask Matching Transformer for Few-Shot Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ReCoRo: Region-Controllable Robust Light Enhancement with User-Specified Imprecise Masks.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Rev: A Video Engine for Object Re-identification at the City Scale.
Proceedings of the 7th IEEE/ACM Symposium on Edge Computing, 2022

SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image.
Proceedings of the Computer Vision - ECCV 2022, 2022

AdaFocusV3: On Unified Spatial-Temporal Dynamic Video Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

Point-to-Box Network for Accurate Object Detection via Single Point Supervision.
Proceedings of the Computer Vision - ECCV 2022, 2022

Object Localization under Single Coarse Point Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DiSparse: Disentangled Sparsification for Multitask Model Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Towards Layer-wise Image Vectorization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Numerical Study of Air Flow Induced by Shock Impact on an Array of Perforated Plates.
Entropy, 2021

Global and Local Alignment Networks for Unpaired Image-to-Image Translation.
CoRR, 2021

Feudal Reinforcement Learning by Reading Manuals.
CoRR, 2021

Understanding and Accelerating Neural Architecture Search with Training-Free and Theory-Grounded Metrics.
CoRR, 2021

MSN: Efficient Online Mask Selection Network for Video Instance Segmentation.
CoRR, 2021

Escaping the Big Data Paradigm with Compact Transformers.
CoRR, 2021

UltraSR: Spatial Encoding is a Missing Key for Implicit Image Function-based Arbitrary-Scale Super-Resolution.
CoRR, 2021

MUSE: Textual Attributes Guided Portrait Painting Generation.
Proceedings of the 4th IEEE International Conference on Multimedia Information Processing and Retrieval, 2021

Study Group Learning: Improving Retinal Vessel Segmentation Trained with Noisy Labels.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Interpretable Visual Reasoning via Induced Symbolic Space.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A Multi-Mode Modulator for Multi-Domain Few-Shot Classification.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Is In-Domain Data Really Needed? A Pilot Study on Cross-Domain Calibration for Network Quantization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Rethinking Text Segmentation: A Novel Dataset and a Text-Specific Refinement Approach.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

RSCA: Real-Time Segmentation-Based Context-Aware Scene Text Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Pseudo-IoU: Improving Label Assignment in Anchor-Free Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Learning to Track Instances without Video Annotations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

High-Resolution Deep Image Matting.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Any-Precision Deep Neural Networks.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

CompFeat: Comprehensive Feature Aggregation for Video Instance Segmentation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Clique: Spatiotemporal Object Re-identification at the City Scale.
CoRR, 2020

MUSE: Illustrating Textual Attributes by Portrait Generation.
CoRR, 2020

Deep Learning for 3D Point Cloud Understanding: A Survey.
CoRR, 2020

The 1st Tiny Object Detection Challenge: Methods and Results.
CoRR, 2020

Deep Learning-Based Automated Image Segmentation for Concrete Petrographic Analysis.
CoRR, 2020

Pyramid Attention Networks for Image Restoration.
CoRR, 2020

Laplacian Denoising Autoencoder.
CoRR, 2020

Deep Affinity Net: Instance Segmentation via Affinity.
CoRR, 2020

Human-Object Interaction Detection: A Quick Survey and Examination of Methods.
Proceedings of the HuMA'20: Proceedings of the 1st International Workshop on Human-centric Multimedia Analysis, 2020

SkyNet: a Hardware-Efficient Method for Object Detection and Tracking on Embedded Systems.
Proceedings of the Third Conference on Machine Learning and Systems, 2020

Motion Pyramid Networks for Accurate and Efficient Cardiac Motion Estimation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

The 1st Tiny Object Detection Challenge: Methods and Results.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

FOAL: Fast Online Adaptive Learning for Cardiac Motion Estimation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Differential Treatment for Stuff and Things: A Simple Unsupervised Domain Adaptation Method for Semantic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Alleviating Semantic-level Shift: A Semi-supervised Domain Adaptation Method for Semantic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Image Super-Resolution With Cross-Scale Non-Local Attention and Exhaustive Self-Exemplars Mining.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Agriculture-Vision: A Large Aerial Image Database for Agricultural Pattern Analysis.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020


HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

When AWGN-Based Denoiser Meets Real Noises.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Bottom-up Higher-Resolution Networks for Multi-Person Pose Estimation.
CoRR, 2019

SkyNet: A Champion Model for DAC-SDC on Low Power Object Detection.
CoRR, 2019

A Novel Framework for 3D-2D Vertebra Matching.
Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

Self-Similarity Grouping: A Simple Unsupervised Cross Domain Adaptation Approach for Person Re-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

SPGNet: Semantic Prediction Guidance for Scene Parsing.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Geometry-Aware Distillation for Indoor Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

SpotTune: Transfer Learning Through Adaptive Fine-Tuning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Improving Object Detection from Scratch via Gated Feature Reuse.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

Weakly Supervised Scene Parsing with Point-Based Distance Metric Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Horizontal Pyramid Matching for Person Re-Identification.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Self-similarity Grouping: A Simple Unsupervised Cross Domain Adaptation Approach for Person Re-identification.
CoRR, 2018

A Simple Non-i.i.d. Sampling Approach for Efficient Training and Better Generalization.
CoRR, 2018

Decoupled Classification Refinement: Hard False Positive Suppression for Object Detection.
CoRR, 2018

TS2C: Tight Box Mining with Surrounding Segmentation Context for Weakly Supervised Object Detection.
CoRR, 2018

Object-Centric Spatio-Temporal Activity Detection and Recognition.
Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018

VisDrone-DET2018: The Vision Meets Drone Object Detection in Image Challenge Results.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

TS ^2 2 C: Tight Box Mining with Surrounding Segmentation Context for Weakly Supervised Object Detection.
Proceedings of the Computer Vision - ECCV 2018, 2018

Revisiting RCNN: On Awakening the Classification Power of Faster RCNN.
Proceedings of the Computer Vision - ECCV 2018, 2018

Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi-Supervised Semantic Segmentation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Geometry-Aware Traffic Flow Analysis by Detection and Tracking.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017
Deep learning in sequential data analysis
PhD thesis, 2017

Learning Object Detectors from Scratch with Gated Recurrent Feature Pyramids.
CoRR, 2017

Effective object detection from traffic camera videos.
Proceedings of the 2017 IEEE SmartWorld, 2017

Computed tomography super-resolution using convolutional neural networks.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017


Balanced Two-Stage Residual Networks for Image Super-Resolution.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

2016
Seq-NMS for Video Object Detection.
CoRR, 2016

Epitomic Image Super-Resolution.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016


  Loading...