Kevin J. Shih

According to our database1, Kevin J. Shih authored at least 30 papers between 2013 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Partial Convolution for Padding, Inpainting, and Image Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Multilingual Multiaccented Multispeaker TTS with RADTTS.
CoRR, 2023

P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech Prompting.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

High-Acoustic Fidelity Text To Speech Synthesis With Fine-Grained Control Of Speech Attributes.
Proceedings of the IEEE International Conference on Acoustics, 2023

Vani: Very-Lightweight Accent-Controllable TTS for Native And Non-Native Speakers With Identity Preservation.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Revisiting Image-Language Networks for Open-Ended Phrase Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows.
CoRR, 2022

One TTS Alignment to Rule Them All.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis.
Proceedings of the 9th International Conference on Learning Representations, 2021

2019
Video Interpolation and Prediction with Unsupervised Landmarks.
CoRR, 2019

Graphical Contrastive Losses for Scene Graph Generation.
CoRR, 2019

Unsupervised Video Interpolation Using Cycle Consistency.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Graphical Contrastive Losses for Scene Graph Parsing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Improving Semantic Segmentation via Video Propagation and Label Relaxation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Partial Convolution based Padding.
CoRR, 2018

An Interpretable Model for Scene Graph Generation.
CoRR, 2018

Open-vocabulary Phrase Detection.
CoRR, 2018

SDCNet: Video Prediction Using Spatially-Displaced Convolution.
CoRR, 2018

Introduction to the 1st Place Winning Model of OpenImages Relationship Detection Challenge.
CoRR, 2018

SDC-Net: Video Prediction Using Spatially-Displaced Convolution.
Proceedings of the Computer Vision - ECCV 2018, 2018

Image Inpainting for Irregular Holes Using Partial Convolutions.
Proceedings of the Computer Vision - ECCV 2018, 2018

Learning Interpretable Spatial Operations in a Rich 3D Blocks World.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Learning visual tasks with selective attention
PhD thesis, 2017

Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2016
Where to Look: Focus Regions for Visual Question Answering.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Learning Discriminative Collections of Part Detectors for Object Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Part Localization using Multi-Proposal Consensus for Fine-Grained Categorization.
Proceedings of the British Machine Vision Conference 2015, 2015

2013
Learning Collections of Part Models for Object Recognition.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013


  Loading...