Shenghua Gao

Orcid: 0000-0003-1626-2040

According to our database1, Shenghua Gao authored at least 158 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Anatomical Prior and Inter-Slice Consistency for Semi-Supervised Vertebral Structure Detection in 3D Ultrasound Volume.
IEEE J. Biomed. Health Informatics, April, 2024

Feature Re-Representation and Reliable Pseudo Label Retraining for Cross-Domain Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

2D Gaussian Splatting for Geometrically Accurate Radiance Fields.
CoRR, 2024

Bridging 3D Gaussian and Mesh for Freeview Video Rendering.
CoRR, 2024

MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning.
CoRR, 2024

TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

2023
Towards Scalable 3D Anomaly Detection and Localization: A Benchmark via 3D Anomaly Synthesis and A Self-Supervised Learning Network.
CoRR, 2023

RoomDesigner: Encoding Anchor-latents for Style-consistent and Shape-compatible Indoor Scene Generation.
CoRR, 2023

DebSDF: Delving into the Details and Bias of Neural Indoor Scene Reconstruction.
CoRR, 2023

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation.
CoRR, 2023

InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions.
CoRR, 2023

Omni-Line-of-Sight Imaging for Holistic Shape Reconstruction.
CoRR, 2023

HFGD: High-level Feature Guided Decoder for Semantic Segmentation.
CoRR, 2023

P<sup>2</sup>SDF for Neural Indoor Scene Reconstruction.
CoRR, 2023

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Revisiting Event-Based Video Frame Interpolation.
IROS, 2023

LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PlaneDepth: Self-Supervised Depth Estimation via Orthogonal Planes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Lifelong Person Re-identification via Knowledge Refreshing and Consolidation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Memorizing Structure-Texture Correspondence for Image Anomaly Detection.
IEEE Trans. Neural Networks Learn. Syst., 2022

Domain Adversarial Reinforcement Learning for Partial Domain Adaptation.
IEEE Trans. Neural Networks Learn. Syst., 2022

Proxy-Bridged Image Reconstruction Network for Anomaly Detection in Medical Images.
IEEE Trans. Medical Imaging, 2022

Chest X-Ray Diagnostic Quality Assessment: How Much Is Pixel-Wise Supervision Needed?
IEEE Trans. Medical Imaging, 2022

Spherical DNNs and Their Applications in 360$^\circ$∘ Images and Videos.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Future Frame Prediction Network for Video Anomaly Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Liquid Warping GAN With Attention: A Unified Framework for Human Image Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Locating and Counting Heads in Crowds With a Depth Prior.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

VertMatch: A Semi-supervised Framework for Vertebral Structure Detection in 3D Ultrasound Volume.
CoRR, 2022

Electrical Tunable Spintronic Neuron with Trainable Activation Function.
CoRR, 2022

PlaneDepth: Plane-Based Self-Supervised Monocular Depth Estimation.
CoRR, 2022

PREF: Phasorial Embedding Fields for Compact Neural Representations.
CoRR, 2022

TaylorImNet for Fast 3D Shape Reconstruction Based on Implicit Surface Function.
CoRR, 2022

AS-MLP: An Axial Shifted MLP Architecture for Vision.
Proceedings of the Tenth International Conference on Learning Representations, 2022

UNIF: United Neural Implicit Functions for Clothed Human Reconstruction and Animation.
Proceedings of the Computer Vision - ECCV 2022, 2022

SVIP: Sequence VerIfication for Procedures in Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

TransRAC: Encoding Multi-scale Temporal Correlation with Transformers for Repetitive Action Counting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Cross-domain Trajectory Prediction with CTP-Net.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

HaViT: Hybrid-Attention Based Vision Transformer for Video Classification.
Proceedings of the Computer Vision - ACCV 2022, 2022

Dual-Space NeRF: Learning Animatable Avatars and Scene Lighting in Separate Spaces.
Proceedings of the International Conference on 3D Vision, 2022

2021
RGB-D-based gaze point estimation via multi-column CNNs and facial landmarks global optimization.
Vis. Comput., 2021

Video Anomaly Detection with Sparse Coding Inspired Deep Neural Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

SUNNet: A novel framework for simultaneous human parsing and pose estimation.
Neurocomputing, 2021

Normal graph: Spatial temporal graph convolutional networks based prediction network for skeleton based video anomaly detection.
Neurocomputing, 2021

CTP-Net For Cross-Domain Trajectory Prediction.
CoRR, 2021

OH-Former: Omni-Relational High-Order Transformer for Person Re-Identification.
CoRR, 2021

Partially-Supervised Learning for Vessel Segmentation in Ocular Images.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Memory-Assisted Dual-End Adaptation Network For Choroid Segmentation In Multi-Domain Optical Coherence Tomography.
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

Accurate depth estimation from a hybrid event-RGB stereo setup.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Crowd Counting With Partial Annotations in an Image.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Speech Drives Templates: Co-Speech Gesture Synthesis with Learned Templates.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Prior Based Human Completion.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning To Recommend Frame for Interactive Video Object Segmentation in the Wild.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Layout-Guided Novel View Synthesis From a Single Indoor Panorama.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Look Before You Leap: Learning Landmark Features for One-Stage Visual Grounding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

MOS: A Low Latency and Lightweight Framework for Face Detection, Landmark Localization, and Head Pose Estimation.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Amodal Segmentation Based on Visible Region Segmentation and Shape Prior.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

KGDet: Keypoint-Guided Fashion Detection.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Appearance-Motion Memory Consistency Network for Video Anomaly Detection.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Graph Convolutional Neural Network for Skeleton-based Video Abnormal Behavior Detection.
Proceedings of the Generalization with Deep Learning: For Improvement on Sensing Capability, 2021

2020
Image saliency detection via multi-scale iterative CNN.
Vis. Comput., 2020

Noise Adaptation Generative Adversarial Network for Medical Image Analysis.
IEEE Trans. Medical Imaging, 2020

Correction to "Noise Adaptation Generative Adversarial Network for Medical Image Analysis".
IEEE Trans. Medical Imaging, 2020

Visual Tracking With Multiview Trajectory Prediction.
IEEE Trans. Image Process., 2020

Multi-level feature fusion based Locality-Constrained Spatial Transformer network for video crowd counting.
Neurocomputing, 2020

SIRI: Spatial Relation Induced Network For Spatial Description Resolution.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Sparse-Gan: Sparsity-Constrained Generative Adversarial Network for Anomaly Detection in Retinal OCT Image.
Proceedings of the 17th IEEE International Symposium on Biomedical Imaging, 2020

Open-Set OCT Image Recognition with Synthetic Learning.
Proceedings of the 17th IEEE International Symposium on Biomedical Imaging, 2020

SUNet: A Lesion Regularized Model for Simultaneous Diabetic Retinopathy and Diabetic Macular Edema Grading.
Proceedings of the 17th IEEE International Symposium on Biomedical Imaging, 2020

Perceptual-Assisted Adversarial Adaptation for Choroid Segmentation in Optical Coherence Tomography.
Proceedings of the 17th IEEE International Symposium on Biomedical Imaging, 2020

Towards Fast Adaptation of Neural Architectures with Meta Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Encoding Structure-Texture Relation with P-Net for Anomaly Detection in Retinal Images.
Proceedings of the Computer Vision - ECCV 2020, 2020

Structured3D: A Large Photo-Realistic Dataset for Structured 3D Modeling.
Proceedings of the Computer Vision - ECCV 2020, 2020

P<sup>2</sup>Net: Patch-Match and Plane-Regularization for Unsupervised Indoor Depth Estimation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Fast-MVSNet: Sparse-to-Dense Multi-View Stereo With Learned Propagation and Gauss-Newton Refinement.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Geometric Structure Based and Regularized Depth Estimation From 360 Indoor Imagery.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Multiview Multitask Gaze Estimation With Deep Convolutional Neural Networks.
IEEE Trans. Neural Networks Learn. Syst., 2019

CE-Net: Context Encoder Network for 2D Medical Image Segmentation.
IEEE Trans. Medical Imaging, 2019

Personalized Saliency and Its Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Utilizing Information Bottleneck to Evaluate the Capability of Deep Neural Networks for Image Classification.
Entropy, 2019

BioNet: Infusing Biomarker Prior into Global-to-Local Network for Choroid Segmentation in Optical Coherence Tomography Images.
CoRR, 2019

Generic Multiview Visual Tracking.
CoRR, 2019

Wireframe Parsing With Guidance of Distance Map.
IEEE Access, 2019

Learning Semantics-aware Distance Map with Semantics Layering Network for Amodal Instance Segmentation.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

SkrGAN: Sketching-Rendering Unconditional Generative Adversarial Networks for Medical Image Synthesis.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Ki-GAN: Knowledge Infusion Generative Adversarial Network for Photoacoustic Image Reconstruction In Vivo.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Margin Learning Embedded Prediction for Video Anomaly Detection with A Few Anomalies.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Locality-Constrained Spatial Transformer Network for Video Crowd Counting.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Hybrid Neural Network for Photoacoustic Imaging Reconstruction.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

PPGNet: Learning Point-Pair Graph for Line Segment Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Single-Image Piece-Wise Planar 3D Reconstruction via Associative Embedding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Density Map Regression Guided Detection Network for RGB-D Crowd Counting and Localization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Local to Global Learning: Gradually Adding Classes for Training Deep Neural Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

RGBD Based Gaze Estimation via Multi-Task CNN.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Body Structure Aware Deep Crowd Counting.
IEEE Trans. Image Process., 2018

Deep Surface Light Fields.
Proc. ACM Comput. Graph. Interact. Tech., 2018

Multi-Cell Multi-Task Convolutional Neural Networks for Diabetic Retinopathy Grading Kang.
CoRR, 2018

Fundus Image Quality-Guided Diabetic Retinopathy Grading.
Proceedings of the Computational Pathology and Ophthalmic Medical Image Analysis, 2018

Multi-Cell Multi-Task Convolutional Neural Networks for Diabetic Retinopathy Grading.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

Saliency Detection in 360 ^\circ ∘ Videos.
Proceedings of the Computer Vision - ECCV 2018, 2018

Evaluating Capability of Deep Neural Networks for Image Classification via Information Plane.
Proceedings of the Computer Vision - ECCV 2018, 2018

Encoding Crowd Interaction With Deep Neural Network for Pedestrian Trajectory Prediction.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Gaze Prediction in Dynamic 360° Immersive Videos.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Face Aging With Identity-Preserved Conditional Generative Adversarial Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Future Frame Prediction for Anomaly Detection - A New Baseline.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning to Parse Wireframes in Images of Man-Made Environments.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Believe It or Not, We Know What You Are Looking At!
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Label Information Guided Graph Construction for Semi-Supervised Learning.
IEEE Trans. Image Process., 2017

Locality Constrained Deep Supervised Hashing for Image Retrieval.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Beyond Universal Saliency: Personalized Saliency Prediction with Multi-task CNN.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Remembering history with convolutional LSTM for anomaly detection.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

A Revisit of Sparse Coding Based Anomaly Detection in Stacked RNN Framework.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Improving Training of Deep Neural Networks via Singular Value Bounding.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Patch-Set-Based Representation for Alignment-Free Image Set Classification.
IEEE Trans. Circuits Syst. Video Technol., 2016

DEFEATnet - A Deep Conventional Image Representation for Image Classification.
IEEE Trans. Circuits Syst. Video Technol., 2016

ROML: A Robust Feature Correspondence Approach for Matching Objects in A Set of Images.
Int. J. Comput. Vis., 2016

Image compression via multiple learned geometric dictionaries.
Proceedings of the 2016 IEEE Global Conference on Signal and Information Processing, 2016

Bi-Level Multi-column Convolutional Neural Networks for Facial Landmark Point Detection.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Single-Image Crowd Counting via Multi-Column Convolutional Neural Network.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Progressively Parsing Interactional Objects for Fine Grained Action Detection.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Cold-Start Heterogeneous-Device Wireless Localization.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Generalized Multiple Kernel Learning With Data-Dependent Priors.
IEEE Trans. Neural Networks Learn. Syst., 2015

RGB-D Object Recognition via Incorporating Latent Data Structure and Prior Knowledge.
IEEE Trans. Multim., 2015

Constructing a Nonnegative Low-Rank and Sparse Graph With Data-Adaptive Features.
IEEE Trans. Image Process., 2015

PCANet: A Simple Deep Learning Baseline for Image Classification?
IEEE Trans. Image Process., 2015

Single Sample Face Recognition via Learning Deep Supervised Autoencoders.
IEEE Trans. Inf. Forensics Secur., 2015

Laplacian Auto-Encoders: An explicit learning of nonlinear data manifold.
Neurocomputing, 2015

Neither Global Nor Local: Regularized Patch-Based Representation for Single Sample Per Person Face Recognition.
Int. J. Comput. Vis., 2015

Design of global emergency communication system based on geosynchronous TDRSS.
Proceedings of the International Conference on Wireless Communications & Signal Processing, 2015

Partially Common-Semantic Pursuit for RGB-D Object Recognition.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

2014
Concurrent Single-Label Image Classification and Annotation via Efficient Multi-Layer Group Sparse Coding.
IEEE Trans. Multim., 2014

Learning Category-Specific Dictionary and Shared Dictionary for Fine-Grained Image Categorization.
IEEE Trans. Image Process., 2014

Region-Based Saliency Detection and Its Application in Object Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2014

Discovering joint audio-visual codewords for video event detection.
Mach. Vis. Appl., 2014

Constructing a Non-Negative Low Rank and Sparse Graph with Data-Adaptive Features.
CoRR, 2014

Hand-Crafted Features or Machine Learnt Features? Together They Improve RGB-D Object Recognition.
Proceedings of the 2014 IEEE International Symposium on Multimedia, 2014

Efficient Sparsity Estimation via Marginal-Lasso Coding.
Proceedings of the Computer Vision - ECCV 2014, 2014

Unsupervised Feature Learning for RGB-D Image Classification.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
Regularized Feature Reconstruction for Spatio-Temporal Saliency Detection.
IEEE Trans. Image Process., 2013

Objective-Guided Image Annotation.
IEEE Trans. Image Process., 2013

Sparse Representation With Kernels.
IEEE Trans. Image Process., 2013

Laplacian Sparse Coding, Hypergraph Laplacian Sparse Coding, and Applications.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Background subtraction via coherent trajectory decomposition.
Proceedings of the ACM Multimedia Conference, 2013

Learning by Associating Ambiguously Labeled Images.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Improving sparse coding with graph, kernel, and structure
PhD thesis, 2012

Spatiotemporal Saliency Detection via Sparse Representation.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Learning Class-to-Image Distance via Large Margin and L1-Norm Regularization.
Proceedings of the Computer Vision - ECCV 2012, 2012

2011
Multi-layer group sparse coding - For concurrent image classification and annotation.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Web image concept annotation with better understanding of tags and visual features.
J. Vis. Commun. Image Represent., 2010

Discovering Class-Specific Informative Patches and Its Application in Landmark Charaterization.
Proceedings of the Advances in Multimedia Modeling, 2010

Automatic image tagging via category label and web data.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Wikipedia-assisted concept thesaurus for better web media understanding.
Proceedings of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval, 2010

Kernel Sparse Representation for Image Classification and Face Recognition.
Proceedings of the Computer Vision, 2010

Local features are not lonely - Laplacian sparse coding for image classification.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Concept model-based unsupervised web image re-ranking.
Proceedings of the International Conference on Image Processing, 2009


  Loading...