Lu Sheng

CoRR, 2023

LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Distortion-aware Transformer in 360° Salient Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Siamese DETR.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

VPU: A Video-Based Point Cloud Upsampling Framework.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Towards Explainable 3D Grounded Visual Question Answering: A New Benchmark and Strong Baseline.

[BibT_eX]

[DOI]

CoRR, 2022

Improving RGB-D Point Cloud Registration by Learning Multi-scale Local Linear Transformation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

SketchSampler: Sketch-Based 3D Reconstruction via View-Dependent Depth Sampling.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DanceFormer: Music Conditioned 3D Dance Generation with Parametric Motion Transformer.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Motion Compensated Virtual View Synthesis Using Novel Particle Cell.

[BibT_eX]

[DOI]

Chi Ho Cheung

IEEE Trans. Multim., 2021

PCG-TAL: Progressive Cross-Granularity Cooperation for Temporal Action Localization.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Transformer3D-Det: Improving 3D Object Detection by Vote Refinement.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2021

Sequential Point Cloud Upsampling by Exploiting Multi-Scale Temporal Dependency.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2021

ForgeryNet - Face Forgery Analysis Challenge 2021: Methods and Results.

[BibT_eX]

[DOI]

CoRR, 2021

DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.

[BibT_eX]

[DOI]

Buyu Li

Yongchi Zhao

CoRR, 2021

IncreACO: Incrementally Learned Automatic Check-out with Photorealistic Exemplar Augmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

VoteHMR: Occlusion-Aware Voting Network for Robust 3D Human Mesh Recovery from Partial Point Clouds.

[BibT_eX]

[DOI]

Guanze Liu

Yu Rong

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

StyleFormer: Real-time Arbitrary Style Transfer via Parametric Style Composition.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Back-Tracing Representative Points for Voting-Based 3D Object Detection in Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

High-Quality Video Generation from Static Structural Annotations.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2020

PV-NAS: Practical Neural Architecture Search for Video Recognition.

[BibT_eX]

[DOI]

CoRR, 2020

Adaptive Gradient Method with Resilience and Momentum.

[BibT_eX]

[DOI]

CoRR, 2020

Unsupervised Domain Expansion from Multiple Sources.

[BibT_eX]

[DOI]

CoRR, 2020

Thinking in Frequency: Face Forgery Detection by Mining Frequency-Aware Clues.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Powering One-Shot Topological NAS with Stabilized Share-Parameter Proxy.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Morphing and Sampling Network for Dense Point Cloud Completion.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Bags of tricks for learning depth and camera motion from monocular videos.

[BibT_eX]

[DOI]

Bowen Dong

Virtual Real. Intell. Hardw., 2019

Cascaded regression using landmark displacement for 3D face reconstruction.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2019

Visibility Constrained Generative Model for Depth-Based 3D Facial Pose Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

Unsupervised Bi-directional Flow-based Video Generation from one Snapshot.

[BibT_eX]

[DOI]

CoRR, 2019

CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Improving Pedestrian Attribute Recognition With Weakly-Supervised Multi-Scale Attribute-Specific Localization.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Unsupervised Collaborative Learning of Keyframe Detection and Visual Odometry Towards Monocular Deep SLAM.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Context and Attribute Grounded Dense Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Semantics Disentangling for Text-To-Image Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Video Generation From Single Semantic Label Map.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Spatio-Temporal Disocclusion Filling Using Novel Sprite Cells.

[BibT_eX]

[DOI]

Chi Ho Cheung

IEEE Trans. Multim., 2018

Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Optical Flow Guided Feature: A Fast and Robust Motion Representation for Video Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Avatar-Net: Multi-Scale Zero-Shot Style Transfer by Feature Decoration.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Exploring Disentangled Feature Representation Beyond Face Identification.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

A Generative Model for Depth-Based Robust 3D Facial Pose Tracking.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Real-Time Head Pose Tracking with Online Face Template Reconstruction.

[BibT_eX]

[DOI]

Raveendran Paramesran

IEEE Trans. Pattern Anal. Mach. Intell., 2016

2015

Online Temporally Consistent Indoor Depth Video Enhancement via Static Structure.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

A disocclusion filling method using multiple sprites with depth for virtual view synthesis.

[BibT_eX]

[DOI]

Chi Ho Cheung

Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

2014

Temporal depth video enhancement based on intrinsic static structure.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Screen-camera calibration using a thread.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Accelerating the Distribution Estimation for the Weighted Median/Mode Filters.

[BibT_eX]

[DOI]

Tak-Wai Hui

Proceedings of the Computer Vision - ACCV 2014, 2014

2013

A Head Pose Tracking System Using RGB-D Camera.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision Systems - 9th International Conference, 2013

Depth enhancement based on hybrid geometric hole filling strategy.

[BibT_eX]

[DOI]