Thomas H. Li

Orcid: 0000-0001-6123-1265

According to our database1, Thomas H. Li authored at least 80 papers between 2017 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Mitigating Label Noise in GANs via Enhanced Spectral Normalization.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

Semantic Point Cloud Upsampling.
IEEE Trans. Multim., 2023

StreamFlow: Streamlined Multi-Frame Optical Flow Estimation for Video Sequences.
CoRR, 2023

Mug-STAN: Adapting Image-Language Pretrained Models for General Video Understanding.
CoRR, 2023

One For All: Video Conversation is Feasible Without Video Instruction Tuning.
CoRR, 2023

A<sup>2</sup>Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting Vision-and-Language Ability of Foundation Models.
CoRR, 2023

Detecting the open-world objects with the help of the Brain.
CoRR, 2023

LIO-PPF: Fast LiDAR-Inertial Odometry via Incremental Plane Pre-Fitting and Skeleton Tracking.
CoRR, 2023

Self-Supervised Monocular Depth Estimation: Solving the Edge-Fattening Problem.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Frequency-Aware Self-Supervised Monocular Depth Estimation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

IPFR: Identity-Preserving Face Reenactment with Enhanced Domain Adversarial Training and Multi-level Identity Priors.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Efficient Test-Time Adaptation for Super-Resolution with Second-Order Degradation and Reconstruction.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PDE-based Progressive Prediction Framework for Attribute Compression of 3D Point Clouds.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

LIO-PPF: Fast LiDAR-Inertial Odometry via Incremental Plane Pre-Fitting and Skeleton Tracking.
IROS, 2023

Causality Compensated Attention for Contextual Biased Visual Recognition.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Vision-and-Language Navigation from YouTube Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Improving Graph Representation for Point Cloud Segmentation via Attentive Filtering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Masked Motion Encoding for Self-Supervised Video Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CAT: LoCalization and IdentificAtion Cascade Detection Transformer for Open-World Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Revisiting Temporal Modeling for CLIP-Based Image-to-Video Knowledge Transferring.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Hard Sample Matters a Lot in Zero-Shot Quantization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Learning the Global Descriptor for 3-D Object Recognition Based on Multiple Views Decomposition.
IEEE Trans. Multim., 2022

QINet: Decision Surface Learning and Adversarial Enhancement for Quasi-Immune Completion of Diverse Corrupted Point Clouds.
IEEE Trans. Geosci. Remote. Sens., 2022

PointOT: Interpretable Geometry-Inspired Point Cloud Generative Model via Optimal Transport.
IEEE Trans. Circuits Syst. Video Technol., 2022

Learning Disentangled Representation for Multi-View 3D Object Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2022

Rate-Distortion Optimized Graph for Point Cloud Attribute Coding.
IEEE Signal Process. Lett., 2022

M<sup>3</sup>Video: Masked Motion Modeling for Self-Supervised Video Representation Learning.
CoRR, 2022

Geometric-Aware Calibration Mechanism for Self-Supervised Depth Estimation.
Proceedings of the IEEE Smartworld, 2022

Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Active Camera for Multi-Object Navigation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MOAC: Multi-level Perception Optimizer Based on Dual Augmented Cost for Structure- from-Motion.
Proceedings of the 5th IEEE International Conference on Multimedia Information Processing and Retrieval, 2022

DKNAS: A Practical Deep Keypoint Extraction Framework Based on Neural Architecture Search.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Fine-Grained Correlation Representation for Graph-Based Point Cloud Attribute Compression.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Deep Geometry Post-Processing for Decompressed Point Clouds.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Pointivae: Invertible Variational Autoencoder Framework for 3D Point Cloud Generation.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Attention Guided Invariance Selection for Local Feature Descriptors.
Proceedings of the IEEE International Conference on Acoustics, 2022

Neural Texture Extraction and Distribution for Controllable Person Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Low Pass Filter for Anti-aliasing in Temporal Action Localization.
CoRR, 2021

Combining Attention with Flow for Person Image Synthesis.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Information-Growth Attention Network for Image Super-Resolution.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Rethinking Training Objective For Self-Supervised Monocular Depth Estimation: Semantic Cues To Rescue.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Structure-transformed Texture-enhanced Network for Person Image Synthesis.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ATVIO: Attention Guided Visual-Inertial Odometry.
Proceedings of the IEEE International Conference on Acoustics, 2021

SSD-GAN: Measuring the Realness in the Spatial and Spectral Domains.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Deep Spatial Transformation for Pose-Guided Person Image Generation and Animation.
IEEE Trans. Image Process., 2020

Spatial-Temporal Context-Aware Online Action Detection and Prediction.
IEEE Trans. Circuits Syst. Video Technol., 2020

Neural saliency algorithm guide bi-directional visual perception style transfer.
CAAI Trans. Intell. Technol., 2020

Vaccine-style-net: Point Cloud Completion in Implicit Continuous Function Space.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

VONAS: Network Design in Visual Odometry using Neural Architecture Search.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Temporal-Aware SfM-Learner: Unsupervised Learning Monocular Depth and Motion from Stereo Video Clips.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Towards Loss Balance and Consistent Model in Self-supervised Monocular Depth Estimation.
Proceedings of the 32nd IEEE International Conference on Tools with Artificial Intelligence, 2020

Twinvo: Unsupervised Learning of Monocular Visual Odometry Using Bi-Direction Twin Network.
Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020

Pose Refinement: Bridging the Gap Between Unsupervised Learning and Geometric Methods for Visual Odometry.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

ROIMIX: Proposal-Fusion Among Multiple Images for Underwater Object Detection.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Regression Before Classification for Temporal Action Detection.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Deep Image Spatial Transformation for Person Image Generation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Over-Exposure Correction via Exposure and Scene Information Disentanglement.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
Exploiting the Value of the Center-dark Channel Prior for Salient Object Detection.
ACM Trans. Intell. Syst. Technol., 2019

LECARM: Low-Light Image Enhancement Using the Camera Response Model.
IEEE Trans. Circuits Syst. Video Technol., 2019

Deep AutoEncoder-based Lossy Geometry Compression for Point Clouds.
CoRR, 2019

Multi-mapping Image-to-Image Translation via Learning Disentanglement.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

ARMIN: Towards a More Efficient and Light-weight Recurrent Memory Network.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

PDNet: Prior-Model Guided Depth-Enhanced Network for Salient Object Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Salient Contour-Aware Based Twice Learning Strategy for Saliency Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

StructureFlow: Image Inpainting via Structure-Aware Appearance Flow.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Boundary Information Matters More: Accurate Temporal Action Detection with Temporal Boundary Network.
Proceedings of the IEEE International Conference on Acoustics, 2019

BLP - Boundary Likelihood Pinpointing Networks for Accurate Temporal Action Localization.
Proceedings of the IEEE International Conference on Acoustics, 2019

Graph Convolutional Label Noise Cleaner: Train a Plug-And-Play Action Classifier for Anomaly Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Real Photographs Denoising With Noise Domain Adaptation and Attentive Generative Adversarial Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
Detecting action tubes via spatial action estimation and temporal path inference.
Neurocomputing, 2018

Exploiting the Value of the Center-dark Channel Prior for Salient Object Detection.
CoRR, 2018

Active Temporal Action Detection in Untrimmed Videos Via Deep Reinforcement Learning.
IEEE Access, 2018

Adaptive Integration Skip Compensation Neural Networks for Removing Mixed Noise in Image.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Deep Pedestrian Detection Using Contextual Information and Multi-level Features.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Online Action Tube Detection via Resolving the Spatio-temporal Context Pattern.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

SingleGAN: Image-to-Image Translation by a Single-Generator Network Using Multiple Generative Adversarial Learning.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Towards Automatic Wild Animal Detection in Low Quality Camera-Trap Images Using Two-Channeled Perceiving Residual Pyramid Networks.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017


  Loading...