Viet-Khoa Vo-Ho

Orcid: 0000-0003-0277-7094

Affiliations:
  • Vietnam National University, Ho Chi Minh City, Vietnam
  • University of Arkansas, Fayetteville, USA


According to our database1, Viet-Khoa Vo-Ho authored at least 24 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation.
CoRR, 2024

ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

2023
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation.
Int. J. Comput. Vis., 2023

Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation.
CoRR, 2023

CLIP-TSA: Clip-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection.
Proceedings of the IEEE International Conference on Image Processing, 2023

DNA: Deformable Neural Articulations Network for Template-free Dynamic 3D Human Reconstruction from Monocular RGB-D Video.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning.
CoRR, 2022

Meta-Learning of NAS for Few-shot Learning in Medical Image Applications.
CoRR, 2022

CapsNet for Medical Image Segmentation.
CoRR, 2022

3DConvCaps: 3DUnet with Convolutional Capsule Encoder for Medical Image Segmentation.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

VLCAP: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

AISFormer: Amodal Instance Segmentation with Transformer.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Contextual Explainable Video Representation: Human Perception-based Understanding.
Proceedings of the 56th Asilomar Conference on Signals, Systems, and Computers, ACSSC 2022, Pacific Grove, CA, USA, October 31, 2022

2021
ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation.
IEEE Access, 2021

Agent-Environment Network for Temporal Action Proposal Generation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Offboard 3D Object Detection From Point Cloud Sequences.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
FIRST - Flexible Interactive Retrieval SysTem for Visual Lifelog Exploration at LSC 2020.
Proceedings of the Third ACM Workshop on Lifelog Search Challenge, 2020


2019
Smart Lifelog Retrieval System with Habit-based Concepts and Moment Visualization.
Proceedings of the ACM Workshop on Lifelog Search Challenge, 2019

Vehicle Re-identification with Learned Representation and Spatial Verification and Abnormality Detection with Multi-Adaptive Vehicle Detectors for Traffic Video Analysis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
Personal Diary Generation from Wearable Cameras with Concept Augmented Image Captioning and Wide Trail Strategy.
Proceedings of the Ninth International Symposium on Information and Communication Technology, 2018

Lifelog Moment Retrieval with Visual Concept Fusion and Text-based Query Expansion.
Proceedings of the Working Notes of CLEF 2018, 2018


  Loading...