Quan Kong

Orcid: 0009-0008-4762-0481

According to our database1, Quan Kong authored at least 50 papers between 2012 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
The 9th AI City Challenge.
CoRR, August, 2025

Mixture of Experts Guided by Gaussian Splatters Matters: A new Approach to Weakly-Supervised Video Anomaly Detection.
CoRR, August, 2025

Speculative Decoding via Hybrid Drafting and Rollback-Aware Branch Parallelism.
CoRR, June, 2025

One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory.
CoRR, May, 2025

Distance Estimation in Outdoor Driving Environments Using Phase-only Correlation Method with Event Cameras.
CoRR, May, 2025

Evaluation of Mobile Environment for Vehicular Visible Light Communication Using Multiple LEDs and Event Cameras.
CoRR, May, 2025

Just Dance with π! A Poly-modal Inductor for Weakly-supervised Video Anomaly Detection.
CoRR, May, 2025

Multi-Level Adaptive Attention Fusion Network for Infrared and Visible Image Fusion.
IEEE Signal Process. Lett., 2025

Guess Future Anomalies from Normalcy: Forecasting Abnormal Behavior in Real-World Videos.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025

Augmented Reality Applications Using Active Markers With An Event Camera.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

E-VLC: A Real-World Dataset for Event-based Visible Light Communication And Localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Synthetic Visual Genome.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Just Dance with pi! A Poly-modal Inductor for Weakly-supervised Video Anomaly Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

GA3CE: Unconstrained 3D Gaze Estimation with Gaze-Aware 3D Context Encoding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Human-Scene Network: A novel baseline with self-rectifying loss for weakly supervised video anomaly detection.
Comput. Vis. Image Underst., 2024

SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance.
CoRR, 2024

WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding.
CoRR, 2024

OE-CTST: Outlier-Embedded Cross Temporal Scale Transformer for Weakly-supervised Video Anomaly Detection.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Reprojection Errors as Prompts for Efficient Scene Coordinate Regression.
Proceedings of the Computer Vision - ECCV 2024, 2024

WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-Grained Spatial-Temporal Understanding.
Proceedings of the Computer Vision - ECCV 2024, 2024


2023
Infrared and Visible Image Fusion via Attention-Based Adaptive Feature Fusion.
Entropy, March, 2023

Remote Sensing Image Super-Resolution With Residual Split Attention Mechanism.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2023

LAC - Latent Action Composition for Skeleton-based Action Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DeCo: Decomposition and Reconstruction for Compositional Temporal Grounding via Coarse-to-Fine Contrastive Ranking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Self-Supervised Video Representation Learning via Latent Time Navigation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Hierarchical contrastive adaptation for cross-domain object detection.
Mach. Vis. Appl., 2022

NormFuse: Infrared and Visible Image Fusion With Pixel-Adaptive Normalization.
IEEE CAA J. Autom. Sinica, 2022

Mask Atari for Deep Reinforcement Learning as POMDP Benchmarks.
CoRR, 2022

Efficient and Accurate Skeleton-Based Two-Person Interaction Recognition Using Inter-and Intra-Body Graphs.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

2021
Multi-Stream Adaptive Graph Convolutional Network Using Inter- and Intra-Body Graphs for Two-Person Interaction Recognition.
IEEE Access, 2021

Robust Unsupervised Multi-Object Tracking In Noisy Environments.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

A Multi-Modal Fusion Approach for Audio-Visual Scene Classification Enhanced by CLIP Variants.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

2020
Hitachi at TRECVID DSDI 2020.
Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020

Cycle-Contrast for Self-Supervised Video Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Anticipating the Start of User Interaction for Service Robot in the Wild.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

2019

Towards Efficient Instance Segmentation with Hierarchical Distillation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

MMAct: A Large-Scale Dataset for Cross Modal Human Action Understanding.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Active Generative Adversarial Network for Image Classification.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Multimodal Deep Neural Networks Based Ensemble Learning for X-Ray Object Recognition.
Proceedings of the Computer Vision - ACCV 2018 Workshops, 2018

Adversarial Zero-shot Learning With Semantic Augmentation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2016
Selecting home appliances with smart glass based on contextual information.
Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2016

Egocentric Video Search via Physical Interactions.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2014
Reusing training data with generative/discriminative hybrid model for practical acceleration-based activity recognition.
Computing, 2014

Developing discriminate model and comparative analysis of differentially expressed genes and pathways for bloodstream samples of diabetes mellitus type 2.
BMC Bioinform., 2014

Identifying outlets at which electrical appliances are used by electrical wire sensing to gain positional information about appliance use.
Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2014

2013
Detecting and correcting WiFi positioning errors.
Proceedings of the 2013 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2013

Sharing training data among different activity classes.
Proceedings of the 2013 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2013

2012
A Run-Time Task Migration Scheme for an Adjustable Issue-Slots Multi-core Processor.
Proceedings of the Reconfigurable Computing: Architectures, Tools and Applications, 2012


  Loading...