Xiantong Zhen

Orcid: 0000-0001-5213-0462

According to our database1, Xiantong Zhen authored at least 145 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Gaze Estimation by Attention-Induced Hierarchical Variational Auto-Encoder.
IEEE Trans. Cybern., April, 2024

MetaKernel: Learning Variational Random Features With Limited Labels.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

Variational Neuron Shifting for Few-Shot Image Classification Across Domains.
IEEE Trans. Multim., 2024

Weakly-Supervised RGBD Video Object Segmentation.
IEEE Trans. Image Process., 2024

2023
Continuous cross-modal hashing.
Pattern Recognit., October, 2023

Attentional prototype inference for few-shot segmentation.
Pattern Recognit., October, 2023

Latent Domain Generation for Unsupervised Domain Adaptation Object Counting.
IEEE Trans. Multim., 2023

Learning to Learn With Variational Inference for Cross-Domain Image Classification.
IEEE Trans. Multim., 2023

Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited Samples.
CoRR, 2023

Knowledge Graph Embeddings for Multi-Lingual Structured Representations of Radiology Reports.
CoRR, 2023

Learning Variational Neighbor Labels for Test-Time Domain Generalization.
CoRR, 2023

Lightweight Improved Residual Network for Efficient Inverse Tone Mapping.
CoRR, 2023

CageViT: Convolutional Activation Guided Efficient Vision Transformer.
CoRR, 2023

Parameter-Free Channel Attention for Image Classification and Super-Resolution.
CoRR, 2023

Probabilistic Integration of Object Level Annotations in Chest X-ray Classification.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Episodic Multi-Task Learning with Heterogeneous Neural Processes.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MetaModulation: Learning Variational Feature Hierarchies for Few-Shot Learning with Fewer Tasks.
Proceedings of the International Conference on Machine Learning, 2023

Energy-Based Test Sample Adaptation for Domain Generalization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited Samples.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Order-preserving Consistency Regularization for Domain Adaptation and Generalization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CK-Transformer: Commonsense Knowledge Enhanced Transformers for Referring Expression Comprehension.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Implicit Diffusion Models for Continuous Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SuperDisco: Super-Class Discovery Improves Visual Recognition for the Long-Tail.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

EMO: Episodic Memory Optimization for Few-Shot Meta-Learning.
Proceedings of the Conference on Lifelong Learning Agents, 2023

Coarse-Fine View Attention Alignment-Based GAN for CT Reconstruction from Biplanar X-Rays.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2023

2022
Memory Attention Networks for Skeleton-Based Action Recognition.
IEEE Trans. Neural Networks Learn. Syst., 2022

Variational Abnormal Behavior Detection With Motion Consistency.
IEEE Trans. Image Process., 2022

Variational Hyperparameter Inference for Few-Shot Learning Across Domains.
IEEE Trans. Circuits Syst. Video Technol., 2022

Cross-Domain Attention Network for Unsupervised Domain Adaptation Crowd Counting.
IEEE Trans. Circuits Syst. Video Technol., 2022

Spherical Zero-Shot Learning.
IEEE Trans. Circuits Syst. Video Technol., 2022

Deep learning DCE-MRI parameter estimation: Application in pancreatic cancer.
Medical Image Anal., 2022

Uncertainty-aware report generation for chest X-rays by variational topic inference.
Medical Image Anal., 2022

Attentive encoder-decoder networks for crowd counting.
Neurocomputing, 2022

Joint Super-Resolution and Inverse Tone-Mapping: A Feature Decomposition Aggregation Network and A New Benchmark.
CoRR, 2022

Fusion-Correction Network for Single-Exposure Correction and Multi-Exposure Fusion.
CoRR, 2022

Association Graph Learning for Multi-Task Classification with Category Shifts.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Variational Model Perturbation for Source-Free Domain Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

LifeLonger: A Benchmark for Continual Disease Classification.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Learning to Generalize across Domains on Single Test Samples.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Hierarchical Variational Memory for Few-shot Learning Across Domains.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Pixel-Level Non-local Image Smoothing With Objective Evaluation.
IEEE Trans. Multim., 2021

Learning to Adapt With Memory for Probabilistic Few-Shot Learning.
IEEE Trans. Circuits Syst. Video Technol., 2021

Attentional Kernel Encoding Networks for Fine-Grained Visual Categorization.
IEEE Trans. Circuits Syst. Video Technol., 2021

Deep 3D human pose estimation: A review.
Comput. Vis. Image Underst., 2021

Generative Kernel Continual learning.
CoRR, 2021

Multi-Task Neural Processes.
CoRR, 2021

Attentional Prototype Inference for Few-Shot Semantic Segmentation.
CoRR, 2021

Variational Prototype Inference for Few-Shot Semantic Segmentation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Learning to Learn Dense Gaussian Processes for Few-Shot Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Variational Multi-Task Learning with Gumbel-Softmax Priors.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Hierarchical Embedding for Video Instance Segmentation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Variational Topic Inference for Chest X-Ray Report Generation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Variational Knowledge Distillation for Disease Classification in Chest X-Rays.
Proceedings of the Information Processing in Medical Imaging, 2021

A Bit More Bayesian: Domain-Invariant Learning with Uncertainty.
Proceedings of the 38th International Conference on Machine Learning, 2021

Kernel Continual Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

MetaNorm: Learning to Normalize Few-Shot Batches Across Domains.
Proceedings of the 9th International Conference on Learning Representations, 2021

Seminar Learning for Click-Level Weakly Supervised Semantic Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Meta-Learning with Variational Semantic Memory for Word Sense Disambiguation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
The Structure Transfer Machine Theory and Applications.
IEEE Trans. Image Process., 2020

Conditional Variational Image Deraining.
IEEE Trans. Image Process., 2020

Calibrated Multivariate Regression Networks.
IEEE Trans. Circuits Syst. Video Technol., 2020

Heterogenous output regression network for direct face alignment.
Pattern Recognit., 2020

Model-Agnostic Metric for Zero-Shot Learning.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Variational Image Deraining.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Learning to Learn Variational Semantic Memory.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Few-Shot Ensemble Learning for Video Classification with SlowFast Memory Networks.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Transductive Relation-Propagation Network for Few-shot Learning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Learning to Learn Kernels with Variational Random Features.
Proceedings of the 37th International Conference on Machine Learning, 2020

You Only Need The Image: Unsupervised Few-Shot Semantic Segmentation With Co-Guidance Network.
Proceedings of the IEEE International Conference on Image Processing, 2020

Few-Shot Semantic Segmentation with Democratic Attention Networks.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning to Learn with Variational Information Bottleneck for Domain Generalization.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Deep Ensemble Machine for Video Classification.
IEEE Trans. Neural Networks Learn. Syst., 2019

Learning Match Kernels on Grassmann Manifolds for Action Recognition.
IEEE Trans. Image Process., 2019

Gaussian Transfer Convolutional Neural Networks.
IEEE Trans. Emerg. Top. Comput. Intell., 2019

Glance and Stare: Trapping Flying Birds in Aerial Videos by Adaptive Deep Spatio-Temporal Features.
IEEE Trans. Circuits Syst. Video Technol., 2019

Long-Short-Term Features for Dynamic Scene Classification.
IEEE Trans. Circuits Syst. Video Technol., 2019

Attentional Information Fusion Networks for Cross-Scene Power Line Detection.
IEEE Geosci. Remote. Sens. Lett., 2019

Graph Neural Based End-to-end Data Association Framework for Online Multiple-Object Tracking.
CoRR, 2019

Crowd Counting and Density Estimation by Trellis Encoder-Decoder Network.
CoRR, 2019

Cosine Activation in Compact Network (CACN): Application to Scene Classification.
IEEE Access, 2019

Starts Better and Ends Better: A Target Adaptive Image Signature Tracker.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Multi-Scale Aggregation Network for Direct Face Alignment.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Model-Free Tracking With Deep Appearance and Motion Features Integration.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Learning the Set Graphs: Image-Set Classification Using Sparse Graph Convolutional Networks.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Two-Stream Multi-Task Network for Fashion Recognition.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Attentional Neural Fields for Crowd Counting.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Relational Attention Network for Crowd Counting.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Crowd Counting and Density Estimation by Trellis Encoder-Decoder Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Attentive Temporal Pyramid Network for Dynamic Scene Classification.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Multitarget Sparse Latent Regression.
IEEE Trans. Neural Networks Learn. Syst., 2018

Multi-Stream Convolutional Neural Network for SAR Automatic Target Recognition.
Remote. Sens., 2018

Multi-Target Regression via Robust Low-Rank Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Cognitive Assessment Prediction in Alzheimer's Disease by Multi-Layer Multi-Target Regression.
Neuroinformatics, 2018

Deep appearance and motion learning for egocentric activity recognition.
Neurocomputing, 2018

The Structure Transfer Machine Theory and Applications.
CoRR, 2018

Spatial Ensemble Kernel Learning for Scene Classification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Direct Shape Regression Networks for End-to-End Face Alignment.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Attentional Alignment Networks.
Proceedings of the British Machine Vision Conference 2018, 2018

In Defense of Single-column Networks for Crowd Counting.
Proceedings of the British Machine Vision Conference 2018, 2018

Deep Collaborative Tracking Networks.
Proceedings of the British Machine Vision Conference 2018, 2018

2017
Descriptor Learning via Supervised Manifold Regularization for Multioutput Regression.
IEEE Trans. Neural Networks Learn. Syst., 2017

Supervised Local Descriptor Learning for Human Action Recognition.
IEEE Trans. Multim., 2017

Real-time visual tracking based on improved perceptual hashing.
Multim. Tools Appl., 2017

Direct and simultaneous estimation of cardiac four chamber volumes by multioutput sparse regression.
Medical Image Anal., 2017

Direct Estimation of Spinal Cobb Angles by Structured Multi-output Regression.
Proceedings of the Information Processing in Medical Imaging, 2017

Learning discriminant grassmann kernels for image-set classification.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Realistic human action recognition: When CNNS meet LDS.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Learning Deep Match Kernels for Image-Set Classification.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Local Feature Discriminant Projection.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Multi-scale deep networks and regression forests for direct bi-ventricular volume estimation.
Medical Image Anal., 2016

Handcrafted vs. learned representations for human action recognition.
Image Vis. Comput., 2016

Action recognition via spatio-temporal local features: A comprehensive study.
Image Vis. Comput., 2016

Spatial and temporal scoring for egocentric video summarization.
Neurocomputing, 2016

Multi-task Shape Regression for Medical Image Segmentation.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016, 2016

Towards optimal vlad for human action recognition from still images.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Realistic human action recognition: When deep learning meets VLAD.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Correction to "Regression Segmentation for M<sup>3</sup> Spinal Images".
IEEE Trans. Medical Imaging, 2015

Regression Segmentation for M<sup>3</sup> Spinal Images.
IEEE Trans. Medical Imaging, 2015

Towards Direct Medical Image Analysis without Segmentation.
CoRR, 2015

Direct volume estimation without segmentation.
Proceedings of the Medical Imaging 2015: Image Processing, 2015

Direct and Simultaneous Four-Chamber Volume Estimation by Multi-Output Regression.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015, 2015

Dimensionality reduction by supervised locality analysis.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Supervised descriptor learning for multi-output regression.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Learning Object-to-Class Kernels for Scene Classification.
IEEE Trans. Image Process., 2014

Spatio-Temporal Laplacian Pyramid Coding for Action Recognition.
IEEE Trans. Cybern., 2014

Robust point pattern matching based on spectral context.
Pattern Recognit., 2014

Action recognition by spatio-temporal oriented energies.
Inf. Sci., 2014

Direct Estimation of Cardiac Bi-ventricular Volumes with Regression Forests.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2014, 2014

A Performance Evaluation on Action Recognition with Local Features.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Learning semantic kernels for scene classification.
Proceedings of the IEEE International Conference on Acoustics, 2014

Discriminative Embedding via Image-to-Class Distances.
Proceedings of the British Machine Vision Conference, 2014

2013
Feature extraction and representation for human action recognition.
PhD thesis, 2013

Learning Discriminative Key Poses for Action Recognition.
IEEE Trans. Cybern., 2013

Embedding Motion and Structure Features for Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2013

A local descriptor based on Laplacian pyramid coding for action recognition.
Pattern Recognit. Lett., 2013

Combining appearance and structural features for human action recognition.
Neurocomputing, 2013

Discriminative high-level representations for scene classification.
Proceedings of the IEEE International Conference on Image Processing, 2013

Recognizing actions via sparse coding on structure projection.
Proceedings of the IEEE International Conference on Image Processing, 2013

Towards optimal object bank for scene classification.
Proceedings of the IEEE International Conference on Acoustics, 2013

Spatio-temporal steerable pyramid for human action recognition.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Human Action Retrieval via efficient feature matching.
Proceedings of the 10th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2013

2012
High order co-occurrence of visualwords for action recognition.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

2011
Human action representation using pyramid correlogram of oriented gradients on motion history images.
Int. J. Comput. Math., 2011


  Loading...