Xinxiao Wu

Orcid: 0000-0002-2056-6947

According to our database1, Xinxiao Wu authored at least 91 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Boosting Entity-Aware Image Captioning With Multi-Modal Knowledge Graph.
IEEE Trans. Multim., 2024

Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning.
CoRR, 2024

DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Multi-Modal Prompting for Open-Vocabulary Video Visual Relationship Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Relational Distant Supervision for Image Captioning without Image-Text Pairs.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Topic-aware video summarization using multimodal transformer.
Pattern Recognit., August, 2023

Sentimental Visual Captioning using Multimodal Transformer.
Int. J. Comput. Vis., April, 2023

Adaptive Latent Graph Representation Learning for Image-Text Matching.
IEEE Trans. Image Process., 2023

Probability Distribution Based Frame-supervised Language-driven Action Localization.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Teaching What You Should Teach: A Data-Based Distillation Method.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Counterfactual Inference for Visual Relationship Detection in Videos.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Meta-Causal Learning for Single Domain Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Domain Adversarial Reinforcement Learning for Partial Domain Adaptation.
IEEE Trans. Neural Networks Learn. Syst., 2022

Exploiting Informative Video Segments for Temporal Action Localization.
IEEE Trans. Multim., 2022

Learning Cooperative Neural Modules for Stylized Image Captioning.
Int. J. Comput. Vis., 2022

Learning What You Should Learn.
CoRR, 2022

Knowledge Prompting for Few-shot Action Recognition.
CoRR, 2022

Entity-aware and Motion-aware Transformers for Language-driven Action Localization in Videos.
CoRR, 2022

Entity-aware and Motion-aware Transformers for Language-driven Action Localization.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Adaptive Recursive Circle Framework for Fine-Grained Action Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Bootstrap Generalization Ability from Loss Landscape Perspective.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Exploring Spatial-Temporal Instance Relationships in an Intermediate Domain for Image-to-Video Object Detection.
Proceedings of the Computer Vision - ACCV 2022 Workshops, 2022

Adaptive Image-to-Video Scene Graph Generation via Knowledge Reasoning and Adversarial Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Cross-Domain Image Captioning via Cross-Modal Retrieval and Model Adaptation.
IEEE Trans. Image Process., 2021

Sequential Instance Refinement for Cross-Domain Object Detection in Images.
IEEE Trans. Image Process., 2021

Joint Learning of Multiple Latent Domains and Deep Representations for Domain Adaptation.
IEEE Trans. Cybern., 2021

Boundary discrimination and proposal evaluation for temporal action proposal generation.
Multim. Tools Appl., 2021

Spatial-Temporal Relation Reasoning for Action Prediction in Videos.
Int. J. Comput. Vis., 2021

Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph.
CoRR, 2021

Adaptive Recursive Circle Framework for Fine-grained Action Recognition.
CoRR, 2021

Multi-modal Dependency Tree for Video Captioning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Image Captioning with Inherent Sentiment.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Anticipating Future Relations via Graph Growing for Action Prediction.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Spatial-temporal Causal Inference for Partial Image-to-video Adaptation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Learning Normal Patterns via Adversarial Attention-Based Autoencoder for Abnormal Event Detection in Videos.
IEEE Trans. Multim., 2020

Confidence-Guided Self Refinement for Action Prediction in Untrimmed Videos.
IEEE Trans. Image Process., 2020

Incremental transfer learning for video annotation via grouped heterogeneous sources.
IET Comput. Vis., 2020

Video Captioning Using Weak Annotation.
CoRR, 2020

Hierarchical Matching and Reasoning for Action Localization via Language Query.
Proceedings of the Pattern Recognition and Computer Vision - Third Chinese Conference, 2020

Preserving Global and Local Temporal Consistency for Arbitrary Video Style Transfer.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

MemCap: Memorizing Style Knowledge for Image Captioning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Joint Commonsense and Relation Reasoning for Image and Video Captioning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Temporal Action Localization in Untrimmed Videos Using Action Pattern Trees.
IEEE Trans. Multim., 2019

Exploiting Images for Video Recognition: Heterogeneous Feature Augmentation via Symmetric Adversarial Learning.
IEEE Trans. Image Process., 2019

Combining multiple deep cues for action recognition.
Multim. Tools Appl., 2019

Relational Reasoning using Prior Knowledge for Visual Captioning.
CoRR, 2019

Exploiting Human Pose for Weakly-Supervised Temporal Action Localization.
Proceedings of the Pattern Recognition and Computer Vision - Second Chinese Conference, 2019

Learning Weighted Video Segments for Temporal Action Localization.
Proceedings of the Pattern Recognition and Computer Vision - Second Chinese Conference, 2019

Joint Syntax Representation Learning and Visual Cue Translation for Video Captioning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
Extracting Key Segments of Videos for Event Detection by Learning From Web Sources.
IEEE Trans. Multim., 2018

Content-Attention Representation by Factorized Action-Scene Network for Action Recognition.
IEEE Trans. Multim., 2018

A discriminative structural model for joint segmentation and recognition of human actions.
Multim. Tools Appl., 2018

Action recognition with motion map 3D network.
Neurocomputing, 2018

Exploiting Images for Video Recognition with Hierarchical Generative Adversarial Networks.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Unsupervised Deep Learning of Mid-Level Video Representation for Action Recognition.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Recognizing key segments of videos for video annotation by learning from web image sets.
Multim. Tools Appl., 2017

Heterogeneous domain adaptation method for video annotation.
IET Comput. Vis., 2017

Representing Discrimination of Video by a Motion Map.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Heterogeneous Multi-group Adaptation for Event Recognition in Consumer Videos.
Proceedings of the Image and Graphics - 9th International Conference, 2017

2016
Transfer Latent SVM for Joint Recognition and Localization of Actions in Videos.
IEEE Trans. Cybern., 2016

Heterogeneous discriminant analysis for cross-view action recognition.
Neurocomputing, 2016

A Hierarchical Video Description for Complex Activity Understanding.
Int. J. Comput. Vis., 2016

Multi-group-multi-class domain adaptation for event recognition.
IET Comput. Vis., 2016

Multimedia event detection via deep spatial-temporal neural networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

2015
Cross-View Action Recognition Over Heterogeneous Feature Spaces.
IEEE Trans. Image Process., 2015

Cross-domain structural model for video event annotation via web images.
Multim. Tools Appl., 2015

Heterogeneous Discriminant Analysis for Cross-View Action Recognition.
Proceedings of the Neural Information Processing - 22nd International Conference, 2015

A Multiple Image Group Adaptation Approach for Event Recognition in Consumer Videos.
Proceedings of the Image and Graphics - 8th International Conference, 2015

Finding Event Videos via Image Search Engine.
Proceedings of the IEEE International Conference on Data Mining Workshop, 2015

Incremental Discriminant Learning for Heterogeneous Domain Adaptation.
Proceedings of the IEEE International Conference on Data Mining Workshop, 2015

2014
Video Annotation via Image Groups from the Web.
IEEE Trans. Multim., 2014

Learning a discriminative mid-level feature for action recognition.
Sci. China Inf. Sci., 2014

A system for TRECVID MED by MCIS.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Modeling the Relationship of Action, Object, and Scene.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Multi-group Adaptation for Event Recognition from Videos.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Video Annotation by Incremental Learning from Grouped Heterogeneous Sources.
Proceedings of the Computer Vision - ACCV 2014, 2014

Weakly Supervised Action Recognition and Localization Using Web Images.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
Action Recognition Using Multilevel Features and Latent Structural SVM.
IEEE Trans. Circuits Syst. Video Technol., 2013

Scene image retrieval via re-ranking semantic and packed dense interestpoints.
Neurocomputing, 2013

Cross-View Action Recognition over Heterogeneous Feature Spaces.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2012
Transfer Discriminant-Analysis of Canonical Correlations for View-Transfer Action Recognition.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

Annotating videos from the web images.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Action recognition with discriminative mid-level features.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

View-Invariant Action Recognition Using Latent Kernelized Structural SVM.
Proceedings of the Computer Vision - ECCV 2012, 2012

2011
Action recognition using context and appearance distribution features.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Incremental discriminant-analysis of canonical correlations for action recognition.
Pattern Recognit., 2010

Discriminative human action recognition in the learned hierarchical manifold space.
Image Vis. Comput., 2010

2009
Action recognition feedback-based framework for human pose reconstruction from monocular images.
Pattern Recognit. Lett., 2009

Tracking articulated objects by learning intrinsic structure of motion.
Pattern Recognit. Lett., 2009

Incremental discriminative-analysis of canonical correlations for action recognition.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

2008
Human action recognition using discriminative models in the learned hierarchical manifold space.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008


  Loading...