We stand with Ukraine

We stand with Ukraine

Yanli Ji

Orcid: 0000-0001-9122-6141

According to our database¹, Yanli Ji authored at least 67 papers between 2010 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Vision-Language Collaborative Representation Learning for Action Quality Assessment.

[DOI]

,

,

,

,

,

,

IEEE Trans. Image Process., 2026

2025

Visual-Semantic Alignment Temporal Parsing for Action Quality Assessment.

[DOI]

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., March, 2025

ReMP-AD: Retrieval-Enhanced Multi-Modal Prompt Fusion for Few-Shot Industrial Visual Anomaly Detection.

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024

SV-Learner: Support-Vector Contrastive Learning for Robust Learning With Noisy Labels.

[DOI]

,

,

,

,

IEEE Trans. Knowl. Data Eng., October, 2024

Dominant SIngle-Modal SUpplementary Fusion (SIMSUF) for Multimodal Sentiment Analysis.

[DOI]

,

,

,

,

IEEE Trans. Multim., 2024

Self-Supervised Sub-Action Parsing Network for Semi-Supervised Action Quality Assessment.

[DOI]

,

,

,

,

IEEE Trans. Image Process., 2024

Corrigendum to "Learning with Noisy Labels Using Collaborative Sample Selection and Contrastive Semi-Supervised Learning" [Knowledge-Based Systems 296 (2024) 111860].

[DOI]

,

,

,

,

,

,

Knowl. Based Syst., 2024

Learning with noisy labels using collaborative sample selection and contrastive semi-supervised learning.

[DOI]

,

,

,

,

,

,

Knowl. Based Syst., 2024

Independency Adversarial Learning for Cross-Modal Sound Separation.

[DOI]

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Relation-mining self-attention network for skeleton-based human action recognition.

[DOI]

,

,

,

,

Pattern Recognit., July, 2023

Region Attention Enhanced Unsupervised Cross-Domain Facial Emotion Recognition.

[DOI]

,

,

,

IEEE Trans. Knowl. Data Eng., April, 2023

Layer-fusion for online mutual knowledge distillation.

[DOI]

,

,

,

Multim. Syst., April, 2023

Self-Supervised Fine-Grained Cycle-Separation Network (FSCN) for Visual-Audio Separation.

[DOI]

,

,

,

,

IEEE Trans. Multim., 2023

Fine-Grained Spatio-Temporal Parsing Network for Action Quality Assessment.

,

,

,

,

IEEE Trans. Image Process., 2023

Localization-assisted Uncertainty Score Disentanglement Network for Action Quality Assessment.

[DOI]

,

,

,

,

,

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Cross-modality Representation Interactive Learning for Multimodal Sentiment Analysis.

[DOI]

,

,

,

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Unsupervised Sounding Pixel Learning.

[DOI]

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022

Cross-Modal Dynamic Networks for Video Moment Retrieval With Text Query.

[DOI]

,

,

,

,

,

IEEE Trans. Multim., 2022

View-Invariant Human Action Recognition Via View Transformation Network (VTN).

[DOI]

,

,

,

,

,

IEEE Trans. Multim., 2022

Answer Again: Improving VQA With Cascaded-Answering Model.

[DOI]

,

,

,

,

,

IEEE Trans. Knowl. Data Eng., 2022

Multi-level Multi-modal Feature Fusion for Action Recognition in Videos.

[DOI]

,

,

Kumie Alemu Gedamu

Proceedings of the HCMA@MM 2022: Proceedings of the 3rd International Workshop on Human-Centric Multimedia Analysis, 2022

Global-Local Cross-View Fisher Discrimination for View-Invariant Action Recognition.

[DOI]

,

,

,

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Selective Hypergraph Convolutional Networks for Skeleton-based Action Recognition.

[DOI]

,

,

,

,

Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

2021

Arbitrary-View Human Action Recognition: A Varying-View RGB-D Action Dataset.

[DOI]

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., 2021

View-invariant action recognition via Unsupervised AttentioN Transfer (UANT).

[DOI]

,

,

,

Pattern Recognit., 2021

Arbitrary-view human action recognition via novel-view action generation.

[DOI]

,

,

,

,

Pattern Recognit., 2021

Fusing functional connectivity with network nodal information for sparse network pattern learning of functional brain networks.

[DOI]

,

,

,

,

,

Inf. Fusion, 2021

Vision-guided Music Source Separation via a Fine-grained Cycle-Separation Network.

[DOI]

,

,

,

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

PoseGTAC: Graph Transformer Encoder-Decoder with Atrous Convolution for 3D Human Pose Estimation.

[DOI]

,

,

,

,

,

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Graph Convolutional Hourglass Networks for Skeleton-Based Action Recognition.

[DOI]

,

,

,

,

,

Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Multi-Stage Aggregated Transformer Network for Temporal Language Localization in Videos.

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Partial Feature Selection and Alignment for Multi-Source Domain Adaptation.

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

A Context Knowledge Map Guided Coarse-to-Fine Action Recognition.

[DOI]

,

,

,

,

,

IEEE Trans. Image Process., 2020

A Survey of Human Action Analysis in HRI Applications.

[DOI]

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., 2020

Graph-based variational auto-encoder for generalized zero-shot learning.

[DOI]

,

,

,

,

,

Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Universal Weighting Metric Learning for Cross-Modal Matching.

[DOI]

,

,

,

,

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning to Optimize Non-Rigid Tracking.

[DOI]

,

,

,

,

,

Matthias Nießner

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Deep adversarial metric learning for cross-modal retrieval.

[DOI]

,

,

,

,

World Wide Web, 2019

More is Better: Precise and Detailed Image Captioning Using Online Positive Recall and Missing Concepts Mining.

[DOI]

,

,

,

,

,

IEEE Trans. Image Process., 2019

Word-to-region attention network for visual question answering.

[DOI]

,

,

,

,

,

,

Multim. Tools Appl., 2019

Cross-domain facial expression recognition via an intra-category common feature and inter-category Distinction feature fusion network.

[DOI]

,

,

,

,

Neurocomputing, 2019

Learning one-to-many stylised Chinese character transformation and generation by generative adversarial networks.

[DOI]

,

,

,

IET Image Process., 2019

A Large-scale Varying-view RGB-D Action Dataset for Arbitrary-view Human Action Recognition.

[DOI]

,

,

,

,

,

CoRR, 2019

Attention Transfer (ANT) Network for View-invariant Action Recognition.

[DOI]

,

,

,

,

,

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Learning to create multi-stylized Chinese character fonts by generative adversarial networks.

[DOI]

,

,

,

Proceedings of the ACM Turing Celebration Conference - China, 2019

2018

Video Captioning by Adversarial LSTM.

[DOI]

,

,

,

,

,

,

IEEE Trans. Image Process., 2018

Recognition and Detection of Two-Person Interactive Actions Using Automatically Selected Skeleton Features.

[DOI]

,

,

,

,

,

IEEE Trans. Hum. Mach. Syst., 2018

Recurrent attention network using spatial-temporal relations for action recognition.

[DOI]

,

,

,

,

Signal Process., 2018

One-shot learning based pattern transition map for action early recognition.

[DOI]

,

,

,

Signal Process., 2018

Semantic binary coding for visual recognition via joint concept-attribute modelling.

[DOI]

,

,

,

,

,

Multim. Tools Appl., 2018

Hierarchical topology based hand pose estimation from a single depth image.

[DOI]

,

,

,

Multim. Tools Appl., 2018

Domain Invariant Subspace Learning for Cross-Modal Retrieval.

[DOI]

,

,

,

,

,

Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

A Large-scale RGB-D Database for Arbitrary-view Human Action Recognition.

[DOI]

,

,

,

,

,

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Domain separation network for cross-modal retrieval.

[DOI]

,

,

,

,

Proceedings of the 10th International Conference on Internet Multimedia Computing and Service, 2018

2017

Gazing point dependent eye gaze estimation.

[DOI]

,

,

,

,

,

,

Pattern Recognit., 2017

Exploiting Concept Correlation with Attributes for Semantic Binary Representation Learning.

[DOI]

,

,

,

,

,

Proceedings of the Internet Multimedia Computing and Service, 2017

Deep Semantic Indexing Using Convolutional Localization Network with Region-Based Visual Attention for Image Database.

[DOI]

,

,

,

,

,

Proceedings of the Databases Theory and Applications, 2017

2016

Multi-cue Information Fusion for Two-Layer Activity Recognition.

[DOI]

,

,

,

,

Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

2015

Learning contrastive feature distribution model for interaction recognition.

[DOI]

,

,

,

J. Vis. Commun. Image Represent., 2015

A Survey on Media Interaction in Social Robotics.

[DOI]

,

,

,

,

Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Real-Time Understanding of Abnormal Crowd Behavior on Social Robots.

[DOI]

,

,

,

,

,

Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

2014

Interactive body part contrast mining for human interaction recognition.

[DOI]

,

,

Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

3D Medial Axis Distance for hand detection.

[DOI]

,

,

,

,

Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

2013

A cooperative spectrum sensing scheme based on Detecting Reliability Statistics in cognitive radio.

[DOI]

,

,

,

,

Proceedings of the 24th IEEE Annual International Symposium on Personal, 2013

2012

Linear Semi-Supervised Dimensionality Reduction with Pairwise Constraint for Multiple Subclasses.

[DOI]

,

,

,

Einoshin Suzuki

IEICE Trans. Inf. Syst., 2012

Cooking gesture recognition using local feature and depth image.

[DOI]

,

,

Atsushi Shimada

,

Hajime Nagahara

,

Rin-Ichiro Taniguchi

Proceedings of the ACM multimedia 2012 workshop on Multimedia for cooking and eating activities, 2012

2010

Human Action Recognition by SOM Considering the Probability of Spatio-temporal Features.

[DOI]

,

Atsushi Shimada

,

Rin-Ichiro Taniguchi

Proceedings of the Neural Information Processing. Models and Applications, 2010

Loading...