Yanli Ji

Orcid: 0000-0001-9122-6141

According to our database1, Yanli Ji authored at least 61 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Independency Adversarial Learning for Cross-Modal Sound Separation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Relation-mining self-attention network for skeleton-based human action recognition.
Pattern Recognit., July, 2023

Region Attention Enhanced Unsupervised Cross-Domain Facial Emotion Recognition.
IEEE Trans. Knowl. Data Eng., April, 2023

Layer-fusion for online mutual knowledge distillation.
Multim. Syst., April, 2023

Self-Supervised Fine-Grained Cycle-Separation Network (FSCN) for Visual-Audio Separation.
IEEE Trans. Multim., 2023

Fine-Grained Spatio-Temporal Parsing Network for Action Quality Assessment.
IEEE Trans. Image Process., 2023

Learning with Noisy Labels Using Collaborative Sample Selection and Contrastive Semi-Supervised Learning.
CoRR, 2023

Localization-assisted Uncertainty Score Disentanglement Network for Action Quality Assessment.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Cross-modality Representation Interactive Learning for Multimodal Sentiment Analysis.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Unsupervised Sounding Pixel Learning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Cross-Modal Dynamic Networks for Video Moment Retrieval With Text Query.
IEEE Trans. Multim., 2022

View-Invariant Human Action Recognition Via View Transformation Network (VTN).
IEEE Trans. Multim., 2022

Answer Again: Improving VQA With Cascaded-Answering Model.
IEEE Trans. Knowl. Data Eng., 2022

Multi-level Multi-modal Feature Fusion for Action Recognition in Videos.
Proceedings of the HCMA@MM 2022: Proceedings of the 3rd International Workshop on Human-Centric Multimedia Analysis, 2022

Global-Local Cross-View Fisher Discrimination for View-Invariant Action Recognition.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Selective Hypergraph Convolutional Networks for Skeleton-based Action Recognition.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

2021
Arbitrary-View Human Action Recognition: A Varying-View RGB-D Action Dataset.
IEEE Trans. Circuits Syst. Video Technol., 2021

View-invariant action recognition via Unsupervised AttentioN Transfer (UANT).
Pattern Recognit., 2021

Arbitrary-view human action recognition via novel-view action generation.
Pattern Recognit., 2021

Fusing functional connectivity with network nodal information for sparse network pattern learning of functional brain networks.
Inf. Fusion, 2021

Vision-guided Music Source Separation via a Fine-grained Cycle-Separation Network.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

PoseGTAC: Graph Transformer Encoder-Decoder with Atrous Convolution for 3D Human Pose Estimation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Graph Convolutional Hourglass Networks for Skeleton-Based Action Recognition.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Multi-Stage Aggregated Transformer Network for Temporal Language Localization in Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Partial Feature Selection and Alignment for Multi-Source Domain Adaptation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
A Context Knowledge Map Guided Coarse-to-Fine Action Recognition.
IEEE Trans. Image Process., 2020

A Survey of Human Action Analysis in HRI Applications.
IEEE Trans. Circuits Syst. Video Technol., 2020

Periodic multimedia spectrum sensing method based on high-order anti-jamming mechanism in cognitive wireless networks.
Multim. Tools Appl., 2020

Graph-based variational auto-encoder for generalized zero-shot learning.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Universal Weighting Metric Learning for Cross-Modal Matching.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning to Optimize Non-Rigid Tracking.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Deep adversarial metric learning for cross-modal retrieval.
World Wide Web, 2019

More is Better: Precise and Detailed Image Captioning Using Online Positive Recall and Missing Concepts Mining.
IEEE Trans. Image Process., 2019

Word-to-region attention network for visual question answering.
Multim. Tools Appl., 2019

Cross-domain facial expression recognition via an intra-category common feature and inter-category Distinction feature fusion network.
Neurocomputing, 2019

Learning one-to-many stylised Chinese character transformation and generation by generative adversarial networks.
IET Image Process., 2019

A Large-scale Varying-view RGB-D Action Dataset for Arbitrary-view Human Action Recognition.
CoRR, 2019

Attention Transfer (ANT) Network for View-invariant Action Recognition.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Learning to create multi-stylized Chinese character fonts by generative adversarial networks.
Proceedings of the ACM Turing Celebration Conference - China, 2019

2018
Video Captioning by Adversarial LSTM.
IEEE Trans. Image Process., 2018

Recognition and Detection of Two-Person Interactive Actions Using Automatically Selected Skeleton Features.
IEEE Trans. Hum. Mach. Syst., 2018

Recurrent attention network using spatial-temporal relations for action recognition.
Signal Process., 2018

One-shot learning based pattern transition map for action early recognition.
Signal Process., 2018

Semantic binary coding for visual recognition via joint concept-attribute modelling.
Multim. Tools Appl., 2018

Hierarchical topology based hand pose estimation from a single depth image.
Multim. Tools Appl., 2018

Domain Invariant Subspace Learning for Cross-Modal Retrieval.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

A Large-scale RGB-D Database for Arbitrary-view Human Action Recognition.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Domain separation network for cross-modal retrieval.
Proceedings of the 10th International Conference on Internet Multimedia Computing and Service, 2018

2017
Gazing point dependent eye gaze estimation.
Pattern Recognit., 2017

Exploiting Concept Correlation with Attributes for Semantic Binary Representation Learning.
Proceedings of the Internet Multimedia Computing and Service, 2017

Deep Semantic Indexing Using Convolutional Localization Network with Region-Based Visual Attention for Image Database.
Proceedings of the Databases Theory and Applications, 2017

2016
Multi-cue Information Fusion for Two-Layer Activity Recognition.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

2015
Learning contrastive feature distribution model for interaction recognition.
J. Vis. Commun. Image Represent., 2015

A Survey on Media Interaction in Social Robotics.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Real-Time Understanding of Abnormal Crowd Behavior on Social Robots.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

2014
Interactive body part contrast mining for human interaction recognition.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

3D Medial Axis Distance for hand detection.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

2013
A cooperative spectrum sensing scheme based on Detecting Reliability Statistics in cognitive radio.
Proceedings of the 24th IEEE Annual International Symposium on Personal, 2013

2012
Linear Semi-Supervised Dimensionality Reduction with Pairwise Constraint for Multiple Subclasses.
IEICE Trans. Inf. Syst., 2012

Cooking gesture recognition using local feature and depth image.
Proceedings of the ACM multimedia 2012 workshop on Multimedia for cooking and eating activities, 2012

2010
Human Action Recognition by SOM Considering the Probability of Spatio-temporal Features.
Proceedings of the Neural Information Processing. Models and Applications, 2010


  Loading...