Yanxiong Li

Orcid: 0000-0003-4362-1125

According to our database1, Yanxiong Li authored at least 51 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Few-Shot Class-Incremental Audio Classification Using Dynamically Expanded Classifier With Self-Attention Modified Prototypes.
IEEE Trans. Multim., 2024

Lightweight Speaker Verification Using Transformation Module With Feature Partition and Fusion.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

2023
Few-shot class-incremental audio classification via discriminative prototype learning.
Expert Syst. Appl., September, 2023

Audiovisual Dependency Attention for Violence Detection in Videos.
IEEE Trans. Multim., 2023

Few-Shot Speaker Identification Using Lightweight Prototypical Network With Feature Grouping and Interaction.
IEEE Trans. Multim., 2023

Few-shot Class-incremental Audio Classification Using Adaptively-refined Prototypes.
CoRR, 2023

Clean Sample Guided Self-Knowledge Distillation for Image Classification.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Fall event detection with global and temporal local information in real-world videos.
Multim. Tools Appl., 2022

Predicting skeleton trajectories using a Skeleton-Transformer for video anomaly detection.
Multim. Syst., 2022

Deep mutual attention network for acoustic scene classification.
Digit. Signal Process., 2022

Speaker verification using attentive multi-scale convolutional recurrent network.
Appl. Soft Comput., 2022

Few-Shot Speaker Identification Using Depthwise Separable Convolutional Network with Channel Attention.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Domestic Activity Clustering from Audio via Depthwise Separable Convolutional Autoencoder Network.
Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

2021
Speaker Clustering by Co-Optimizing Deep Representation Learning and Cluster Estimation.
IEEE Trans. Multim., 2021

Memory-Replay Knowledge Distillation.
Sensors, 2021

Domestic Activities Classification from Audio Recordings Using Multi-scale Dilated Depthwise Separable Convolutional Network.
Proceedings of the 23rd International Workshop on Multimedia Signal Processing, 2021

Acoustic Scene Classification Using Deep CNNs With Time-Frequency Representations.
Proceedings of the 21st International Conference on Communication Technology, 2021

Acoustic Scene Classification Using Aggregation of Two-Scale Deep Embeddings.
Proceedings of the 21st International Conference on Communication Technology, 2021

A Stage Match for Query-by-Example Spoken Term Detection Based On Structure Information of Query.
Proceedings of the IEEE International Conference on Acoustics, 2021

Violence Detection in Videos Based on Fusing Visual and Audio Information.
Proceedings of the IEEE International Conference on Acoustics, 2021

Domestic Activities Clustering From Audio Recordings Using Convolutional Capsule Autoencoder Network.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration.
IEEE Trans. Multim., 2020

Sound Event Detection with Depthwise Separable and Dilated Convolutions.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Sound Event Detection Via Dilated Convolutional Recurrent Neural Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Acoustic event diarization in TV/movie audios using deep embedding and integer linear programming.
Multim. Tools Appl., 2019

2018
Mobile Phone Clustering From Speech Recordings Using Deep Representation and Spectral Clustering.
IEEE Trans. Inf. Forensics Secur., 2018

Using multi-stream hierarchical deep neural network to extract deep audio feature for acoustic event detection.
Multim. Tools Appl., 2018

Dictionary learning based on M-PCA-N for audio signal sparse representation.
IET Signal Process., 2018

Anomalous Sound Detection Using Deep Audio Representation and a BLSTM Network for Audio Surveillance of Roads.
IEEE Access, 2018

Frontal Face Generation from Multiple Pose-Variant Faces with CGAN in Real-World Surveillance Scene.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Sparse representation-based quasi-clean speech construction for speech quality assessment under complex environments.
IET Signal Process., 2017

Unsupervised detection of acoustic events using information bottleneck principle.
Digit. Signal Process., 2017

Unsupervised classification of speaker roles in multi-participant conversational speech.
Comput. Speech Lang., 2017

Mobile phone clustering from acquired speech recordings using deep Gaussian supervector and spectral clustering.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Home security surveillance based on acoustic scenes analysis.
Proceedings of the 10th International Congress on Image and Signal Processing, 2017

2016
Source cell phone matching from speech recordings by sparse representation and KISS metric.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2014
An Algorithm of Speaker Clustering Based on Model Distance.
J. Multim., 2014

Fast speaker clustering using distance of feature matrix mean and adaptive convergence threshold.
IET Signal Process., 2014

A novel time-frequency feature extraction for movie audio signals classification.
Proceedings of the International Conference on Audio, 2014

2013
Speaker Change Detection based on Mean Shift.
J. Comput., 2013

Liveness detection using time drift between lip movement and voice.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2013

An adaptive premium penalty ant colony optimization algorithm.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2013

System design for monitoring infant speech emotion.
Proceedings of the 10th IEEE International Conference on Control and Automation, 2013

2012
A two -step hybrid approach for voiceprint-biometric template protection.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2012

Fractal dimension feature for distinguishing between overlapped speech and single-speaker speech.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2012

2011
Detecting laughter in spontaneous speech by constructing laughter bouts.
Int. J. Speech Technol., 2011

2010
Genetic algorithm based simultaneous optimization of feature subsets and hidden Markov model parameters for discrimination between speech and non-speech events.
Int. J. Speech Technol., 2010

2009
Audio ACP-Based Story Segmentation for TV News.
Proceedings of the Computer and Information Science 2009 [outstanding papers from the 8th ACIS/IEEE International Conference on Computer and Information Science, 2009

Characteristics-based effective applause detection for meeting speech.
Signal Process., 2009

2008
Cancelable Voiceprint Templates Based on Knowledge Signatures.
Proceedings of The International Symposium on Electronic Commerce and Security, 2008

A Novel Detection Method of Filled Pause in Mandarin Spontaneous Speech.
Proceedings of the 7th IEEE/ACIS International Conference on Computer and Information Science, 2008


  Loading...