We stand with Ukraine

We stand with Ukraine

Yanxiong Li

Orcid: 0000-0003-4362-1125

According to our database¹, Yanxiong Li authored at least 64 papers between 2008 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2025

Low-complexity speaker embedding module with feature segmentation, transformation and reconstruction for few-shot speaker identification.

[DOI]

,

,

,

Expert Syst. Appl., 2025

Investigation on the Dynamic Thermal Behavior for High Voltage Cable With Air Gap Considering the Thermal Expansion of Insulation.

[DOI]

,

,

,

,

,

,

,

IEEE Access, 2025

Infant Cry Detection In Noisy Environment Using Blueprint Separable Convolutions and Time-Frequency Recurrent Neural Network.

[DOI]

,

Proceedings of the IEEE International Workshop on Multimedia Signal Processing, 2025

Infant Cry Emotion Recognition Using Improved ECAPA-TDNN with Multi-scale Feature Fusion and Attention Enhancement.

[DOI]

,

,

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Generalizable Audio Deepfake Detection via Hierarchical Structure Learning and Feature Whitening in Poincaré sphere.

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Fully Few-shot Class-incremental Audio Classification Using Multi-level Embedding Extractor and Ridge Regression Classifier.

[DOI]

,

,

,

,

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Cross-Domain Few-Shot Open-Set Keyword Spotting Using Keyword Adaptation and Prototype Reprojection.

[DOI]

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Deep Enhancement Spotting Network for Low-complexity Keyword Spotting in Noisy Environments.

[DOI]

,

,

,

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

Few-Shot Class-Incremental Audio Classification Using Dynamically Expanded Classifier With Self-Attention Modified Prototypes.

[DOI]

,

,

,

,

Emmanouil Benetos

IEEE Trans. Multim., 2024

Few-Shot Class-Incremental Audio Classification With Adaptive Mitigation of Forgetting and Overfitting.

[DOI]

,

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Lightweight Speaker Verification Using Transformation Module With Feature Partition and Fusion.

[DOI]

,

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Detecting video anomalies by jointly utilizing appearance and skeleton information.

[DOI]

,

,

,

Expert Syst. Appl., 2024

Fully Few-shot Class-incremental Audio Classification Using Expandable Dual-embedding Extractor.

[DOI]

,

,

,

,

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Low-Complexity Acoustic Scene Classification Using Parallel Attention-Convolution Network.

[DOI]

,

,

,

,

,

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023

Few-shot class-incremental audio classification via discriminative prototype learning.

[DOI]

,

,

,

Expert Syst. Appl., September, 2023

Audiovisual Dependency Attention for Violence Detection in Videos.

[DOI]

,

,

,

,

IEEE Trans. Multim., 2023

Few-Shot Speaker Identification Using Lightweight Prototypical Network With Feature Grouping and Interaction.

[DOI]

,

,

,

,

IEEE Trans. Multim., 2023

Few-shot Class-incremental Audio Classification Using Adaptively-refined Prototypes.

[DOI]

,

,

,

,

Tuomas Virtanen

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Few-shot Class-incremental Audio Classification Using Stochastic Classifier.

[DOI]

,

,

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Clean Sample Guided Self-Knowledge Distillation for Image Classification.

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Fall event detection with global and temporal local information in real-world videos.

[DOI]

,

,

,

Multim. Tools Appl., 2022

Predicting skeleton trajectories using a Skeleton-Transformer for video anomaly detection.

[DOI]

,

,

Multim. Syst., 2022

Deep mutual attention network for acoustic scene classification.

[DOI]

,

,

,

Digit. Signal Process., 2022

Speaker verification using attentive multi-scale convolutional recurrent network.

[DOI]

,

,

,

Appl. Soft Comput., 2022

Few-Shot Speaker Identification Using Depthwise Separable Convolutional Network with Channel Attention.

[DOI]

,

,

,

,

,

Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Domestic Activity Clustering from Audio via Depthwise Separable Convolutional Autoencoder Network.

[DOI]

,

,

Konstantinos Drossos

,

Tuomas Virtanen

Proceedings of the 24th IEEE International Workshop on Multimedia Signal Processing, 2022

2021

Speaker Clustering by Co-Optimizing Deep Representation Learning and Cluster Estimation.

[DOI]

,

,

,

,

IEEE Trans. Multim., 2021

Memory-Replay Knowledge Distillation.

[DOI]

,

,

Sensors, 2021

Domestic Activities Classification from Audio Recordings Using Multi-scale Dilated Depthwise Separable Convolutional Network.

[DOI]

,

,

,

,

Proceedings of the 23rd International Workshop on Multimedia Signal Processing, 2021

Acoustic Scene Classification Using Deep CNNs With Time-Frequency Representations.

[DOI]

,

,

,

Proceedings of the 21st International Conference on Communication Technology, 2021

Acoustic Scene Classification Using Aggregation of Two-Scale Deep Embeddings.

[DOI]

,

,

,

,

,

,

Proceedings of the 21st International Conference on Communication Technology, 2021

A Stage Match for Query-by-Example Spoken Term Detection Based On Structure Information of Query.

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

Violence Detection in Videos Based on Fusing Visual and Audio Information.

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

Domestic Activities Clustering From Audio Recordings Using Convolutional Capsule Autoencoder Network.

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration.

[DOI]

,

,

,

,

IEEE Trans. Multim., 2020

Sound Event Detection with Depthwise Separable and Dilated Convolutions.

[DOI]

Konstantinos Drossos

,

Stylianos I. Mimilakis

,

,

,

Tuomas Virtanen

Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Sound Event Detection Via Dilated Convolutional Recurrent Neural Networks.

[DOI]

,

,

Konstantinos Drossos

,

Tuomas Virtanen

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Acoustic event diarization in TV/movie audios using deep embedding and integer linear programming.

[DOI]

,

,

,

,

,

Multim. Tools Appl., 2019

2018

Mobile Phone Clustering From Speech Recordings Using Deep Representation and Spectral Clustering.

[DOI]

,

,

,

,

,

IEEE Trans. Inf. Forensics Secur., 2018

Using multi-stream hierarchical deep neural network to extract deep audio feature for acoustic event detection.

[DOI]

,

,

,

,

,

,

Multim. Tools Appl., 2018

Dictionary learning based on M-PCA-N for audio signal sparse representation.

[DOI]

,

,

,

,

,

IET Signal Process., 2018

Anomalous Sound Detection Using Deep Audio Representation and a BLSTM Network for Audio Surveillance of Roads.

[DOI]

,

,

,

,

IEEE Access, 2018

Frontal Face Generation from Multiple Pose-Variant Faces with CGAN in Real-World Surveillance Scene.

[DOI]

,

,

,

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Sparse representation-based quasi-clean speech construction for speech quality assessment under complex environments.

[DOI]

,

,

,

IET Signal Process., 2017

Unsupervised detection of acoustic events using information bottleneck principle.

[DOI]

,

,

,

,

,

,

,

Digit. Signal Process., 2017

Unsupervised classification of speaker roles in multi-participant conversational speech.

[DOI]

,

,

,

,

,

,

,

,

Comput. Speech Lang., 2017

Mobile phone clustering from acquired speech recordings using deep Gaussian supervector and spectral clustering.

[DOI]

,

,

,

,

,

,

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Home security surveillance based on acoustic scenes analysis.

[DOI]

,

,

,

Proceedings of the 10th International Congress on Image and Signal Processing, 2017

2016

Source cell phone matching from speech recordings by sparse representation and KISS metric.

[DOI]

,

,

,

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2014

An Algorithm of Speaker Clustering Based on Model Distance.

[DOI]

,

,

,

J. Multim., 2014

Fast speaker clustering using distance of feature matrix mean and adaptive convergence threshold.

[DOI]

,

,

,

,

,

IET Signal Process., 2014

A novel time-frequency feature extraction for movie audio signals classification.

[DOI]

,

,

,

Proceedings of the International Conference on Audio, 2014

2013

Speaker Change Detection based on Mean Shift.

[DOI]

,

,

,

J. Comput., 2013

Liveness detection using time drift between lip movement and voice.

[DOI]

,

,

,

,

Proceedings of the International Conference on Machine Learning and Cybernetics, 2013

An adaptive premium penalty ant colony optimization algorithm.

[DOI]

,

,

,

,

Proceedings of the International Conference on Machine Learning and Cybernetics, 2013

System design for monitoring infant speech emotion.

[DOI]

,

,

,

,

,

Proceedings of the 10th IEEE International Conference on Control and Automation, 2013

2012

A two -step hybrid approach for voiceprint-biometric template protection.

[DOI]

,

,

Proceedings of the International Conference on Machine Learning and Cybernetics, 2012

Fractal dimension feature for distinguishing between overlapped speech and single-speaker speech.

[DOI]

,

,

,

,

Proceedings of the International Conference on Machine Learning and Cybernetics, 2012

2011

Detecting laughter in spontaneous speech by constructing laughter bouts.

[DOI]

,

Int. J. Speech Technol., 2011

2010

Genetic algorithm based simultaneous optimization of feature subsets and hidden Markov model parameters for discrimination between speech and non-speech events.

[DOI]

,

,

,

,

Int. J. Speech Technol., 2010

2009

Audio ACP-Based Story Segmentation for TV News.

[DOI]

,

,

,

,

Proceedings of the Computer and Information Science 2009 [outstanding papers from the 8th ACIS/IEEE International Conference on Computer and Information Science, 2009

Characteristics-based effective applause detection for meeting speech.

[DOI]

,

,

,

,

Signal Process., 2009

2008

Cancelable Voiceprint Templates Based on Knowledge Signatures.

[DOI]

,

,

,

Proceedings of The International Symposium on Electronic Commerce and Security, 2008

A Novel Detection Method of Filled Pause in Mandarin Spontaneous Speech.

[DOI]

,

,

Proceedings of the 7th IEEE/ACIS International Conference on Computer and Information Science, 2008

Loading...