Junyi Peng

Orcid: 0000-0002-4103-5416

According to our database1, Junyi Peng authored at least 33 papers between 2004 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
SeamlessFlow: A Trainer Agent Isolation RL Framework Achieving Bubble-Free Pipelines via Tag Scheduling.
CoRR, August, 2025

SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline.
CoRR, May, 2025

Analysis of ABC Frontend Audio Systems for the NIST-SRE24.
CoRR, May, 2025

SRPO: A Cross-Domain Implementation of Large-Scale Reinforcement Learning on LLM.
CoRR, April, 2025

ESPnet-SpeechLM: An Open Speech Language Model Toolkit.
CoRR, February, 2025

LMSST-GCN: Longitudinal MRI sub-structural texture guided graph convolution network for improved progression prediction of knee osteoarthritis.
Comput. Methods Programs Biomed., 2025

CA-MHFA: A Context-Aware Multi-Head Factorized Attentive Pooling for SSL-Based Speaker Verification.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning Models.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
BUT Systems and Analyses for the ASVspoof 5 Challenge.
CoRR, 2024

Multi-Level fusion graph neural network: Application to PET and CT imaging for risk stratification of head and neck cancer.
Biomed. Signal Process. Control., 2024

Investigation of Speaker Representation for Target-Speaker Speech Processing.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Multi-Channel Extension of Pre-trained Models for Speaker Verification.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Target Speech Extraction with Pre-Trained Self-Supervised Learning Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

Probing Self-Supervised Learning Models With Target Speech Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Functional-structural sub-region graph convolutional network (FSGCN): Application to the prognosis of head and neck cancer with PET/CT imaging.
Comput. Methods Programs Biomed., March, 2023

Improving Speaker Verification with Self-Pretrained Transformer Models.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Multi-Channel Speech Separation with Cross-Attention and Beamforming.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Description and Analysis of ABC Submission to NIST LRE 2022.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
An Attention-Based Backend Allowing Efficient Fine-Tuning of Transformer Models for Speaker Verification.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Progressive Contrastive Learning for Self-Supervised Text-Independent Speaker Verification.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Learnable Sparse Filterbank for Speaker Verification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Effective Phase Encoding for End-To-End Speaker Verification.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
Deep Speaker Embedding with Long Short Term Centroid Learning for Text-Independent Speaker Verification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Context-adaptive Gaussian Attention for Text-independent Speaker Verification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Discriminative Feature Learning for Speech Emotion Recognition.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2019: Text and Time Series, 2019

Syllable-Dependent Discriminative Learning for Small Footprint Text-Dependent Speaker Verification.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Logistic Similarity Metric Learning via Affinity Matrix for Text-Independent Speaker Verification.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Speaker-discriminative Embedding Learning via Affinity Matrix for Short Utterance Speaker Verification.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Alleviate Cross-chunk Permutation through Chunk-level Speaker Embedding for Blind Speech Separation.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2004
A Multi-Object Tracking System for Surveillance Video Analysis.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Semantic-based traffic video retrieval using activity pattern analysis.
Proceedings of the 2004 International Conference on Image Processing, 2004


  Loading...