Jia Qi Yip

Orcid: 0000-0002-9896-9658

According to our database1, Jia Qi Yip authored at least 28 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Speechless: Speech Instruction Training Without Speech for Low Resource Languages.
CoRR, May, 2025

Robust Audio Deepfake Detection using Ensemble Confidence Calibration.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Extending Whisper for Emotion Prediction Using Word-level Pseudo Labels.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Speech Enhancement Using Continuous Embeddings of Neural Audio Codec.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
VERSA: A Versatile Evaluation Toolkit for Speech, Audio, and Music.
CoRR, 2024

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks.
CoRR, 2024

Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions.
CoRR, 2024

Improved Alignment for Score Combination of RNN-T and CTC Decoder for Online Decoding.
Proceedings of the Text, Speech, and Dialogue - 27th International Conference, 2024

Continual Learning With Embedding Layer Surgery and Task-Wise Beam Search Using Whisper.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs For Audio, Music, and Speech.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Continual Learning Optimizations for Auto-regressive Decoder of Multilingual ASR systems.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Towards Audio Codec-based Speech Separation.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2024

SPGM: Prioritizing Local Features for Enhanced Speech Separation Performance.
Proceedings of the IEEE International Conference on Acoustics, 2024

Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2024

Low Resource Language Adaptation using Two-stage Regularization for Multilingual ASR.
Proceedings of the International Conference on Asian Language Processing, 2024

Low-resource Language Adaptation with Ensemble of PEFT Approaches.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2024

Speech Separation using Neural Audio Codecs with Embedding Loss.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2024

2023
Codec Data Augmentation for Time-domain Heart Sound Classification.
CoRR, 2023

ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention.
CoRR, 2023

deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition.
CoRR, 2023

ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Small Footprint Multi-channel Network for Keyword Spotting with Centroid Based Awareness.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Contrastive Speech Mixup for Low-Resource Keyword Spotting.
Proceedings of the IEEE International Conference on Acoustics, 2023

De'hubert: Disentangling Noise in a Self-Supervised Model for Robust Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
I2CR: Improving Noise Robustness on Keyword Spotting Using Inter-Intra Contrastive Regularization.
CoRR, 2022


  Loading...