Jee-Weon Jung

Orcid: 0000-0003-0505-2988

According to our database1, Jee-Weon Jung authored at least 68 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verification.
CoRR, 2024

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages.
CoRR, 2024

Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
CoRR, 2024

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models.
CoRR, 2024

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer.
CoRR, 2024

Improving Design of Input Condition Invariant Speech Enhancement.
CoRR, 2024

AugSumm: towards generalizable speech summarization using synthetic labels from large language model.
CoRR, 2024

2023
Understanding Probe Behaviors through Variational Bounds of Mutual Information.
CoRR, 2023

UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network.
CoRR, 2023

One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition.
CoRR, 2023

Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation.
CoRR, 2023

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study.
CoRR, 2023

Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks.
CoRR, 2023

Encoder-decoder multimodal speaker change detection.
CoRR, 2023

Multi-Dataset Co-Training with Sharpness-Aware Optimization for Audio Anti-spoofing.
CoRR, 2023

Towards single integrated spoofing-aware speaker verification embeddings.
CoRR, 2023

VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge.
CoRR, 2023

Absolute Decision Corrupts Absolutely: Conservative Online Speaker Diarisation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Advancing the Dimensionality Reduction of Speaker Embeddings for Speaker Diarisation: Disentangling Noise and Informing Speech Activity.
Proceedings of the IEEE International Conference on Acoustics, 2023

In Search of Strong Embedding Extractors for Speaker Diarisation.
Proceedings of the IEEE International Conference on Acoustics, 2023

High-Resolution Embedding Extractor for Speaker Diarisation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Disentangled representation learning for multilingual speaker recognition.
CoRR, 2022

Large-scale learning of generalised representations for speaker recognition.
CoRR, 2022

Selective Kernel Attention for Robust Speaker Verification.
CoRR, 2022

SASV Challenge 2022: A Spoofing Aware Speaker Verification Challenge Evaluation Plan.
CoRR, 2022

Frequency and Multi-Scale Selective Kernel Attention for Speaker Verification.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Automatic Speaker Verification Spoofing and Deepfake Detection Using Wav2vec 2.0 and Data Augmentation.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

SASV 2022: The First Spoofing-Aware Speaker Verification Challenge.
Proceedings of the Interspeech 2022, 2022

Pushing the limits of raw waveform speaker recognition.
Proceedings of the Interspeech 2022, 2022

Attentive Max Feature Map and Joint Training for Acoustic Scene Classification.
Proceedings of the IEEE International Conference on Acoustics, 2022

Multi-Scale Speaker Embedding-Based Graph Attention Networks For Speaker Diarisation.
Proceedings of the IEEE International Conference on Acoustics, 2022

AASIST: Audio Anti-Spoofing Using Integrated Spectro-Temporal Graph Attention Networks.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Disentangled dimensionality reduction for noise-robust speaker diarisation.
CoRR, 2021

End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection.
CoRR, 2021

Attentive Max Feature Map for Acoustic Scene Classification with Joint Learning considering the Abstraction of Classes.
CoRR, 2021

Learning Metrics from Mean Teacher: A Supervised Learning Method for Improving the Generalization of Speaker Verification System.
CoRR, 2021

Graph Attention Networks for Anti-Spoofing.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Adapting Speaker Embeddings for Speaker Diarisation.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Three-Class Overlapped Speech Detection Using a Convolutional Recurrent Neural Network.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

DCASENET: An Integrated Pretrained Deep Neural Network for Detecting and Classifying Acoustic Scenes and Events.
Proceedings of the IEEE International Conference on Acoustics, 2021

Graph Attention Networks for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Capturing scattered discriminative information using a deep architecture in acoustic scene classification.
CoRR, 2020

Integrated Replay Spoofing-aware Text-independent Speaker Verification.
CoRR, 2020

Improved RawNet with Filter-wise Rescaling for Text-independent Speaker Verification using Raw Waveforms.
CoRR, 2020

A study on the role of subsidiary information in replay attack spoofing detection.
CoRR, 2020

Knowledge Distillation in Acoustic Scene Classification.
IEEE Access, 2020

Selective Deep Speaker Embedding Enhancement for Speaker Verification.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Self-Supervised Pre-Training with Acoustic Configurations for Replay Spoofing Detection.
Proceedings of the Interspeech 2020, 2020

Segment Aggregation for Short Utterances Speaker Verification Using Raw Waveforms.
Proceedings of the Interspeech 2020, 2020

Acoustic Scene Classification Using Audio Tagging.
Proceedings of the Interspeech 2020, 2020

Improved RawNet with Feature Map Scaling for Text-Independent Speaker Verification Using Raw Waveforms.
Proceedings of the Interspeech 2020, 2020

Audio Tag Representation Guided Dual Attention Network for Acoustic Scene Classification.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019
Cosine similarity-based adversarial process.
CoRR, 2019

Replay Attack Detection with Complementary High-Resolution Information Using End-to-End DNN for the ASVspoof 2019 Challenge.
Proceedings of the Interspeech 2019, 2019

RawNet: Advanced End-to-End Deep Neural Network Using Raw Waveforms for Text-Independent Speaker Verification.
Proceedings of the Interspeech 2019, 2019

End-to-End Losses Based on Speaker Basis Vectors and All-Speaker Hard Negative Mining for Speaker Verification.
Proceedings of the Interspeech 2019, 2019

Acoustic Scene Classification Using Teacher-Student Learning with Soft-Labels.
Proceedings of the Interspeech 2019, 2019

Distilling the Knowledge of Specialist Deep Neural Networks in Acoustic Scene Classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Short Utterance Compensation in Speaker Verification via Cosine-Based Teacher-Student Learning of Speaker Embeddings.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Replay attack spoofing detection system using replay noise by multi-task learning.
CoRR, 2018

Replay Spoofing Detection System for Automatic Speaker Verification Using Multi-Task Learning of Noise Classes.
Proceedings of the Conference on Technologies and Applications of Artificial Intelligence, 2018

Avoiding Speaker Overfitting in End-to-End DNNs Using Raw Waveform for Text-Independent Speaker Verification.
Proceedings of the Interspeech 2018, 2018

A Complete End-to-End Speaker Verification System Using Deep Neural Networks: From Raw Signals to Verification Result.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

DNN based multi-level feature ensemble for acoustic scene classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017
Joint Training of Expanded End-to-End DNN for Text-Dependent Speaker Verification.
Proceedings of the Interspeech 2017, 2017

DNN-Based Audio Scene Classification for DCASE2017: Dual Input Features, Balancing Cost, and Stochastic Data Duplication.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2017


  Loading...