Hakan Erdogan

Felix de Chaumont Quitry

Marco Tagliasacchi

Scott Wisdom

John R. Hershey

Proceedings of the Interspeech 2022, 2022

Adapting Speech Separation to Real-World Meetings using Mixture Invariant Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

DF-Conformer: Integrated Architecture of Conv-Tasnet and Conformer Using Linear Complexity Self-Attention for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Sequential Multi-Frame Neural Beamforming for Speech Separation and Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Integration of Speech Separation, Diarization, and Recognition for Multi-Speaker Meetings: System Description, Comparison, and Analysis.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Continuous Speech Separation Using Speaker Inventory for Long Recording.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

What's all the Fuss about Free Universal Sound Separation Data?

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Sound Event Detection and Separation: A Benchmark on Desed Synthetic Soundscapes.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

End-To-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording.

[BibT_eX]

[DOI]

CoRR, 2020

Unsupervised Sound Separation Using Mixtures of Mixtures.

[BibT_eX]

[DOI]

CoRR, 2020

Unsupervised Sound Separation Using Mixture Invariant Training.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Performance Study of a Convolutional Time-Domain Audio Separation Network for Real-Time Speech Denoising.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Improving Sound Event Detection in Domestic Environments using Sound Separation.

[BibT_eX]

[DOI]

Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

2019

Using spatial overlap ratio of independent classifiers for likelihood map fusion in mean-shift tracking.

[BibT_eX]

[DOI]

Signal Image Video Process., 2019

Fixed-length asymmetric binary hashing for fingerprint verification through GMM-SVM based representations.

[BibT_eX]

[DOI]

Pattern Recognit., 2019

Universal Sound Separation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Low-latency Speaker-independent Continuous Speech Separation.

[BibT_eX]

[DOI]

Dimitrios Dimitriadis

Proceedings of the IEEE International Conference on Acoustics, 2019

Single-channel Speech Extraction Using Speaker Inventory and Attention Network.

[BibT_eX]

[DOI]

Dimitrios Dimitriadis

Jasha Droppo

Yifan Gong

Proceedings of the IEEE International Conference on Acoustics, 2019

SDR - Half-baked or Well Done?

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Multi-Channel Overlapped Speech Recognition with Location Guided Speech Extraction Network.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2018, 2018

Investigations on Data Augmentation and Loss Functions for Deep Learning Based Speech-Background Separation.

[BibT_eX]

[DOI]

Takuya Yoshioka

Proceedings of the Interspeech 2018, 2018

Multi-Microphone Neural Speech Separation for Far-Field Multi-Talker Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Exploring Practical Aspects of Neural Mask-Based Beamforming for Far-Field Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Image noise level estimation based on higher-order statistics.

[BibT_eX]

[DOI]

Mostafa Mehdipour-Ghazi

Multim. Tools Appl., 2017

Multi-microphone speech recognition integrating beamforming, robust feature extraction, and advanced DNN/RNN backend.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2017

PLDA-Based Diarization of Telephone Conversations.

[BibT_eX]

[DOI]

CoRR, 2017

Deep long short-term memory adaptive beamforming networks for multichannel robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Discriminative Beamforming with Phase-Aware Neural Networks for Speech Enhancement and Recognition.

[BibT_eX]

[DOI]

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

Deep Recurrent Networks for Separation and Recognition of Single-Channel Speech in Nonstationary Background Audio.

[BibT_eX]

[DOI]

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016

Tracklet clustering for robust multiple object tracking using distance dependent Chinese restaurant processes.

[BibT_eX]

[DOI]

Fatih Porikli

Signal Image Video Process., 2016

Improving A<sup>⋆</sup> OMP: Theoretical and empirical analyses with a novel dynamic cost model.

[BibT_eX]

[DOI]

Signal Process., 2016

Practical security and privacy attacks against biometric hashing using sparse recovery.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2016

Improved MVDR Beamforming Using Single-Channel Mask Prediction Networks.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2016, 2016

Deep beamforming networks for multi-channel speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Unpredictability assessment of biometric hashing under naive and advanced threat conditions.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

GMM-SVM Fingerprint Verification Based on Minutiae Only.

[BibT_eX]

[DOI]

Yusuf Ziya Isik

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016

2015

THRIVE: threshold homomorphic encryption based secure and privacy preserving biometric verification system.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2015

Comments On "Multipath Matching Pursuit" by Kwon, Wang and Shim.

[BibT_eX]

[DOI]

CoRR, 2015

Speech enhancement and recognition using multi-task learning of long short-term memory recurrent neural networks.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2015, 2015

Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

PLDA-based diarization of telephone conversations.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR.

[BibT_eX]

[DOI]

Proceedings of the Latent Variable Analysis and Signal Separation, 2015

Recognize and separate approach for speech denoising using nonnegative matrix factorization.

[BibT_eX]

[DOI]

Fahad Sohrab

Proceedings of the 23rd European Signal Processing Conference, 2015

S-vector: A discriminative representation derived from i-vector for speaker verification.

[BibT_eX]

[DOI]

Yusuf Ziya Isik

Ruhi Sarikaya

Proceedings of the 23rd European Signal Processing Conference, 2015

The MERL/SRI system for the 3RD CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Source separation using regularized NMF with MMSE estimates under GMM priors with online learning for the uncertainties.

[BibT_eX]

[DOI]

Digit. Signal Process., 2014

Biohashing with Local Zernike Moments for face verification.

[BibT_eX]

[DOI]

Proceedings of the 2014 22nd Signal Processing and Communications Applications Conference (SIU), 2014

Learning word representations for Turkish.

[BibT_eX]

[DOI]

Proceedings of the 2014 22nd Signal Processing and Communications Applications Conference (SIU), 2014

A Latent Dirichlet Allocation Based Front-End for Speaker Verification.

[BibT_eX]

[DOI]

Yusuf Ziya Isik

Ruhi Sarikaya

Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Deep neural networks for single channel source separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Counting people by clustering person detector outputs.

[BibT_eX]

[DOI]

Fatih Murat Porikli

Proceedings of the 11th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2014

2013

Linear classifier combination and selection using group sparse regularization and hinge loss.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2013

Compressed sensing signal recovery via forward-backward pursuit.

[BibT_eX]

[DOI]

Digit. Signal Process., 2013

Regularized nonnegative matrix factorization using Gaussian mixture priors for supervised single channel source separation.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2013

Improving A*OMP: Theoretical and Empirical Analyses With a Novel Dynamic Cost Model.

[BibT_eX]

[DOI]

CoRR, 2013

Optimal forward-backward pursuit for the sparse signal recovery problem.

[BibT_eX]

[DOI]

Proceedings of the 21st Signal Processing and Communications Applications Conference, 2013

Initialization of nonnegative matrix factorization dictionaries for single channel source separation.

[BibT_eX]

[DOI]

Proceedings of the 21st Signal Processing and Communications Applications Conference, 2013

Detecting and Tracking Unknown Number of Objects with Dirichlet Process Mixture Models and Markov Random Fields.

[BibT_eX]

[DOI]

Fatih Porikli

Proceedings of the Advances in Visual Computing - 9th International Symposium, 2013

Spectro-temporal post-enhancement using MMSE estimation in NMF based single-channel source separation.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2013, 2013

Discriminative nonnegative dictionary learning using cross-coherence penalties for single channel source separation.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2013, 2013

A mixed integer linear programming formulation for the sparse recovery problem in compressed sensing.

[BibT_eX]

[DOI]

S. Ilker Birbil

Proceedings of the IEEE International Conference on Acoustics, 2013

BioHashing with Fingerprint Spectral Minutiae.

[BibT_eX]

[DOI]

Proceedings of the 2013 BIOSIG, 2013

2012

Facial feature extraction using a probabilistic approach.

[BibT_eX]

[DOI]

Mustafa Berkay Yilmaz

Mustafa Unel

Signal Process. Image Commun., 2012

Error-Correcting Output Codes Guided Quantization for Biometric Hashing.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2012

Discriminative Projection Selection Based Face Image Hashing.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2012

A* orthogonal matching pursuit: Best-first search for compressed sensing signal recovery.

[BibT_eX]

[DOI]

Digit. Signal Process., 2012

On the Theoretical Analysis of Orthogonal Matching Pursuit with Termination Based on the Residue

[BibT_eX]

[DOI]

CoRR, 2012

Analysis of accuracy and complexity of A* search for compressed sensing signal recovery.

[BibT_eX]

[DOI]

Proceedings of the 20th Signal Processing and Communications Applications Conference, 2012

Biometric hash: A study on statistical quantization methods.

[BibT_eX]

[DOI]

Proceedings of the 20th Signal Processing and Communications Applications Conference, 2012

Audio-visual speech recognition with background music using single-channel source separation.

[BibT_eX]

[DOI]

Proceedings of the 20th Signal Processing and Communications Applications Conference, 2012

SUTAV: A Turkish Audio-Visual Database.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Hidden Markov Models as Priors for Regularized Nonnegative Matrix Factorization in Single-Channel Source Separation.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2012, 2012

Gaussian Mixture Gain Priors for Regularized Nonnegative Matrix Factorization in Single-Channel Source Separation.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2012, 2012

A comparison of termination criteria for A∗OMP.

[BibT_eX]

[DOI]

Proceedings of the 20th European Signal Processing Conference, 2012

Forward-backward search for compressed sensing signal recovery.

[BibT_eX]

[DOI]

Proceedings of the 20th European Signal Processing Conference, 2012

Spectro-temporal post-smoothing in NMF based single-channel source separation.

[BibT_eX]

[DOI]

Proceedings of the 20th European Signal Processing Conference, 2012

2011

Bayesian Models and Algorithms for Protein β-Sheet Prediction.

[BibT_eX]

[DOI]

Zafer Aydin

Yucel Altunbasak

IEEE ACM Trans. Comput. Biol. Bioinform., 2011

Max-Margin Stacking and Sparse Regularization for Linear Classifier Combination and Selection

[BibT_eX]

[DOI]

CoRR, 2011

Single Channel Speech Music Separation Using Nonnegative Matrix Factorization with Sliding Windows and Spectral Masks.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2011, 2011

Adaptation of Speaker-Specific Bases in Non-Negative Matrix Factorization for Single Channel Speech-Music Separation.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2011, 2011

A face image hashing method based on optimal linear transform under colored Gaussian noise assumption.

[BibT_eX]

[DOI]

Mehmet Kivanç Mihçak

Proceedings of the 17th International Conference on Digital Signal Processing, 2011

Information theoretic capacity analysis for biometric hashing methods.

[BibT_eX]

[DOI]

Mehmet Kivanç Mihçak

Proceedings of the 17th International Conference on Digital Signal Processing, 2011

Single channel speech music separation using nonnegative matrix factorization and spectral masks.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Digital Signal Processing, 2011

Using multiple visual tandem streams in audio-visual speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Compressed sensing signal recovery via A* Orthogonal Matching Pursuit.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Developing a Scoring Function for NMR Structure-based Assignments using Machine Learning.

[BibT_eX]

[DOI]

Mehmet Serkan Apaydin

Proceedings of the Computer and Information Sciences, 2010

Decision Fusion for Patch-Based Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Pattern Recognition, 2010

A Unifying Framework for Learning the Linear Combiners for Classifier Ensembles.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Pattern Recognition, 2010

Semi-blind Speech-Music Separation Using Sparsity and Continuity Priors.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Pattern Recognition, 2010

2009

Probabilistic Facial Feature Extraction Using Joint Distribution of Location and Texture Information.

[BibT_eX]

[DOI]

Mustafa Berkay Yilmaz

Mustafa Unel

Proceedings of the Advances in Visual Computing, 5th International Symposium, 2009

A Cancelable Biometric Hashing for Secure Biometric Verification System.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2009), 2009

2008

Joint Morphological-Lexical Language Modeling for Processing Morphologically Rich Languages With Application to Dialectal Arabic.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2008

Using local temporal features of bounding boxes for walking/running classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Evolving Implicit Polynomial Interfaces.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2008, Leeds, UK, September 2008, 2008

Lip segmentation using adaptive color space training.

[BibT_eX]

[DOI]

Erol Ozgur

Mustafa Berkay Yilmaz

Harun Karabalkan

Mustafa Unel

Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

2007

Bayesian Protein Secondary Structure Prediction With Near-Optimal Segmentations.

[BibT_eX]

[DOI]

Zafer Aydin

Yucel Altunbasak

IEEE Trans. Signal Process., 2007

Protein Fold Recognition using Residue-Based Alignments of Sequence and Secondary Structure.

[BibT_eX]

[DOI]

Zafer Aydin

Yucel Altunbasak

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

Multimodal Person Recognition for Human-Vehicle Interaction.

[BibT_eX]

[DOI]

IEEE Multim., 2006

2005

Semantic confidence measurement for spoken dialog systems.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2005

Using semantic analysis to improve speech recognition performance.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2005

Multi-modal Person Recognition for Vehicular Applications.

[BibT_eX]

[DOI]

Proceedings of the Multiple Classifier Systems, 6th International Workshop, 2005

Regularizing linear discriminant analysis for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2005, 2005

An online handwriting recognition system for Turkish.

[BibT_eX]

[DOI]

Proceedings of the Document Recognition and Retrieval XII, 2005

2004

Filler model based confidence measures for spoken dialogue systems: a case study for Turkish.

[BibT_eX]

[DOI]

Aydin Akyol

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2002

Incremental on-line feature space MLLR adaptation for telephony speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Semantic structured language models.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Turn-Based Language Modeling for spoken dialog systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

Recent advances in speech recognition system for IBM DARPA communicator.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Innovative approaches for large vocabulary name recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2001

Rapid adaptation using penalized-likelihood methods.

[BibT_eX]

[DOI]

Yuqing Gao

Michael Picheny

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

Exact distribution of edge-preserving MAP estimators for linear signal models with Gaussian measurement noise.

[BibT_eX]

[DOI]

Wei Biao Wu

IEEE Trans. Image Process., 2000

Weighted pairwise scatter to improve linear discriminant analysis.

[BibT_eX]

[DOI]

Yongxin Li

Yuqing Gao

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Algorithms for joint estimation of attenuation and emission images in PET.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

Fast Monotonic Algorithms for Transmission Tomography.

[BibT_eX]

[DOI]

IEEE Trans. Medical Imaging, 1999

1998

Accelerated Monotonic Algorithms for Transmission Tomography.

[BibT_eX]

[DOI]