Tatsuya Komatsu

Yusuke Kida

CoRR, 2022

MLP-ASR: Sequence-length agnostic all-MLP architectures for speech recognition.

[BibT_eX]

[DOI]

Jin Sakuma

Robin Scheibler

CoRR, 2022

Interdecoder: using Attention Decoders as Intermediate Regularization for CTC-Based Speech Recognition.

[BibT_eX]

[DOI]

Yusuke Fujita

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Alternate Intermediate Conditioning with Syllable-Level and Character-Level Targets for Japanese ASR.

[BibT_eX]

[DOI]

Yusuke Fujita

Yusuke Kida

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2022, 2022

Better Intermediates Improve CTC Inference.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2022, 2022

Development of Estimation Systems of Calving Time Based on Time-Frequency Analysis for Ventral Tail Base Surface Temperature.

[BibT_eX]

[DOI]

Proceedings of the 11th International Conference on Control, 2022

Self-Supervised Learning Method Using Multiple Sampling Strategies for General-Purpose Audio Representation.

[BibT_eX]

[DOI]

Ibuki Kuroyanagi

Proceedings of the IEEE International Conference on Acoustics, 2022

Non-Autoregressive ASR with Self-Conditioned Folded Encoders.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Sound Event Localization and Detection with Pre-Trained Audio Spectrogram Transformer and Multichannel Seperation Network.

[BibT_eX]

[DOI]

Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021

Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers.

[BibT_eX]

[DOI]

Yusuke Kida

CoRR, 2021

Relaxing the Conditional Independence Assumption of CTC-Based ASR by Conditioning on Intermediate Predictions.

[BibT_eX]

[DOI]

Jumon Nozaki

Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Acoustic Event Detection with Classifier Chains.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Disentangled Speaker and Language Representations Using Mutual Information Minimization and Domain Adaptation for Cross-Lingual TTS.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Multichannel Separation and Classification of Sound Events.

[BibT_eX]

[DOI]

Robin Scheibler

Proceedings of the 29th European Signal Processing Conference, 2021

Multi-Source Domain Adaptation with Sinkhorn Barycenter.

[BibT_eX]

[DOI]

Tomoko Matsui

Junbin Gao

Proceedings of the 29th European Signal Processing Conference, 2021

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Comparison of Low Complexity Self-Attention Mechanisms for Acoustic Event Detection.

[BibT_eX]

[DOI]

Robin Scheibler

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020

Differentially Private Variational Autoencoders with Term-wise Gradient Aggregation.

[BibT_eX]

[DOI]

CoRR, 2020

Unsupervised Training for Deep Speech Source Separation with Kullback-Leibler Divergence Based Probabilistic Loss Function.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Weakly-Supervised Sound Event Detection with Self-Attention.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Consistency-Aware Multi-Channel Speech Enhancement Using Deep Neural Networks.

[BibT_eX]

[DOI]

Yoshiki Masuyama

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Scene-Dependent Acoustic Event Detection with Scene Conditioning and Fake-Scene-Conditioned Loss.

[BibT_eX]

[DOI]

Keisuke Imoto

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Robust Acoustic Scene Classification to Multiple Devices Using Maximum Classifier Discrepancy and Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the 28th European Signal Processing Conference, 2020

Sound Event Localization and Detection Using Convolutional Recurrent Neural Networks and Gated Linear Units.

[BibT_eX]

[DOI]

Tsubasa Takahashi

Proceedings of the 28th European Signal Processing Conference, 2020

Conformer-Based Sound Event Detection with Semi-Supervised Learning and Data Augmentation.

[BibT_eX]

[DOI]

Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

Computer-Resource-Aware Deep Speech Separation with a Run-Time-Specified Number of BLSTM Layers.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019

Overview of Tasks and Investigation of Subjective Evaluation Methods in Environmental Sound Synthesis and Conversion.

[BibT_eX]

[DOI]

CoRR, 2019

Fast Convergence Algorithm for State-Space Model Based Speech Dereverberation by Multi-Channel Non-Negative Matrix Factorization.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Variational Bayesian Multi-Channel Speech Dereverberation Under Noisy Environments with Probabilistic Convolutive Transfer Function.

[BibT_eX]

[DOI]

Proceedings of the Interspeech 2019, 2019

Multichannel Loss Function for Supervised Speech Source Separation by Mask-Based Beamforming.

[BibT_eX]

[DOI]

Yoshiki Masuyama

Proceedings of the Interspeech 2019, 2019

Bayesian Non-parametric Multi-source Modelling Based Determined Blind Source Separation.

[BibT_eX]

[DOI]

Chaitanya Narisetty

Proceedings of the IEEE International Conference on Acoustics, 2019

Scene-dependent Anomalous Acoustic-event Detection Based on Conditional Wavenet and I-vector.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

A Stereo Wind-Noise Suppressor with Null Beamforming and Frequency-Domain Noise Averaging.

[BibT_eX]

[DOI]

Masanori Kato

Akihiko Sugiyama

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2018

Modelling of Sound Events with Hidden Imbalances Based on Clustering and Separate Sub-Dictionary Learning.

[BibT_eX]

[DOI]

Chaitanya Narisetty

Proceedings of the 26th European Signal Processing Conference, 2018

Anomalous Sound Event Detection Based on WaveNet.

[BibT_eX]

[DOI]

Proceedings of the 26th European Signal Processing Conference, 2018

Weakly Labeled Learning Using BLSTM-CTC for Sound Event Detection.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017

Detection of anomaly acoustic scenes based on a temporal dissimilarity model.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

An acoustic monitoring system and its field trials.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

Acoustic event detection based on non-negative matrix factorization with mixtures of local dictionaries and activation aggregation.

[BibT_eX]

[DOI]

Yuzo Senda