Tatsuya Komatsu

According to our database1, Tatsuya Komatsu authored at least 50 papers between 1998 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers.
CoRR, 2024

2023
Audio Difference Learning for Audio Captioning.
CoRR, 2023

PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions.
CoRR, 2023

Analysis of Biological Data of Cows and Development of Detection Systems for Calving Phase.
Proceedings of the 49th Annual Conference of the IEEE Industrial Electronics Society, 2023

Neural Diarization with Non-Autoregressive Intermediate Attractors.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
How Information on Acoustic Scenes and Sound Events Mutually Benefits Event Detection and Scene Classification Tasks.
CoRR, 2022

Multi-sequence Intermediate Conditioning for CTC-based ASR.
CoRR, 2022

MLP-ASR: Sequence-length agnostic all-MLP architectures for speech recognition.
CoRR, 2022

Interdecoder: using Attention Decoders as Intermediate Regularization for CTC-Based Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Alternate Intermediate Conditioning with Syllable-Level and Character-Level Targets for Japanese ASR.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR.
Proceedings of the Interspeech 2022, 2022

Better Intermediates Improve CTC Inference.
Proceedings of the Interspeech 2022, 2022

Development of Estimation Systems of Calving Time Based on Time-Frequency Analysis for Ventral Tail Base Surface Temperature.
Proceedings of the 11th International Conference on Control, 2022

Self-Supervised Learning Method Using Multiple Sampling Strategies for General-Purpose Audio Representation.
Proceedings of the IEEE International Conference on Acoustics, 2022

Non-Autoregressive ASR with Self-Conditioned Folded Encoders.
Proceedings of the IEEE International Conference on Acoustics, 2022

Sound Event Localization and Detection with Pre-Trained Audio Spectrogram Transformer and Multichannel Seperation Network.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021
Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers.
CoRR, 2021

Relaxing the Conditional Independence Assumption of CTC-Based ASR by Conditioning on Intermediate Predictions.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Acoustic Event Detection with Classifier Chains.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Disentangled Speaker and Language Representations Using Mutual Information Minimization and Domain Adaptation for Cross-Lingual TTS.
Proceedings of the IEEE International Conference on Acoustics, 2021

Multichannel Separation and Classification of Sound Events.
Proceedings of the 29th European Signal Processing Conference, 2021

Multi-Source Domain Adaptation with Sinkhorn Barycenter.
Proceedings of the 29th European Signal Processing Conference, 2021

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Comparison of Low Complexity Self-Attention Mechanisms for Acoustic Event Detection.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Differentially Private Variational Autoencoders with Term-wise Gradient Aggregation.
CoRR, 2020

Unsupervised Training for Deep Speech Source Separation with Kullback-Leibler Divergence Based Probabilistic Loss Function.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Weakly-Supervised Sound Event Detection with Self-Attention.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Consistency-Aware Multi-Channel Speech Enhancement Using Deep Neural Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Scene-Dependent Acoustic Event Detection with Scene Conditioning and Fake-Scene-Conditioned Loss.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Robust Acoustic Scene Classification to Multiple Devices Using Maximum Classifier Discrepancy and Knowledge Distillation.
Proceedings of the 28th European Signal Processing Conference, 2020

Sound Event Localization and Detection Using Convolutional Recurrent Neural Networks and Gated Linear Units.
Proceedings of the 28th European Signal Processing Conference, 2020

Conformer-Based Sound Event Detection with Semi-Supervised Learning and Data Augmentation.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

Computer-Resource-Aware Deep Speech Separation with a Run-Time-Specified Number of BLSTM Layers.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Overview of Tasks and Investigation of Subjective Evaluation Methods in Environmental Sound Synthesis and Conversion.
CoRR, 2019

Fast Convergence Algorithm for State-Space Model Based Speech Dereverberation by Multi-Channel Non-Negative Matrix Factorization.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Variational Bayesian Multi-Channel Speech Dereverberation Under Noisy Environments with Probabilistic Convolutive Transfer Function.
Proceedings of the Interspeech 2019, 2019

Multichannel Loss Function for Supervised Speech Source Separation by Mask-Based Beamforming.
Proceedings of the Interspeech 2019, 2019

Bayesian Non-parametric Multi-source Modelling Based Determined Blind Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Scene-dependent Anomalous Acoustic-event Detection Based on Conditional Wavenet and I-vector.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
A Stereo Wind-Noise Suppressor with Null Beamforming and Frequency-Domain Noise Averaging.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2018

Modelling of Sound Events with Hidden Imbalances Based on Clustering and Separate Sub-Dictionary Learning.
Proceedings of the 26th European Signal Processing Conference, 2018

Anomalous Sound Event Detection Based on WaveNet.
Proceedings of the 26th European Signal Processing Conference, 2018

Weakly Labeled Learning Using BLSTM-CTC for Sound Event Detection.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Detection of anomaly acoustic scenes based on a temporal dissimilarity model.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

An acoustic monitoring system and its field trials.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Acoustic event detection based on non-negative matrix factorization with mixtures of local dictionaries and activation aggregation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Acoustic Event Detection Method Using Semi-Supervised Non-Negative Matrix Factorization with Mixtures of Local Dictionaries.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

2013
Computationally efficient single channel dereverberation based on complementary wiener filter.
Proceedings of the IEEE International Conference on Acoustics, 2013

Modeling head-related transfer functions via spatial-temporal Gaussian process.
Proceedings of the IEEE International Conference on Acoustics, 2013

1998
A 240-Mbps, 1-W CMOS EPRML read-channel LSI chip using an interleaved subranging pipeline A/D converter.
IEEE J. Solid State Circuits, 1998


  Loading...