Yoichi Yamashita

Orcid: 0000-0001-5379-9686

According to our database1, Yoichi Yamashita authored at least 84 papers between 1990 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
RISC: A Corpus for Shout Type Classification and Shout Intensity Prediction.
CoRR, 2023

Environmental sound conversion from vocal imitations and sound event labels.
CoRR, 2023

Multi-Instruments Music Generation Based on Chord Input.
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023

Universal Sound Separation Using Replay-based Data Sampling in Incremental Learning.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Investigating the Effectiveness of Speaker Embeddings for Shout Intensity Prediction.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
How Should We Evaluate Synthesized Environmental Sounds.
CoRR, 2022

Speech Emotion Recognition Using Label Smoothing Based on Neutral and Anger Characteristics.
Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022

Sound Event Detection Guided by Semantic Contexts of Scenes.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Joint Analysis of Sound Events and Acoustic Scenes Using Multitask Learning.
IEICE Trans. Inf. Syst., 2021

Onoma-to-wave: Environmental sound synthesis from onomatopoeic words.
CoRR, 2021

Sound Event Detection Based on Curriculum Learning Considering Learning Difficulty of Events.
Proceedings of the IEEE International Conference on Acoustics, 2021

Speech Emotion Recognition with Fusion of Acoustic- and Linguistic-Feature-Based Decisions.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Sound Event Detection Using Duration Robust Loss Function.
CoRR, 2020

Sound Event Detection by Multitask Learning of Sound Events and Scenes with Soft Scene Labels.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Evaluation Metric of Sound Event Detection Considering Severe Misdetection by Scenes.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

RWCP-SSD-Onomatopoeia: Onomatopoeic Word Dataset for Environmental Sound Synthesis.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

Speech Enhancement for Optical Laser Microphone With Deep Neural Network.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Overview of Tasks and Investigation of Subjective Evaluation Methods in Environmental Sound Synthesis and Conversion.
CoRR, 2019

Joint Analysis of Acoustic Event and Scene Based on Multitask Learning.
CoRR, 2019

Joint Analysis of Acoustic Events and Scenes Based on Multitask Learning.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Characteristics Study of Dance-charts on Rhythm-based Video Games.
Proceedings of the IEEE Conference on Games, 2019

2018
Single-Channel Speech Enhancement With Phase Reconstruction Based on Phase Distortion Averaging.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Active Speech Obscuration with Speaker-dependent Human Speech-like Noise for Speech Privacy.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Phase reconstruction method based on time-frequency domain harmonic structure for speech enhancement.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Online sound structure analysis based on generative model of acoustic feature sequences.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2015
F0 Parameterization of Glottalized Tones in HMM-Based Speech Synthesis for Hanoi Vietnamese.
IEICE Trans. Inf. Syst., 2015

F0 parameterization of glottalized tones for HMM-based vietnamese TTS.
Proceedings of the INTERSPEECH 2015, 2015

Robust sound image localization for moving listener with curved-type parametric loudspeaker.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Foreword.
IEICE Trans. Inf. Syst., 2014

Gaussian Mixture Model Learning for Desired Speech Discrimination Based on Kurtosis of Linear Prediction Residual Signals.
Proceedings of the Smart Digital Futures 2014, 2014

Close/distant talker discrimination based on kurtosis of linear prediction residual signals.
Proceedings of the IEEE International Conference on Acoustics, 2014

Reverberation steering and listening area expansion on 3-D sound field reproduction with parametric array loudspeaker.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
YLAB@RU at Spoken Term Detection Task in NTCIR-10 SpokenDoc-2.
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Overview of the NTCIR-10 SpokenDoc-2 Task.
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Weighted double sideband modulation toward high quality audible sound on parametric loudspeaker.
Proceedings of the IEEE International Conference on Acoustics, 2013

Interactive Acoustic Sound Field Reproduction with Web System for Gion Festival.
Proceedings of the 2013 International Conference on Culture and Computing, 2013

An active unpleasantness control system for indoor noise based on auditory masking.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Suitable spatial resolution at frequency bands based on variances of phase differences for real-time talker localization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Estimation of speech recognition performance in noisy and reverberant environments using PESQ score and acoustic parameters.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Digital Archive for Japanese Intangible Cultural Heritage Based on Reproduction of High-Fidelity Sound Field in Yamahoko Parade of Gion Festival.
Proceedings of the 13th ACIS International Conference on Software Engineering, 2012

Incorporating dynamic features into minimum generation error training for HMM-based speech synthesis.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

2011
YLAB@RU at Spoken Term Detection Task in NTCIR-9.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

2010
Multiple Sound Source Localization Based on Inter-Channel Correlation Using a Distributed Microphone System in a Real Environment.
IEICE Trans. Inf. Syst., 2010

Automatic prosodic labeling of accent information for Japanese spoken sentences.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Robust speaker localization in a disturbance noise environment using a distributed microphone system.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Constructing Japanese test collections for spoken term detection.
Proceedings of the INTERSPEECH 2010, 2010

2009
Construction of a Test Collection for Spoken Document Retrieval from Lecture Audio Data.
J. Inf. Process., 2009

A study on multiple sound source localization with a distributed microphone system.
Proceedings of the INTERSPEECH 2009, 2009

2008
Omnidirectional Audio-Visual Talker Localization Based on Dynamic Fusion of Audio-Visual Features Using Validity and Reliability Criteria.
IEICE Trans. Inf. Syst., 2008

Test Collections for Spoken Document Retrieval from Lecture Audio Data.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Localization of multiple sound sources based on inter-channel correlation using a distributed microphone system.
Proceedings of the INTERSPEECH 2008, 2008

2007
Noise-robust hands-free voice activity detection with adaptive zero crossing detection using talker direction estimation.
Proceedings of the INTERSPEECH 2007, 2007

Omnidirectional audio-visual talker localizer with dynamic feature fusion based on validity and reliability criteria.
Proceedings of the INTERSPEECH 2007, 2007

2006
Robust Talker Direction Estimation Based on Weighted CSP Analysis and Maximum Likelihood Estimation.
IEICE Trans. Inf. Syst., 2006

A design of robust omnidirectional audio-visual talker localizer.
Proceedings of the Tenth IASTED International Conference on Internet and Multimedia Systems and Applications (IMSA 2006), 2006

2005
Speech recognition using interphoneme dependencies based on a speaker space model.
Syst. Comput. Jpn., 2005

Automatic Scoring for Prosodic Proficiency of English Sentences Spoken by Japanese Based on Utterance Comparison.
IEICE Trans. Inf. Syst., 2005

A study of weighted CSP analysis with average speech spectrum for noise robust talker localization.
Proceedings of the INTERSPEECH 2005, 2005

2004
Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents.
Proceedings of the Life-like characters - tools, affective functions, and applications., 2004

2003
Prediction of sentence importance for speech summarization using prosodic parameters.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Extraction of important sentences using F0 information for speech summarization.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Keyword spotting using F0 contour information.
Syst. Comput. Jpn., 2001

Stochastic F0 contour model based on the clustering of F0 shapes of a syntactic unit.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
An annotation scheme of spoken dialogues with topic break indexes.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Prediction of keyword spotting accuracy based on simulation.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Standardising annotation schemes for Japanese discourse.
Proceedings of the First International Conference on Language Resources and Evaluation, 1998

Topic recognition for news speech based on keyword spotting.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997
Keyword spotting using F0 contour matching.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996
Prediction of F0 parameter of contextualized utterances in dialogue.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1995
An Utterance Prediction Method Based on the Topic Transition Model.
IEICE Trans. Inf. Syst., 1995

Modeling the contextual effects on prosody in dialog.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1994
Speech synthesis from concept representation in general speech output interface.
Syst. Comput. Jpn., 1994

Dialog context dependencies of utterances generated from concept reperesentation.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Automatic generation of prosodic rules for speech synthesis.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Next utterance prediction based on two kinds of dialog models.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1992
A speech labeling system based on knowledge processing.
Syst. Comput. Jpn., 1992

Dialog management for speech output from concept representation.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

A powerful disambiguating mechanism for speech understanding systems based on ATMs.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

SOCS: A speech output system from concept representation.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1990
A Support System for Constructing Rule Base for Speech Synthesis by Rule. Automatic Extraction of Synthesis Rules.
Syst. Comput. Jpn., 1990

Concept description for synthetic speech output system.
Proceedings of the ESCA Workshop on Speech Synthesis, 1990

A support environment based on rule interpreter for synthesis by rule.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Dialog management system mascots in speech understanding system.
Proceedings of the First International Conference on Spoken Language Processing, 1990

A speech labeling system based on knowledge processing.
Proceedings of the First International Conference on Spoken Language Processing, 1990


  Loading...