Hiroshi G. Okuno

AI Soc., October, 2025

Locating Survivors' Voices in Disaster Sites Using Quadcopters Based on Modeling Complicated Environments by PyRoomAcoustics and SSL by MUSIC-based Algorithms.

[BibT_eX]

[DOI]

Proceedings of the IEEE/SICE International Symposium on System Integration, 2025

2023

Extracting Bird Vocalizations from a Complex Natural Soundscape in Forests Using Robot Audition Techniques.

[BibT_eX]

[DOI]

Proceedings of the IEEE/SICE International Symposium on System Integration, 2023

2022

Auditory Survey of Endangered Eurasian Bittern Using Microphone Arrays and Robot Audition.

[BibT_eX]

[DOI]

Frontiers Robotics AI, 2022

2021

Visualizing Directional Soundscapes of Bird Vocalizations Using Robot Audition Techniques.

[BibT_eX]

[DOI]

Proceedings of the IEEE/SICE International Symposium on System Integration, 2021

Observing Nocturnal Birds Using Localization Techniques.

[BibT_eX]

[DOI]

Proceedings of the IEEE/SICE International Symposium on System Integration, 2021

Alternating Drive-and-Glide Flight Navigation of a Kiteplane for Sound Source Position Estimation.

[BibT_eX]

[DOI]

Makoto Kumon

Shuichi Tajima

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

2020

Multiple Sound Source Position Estimation by Drone Audition Based on Data Association Between Sound Source Localization and Identification.

[BibT_eX]

[DOI]

Mizuho Wakabayashi

Makoto Kumon

IEEE Robotics Autom. Lett., 2020

Drone audition listening from the sky estimates multiple sound source positions by integrating sound source localization and data association.

[BibT_eX]

[DOI]

Mizuho Wakabayashi

Makoto Kumon

Adv. Robotics, 2020

Multi-hop wireless command and telemetry communication system for remote operation of robots with extending operation area beyond line-of-sight using 920 MHz/169 MHz.

[BibT_eX]

[DOI]

Adv. Robotics, 2020

Robot Audition and Computational Auditory Scene Analysis.

[BibT_eX]

[DOI]

Adv. Intell. Syst., 2020

Design and Implementation of Real-Time Visualization of Sound Source Positions by Drone Audition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/SICE International Symposium on System Integration, 2020

Soundscape Analysis of Bird Songs in Forests Using Microphone Arrays.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/SICE International Symposium on System Integration, 2020

Computational Design of Balanced Open Link Planar Mechanisms with Counterweights from User Sketches.

[BibT_eX]

[DOI]

Bernhard Thomaszewski

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

2019

Recent R&D Technologies and Future Prospective of Flying Robot in Tough Robotics Challenge.

[BibT_eX]

[DOI]

Proceedings of the Disaster Robotics - Results from the ImPACT Tough Robotics Challenge, 2019

Development of Tough Snake Robot Systems.

[BibT_eX]

[DOI]

Proceedings of the Disaster Robotics - Results from the ImPACT Tough Robotics Challenge, 2019

ImPACT-TRC Thin Serpentine Robot Platform for Urban Search and Rescue.

[BibT_eX]

[DOI]

Proceedings of the Disaster Robotics - Results from the ImPACT Tough Robotics Challenge, 2019

Computational Design of Statically Balanced Planar Spring Mechanisms.

[BibT_eX]

[DOI]

Bernhard Thomaszewski

IEEE Robotics Autom. Lett., 2019

An Integrated Framework for Field Recording, Localization, Classification and Annotation of Birdsongs Using Robot Audition Techniques - Harkbird 2.0.

[BibT_eX]

[DOI]

Hiroshi Gitchang Okuno

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Speech Enhancement Based on Bayesian Low-Rank and Sparse Decomposition of Multichannel Magnitude Spectrograms.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Assessment of MUSIC-Based Noise-Robust Sound Source Localization with Active Frequency Range Filtering.

[BibT_eX]

[DOI]

J. Robotics Mechatronics, 2018

Design and Implementation of Programmable Drawing Automata based on Cam Mechanisms for Representing Spatial Trajectory.

[BibT_eX]

[DOI]

Takuto Takahashi

Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Extracting the Relationship between the Spatial Distribution and Types of Bird Vocalizations Using Robot Audition System HARK.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

2017

Design of UAV-Embedded Microphone Array System for Sound Source Localization in Outdoor Environments.

[BibT_eX]

[DOI]

Sensors, 2017

Development of a Robotic Pet Using Sound Source Localization with the HARK Robot Audition System.

[BibT_eX]

[DOI]

Ryo Suzuki

Takuto Takahashi

J. Robotics Mechatronics, 2017

Influence of Different Impulse Response Measurement Signals on MUSIC-Based Sound Source Localization.

[BibT_eX]

[DOI]

J. Robotics Mechatronics, 2017

HARKBird: Exploring Acoustic Interactions in Bird Communities Using a Microphone Array.

[BibT_eX]

[DOI]

J. Robotics Mechatronics, 2017

Editorial: Robot Audition Technologies.

[BibT_eX]

[DOI]

J. Robotics Mechatronics, 2017

Development, Deployment and Applications of Robot Audition Open Source Software HARK.

[BibT_eX]

[DOI]

Takeshi Mizumoto

J. Robotics Mechatronics, 2017

Swarm of Sound-to-Light Conversion Devices to Monitor Acoustic Communication Among Small Nocturnal Animals.

[BibT_eX]

[DOI]

J. Robotics Mechatronics, 2017

Acoustic Monitoring of the Great Reed Warbler Using Multiple Microphone Arrays and Robot Audition.

[BibT_eX]

[DOI]

J. Robotics Mechatronics, 2017

Low Latency and High Quality Two-Stage Human-Voice-Enhancement System for a Hose-Shaped Rescue Robot.

[BibT_eX]

[DOI]

J. Robotics Mechatronics, 2017

Size Effect on Call Properties of Japanese Tree Frogs Revealed by Audio-Processing Technique.

[BibT_eX]

[DOI]

J. Robotics Mechatronics, 2017

Development of microphone-array-embedded UAV for search and rescue task.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

2016

Robust Recognition of Simultaneous Speech By a Mobile Robot.

[BibT_eX]

[DOI]

CoRR, 2016

Sound-based online localization for an in-pipe snake robot.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Symposium on Safety, 2016

Parallel Speech Corpora of Japanese Dialects.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Localizing Bird Songs Using an Open Source Robot Audition System with a Microphone Array.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Call Alternation Between Specific Pairs of Male Frogs Revealed by a Sound-Imaging Method in Their Natural Habitat.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Variational Bayesian multi-channel robust NMF for human-voice enhancement with a deformable and partially-occluded microphone array.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

2015

Automatic Speech Recognition for Mixed Dialect Utterances by Mixing Dialect Language Models.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2015

HMM-based Attacks on Google's ReCAPTCHA with Continuous Visual and Audio Symbols.

[BibT_eX]

[DOI]

J. Inf. Process., 2015

A Recipe for Empathy - Integrating the Mirror System, Insula, Somatosensory Cortex and Motherese.

[BibT_eX]

[DOI]

Int. J. Soc. Robotics, 2015

Beat Tracking for Interactive Dancing Robots.

[BibT_eX]

[DOI]

Int. J. Humanoid Robotics, 2015

Bayesian Audio-to-Score Alignment Based on Joint Inference of Timbre, Volume, Tempo, and Note Onset Timings.

[BibT_eX]

[DOI]

Akira Maezawa

Comput. Music. J., 2015

Toward a quizmaster robot for speech-based multiparty interaction.

[BibT_eX]

[DOI]

Adv. Robotics, 2015

Preferential training of neurodynamical model based on predictability of target dynamics.

[BibT_eX]

[DOI]

Adv. Robotics, 2015

Posture estimation of hose-shaped robot by using active microphone array.

[BibT_eX]

[DOI]

Adv. Robotics, 2015

Audio-visual speech recognition using deep learning.

[BibT_eX]

[DOI]

Appl. Intell., 2015

Improved sound source localization in horizontal plane for binaural robot audition.

[BibT_eX]

[DOI]

Appl. Intell., 2015

Unified inter- and intra-recording duration model for multiple music audio alignment.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015

Human-voice enhancement based on online RPCA for a hose-shaped rescue robot with a microphone array.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Symposium on Safety, 2015

Microphone-accelerometer based 3D posture estimation for a hose-shaped rescue robot.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Robot audition: Its rise and perspectives.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Challenges in deploying a microphone array to localize and separate sound sources in real auditory scenes.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Recognition of In-Field Frog Chorusing Using Bayesian Nonparametric Microphone Array Processing.

[BibT_eX]

[DOI]

Hiroshi Gitchang Okuno

Proceedings of the Computational Sustainability, 2015

2014

Multichannel sound source dereverberation and separation for arbitrary number of sources based on Bayesian nonparametrics.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

Bayesian Nonparametrics for Microphone Array Processing.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

Nonparametric Bayesian dereverberation of power spectrograms based on infinite-order autoregressive processes.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

The MEI Robot: Towards Using Motherese to Develop Multimodal Emotional Intelligence.

[BibT_eX]

[DOI]

IEEE Trans. Auton. Ment. Dev., 2014

The interaction between a robot and multiple people based on spatially mapping of friendliness and motion parameters.

[BibT_eX]

[DOI]

Tsuyoshi Tasaki

Adv. Robotics, 2014

Applying intrinsic motivation for visuomotor learning of robot arm motion.

[BibT_eX]

[DOI]

Proceedings of the 11th International Conference on Ubiquitous Robots and Ambient Intelligence, 2014

A sound-based online method for estimating the time-varying posture of a hose-shaped robot.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Symposium on Safety, 2014

Sound annotation tool for multidirectional sounds based on spatial information extracted by HARK robot audition software.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Systems, Man, and Cybernetics, 2014

Bayesian Audio Alignment based on a Unified Model of Music Composition and Performance.

[BibT_eX]

[DOI]

Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Making a robot dance to diverse musical genre in noisy environments.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014

Visualization of auditory awareness based on sound source positions estimated by depth sensor and microphone array.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014

Lipreading using convolutional neural network.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Transferring Vocal Expression of F0 Contour Using Singing Voice Synthesizer.

[BibT_eX]

[DOI]

Yukara Ikemiya

Proceedings of the Modern Advances in Applied Intelligence, 2014

Insertion of pause in drawing from babbling for robot's developmental imitation learning.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Parameter Estimation of Virtual Musical Instrument Synthesizers.

[BibT_eX]

[DOI]

Proceedings of the Music Technology meets Philosophy, 2014

Automatic transcription of guitar tablature from audio signals in accordance with player's proficiency.

[BibT_eX]

[DOI]

Kazuki Yazawa

Proceedings of the IEEE International Conference on Acoustics, 2014

Audio part mixture alignment based on hierarchical nonparametric Bayesian model of musical audio sequence collection.

[BibT_eX]

[DOI]

Akira Maezawa

Proceedings of the IEEE International Conference on Acoustics, 2014

Transcribing vocal expression from polyphonic music.

[BibT_eX]

[DOI]

Yukara Ikemiya

Proceedings of the IEEE International Conference on Acoustics, 2014

A robot quizmaster that can localize, separate, and recognize simultaneous utterances for a fastest-voice-first quiz game.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE-RAS International Conference on Humanoid Robots, 2014

2013

Robust Multipitch Analyzer against Initialization based on Latent Harmonic Allocation using Overtone Corpus.

[BibT_eX]

[DOI]

Inf. Media Technol., 2013

Nonparametric Bayesian sparse factor analysis for frequency domain blind source separation without permutation ambiguity.

[BibT_eX]

[DOI]

Kohei Nagira

Takuma Otsuka

EURASIP J. Audio Speech Music. Process., 2013

A real-time super-resolution robot audition system that improves the robustness of simultaneous speech recognition.

[BibT_eX]

[DOI]

Keisuke Nakamura

Adv. Robotics, 2013

Improved binaural sound localization and tracking for unknown time-varying number of speakers.

[BibT_eX]

[DOI]

Adv. Robotics, 2013

Robust localization and tracking of multiple speakers in real environments for binaural robot audition.

[BibT_eX]

[DOI]

Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013

Developmental Human-Robot Imitation Learning of Drawing with a Neuro Dynamical System.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, 2013

Learning and association of synaesthesia phenomenon using deep neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE/SICE International Symposium on System Integration, 2013

Integration of behaviors and languages with a hierarchal structure self-organized in a neuro-dynamical model.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Workshop on Robotic Intelligence In Informationally Structured Space, 2013

Solving Google's Continuous Audio CAPTCHA with HMM-Based Automatic Speech Recognition.

[BibT_eX]

[DOI]

Shotaro Sano

Takuma Otsuka

Proceedings of the Advances in Information and Computer Security, 2013

Noise correlation matrix estimation for improving sound source localization by multirotor UAV.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013

Posture estimation of hose-shaped robot using microphone array localization.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013

Automatic estimation of dialect mixing ratio for dialect speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Improved Sound Source Localization and Front-Back Disambiguation for Humanoid Robots with Two Ears.

[BibT_eX]

[DOI]

Proceedings of the Recent Trends in Applied Artificial Intelligence, 2013

Proposal of International Conference Promotion: Destination Branding and Risk Management by a Network of Conference Centres.

[BibT_eX]

[DOI]

Mayumi J. Hikita

Proceedings of the Serviceology for Services, 2013

Hands-free human-robot communication robust to speaker's radial position.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

Audio-based guitar tablature transcription using multipitch analysis and playability constraints.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Initialization-robust Bayesian multipitch analyzer based on psychoacoustical and musical criteria.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Multiple index combination for Japanese spoken term detection with optimum index selection based on OOV-region classifier.

[BibT_eX]

[DOI]

Naoyuki Kanda

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Adaptive Pitch Control for Robot Thereminist Using Unscented Kalman Filter.

[BibT_eX]

[DOI]

Proceedings of the Modern Advances in Intelligent Systems and Tools, 2012

Tool-Body Assimilation of Humanoid Robot Using a Neurodynamical System.

[BibT_eX]

[DOI]

IEEE Trans. Auton. Ment. Dev., 2012

Efficient Blind Dereverberation and Echo Cancellation Based on Independent Component Analysis for Actual Acoustic Signals.

[BibT_eX]

[DOI]

Neural Comput., 2012

Automatic Allocation of Training Data for Speech Understanding Based on Multiple Model Combinations.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2012

Towards expressive musical robots: a cross-modal framework for emotional gesture, voice and music.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2012

A multimodal tempo and beat-tracking system based on audiovisual information from live guitar performances.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2012

Automated Violin Fingering Transcription Through Analysis of an Audio Recording.

[BibT_eX]

[DOI]

Comput. Music. J., 2012

A Musical Robot that Synchronizes with a Coplayer Using Non-Verbal Cues.

[BibT_eX]

[DOI]

Adv. Robotics, 2012

Infinite Sparse Factor Analysis for Blind Source Separation in Reverberant Environments.

[BibT_eX]

[DOI]

Kohei Nagira

Takuma Otsuka

Proceedings of the Structural, Syntactic, and Statistical Pattern Recognition, 2012

An active audition framework for auditory-driven HRI: Application to interactive robot dancing.

[BibT_eX]

[DOI]

Proceedings of the 21st IEEE International Symposium on Robot and Human Interactive Communication, 2012

Bayesian Nonnegative Harmonic-Temporal Factorization and Its Application to Multipitch Analysis.

[BibT_eX]

[DOI]

Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

Sound sources selection system by using onomatopoeic querries from multiple sound sources.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Unified auditory functions based on Bayesian topic model.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Live assessment of beat tracking for robot audition.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Who is the leader in a multiperson ensemble? - Multiperson human-robot ensemble model with leaderness -.

[BibT_eX]

[DOI]

Takeshi Mizumoto

Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Body area segmentation from visual scene based on predictability of neuro-dynamical system.

[BibT_eX]

[DOI]

Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN), 2012

Self-organization of object features representing motion using Multiple Timescales Recurrent Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN), 2012

Automatic Chord Recognition Based on Probabilistic Integration of Acoustic Features, Bass Sounds, and Chord Transition.

[BibT_eX]

[DOI]

Proceedings of the Advanced Research in Applied Artificial Intelligence, 2012

Incremental probabilistic geometry estimation for robot scene understanding.

[BibT_eX]

[DOI]

Louis-Kenzo Cahier

Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Initialization-robust multipitch estimation based on latent harmonic allocation using overtone corpus.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Complex Extension of Infinite Sparse Factor Analysis for Blind Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the Latent Variable Analysis and Signal Separation, 2012

A GMM Sound Source Model for Blind Speech Separation in Under-determined Conditions.

[BibT_eX]

[DOI]

Proceedings of the Latent Variable Analysis and Signal Separation, 2012

Improvement of audio-visual score following in robot ensemble with human guitarist.

[BibT_eX]

[DOI]

Proceedings of the 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012), Osaka, Japan, November 29, 2012

Using Speech Data to Recognize Emotion in Human Gait.

[BibT_eX]

[DOI]

Proceedings of the Human Behavior Understanding - Third International Workshop, 2012

Statistical Method of Building Dialect Language Models for ASR Systems.

[BibT_eX]

[DOI]

Naoki Hirayama

Shinsuke Mori

Proceedings of the COLING 2012, 2012

Bayesian Unification of Sound Source Localization and Separation with Permutation Resolution.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011

Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

Emergence of hierarchical structure mirroring linguistic composition in a recurrent neural network.

[BibT_eX]

[DOI]

Neural Networks, 2011

A multi-expert model for dialogue and behavior control of conversational robots and agents.

[BibT_eX]

[DOI]

Knowl. Based Syst., 2011

LyricSynchronizer: Automatic Synchronization System Between Musical Audio Signals and Lyrics.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2011

People Detection Based on Spatial Mapping of Friendliness and Floor Boundary Points for a Mobile Navigation Robot.

[BibT_eX]

[DOI]

J. Robotics, 2011

Real-Time Audio-to-Score Alignment Using Particle Filter for Coplayer Music Robots.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2011

Classification of Known and Unknown Environmental Sounds Based on Self-Organized Space Using a Recurrent Neural Network.

[BibT_eX]

[DOI]

Adv. Robotics, 2011

Towards Written Text Recognition Based on Handwriting Experiences Using a Recurrent Neural Network.

[BibT_eX]

[DOI]

Adv. Robotics, 2011

Handwriting prediction based character recognition using recurrent neural network.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, 2011

A Two-Stage Domain Selection Framework for Extensible Multi-Domain Spoken Dialogue Systems.

[BibT_eX]

[DOI]

Proceedings of the SIGDIAL 2011 Conference, 2011

A musical mood trajectory estimation method using lyrics and acoustic features.

[BibT_eX]

[DOI]

Proceedings of the 1st international ACM workshop on Music information retrieval with user-centered and multimodal strategies, Scottsdale, AZ, USA, November 28, 2011

Evaluation of Spoken Dialogue System that uses Utterance Timing to Interpret User Utterances.

[BibT_eX]

[DOI]

Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems, 2011

Incremental Bayesian Audio-to-Score Alignment with Flexible Harmonic Structure Models.

[BibT_eX]

[DOI]

Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Improvement of speaker localization by considering multipath interference of sound wave for binaural robot audition.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Particle-filter based audio-visual beat-tracking for music robot ensemble with human guitarist.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Bayesian Extension of MUSIC for Sound Source Localization and Tracking.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Fast and Simple Iterative Algorithm of Lp-Norm Minimization for Under-Determined Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Environmental Sound Recognition for Robot Audition Using Matching-Pursuit.

[BibT_eX]

[DOI]

Proceedings of the Modern Approaches in Applied Intelligence, 2011

Robot with Two Ears Listens to More than Two Simultaneous Utterances by Exploiting Harmonic Structures.

[BibT_eX]

[DOI]

Proceedings of the Modern Approaches in Applied Intelligence, 2011

Design and implementation of selectable sound separation on the Texai telepresence system using HARK.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2011

Use of a Sparse Structure to Improve Learning Performance of Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 18th International Conference, 2011

I-Divergence-based dereverberation method with auxiliary function approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Polyphonic audio-to-score alignment based on Bayesian Latent Harmonic Allocation Hidden Markov Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Simultaneous processing of sound source separation and musical instrument identification using Bayesian spectral modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Cluster Self-organization of Known and Unknown Environmental Sounds Using Recurrent Neural Network.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2011, 2011

Converting emotional voice to motion for robot telepresence.

[BibT_eX]

[DOI]

Proceedings of the 11th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2011), 2011

2010

A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

Inter-modality mapping in robot with recurrent neural network.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2010

Soft missing-feature mask generation for robot audition.

[BibT_eX]

[DOI]

Paladyn J. Behav. Robotics, 2010

Voice-awareness control for a humanoid robot consistent with its body posture and movements.

[BibT_eX]

[DOI]

Paladyn J. Behav. Robotics, 2010

Selecting Help Messages by Using Robust Grammar Verification for Handling Out-of-Grammar Utterances in Spoken Dialogue Systems.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2010

Design and Implementation of Robot Audition System 'HARK' - Open Source Software for Listening to Three Simultaneous Speakers.

[BibT_eX]

[DOI]

Adv. Robotics, 2010

Human-robot cooperation in arrangement of objects using confidence measure of neuro-dynamical system.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, 2010

Online Error Detection of Barge-In Utterances by Using Individual Users' Utterance Histories in Spoken Dialogue System.

[BibT_eX]

[DOI]

Proceedings of the SIGDIAL 2010 Conference, 2010

Query-by-conducting: An Interface to Retrieve Classical-music Interpretations by Real-time Tempo Input.

[BibT_eX]

[DOI]

Akira Maezawa

Proceedings of the 11th International Society for Music Information Retrieval Conference, 2010

Two-layered audio-visual speech recognition for robots in noisy environments.

[BibT_eX]

[DOI]

Takami Yoshida

Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Speedup and performance improvement of ICA-based robot audition by parallel and resampling-based block-wise processing.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

An improvement in automatic speech recognition using soft missing feature masks for robot audition.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Motion generation based on reliable predictability using self-organized object features.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Human-robot ensemble between robot thereminist and human percussionist using coupled oscillator model.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Robot musical accompaniment: integrating audio and visual cues for real-time synchronization with a human flutist.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Exploiting harmonic structures to improve separating simultaneous speech in under-determined conditions.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Effects of modelling within- and between-frame temporal variations in power spectra on non-verbal sound recognition.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Analyzing user utterances in barge-in-able spoken dialogue system for improving identification accuracy.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

An Improvement in Audio-Visual Voice Activity Detection for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Takami Yoshida

Proceedings of the Trends in Applied Intelligent Systems, 2010

System for Supporting Web-based Public Debate Using Transcripts of Face-to-Face Meeting.

[BibT_eX]

[DOI]

Proceedings of the Trends in Applied Intelligent Systems, 2010

Music-Ensemble Robot That Is Capable of Playing the Theremin While Listening to the Accompanied Music.

[BibT_eX]

[DOI]

Proceedings of the Trends in Applied Intelligent Systems, 2010

Improving Identification Accuracy by Extending Acceptable Utterances in Spoken Dialogue System Using Barge-in Timing.

[BibT_eX]

[DOI]

Proceedings of the Trends in Applied Intelligent Systems, 2010

Violin Fingering Estimation Based on Violin Pedagogical Fingering Model Constrained by Bowed Sequence Estimation from Audio Input.

[BibT_eX]

[DOI]

Proceedings of the Trends in Applied Intelligent Systems, 2010

Recognition and Generation of Sentences through Self-organizing Linguistic Hierarchy Using MTRNN.

[BibT_eX]

[DOI]

Proceedings of the Trends in Applied Intelligent Systems, 2010

Upper-limit evaluation of robot audition based on ICA-BSS in multi-source, barge-in and highly reverberant conditions.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Improvement in listening capability for humanoid robot HRP-2.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Noisy speech enhancement based on prior knowledge about spectral envelope and harmonic structure.

[BibT_eX]

[DOI]

Takuya Yoshioka

Proceedings of the IEEE International Conference on Acoustics, 2010

Music dereverberation using harmonic structure source model and Wiener filter.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Automatic Allocation of Training Data for Rapid Prototyping of Speech Understanding based on Multiple Model Combination.

[BibT_eX]

[DOI]

Proceedings of the COLING 2010, 2010

Design and Implementation of Two-level Synchronization for Interactive Music Robot.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

A Corpus-Based Analysis of Coreferential Recency Effect in Japanese Discourse for Tracking Dynamic Topic.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE/ACIS International Conference on Computer and Information Science, 2010

2009

Autonomous Motion Generation Based on Reliable Predictability.

[BibT_eX]

[DOI]

J. Robotics Mechatronics, 2009

Parameter Estimation for Harmonic and Inharmonic Models by Using Timbre Feature Distributions.

[BibT_eX]

[DOI]

Inf. Media Technol., 2009

Self-organization of Dynamic Object Features Based on Bidirectional Training.

[BibT_eX]

[DOI]

Adv. Robotics, 2009

Human Tracking System Integrating Sound and Face Localization Using an Expectation-Maximization Algorithm in Real Environments.

[BibT_eX]

[DOI]

Adv. Robotics, 2009

Target Speech Detection and Separation for Communication with Humanoid Robots in Noisy Home Environments.

[BibT_eX]

[DOI]

Adv. Robotics, 2009

Statistical models for speech dereverberation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

A novel framework for recognizing phonemes of singing voice in polyphonic music.

[BibT_eX]

[DOI]

Hiromasa Fujihara

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

A Model of Temporally Changing User Behaviors in a Deployed Spoken Dialogue System.

[BibT_eX]

[DOI]

Proceedings of the User Modeling, 2009

Ranking Help Message Candidates Based on Robust Grammar Verification Results and Utterance History in Spoken Dialogue Systems.

[BibT_eX]

[DOI]

Proceedings of the SIGDIAL 2009 Conference, 2009

A Speech Understanding Framework that Uses Multiple Language Models and Multiple Understanding Models.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Changing timbre and phrase in existing musical performances as you like: manipulations of single part using harmonic and inharmonic models.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Multimedia 2009, 2009

Robot Audition: Missing Feature Theory Approach and Active Audition.

[BibT_eX]

[DOI]

Hyun-Don Kim

Proceedings of the Robotics Research - The 14th International Symposium, 2009

Bowed String Sequence Estimation of a Violin Based on Adaptive Audio Signal Classification and Context-Dependent Error Correction.

[BibT_eX]

[DOI]

Proceedings of the 11th IEEE International Symposium on Multimedia, 2009

Step-size parameter adaptation of multi-channel semi-blind ICA with piecewise linear model for barge-in-able robot audition.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Missing-feature-theory-based robust simultaneous speech recognition system with non-clean speech acoustic model.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Incremental polyphonic audio to score alignment using beat tracking for singer robots.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Modeling tool-body assimilation using second-order Recurrent Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Thereminist robot: Development of a robot theremin player with feedforward and feedback arm control based on a Theremin's pitch model.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Phoneme acquisition model based on vowel imitation using Recurrent Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Emergence of evolutionary interaction with voice and motion between two robots using RNN.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Enabling a user to specify an item at any time during system enumeration - item identification for barge-in-able conversational dialogue systems.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Improving speech understanding accuracy with limited training data using multiple language models and multiple understanding models.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Adjusting Occurrence Probabilities of Automatically-Generated Abbreviated Words in Spoken Dialogue Systems.

[BibT_eX]

[DOI]

Proceedings of the Next-Generation Applied Intelligence, 2009

Prediction and imitation of other's motions by reusing own forward-inverse model in robots.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Robotics and Automation, 2009

Continuous vocal imitation with self-organized vowel spaces in Recurrent Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Robotics and Automation, 2009

ICA-based efficient blind dereverberation and echo cancellation method for barge-in-able robot audition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Automatic speech recognition improved by two-layered audio-visual integration for robot audition.

[BibT_eX]

[DOI]

Takami Yoshida

Proceedings of the 9th IEEE-RAS International Conference on Humanoid Robots, 2009

Automatic estimation of reverberation time with robot speech to improve ICA-based robot audition.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE-RAS International Conference on Humanoid Robots, 2009

Voice quality manipulation for humanoid robots consistent with their head movements.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE-RAS International Conference on Humanoid Robots, 2009

Development of a Meeting Browser towards Supporting Public Involvement.

[BibT_eX]

[DOI]

Proceedings of the 12th IEEE International Conference on Computational Science and Engineering, 2009

2008

A game-theoretic model of referential coherence and its empirical verification using large Japanese and English corpora.

[BibT_eX]

[DOI]

ACM Trans. Speech Lang. Process., 2008

An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2008

Managing out-of-grammar utterances by topic estimation with domain extensibility in multi-domain spoken dialogue systems.

[BibT_eX]

[DOI]

Speech Commun., 2008

Cheek to Chip: Dancing Robots and AI's Future.

[BibT_eX]

[DOI]

Jean-Julien Aucouturier

IEEE Intell. Syst., 2008

SalienceGraph: Visualizing Salience Dynamics of Written Discourse by Using Reference Probability and PLSA.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2008: Trends in Artificial Intelligence, 2008

3D Auditory Scene Visualizer with Face Tracking: Design and Implementation for Auditory Awareness Compensation.

[BibT_eX]

[DOI]

Proceedings of the ISUC 2008, 2008

Automatic Chord Recognition Based on Probabilistic Integration of Chord Transition and Bass Pitch Estimation.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2008, 2008

A Robot Singer with Music Recognition Based on Real-Time Beat Tracking.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2008, 2008

Instrument Equalizer for Query-by-Example Retrieval: Improving Sound Source Separation Based on Integrated Harmonic and Inharmonic Models.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2008, 2008

Design and Implementation of 3D Auditory Scene Visualizer towards Auditory Awareness with Face Tracking.

[BibT_eX]

[DOI]

Proceedings of the Tenth IEEE International Symposium on Multimedia (ISM2008), 2008

Barge-in-able robot audition based on ICA and missing feature theory under semi-blind situation.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

Active sensing based dynamical object feature extraction.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

A robot uses its own microphone to synchronize its steps to musical beats while scatting and singing.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

A robot listens to music and counts its beats aloud by separating music from counting voice.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

Design and evaluation of two-channel-based sound source localization over entire azimuth range for moving talkers.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

Target speech detection and separation for humanoid robots in sparse dialogue with noisy home environments.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

Segmenting acoustic signal with articulatory movement using Recurrent Neural Network for phoneme acquisition.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

Soft missing-feature mask generation for simultaneous speech recognition system in robots.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Predicting ASR errors by exploiting barge-in rate of individual users for spoken dialogue systems.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Expanding vocabulary for recognizing user's abbreviations of proper nouns without increasing ASR error rates in spoken dialogue systems.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Extensibility verification of robust domain selection against out-of-grammar utterances in multi-domain spoken dialogue system.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Rapid Prototyping of Robust Language Understanding Modules for Spoken Dialogue Systems.

[BibT_eX]

[DOI]

Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

Integrating Topic Estimation and Dialogue History for Domain Selection in Multi-domain Spoken Dialogue Systems.

[BibT_eX]

[DOI]

Proceedings of the New Frontiers in Applied Artificial Intelligence, 2008

Object dynamics prediction and motion generation based on reliable predictability.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

A robot referee for rock-paper-scissors sound games.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

Two-channel-based voice activity detection for humanoid robots in noisy home environments.

[BibT_eX]

[DOI]

Proceedings of the 2008 IEEE International Conference on Robotics and Automation, 2008

An open source software system for robot audition HARK and its evaluation.

[BibT_eX]

[DOI]

Proceedings of the 8th IEEE-RAS International Conference on Humanoid Robots, 2008

A beat-tracking robot for human-robot interaction and its evaluation.

[BibT_eX]

[DOI]

Proceedings of the 8th IEEE-RAS International Conference on Humanoid Robots, 2008

2007

Robust Recognition of Simultaneous Speech by a Mobile Robot.

[BibT_eX]

[DOI]

IEEE Trans. Robotics, 2007

Drum Sound Recognition for Polyphonic Audio Signals by Adaptation and Matching of Spectrogram Templates With Harmonic Structure Suppression.

[BibT_eX]

[DOI]

Kazuyoshi Yoshii

IEEE Trans. Speech Audio Process., 2007

Statistical machine translation using hierarchical phrase alignment.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 2007

Drumix: An Audio Player with Real-time Drum-part Rearrangement Functions for Active Music Listening.

[BibT_eX]

[DOI]

Inf. Media Technol., 2007

Instrogram: Probabilistic Representation of Instrument Existence for Polyphonic Music.

[BibT_eX]

[DOI]

Inf. Media Technol., 2007

Instrument Identification in Polyphonic Music: Feature Weighting to Minimize Influence of Sound Overlaps.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2007

Introducing Utterance Verification in Spoken Dialogue System to Improve Dynamic Help Generation for Novice Users.

[BibT_eX]

[DOI]

Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, 2007

Auditory and Visual Integration based Localization and Tracking of Multiple Moving Sounds in Daily-life Environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE RO-MAN 2007, 2007

Meaning Games.

[BibT_eX]

[DOI]

Proceedings of the New Frontiers in Artificial Intelligence, 2007

Improving Efficiency and Scalability of Model-Based Music Recommender System Based on Incremental Training.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Music Information Retrieval, 2007

A biped robot that keeps steps in time with musical beats while listening to music with its own ears.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

Discovery of other individuals by projecting a self-model through imitation.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

Exploiting known sound source signals to improve ICA-based robot audition in speech separation and recognition.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

Two-way translation of compound sentences and arm motions by recurrent neural networks.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

Auditory and visual integration based localization and tracking of humans in daily-life environments.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

Vocal imitation using physical vocal tract model.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

Analyzing temporal transition of real user's behaviors in a spoken dialogue system.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Topic estimation with domain extensibility for guiding user's out-of-grammar utterances in multi-domain spoken dialogue systems.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Evaluation of Two Simultaneous Continuous Speech Recognition with ICA BSS and MFT-Based ASR.

[BibT_eX]

[DOI]

Proceedings of the New Trends in Applied Artificial Intelligence, 2007

Real-Time Auditory and Visual Talker Tracking Through Integrating EM Algorithm and Particle Filter.

[BibT_eX]

[DOI]

Proceedings of the New Trends in Applied Artificial Intelligence, 2007

Human-Robot Cooperation using Quasi-symbols Generated by RNNPB Model.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Robotics and Automation, 2007

Distance Estimation of Hidden Objects Based on Acoustical Holography by applying Acoustic Diffraction of Audible Sound.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Robotics and Automation, 2007

Predicting Object Dynamics from Visual Images through Active Sensing Experiences.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Robotics and Automation, 2007

Vowel Imitation Using Vocal Tract Model and Recurrent Neural Network.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing, 14th International Conference, 2007

Integration and Adaptation of Harmonic and Inharmonic Models for Separating Polyphonic Musical Signals.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

Design and implementation of a robot audition system for automatic speech recognition of simultaneous speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006

Using multiple edit distances to automatically grade outputs from Machine translation systems.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2006

A privacy-enhanced access control.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 2006

Dynamic Communication of Humanoid Robot with Multiple People Based on Interaction Distance.

[BibT_eX]

[DOI]

Inf. Media Technol., 2006

Common Acoustical Pole Estimation from Multi-Channel Musical Audio Signals.

[BibT_eX]

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2006

Multi-Domain Spoken Dialogue System with Extensibility and Robustness against Speech Recognition Errors.

[BibT_eX]

[DOI]

Proceedings of the SIGDIAL 2006 Workshop, 2006

Recognition of Simultaneous Speech by Estimating Reliability of Separated Signals for Robot Audition.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2006: Trends in Artificial Intelligence, 2006

Hybrid Collaborative and Content-based Music Recommendation Using Probabilistic Model with Latent User Preferences.

[BibT_eX]

Proceedings of the ISMIR 2006, 2006

Automatic Feature Weighting in Automatic Transcription of Specified Part in Polyphonic Music.

[BibT_eX]

Proceedings of the ISMIR 2006, 2006

Musical Instrument Recognizer "Instrogram" and Its Application to Music Retrieval Based on Instrumentation Similarity.

[BibT_eX]

[DOI]

Proceedings of the Eigth IEEE International Symposium on Multimedia (ISM 2006), 2006

Automatic Synchronization between Lyrics and Music CD Recordings Based on Viterbi Alignment of Segregated Vocal Signals.

[BibT_eX]

[DOI]

Proceedings of the Eigth IEEE International Symposium on Multimedia (ISM 2006), 2006

Experience Based Imitation Using RNNPB.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006

Real-Time Robot Audition System That Recognizes Simultaneous Speech in The Real World.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006

Missing-Feature based Speech Recognition for Two Simultaneous Speech Signals Separated by ICA with a pair of Humanoid Ears.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006

Multiple Acoustical Holography Method for Localization of Objects in Broad Range using Audible Sound.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006

Real-Time Tracking of Multiple Sound Sources by Integration of In-Room and Robot-Embedded Microphone Arrays.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006

Leak energy based missing feature mask generation for ICA and GSS and its evaluation with simultaneous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2006

Improving speech recognition of two simultaneous speech signals by integrating ICA BSS and automatic missing feature mask generation.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Dynamic help generation by estimating user²s mental model in spoken dialogue systems.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Speaker identification under noisy environments by using harmonic structure extraction and reliable frame weighting.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Genetic Algorithm-Based Improvement of Robot Hearing Capabilities in Separating and Recognizing Simultaneous Speech Signals.

[BibT_eX]

[DOI]

Proceedings of the Advances in Applied Artificial Intelligence, 2006

An Error Correction Framework Based on Drum Pattern Periodicity for Improving Drum Sound Detection.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Robust Tracking of Multiple Sound Sources by Spatial Integration of Room And Robot Microphone Arrays.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Instrogram: A New Musical Instrument Recognition Technique Without Using Onset Detection NOR F0 Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

F0 Estimation Method for Singing Voice in Polyphonic Audio Signal Based on Statistical Vocal Model and Viterbi Search.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Robust decomposition of inverse filter of channel and prediction error filter of speech signal for dereverberation.

[BibT_eX]

[DOI]

Proceedings of the 14th European Signal Processing Conference, 2006

2005

User Modeling in Spoken Dialogue Systems to Generate Flexible Guidance.

[BibT_eX]

[DOI]

User Model. User Adapt. Interact., 2005

Extracting Multimodal Dynamics of Objects Using RNNPB.

[BibT_eX]

[DOI]

J. Robotics Mechatronics, 2005

A computational model of monkey cortical grating cells.

[BibT_eX]

[DOI]

Biol. Cybern., 2005

Pitch-Dependent Identification of Musical Instrument Sounds.

[BibT_eX]

[DOI]

Appl. Intell., 2005

Walking with body-sense in virtual space using the nonlinear oscillator.

[BibT_eX]

[DOI]

Kenri Kodaka

Proceedings of the IEEE International Conference on Systems, 2005

Empirical Verification of Meaning-Game-based Generalization of Centering Theory with Large Japanese Corpus.

[BibT_eX]

[DOI]

Proceedings of the 19st Pacific Asia Conference on Language, Information and Computation, 2005

Instrument Identification in Polyphonic Music: Feature Weighting with Mixed Sounds, Pitch-Dependent Timbre Modeling, and Use of Musical Context.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2005, 2005

Singer Identification Based on Accompaniment Sound Reduction and Reliable Frame Selection.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2005, 2005

Making a robot recognize three simultaneous sentences in real-time.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

Spatially mapping of friendliness for human-robot interaction.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

Extracting multi-modal dynamics of objects using RNNPB.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

A two-layer model for behavior and dialogue planning in conversational service robots.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

Implementation of active direction-pass filter on dynamically reconfigurable processor.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

Multiple moving speaker tracking by microphone array on mobile robot.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Contextual constraints based on dialogue models in database search task for spoken dialogue systems.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Distance-Based Dynamic Interaction of Humanoid Robot with Multiple People.

[BibT_eX]

[DOI]

Proceedings of the Innovations in Applied Artificial Intelligence, 2005

Enhanced Robot Speech Recognition Based on Microphone Array Source Separation and Missing Feature Theory.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Robotics and Automation, 2005

2004

Effects of increasing modalities in recognizing three simultaneous speeches.

[BibT_eX]

[DOI]

Speech Commun., 2004

Improvement of recognition of simultaneous speech signals using AV integration and scattering theory for humanoid robots.

[BibT_eX]

[DOI]

Speech Commun., 2004

Automatic Sound-Imitation Word Recognition from Environmental Sounds Focusing on Ambiguity Problem in Determining Phonemes.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2004: Trends in Artificial Intelligence, 2004

Incremental Methods to Select Test Sentences for Evaluating Translation Ability.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Bus Information System Based on User Models and Dynamic Generation of VoiceXML Scripts.

[BibT_eX]

[DOI]

Proceedings of the New Frontiers in Artificial Intelligence - JSAI 2003 and JSAI 2004 Conferences and Workshops, Niigata, Japan, June 23-27, 2003 and Kanazawa, Japan, May 31, 2004

Automatic Chord Transcription with Concurrent Recognition of Chord Symbols and Boundaries.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2004, 2004

Automatic Drum Sound Description for Real-World Music Using Template Adaptation and Matching Methods.

[BibT_eX]

[DOI]

Kazuyoshi Yoshii

Proceedings of the ISMIR 2004, 2004

Assessment of general applicability of robot audition system by recognizing three simultaneous speeches.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004

Drum sound identification for polyphonic music using template adaptation and matching methods.

[BibT_eX]

[DOI]

Kazuyoshi Yoshii

Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

Robot motion control using listener's back-channels and head gesture information.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Disambiguation in determining phonemes of sound-imitation words for environmental sound recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Recognition of Emotional States in Spoken Dialogue with a Robot.

[BibT_eX]

[DOI]

Proceedings of the Innovations in Applied Artificial Intelligence, 2004

Improvement of Robot Audition by Interfacing Sound Source Separation and Automatic Speech Recognition with Missing Feature Theory.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Robotics and Automation, 2004

Comparing features for forming music streams in automatic music transcription.

[BibT_eX]

[DOI]

Yohei Sakuraba

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Category-level identification of non-registered musical instrument sounds.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Efficient Confirmation Strategy for Large-scale Text Retrieval Systems with Spoken Dialogue Interface.

[BibT_eX]

[DOI]

Proceedings of the COLING 2004, 2004

Using a Mixture of N-Best Lists from Multiple MT Systems in Rank-Sum-Based Confidence Measure for MT Outputs.

[BibT_eX]

[DOI]

Proceedings of the COLING 2004, 2004

2003

Human-robot non-verbal interaction empowered by real-time auditory and visual multiple-talker tracking.

[BibT_eX]

[DOI]

Adv. Robotics, 2003

Flexible Spoken Dialogue System based on User Models and Dynamic Generation of VoiceXML Scripts.

[BibT_eX]

[DOI]

Proceedings of the SIGDIAL 2003 Workshop, 2003

Experimental comparison of MT evaluation methods: RED vs.BLEU.

[BibT_eX]

[DOI]

Proceedings of Machine Translation Summit IX: Papers, 2003

Real-Time Sound Source Localization and Separation Based on Active Audio-Visual Integration.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Nets Problem Solving Methods, 2003

Applying scattering theory to robot audition system: robust sound source localization and extraction.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems, Las Vegas, Nevada, USA, October 27, 2003

Three simultaneous speech recognition by integration of active audition and face recognition for humanoid.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

User modeling in spoken dialogue systems for flexible guidance generation.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Automatic transformation of environmental sounds into sound-imitation words based on Japanese syllable structure.

[BibT_eX]

[DOI]

Kazushi Ishihara

Yasushi Tsubota

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Design and Implementation of Personality of Humanoids in Human Humanoid Non-verbal Interaction.

[BibT_eX]

[DOI]

Proceedings of the Developments in Applied Artificial Intelligence, 2003

Pitch-Dependent Musical Instrument Identification and Its Application to Musical Sound Ontology.

[BibT_eX]

[DOI]

Proceedings of the Developments in Applied Artificial Intelligence, 2003

Realizing personality in audio-visually triggered non-verbal behaviors.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Robotics and Automation, 2003

Robot recognizes three simultaneous speech by active audition.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Robotics and Automation, 2003

Note Recognition of Polyphonic Music by Using Timbre Similarity and Direction Proximity.

[BibT_eX]

[DOI]

Yohei Sakuraba

Proceedings of the 2003 International Computer Music Conference, 2003

Musical instrument identification based on F0-dependent multivariate normal distribution.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Improvement of three simultaneous speech recognition by using AV integration and scattering theory for humanoid.

[BibT_eX]

[DOI]

Proceedings of the AVSP 2003, 2003

Privacy-Enhanced SPKI Access Control on PKIX and Its Application to Web Server .

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Advanced Information Networking and Applications (AINA'03), 2003

Chunk-Based Statistical Translation.

[BibT_eX]

[DOI]

Taro Watanabe

Eiichiro Sumita

Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003

Flexible Guidance Generation Using User Model in Spoken Dialogue Systems.

[BibT_eX]

[DOI]

Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003

2002

Real-time Auditory and Visual Multiple-speaker Tracking For Human-robot Interaction.

[BibT_eX]

[DOI]

J. Robotics Mechatronics, 2002

Realizing Audio-Visually Triggered ELIZA-Like Non-verbal Behaviors.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2002: Trends in Artificial Intelligence, 2002

Auditory fovea based speech separation and its application to dialog system.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, Lausanne, Switzerland, September 30, 2002

Belief network based disambiguation of object reference in spoken dialogue system for robot.

[BibT_eX]

[DOI]

Yoko Yamakata

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Auditory fovea based speech enhancement and its application to human-robot dialog system.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Real-time sound source localization and separation for robot audition.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Social Interaction of Humanoid RobotBased on Audio-Visual Tracking.

[BibT_eX]

[DOI]

Proceedings of the Developments in Applied Artificial Intelligence, 2002

Real-Time Speaker Localization and Speech Separation by Audio-Visual Integration.

[BibT_eX]

[DOI]

Proceedings of the 2002 IEEE International Conference on Robotics and Automation, 2002

Efficient Dialogue Strategy to Find Users' Intended Items from Information Query Results.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Computational Linguistics, 2002

Exploiting Auditory Fovea in Humanoid-Human Interaction.

[BibT_eX]

[DOI]

Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

2001

Detection of Oriented Repetitive Alternating Patterns in Color Images (A Computational Model of Monkey Grating Cells).

[BibT_eX]

[DOI]

Tino Lourens

Proceedings of the Connectionist Models of Neurons, 2001

Human-robot interaction through real-time auditory and visual multiple-talker tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2001

Epipolar geometry based sound localization and extraction for humanoid audition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2001

An Access Control with handling Private Information.

[BibT_eX]

[DOI]

Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001

Separating three simultaneous speeches with two microphones by integrating auditory and visual processing.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Real-time multiple speaker tracking by multi-modal integration for mobile robots.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Real-Time Auditory and Visual Multiple-Object Tracking for Humanoids.

[BibT_eX]

Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

Sound and Visual Tracking for Humanoid Robot.

[BibT_eX]

[DOI]

Proceedings of the Engineering of Intelligent Systems, 2001

Automatic Graph Extraction from Color Images.

[BibT_eX]

[DOI]

Tino Lourens

Proceedings of the 11th International Conference on Image Analysis and Processing (ICIAP 2001), 2001

Graph extraction from color images.

[BibT_eX]

[DOI]

Proceedings of the 9th European Symposium on Artificial Neural Networks, 2001

A computational model of monkey grating cells for oriented repetitive alternating patterns.

[BibT_eX]

[DOI]

Proceedings of the 9th European Symposium on Artificial Neural Networks, 2001

2000

Privacy-Enhanced Access Control by SPKI and Its Application to Web Server.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE International Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE 2000), 2000

Bridging Gap between the Simulation and Robotics with a Global Vision System.

[BibT_eX]

[DOI]

Yukiko Nakagawa

Proceedings of the RoboCup 2000: Robot Soccer World Cup IV, 2000

And the Fans Are Going Wild! SIG plus MIKE.

[BibT_eX]

[DOI]

Proceedings of the RoboCup 2000: Robot Soccer World Cup IV, 2000

Humanoid Active Audition System Improved by the Cover Acoustics.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2000, Topics in Artificial Intelligence, 6th Pacific Rim International Conference on Artificial Intelligence, Melbourne, Australia, August 28, 2000

Active audition system and humanoid exterior design.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2000

Design and architecture of SIG the humanoid: an experimental platform for integrated perception in RoboCup humanoid challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2000

A framework for integrating sensory information in a humanoid robot.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2000

Privacy enhanced access control by SPKI.

[BibT_eX]

[DOI]

Proceedings of the Seventh International Conference on Parallel and Distributed Systems Workshops, 2000

Designing a humanoid head for RoboCup challenge.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference on Autonomous Agents, 2000

Active Audition for Humanoid.

[BibT_eX]

[DOI]

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000

1999

Listening to two simultaneous speeches.

[BibT_eX]

[DOI]

Speech Commun., 1999

Harmonic sound stream segregation using localization and its application to speech stream segregation.

[BibT_eX]

[DOI]

Speech Commun., 1999

Using Vision to Improve Sound Source Separation.

[BibT_eX]

[DOI]

Yukiko Nakagawa

Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, 1999

1998

On the Properties of Combination Set Operations.

[BibT_eX]

[DOI]

Shin-ichi Minato

Hideki Isozaki

Inf. Process. Lett., 1998

Sound Ontology for Computational Auditory Scence Analysis.

[BibT_eX]

[DOI]

Proceedings of the Fifteenth National Conference on Artificial Intelligence and Tenth Innovative Applications of Artificial Intelligence Conference, 1998

1997

Understanding Three Simultaneous Speeches.

[BibT_eX]

[DOI]

Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, 1997

1996

A new speech enhancement: speech stream segregation.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Design and Implementation of Multiple-Context Truth Maintenance System with Binary Decision Diagram.

[BibT_eX]

Osamu Shimokuni

Hidehiko Tanaka

Proceedings of the Industrial and Engineering Applications of Artificial Intelligence and Expert Systems, 1996

Localization by harmonic structure and its application to harmonic sound stream segregation.

[BibT_eX]

[DOI]

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Interfacing Sound Stream Segregation to Automatic Speech Recognition - Preliminary Results on Listening to Several Sounds Simultaneously.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth National Conference on Artificial Intelligence and Eighth Innovative Applications of Artificial Intelligence Conference, 1996

1995

Residue-Driven Architecture for Computational Auditory Scene Analysis.

[BibT_eX]

[DOI]

Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1995

A computational model of sound stream segregation with multi-agent paradigm.

[BibT_eX]

[DOI]