Carlos Toshinori Ishi

Orcid: 0000-0001-8130-1048

According to our database1, Carlos Toshinori Ishi authored at least 114 papers between 1999 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
An Adversarial Training Based Speech Emotion Classifier With Isolated Gaussian Regularization.
IEEE Trans. Affect. Comput., 2023

QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion.
CoRR, 2023

A Smartphone Pose Auto-calibration Method using Hash-based DOA Estimation.
Proceedings of the IEEE/SICE International Symposium on System Integration, 2023

An attention-based sound selective hearing support system: evaluation by subjects with age-related hearing loss.
Proceedings of the IEEE/SICE International Symposium on System Integration, 2023

Recognizing Real-World Intentions using A Multimodal Deep Learning Approach with Spatial-Temporal Graph Convolutional Networks.
IROS, 2023

HAG: Hierarchical Attention with Graph Network for Dialogue Act Classification in Conversation.
Proceedings of the IEEE International Conference on Acoustics, 2023

I Know Your Feelings Before You Do: Predicting Future Affective Reactions in Human-Computer Dialogue.
Proceedings of the Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

Using Joint Training Speaker Encoder With Consistency Loss to Achieve Cross-Lingual Voice Conversion and Expressive Voice Conversion.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

QUICKVC: A Lightweight VITS-Based Any-to-Many Voice Conversion Model using ISTFT for Faster Conversion.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
An improved CycleGAN-based emotional voice conversion model by augmenting temporal dependency with a transformer.
Speech Commun., 2022

Expression of Personality by Gaze Movements of an Android Robot in Multi-Party Dialogues<sup>*</sup>.
Proceedings of the 31st IEEE International Conference on Robot and Human Interactive Communication, 2022

Controlling the Impression of Robots via GAN-based Gesture Generation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Butsukusa: A Conversational Mobile Robot Describing Its Own Observations and Internal States.
Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, 2022

A Controllable Cross-Gender Voice Conversion for Social Robot.
Proceedings of the 10th International Conference on Affective Computing and Intelligent Interaction, ACII 2022, 2022

2021
Using an Android Robot to Improve Social Connectedness by Sharing Recent Experiences of Group Members in Human-Robot Conversations.
IEEE Robotics Autom. Lett., October, 2021

Skeleton-Based Emotion Recognition Based on Two-Stream Self-Attention Enhanced Spatial-Temporal Graph Convolutional Network.
Sensors, 2021

Enabling Robots to Distinguish Between Aggressive and Joking Attitudes.
IEEE Robotics Autom. Lett., 2021

Advocating Attitudinal Change Through Android Robot's Intention-Based Expressive Behaviors: Toward WHO COVID-19 Guidelines Adherence.
IEEE Robotics Autom. Lett., 2021

CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer.
CoRR, 2021

Analysis of Eye Gaze Reasons and Gaze Aversions During Three-Party Conversations.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Probabilistic Human-like Gesture Synthesis from Speech using GRU-based WGAN.
Proceedings of the ICMI '21 Companion: Companion Publication of the 2021 International Conference on Multimodal Interaction, Montreal, QC, Canada, October 18, 2021

MAEC: Multi-Instance Learning with an Adversarial Auto-Encoder-Based Classifier for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Analysis of Role-Based Gaze Behaviors and Gaze Aversions, and Implementation of Robot's Gaze Control for Multi-party Dialogue.
Proceedings of the HAI '21: International Conference on Human-Agent Interaction, Virtual Event, Japan, November 9, 2021

2020
Multi-Modality Emotion Recognition Model with GAT-Based Multi-Head Inter-Modality Attention.
Sensors, 2020

Person-Directed Pointing Gestures and Inter-Personal Relationship: Expression of Politeness to Friendliness by Android Robots.
IEEE Robotics Autom. Lett., 2020

Analysis of body gestures in anger expression and evaluation in android robot.
Adv. Robotics, 2020

Analysis of sound activities and voice activity detection using in-car microphone arrays.
Proceedings of the 2020 IEEE/SICE International Symposium on System Integration, 2020

SeMemNN: A Semantic Matrix-Based Memory Neural Network for Text Classification.
Proceedings of the IEEE 14th International Conference on Semantic Computing, 2020

AAEC: An Adversarial Autoencoder-based Classifier for Audio Emotion Recognition.
Proceedings of the MuSe'20: Proceedings of the 1st International on Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop, 2020

Generation and Evaluation of Audio-Visual Anger Emotional Expression for Android Robot.
Proceedings of the Companion of the 2020 ACM/IEEE International Conference on Human-Robot Interaction, 2020

An End-to-end Multitask Learning Model to Improve Speech Emotion Recognition.
Proceedings of the 28th European Signal Processing Conference, 2020

3D Skeletal Movement enhanced Emotion Recognition Network.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Probabilistic nod generation model based on speech and estimated utterance categories.
Adv. Robotics, 2019

Auditory scene reproduction for tele-operated robot systems.
Adv. Robotics, 2019

Expressing reactive emotion based on multimodal emotion recognition for natural conversation in human-robot interaction.
Adv. Robotics, 2019

Analysis of factors influencing the impression of speaker individuality in android robots.
Proceedings of the 28th IEEE International Conference on Robot and Human Interactive Communication, 2019

A Neural Turn-Taking Model without RNN.
Proceedings of the Interspeech 2019, 2019

Online processing for speech-driven gesture motion generation in android robots.
Proceedings of the 19th IEEE-RAS International Conference on Humanoid Robots, 2019

2018
A Speech-Driven Hand Gesture Generation Method and Evaluation in Android Robots.
IEEE Robotics Autom. Lett., 2018

2017
Probabilistic 3-D Mapping of Sound-Emitting Structures Based on Acoustic Ray Casting.
IEEE Trans. Robotics, 2017

Motion Analysis in Vocalized Surprise Expressions and Motion Generation in Android Robots.
IEEE Robotics Autom. Lett., 2017

Novel Speech Motion Generation by Modeling Dynamics of Human Speech Production.
Frontiers Robotics AI, 2017

Probabilistic nod generation model based on estimated utterance categories.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Turn-Taking Estimation Model Based on Joint Embedding of Lexical and Prosodic Contents.
Proceedings of the Interspeech 2017, 2017

Motion Analysis in Vocalized Surprise Expressions.
Proceedings of the Interspeech 2017, 2017

Prosodic Analysis of Attention-Drawing Speech.
Proceedings of the Interspeech 2017, 2017

Emotion recognition by combining prosody and sentiment analysis for expressing reactive emotion by humanoid robot.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Speech driven trunk motion generating system based on physical constraint.
Proceedings of the 25th IEEE International Symposium on Robot and Human Interactive Communication, 2016

ERICA: The ERATO Intelligent Conversational Android.
Proceedings of the 25th IEEE International Symposium on Robot and Human Interactive Communication, 2016

Hearing support system using environment sensor network.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Motion generation in android robots during laughing speech.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

2015
Online speech-driven head motion generating system and evaluation on a tele-operated robot.
Proceedings of the 24th IEEE International Symposium on Robot and Human Interactive Communication, 2015

Robot-assisted acoustic inspection of infrastructures - cooperative hammer sounding inspection.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Speech activity detection and face orientation estimation using multiple microphone arrays and human position information.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Audio augmented point clouds for applications in robotics.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Bringing the Scene Back to the Tele-operator: Auditory Scene Manipulation for Tele-presence Systems.
Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction, 2015

2014
Analysis of relationship between head motion events and speech in dialogue conversations.
Speech Commun., 2014

Integration of Multiple Microphone Arrays and Use of Sound Reflections for 3D Localization of Sound Sources.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2014

Audio ray tracing for position estimation of entities in blind regions.
Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014

Analysis of laughter events in real science classes by using multiple environment sensor data.
Proceedings of the INTERSPEECH 2014, 2014

Mapping sound emitting structures in 3D.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

2013
Generation of Nodding, Head tilting and Gazing for Human-Robot speech Interaction.
Int. J. Humanoid Robotics, 2013

Analysis of the visual Lombard effect and automatic recognition experiments.
Comput. Speech Lang., 2013

Using sound reflections to detect moving entities out of the field of view.
Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013

Using multiple microphone arrays and reflections for 3D localization of sound sources.
Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013

Creation of radiated sound intensity maps using multi-modal measurements onboard an autonomous mobile platform.
Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013

Analysis of factors involved in the choice of rising or non-rising intonation in question utterances appearing in conversational speech.
Proceedings of the INTERSPEECH 2013, 2013

Probabilistic approach for building auditory maps with a mobile microphone array.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

2012
Body-conductive acoustic sensors in human-robot communication.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Evaluation of formant-based lip motion generation in tele-operated humanoid robots.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Combining laser range finders and local steered response power for audio monitoring.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Evaluation of a formant-based speech-driven lip motion generation.
Proceedings of the INTERSPEECH 2012, 2012

Fusion of standard and alternative acoustic sensors for robust automatic speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Generation of nodding, head tilting and eye gazing for human-robot dialogue interaction.
Proceedings of the International Conference on Human-Robot Interaction, 2012

The role of the Lombard reflex in parkinson's disease.
Proceedings of the 12th IEEE International Conference on Bioinformatics & Bioengineering, 2012

2011
Field Trial of a Networked Robot at a Train Station.
Int. J. Soc. Robotics, 2011

Studying laughter in combination with two humanoid robots.
AI Soc., 2011

Telenoid: tele-presence android for communication.
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2011

The effects of microphone array processing on pitch extraction in real noisy environments.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Multi-modal front-end for speaker activity detection in small meetings.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Analysis of Acoustic-Prosodic Features Related to Paralinguistic Information Carried by Interjections in Dialogue Speech.
Proceedings of the INTERSPEECH 2011, 2011

Improved Acoustic Characterization of Breathy and Whispery Voices.
Proceedings of the INTERSPEECH 2011, 2011

Range Based Multi Microphone Array Fusion for Speaker Activity Detection in Small Meetings.
Proceedings of the INTERSPEECH 2011, 2011

Speech Production in Noisy Environments and the Effect on Automatic Speech Recognition.
Proceedings of the 17th International Congress of Phonetic Sciences, 2011

Speech-driven lip motion generation for tele-operated humanoid robots.
Proceedings of the Auditory-Visual Speech Processing, 2011

2010
Analysis of the Roles and the Dynamics of Breathy and Whispery Voice Qualities in Dialogue Speech.
EURASIP J. Audio Speech Music. Process., 2010

Sound interval detection of multiple sources based on sound directivity.
Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Close speaker cancellation for suppression of non-stationary background noise for hands-free speech interface.
Proceedings of the INTERSPEECH 2010, 2010

Head motions during dialogue speech and nod timing control in humanoid robots.
Proceedings of the 5th ACM/IEEE International Conference on Human Robot Interaction, 2010

Real-time audio-visual voice activity detection for speech recognition in noisy environments.
Proceedings of the Auditory-Visual Speech Processing, 2010

Investigating the role of the Lombard reflex in visual and audiovisual speech recognition.
Proceedings of the Auditory-Visual Speech Processing, 2010

2009
Evaluation of a MUSIC-based real-time sound localization of multiple sound sources in real noisy environments.
Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

How about laughter? Perceived naturalness of two laughing humanoid robots.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

2008
A Robust Speech Recognition System for Communication Robots in Noisy Environments.
IEEE Trans. Robotics, 2008

A Method for Automatic Detection of Vocal Fry.
IEEE Trans. Speech Audio Process., 2008

Automatic extraction of paralinguistic information using prosodic features related to F.
Speech Commun., 2008

The meanings carried by interjections in spontaneous speech.
Proceedings of the INTERSPEECH 2008, 2008

A semi-autonomous communication robot: a field trial at a train station.
Proceedings of the 3rd ACM/IEEE international conference on Human robot interaction, 2008

Analysis of inter- and intra-speaker variability of head motions during spoken dialogue.
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

2007
A blendshape model for mapping facial motions to an android.
Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

Analysis of head motions and speech, and head motion control in an android.
Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

Analysis of head motions and speech in spoken dialogue.
Proceedings of the INTERSPEECH 2007, 2007

2006
Evaluation of Prosodic and Voice Quality Features on Automatic Extraction of Paralinguistic Information.
Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006

Analysis of prosodic and linguistic cues of phrase finals for turn-taking and dialog acts.
Proceedings of the INTERSPEECH 2006, 2006

Robust Speech Recognition System for Communication Robots in Real Environments.
Proceedings of the 2006 6th IEEE-RAS International Conference on Humanoid Robots, 2006

2005
Perceptually-Related F0 Parameters for Automatic Classification of Phrase Final Tones.
IEICE Trans. Inf. Syst., 2005

Proposal of acoustic measures for automatic detection of vocal fry.
Proceedings of the INTERSPEECH 2005, 2005

2004
A new acoustic measure for aspiration noise detection.
Proceedings of the INTERSPEECH 2004, 2004

2003
Mora F0 representation for accent type identification in continuous speech and considerations on its relation with perceived pitch values.
Speech Commun., 2003

Perceptually-related acoustic-prosodic features of phrase finals in spontaneous speech.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2001
Identification of accent and intonation in sentences for CALL systems.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
The distribution of fillers in lectures in the Japanese language.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Identification of Japanese double-mora phonemes considering speaking rate for the use in CALL systems.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
A system for learning the pronunciation of Japanese pitch accent.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999


  Loading...