Komei Sugiura

Orcid: 0000-0002-0261-0510

According to our database1, Komei Sugiura authored at least 73 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Learning-To-Rank Approach for Identifying Everyday Objects Using a Physical-World Search Engine.
IEEE Robotics Autom. Lett., 2024

Polos: Multimodal Metric Learning from Human Feedback for Image Captioning.
CoRR, 2024

2023
DialMAT: Dialogue-Enabled Transformer with Moment-Based Adversarial Training.
CoRR, 2023

Fully Automated Task Management for Generation, Execution, and Evaluation: A Framework for Fetch-and-Carry Tasks with Natural Language Instructions in Continuous Space.
CoRR, 2023

Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning with Encoder-Decoder Model using Action Query.
CoRR, 2023

Switching Text-Based Image Encoders for Captioning Images With Text.
IEEE Access, 2023

Affective Image Captioning for Visual Artworks Using Emotion-Based Cross-Attention Mechanisms.
IEEE Access, 2023

Prototypical Contrastive Transfer Learning for Multimodal Language Understanding.
IROS, 2023

Switching Head-Tail Funnel UNITER for Dual Referring Expression Comprehension with Fetch-and-Carry Tasks.
IROS, 2023

Multimodal Diffusion Segmentation Model for Object Segmentation from Manipulation Instructions.
IROS, 2023

JaSPICE: Automatic Evaluation Metric Using Predicate-Argument Structures for Image Captioning Models.
Proceedings of the 27th Conference on Computational Natural Language Learning, 2023

2022
Visual Explanation of Deep Q-Network for Robot Navigation by Fine-tuning Attention Branch.
CoRR, 2022

Moment-based Adversarial Training for Embodied Language Comprehension.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Relational Future Captioning Model for Explaining Likely Collisions in Daily Tasks.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Shared Transformer Encoder with Mask-Based 3d Model Estimation for Container Mass Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2022

Flare Transformer: Solar Flare Prediction Using Magnetograms and Sunspot Physical Features.
Proceedings of the Computer Vision - ACCV 2022, 2022

Visual Explanation Generation Based on Lambda Attention Branch Networks.
Proceedings of the Computer Vision - ACCV 2022, 2022

2021
CrossMap Transformer: A Crossmodal Masked Path Transformer Using Double Back-Translation for Vision-and-Language Navigation.
IEEE Robotics Autom. Lett., 2021

Case Relation Transformer: A Crossmodal Language Generation Model for Fetching Instructions.
IEEE Robotics Autom. Lett., 2021

Target-Dependent UNITER: A Transformer-Based Multimodal Language Comprehension Model for Domestic Service Robots.
IEEE Robotics Autom. Lett., 2021

Predicting and attending to damaging collisions for placing everyday objects in photo-realistic simulations.
Adv. Robotics, 2021

LatteGAN: Visually Guided Language Attention for Multi-Turn Text-Conditioned Image Manipulation.
IEEE Access, 2021

Visual Explanation using Attention Mechanism in Actor-Critic-based Deep Reinforcement Learning.
Proceedings of the International Joint Conference on Neural Networks, 2021

Unified Questioner Transformer for Descriptive Question Generation in Goal-Oriented Visual Dialogue.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Alleviating the Burden of Labeling: Sentence Generation by Attention Branch Encoder-Decoder Network.
IEEE Robotics Autom. Lett., 2020

A Multimodal Target-Source Classifier With Attention Branches to Understand Ambiguous Instructions for Fetching Daily Objects.
IEEE Robotics Autom. Lett., 2020

Compensation on x-vector for Short Utterance Spoken Language Identification.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

2019
Understanding Natural Language Instructions for Fetching Daily Objects Using GAN-Based Multimodal Target-Source Classification.
IEEE Robotics Autom. Lett., 2019

Latent-Space Data Augmentation for Visually-Grounded Language Understanding.
Proceedings of the Advances in Artificial Intelligence, 2019

Multimodal Attention Branch Network for Perspective-Free Sentence Generation.
Proceedings of the 3rd Annual Conference on Robot Learning, 2019

2018
SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Networks by Drones.
IEEE Robotics Autom. Lett., 2018

A Multimodal Classifier Generative Adversarial Network for Carry and Place Tasks From Ambiguous Language Instructions.
IEEE Robotics Autom. Lett., 2018

2017
Sentence Selection Based on Extended Entropy Using Phonetic and Prosodic Contexts for Statistical Parametric Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Grounded language understanding for manipulation instructions using GAN-based classification.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Dynamically pre-trained deep recurrent neural networks using environmental monitoring data for predicting PM2.5.
Neural Comput. Appl., 2016

Space-time multiple regression model for grid-based population estimation in urban areas.
Int. J. Geogr. Inf. Sci., 2016

Special issue on machine learning and data engineering in robotics.
Adv. Robotics, 2016

Analysis of Long-Term and Large-Scale Experiments on Robot Dialogues Using a Cloud Robotics Platform.
Proceedings of the Eleventh ACM/IEEE International Conference on Human Robot Interation, 2016

2015
A cloud robotics approach towards dialogue-oriented robot speech.
Adv. Robotics, 2015

RoboCup@Home: Analysis and results of evolving competitions for domestic and service robots.
Artif. Intell., 2015

Rospeex: A cloud robotics platform for human-robot spoken dialogues.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Entropy-based sentence selection for speech synthesis using phonetic and prosodic contexts.
Proceedings of the INTERSPEECH 2015, 2015

Geovisualization and correlation analysis between geotagged Twitter and JMA rainfall data: Case of heavy rain disaster in Hiroshima.
Proceedings of the 2nd IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services, 2015

Constrained region selection method based on configuration space for visualization in scientific dataset search.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

2014
Quality-of-Experience (QoE) in Emerging Mobile Social Networks.
IEICE Trans. Inf. Syst., 2014

On RoboCup@Home - Past, Present and Future of a Scientific Competition for Service Robots.
Proceedings of the RoboCup 2014: Robot World Cup XVIII [papers from the 18th Annual RoboCup International Symposium, 2014

Non-monologue HMM-based speech synthesis for service robots: A cloud robotics approach.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

A new dimension for RoboCup @home: human-robot interaction between virtual and real worlds.
Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, 2014

Dynamic pre-training of Deep Recurrent Neural Networks for predicting environmental monitoring data.
Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

Spatio-temporal Pseudo Relevance Feedback for Large-Scale and Heterogeneous Scientific Repositories.
Proceedings of the 2014 IEEE International Congress on Big Data, Anchorage, AK, USA, June 27, 2014

2013
Human-Robot Interaction between Virtual and Real Worlds: Motivation from RoboCup @Home.
Proceedings of the Social Robotics - 5th International Conference, 2013

Development of RoboCup@Home Simulation towards Long-term Large Scale HRI.
Proceedings of the RoboCup 2013: Robot World Cup XVII [papers from the 17th Annual RoboCup International Symposium, 2013

Utterance Classification Using Linguistic and Non-linguistic Information for Network-Based Speech-to-Speech Translation Systems.
Proceedings of the 2013 IEEE 14th International Conference on Mobile Data Management, Milan, Italy, June 3-6, 2013, 2013

Complementary Integration of Heterogeneous Crowd-Sourced Datasets for Enhanced Social Analytics.
Proceedings of the 2013 IEEE 14th International Conference on Mobile Data Management, Milan, Italy, June 3-6, 2013, 2013

2012
Learning Novel Objects for Extended Mobile Manipulation.
J. Intell. Robotic Syst., 2012

2011
Modeling spoken decision support dialogue and optimization of its dialogue strategy.
ACM Trans. Speech Lang. Process., 2011

Situated Spoken Dialogue with Robots Using Active Learning.
Adv. Robotics, 2011

Learning, Generation and Recognition of Motions by Reference-Point-Dependent Probabilistic Models.
Adv. Robotics, 2011

Motion generation by reference-point-dependent trajectory HMMs.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Online Learning of Bayes Risk-Based Optimization of Dialogue Management for Document Retrieval Systems with Speech Interface.
Proceedings of the Spoken Dialogue Systems Technology and Design, 2011

2010
Dialogue strategy optimization to assist user's decision for spoken consulting dialogue systems.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Modeling Spoken Decision Making Dialogue and Optimization of its Dialogue Strategy.
Proceedings of the SIGDIAL 2010 Conference, 2010

Detecting robot-directed speech by situated understanding in object manipulation tasks.
Proceedings of the 19th IEEE International Conference on Robot and Human Interactive Communication, 2010

Active learning of confidence measure function in robot language acquisition framework.
Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Learning novel objects using out-of-vocabulary word segmentation and object extraction for home assistant robots.
Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Robot-directed speech detection using Multimodal Semantic Confidence based on speech, image, and motion.
Proceedings of the IEEE International Conference on Acoustics, 2010

Active Learning for Generating Motion and Utterances in Object Manipulation Dialogue Tasks.
Proceedings of the Dialog with Robots, 2010

Robots that Learn to Communicate: A Developmental Approach to Personally and Physically Situated Human-Robot Conversations.
Proceedings of the Dialog with Robots, 2010

2009
Bayesian learning of confidence measure function for generation of utterances and motions in object manipulation dialogue task.
Proceedings of the INTERSPEECH 2009, 2009

2008
Constructive Approach to Role-Reversal Imitation Through Unsegmented Interactions.
J. Robotics Mechatronics, 2008

Motion recognition and generation by combining reference-point-dependent probabilistic models.
Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

2005
Exploiting interaction between sensory morphology and learning.
Proceedings of the IEEE International Conference on Systems, 2005

2003
Evolution of Rewriting Rule Sets Using String-Based Tierra.
Proceedings of the Advances in Artificial Life, 7th European Conference, 2003


  Loading...