Komei Sugiura

CoRR, 2023

Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning with Encoder-Decoder Model using Action Query.

[BibT_eX]

[DOI]

CoRR, 2023

Switching Text-Based Image Encoders for Captioning Images With Text.

[BibT_eX]

[DOI]

Arisa Ueda

Wei Yang

IEEE Access, 2023

Affective Image Captioning for Visual Artworks Using Emotion-Based Cross-Attention Mechanisms.

[BibT_eX]

[DOI]

IEEE Access, 2023

Prototypical Contrastive Transfer Learning for Multimodal Language Understanding.

[BibT_eX]

[DOI]

Seitaro Otsuki

IROS, 2023

Switching Head-Tail Funnel UNITER for Dual Referring Expression Comprehension with Fetch-and-Carry Tasks.

[BibT_eX]

[DOI]

IROS, 2023

Multimodal Diffusion Segmentation Model for Object Segmentation from Manipulation Instructions.

[BibT_eX]

[DOI]

IROS, 2023

JaSPICE: Automatic Evaluation Metric Using Predicate-Argument Structures for Image Captioning Models.

[BibT_eX]

[DOI]

Yuiga Wada

Kanta Kaneda

Proceedings of the 27th Conference on Computational Natural Language Learning, 2023

2022

Visual Explanation of Deep Q-Network for Robot Navigation by Fine-tuning Attention Branch.

[BibT_eX]

[DOI]

CoRR, 2022

Moment-based Adversarial Training for Embodied Language Comprehension.

[BibT_eX]

[DOI]

Proceedings of the 26th International Conference on Pattern Recognition, 2022

Relational Future Captioning Model for Explaining Likely Collisions in Daily Tasks.

[BibT_eX]

[DOI]

Motonari Kambara

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Shared Transformer Encoder with Mask-Based 3d Model Estimation for Container Mass Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Flare Transformer: Solar Flare Prediction Using Magnetograms and Sunspot Physical Features.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2022, 2022

Visual Explanation Generation Based on Lambda Attention Branch Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2022, 2022

2021

CrossMap Transformer: A Crossmodal Masked Path Transformer Using Double Back-Translation for Vision-and-Language Navigation.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2021

Case Relation Transformer: A Crossmodal Language Generation Model for Fetching Instructions.

[BibT_eX]

[DOI]

Motonari Kambara

IEEE Robotics Autom. Lett., 2021

Target-Dependent UNITER: A Transformer-Based Multimodal Language Comprehension Model for Domestic Service Robots.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2021

Predicting and attending to damaging collisions for placing everyday objects in photo-realistic simulations.

[BibT_eX]

[DOI]

Adv. Robotics, 2021

LatteGAN: Visually Guided Language Attention for Multi-Turn Text-Conditioned Image Manipulation.

[BibT_eX]

[DOI]

IEEE Access, 2021

Visual Explanation using Attention Mechanism in Actor-Critic-based Deep Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2021

Unified Questioner Transformer for Descriptive Question Generation in Goal-Oriented Visual Dialogue.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Alleviating the Burden of Labeling: Sentence Generation by Attention Branch Encoder-Decoder Network.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2020

A Multimodal Target-Source Classifier With Attention Branches to Understand Ambiguous Instructions for Fetching Daily Objects.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2020

Compensation on x-vector for Short Utterance Spoken Language Identification.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

2019

Understanding Natural Language Instructions for Fetching Daily Objects Using GAN-Based Multimodal Target-Source Classification.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2019

Latent-Space Data Augmentation for Visually-Grounded Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Advances in Artificial Intelligence, 2019

Multimodal Attention Branch Network for Perspective-Free Sentence Generation.

[BibT_eX]

[DOI]

Proceedings of the 3rd Annual Conference on Robot Learning, 2019

2018

SuMo-SS: Submodular Optimization Sensor Scattering for Deploying Sensor Networks by Drones.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2018

A Multimodal Classifier Generative Adversarial Network for Carry and Place Tasks From Ambiguous Language Instructions.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2018

2017

Sentence Selection Based on Extended Entropy Using Phonetic and Prosodic Contexts for Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Grounded language understanding for manipulation instructions using GAN-based classification.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

Dynamically pre-trained deep recurrent neural networks using environmental monitoring data for predicting PM2.5.

[BibT_eX]

[DOI]

Bun Theang Ong

Neural Comput. Appl., 2016

Space-time multiple regression model for grid-based population estimation in urban areas.

[BibT_eX]

[DOI]

Ko Ko Lwin

Int. J. Geogr. Inf. Sci., 2016

Special issue on machine learning and data engineering in robotics.

[BibT_eX]

[DOI]

Adv. Robotics, 2016

Analysis of Long-Term and Large-Scale Experiments on Robot Dialogues Using a Cloud Robotics Platform.

[BibT_eX]

[DOI]

Proceedings of the Eleventh ACM/IEEE International Conference on Human Robot Interation, 2016

2015

A cloud robotics approach towards dialogue-oriented robot speech.

[BibT_eX]

[DOI]

Adv. Robotics, 2015

RoboCup@Home: Analysis and results of evolving competitions for domestic and service robots.

[BibT_eX]

[DOI]

Luca Iocchi

Dirk Holz

Javier Ruiz-del-Solar

Tijn van der Zant

Artif. Intell., 2015

Rospeex: A cloud robotics platform for human-robot spoken dialogues.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Entropy-based sentence selection for speech synthesis using phonetic and prosodic contexts.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Geovisualization and correlation analysis between geotagged Twitter and JMA rainfall data: Case of heavy rain disaster in Hiroshima.

[BibT_eX]

[DOI]

Ko Ko Lwin

Proceedings of the 2nd IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services, 2015

Constrained region selection method based on configuration space for visualization in scientific dataset search.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

2014

Quality-of-Experience (QoE) in Emerging Mobile Social Networks.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2014

On RoboCup@Home - Past, Present and Future of a Scientific Competition for Service Robots.

[BibT_eX]

[DOI]

Dirk Holz

Javier Ruiz-del-Solar

Sven Wachsmuth

Proceedings of the RoboCup 2014: Robot World Cup XVIII [papers from the 18th Annual RoboCup International Symposium, 2014

Non-monologue HMM-based speech synthesis for service robots: A cloud robotics approach.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

A new dimension for RoboCup @home: human-robot interaction between virtual and real worlds.

[BibT_eX]

[DOI]

Jeffrey Too Chuan Tan

Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, 2014

Dynamic pre-training of Deep Recurrent Neural Networks for predicting environmental monitoring data.

[BibT_eX]

[DOI]

Bun Theang Ong

Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

Spatio-temporal Pseudo Relevance Feedback for Large-Scale and Heterogeneous Scientific Repositories.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Congress on Big Data, Anchorage, AK, USA, June 27, 2014

2013

Human-Robot Interaction between Virtual and Real Worlds: Motivation from RoboCup @Home.

[BibT_eX]

[DOI]

Jeffrey Too Chuan Tan

Proceedings of the Social Robotics - 5th International Conference, 2013

Development of RoboCup@Home Simulation towards Long-term Large Scale HRI.

[BibT_eX]

[DOI]

Tetsunari Inamura

Jeffrey Too Chuan Tan

Takayuki Nagai

Hiroyuki Okada

Proceedings of the RoboCup 2013: Robot World Cup XVII [papers from the 17th Annual RoboCup International Symposium, 2013

Utterance Classification Using Linguistic and Non-linguistic Information for Network-Based Speech-to-Speech Translation Systems.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE 14th International Conference on Mobile Data Management, Milan, Italy, June 3-6, 2013, 2013

Complementary Integration of Heterogeneous Crowd-Sourced Datasets for Enhanced Social Analytics.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE 14th International Conference on Mobile Data Management, Milan, Italy, June 3-6, 2013, 2013

2012

Learning Novel Objects for Extended Mobile Manipulation.

[BibT_eX]

[DOI]

J. Intell. Robotic Syst., 2012

2011

Modeling spoken decision support dialogue and optimization of its dialogue strategy.

[BibT_eX]

[DOI]

ACM Trans. Speech Lang. Process., 2011

Situated Spoken Dialogue with Robots Using Active Learning.

[BibT_eX]

[DOI]

Adv. Robotics, 2011

Learning, Generation and Recognition of Motions by Reference-Point-Dependent Probabilistic Models.

[BibT_eX]

[DOI]

Adv. Robotics, 2011

Motion generation by reference-point-dependent trajectory HMMs.

[BibT_eX]

[DOI]

Naoto Iwahashi

Hideki Kashioka

Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Online Learning of Bayes Risk-Based Optimization of Dialogue Management for Document Retrieval Systems with Speech Interface.

[BibT_eX]

[DOI]

Proceedings of the Spoken Dialogue Systems Technology and Design, 2011

2010

Detecting Robot-Directed Speech by Situated Understanding in Physical Interaction.

[BibT_eX]

[DOI]

Inf. Media Technol., 2010

Dialogue strategy optimization to assist user's decision for spoken consulting dialogue systems.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Modeling Spoken Decision Making Dialogue and Optimization of its Dialogue Strategy.

[BibT_eX]

[DOI]

Proceedings of the SIGDIAL 2010 Conference, 2010

Detecting robot-directed speech by situated understanding in object manipulation tasks.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Conference on Robot and Human Interactive Communication, 2010

Active learning of confidence measure function in robot language acquisition framework.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Learning novel objects using out-of-vocabulary word segmentation and object extraction for home assistant robots.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Robot-directed speech detection using Multimodal Semantic Confidence based on speech, image, and motion.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Active Learning for Generating Motion and Utterances in Object Manipulation Dialogue Tasks.

[BibT_eX]

[DOI]

Proceedings of the Dialog with Robots, 2010

Robots that Learn to Communicate: A Developmental Approach to Personally and Physically Situated Human-Robot Conversations.

[BibT_eX]

[DOI]

Proceedings of the Dialog with Robots, 2010

2009

Bayesian learning of confidence measure function for generation of utterances and motions in object manipulation dialogue task.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008

Constructive Approach to Role-Reversal Imitation Through Unsegmented Interactions.

[BibT_eX]

[DOI]

J. Robotics Mechatronics, 2008

Motion recognition and generation by combining reference-point-dependent probabilistic models.

[BibT_eX]

[DOI]