We stand with Ukraine

We stand with Ukraine

Yoshitaka Ushiku

Orcid: 0000-0002-9014-1389

Affiliations:

OMRON SINIC X Corp., Tokyo, Japan
University of Tokyo, Department of Mechano-Informatics, Japan (PhD 2014)

According to our database¹, Yoshitaka Ushiku authored at least 81 papers between 2010 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

On csauthors.net:

Bibliography

2024

COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark.

[BibT_eX]

[DOI]

,

,

Atsushi Hashimoto

,

,

,

Yusuke Fukasawa

,

Yoshitaka Ushiku

CoRR, 2024

SciPostLayout: A Dataset for Layout Analysis and Layout Generation of Scientific Posters.

[BibT_eX]

[DOI]

,

,

Yoshitaka Ushiku

CoRR, 2024

AdaCoder: Adaptive Prompt Compression for Programmatic Visual Question Answering.

[BibT_eX]

[DOI]

,

,

Atsushi Hashimoto

,

Yoshitaka Ushiku

,

CoRR, 2024

TNF: Tri-branch Neural Fusion for Multimodal Medical Data Classification.

[BibT_eX]

[DOI]

,

,

Yoshitaka Ushiku

,

,

CoRR, 2024

Unsupervised LLM Adaptation for Question Answering.

[BibT_eX]

[DOI]

,

,

,

Yoshitaka Ushiku

CoRR, 2024

Toward AI-Mediated Avatar-Based Telecommunication: Investigating Visual Impression of Switching Between User- and AI-Controlled Avatars in Video Chat.

[BibT_eX]

[DOI]

,

,

Yoshitaka Ushiku

IEEE Access, 2024

Unsupervised Domain Adaptation for Human and Animal Chest X-Ray Bone Suppression.

[BibT_eX]

[DOI]

Tomohiro Hashimoto

,

,

Shoichiro Sekiguchi

,

Yoshitaka Ushiku

,

Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

Vision-Language Interpreter for Robot Task Planning.

[BibT_eX]

[DOI]

,

Cristian C. Beltran-Hernandez

,

,

Atsushi Hashimoto

,

,

Kento Kawaharazuka

,

Kazutoshi Tanaka

,

Yoshitaka Ushiku

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Crystalformer: Infinitely Connected Attention for Periodic Structure Encoding.

[BibT_eX]

[DOI]

Tatsunori Taniai

,

,

,

,

,

Yoshitaka Ushiku

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

PolarDB: Formula-Driven Dataset for Pre-Training Trajectory Encoders.

[BibT_eX]

[DOI]

,

,

,

,

Yoshitaka Ushiku

,

Atsushi Hashimoto

,

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

State-aware video procedural captioning.

[BibT_eX]

[DOI]

Taichi Nishimura

,

Atsushi Hashimoto

,

Yoshitaka Ushiku

,

Hirotaka Kameko

,

Multim. Tools Appl., October, 2023

Analysis of Protein Folding Simulation with Moving Root Mean Square Deviation.

[BibT_eX]

[DOI]

Yutaka Maruyama

,

,

Yoshitaka Ushiku

,

Ayori Mitsutake

J. Chem. Inf. Model., March, 2023

A Transformer Model for Symbolic Regression towards Scientific Discovery.

[BibT_eX]

[DOI]

Florian Lalande

,

Yoshitomo Matsubara

,

,

Tatsunori Taniai

,

,

Yoshitaka Ushiku

CoRR, 2023

Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos.

[BibT_eX]

[DOI]

Takehiko Ohkawa

,

,

Taichi Nishimura

,

,

Atsushi Hashimoto

,

Yoshitaka Ushiku

,

CoRR, 2023

WeaveNet for Approximating Two-sided Matching Problems.

[BibT_eX]

[DOI]

,

,

Atsushi Hashimoto

,

,

Yoshitaka Ushiku

CoRR, 2023

A Critical Look at the Current Usage of Foundation Model for Dense Recognition Task.

[BibT_eX]

[DOI]

,

Atsushi Hashimoto

,

Yoshitaka Ushiku

CoRR, 2023

Noisy Universal Domain Adaptation via Divergence Optimization for Visual Recognition.

[BibT_eX]

[DOI]

,

Atsushi Hashimoto

,

Yoshitaka Ushiku

CoRR, 2023

Reference-based Dense Pose Estimation via Partial 3D Point Cloud Matching.

[BibT_eX]

[DOI]

,

Atsushi Hashimoto

,

,

Yoshitaka Ushiku

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Robotic Powder Grinding with Audio-Visual Feedback for Laboratory Automation in Materials Science.

[BibT_eX]

[DOI]

Yusaku Nakajima

,

,

Kazutoshi Tanaka

,

,

Felix von Drigalski

,

,

Yoshitaka Ushiku

,

IROS, 2023

2022

Self-supervised learning of materials concepts from crystal structures via deep neural networks.

[BibT_eX]

[DOI]

,

Tatsunori Taniai

,

,

Yoshitaka Ushiku

,

Mach. Learn. Sci. Technol., December, 2022

Neural Structure Fields with Application to Crystal Structure Autoencoders.

[BibT_eX]

[DOI]

,

,

Tatsunori Taniai

,

,

Yoshitaka Ushiku

,

,

CoRR, 2022

Recipe Generation from Unsegmented Cooking Videos.

[BibT_eX]

[DOI]

Taichi Nishimura

,

Atsushi Hashimoto

,

Yoshitaka Ushiku

,

Hirotaka Kameko

,

CoRR, 2022

Rethinking Symbolic Regression Datasets and Benchmarks for Scientific Discovery.

[BibT_eX]

[DOI]

Yoshitomo Matsubara

,

,

,

Tatsunori Taniai

,

Yoshitaka Ushiku

CoRR, 2022

Edge-Selective Feature Weaving for Point Cloud Matching.

[BibT_eX]

[DOI]

,

Atsushi Hashimoto

,

,

,

,

Yoshitaka Ushiku

CoRR, 2022

Edge Computing-Assisted DNN Image Recognition System With Progressive Image Retransmission.

[BibT_eX]

[DOI]

Mutsuki Nakahara

,

,

Yoshitaka Ushiku

,

Takayuki Nishio

,

,

,

IEEE Access, 2022

Robotic Powder Grinding with a Soft Jig for Laboratory Automation in Material Science.

[BibT_eX]

[DOI]

Yusaku Nakajima

,

,

,

,

Felix von Drigalski

,

Kazutoshi Tanaka

,

Yoshitaka Ushiku

,

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Cross-modal Representation Learning for Understanding Manufacturing Procedure.

[BibT_eX]

[DOI]

Atsushi Hashimoto

,

Taichi Nishimura

,

Yoshitaka Ushiku

,

Hirotaka Kameko

,

Proceedings of the Cross-Cultural Design. Applications in Learning, Arts, Cultural Heritage, Creative Industries, and Virtual Reality, 2022

The Effect of Improving Annotation Quality on Object Detection Datasets: A Preliminary Study.

[BibT_eX]

[DOI]

,

Yoshitaka Ushiku

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows.

[BibT_eX]

[DOI]

,

Atsushi Hashimoto

,

Taichi Nishimura

,

Hirotaka Kameko

,

,

Yoshitaka Ushiku

,

Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021

Crowd Density Forecasting by Modeling Patch-Based Dynamics.

[BibT_eX]

[DOI]

Hiroaki Minoura

,

,

,

Yoshitaka Ushiku

IEEE Robotics Autom. Lett., 2021

Foreground-Aware Stylization and Consensus Pseudo-Labeling for Domain Adaptation of First-Person Hand Segmentation.

[BibT_eX]

[DOI]

Takehiko Ohkawa

,

,

Atsushi Hashimoto

,

Yoshitaka Ushiku

,

IEEE Access, 2021

Structure-Aware Procedural Text Generation From an Image Sequence.

[BibT_eX]

[DOI]

Taichi Nishimura

,

Atsushi Hashimoto

,

Yoshitaka Ushiku

,

Hirotaka Kameko

,

,

IEEE Access, 2021

Retransmission Edge Computing System Conducting Adaptive Image Compression Based on Image Recognition Accuracy.

[BibT_eX]

[DOI]

Mutsuki Nakahara

,

,

,

Yoshitaka Ushiku

,

,

Proceedings of the 94th IEEE Vehicular Technology Conference, 2021

Egocentric Biochemical Video-and-Language Dataset.

[BibT_eX]

[DOI]

Taichi Nishimura

,

,

Atsushi Hashimoto

,

Yoshitaka Ushiku

,

,

,

Hirotaka Kameko

,

Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Removing Word-Level Spurious Alignment between Images and Pseudo-Captions in Unsupervised Image Captioning.

[BibT_eX]

[DOI]

,

Yoshitaka Ushiku

,

Atsushi Hashimoto

,

,

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Divergence Optimization for Noisy Universal Domain Adaptation.

[BibT_eX]

[DOI]

,

Atsushi Hashimoto

,

Yoshitaka Ushiku

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Visual Grounding Annotation of Recipe Flow Graph.

[BibT_eX]

[DOI]

Taichi Nishimura

,

,

Hayato Hashimoto

,

Atsushi Hashimoto

,

,

,

Yoshitaka Ushiku

,

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2019

How narratives move your mind: A corpus of shared-character stories for connecting emotional flow and interestingness.

[BibT_eX]

[DOI]

,

,

Yoshitaka Ushiku

,

Inf. Process. Manag., 2019

Decentralized Learning of Generative Adversarial Networks from Multi-Client Non-iid Data.

[BibT_eX]

[DOI]

,

Tomohiro Takahashi

,

Atsushi Hashimoto

,

Yoshitaka Ushiku

CoRR, 2019

Pose Graph optimization for Unsupervised Monocular Visual Odometry.

[BibT_eX]

[DOI]

,

Yoshitaka Ushiku

,

Proceedings of the International Conference on Robotics and Automation, 2019

Generating Easy-to-Understand Referring Expressions for Target Identifications.

[BibT_eX]

[DOI]

Mikihiro Tanaka

,

Takayuki Itamochi

,

Kenichi Narioka

,

,

Yoshitaka Ushiku

,

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Strong-Weak Distribution Alignment for Adaptive Object Detection.

[BibT_eX]

[DOI]

,

Yoshitaka Ushiku

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Label-Noise Robust Generative Adversarial Networks.

[BibT_eX]

[DOI]

Takuhiro Kaneko

,

Yoshitaka Ushiku

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Class-Distinct and Class-Mutual Image Generation with GANs.

[BibT_eX]

[DOI]

Takuhiro Kaneko

,

Yoshitaka Ushiku

,

Proceedings of the 30th British Machine Vision Conference 2019, 2019

Estimating the Causal Effect from Partially Observed Time Series.

[BibT_eX]

[DOI]

,

,

Yoshitaka Ushiku

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Conditional Video Generation Using Action-Appearance Captions.

[BibT_eX]

[DOI]

Shohei Yamamoto

,

Antonio Tejero-de-Pablos

,

Yoshitaka Ushiku

,

CoRR, 2018

Towards Human-Friendly Referring Expression Generation.

[BibT_eX]

[DOI]

Mikihiro Tanaka

,

Takayuki Itamochi

,

Kenichi Narioka

,

,

Yoshitaka Ushiku

,

CoRR, 2018

Learning from Between-class Examples for Deep Sound Recognition.

[BibT_eX]

[DOI]

,

Yoshitaka Ushiku

,

Proceedings of the 6th International Conference on Learning Representations, 2018

Adversarial Dropout Regularization.

[BibT_eX]

[DOI]

,

Yoshitaka Ushiku

,

,

Proceedings of the 6th International Conference on Learning Representations, 2018

Multichannel Semantic Segmentation with Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

,

,

Yoshitaka Ushiku

,

Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Visual Question Generation for Class Acquisition of Unknown Objects.

[BibT_eX]

[DOI]

,

Antonio Tejero-de-Pablos

,

Yoshitaka Ushiku

,

Proceedings of the Computer Vision - ECCV 2018, 2018

Open Set Domain Adaptation by Backpropagation.

[BibT_eX]

[DOI]

,

Shohei Yamamoto

,

Yoshitaka Ushiku

,

Proceedings of the Computer Vision - ECCV 2018, 2018

Generalized Bayesian Canonical Correlation Analysis with Missing Modalities.

[BibT_eX]

[DOI]

Toshihiko Matsuura

,

,

Yoshitaka Ushiku

,

Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Between-Class Learning for Image Classification.

[BibT_eX]

[DOI]

,

Yoshitaka Ushiku

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Customized Image Narrative Generation via Interactive Visual Question Generation and Answering.

[BibT_eX]

[DOI]

,

Yoshitaka Ushiku

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Maximum Classifier Discrepancy for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

,

,

Yoshitaka Ushiku

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Neural 3D Mesh Renderer.

[BibT_eX]

[DOI]

,

Yoshitaka Ushiku

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Viewpoint-Aware Video Summarization.

[BibT_eX]

[DOI]

Atsushi Kanehira

,

,

Yoshitaka Ushiku

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Hierarchical Video Generation From Orthogonal Information: Optical Flow and Texture.

[BibT_eX]

[DOI]

Katsunori Ohnishi

,

Shohei Yamamoto

,

Yoshitaka Ushiku

,

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Alternating Circulant Random Features for Semigroup Kernels.

[BibT_eX]

[DOI]

,

Yoshitaka Ushiku

,

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Melody Generation for Pop Music via Word Representation of Musical Properties.

[BibT_eX]

[DOI]

,

Léopold Crestel

,

,

,

Katsunori Ohnishi

,

Masataka Yamaguchi

,

Masahiro Nakawaki

,

Yoshitaka Ushiku

,

CoRR, 2017

Multispectral Object Detection for Autonomous Vehicles.

[BibT_eX]

[DOI]

Takumi Karasawa

,

,

,

Antonio Tejero-de-Pablos

,

Yoshitaka Ushiku

,

Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

WebDNN: Fastest DNN Execution Framework on Web Browser.

[BibT_eX]

[DOI]

Masatoshi Hidaka

,

Yuichiro Kikura

,

Yoshitaka Ushiku

,

Proceedings of the 2017 ACM on Multimedia Conference, 2017

MFNet: Towards real-time semantic segmentation for autonomous vehicles with multi-spectral scenes.

[BibT_eX]

[DOI]

,

,

Takumi Karasawa

,

Yoshitaka Ushiku

,

Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Asymmetric Tri-training for Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

,

Yoshitaka Ushiku

,

Proceedings of the 34th International Conference on Machine Learning, 2017

DualNet: Domain-invariant network for visual question answering.

[BibT_eX]

[DOI]

,

,

Yoshitaka Ushiku

,

Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Spatial-Temporal Weighted Pyramid Using Spatial Orthogonal Pooling.

[BibT_eX]

[DOI]

,

Yoshitaka Ushiku

,

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Deep Modality Invariant Adversarial Network for Shared Representation Learning.

[BibT_eX]

[DOI]

,

,

,

Yoshitaka Ushiku

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Spatio-Temporal Person Retrieval via Natural Language Queries.

[BibT_eX]

[DOI]

Masataka Yamaguchi

,

,

Yoshitaka Ushiku

,

Proceedings of the IEEE International Conference on Computer Vision, 2017

2016

The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering (FSVQA).

[BibT_eX]

[DOI]

,

Yoshitaka Ushiku

,

CoRR, 2016

DeMIAN: Deep Modality Invariant Adversarial Network.

[BibT_eX]

[DOI]

,

,

Yoshitaka Ushiku

,

CoRR, 2016

Image Captioning with Sentiment Terms via Weakly-Supervised Sentiment Dataset.

[BibT_eX]

[DOI]

,

Yoshitaka Ushiku

,

Proceedings of the British Machine Vision Conference 2016, 2016

2015

Common Subspace for Model and Similarity: Phrase Learning for Caption Generation from Images.

[BibT_eX]

[DOI]

Yoshitaka Ushiku

,

Masataka Yamaguchi

,

,

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014

Hard negative classes for multiple object detection.

[BibT_eX]

[DOI]

,

,

Yoshitaka Ushiku

,

,

Hiroshi Muraoka

,

Yasuo Kuniyoshi

,

Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Three Guidelines of Online Learning for Large-Scale Visual Recognition.

[BibT_eX]

[DOI]

Yoshitaka Ushiku

,

Masatoshi Hidaka

,

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2012

Efficient image annotation for automatic sentence generation.

[BibT_eX]

[DOI]

Yoshitaka Ushiku

,

,

Yasuo Kuniyoshi

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

ISI at ImageCLEF 2012: Scalable System for Image Annotation.

[BibT_eX]

[DOI]

Yoshitaka Ushiku

,

Hiroshi Muraoka

,

,

Teppei Fujisawa

,

,

,

Takayuki Higuchi

,

,

,

Yasuo Kuniyoshi

Proceedings of the CLEF 2012 Evaluation Labs and Workshop, 2012

2011

Automatic sentence generation from images.

[BibT_eX]

[DOI]

Yoshitaka Ushiku

,

,

Yasuo Kuniyoshi

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Understanding images with natural sentences.

[BibT_eX]

[DOI]

Yoshitaka Ushiku

,

,

Yasuo Kuniyoshi

Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Discriminative spatial pyramid.

[BibT_eX]

[DOI]

,

Yoshitaka Ushiku

,

,

Yasuo Kuniyoshi

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010

Improving image similarity measures for image browsing and retrieval through latent space learning between images and long texts.

[BibT_eX]

[DOI]

Yoshitaka Ushiku

,

,

Yasuo Kuniyoshi

Proceedings of the International Conference on Image Processing, 2010

Loading...