Yoshitaka Ushiku

Orcid: 0000-0002-9014-1389

Affiliations:
  • OMRON SINIC X Corp., Tokyo, Japan
  • University of Tokyo, Department of Mechano-Informatics, Japan (PhD 2014)


According to our database1, Yoshitaka Ushiku authored at least 75 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Crystalformer: Infinitely Connected Attention for Periodic Structure Encoding.
CoRR, 2024

TNF: Tri-branch Neural Fusion for Multimodal Medical Data Classification.
CoRR, 2024

Unsupervised LLM Adaptation for Question Answering.
CoRR, 2024

2023
State-aware video procedural captioning.
Multim. Tools Appl., October, 2023

Analysis of Protein Folding Simulation with Moving Root Mean Square Deviation.
J. Chem. Inf. Model., March, 2023

A Transformer Model for Symbolic Regression towards Scientific Discovery.
CoRR, 2023

Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos.
CoRR, 2023

Vision-Language Interpreter for Robot Task Planning.
CoRR, 2023

WeaveNet for Approximating Two-sided Matching Problems.
CoRR, 2023

A Critical Look at the Current Usage of Foundation Model for Dense Recognition Task.
CoRR, 2023

Noisy Universal Domain Adaptation via Divergence Optimization for Visual Recognition.
CoRR, 2023

Reference-based Dense Pose Estimation via Partial 3D Point Cloud Matching.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Robotic Powder Grinding with Audio-Visual Feedback for Laboratory Automation in Materials Science.
IROS, 2023

2022
Self-supervised learning of materials concepts from crystal structures via deep neural networks.
Mach. Learn. Sci. Technol., December, 2022

Neural Structure Fields with Application to Crystal Structure Autoencoders.
CoRR, 2022

Recipe Generation from Unsegmented Cooking Videos.
CoRR, 2022

Rethinking Symbolic Regression Datasets and Benchmarks for Scientific Discovery.
CoRR, 2022

Edge-Selective Feature Weaving for Point Cloud Matching.
CoRR, 2022

Edge Computing-Assisted DNN Image Recognition System With Progressive Image Retransmission.
IEEE Access, 2022

Robotic Powder Grinding with a Soft Jig for Laboratory Automation in Material Science.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Cross-modal Representation Learning for Understanding Manufacturing Procedure.
Proceedings of the Cross-Cultural Design. Applications in Learning, Arts, Cultural Heritage, Creative Industries, and Virtual Reality, 2022

The Effect of Improving Annotation Quality on Object Detection Datasets: A Preliminary Study.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
Crowd Density Forecasting by Modeling Patch-Based Dynamics.
IEEE Robotics Autom. Lett., 2021

Foreground-Aware Stylization and Consensus Pseudo-Labeling for Domain Adaptation of First-Person Hand Segmentation.
IEEE Access, 2021

Structure-Aware Procedural Text Generation From an Image Sequence.
IEEE Access, 2021

Retransmission Edge Computing System Conducting Adaptive Image Compression Based on Image Recognition Accuracy.
Proceedings of the 94th IEEE Vehicular Technology Conference, 2021

Egocentric Biochemical Video-and-Language Dataset.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Removing Word-Level Spurious Alignment between Images and Pseudo-Captions in Unsupervised Image Captioning.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Divergence Optimization for Noisy Universal Domain Adaptation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Visual Grounding Annotation of Recipe Flow Graph.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2019
How narratives move your mind: A corpus of shared-character stories for connecting emotional flow and interestingness.
Inf. Process. Manag., 2019

Decentralized Learning of Generative Adversarial Networks from Multi-Client Non-iid Data.
CoRR, 2019

Pose Graph optimization for Unsupervised Monocular Visual Odometry.
Proceedings of the International Conference on Robotics and Automation, 2019

Generating Easy-to-Understand Referring Expressions for Target Identifications.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Strong-Weak Distribution Alignment for Adaptive Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Label-Noise Robust Generative Adversarial Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Class-Distinct and Class-Mutual Image Generation with GANs.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

Estimating the Causal Effect from Partially Observed Time Series.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Conditional Video Generation Using Action-Appearance Captions.
CoRR, 2018

Towards Human-Friendly Referring Expression Generation.
CoRR, 2018

Learning from Between-class Examples for Deep Sound Recognition.
Proceedings of the 6th International Conference on Learning Representations, 2018

Adversarial Dropout Regularization.
Proceedings of the 6th International Conference on Learning Representations, 2018

Multichannel Semantic Segmentation with Unsupervised Domain Adaptation.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Visual Question Generation for Class Acquisition of Unknown Objects.
Proceedings of the Computer Vision - ECCV 2018, 2018

Open Set Domain Adaptation by Backpropagation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Generalized Bayesian Canonical Correlation Analysis with Missing Modalities.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Between-Class Learning for Image Classification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Customized Image Narrative Generation via Interactive Visual Question Generation and Answering.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Maximum Classifier Discrepancy for Unsupervised Domain Adaptation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Neural 3D Mesh Renderer.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Viewpoint-Aware Video Summarization.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Hierarchical Video Generation From Orthogonal Information: Optical Flow and Texture.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Alternating Circulant Random Features for Semigroup Kernels.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Melody Generation for Pop Music via Word Representation of Musical Properties.
CoRR, 2017

Multispectral Object Detection for Autonomous Vehicles.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

WebDNN: Fastest DNN Execution Framework on Web Browser.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

MFNet: Towards real-time semantic segmentation for autonomous vehicles with multi-spectral scenes.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Asymmetric Tri-training for Unsupervised Domain Adaptation.
Proceedings of the 34th International Conference on Machine Learning, 2017

DualNet: Domain-invariant network for visual question answering.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Spatial-Temporal Weighted Pyramid Using Spatial Orthogonal Pooling.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Deep Modality Invariant Adversarial Network for Shared Representation Learning.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Spatio-Temporal Person Retrieval via Natural Language Queries.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2016
The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering (FSVQA).
CoRR, 2016

DeMIAN: Deep Modality Invariant Adversarial Network.
CoRR, 2016

Image Captioning with Sentiment Terms via Weakly-Supervised Sentiment Dataset.
Proceedings of the British Machine Vision Conference 2016, 2016

2015
Common Subspace for Model and Similarity: Phrase Learning for Caption Generation from Images.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014
Hard negative classes for multiple object detection.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Three Guidelines of Online Learning for Large-Scale Visual Recognition.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2012
Efficient image annotation for automatic sentence generation.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

ISI at ImageCLEF 2012: Scalable System for Image Annotation.
Proceedings of the CLEF 2012 Evaluation Labs and Workshop, 2012

2011
Automatic sentence generation from images.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Understanding images with natural sentences.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Discriminative spatial pyramid.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Improving image similarity measures for image browsing and retrieval through latent space learning between images and long texts.
Proceedings of the International Conference on Image Processing, 2010


  Loading...