Yuta Nakashima

Orcid: 0000-0001-8000-3567

According to our database1, Yuta Nakashima authored at least 137 papers between 2005 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Instruct Me More! Random Prompting for Visual In-Context Learning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Revisiting Pixel-Level Contrastive Pre-Training on Scene Images.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

2023
Real-time estimation of the remaining surgery duration for cataract surgery using deep convolutional neural networks and long short-term memory.
BMC Medical Informatics Decis. Mak., December, 2023

Development of Cell Micropatterning Technique Using Laser Processing of Alginate Gel.
J. Robotics Mechatronics, October, 2023

Development of a Microfluidic Ion Current Measurement System for Single-Microplastic Detection.
J. Robotics Mechatronics, October, 2023

Special Issue on Bio-MEMS.
J. Robotics Mechatronics, October, 2023

ACT2G: Attention-based Contrastive Learning for Text-to-Gesture Generation.
Proc. ACM Comput. Graph. Interact. Tech., August, 2023

Multi-modal humor segment prediction in video.
Multim. Syst., August, 2023

Match them up: visually explainable few-shot image classification.
Appl. Intell., May, 2023

Stable Diffusion Exposed: Gender Bias from Prompt to Image.
CoRR, 2023

Situating the social issues of image generation models in the model life cycle: a sociotechnical approach.
CoRR, 2023

Improving Facade Parsing with Vision Transformers and Line Integration.
CoRR, 2023

Inference Time Evidences of Adversarial Attacks for Forensic on Transformers.
CoRR, 2023

Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Enhancing Fake News Detection in Social Media via Label Propagation on Cross-modal Tweet Graph.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Not Only Generative Art: Stable Diffusion for Content-Style Disentanglement in Art Analysis.
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023

ICDAR'23: Intelligent Cross-Data Analysis and Retrieval.
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023

Learning Bottleneck Concepts in Image Classification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Model-Agnostic Gender Debiased Image Captioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Uncurated Image-Text Datasets: Shedding Light on Demographic Bias.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Information Extraction from Public Meeting Articles.
SN Comput. Sci., 2022

Corpus Construction for Historical Newspapers: A Case Study on Public Meeting Corpus Construction Using OCR Error Correction.
SN Comput. Sci., 2022

Depthwise Spatio-Temporal STFT Convolutional Neural Networks for Human Action Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

The semantic typology of visually grounded paraphrases.
Comput. Vis. Image Underst., 2022

Learning More May Not Be Better: Knowledge Transferability in Vision and Language Tasks.
CoRR, 2022

Integration of Gesture Generation System Using Gesture Library with DIY Robot Design Kit.
Proceedings of the IEEE/SICE International Symposium on System Integration, 2022

Tone Classification for Political Advertising Video using Multimodal Cues.
Proceedings of the ICDAR@ICMR 2022: Proceedings of the 3rd ACM Workshop on Intelligent Cross-Data Analysis and Retrieval, Newark, NJ, USA, June 27, 2022

ICDAR'22: Intelligent Cross-Data Analysis and Retrieval.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

A Japanese Dataset for Subjective and Objective Sentiment Polarity Classification in Micro Blog Domain.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Deep Gesture Generation for Social Robots Using Type-Specific Libraries.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Emotional Intensity Estimation based on Writer's Personality.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

Gender and Racial Bias in Visual Question Answering Datasets.
Proceedings of the FAccT '22: 2022 ACM Conference on Fairness, Accountability, and Transparency, Seoul, Republic of Korea, June 21, 2022

AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Optimal Correction Cost for Object Detection Evaluation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Quantifying Societal Bias Amplification in Image Captioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Multi-label Disengagement and Behavior Prediction in Online Learning.
Proceedings of the Artificial Intelligence in Education - 23rd International Conference, 2022

2021
A comparative study of language transformers for video question answering.
Neurocomputing, 2021

Generation and Detection of Media Clones.
IEICE Trans. Inf. Syst., 2021

Preventing Fake Information Generation Against Media Clone Attacks.
IEICE Trans. Inf. Syst., 2021

A Picture May Be Worth a Hundred Words for Visual Question Answering.
CoRR, 2021

Development of a Vertex Finding Algorithm using Recurrent Neural Network.
CoRR, 2021

Understanding the Role of Scene Graphs in Visual Question Answering.
CoRR, 2021

Noisy-LSTM: Improving Temporal Awareness for Video Semantic Segmentation.
IEEE Access, 2021

Cross-Lingual Visual Grounding.
IEEE Access, 2021

The Laughing Machine: Predicting Humor in Video.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

WRIME: A New Dataset for Emotional Intensity Estimation with Subjective and Objective Annotations.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Built Year Prediction from Buddha Face with Heterogeneous Labels.
Proceedings of the SUMAC'21: Proceedings of the 3rd Workshop on Structuring and Understanding of Multimedia heritAge Contents, 2021

Image Retrieval by Hierarchy-aware Deep Hashing Based on Multi-task Learning.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

GCNBoost: Artwork Classification by Label Propagation through a Knowledge Graph.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Museum Experience into a Souvenir: Generating Memorable Postcards from Guide Device Behavior Log.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2021

Learners' Efficiency Prediction Using Facial Behavior Analysis.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

PoseRN: A 2D Pose Refinement Network For Bias-Free Multi-View 3D Human Pose Estimation.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Visual Question Answering with Textual Representations for Images.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

MTUNet: Few-Shot Image Classification With Visual Explanations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Transferring Domain-Agnostic Knowledge in Video Question Answering.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Attending Self-Attention: A Case Study of Visually Grounded Supervision in Vision-and-Language Transformers.
Proceedings of the ACL-IJCNLP 2021 Student Research Workshop, 2021

2020
Visually grounded paraphrase identification via gating and phrase localization.
Neurocomputing, 2020

ContextNet: representation and exploration for painting classification and retrieval in context.
Int. J. Multim. Inf. Retr., 2020

Grading the Severity of Arteriolosclerosis from Retinal Arterio-venous Crossing Patterns.
CoRR, 2020

Constructing a Visual Relationship Authenticity Dataset.
CoRR, 2020

Knowledge-Based Visual Question Answering in Videos.
CoRR, 2020

Improving topic modeling through homophily for legal documents.
Appl. Netw. Sci., 2020

BERT Representations for Video Question Answering.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

IterNet: Retinal Image Segmentation Utilizing Structural Redundancy in Vessel Networks.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Privacy Sensitive Large-Margin Model for Face De-Identification.
Proceedings of the Neural Computing for Advanced Applications, 2020

Joint Learning of Vessel Segmentation and Artery/Vein Classification with Post-processing.
Proceedings of the International Conference on Medical Imaging with Deep Learning, 2020

Constructing a Public Meeting Corpus.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Demographic Influences on Contemporary Art with Unsupervised Style Embeddings.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

A Dataset and Baselines for Visual Question Answering on Art.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Knowledge-Based Video Question Answering with Unsupervised Scene Descriptions.
Proceedings of the Computer Vision - ECCV 2020, 2020

Yoga-82: A New Dataset for Fine-grained Classification of Human Poses.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Uncovering Hidden Challenges in Query-Based Video Moment Retrieval.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

IDSOU at WNUT-2020 Task 2: Identification of Informative COVID-19 English Tweets.
Proceedings of the Sixth Workshop on Noisy User-generated Text, 2020

KnowIT VQA: Answering Knowledge-Based Questions about Videos.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Historical and Modern Features for Buddha Statue Classification.
CoRR, 2019

Understanding Art through Multi-Modal Retrieval in Paintings.
CoRR, 2019

3D Image Reconstruction from Multi-focus Microscopic Images.
Proceedings of the Image and Video Technology, 2019

Human Shape Reconstruction with Loose Clothes from Partially Observed Data by Pose Specific Deformation.
Proceedings of the Image and Video Technology - 9th Pacific-Rim Symposium, 2019

BUDA.ART: A Multimodal Content Based Analysis and Retrieval System for Buddha Statues.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Historical and Modern Features for Buddha Statue Classification.
Proceedings of the 1st Workshop on Structuring and Understanding of Multimedia heritAge Contents, 2019

Context-Aware Embeddings for Automatic Art Analysis.
Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

Facial Expression Recognition with Skip-Connection to Leverage Low-Level Features.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Rethinking the Evaluation of Video Summaries.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Legal Information as a Complex Network: Improving Topic Modeling Through Homophily.
Proceedings of the Complex Networks and Their Applications VIII, 2019

2018
Summarization of User-Generated Sports Video by Using Deep Action Recognition Features.
IEEE Trans. Multim., 2018

Iterative applications of image completion with CNN-based failure detection.
J. Vis. Commun. Image Represent., 2018

Finding Important People in a Video Using Deep Neural Networks with Conditional Random Fields.
IEICE Trans. Inf. Syst., 2018

Representing a Partially Observed Non-Rigid 3D Human Using Eigen-Texture and Eigen-Deformation.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

iParaphrasing: Extracting Visually Grounded Paraphrases via an Image.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017
Augmented Reality Marker Hiding with Texture Deformation.
IEEE Trans. Vis. Comput. Graph., 2017

Video summarization using textual descriptions for authoring video blogs.
Multim. Tools Appl., 2017

Increasing pose comprehension through augmented reality reenactment.
Multim. Tools Appl., 2017

ReMagicMirror: Action Learning Using Human Reenactment with the Mirror Metaphor.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Calcium signaling response of osteoblastic cells received with compressive stimuli.
Proceedings of the International Symposium on Micro-NanoMechatronics and Human Science, 2017

Fabrication of three-dimensional deformable microfilter for capturing target cells.
Proceedings of the International Symposium on Micro-NanoMechatronics and Human Science, 2017

Novel view synthesis with light-weight view-dependent texture mapping for a stereoscopic HMD.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Realtime Novel View Synthesis with Eigen-Texture Regression.
Proceedings of the British Machine Vision Conference 2017, 2017

2016
Privacy Protection for Social Video via Background Estimation and CRF-Based Videographer's Intention Modeling.
IEICE Trans. Inf. Syst., 2016

Evaluating Protection Capability for Visual Privacy Information.
IEEE Secur. Priv., 2016

Flexible human action recognition in depth video sequences using masked joint trajectories.
EURASIP J. Image Video Process., 2016

Human action recognition-based video summarization for RGB-D personal sports video.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Learning Joint Representations of Videos and Sentences with Web Image Search.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Video Summarization Using Deep Semantic Features.
Proceedings of the Computer Vision - ACCV 2016, 2016

3D shape template generation from RGB-D images capturing a moving and deforming object.
Proceedings of the 3D Image Processing, 2016

2015
AR image generation using view-dependent geometry modification and texture mapping.
Virtual Real., 2015

Protection and Utilization of Privacy Information via Sensing.
IEICE Trans. Inf. Syst., 2015

Measurement of cell mechanical properties by cell compression microdevice.
Proceedings of the 2015 International Symposium on Micro-NanoMechatronics and Human Science, 2015

AR Marker Hiding with Real-Time Texture Deformation.
Proceedings of the 2015 IEEE International Symposium on Mixed and Augmented Reality Workshops, 2015

Textual description-based video summarization for video blogs.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Facial expression preserving privacy protection using image melding.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

2014
Background Estimation for a Single Omnidirectional Image Sequence Captured with a Moving Camera.
IPSJ Trans. Comput. Vis. Appl., 2014

Evaluation of cell-cell or cell-substrate adhesion effect on cellular differentiation using a microwell array having convertible culture surface.
Proceedings of the 2014 International Symposium on Micro-NanoMechatronics and Human Science, 2014

Free-viewpoint AR human-motion reenactment based on a single RGB-D video stream.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

2013
Augmented reality image generation with virtualized real objects using view-dependent texture and geometry.
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2013

Real-time privacy protection system for social videos using intentionally-captured persons detection.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Inferring what the videographer wanted to capture.
Proceedings of the IEEE International Conference on Image Processing, 2013

2012
Intended human object detection for automatically protecting privacy in mobile video surveillance.
Multim. Syst., 2012

Development of a dynamic conversion technique of cell culture surface using alginate thin film.
Proceedings of the International Symposium on Micro-NanoMechatronics and Human Science, 2012

Markov random field-based real-time detection of intentionally-captured persons.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

2011
Indoor Positioning System Using Digital Audio Watermarking.
IEICE Trans. Inf. Syst., 2011

Extracting intentionally captured regions using point trajectories.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Fabrication of a dynamic compression stimulus microdevice to cells for evaluating real-time cellular response.
Proceedings of the International Symposium on Micro-NanoMechatronics and Human Science, 2011

Automatic generation of privacy-protected videos using background estimation.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

2010
Automatically protecting privacy in consumer generated videos using intended human object detector.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Digital Diorama: Sensing-Based Real-World Visualization.
Proceedings of the Information Processing and Management of Uncertainty in Knowledge-Based Systems. Applications, 2010

Discriminating Intended Human Objects in Consumer Videos.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Real-Time User Position Estimation in Indoor Environments Using Digital Watermarking for Audio Signals.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Detecting intended human objects in human-captured videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Watermarked Movie Soundtrack Finds the Position of the Camcorder in a Theater.
IEEE Trans. Multim., 2009

A Fifth-Order G<sub>m</sub>-C Continuous-Time ΔΣ Modulator With Process-Insensitive Input Linear Range.
IEEE J. Solid State Circuits, 2009

2007
Maximum-Likelihood Estimation of Recording Position Based on Audio Watermarking.
Proceedings of the 3rd International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2007), 2007

Determining Recording Location Based on Synchronization Positions of Audiowatermarking.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Estimation of recording location using audio watermarking.
Proceedings of the 8th workshop on Multimedia & Security, 2006

2005
Fabrication of a Microfluidic Device for Axonal Guidance.
J. Robotics Mechatronics, 2005


  Loading...