Yezhou Yang

Proceedings of the 27th IEEE International Conference on Intelligent Transportation Systems, 2024

TROPE: TRaining-Free Object-Part Enhancement for Seamlessly Improving Fine-Grained Zero-Shot Image Captioning.

[BibT_eX]

[DOI]

Joshua Feinglass

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Precision or Recall? An Analysis of Image Captions for Training Text-to-Image Generation Model.

[BibT_eX]

[DOI]

Sheng Cheng

Maitreya Patel

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

R.A.C.E. : Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model.

[BibT_eX]

[DOI]

Changhoon Kim

Kyle Min

Proceedings of the Computer Vision - ECCV 2024, 2024

Getting it Right: Improving Spatial Consistency in Text-to-Image Models.

[BibT_eX]

[DOI]

Agneet Chatterjee

Gabriela Ben Melech Stan

Proceedings of the Computer Vision - ECCV 2024, 2024

REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Recent Event Camera Innovations: A Survey.

[BibT_eX]

[DOI]

Shri Ajay Kumar Ravindhiran

Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024

Evaluating Multimodal Large Language Models across Distribution Shifts and Augmentations.

[BibT_eX]

[DOI]

Fenil Denish Bardoliya

Nagaraju Machavarapu

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

eTraM: Event-Based Traffic Monitoring Dataset.

[BibT_eX]

[DOI]

Aayush Atul Verma

Arpitsinh Vaghela

Hua Wei

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Grounding Stylistic Domain Generalization with Quantitative Domain Shift Measures and Synthetic Scene Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

'Eyes of a Hawk and Ears of a Fox': Part Prototype Network for Generalized Zero-Shot Learning.

[BibT_eX]

[DOI]

Joshua Feinglass

Jayaraman J. Thiagarajan

Rushil Anirudh

T. S. Jayram

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Mole Recruitment: Poisoning of Image Classifiers via Selective Batch Sampling.

[BibT_eX]

[DOI]

CoRR, 2023

Improving Diversity with Adversarially Learned Transformations for Domain Generalization.

[BibT_eX]

[DOI]

Tejas Gokhale

Rushil Anirudh

Jayaraman J. Thiagarajan

Bhavya Kailkhura

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras.

[BibT_eX]

[DOI]

Himanshu Pahadia

Duo Lu

Proceedings of the 26th IEEE International Conference on Intelligent Transportation Systems, 2023

CAROM Air - Vehicle Localization and Traffic Scene Reconstruction from Aerial Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Attributing Image Generative Models using Latent Fingerprints.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Adversarial Bayesian Augmentation for Single-Source Domain Generalization.

[BibT_eX]

[DOI]

Sheng Cheng

Tejas Gokhale

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

End-to-end Knowledge Retrieval with Multi-modal Queries.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

A Coulomb Force Inspired Loss Function for High-Performance Pedestrian Detection.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2022

Benchmarking Spatial Relationships in Text-to-Image Generation.

[BibT_eX]

[DOI]

CoRR, 2022

Learning Action-Effect Dynamics from Pairs of Scene-graphs.

[BibT_eX]

[DOI]

Pratyay Banerjee

CoRR, 2022

Reasoning about Actions over Visual and Linguistic Modalities: A Survey.

[BibT_eX]

[DOI]

CoRR, 2022

Targeted Attack on Deep RL-based Autonomous Driving with Learned Visual Patterns.

[BibT_eX]

[DOI]

Prasanth Buddareddygari

Travis Zhang

Yi Ren

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

CAVAN: Commonsense Knowledge Anchored Video Captioning.

[BibT_eX]

[DOI]

Huiliang Shao

Zhiyuan Fang

Proceedings of the 26th International Conference on Pattern Recognition, 2022

Attributable Watermarking of Speech Generative Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Learning Action-Effect Dynamics for Hypothetical Vision-Language Reasoning Task.

[BibT_eX]

[DOI]

Pratyay Banerjee

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

CRIPP-VQA: Counterfactual Reasoning about Implicit Physical Properties via Video Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Injecting Semantic Concepts into End-to-End Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SSR-GNNs: Stroke-based Sketch Representation with Graph Neural Networks.

[BibT_eX]

[DOI]

Sheng Cheng

Yi Ren

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos.

[BibT_eX]

[DOI]

Arnav Chakravarthy

Zhiyuan Fang

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Good, Better, Best: Textual Distractors Generation for Multiple-Choice Visual Question Answering via Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

To Find Waldo You Need Contextual Cues: Debiasing Who's Waldo.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Semantically Distributed Robust Optimization for Vision-and-Language Inference.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021

Efficient Robotic Object Search Via HIEM: Hierarchical Policy Learning With Intrinsic-Extrinsic Modeling.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2021

CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over Images.

[BibT_eX]

[DOI]

Akshay Kumar

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

CAROM - Vehicle Localization and Traffic Scene Reconstruction from Monocular Cameras on Road Infrastructures.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Decentralized Attribution of Generative Models.

[BibT_eX]

[DOI]

Changhoon Kim

Yi Ren

Proceedings of the 9th International Conference on Learning Representations, 2021

SEED: Self-supervised Distillation For Visual Representation.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Compressing Visual-linguistic Model via Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Weakly Supervised Relative Spatial Reasoning for Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Hierarchical and Partially Observable Goal-Driven Policy Learning With Goals Relational Graph.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

First Workshop on Knowledge Injection in Neural Networks (KINN).

[BibT_eX]

[DOI]

Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

SMURF: SeMantic and linguistic UndeRstanding Fusion for Caption Evaluation via Typicality Analysis.

[BibT_eX]

[DOI]

Joshua Feinglass

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

WeaQA: Weak Supervision via Captions for Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations.

[BibT_eX]

[DOI]

Tejas Gokhale

Rushil Anirudh

Bhavya Kailkhura

Jayaraman J. Thiagarajan

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Neural Style Transfer: A Review.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., 2020

Low to High Dimensional Modality Hallucination Using Aggregated Fields of View.

[BibT_eX]

[DOI]

Kausic Gunasekar

Qiang Qiu

IEEE Robotics Autom. Lett., 2020

Fine-grained visual understanding and reasoning.

[BibT_eX]

[DOI]

Neurocomputing, 2020

Self-Supervised VQA: Answering Visual Questions using Images and Captions.

[BibT_eX]

[DOI]

CoRR, 2020

Efficient Robotic Object Search via HIEM: Hierarchical Policy Learning with Intrinsic-Extrinsic Modeling.

[BibT_eX]

[DOI]

CoRR, 2020

Low to High Dimensional Modality Hallucination using Aggregated Fields of View.

[BibT_eX]

[DOI]

Kausic Gunasekar

Qiang Qiu

CoRR, 2020

Weak Supervision and Referring Attention for Temporal-Textual Association Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Resisting the Distracting-factors in Pedestrian Detection.

[BibT_eX]

[DOI]

Zhe Wang

Jun Wang

CoRR, 2020

Diverse Visuo-Lingustic Question Answering (DVLQA) Challenge.

[BibT_eX]

[DOI]

Shailaja Sampat

Mohammad Farhadi Bajestani

CoRR, 2020

memeBot: Towards Automatic Image Meme Generation.

[BibT_eX]

[DOI]

CoRR, 2020

Enabling Incremental Knowledge Transfer for Object Detection at the Edge.

[BibT_eX]

[DOI]

Mehdi Ghasemi

Sarma B. K. Vrudhula

CoRR, 2020

From Seeing to Moving: A Survey on Learning for Visual Indoor Navigation (VIN).

[BibT_eX]

[DOI]

CoRR, 2020

TKD: Temporal Knowledge Distillation for Active Perception.

[BibT_eX]

[DOI]

Mohammad Farhadi

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Learning hierarchical behavior and motion planning for autonomous driving.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Visuo-Lingustic Question Answering (VLQA) Challenge.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

MUTANT: A Training Paradigm for Out-of-Distribution Generalization in Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Video2Commonsense: Generating Commonsense Descriptions to Enrich Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

VQA-LOL: Visual Question Answering Under the Lens of Logic.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Enabling Incremental Knowledge Transfer for Object Detection at the Edge.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Interpretable Partitioned Embedding for Intelligent Multi-item Fashion Outfit Composition.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2019

Robot learning of manipulation activities with overall planning through precedence graph.

[BibT_eX]

[DOI]

Zhe Lin

Robotics Auton. Syst., 2019

A survey on semantic-based methods for the understanding of human movements.

[BibT_eX]

[DOI]

Karinne Ramírez-Amaro

Gordon Cheng

Robotics Auton. Syst., 2019

GAPLE: Generalizable Approaching Policy LEarning for Robotic Object Searching in Indoor Environment.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2019

Good, Better, Best: Textual Distractors Generation for Multi-Choice VQA via Policy Gradient.

[BibT_eX]

[DOI]

CoRR, 2019

Blocksworld Revisited: Learning and Reasoning to Generate Event-Sequences from Image Pairs.

[BibT_eX]

[DOI]

CoRR, 2019

Fluorescence Image Histology Pattern Transformation using Image Style Transfer.

[BibT_eX]

[DOI]

Evgenii Belykh

Xiaochun Zhao

Leandro Borba Moreira

CoRR, 2019

Active Adversarial Evader Tracking with a Probabilistic Pursuer under the Pursuit-Evasion Game Framework.

[BibT_eX]

[DOI]

Varun Chandra Jammula

Anshul Rai

CoRR, 2019

Improving Model Robustness with Transformation-Invariant Attacks.

[BibT_eX]

[DOI]

CoRR, 2019

How Shall I Drive? Interaction Modeling and Motion Planning towards Empathetic and Socially-Graceful Driving.

[BibT_eX]

[DOI]

CoRR, 2019

Spatial Knowledge Distillation to Aid Visual Reasoning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Integrating Knowledge and Reasoning in Image Understanding.

[BibT_eX]

[DOI]

Somak Aditya

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

How Shall I Drive? Interaction Modeling and Motion Planning towards Empathetic and Socially-Graceful Driving.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Robotics and Automation, 2019

Image Decomposition and Classification Through a Generative Model.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

A Novel Design of Adaptive and Hierarchical Convolutional Neural Networks using Partial Reconfiguration on FPGA.

[BibT_eX]

[DOI]

Mohammad Farhadi

Mehdi Ghasemi

Proceedings of the 2019 IEEE High Performance Extreme Computing Conference, 2019

Cooking With Blocks : A Recipe for Visual Reasoning on Image-Pairs.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Modularized Textual Grounding for Counterfactual Resilience.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Convolutional neural networks: Ensemble modeling, fine-tuning and unsupervised semantic localization for neurosurgical CLE images.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2018

Prediction of Manipulation Actions.

[BibT_eX]

[DOI]

Cornelia Fermüller

Fang Wang

Konstantinos Zampogiannis

Yi Zhang

Francisco Barranco

Michael Pfeiffer

Int. J. Comput. Vis., 2018

Image Understanding using vision and reasoning through Scene Description Graph.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2018

Interpretable Partitioned Embedding for Customized Fashion Outfit Composition.

[BibT_eX]

[DOI]

CoRR, 2018

Weakly Supervised Attention Learning for Textual Phrases Grounding.

[BibT_eX]

[DOI]

CoRR, 2018

Prospects for Theranostics in Neurosurgical Technology: Empowering Confocal Laser Endomicroscopy Diagnostics via Deep Learning.

[BibT_eX]

[DOI]

CoRR, 2018

Stroke Controllable Fast Style Transfer with Adaptive Receptive Fields.

[BibT_eX]

[DOI]

CoRR, 2018

DeepSIC: Deep Semantic Image Compression.

[BibT_eX]

[DOI]

Sihui Luo

Mingli Song

CoRR, 2018

Combining Knowledge and Reasoning through Probabilistic Soft Logic for Image Puzzle Solving.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

Interpretable Partitioned Embedding for Customized Multi-item Fashion Outfit Composition.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval, 2018

Weakly-Supervised Learning-Based Feature Localization for Confocal Laser Endomicroscopy Glioma Images.

[BibT_eX]

[DOI]

Leandro Borba Moreira

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2018, 2018

Active Object Perceiver: Recognition-Guided Policy Learning for Object Searching on Mobile Robots.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Extrinsic Dexterity Through Active Slip Control Using Deep Predictive Models.

[BibT_eX]

[DOI]

Simon Stepputtis

Heni Ben Amor

Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

DeepSIC: Deep Semantic Image Compression.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 25th International Conference, 2018

DeepSSH: Deep Semantic Structured Hashing for Explainable Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Stroke Controllable Fast Style Transfer with Adaptive Receptive Fields.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Transductive Unbiased Embedding for Zero-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Explicit Reasoning over End-to-End Neural Architectures for Visual Question Answering.

[BibT_eX]

[DOI]

Somak Aditya

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Unsupervised Linking of Visual Features to Textual Descriptions in Long Manipulation Activities.

[BibT_eX]

[DOI]

Eren Erdal Aksoy

Ekaterina Ovchinnikova

Adil Orhan

Tamim Asfour

IEEE Robotics Autom. Lett., 2017

TripletGAN: Training Generative Model with Triplet Loss.

[BibT_eX]

[DOI]

CoRR, 2017

Convolutional Neural Networks: Ensemble Modeling, Fine-Tuning and Unsupervised Semantic Localization.

[BibT_eX]

[DOI]

CoRR, 2017

On the Importance of Consistency in Training Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2017

Neural Style Transfer: A Review.

[BibT_eX]

[DOI]

CoRR, 2017

Improving utility of brain tumor confocal laser endomicroscopy: objective value assessment and diagnostic frame detection with convolutional neural networks.

[BibT_eX]

[DOI]

Proceedings of the Medical Imaging 2017: Computer-Aided Diagnosis, 2017

What can i do around here? Deep functional scene understanding for cognitive robots.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Fast task-specific target detection via graph based constraints representation and checking.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Collision-free trajectory planning in human-robot interaction through hand movement prediction from vision.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE-RAS International Conference on Humanoid Robotics, 2017

Hand Movement Prediction Based Collision-Free Human-Robot Interaction.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

2016

What Can I Do Around Here? Deep Functional Scene Understanding for Cognitive Robots.

[BibT_eX]

[DOI]

CoRR, 2016

Answering Image Riddles using Vision and Reasoning through Probabilistic Soft Logic.

[BibT_eX]

[DOI]

CoRR, 2016

LightNet: A Versatile, Standalone Matlab-based Environment for Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Co-active learning to adapt humanoid movement for manipulation.

[BibT_eX]

[DOI]

Proceedings of the 16th IEEE-RAS International Conference on Humanoid Robots, 2016

Reliable Attribute-Based Object Recognition Using High Predictive Value Classifiers.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

2015

Manipulation Action Understanding for Observation and Execution.

[BibT_eX]

[DOI]

Konstantinos Zampogiannis

PhD thesis, 2015

Neural Self Talk: Image Understanding via Continuous Questioning and Answering.

[BibT_eX]

[DOI]

CoRR, 2015

From Images to Sentences through Scene Description Graphs using Commonsense Reasoning and Knowledge.

[BibT_eX]

[DOI]

CoRR, 2015

Learning the spatial semantics of manipulation actions through preposition grounding.

[BibT_eX]

[DOI]

Cornelia Fermüller

Yiannis Aloimonos

Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Grasp type revisited: A modern perspective on a classical feature for vision.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Learning the Semantics of Manipulation Action.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Visual Commonsense for Scene Understanding Using Perception, Semantic Parsing and Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 2015 AAAI Spring Symposia, 2015

Robot Learning Manipulation Action Plans by "Watching" Unconstrained Videos from the World Wide Web.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014

Low-level and high-level prior learning for visual saliency estimation.

[BibT_eX]

[DOI]

Inf. Sci., 2014

Manipulation action tree bank: A knowledge resource for humanoids.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE-RAS International Conference on Humanoid Robots, 2014

Learning hand movements from markerless demonstrations for humanoid tasks.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE-RAS International Conference on Humanoid Robots, 2014

2013

Color-to-gray based on chance of happening preservation.

[BibT_eX]

[DOI]

Neurocomputing, 2013

Minimalist plans for interpreting manipulation actions.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2013

Robots with language: Multi-label visual recognition using NLP.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

Detection of Manipulation Action Consequences (MAC).

[BibT_eX]

[DOI]