Song-Chun Zhu

According to our database1, Song-Chun Zhu authored at least 338 papers between 1994 and 2020.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2020
Learning to infer human attention in daily activities.
Pattern Recognit., 2020

Cooperative Training of Descriptor and Generator Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task Activities.
CoRR, 2020

Joint Mind Modeling for Explanation Generation in Complex Human-Robot Collaborative Tasks.
CoRR, 2020

Human-Robot Interaction in a Shared Augmented Reality Workspace.
CoRR, 2020

A Competence-aware Curriculum for Visual Concepts Learning via Question Answering.
CoRR, 2020

A Representational Model of Grid Cells Based on Matrix Lie Algebras.
CoRR, 2020

Learning Latent Space Energy-Based Prior Model.
CoRR, 2020

Learning Energy-based Model with Flow-based Backbone by Neural Transport MCMC.
CoRR, 2020

Closed Loop Neural-Symbolic Learning via Integrating Neural Perception, Grammar Parsing, and Symbolic Reasoning.
CoRR, 2020

Joint Training of Variational Auto-Encoder and Latent Energy-Based Model.
CoRR, 2020

Stochastic Security: Adversarial Defense Using Long-Run Dynamics of Energy-Based Models.
CoRR, 2020

Joint Inference of States, Robot Knowledge, and Human (False-)Beliefs.
CoRR, 2020

Congestion-aware Evacuation Routing using Augmented Reality Devices.
CoRR, 2020

Dark, Beyond Deep: A Paradigm Shift to Cognitive AI with Humanlike Common Sense.
CoRR, 2020

Generative PointNet: Energy-Based Learning on Unordered Point Sets for 3D Generation, Reconstruction and Classification.
CoRR, 2020

Emergence of Pragmatics from Referential Game between Theory of Mind Agents.
CoRR, 2020

Words Aren't Enough, Their Order Matters: On the Robustness of Grounding Visual Referring Expressions.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Machine Number Sense: A Dataset of Visual Arithmetic Problems for Abstract and Relational Reasoning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Motion-Based Generator Model: Unsupervised Disentanglement of Appearance, Trackable and Intrackable Motions in Dynamic Patterns.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

On the Anatomy of MCMC-Based Maximum Likelihood Learning of Energy-Based Models.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Theory-Based Causal Transfer: Integrating Instance-Level Induction and Abstract-Level Structure Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

CoCoX: Generating Conceptual and Counterfactual Explanations via Fault-Lines.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
A tale of two explanations: Enhancing human trust by explaining robot behavior.
Sci. Robotics, 2019

Learning Deep Generative Models with Short Run Inference Dynamics.
CoRR, 2019

Representation Learning: A Statistical Perspective.
CoRR, 2019

Learning Energy-based Spatial-Temporal Generative ConvNets for Dynamic Patterns.
CoRR, 2019

X-ToM: Explaining with Theory-of-Mind for Gaining Justified Human Trust.
CoRR, 2019

Towards Interpretable Image Synthesis by Learning Sparsely Connected AND-OR Networks.
CoRR, 2019

HUGE2: a Highly Untangled Generative-model Engine for Edge-computing.
CoRR, 2019

On Learning Non-Convergent Short-Run MCMC Toward Energy-Based Model.
CoRR, 2019

VRKitchen: an Interactive 3D Virtual Environment for Task-oriented Learning.
CoRR, 2019

Visual Discourse Parsing.
CoRR, 2019

Learning Vector Representation of Content and Matrix Representation of Change: Towards a Representational Model of V1.
CoRR, 2019

Multimodal Conditional Learning with Fast Thinking Policy-like Model and Slow Thinking Planner-like Model.
CoRR, 2019

Inducing Sparse Coding and And-Or Grammar from Generator Network.
CoRR, 2019

Interpretable CNNs.
CoRR, 2019

Explaining AlphaGo: Interpreting Contextual Effects in Neural Networks.
CoRR, 2019

Learning Perceptual Inference by Contrasting.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Learning Non-Convergent Non-Persistent Short-Run MCMC Toward Energy-Based Model.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Learning Virtual Grasp with Failed Demonstrations via Bayesian Inverse Reinforcement Learning.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Self-Supervised Incremental Learning for Sound Source Localization in Complex Indoor Environment.
Proceedings of the International Conference on Robotics and Automation, 2019

High-Fidelity Grasping in Virtual Reality using a Glove-based System.
Proceedings of the International Conference on Robotics and Automation, 2019

Learning Grid Cells as Vector Representation of Self-Position Coupled with Matrix Representation of Self-Motion.
Proceedings of the 7th International Conference on Learning Representations, 2019

DenseRaC: Joint 3D Pose and Shape Estimation by Dense Render-and-Compare.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Understanding Human Gaze Communication by Spatio-Temporal Graph Reasoning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Holistic++ Scene Understanding: Single-View 3D Holistic Scene Parsing and Human Pose Estimation With Human-Object Interaction and Physical Commonsense.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Sparse Winograd Convolutional Neural Networks on Small-scale Systolic Arrays.
Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2019

Reasoning Visual Dialogs With Structural and Partial Observations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

RAVEN: A Dataset for Relational and Analogical Visual REasoNing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Unsupervised Disentangling of Appearance and Geometry by Deformable Generator Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Divergence Triangle for Joint Training of Generator Model, Energy-Based Model, and Inferential Model.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Natural Language Interaction with Explainable AI Models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Explainable AI as Collaborative Task Solving.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Partitioning the Perception of Physical and Social Events Within a Unified Psychological Space.
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019

Decomposing Human Causal Learning: Bottom-up Associative Learning and Top-down Schema Reasoning.
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019

VRGym: a virtual testbed for physical and interactive AI.
Proceedings of the ACM Turing Celebration Conference - China, 2019

MetaStyle: Three-Way Trade-off among Speed, Flexibility, and Quality in Neural Style Transfer.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Learning Dynamic Generator Model by Alternating Back-Propagation through Time.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Recognizing Unseen Attribute-Object Pair with Generative Model.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Mirroring without Overimitation: Learning Functionally Equivalent Manipulation Actions.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Perception of Human Interaction Based on Motion Trajectories: From Aerial Videos to Decontextualized Animations.
topiCS, 2018

Learning and Inferring "Dark Matter" and Predicting Human Intents and Trajectories in Videos.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Attribute And-Or Grammar for Joint Parsing of Human Pose, Parts and Attributes.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Single-View 3D Scene Reconstruction and Parsing by Attribute Grammar.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Visual interpretability for deep learning: a survey.
Frontiers Inf. Technol. Electron. Eng., 2018

Configurable 3D Scene Synthesis and 2D Image Rendering with Per-pixel Ground Truth Using Stochastic Grammars.
Int. J. Comput. Vis., 2018

Mining deep And-Or object structures via cost-sensitive question-answer-based active annotations.
Comput. Vis. Image Underst., 2018

Divergence Triangle for Joint Training of Generator Model, Energy-based Model, and Inference Model.
CoRR, 2018

Explanatory Graphs for CNNs.
CoRR, 2018

Mining Interpretable AOG Representations from Convolutional Networks via Active Question Answering.
CoRR, 2018

Deeper Interpretability of Deep Networks.
CoRR, 2018

Learning Grid-like Units with Vector Representation of Self-Position and Matrix Representation of Self-Motion.
CoRR, 2018

A Tale of Three Probabilistic Families: Discriminative, Descriptive and Generative Models.
CoRR, 2018

Interactive Agent Modeling by Learning to Probe.
CoRR, 2018

Deformable Generator Network: Unsupervised Disentanglement of Appearance and Geometry.
CoRR, 2018

Unsupervised Learning of Neural Networks to Explain Neural Networks.
CoRR, 2018

Network Transplanting.
CoRR, 2018

Building a Telescope to Look Into High-Dimensional Image Spaces.
CoRR, 2018

Interpreting CNNs via Decision Trees.
CoRR, 2018

Spatially Perturbed Collision Sounds Attenuate Perceived Causality in 3D Launching Events.
Proceedings of the 2018 IEEE Conference on Virtual Reality and 3D User Interfaces, 2018

Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Unsupervised Learning of Hierarchical Models for Hand-Object Interactions.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Intent-Aware Multi-Agent Reinforcement Learning.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Interactive Robot Knowledge Patching Using Augmented Reality.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Generalized Earley Parser: Bridging Symbolic Grammars and Sequence Data for Future Prediction.
Proceedings of the 35th International Conference on Machine Learning, 2018

Learning Human-Object Interactions by Graph Parsing Neural Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018

Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image.
Proceedings of the Computer Vision - ECCV 2018, 2018

Interpretable Convolutional Neural Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

A Causal And-Or Graph Model for Visibility Fluent Reasoning in Tracking Interacting Objects.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning Descriptor Networks for 3D Shape Synthesis and Analysis.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Where and Why Are They Looking? Jointly Inferring Human Attention and Intentions in Complex Tasks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Attentive Fashion Grammar Network for Fashion Landmark Detection and Clothing Category Classification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Human-Centric Indoor Scene Synthesis Using Stochastic Grammar.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning Generative ConvNets via Multi-Grid Modeling and Sampling.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Inferring Shared Attention in Social Scene Videos.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Human Causal Transfer: Challenges for Deep Reinforcement Learning.
Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

Examining CNN Representations With Respect to Dataset Bias.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Interpreting CNN Knowledge via an Explanatory Graph.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Scene-Centric Joint Parsing of Cross-View Videos.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Tracking Occluded Objects and Recovering Incomplete Trajectories by Reasoning About Containment Relations and Human Actions.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Learning Pose Grammar to Encode Human Body Configuration for 3D Pose Estimation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
The Martian: Examining Human Physical Judgments across Virtual Gravity Fields.
IEEE Trans. Vis. Comput. Graph., 2017

Joint Image-Text News Topic Detection and Tracking by Multimodal Topic And-Or Graph.
IEEE Trans. Multimedia, 2017

Online Object Tracking, Learning and Parsing with And-Or Graphs.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Modeling 4D Human-Object Interactions for Joint Event Segmentation, Recognition, and Object Localization.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Learning Knowledge-guided Pose Grammar Machine for 3D Human Pose Estimation.
CoRR, 2017

Learning Multi-grid Generative ConvNets by Minimal Contrastive Divergence.
CoRR, 2017

A Causal And-Or Graph Model for Visibility Fluent Reasoning in Human-Object Interactions.
CoRR, 2017

Joint Parsing of Cross-view Scenes with Spatio-temporal Semantic Parse Graphs.
CoRR, 2017

A Cost-Sensitive Visual Question-Answer Framework for Mining a Deep And-OR Object Semantics from Web Images.
CoRR, 2017

Interactively Transferring CNN Patterns for Part Localization.
CoRR, 2017

Configurable, Photorealistic Image Rendering and Ground Truth Synthesis by Sampling Stochastic Grammars Representing Indoor Scenes.
CoRR, 2017

A glove-based system for studying hand-object manipulation via joint pose and force sensing.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Feeling the force: Integrating force and pose for fluent discovery through imitation learning to open medicine bottles.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Single-Image 3D Scene Parsing Using Geometric Commonsense.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Inferring Human Attention by Learning Latent Intentions.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Learning social affordance grammar from videos: Transferring human interactions to human-robot interactions.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Predicting Human Activities Using Stochastic Grammar.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Monocular 3D Human Pose Estimation by Predicting Depth on Joints.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Jointly Recognizing Object Fluents and Tasks in Egocentric Videos.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Mining Object Parts from CNNs via Active Question-Answering.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Synthesizing Dynamic Patterns by Spatial-Temporal Generative ConvNet.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Generative Hierarchical Learning of Sparse FRAME Models.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

CERN: Confidence-Energy Recurrent Network for Group Activity Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Inferring Hidden Statuses and Actions in Video by Causal Reasoning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Learning Human Utility from Video Demonstrations for Deductive Planning in Robotics.
Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017

Inferring Human Interaction from Motion Trajectories in Aerial Videos.
Proceedings of the 39th Annual Meeting of the Cognitive Science Society, 2017

Visuomotor Adaptation and Sensory Recalibration in Reversed Hand Movement Task.
Proceedings of the 39th Annual Meeting of the Cognitive Science Society, 2017

Consistent Probabilistic Simulation Underlying Human Judgment in Substance Dynamics.
Proceedings of the 39th Annual Meeting of the Cognitive Science Society, 2017

Inferring Context Through Scene Understanding.
Proceedings of the 2017 AAAI Spring Symposia, 2017

Growing Interpretable Part Graphs on ConvNets via Multi-Shot Learning.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Cross-View People Tracking by Scene-Centered Spatio-Temporal Parsing.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Alternating Back-Propagation for Generator Network.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Learning Perceptual Causality from Video.
ACM Trans. Intell. Syst. Technol., 2016

A Reconfigurable Tangram Model for Scene Representation and Categorization.
IEEE Trans. Image Process., 2016

Learning And-Or Model to Represent Context and Occlusion for Car Detection and Viewpoint Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Multi-Shot Mining Semantic Part Concepts in CNNs.
CoRR, 2016

Synthesizing Dynamic Textures and Sounds by Spatial-Temporal Generative ConvNet.
CoRR, 2016

Modeling and Inferring Human Intents and Latent Functional Objects for Trajectory Prediction.
CoRR, 2016

Cooperative Training of Descriptor and Generator Networks.
CoRR, 2016

Attribute And-Or Grammar for Joint Parsing of Human Attributes, Part and Pose.
CoRR, 2016

Learning Generative ConvNet with Continuous Latent Factors by Alternating Back-Propagation.
CoRR, 2016

Evaluating physical quantities and learning human utilities from RGBD videos.
Proceedings of the SIGGRAPH ASIA 2016, Macao, December 5-8, 2016, 2016

A virtual reality platform for dynamic human-scene interaction.
Proceedings of the SIGGRAPH ASIA 2016, Macao, December 5-8, 2016, 2016

Grounded Semantic Role Labeling.
Proceedings of the NAACL HLT 2016, 2016

Inferring human intent from video by sampling hierarchical plans.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Learning Social Affordance for Human-Robot Interaction.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

What Is Where: Inferring Containment Relations from Videos.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Robot learning with a spatial, temporal, and causal and-or graph.
Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

A Theory of Generative ConvNet.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Jointly Learning Grounded Task Structures from Language Instruction and Visual Demonstration.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Inferring Forces and Learning Human Utilities from Videos.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Multi-view People Tracking via Hierarchical Trajectory Composition.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Recognizing Car Fluents from Video.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Critical Features of Joint Actions that Signal Human Interaction.
Proceedings of the 38th Annual Meeting of the Cognitive Science Society, 2016

Probabilistic Simulation Predicts Human Performance on Viscous Fluid-Pouring Problem.
Proceedings of the 38th Annual Meeting of the Cognitive Science Society, 2016

Learning FRAME Models Using CNN Filters.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Task Learning through Visual Demonstration and Situated Dialogue.
Proceedings of the Symbiotic Cognitive Systems, 2016

2015
And-Or Graph Face Model and Its Applications in Artistic Sketching and Aging Simulation.
Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015

Learning Near-Optimal Cost-Sensitive Decision Policy for Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Learning Hierarchical Space Tiling for Scene Modeling, Parsing and Attribute Tagging.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Learning 3D Object Templates by Quantizing Geometry and Appearance Spaces.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Video Primal Sketch: A Unified Middle-Level Representation for Video.
J. Math. Imaging Vis., 2015

Scene Understanding by Reasoning Stability and Safety.
Int. J. Comput. Vis., 2015

Learning Sparse FRAME Models for Natural Image Patterns.
Int. J. Comput. Vis., 2015

Learning And-Or Models to Represent Context and Occlusion for Car Detection and Viewpoint Estimation.
CoRR, 2015

A Restricted Visual Turing Test for Deep Scene and Event Understanding.
CoRR, 2015

Learning FRAME Models Using CNN Filters for Knowledge Visualization.
CoRR, 2015

Joint Image-Text News Topic Detection and Tracking with And-Or Graph Representation.
CoRR, 2015

Mining And-Or Graphs for Graph Matching and Object Discovery.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Attributed Grammars for Joint Estimation of Human Attributes, Part and Pose.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Automated Facial Trait Judgment and Election Outcome Prediction: Social Dimensions of Face.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Understanding tools: Task-oriented object modeling, learning and recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Joint inference of groups, events and human roles in aerial videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Joint action recognition and pose estimation from video.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Evaluating Human Cognition of Containing Relations with Physical Simulation.
Proceedings of the 37th Annual Meeting of the Cognitive Science Society, 2015

Represent and Infer Human Theory of Mind for Human-Robot Interaction.
Proceedings of the 2015 AAAI Fall Symposia, Arlington, Virginia, USA, November 12-14, 2015, 2015

A Unified Framework for Human-Robot Knowledge Transfer.
Proceedings of the 2015 AAAI Fall Symposia, Arlington, Virginia, USA, November 12-14, 2015, 2015

2014
Animated Pose Templates for Modeling and Detecting Human Actions.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Joint Video and Text Parsing for Understanding Events and Answering Queries.
IEEE Multim., 2014

Mapping Energy Landscapes of Non-Convex Learning Problems.
CoRR, 2014

Detecting potential falling objects by inferring human action and natural disturbance.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Mapping the Energy Landscape of Non-convex Optimization Problems.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2014

Integrating Context and Occlusion for Car Detection by Hierarchical And-Or Model.
Proceedings of the Computer Vision - ECCV 2014, 2014

Learning Inhomogeneous FRAME Models for Object Patterns.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Cross-View Action Modeling, Learning, and Recognition.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Single-View 3D Scene Parsing by Attributed Grammar.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Visual Persuasion: Inferring Communicative Intents of Images.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Unsupervised Learning of Dictionaries of Hierarchical Compositional Models.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Video Stylization: Painterly Rendering and Optimization With Content Extraction.
IEEE Trans. Circuits Syst. Video Techn., 2013

Abstract painting with interactive control of perceptual entropy.
ACM Trans. Appl. Percept., 2013

Learning AND-OR Templates for Object Recognition and Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Learning and parsing video events with goal and intent prediction.
Comput. Vis. Image Underst., 2013

Unsupervised Structure Learning of Stochastic And-Or Grammars.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Inferring "Dark Matter" and "Dark Energy" from Videos.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Modeling 4D Human-Object Interactions for Event and Object Recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Concurrent Action Detection with Structural Prediction.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Modeling Occlusion by Discriminative AND-OR Structures.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Human Attribute Recognition by Rich Appearance Dictionary.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Cosegmentation and Cosketch by Unsupervised Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Monte Carlo Tree Search for Scheduling Activity Recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Beyond Point Clouds: Scene Understanding by Reasoning Geometry and Physics.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Scene Parsing by Integrating Function, Geometry and Appearance Models.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Weakly Supervised Learning for Attribute Localization in Outdoor Scenes.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Discriminatively Trained And-Or Tree Models for Object Detection.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Integrating Grammar and Segmentation for Human Pose Estimation.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Using Causal Induction in Humans to Learn and Infer Causality from Video.
Proceedings of the 35th Annual Meeting of the Cognitive Science Society, 2013

Rates for Inductive Learning of Compositional Models.
Proceedings of the Learning Rich Representations from Low-Level Sensors, 2013

Structure vs. Appearance and 3D vs. 2D? A Numeric Answer.
Proceedings of the Shape Perception in Human and Computer Vision, 2013

Erratum to: Artistic Rendering of Portraits.
Proceedings of the Image and Video-Based Artistic Stylisation, 2013

Artistic Rendering of Portraits.
Proceedings of the Image and Video-Based Artistic Stylisation, 2013

2012
Background modeling by subspace learning on spatio-temporal patches.
Pattern Recognit. Lett., 2012

Learning Hybrid Image Templates (HIT) by Information Projection.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Intrackability: Characterizing Video Statistics and Pursuing Video Representations.
Int. J. Comput. Vis., 2012

Learning reconfigurable scene representation by tangram model.
Proceedings of the IEEE Workshop on Applications of Computer Vision, 2012

Reconfigurable templates for robust vehicle detection and classification.
Proceedings of the IEEE Workshop on Applications of Computer Vision, 2012

Cost-Sensitive Top-Down/Bottom-Up Inference for Multiscale Activity Recognition.
Proceedings of the Computer Vision - ECCV 2012, 2012

Hierarchical Space Tiling for Scene Modeling.
Proceedings of the Computer Vision, 2012

2011
C<sup>4</sup>: Exploring Multiple Solutions in Graphical Models by Cluster Sampling.
IEEE Trans. Pattern Anal. Mach. Intell., 2011

A Numerical Study of the Bottom-Up and Top-Down Inference Processes in And-Or Graphs.
Int. J. Comput. Vis., 2011

Customizing painterly rendering styles using stroke processes.
Proceedings of the 9th International Symposium on Non-Photorealistic Animation and Rendering 2009, 2011

Portrait painting using active templates.
Proceedings of the 9th International Symposium on Non-Photorealistic Animation and Rendering 2009, 2011

Image Parsing with Stochastic Scene Grammar.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Inferring social roles in long timespan video sequence.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Unsupervised learning of stochastic AND-OR templates for object modeling.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Human parsing using stochastic and-or grammars and rich appearances.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Unsupervised learning of event AND-OR grammar and semantics from video.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Parsing video events with goal inference and intent prediction.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Image representation by active curves.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Video Primal Sketch: A generic middle-level representation of video.
Proceedings of the IEEE International Conference on Computer Vision, 2011

2010
Learning explicit and implicit visual manifolds by information projection.
Pattern Recognit. Lett., 2010

I2T: Image Parsing to Text Description.
Proceedings of the IEEE, 2010

A Compositional and Dynamic Model for Face Aging.
IEEE Trans. Pattern Anal. Mach. Intell., 2010

Layered Graph Matching with Composite Cluster Sampling.
IEEE Trans. Pattern Anal. Mach. Intell., 2010

Learning Active Basis Model for Object Detection and Recognition.
Int. J. Comput. Vis., 2010

A Hierarchical and Contextual Model for Aerial Image Parsing.
Int. J. Comput. Vis., 2010

Sisley the abstract painter.
Proceedings of the 8th International Symposium on Non-Photorealistic Animation and Rendering 2010, 2010

Painterly animation using video semantics and feature correspondence.
Proceedings of the 8th International Symposium on Non-Photorealistic Animation and Rendering 2010, 2010

CO3 for ultra-fast and accurate interactive segmentation.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Artistic paper-cut of human portraits.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Learning Artistic Lighting Template from Portrait Photographs.
Proceedings of the Computer Vision, 2010

Learning a probabilistic model mixing 3D and 2D primitives for view invariant object recognition.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Discovering scene categories by information projection and cluster sampling.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
And-Or Graph Model for Faces.
Proceedings of the Encyclopedia of Biometrics, 2009

From image parsing to painterly rendering.
ACM Trans. Graph., 2009

Bottom-Up/Top-Down Image Parsing with Attribute Grammar.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Learning deformable action templates from cluttered videos.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Evaluating information contributions of bottom-up and top-down processes.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Learning mixed templates for object recognition.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Trajectory parsing by cluster sampling in spatio-temporal graph.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Layered graph matching by composite cluster sampling with collaborative and competitive interactions.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Flow mosaicking: Real-time pedestrian counting without scene-specific learning.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008
A Hierarchical Compositional Model for Face Representation and Sketching.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

Perceptual Scale-Space and Its Applications.
Int. J. Comput. Vis., 2008

Design sparse features for age estimation using hierarchical face model.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

Learning a scene contextual model for tracking and abnormality detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2008

A hierarchical and contextual model for aerial image understanding.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

SAVE: A framework for semantic annotation of visual events.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2008

An integrated background model for video surveillance based on primal sketch and 3D scene geometry.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
Statistical Principles in Image Modeling.
Technometrics, 2007

A Two-Level Generative Model for Cloth Representation and Shape from Shading.
IEEE Trans. Pattern Anal. Mach. Intell., 2007

Primal sketch: Integrating structure and texture.
Comput. Vis. Image Underst., 2007

Deformable Template As Active Basis.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

An Empirical Study of Object Category Recognition: Sequential Testing with Generalized Samples.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Introduction to a Large-Scale General Purpose Ground Truth Database: Methodology, Annotation Tool and Benchmarks.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2007

Object Category Recognition Using Generative Template Boosting.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2007

An Automatic Portrait System Based on And-Or Graph Representation.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2007

Dynamic Feature Cascade for Multiple Object Tracking with Trackability Analysis.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2007

Bayesian Inference for Layer Representation with Mixed Markov Random Field.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2007

Compositional Boosting for Computing Hierarchical Image Structures.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

A Multi-Resolution Dynamic Model for Face Aging Simulation.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Mapping Natural Image Patches by Explicit and Implicit Manifolds.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Layered Graph Match with Graph Editing.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
A Generative Sketch Model for Human Hair Analysis and Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., 2006

Parsing Images into Regions, Curves, and Curve Groups.
Int. J. Comput. Vis., 2006

A Stochastic Grammar of Images.
Found. Trends Comput. Graph. Vis., 2006

Composite Templates for Cloth Modeling and Sketching.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

2005
Generalizing Swendsen-Wang to Sampling Arbitrary Posterior Probabilities.
IEEE Trans. Pattern Anal. Mach. Intell., 2005

What are Textons?
Int. J. Comput. Vis., 2005

Image Parsing: Unifying Segmentation, Detection, and Recognition.
Int. J. Comput. Vis., 2005

Perceptual Scale Space and its Applications.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Bottom-up/Top-Down Image Parsing by Attribute Graph Grammar.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Incorporating Visual Knowledge Representation in Stereo Reconstruction.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

A High Resolution Grammatical Model for Face Representation and Sketching.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Cloth Representation by Shape from Shading with Shading Primitives.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

A Generative Model of Human Hair for Hair Sketching.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

2004
Analysis and Synthesis of Textured Motion: Particles and Waves.
IEEE Trans. Pattern Anal. Mach. Intell., 2004

Range Image Segmentation by an Effective Jump-Diffusion Method.
IEEE Trans. Pattern Anal. Mach. Intell., 2004

On the Relationship Between Image and Motion Segmentation.
Proceedings of the Spatial Coherence for Visual Motion Analysis, 2004

Modeling Complex Motion by Tracking and Editing Hidden Markov Graphs.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

Automatic Single View Building Reconstruction by Integrating Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2004

Information Scaling Laws in Natural Scenes.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2004

Multigrid and Multi-Level Swendsen-Wang Cuts for Hierarchic Graph Partition.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

2003
Statistical Modeling and Conceptualization of Visual Patterns.
IEEE Trans. Pattern Anal. Mach. Intell., 2003

Statistical Edge Detection: Learning and Evaluating Edge Cues.
IEEE Trans. Pattern Anal. Mach. Intell., 2003

Modeling Visual Patterns by Integrating Descriptive and Generative Methods.
Int. J. Comput. Vis., 2003

Modeling Textured Motion : Particle, Wave and Sketch.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Towards a Mathematical Theory of Primal Sketch and Sketchability.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

A Multi-scale Generative Model for Animate Shapes and Parts.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Graph Partition by Swendsen-Wang Cuts.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Bayesian Reconstruction of 3D Shapes and Scenes From A Single Image.
Proceedings of the 2003 IEEE 1st International Workshop on Higher-Level Knowledge in 3D Modeling and Motion Analysis (HLK 2003), 2003

2002
Learning in Gibbsian Fields: How Accurate and How Fast Can It Be?
IEEE Trans. Pattern Anal. Mach. Intell., 2002

Image Segmentation by Data-Driven Markov Chain Monte Carlo.
IEEE Trans. Pattern Anal. Mach. Intell., 2002

What Are Textons?
Proceedings of the Computer Vision, 2002

Statistical Modeling of Texture Sketch.
Proceedings of the Computer Vision, 2002

A Generative Method for Textured Motion: Analysis and Synthesis.
Proceedings of the Computer Vision, 2002

Parsing Images into Region and Curve Processes.
Proceedings of the Computer Vision, 2002

A Stochastic Algorithm for 3D Scene Segmentation and Reconstruction.
Proceedings of the Computer Vision, 2002

2001
Introduction by Guest Editors.
Int. J. Comput. Vis., 2001

Order Parameters for Detecting Target Curves in Images: When Does High Level Knowledge Help?
Int. J. Comput. Vis., 2001

Image Segmentation by Data Driven Markov Chain Monte Carlo.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

Learning Inhomogeneous Gibbs Model of Faces by Minimax Entropy.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

Visual Learning by Integrating Descriptive and Generative Methods.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

Example-Based Facial Sketch Generation with Non-parametric Sampling.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

2000
Exploring Texture Ensembles by Efficient Markov Chain Monte Carlo-Toward a 'Trichromacy' Theory of Texture.
IEEE Trans. Pattern Anal. Mach. Intell., 2000

Guest Editorial: Statistical and Computational Theories of Vision: Modeling, Learning, Sampling and Computing, Part I.
Int. J. Comput. Vis., 2000

Equivalence of Julesz Ensembles and FRAME Models.
Int. J. Comput. Vis., 2000

Integrating Bottom-Up/Top-Down for Object Recognition by Data Driven Markov Chain Monte Carlo.
Proceedings of the 2000 Conference on Computer Vision and Pattern Recognition (CVPR 2000), 2000

Order Parameters for Minimax Entropy Distributions: When Does High Level Knowledge Help?
Proceedings of the 2000 Conference on Computer Vision and Pattern Recognition (CVPR 2000), 2000

1999
Embedding Gestalt Laws in Markov Random Fields.
IEEE Trans. Pattern Anal. Mach. Intell., 1999

Stochastic Jump-Diffusion Process for Computing Medial Axes in Markov Random Fields.
IEEE Trans. Pattern Anal. Mach. Intell., 1999

From local features to global perception - A perspective of Gestalt psychology from Markov random field theory.
Neurocomputing, 1999

Equivalence of Julesz and Gibbs Texture Ensembles.
Proceedings of the International Conference on Computer Vision, 1999

Fundamental Bounds on Edge Detection: An Information Theoretic Evaluation of Different Edge Cues.
Proceedings of the 1999 Conference on Computer Vision and Pattern Recognition (CVPR '99), 1999

1998
Filters, Random Fields and Maximum Entropy (FRAME): Towards a Unified Theory for Texture Modeling.
Int. J. Comput. Vis., 1998

GRADE: Gibbs Reaction and Diffusion Equation.
Proceedings of the Sixth International Conference on Computer Vision (ICCV-98), 1998

Stochastic Computation of Medial Axis in Markov Random Fields.
Proceedings of the 1998 Conference on Computer Vision and Pattern Recognition (CVPR '98), 1998

1997
Prior Learning and Gibbs Reaction-Diffusion.
IEEE Trans. Pattern Anal. Mach. Intell., 1997

Minimax Entropy Principle and Its Application to Texture Modeling.
Neural Computation, 1997

Modeling images and textures by minimax entropy.
Proceedings of the Human Vision and Electronic Imaging II, 1997

Learning Generic Prior Models for Visual Computation.
Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97), 1997

1996
Region Competition: Unifying Snakes, Region Growing, and Bayes/MDL for Multiband Image Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 1996

FORMS: A flexible object recognition and modelling system.
Int. J. Comput. Vis., 1996

FRAME: Filters, Random fields, and Minimax Entropy - Towards a Unified Theory for Texture Modeling.
Proceedings of the 1996 Conference on Computer Vision and Pattern Recognition (CVPR '96), 1996

1995
Region Competition: Unifying Snakes, Region Growing, Energy/Bayes/MDL for Multi-band Image Segmentation.
Proceedings of the Procedings of the Fifth International Conference on Computer Vision (ICCV 95), 1995

1994
A Framework for Shape Representation and Recognition.
Proceedings of the Proceedings 1994 International Conference on Image Processing, 1994


  Loading...