Song-Chun Zhu

Orcid: 0000-0002-1925-5973

Affiliations:
  • University of California, Department of Statistics, Los Angeles, CA, USA


According to our database1, Song-Chun Zhu authored at least 481 papers between 1994 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
M2Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes.
CoRR, 2024

Mars: Situated Inductive Reasoning in an Open-World Environment.
CoRR, 2024

Learning to Balance Altruism and Self-interest Based on Empathy in Mixed-Motive Games.
CoRR, 2024

M3Bench: Benchmarking Whole-body Motion Generation for Mobile Manipulation in 3D Scenes.
CoRR, 2024

PR2: A Physics- and Photo-realistic Testbed for Embodied AI and Humanoid Robots.
CoRR, 2024

FIRE: A Dataset for Feedback Integration and Refinement Evaluation of Multimodal Models.
CoRR, 2024

InterPreT: Interactive Predicate Learning from Language Feedback for Generalizable Task Planning.
CoRR, 2024

SocialGFs: Learning Social Gradient Fields for Multi-Agent Reinforcement Learning.
CoRR, 2024

Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations.
CoRR, 2024

PhyRecon: Physically Plausible Neural Scene Reconstruction.
CoRR, 2024

LLM3: Large Language Model-based Task and Motion Planning with Motion Failure Reasoning.
CoRR, 2024

CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents.
CoRR, 2024

MusicAOG: an Energy-Based Model for Learning and Sampling a Hierarchical Representation of Symbolic Music.
CoRR, 2024

On the Emergence of Symmetrical Reality.
Proceedings of the IEEE Conference Virtual Reality and 3D User Interfaces, 2024

MindDial: Enhancing Conversational Agents with Theory-of-Mind for Common Ground Alignment and Negotiation.
Proceedings of the 25th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2024

Fast Peer Adaptation with Context-aware Exploration.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

An Embodied Generalist Agent in 3D World.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Neural-Symbolic Recursive Machine for Systematic Generalization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

CLOVA: A Closed-LOop Visual Assistant with Tool Usage and Update.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

RulE: Knowledge Graph Reasoning with Rule Embedding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

LangSuit·E: Planning, Controlling and Interacting with Large Language Models in Embodied Text Environments.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

ProAgent: Building Proactive Cooperative Agents with Large Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Guest Editorial: Introduction to the Special Section on Graphs in Vision and Pattern Analysis.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Aligner: One Global Token is Worth Millions of Parameters When Aligning Large Language Models.
CoRR, 2023

AI Alignment: A Comprehensive Survey.
CoRR, 2023

CORE: Common Random Reconstruction for Distributed Optimization with Provable Low Communication Complexity.
CoRR, 2023

MindAgent: Emergent Gaming Interaction.
CoRR, 2023

ProAgent: Building Proactive Cooperative AI with Large Language Models.
CoRR, 2023

Brain in a Vat: On Missing Pieces Towards Artificial General Intelligence in Large Language Models.
CoRR, 2023

MindDial: Belief Dynamics Tracking with Theory-of-Mind Modeling for Situated Neural Dialogue Generation.
CoRR, 2023

Heterogeneous Value Evaluation for Large Language Models.
CoRR, 2023

Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners.
CoRR, 2023

Reconfigurable Data Glove for Reconstructing Physical and Virtual Grasps.
CoRR, 2023

Learning Energy-Based Prior Model with Diffusion-Amortized MCMC.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning non-Markovian Decision-Making from State-only Sequences.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Diplomat: A Dialogue Dataset for Situated PragMATic Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Evaluating and Inducing Personality in Pre-trained Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Part-level Scene Reconstruction Affords Robot Interaction.
IROS, 2023

Learning a Causal Transition Model for Object Cutting.
IROS, 2023

Rearrange Indoor Scenes for Human-Robot Co-Activity.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

On the Complexity of Bayesian Generalization.
Proceedings of the International Conference on Machine Learning, 2023

SQA3D: Situated Question Answering in 3D Scenes.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

A Minimalist Dataset for Systematic Generalization of Perception, Syntax, and Semantics.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Sim2Plan: Robot Motion Planning via Message Passing Between Simulation and Reality.
Proceedings of the Future Technologies Conference, 2023

Diffusion-based Generation, Optimization, and Planning in 3D Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Computer Vision - Statistical Models for Marr's Paradigm
Springer, ISBN: 978-3-030-96529-7, 2023

2022
In situ bidirectional human-robot value alignment.
Sci. Robotics, 2022

Understanding Physical Effects for Effective Tool-Use.
IEEE Robotics Autom. Lett., 2022

Synthesizing Diverse and Physically Stable Grasps With Arbitrary Hand Structures Using Differentiable Force Closure Estimator.
IEEE Robotics Autom. Lett., 2022

Show Me What You Can Do: Capability Calibration on Reachable Workspace for Human-Robot Collaboration.
IEEE Robotics Autom. Lett., 2022

Cascaded Parsing of Human-Object Interaction Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Monocular 3D Pose Estimation via Pose Grammar and Data Augmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Deformable Generator Networks: Unsupervised Disentanglement of Appearance and Geometry.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Generative VoxelNet: Learning Energy-Based Models for 3D Shape Synthesis and Analysis.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Cooperative Training of Fast Thinking Initializer and Slow Thinking Solver for Conditional Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Hierarchical Human Semantic Parsing With Comprehensive Part-Relation Modeling.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Scene Reconstruction with Functional Objects for Robot Autonomy.
Int. J. Comput. Vis., 2022

RulE: Neural-Symbolic Knowledge Graph Reasoning with Rule Embedding.
CoRR, 2022

Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering.
CoRR, 2022

VRKitchen2.0-IndoorKit: A Tutorial for Augmented Indoor Scene Building in Omniverse.
CoRR, 2022

EST: Evaluating Scientific Thinking in Artificial Agents.
CoRR, 2022

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.
CoRR, 2022

MPI: Evaluating and Inducing Personality in Pre-trained Language Models.
CoRR, 2022

Latent Diffusion Energy-Based Model for Interpretable Text Modeling.
CoRR, 2022

EBM Life Cycle: MCMC Strategies for Synthesis, Defense, and Density Modeling.
CoRR, 2022

Triangular Character Animation Sampling with Motion, Emotion, and Relation.
CoRR, 2022

PartAfford: Part-level Affordance Discovery from 3D Objects.
CoRR, 2022

Attention cannot be an Explanation.
CoRR, 2022

Discourse Analysis for Evaluating Coherence in Video Paragraph Captions.
CoRR, 2022

Effective Representation to Capture Collaboration Behaviors between Explainer and User.
CoRR, 2022

Towards Socially Intelligent Agents with Mental State Transition and Human Value.
Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2022

Emergent Graphical Conventions in a Visual Communication Game.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

EgoTaskQA: Understanding Human Tasks in Egocentric Videos.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Probabilistic Models from Generator Latent Spaces with Hat EBM.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Sequential Manipulation Planning on Scene Graph.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Latent Diffusion Energy-Based Model for Interpretable Text Modelling.
Proceedings of the International Conference on Machine Learning, 2022

COAT: Measuring Object Compositionality in Emergent Representations.
Proceedings of the International Conference on Machine Learning, 2022

MCMC Should Mix: Learning Energy-Based Model with Neural Transport Latent Space MCMC.
Proceedings of the Tenth International Conference on Learning Representations, 2022

RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Learning from the Tangram to Solve Mini Visual Tasks.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

ValueNet: A New Dataset for Human Value Driven Dialogue System.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Learning V1 Simple Cells with Vector Representation of Local Content and Matrix Representation of Local Motion.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Interpretable CNNs for Object Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Extraction of an Explanatory Graph to Interpret a CNN.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Mining Interpretable AOG Representations From Convolutional Networks via Active Question Answering.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Learning Energy-Based Spatial-Temporal Generative ConvNets for Dynamic Patterns.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

A Generalized Earley Parser for Human Activity Parsing and Prediction.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

GenMotion: Data-driven Motion Generators for Real-time Animation Synthesis.
CoRR, 2021

Emergent Graphical Conventions in a Visual Communication Game.
CoRR, 2021

IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning.
CoRR, 2021

Emergence of Theory of Mind Collaboration in Multiagent Systems.
CoRR, 2021

CX-ToM: Counterfactual Explanations with Theory-of-Mind for Enhancing Human Trust in Image Recognition Models.
CoRR, 2021

Transformer-based Machine Learning for Fast SAT Solvers and Logic Synthesis.
CoRR, 2021

STAR: Sparse Transformer-based Action Recognition.
CoRR, 2021

VersaGNN: a Versatile accelerator for Graph neural networks.
CoRR, 2021

Synthesizing Diverse and Physically Stable Grasps with Arbitrary Hand Structures by Differentiable Force Closure Estimation.
CoRR, 2021

Towards Socially Intelligent Agents with Mental State Transition and Human Utility.
CoRR, 2021

A HINT from Arithmetic: On Systematic Generalization of Perception, Syntax, and Semantics.
CoRR, 2021

HALMA: Humanlike Abstraction Learning Meets Affordance in Rapid Problem Solving.
CoRR, 2021

Iterative Teacher-Aware Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Unsupervised Foreground Extraction via Deep Region Competition.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

On Path Integration of Grid Cells: Group Representation and Isotropic Scaling.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Robust Visual Reasoning via Language Guided Neural Module Networks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Efficient Task Planning for Mobile Manipulation: a Virtual Kinematic Chain Perspective.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Consolidating Kinematic Models to Promote Coordinated Mobile Manipulations.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Reconstructing Interactive 3D Scenes by Panoptic Mapping and CAD Model Alignments.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Congestion-aware Multi-agent Trajectory Prediction for Collision Avoidance.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Stochastic Security: Adversarial Defense Using Long-Run Dynamics of Energy-Based Models.
Proceedings of the 9th International Conference on Learning Representations, 2021

Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

VLGrammar: Grounded Grammar Induction of Vision and Language.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

YouRefIt: Embodied Reference Understanding with Language and Gesture.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Mind the Context: The Impact of Contextualization in Neural Module Networks for Grounding Visual Referring Expressions.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

CrossVQA: Scalably Generating Benchmarks for Systematically Testing VQA Generalization.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Learning Neural Representation of Camera Pose with Matrix Representation of Pose Shift via View Synthesis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Generative PointNet: Deep Energy-Based Learning on Unordered Point Sets for 3D Generation, Reconstruction and Classification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning Triadic Belief Dynamics in Nonverbal Communication From Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

ACRE: Abstract Causal REasoning Beyond Covariation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

GRICE: A Grammar-based Dataset for Recovering Implicature and Conversational rEasoning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

SocAoG: Incremental Graph Parsing for Social Relation Inference in Dialogues.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Learning Cycle-Consistent Cooperative Networks via Alternating MCMC Teaching for Unsupervised Cross-Domain Translation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

SMART: A Situation Model for Algebra Story Problems via Attributed Grammar.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Learning by Fixing: Solving Math Word Problems with Weak Supervision.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
A massively parallel and scalable multi-CPU material point method.
ACM Trans. Graph., 2020

Learning to infer human attention in daily activities.
Pattern Recognit., 2020

Cooperative Training of Descriptor and Generator Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Generalized Inverse Planning: Learning Lifted non-Markovian Utility for Generalizable Task Representation.
CoRR, 2020

Weighted Entropy Modification for Soft Actor-Critic.
CoRR, 2020

Vertical-Horizontal Structured Attention for Generating Music with Chords.
CoRR, 2020

A Representational Model of Grid Cells Based on Matrix Lie Algebras.
CoRR, 2020

Learning Energy-based Model with Flow-based Backbone by Neural Transport MCMC.
CoRR, 2020

Dark, Beyond Deep: A Paradigm Shift to Cognitive AI with Humanlike Common Sense.
CoRR, 2020

Generative PointNet: Energy-Based Learning on Unordered Point Sets for 3D Generation, Reconstruction and Classification.
CoRR, 2020

Emergence of Pragmatics from Referential Game between Theory of Mind Agents.
CoRR, 2020

Actional-Perceptual Causality: Concepts and Inductive Learning for AI and Robotics.
Proceedings of the 2020 IEEE Symposium Series on Computational Intelligence, 2020

Joint Mind Modeling for Explanation Generation in Complex Human-Robot Collaborative Tasks.
Proceedings of the 29th IEEE International Conference on Robot and Human Interactive Communication, 2020

Learning Latent Space Energy-Based Prior Model.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Human-Robot Interaction in a Shared Augmented Reality Workspace.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Graph-based Hierarchical Knowledge Representation for Robot Task Transfer from Virtual to Physical World.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Congestion-aware Evacuation Routing using Augmented Reality Devices.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Joint Inference of States, Robot Knowledge, and Human (False-)Beliefs.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Closed Loop Neural-Symbolic Learning via Integrating Neural Perception, Grammar Parsing, and Symbolic Reasoning.
Proceedings of the 37th International Conference on Machine Learning, 2020

Structured Attention for Unsupervised Dialogue Structure Induction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Learning Multi-layer Latent Variable Model via Variational Optimization of Short Run MCMC for Approximate Inference.
Proceedings of the Computer Vision - ECCV 2020, 2020

A Competence-Aware Curriculum for Visual Concepts Learning via Question Answering.
Proceedings of the Computer Vision - ECCV 2020, 2020

LEMMA: A Multi-view Dataset for LEarning Multi-agent Multi-task Activities.
Proceedings of the Computer Vision - ECCV 2020, 2020

Inducing Hierarchical Compositional Model by Sparsifying Generator Network.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Joint Training of Variational Auto-Encoder and Latent Energy-Based Model.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Words Aren't Enough, Their Order Matters: On the Robustness of Grounding Visual Referring Expressions.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Machine Number Sense: A Dataset of Visual Arithmetic Problems for Abstract and Relational Reasoning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Motion-Based Generator Model: Unsupervised Disentanglement of Appearance, Trackable and Intrackable Motions in Dynamic Patterns.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

On the Anatomy of MCMC-Based Maximum Likelihood Learning of Energy-Based Models.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Theory-Based Causal Transfer: Integrating Instance-Level Induction and Abstract-Level Structure Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

CoCoX: Generating Conceptual and Counterfactual Explanations via Fault-Lines.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
A tale of two explanations: Enhancing human trust by explaining robot behavior.
Sci. Robotics, 2019

Learning Deep Generative Models with Short Run Inference Dynamics.
CoRR, 2019

Representation Learning: A Statistical Perspective.
CoRR, 2019

X-ToM: Explaining with Theory-of-Mind for Gaining Justified Human Trust.
CoRR, 2019

Towards Interpretable Image Synthesis by Learning Sparsely Connected AND-OR Networks.
CoRR, 2019

HUGE2: a Highly Untangled Generative-model Engine for Edge-computing.
CoRR, 2019

On Learning Non-Convergent Short-Run MCMC Toward Energy-Based Model.
CoRR, 2019

VRKitchen: an Interactive 3D Virtual Environment for Task-oriented Learning.
CoRR, 2019

Visual Discourse Parsing.
CoRR, 2019

Learning Vector Representation of Content and Matrix Representation of Change: Towards a Representational Model of V1.
CoRR, 2019

Multimodal Conditional Learning with Fast Thinking Policy-like Model and Slow Thinking Planner-like Model.
CoRR, 2019

Inducing Sparse Coding and And-Or Grammar from Generator Network.
CoRR, 2019

Interpretable CNNs.
CoRR, 2019

Explaining AlphaGo: Interpreting Contextual Effects in Neural Networks.
CoRR, 2019

Learning Perceptual Inference by Contrasting.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Learning Non-Convergent Non-Persistent Short-Run MCMC Toward Energy-Based Model.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Learning Virtual Grasp with Failed Demonstrations via Bayesian Inverse Reinforcement Learning.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

Self-Supervised Incremental Learning for Sound Source Localization in Complex Indoor Environment.
Proceedings of the International Conference on Robotics and Automation, 2019

High-Fidelity Grasping in Virtual Reality using a Glove-based System.
Proceedings of the International Conference on Robotics and Automation, 2019

Learning Grid Cells as Vector Representation of Self-Position Coupled with Matrix Representation of Self-Motion.
Proceedings of the 7th International Conference on Learning Representations, 2019

DenseRaC: Joint 3D Pose and Shape Estimation by Dense Render-and-Compare.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Understanding Human Gaze Communication by Spatio-Temporal Graph Reasoning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Holistic++ Scene Understanding: Single-View 3D Holistic Scene Parsing and Human Pose Estimation With Human-Object Interaction and Physical Commonsense.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Sparse Winograd Convolutional Neural Networks on Small-scale Systolic Arrays.
Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2019

Reasoning Visual Dialogs With Structural and Partial Observations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

RAVEN: A Dataset for Relational and Analogical Visual REasoNing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Unsupervised Disentangling of Appearance and Geometry by Deformable Generator Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Divergence Triangle for Joint Training of Generator Model, Energy-Based Model, and Inferential Model.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Natural Language Interaction with Explainable AI Models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Explainable AI as Collaborative Task Solving.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Partitioning the Perception of Physical and Social Events Within a Unified Psychological Space.
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019

Decomposing Human Causal Learning: Bottom-up Associative Learning and Top-down Schema Reasoning.
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019

VRGym: a virtual testbed for physical and interactive AI.
Proceedings of the ACM Turing Celebration Conference - China, 2019

MetaStyle: Three-Way Trade-off among Speed, Flexibility, and Quality in Neural Style Transfer.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Learning Dynamic Generator Model by Alternating Back-Propagation through Time.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Recognizing Unseen Attribute-Object Pair with Generative Model.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Mirroring without Overimitation: Learning Functionally Equivalent Manipulation Actions.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Perception of Human Interaction Based on Motion Trajectories: From Aerial Videos to Decontextualized Animations.
Top. Cogn. Sci., 2018

Learning and Inferring "Dark Matter" and Predicting Human Intents and Trajectories in Videos.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Attribute And-Or Grammar for Joint Parsing of Human Pose, Parts and Attributes.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Single-View 3D Scene Reconstruction and Parsing by Attribute Grammar.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Visual interpretability for deep learning: a survey.
Frontiers Inf. Technol. Electron. Eng., 2018

Configurable 3D Scene Synthesis and 2D Image Rendering with Per-pixel Ground Truth Using Stochastic Grammars.
Int. J. Comput. Vis., 2018

Mining deep And-Or object structures via cost-sensitive question-answer-based active annotations.
Comput. Vis. Image Underst., 2018

Divergence Triangle for Joint Training of Generator Model, Energy-based Model, and Inference Model.
CoRR, 2018

Explanatory Graphs for CNNs.
CoRR, 2018

Mining Interpretable AOG Representations from Convolutional Networks via Active Question Answering.
CoRR, 2018

Deeper Interpretability of Deep Networks.
CoRR, 2018

Learning Grid-like Units with Vector Representation of Self-Position and Matrix Representation of Self-Motion.
CoRR, 2018

A Tale of Three Probabilistic Families: Discriminative, Descriptive and Generative Models.
CoRR, 2018

Interactive Agent Modeling by Learning to Probe.
CoRR, 2018

Deformable Generator Network: Unsupervised Disentanglement of Appearance and Geometry.
CoRR, 2018

Unsupervised Learning of Neural Networks to Explain Neural Networks.
CoRR, 2018

Network Transplanting.
CoRR, 2018

Building a Telescope to Look Into High-Dimensional Image Spaces.
CoRR, 2018

Interpreting CNNs via Decision Trees.
CoRR, 2018

Spatially Perturbed Collision Sounds Attenuate Perceived Causality in 3D Launching Events.
Proceedings of the 2018 IEEE Conference on Virtual Reality and 3D User Interfaces, 2018

Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Unsupervised Learning of Hierarchical Models for Hand-Object Interactions.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Intent-Aware Multi-Agent Reinforcement Learning.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Interactive Robot Knowledge Patching Using Augmented Reality.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Generalized Earley Parser: Bridging Symbolic Grammars and Sequence Data for Future Prediction.
Proceedings of the 35th International Conference on Machine Learning, 2018

Learning Human-Object Interactions by Graph Parsing Neural Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018

Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image.
Proceedings of the Computer Vision - ECCV 2018, 2018

Interpretable Convolutional Neural Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

A Causal And-Or Graph Model for Visibility Fluent Reasoning in Tracking Interacting Objects.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning Descriptor Networks for 3D Shape Synthesis and Analysis.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Where and Why Are They Looking? Jointly Inferring Human Attention and Intentions in Complex Tasks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Attentive Fashion Grammar Network for Fashion Landmark Detection and Clothing Category Classification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Human-Centric Indoor Scene Synthesis Using Stochastic Grammar.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning Generative ConvNets via Multi-Grid Modeling and Sampling.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Inferring Shared Attention in Social Scene Videos.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Human Causal Transfer: Challenges for Deep Reinforcement Learning.
Proceedings of the 40th Annual Meeting of the Cognitive Science Society, 2018

Examining CNN Representations With Respect to Dataset Bias.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Interpreting CNN Knowledge via an Explanatory Graph.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Scene-Centric Joint Parsing of Cross-View Videos.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Tracking Occluded Objects and Recovering Incomplete Trajectories by Reasoning About Containment Relations and Human Actions.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Learning Pose Grammar to Encode Human Body Configuration for 3D Pose Estimation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
The Martian: Examining Human Physical Judgments across Virtual Gravity Fields.
IEEE Trans. Vis. Comput. Graph., 2017

Joint Image-Text News Topic Detection and Tracking by Multimodal Topic And-Or Graph.
IEEE Trans. Multim., 2017

Online Object Tracking, Learning and Parsing with And-Or Graphs.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Modeling 4D Human-Object Interactions for Joint Event Segmentation, Recognition, and Object Localization.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Learning Knowledge-guided Pose Grammar Machine for 3D Human Pose Estimation.
CoRR, 2017

Learning Multi-grid Generative ConvNets by Minimal Contrastive Divergence.
CoRR, 2017

A Causal And-Or Graph Model for Visibility Fluent Reasoning in Human-Object Interactions.
CoRR, 2017

Joint Parsing of Cross-view Scenes with Spatio-temporal Semantic Parse Graphs.
CoRR, 2017

A Cost-Sensitive Visual Question-Answer Framework for Mining a Deep And-OR Object Semantics from Web Images.
CoRR, 2017

Interactively Transferring CNN Patterns for Part Localization.
CoRR, 2017

Configurable, Photorealistic Image Rendering and Ground Truth Synthesis by Sampling Stochastic Grammars Representing Indoor Scenes.
CoRR, 2017

A glove-based system for studying hand-object manipulation via joint pose and force sensing.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Feeling the force: Integrating force and pose for fluent discovery through imitation learning to open medicine bottles.
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017

Single-Image 3D Scene Parsing Using Geometric Commonsense.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Inferring Human Attention by Learning Latent Intentions.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Learning social affordance grammar from videos: Transferring human interactions to human-robot interactions.
Proceedings of the 2017 IEEE International Conference on Robotics and Automation, 2017

Predicting Human Activities Using Stochastic Grammar.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Monocular 3D Human Pose Estimation by Predicting Depth on Joints.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Jointly Recognizing Object Fluents and Tasks in Egocentric Videos.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Mining Object Parts from CNNs via Active Question-Answering.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Synthesizing Dynamic Patterns by Spatial-Temporal Generative ConvNet.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Generative Hierarchical Learning of Sparse FRAME Models.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

CERN: Confidence-Energy Recurrent Network for Group Activity Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Inferring Hidden Statuses and Actions in Video by Causal Reasoning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Learning Human Utility from Video Demonstrations for Deductive Planning in Robotics.
Proceedings of the 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, 2017

Inferring Human Interaction from Motion Trajectories in Aerial Videos.
Proceedings of the 39th Annual Meeting of the Cognitive Science Society, 2017

Visuomotor Adaptation and Sensory Recalibration in Reversed Hand Movement Task.
Proceedings of the 39th Annual Meeting of the Cognitive Science Society, 2017

Consistent Probabilistic Simulation Underlying Human Judgment in Substance Dynamics.
Proceedings of the 39th Annual Meeting of the Cognitive Science Society, 2017

Inferring Context Through Scene Understanding.
Proceedings of the 2017 AAAI Spring Symposia, 2017

Growing Interpretable Part Graphs on ConvNets via Multi-Shot Learning.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Cross-View People Tracking by Scene-Centered Spatio-Temporal Parsing.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Alternating Back-Propagation for Generator Network.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Learning Perceptual Causality from Video.
ACM Trans. Intell. Syst. Technol., 2016

A Reconfigurable Tangram Model for Scene Representation and Categorization.
IEEE Trans. Image Process., 2016

Learning And-Or Model to Represent Context and Occlusion for Car Detection and Viewpoint Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Multi-Shot Mining Semantic Part Concepts in CNNs.
CoRR, 2016

Synthesizing Dynamic Textures and Sounds by Spatial-Temporal Generative ConvNet.
CoRR, 2016

Modeling and Inferring Human Intents and Latent Functional Objects for Trajectory Prediction.
CoRR, 2016

Cooperative Training of Descriptor and Generator Networks.
CoRR, 2016

Attribute And-Or Grammar for Joint Parsing of Human Attributes, Part and Pose.
CoRR, 2016

Learning Generative ConvNet with Continuous Latent Factors by Alternating Back-Propagation.
CoRR, 2016

Evaluating physical quantities and learning human utilities from RGBD videos.
Proceedings of the SIGGRAPH ASIA 2016, Macao, December 5-8, 2016, 2016

A virtual reality platform for dynamic human-scene interaction.
Proceedings of the SIGGRAPH ASIA 2016, Macao, December 5-8, 2016, 2016

Grounded Semantic Role Labeling.
Proceedings of the NAACL HLT 2016, 2016

Inferring human intent from video by sampling hierarchical plans.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Learning Social Affordance for Human-Robot Interaction.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

What Is Where: Inferring Containment Relations from Videos.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Robot learning with a spatial, temporal, and causal and-or graph.
Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

A Theory of Generative ConvNet.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Jointly Learning Grounded Task Structures from Language Instruction and Visual Demonstration.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Inferring Forces and Learning Human Utilities from Videos.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Multi-view People Tracking via Hierarchical Trajectory Composition.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Recognizing Car Fluents from Video.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Critical Features of Joint Actions that Signal Human Interaction.
Proceedings of the 38th Annual Meeting of the Cognitive Science Society, 2016

Probabilistic Simulation Predicts Human Performance on Viscous Fluid-Pouring Problem.
Proceedings of the 38th Annual Meeting of the Cognitive Science Society, 2016

Learning FRAME Models Using CNN Filters.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Task Learning through Visual Demonstration and Situated Dialogue.
Proceedings of the Symbiotic Cognitive Systems, 2016

2015
And-Or Graph Face Model and Its Applications in Artistic Sketching and Aging Simulation.
Proceedings of the Encyclopedia of Biometrics, Second Edition, 2015

Learning Near-Optimal Cost-Sensitive Decision Policy for Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Learning Hierarchical Space Tiling for Scene Modeling, Parsing and Attribute Tagging.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Learning 3D Object Templates by Quantizing Geometry and Appearance Spaces.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Video Primal Sketch: A Unified Middle-Level Representation for Video.
J. Math. Imaging Vis., 2015

Scene Understanding by Reasoning Stability and Safety.
Int. J. Comput. Vis., 2015

Learning Sparse FRAME Models for Natural Image Patterns.
Int. J. Comput. Vis., 2015

Learning And-Or Models to Represent Context and Occlusion for Car Detection and Viewpoint Estimation.
CoRR, 2015

A Restricted Visual Turing Test for Deep Scene and Event Understanding.
CoRR, 2015

Learning FRAME Models Using CNN Filters for Knowledge Visualization.
CoRR, 2015

Joint Image-Text News Topic Detection and Tracking with And-Or Graph Representation.
CoRR, 2015

Mining And-Or Graphs for Graph Matching and Object Discovery.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Attributed Grammars for Joint Estimation of Human Attributes, Part and Pose.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Automated Facial Trait Judgment and Election Outcome Prediction: Social Dimensions of Face.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Understanding tools: Task-oriented object modeling, learning and recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Joint inference of groups, events and human roles in aerial videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Joint action recognition and pose estimation from video.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Evaluating Human Cognition of Containing Relations with Physical Simulation.
Proceedings of the 37th Annual Meeting of the Cognitive Science Society, 2015

Represent and Infer Human Theory of Mind for Human-Robot Interaction.
Proceedings of the 2015 AAAI Fall Symposia, Arlington, Virginia, USA, November 12-14, 2015, 2015

A Unified Framework for Human-Robot Knowledge Transfer.
Proceedings of the 2015 AAAI Fall Symposia, Arlington, Virginia, USA, November 12-14, 2015, 2015

2014
Animated Pose Templates for Modeling and Detecting Human Actions.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Joint Video and Text Parsing for Understanding Events and Answering Queries.
IEEE Multim., 2014

Mapping Energy Landscapes of Non-Convex Learning Problems.
CoRR, 2014

Detecting potential falling objects by inferring human action and natural disturbance.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Mapping the Energy Landscape of Non-convex Optimization Problems.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2014

Integrating Context and Occlusion for Car Detection by Hierarchical And-Or Model.
Proceedings of the Computer Vision - ECCV 2014, 2014

Learning Inhomogeneous FRAME Models for Object Patterns.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Cross-View Action Modeling, Learning, and Recognition.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Single-View 3D Scene Parsing by Attributed Grammar.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Visual Persuasion: Inferring Communicative Intents of Images.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Unsupervised Learning of Dictionaries of Hierarchical Compositional Models.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Video Stylization: Painterly Rendering and Optimization With Content Extraction.
IEEE Trans. Circuits Syst. Video Technol., 2013

Abstract painting with interactive control of perceptual entropy.
ACM Trans. Appl. Percept., 2013

Learning AND-OR Templates for Object Recognition and Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Learning and parsing video events with goal and intent prediction.
Comput. Vis. Image Underst., 2013

Unsupervised Structure Learning of Stochastic And-Or Grammars.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Inferring "Dark Matter" and "Dark Energy" from Videos.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Modeling 4D Human-Object Interactions for Event and Object Recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Concurrent Action Detection with Structural Prediction.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Modeling Occlusion by Discriminative AND-OR Structures.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Human Attribute Recognition by Rich Appearance Dictionary.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Cosegmentation and Cosketch by Unsupervised Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Monte Carlo Tree Search for Scheduling Activity Recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Beyond Point Clouds: Scene Understanding by Reasoning Geometry and Physics.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Scene Parsing by Integrating Function, Geometry and Appearance Models.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Weakly Supervised Learning for Attribute Localization in Outdoor Scenes.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Discriminatively Trained And-Or Tree Models for Object Detection.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Integrating Grammar and Segmentation for Human Pose Estimation.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Using Causal Induction in Humans to Learn and Infer Causality from Video.
Proceedings of the 35th Annual Meeting of the Cognitive Science Society, 2013

Rates for Inductive Learning of Compositional Models.
Proceedings of the Learning Rich Representations from Low-Level Sensors, 2013

Structure vs. Appearance and 3D vs. 2D? A Numeric Answer.
Proceedings of the Shape Perception in Human and Computer Vision, 2013

Erratum to: Artistic Rendering of Portraits.
Proceedings of the Image and Video-Based Artistic Stylisation, 2013

Artistic Rendering of Portraits.
Proceedings of the Image and Video-Based Artistic Stylisation, 2013

2012
Background modeling by subspace learning on spatio-temporal patches.
Pattern Recognit. Lett., 2012

Learning Hybrid Image Templates (HIT) by Information Projection.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Intrackability: Characterizing Video Statistics and Pursuing Video Representations.
Int. J. Comput. Vis., 2012

Learning reconfigurable scene representation by tangram model.
Proceedings of the IEEE Workshop on Applications of Computer Vision, 2012

Reconfigurable templates for robust vehicle detection and classification.
Proceedings of the IEEE Workshop on Applications of Computer Vision, 2012

Cost-Sensitive Top-Down/Bottom-Up Inference for Multiscale Activity Recognition.
Proceedings of the Computer Vision - ECCV 2012, 2012

Hierarchical Space Tiling for Scene Modeling.
Proceedings of the Computer Vision, 2012

2011
C<sup>4</sup>: Exploring Multiple Solutions in Graphical Models by Cluster Sampling.
IEEE Trans. Pattern Anal. Mach. Intell., 2011

A Numerical Study of the Bottom-Up and Top-Down Inference Processes in And-Or Graphs.
Int. J. Comput. Vis., 2011

Customizing painterly rendering styles using stroke processes.
Proceedings of the 9th International Symposium on Non-Photorealistic Animation and Rendering, 2011

Portrait painting using active templates.
Proceedings of the 9th International Symposium on Non-Photorealistic Animation and Rendering, 2011

Image Parsing with Stochastic Scene Grammar.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Inferring social roles in long timespan video sequence.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Unsupervised learning of stochastic AND-OR templates for object modeling.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Human parsing using stochastic and-or grammars and rich appearances.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Unsupervised learning of event AND-OR grammar and semantics from video.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Parsing video events with goal inference and intent prediction.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Image representation by active curves.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Video Primal Sketch: A generic middle-level representation of video.
Proceedings of the IEEE International Conference on Computer Vision, 2011

2010
Learning explicit and implicit visual manifolds by information projection.
Pattern Recognit. Lett., 2010

I2T: Image Parsing to Text Description.
Proc. IEEE, 2010

A Compositional and Dynamic Model for Face Aging.
IEEE Trans. Pattern Anal. Mach. Intell., 2010

Layered Graph Matching with Composite Cluster Sampling.
IEEE Trans. Pattern Anal. Mach. Intell., 2010

Learning Active Basis Model for Object Detection and Recognition.
Int. J. Comput. Vis., 2010

A Hierarchical and Contextual Model for Aerial Image Parsing.
Int. J. Comput. Vis., 2010

Sisley the abstract painter.
Proceedings of the 8th International Symposium on Non-Photorealistic Animation and Rendering, 2010

Painterly animation using video semantics and feature correspondence.
Proceedings of the 8th International Symposium on Non-Photorealistic Animation and Rendering, 2010

CO3 for ultra-fast and accurate interactive segmentation.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Artistic paper-cut of human portraits.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Learning Artistic Lighting Template from Portrait Photographs.
Proceedings of the Computer Vision, 2010

Learning a probabilistic model mixing 3D and 2D primitives for view invariant object recognition.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Discovering scene categories by information projection and cluster sampling.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
And-Or Graph Model for Faces.
Proceedings of the Encyclopedia of Biometrics, 2009

From image parsing to painterly rendering.
ACM Trans. Graph., 2009

Bottom-Up/Top-Down Image Parsing with Attribute Grammar.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Learning deformable action templates from cluttered videos.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Evaluating information contributions of bottom-up and top-down processes.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Learning mixed templates for object recognition.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Trajectory parsing by cluster sampling in spatio-temporal graph.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Layered graph matching by composite cluster sampling with collaborative and competitive interactions.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Flow mosaicking: Real-time pedestrian counting without scene-specific learning.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008
A Hierarchical Compositional Model for Face Representation and Sketching.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

Perceptual Scale-Space and Its Applications.
Int. J. Comput. Vis., 2008

Design sparse features for age estimation using hierarchical face model.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

Learning a scene contextual model for tracking and abnormality detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2008

A hierarchical and contextual model for aerial image understanding.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

SAVE: A framework for semantic annotation of visual events.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2008

An integrated background model for video surveillance based on primal sketch and 3D scene geometry.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
Statistical Principles in Image Modeling.
Technometrics, 2007

A Two-Level Generative Model for Cloth Representation and Shape from Shading.
IEEE Trans. Pattern Anal. Mach. Intell., 2007

Primal sketch: Integrating structure and texture.
Comput. Vis. Image Underst., 2007

Deformable Template As Active Basis.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

An Empirical Study of Object Category Recognition: Sequential Testing with Generalized Samples.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Introduction to a Large-Scale General Purpose Ground Truth Database: Methodology, Annotation Tool and Benchmarks.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2007

Object Category Recognition Using Generative Template Boosting.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2007

An Automatic Portrait System Based on And-Or Graph Representation.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2007

Dynamic Feature Cascade for Multiple Object Tracking with Trackability Analysis.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2007

Bayesian Inference for Layer Representation with Mixed Markov Random Field.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2007

Compositional Boosting for Computing Hierarchical Image Structures.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

A Multi-Resolution Dynamic Model for Face Aging Simulation.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Mapping Natural Image Patches by Explicit and Implicit Manifolds.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Layered Graph Match with Graph Editing.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
A Generative Sketch Model for Human Hair Analysis and Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., 2006

Parsing Images into Regions, Curves, and Curve Groups.
Int. J. Comput. Vis., 2006

A Stochastic Grammar of Images.
Found. Trends Comput. Graph. Vis., 2006

Composite Templates for Cloth Modeling and Sketching.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

2005
Generalizing Swendsen-Wang to Sampling Arbitrary Posterior Probabilities.
IEEE Trans. Pattern Anal. Mach. Intell., 2005

What are Textons?
Int. J. Comput. Vis., 2005

Image Parsing: Unifying Segmentation, Detection, and Recognition.
Int. J. Comput. Vis., 2005

Perceptual Scale Space and its Applications.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Bottom-up/Top-Down Image Parsing by Attribute Graph Grammar.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

Incorporating Visual Knowledge Representation in Stereo Reconstruction.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

A High Resolution Grammatical Model for Face Representation and Sketching.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Cloth Representation by Shape from Shading with Shading Primitives.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

A Generative Model of Human Hair for Hair Sketching.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

2004
Analysis and Synthesis of Textured Motion: Particles and Waves.
IEEE Trans. Pattern Anal. Mach. Intell., 2004

Range Image Segmentation by an Effective Jump-Diffusion Method.
IEEE Trans. Pattern Anal. Mach. Intell., 2004

On the Relationship Between Image and Motion Segmentation.
Proceedings of the Spatial Coherence for Visual Motion Analysis, 2004

Modeling Complex Motion by Tracking and Editing Hidden Markov Graphs.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

Automatic Single View Building Reconstruction by Integrating Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2004

Information Scaling Laws in Natural Scenes.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2004

Multigrid and Multi-Level Swendsen-Wang Cuts for Hierarchic Graph Partition.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

2003
Statistical Modeling and Conceptualization of Visual Patterns.
IEEE Trans. Pattern Anal. Mach. Intell., 2003

Statistical Edge Detection: Learning and Evaluating Edge Cues.
IEEE Trans. Pattern Anal. Mach. Intell., 2003

Modeling Visual Patterns by Integrating Descriptive and Generative Methods.
Int. J. Comput. Vis., 2003

Modeling Textured Motion : Particle, Wave and Sketch.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Towards a Mathematical Theory of Primal Sketch and Sketchability.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

A Multi-scale Generative Model for Animate Shapes and Parts.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Graph Partition by Swendsen-Wang Cuts.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Bayesian Reconstruction of 3D Shapes and Scenes From A Single Image.
Proceedings of the 2003 IEEE 1st International Workshop on Higher-Level Knowledge in 3D Modeling and Motion Analysis (HLK 2003), 2003

2002
Learning in Gibbsian Fields: How Accurate and How Fast Can It Be?
IEEE Trans. Pattern Anal. Mach. Intell., 2002

Image Segmentation by Data-Driven Markov Chain Monte Carlo.
IEEE Trans. Pattern Anal. Mach. Intell., 2002

What Are Textons?
Proceedings of the Computer Vision, 2002

Statistical Modeling of Texture Sketch.
Proceedings of the Computer Vision, 2002

A Generative Method for Textured Motion: Analysis and Synthesis.
Proceedings of the Computer Vision, 2002

Parsing Images into Region and Curve Processes.
Proceedings of the Computer Vision, 2002

A Stochastic Algorithm for 3D Scene Segmentation and Reconstruction.
Proceedings of the Computer Vision, 2002

2001
Introduction by Guest Editors.
Int. J. Comput. Vis., 2001

Order Parameters for Detecting Target Curves in Images: When Does High Level Knowledge Help?
Int. J. Comput. Vis., 2001

Image Segmentation by Data Driven Markov Chain Monte Carlo.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

Learning Inhomogeneous Gibbs Model of Faces by Minimax Entropy.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

Visual Learning by Integrating Descriptive and Generative Methods.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

Example-Based Facial Sketch Generation with Non-parametric Sampling.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

2000
Exploring Texture Ensembles by Efficient Markov Chain Monte Carlo-Toward a 'Trichromacy' Theory of Texture.
IEEE Trans. Pattern Anal. Mach. Intell., 2000

Guest Editorial: Statistical and Computational Theories of Vision: Modeling, Learning, Sampling and Computing, Part I.
Int. J. Comput. Vis., 2000

Equivalence of Julesz Ensembles and FRAME Models.
Int. J. Comput. Vis., 2000

Integrating Bottom-Up/Top-Down for Object Recognition by Data Driven Markov Chain Monte Carlo.
Proceedings of the 2000 Conference on Computer Vision and Pattern Recognition (CVPR 2000), 2000

Order Parameters for Minimax Entropy Distributions: When Does High Level Knowledge Help?
Proceedings of the 2000 Conference on Computer Vision and Pattern Recognition (CVPR 2000), 2000

1999
Embedding Gestalt Laws in Markov Random Fields.
IEEE Trans. Pattern Anal. Mach. Intell., 1999

Stochastic Jump-Diffusion Process for Computing Medial Axes in Markov Random Fields.
IEEE Trans. Pattern Anal. Mach. Intell., 1999

From local features to global perception - A perspective of Gestalt psychology from Markov random field theory.
Neurocomputing, 1999

Equivalence of Julesz and Gibbs Texture Ensembles.
Proceedings of the International Conference on Computer Vision, 1999

Fundamental Bounds on Edge Detection: An Information Theoretic Evaluation of Different Edge Cues.
Proceedings of the 1999 Conference on Computer Vision and Pattern Recognition (CVPR '99), 1999

1998
Filters, Random Fields and Maximum Entropy (FRAME): Towards a Unified Theory for Texture Modeling.
Int. J. Comput. Vis., 1998

GRADE: Gibbs Reaction and Diffusion Equation.
Proceedings of the Sixth International Conference on Computer Vision (ICCV-98), 1998

Stochastic Computation of Medial Axis in Markov Random Fields.
Proceedings of the 1998 Conference on Computer Vision and Pattern Recognition (CVPR '98), 1998

1997
Prior Learning and Gibbs Reaction-Diffusion.
IEEE Trans. Pattern Anal. Mach. Intell., 1997

Minimax Entropy Principle and Its Application to Texture Modeling.
Neural Comput., 1997

Modeling images and textures by minimax entropy.
Proceedings of the Human Vision and Electronic Imaging II, 1997

Learning Generic Prior Models for Visual Computation.
Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97), 1997

1996
Region Competition: Unifying Snakes, Region Growing, and Bayes/MDL for Multiband Image Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 1996

FORMS: A flexible object recognition and modelling system.
Int. J. Comput. Vis., 1996

FRAME: Filters, Random fields, and Minimax Entropy - Towards a Unified Theory for Texture Modeling.
Proceedings of the 1996 Conference on Computer Vision and Pattern Recognition (CVPR '96), 1996

1995
Region Competition: Unifying Snakes, Region Growing, Energy/Bayes/MDL for Multi-band Image Segmentation.
Proceedings of the Procedings of the Fifth International Conference on Computer Vision (ICCV 95), 1995

1994
A Framework for Shape Representation and Recognition.
Proceedings of the Proceedings 1994 International Conference on Image Processing, 1994


  Loading...