Zhengyou Zhang

Orcid: 0000-0002-6606-2525

According to our database1, Zhengyou Zhang authored at least 287 papers between 1988 and 2024.

Collaborative distances:

Awards

ACM Fellow

ACM Fellow 2013, "For contributions to computer vision and multimedia.".

IEEE Fellow

IEEE Fellow 2005, "For contributions to robust computer vision techniques.".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Max: A Wheeled-Legged Quadruped Robot for Multimodal Agile Locomotion.
IEEE Trans Autom. Sci. Eng., October, 2024

Enabling Versatility and Dexterity of the Dual-Arm Manipulators: A General Framework Toward Universal Cooperative Manipulation.
IEEE Trans. Robotics, 2024

Lifelike agility and play in quadrupedal robots using reinforcement learning and generative pre-trained models.
Nat. Mac. Intell., 2024

RiskAwareBench: Towards Evaluating Physical Risk Awareness for High-level Planning of LLM-based Embodied Agents.
CoRR, 2024

Self-Supervised Bayesian Visual Imitation Learning Applied to Robotic Pouring.
Proceedings of the IEEE International Conference on Industrial Technology, 2024

2023
Digital Twin-Driven Mixed Reality Framework for Immersive Teleoperation With Haptic Rendering.
IEEE Robotics Autom. Lett., December, 2023

Optical Nanofiber Skins for Multifunctional Humanoid Tactility.
Adv. Intell. Syst., February, 2023

TencentLLMEval: A Hierarchical Evaluation of Real-World Capabilities for Human-Aligned LLMs.
CoRR, 2023

Ground-Challenge: A Multi-sensor SLAM Dataset Focusing on Corner Cases for Ground Robots.
CoRR, 2023

Locomotion generation for quadruped robots on challenging terrains via quadratic programming.
Auton. Robots, 2023

Ground-Challenge: A Multi-sensor SLAM Dataset Focusing on Corner Cases for Ground Robots.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2023

Design, Sensing and Control of Service Robotic System for Intelligent Navigation and Operation in Internet Data Centers.
Proceedings of the 19th IEEE International Conference on Automation Science and Engineering, 2023

2022
High-Fidelity 3D Digital Human Head Creation from RGB-D Selfies.
ACM Trans. Graph., 2022

Explainable Hierarchical Imitation Learning for Robotic Drink Pouring.
IEEE Trans Autom. Sci. Eng., 2022

DA$^{2}$ Dataset: Toward Dexterity-Aware Dual-Arm Grasping.
IEEE Robotics Autom. Lett., 2022

Asymptotic Trajectory Tracking of Autonomous Bicycles via Backstepping and Optimal Control.
IEEE Control. Syst. Lett., 2022

Large-Scale Integrated Flexible Tactile Sensor Array for Sensitive Smart Robotic Touch.
CoRR, 2022

DA<sup>2</sup> Dataset: Toward Dexterity-Aware Dual-Arm Grasping.
CoRR, 2022

Relative Policy-Transition Optimization for Fast Policy Transfer.
CoRR, 2022

An Adaptive Approach to Whole-Body Balance Control of Wheel-Bipedal Robot Ollie.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

RECCraft System: Towards Reliable and Efficient Collective Robotic Construction.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Real-time Inertial Parameter Identification of Floating-Base Robots Through Iterative Primitive Shape Division.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

2021
Joint Hand-Object 3D Reconstruction From a Single Image With Cross-Branch Feature Fusion.
IEEE Trans. Image Process., 2021

Learning-Based Balance Control of Wheel-Legged Robots.
IEEE Robotics Autom. Lett., 2021

Digital Human in an Integrated Physical-Digital World (IPhD).
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Run Like a Dog: Learning Based Whole-Body Control Framework for Quadruped Gait Style Transfer.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Balance Control of a Novel Wheel-legged Robot: Design and Experiments.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

2020
Jointly Learning Visual Poses and Pose Lexicon for Semantic Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2020

TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game.
CoRR, 2020

TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning.
CoRR, 2020

High-Fidelity 3D Digital Human Creation from RGB-D Selfies.
CoRR, 2020

Zeroth-Order Supervised Policy Improvement.
CoRR, 2020

A Flexible Dual-Core Optical Waveguide Sensor for Simultaneous and Continuous Measurement of Contact Force and Position.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Gain Scheduled Controller Design for Balancing an Autonomous Bicycle.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Nonlinear Balance Control of an Unmanned Bicycle: Design and Experiments.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

2019
Report of 2017 NSF Workshop on Multimedia Challenges, Opportunities and Research Roadmaps.
CoRR, 2019

Arena: a toolkit for Multi-Agent Reinforcement Learning.
CoRR, 2019

Curriculum-guided Hindsight Experience Replay.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Challenges of Multimodal Interaction in the Era of Human-Robot Coexistence.
Proceedings of the International Conference on Multimodal Interaction, 2019

2018
Depth Super-Resolution on RGB-D Video Sequences With Large Displacement 3D Motion.
IEEE Trans. Image Process., 2018

Incremental Classifier Learning with Generative Adversarial Networks.
CoRR, 2018

End-to-End Convolutional Semantic Embeddings.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Measuring Infant's Length with an Image.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Guest Editorial Introduction to the Special Issue on Group and Crowd Behavior Analysis for Intelligent Multicamera Video Surveillance.
IEEE Trans. Circuits Syst. Video Technol., 2017

Semantic action recognition by learning a pose lexicon.
Pattern Recognit., 2017

Adversarial Ranking for Language Generation.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

2016
Dimension Reduction With Extreme Learning Machine.
IEEE Trans. Image Process., 2016

Handling Occlusion and Large Displacement Through Improved RGB-D Scene Flow Estimation.
IEEE Trans. Circuits Syst. Video Technol., 2016

Camera calibration: a personal retrospective.
Mach. Vis. Appl., 2016

Guest Editorial: Human Activity Understanding from 2D and 3D Data.
Int. J. Comput. Vis., 2016

Training deep networks for facial expression recognition with crowd-sourced label distribution.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

2015
Visual Understanding with RGB-D Sensors: An Introduction to the Special Issue.
ACM Trans. Intell. Syst. Technol., 2015

Auxiliary Training Information Assisted Visual Recognition.
IPSJ Trans. Comput. Vis. Appl., 2015

A survey on face detection in the wild: Past, present and future.
Comput. Vis. Image Underst., 2015

Vision-enhanced Immersive Interaction and Remote Collaboration with Large Touch Displays.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

VTouch: Vision-enhanced interaction for large touch displays.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Maximum a posteriori estimation of room impulse responses.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

ImmerseBoard: Immersive Telepresence Experience using a Digital Whiteboard.
Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, 2015

Deeply-Supervised Nets.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

2014
Weak Perspective Projection.
Computer Vision, A Reference Guide, 2014

Reference Plane.
Computer Vision, A Reference Guide, 2014

Projection Transformation.
Computer Vision, A Reference Guide, 2014

Projection.
Computer Vision, A Reference Guide, 2014

Perspective Transformation.
Computer Vision, A Reference Guide, 2014

Perspective Camera.
Computer Vision, A Reference Guide, 2014

Iterative Closest Point (ICP).
Computer Vision, A Reference Guide, 2014

Intrinsics.
Computer Vision, A Reference Guide, 2014

Geometric Calibration.
Computer Vision, A Reference Guide, 2014

Fundamental Matrix.
Computer Vision, A Reference Guide, 2014

Essential Matrix.
Computer Vision, A Reference Guide, 2014

Epipolar Geometry.
Computer Vision, A Reference Guide, 2014

Epipolar Constraint.
Computer Vision, A Reference Guide, 2014

Eight-Point Algorithm.
Computer Vision, A Reference Guide, 2014

Depth Distortion.
Computer Vision, A Reference Guide, 2014

Camera Pose.
Computer Vision, A Reference Guide, 2014

Camera Parameters (Intrinsic, Extrinsic).
Computer Vision, A Reference Guide, 2014

Camera Model.
Computer Vision, A Reference Guide, 2014

Camera Extrinsic Parameters.
Computer Vision, A Reference Guide, 2014

Camera Calibration.
Computer Vision, A Reference Guide, 2014

Calibration of Projective Cameras.
Computer Vision, A Reference Guide, 2014

Calibration.
Computer Vision, A Reference Guide, 2014

Affine Projection.
Computer Vision, A Reference Guide, 2014

Affine Camera.
Computer Vision, A Reference Guide, 2014

Face Modeling.
Computer Vision, A Reference Guide, 2014

Activity Recognition.
Computer Vision, A Reference Guide, 2014

Editorial TAMD Update.
IEEE Trans. Auton. Ment. Dev., 2014

Improving multiview face detection with multi-task deep convolutional neural networks.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

WWN: Integration with coarse-to-fine, supervised and reinforcement learning.
Proceedings of the 2014 International Joint Conference on Neural Networks, 2014

Deblur a blurred RGB image with a sharp NIR image through local linear mapping.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Facial expression tracking from head-mounted, partially observing cameras.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Eye gaze tracking using an RGBD camera: a comparison with a RGB solution.
Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2014

Improving cross-ratio-based eye tracking techniques by leveraging the binocular fixation constraint.
Proceedings of the Eye Tracking Research and Applications, 2014

Towards accurate and robust cross-ratio based gaze trackers through learning from simulation.
Proceedings of the Eye Tracking Research and Applications, 2014

Can Visual Recognition Benefit from Auxiliary Information in Training?
Proceedings of the Computer Vision - ACCV 2014, 2014

Completed Dense Scene Flow in RGB-D Space.
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014

2013
Model-based hand pose estimation via spatial-temporal hand parsing and 3D fingertip localization.
Vis. Comput., 2013

Robust Part-Based Hand Gesture Recognition Using Kinect Sensor.
IEEE Trans. Multim., 2013

Editorial - TAMD Outstanding Paper Award and Open Access Publication Established.
IEEE Trans. Auton. Ment. Dev., 2013

"Pattern Recognition" special issue: Sparse representation for event recognition in video surveillance.
Pattern Recognit., 2013

Viewport: A Distributed, Immersive Teleconferencing System with Infrared Dot Pattern.
IEEE Multim., 2013

Modeling the effects of neuromodulation on internal brain areas: Serotonin and dopamine.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013

Robust part-based face matching with multiple templates.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Measuring the engagement level of TV viewers.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Real-Time High-Resolution Sparse Voxelization with Application to Image-Based Modeling.
Proceedings of the High-Performance Graphics 2013, 2013

Wide-Baseline Hair Capture Using Strand-Based Refinement.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Tensor-Based Human Body Modeling.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Action Detection by Fusing Hierarchically Filtered Motion with Spatiotemporal Interest Point Features.
Proceedings of the Human Behavior Recognition Technologies, 2013

2012
Hierarchical Filtered Motion for Action Recognition in Crowded Videos.
IEEE Trans. Syst. Man Cybern. Part C, 2012

Editorial: Impact Factor and Outstanding Paper Awards.
IEEE Trans. Auton. Ment. Dev., 2012

Societally connected multimedia across cultures.
J. Zhejiang Univ. Sci. C, 2012

Guest Editors' Introduction: Special Section on Connected Multimedia.
J. Multim., 2012

Introduction to the Special Issue on Mobile Vision.
Int. J. Comput. Vis., 2012

Microsoft Kinect Sensor and Its Effect.
IEEE Multim., 2012

CIS Publication Spotlight [Publication Spotlight].
IEEE Comput. Intell. Mag., 2012

Auditory augmented reality: Object sonification for the visually impaired.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012

Multi-view learning from imperfect tagging.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Immersive Telepresence: Transcending Space and Time.
Proceedings of the 2012 International Symposium on Ubiquitous Virtual Reality, 2012

Virtual View Reconstruction Using Temporal Information.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

3D scene reconstruction by multiple structured-light based commodity depth cameras.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Mining noisy tagging from multi-label space.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Introduction to the ICME2010 Special Issue.
IEEE Trans. Multim., 2011

Editorial Board Update.
IEEE Trans. Auton. Ment. Dev., 2011

Editorial: Healthy and Prosperous Development.
IEEE Trans. Auton. Ment. Dev., 2011

Improving Immersive Experiences in Telecommunication with Motion Parallax [Applications Corner].
IEEE Signal Process. Mag., 2011

[CIS Publication Spotlight].
IEEE Comput. Intell. Mag., 2011

CIS Publication Spotlight.
IEEE Comput. Intell. Mag., 2011

An effecive night video enhancement algorithm.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

ViewMark: An interactive videoconferencing system for mobile devices.
Proceedings of the IEEE 13th International Workshop on Multimedia Signal Processing (MMSP 2011), 2011

Low-complexity, near-lossless coding of depth maps from kinect-like depth cameras.
Proceedings of the IEEE 13th International Workshop on Multimedia Signal Processing (MMSP 2011), 2011

Interpolation of combined head and room impulse response for audio spatialization.
Proceedings of the IEEE 13th International Workshop on Multimedia Signal Processing (MMSP 2011), 2011

Robust hand gesture recognition based on finger-earth mover's distance with a commodity depth camera.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Robust hand gesture recognition with kinect sensor.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Modeling and representing events in multimedia.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Innovating the multimedia experience.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Mining partially annotated images.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

Calibration between depth and color sensors for commodity depth cameras.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

A novel see-through screen based on weave fabrics.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Realistic audio in immersive video conferencing.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Towards ideal window layouts for multi-party, gaze-aware desktop videoconferencing.
Proceedings of the Graphics Interface 2011 Conference, 2011

What did i miss?: in-meeting review using multimodal accelerated instant replay (air) conferencing.
Proceedings of the International Conference on Human Factors in Computing Systems, 2011

2010
Boosting-Based Face Detection and Adaptation
Synthesis Lectures on Computer Vision, Morgan & Claypool Publishers, ISBN: 978-3-031-01809-1, 2010

Group Event Detection With a Varying Number of Group Members for Video Surveillance.
IEEE Trans. Circuits Syst. Video Technol., 2010

Introduction of New Associate Editors.
IEEE Trans. Auton. Ment. Dev., 2010

CIS Publication Spotlight [Publication Spotlight].
IEEE Comput. Intell. Mag., 2010

MobileSurface: interaction in the air for mobile computing.
Proceedings of the Adjunct proceedings of the 23nd annual ACM symposium on User interface software and technology, 2010

Enhancing stereophonic teleconferencing with microphone arrays through sound field warping.
Proceedings of the 2010 IEEE International Workshop on Multimedia Signal Processing, 2010

Overview of ACM international workshop on connected multimedia.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

AIR conferencing: accelerated instant replay for in-meeting multimodal review.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Action detection using multiple spatial-temporal interest point features.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

3D Deformable Face Tracking with a Commodity Depth Camera.
Proceedings of the Computer Vision, 2010

Action recognition based on a bag of 3D points.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

Exploring spatialized audio & video for distributed conversations.
Proceedings of the 2010 ACM Conference on Computer Supported Cooperative Work, 2010

Human Action Recognition with Expandable Graphical Models.
Proceedings of the Machine Learning for Human Motion Analysis - Theory and Practice., 2010

2009
Active Lighting for Video Conferencing.
IEEE Trans. Circuits Syst. Video Technol., 2009

Editorial.
IEEE Trans. Auton. Ment. Dev., 2009

Autonomous Mental Development: A New Interdisciplinary Transactions for Natural and Artificial Intelligence.
IEEE Trans. Auton. Ment. Dev., 2009

Report on the IEEE 8th International Conference on Development and Learning (ICDL-2009).
IEEE Trans. Auton. Ment. Dev., 2009

Face Relighting from a Single Image under Arbitrary Unknown Lighting Conditions.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Highly realistic audio spatialization for multiparty conferencing using headphones.
Proceedings of the 2009 IEEE International Workshop on Multimedia Signal Processing, 2009

Speeding up spatio-temporal sliding-window search for efficient event detection in crowded videos.
Proceedings of the 1st ACM international workshop on Events in multimedia, 2009

Group Event Detection for Video Surveillance.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

Multimodal collaboration and human-computer interaction.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Boosted multi-task learning for face verification with applications to web image and video search.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Efficient Scale-Space Spatiotemporal Saliency Tracking for Distortion-Free Video Retargeting.
Proceedings of the Computer Vision, 2009

2008
Boosting-Based Multimodal Speaker Detection for Distributed Meeting Videos.
IEEE Trans. Multim., 2008

Maximum Likelihood Sound Source Localization and Beamforming for Directional Microphone Arrays in Distributed Meetings.
IEEE Trans. Multim., 2008

Camera Calibration With Three Noncollinear Points Under Special Motions.
IEEE Trans. Image Process., 2008

Activity Recognition Using a Combination of Category Components and Local Models for Video Surveillance.
IEEE Trans. Circuits Syst. Video Technol., 2008

Expandable Data-Driven Graphical Modeling of Human Actions Based on Salient Postures.
IEEE Trans. Circuits Syst. Video Technol., 2008

Multisensory processing for speech enhancement and magnitude-normalized spectra for speech modeling.
Speech Commun., 2008

Robust and Accurate Visual Echo Cancelation in a Full-duplex Projector-Camera System.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

Software-based distortion compensation for a scanned beam display.
Proceedings of the 5th ACM/IEEE International Workshop on Projector camera systems, 2008

Semantic saliency driven camera control for personal remote collaboration.
Proceedings of the International Workshop on Multimedia Signal Processing, 2008

Graphical modeling and decoding of human actions.
Proceedings of the International Workshop on Multimedia Signal Processing, 2008

Requirements and recommendations for an enhanced meeting viewing experience.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Human activity recognition for video surveillance.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2008), 2008

Why does PHAT work well in lownoise, reverberative environments?
Proceedings of the IEEE International Conference on Acoustics, 2008

Taylor expansion based classifier adaptation: Application to person detection.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Meta-tag propagation by co-training an ensemble classifier for improving image search relevance.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2008

2007
Head-Size Equalization for Improved Visual Perception in Video Conferencing.
IEEE Trans. Multim., 2007

Real-Time Whiteboard Capture and Processing Using a Video Camera for Remote Collaboration.
IEEE Trans. Multim., 2007

Whiteboard scanning and image enhancement.
Digit. Signal Process., 2007

Frequency domain correspondence for speaker normalization.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Learning-Based Perceptual Image Quality Improvement for Video Conferencing.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Exploring Discriminative Learning for Text-Independent Speaker Recognition.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Maximum Likelihood Sound Source Localization for Multiple Directional Microphones.
Proceedings of the IEEE International Conference on Acoustics, 2007

A Generative-Discriminative Framework using Ensemble Methods for Text-Dependent Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2007

Energy-Based Sound Source Localization and Gain Normalization for Ad Hoc Microphone Arrays.
Proceedings of the IEEE International Conference on Acoustics, 2007

Face Re-Lighting from a Single Image under Harsh Lighting Conditions.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
Iterative Local-Global Energy Minimization for Automatic Extraction of Objects of Interest.
IEEE Trans. Pattern Anal. Mach. Intell., 2006

A novel framework of text-independent speaker verification based on utterance transform and iterative cohort modeling.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Robust Local Scoring Function for Text-Independent Speaker Verification.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Speech Modelingwith Magnitude-Normalized Complex Spectra and Its Application to Multisensory Speech Enhancement.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Automatic Real-Time Barcode Localization in Complex Scenes.
Proceedings of the International Conference on Image Processing, 2006

Automatic Business Card Scanning with a Camera.
Proceedings of the International Conference on Image Processing, 2006

Robust and Accurate Visual Echo Cancelation in a Full-Duplex Projector-Camera System.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2006

2005
A graphical model for multi-sensory speech processing in air-and-bone conductive microphones.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Multi-sensory speech processing: incorporating automatically extracted hidden dynamic information.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Automatic Head-size Equalization in Panorama Images for Video Conferencing.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Leakage Model and Teeth Clack Removal for Air- and Bone-Conductive Integrated Microphones.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Real-time whiteboard capture and processing using a video camera for teleconferencing.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Linear Combination Representation for Outlier Detection in Motion Tracking.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

2004
Camera Calibration with One-Dimensional Objects.
IEEE Trans. Pattern Anal. Mach. Intell., 2004

Eye Gaze Correction with Stereovision for Video-Teleconferencing.
IEEE Trans. Pattern Anal. Mach. Intell., 2004

Automatic Eyeglasses Removal from Face Images.
IEEE Trans. Pattern Anal. Mach. Intell., 2004

Robust and Rapid Generation of Animated Faces from Video Images: A Model-Based Modeling Approach.
Int. J. Comput. Vis., 2004

Image-Based Surface Detail Transfer.
IEEE Computer Graphics and Applications, 2004

Remote Collaboration on Physical Whiteboards.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Direct filtering for air- and bone-conductive microphones.
Proceedings of the IEEE 6th Workshop on Multimedia Signal Processing, 2004

Nonlinear information fusion in multi-sensor processing - extracting and exploiting hidden dynamics of speech captured by a bone-conductive microphone.
Proceedings of the IEEE 6th Workshop on Multimedia Signal Processing, 2004

Model-based fusion of bone and air sensors for speech enhancement and robust speech recognition.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004

Visual echo cancellation in a projector-camera-whiteboard system.
Proceedings of the 2004 International Conference on Image Processing, 2004

Multi-sensory microphones for robust speech detection, enhancement and recognition.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Note-taking with a camera: whiteboard scanning and image enhancement.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Office presence detection using multimodal context information.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Incremental motion estimation through modified bundle adjustment.
Proceedings of the 2003 International Conference on Image Processing, 2003

Why take notes? Use the whiteboard capture system.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
New Measurements and Corner-Guidance for Curve Matching with Probabilistic Relaxation.
Int. J. Comput. Vis., 2002

Distributed meetings: a meeting capture and broadcasting system.
Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002

Model-Based Head Pose Tracking With Stereovision.
Proceedings of the 5th IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2002), 2002

2001
Rapid modeling of animated faces from video.
Comput. Animat. Virtual Worlds, 2001

Estimating the Fundamental Matrix by Transforming Image Points in Projective Space.
Comput. Vis. Image Underst., 2001

Expressive expression mapping with ratio images.
Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques, 2001

Visual panel: virtual mouse, keyboard and 3D controller with an ordinary piece of paper.
Proceedings of the 2001 workshop on Perceptive user interfaces, 2001

A Robust and Fast Face Modeling System.
Proceedings of the Advances in Multimedia Information Processing, 2001

Cloning Your Own Face with a Desktop Camera.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

Model-Based Bundle Adjustment with Application to Face Modeling.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

Determining Reflectance Parameters and Illumination Distribution from a Sparse Set of Images for View-dependent Image Synthesis.
Proceedings of the Eighth International Conference On Computer Vision (ICCV-01), Vancouver, British Columbia, Canada, July 7-14, 2001, 2001

Multibody Grouping via Orthogonal Subspace Decomposition.
Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001

2000
A Flexible New Technique for Camera Calibration.
IEEE Trans. Pattern Anal. Mach. Intell., 2000

A Progressive Scheme for Stereo Matching.
Proceedings of the 3D Structure from Images, 2000

Panel Session on Computations and Algorithms.
Proceedings of the 3D Structure from Images, 2000

Visual Screen: Transforming an Ordinary Screen into a Touch Screen.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2000), 2000

Curve Matching with Probabilistic Relaxation.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2000), 2000

Rapid modeling of animated faces from video images.
Proceedings of the 8th ACM International Conference on Multimedia 2000, Los Angeles, CA, USA, October 30, 2000

Robust Head Motion Computation by Taking Advantage of Physical Properties.
Proceedings of the Workshop on Human Motion, 2000

Corner Guided Curve Matching and its Application to Scene Reconstruction.
Proceedings of the 2000 Conference on Computer Vision and Pattern Recognition (CVPR 2000), 2000

1999
Feature-Based Facial Expression Recognition: Sensitivity Analysis and Experiments with A Multilayer Perceptron.
Int. J. Pattern Recognit. Artif. Intell., 1999

New lower bounds of fifteen classical Ramsey numbers.
Australas. J Comb., 1999

What Can Be Determined from a Full and a Weak Perspective Image?
Proceedings of the International Conference on Computer Vision, 1999

Flexible Camera Calibration by Viewing a Plane from Unknown Orientations.
Proceedings of the International Conference on Computer Vision, 1999

Efficient Bundle Adjustment with Virtual Key Frames: A Hierarchical Approach to Multi-Frame Structure from Motion.
Proceedings of the 1999 Conference on Computer Vision and Pattern Recognition (CVPR '99), 1999

Computing Rectifying Homographies for Stereo Vision.
Proceedings of the 1999 Conference on Computer Vision and Pattern Recognition (CVPR '99), 1999

1998
On the Optimization Criteria Used in Two-View Motion Analysis.
IEEE Trans. Pattern Anal. Mach. Intell., 1998

A Unified Theory of Uncalibrated Stereo for Both Perspective and Affine Cameras.
J. Math. Imaging Vis., 1998

Determining the Epipolar Geometry and its Uncertainty: A Review.
Int. J. Comput. Vis., 1998

Euclidean Structure from Uncalibrated Images Using Fuzzy Domain Knowledge: Application to Facial Images Synthesis.
Proceedings of the Sixth International Conference on Computer Vision (ICCV-98), 1998

Modeling Geometric Structure and Illumination Variation of a Scene from Real Images.
Proceedings of the Sixth International Conference on Computer Vision (ICCV-98), 1998

Understanding the Relationship Between the Optimization Criteria in Two-View Motion Analysis.
Proceedings of the Sixth International Conference on Computer Vision (ICCV-98), 1998

Comparison Between Geometry-Based and Gabor-Wavelets-Based Facial Expression Recognition Using Multi-Layer Perceptron.
Proceedings of the 3rd International Conference on Face & Gesture Recognition (FG '98), 1998

Head Pose Determination from One Image Using a Generic Model.
Proceedings of the 3rd International Conference on Face & Gesture Recognition (FG '98), 1998

A New Multistage Approach to Motion and Structure Estimation by Gradually Enforcing Geometric Constraints.
Proceedings of the Computer Vision, 1998

Image-Based Geometrically-Correct Photorealistic Scene/Object Modeling (IBPhM): A Review.
Proceedings of the Computer Vision, 1998

1997
A stereovision system for a planetary rover: calibration, correlation, registration, and fusion.
Mach. Vis. Appl., 1997

Parameter estimation techniques: a tutorial with application to conic fitting.
Image Vis. Comput., 1997

A Tighter Lower Bound on the Spetsakis-Aloimonos Trilinear Constraints.
Comput. Vis. Image Underst., 1997

Characterizing the Uncertainty of the Fundamental Matrix.
Comput. Vis. Image Underst., 1997

A General Expression of the Fundamental Matrix for Both Perspective and Affine Cameras.
Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, 1997

Self-Maintaining Camera Calibration Over Time.
Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97), 1997

1996
Epipolar Geometry in Stereo, Motion and Object Recognition - A Unified Approach
Computational Imaging and Vision 6, Springer, ISBN: 978-94-015-8668-9, 1996

Motion of an uncalibrated stereo rig: self-calibration and metric reconstruction.
IEEE Trans. Robotics Autom., 1996

On the epipolar geometry between two images with lens distortion.
Proceedings of the 13th International Conference on Pattern Recognition, 1996

1995
Motion and Structure of Four Points from One Motion of a Stereo Rig with Unknown Extrinsic Parameters.
IEEE Trans. Pattern Anal. Mach. Intell., 1995

Estimating Motion and Structure from Correspondences of Line Segments between Two Perspective Images.
IEEE Trans. Pattern Anal. Mach. Intell., 1995

A Robust Technique for Matching two Uncalibrated Images Through the Recovery of the Unknown Epipolar Geometry.
Artif. Intell., 1995

An Automatic and Robust Algorithm for Determining Motion and Structure from Two Perspective Images.
Proceedings of the Computer Analysis of Images and Patterns, 6th International Conference, 1995

Multi-Sensor Multi-Target Tracking - Strategies for Events that become Invisible.
Proceedings of the British Machine Vision Conference, 1995

Motion of a Stereo Rig: Strong Weak and Self Calibration.
Proceedings of the Recent Developments in Computer Vision, 1995

1994
Token tracking in a cluttered scene.
Image Vis. Comput., 1994

Iterative point matching for registration of free-form curves and surfaces.
Int. J. Comput. Vis., 1994

A new and efficient iterative approach to image matching.
Proceedings of the 12th IAPR International Conference on Pattern Recognition, 1994

Robust Recovery of the Epipolar Geometry for an Uncalibrated Stereo Rig.
Proceedings of the Computer Vision, 1994

Self-calibration of an Uncalibrated Stereo Rig from One Unknown Motion.
Proceedings of the British Machine Vision Conference, 1994

A Two-Stage Approach to Multi-Sensor Temporal Data Fusion.
Proceedings of the British Machine Vision Conference, 1994

1993
Point Matching for Registration of Free-Form Surfaces.
Proceedings of the Computer Analysis of Images and Patterns, 5th International Conference, 1993

Strategies for Tracking Tokens in a Cluttered Scene.
Proceedings of the British Machine Vision Conference, 1993

1992
Estimation of Displacements from Two 3-D Frames Obtained From Stereo.
IEEE Trans. Pattern Anal. Mach. Intell., 1992

Three-dimensional motion computation and object segmentation in a long sequence of stereo frames.
Int. J. Comput. Vis., 1992

A Complete Navigation System for a Mobile Robot, Using Real-Time Stereovision and the Delaunay Triangulation.
Proceedings of IAPR Workshop on Machine Vision Applications, 1992

Finding Clusters and Planes from 3D Line Segments with Application to 3D Motion Determination.
Proceedings of the Computer Vision, 1992

On Local Matching of Free-form Curves.
Proceedings of the British Machine Vision Conference, 1992

1991
Determining motion from 3D line segment matches: a comparative study.
Image Vis. Comput., 1991

1990
Building a 3D world model with a mobile robot: 3D line segment representation and integration.
Proceedings of the 10th IAPR International Conference on Pattern Recognition, 1990

Tracking and grouping 3D line segments.
Proceedings of the Third International Conference on Computer Vision, 1990

Tracking and Motion Estimation in a Sequence of Stereo Frames.
Proceedings of the 9th European Conference on Artificial Intelligence, 1990

1988
Visual Navigation of a Mobile Robot: Recent Steps.
Proceedings of the IEEE International Workshop on Intelligent Robots and Systems '88, Proceedings. IROS 1988, Tokyo, Japan, October 31, 1988

Analysis Of A Sequence Of Stereo Scenes Containing Multiple Moving Objects Using Rigidity Constraints.
Proceedings of the Second International Conference on Computer Vision, 1988


  Loading...