Rogério Feris

Orcid: 0000-0001-6399-0679

Affiliations:
  • IBM T. J. Watson Research Center, Hawthorne, USA


According to our database1, Rogério Feris authored at least 215 papers between 2000 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models.
CoRR, 2024

Scaling Granite Code Models to 128K Context.
CoRR, 2024

Navigating the Labyrinth: Evaluating and Enhancing LLMs' Ability to Reason About Search Problems.
CoRR, 2024

Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts.
CoRR, 2024

Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation.
CoRR, 2024

Comparison Visual Instruction Tuning.
CoRR, 2024

ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMs.
CoRR, 2024

Trans-LoRA: towards data-free Transferable Parameter Efficient Finetuning.
CoRR, 2024

CAMELoT: Towards Large Language Models with Training-Free Consolidated Associative Memory.
CoRR, 2024

Improved Techniques for Quantizing Deep Networks with Adaptive Bit-Widths.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

LangNav: Language as a Perceptual Representation for Navigation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Large Scale Generative AI Text Applied to Sports and Music.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Adaptive Memory Replay for Continual Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

What, When, and Where? Self-Supervised Spatio- Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Self-Specialization: Uncovering Latent Expertise within Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Self-Specialization: Uncovering Latent Expertise within Large Language Models.
CoRR, 2023

TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification.
CoRR, 2023

Mind the Backbone: Minimizing Backbone Distortion for Robust Object Detection.
CoRR, 2023

Select, Label, and Mix: Learning Discriminative Invariant Feature Representations for Partial Domain Adaptation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Addressing Feature Suppression in Unsupervised Visual Representations.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Learning Human Action Recognition Representations Without Real Humans.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning to Grow Pretrained Models for Efficient Transformer Training.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

CDAC: Cross-domain Attention Consistency in Transformer for Domain Adaptive Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Going Beyond Nouns With Vision & Language Models Using Synthetic Data.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2023

Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

CODA-Prompt: COntinual Decomposed Attention-Based Prompting for Rehearsal-Free Continual Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ConStruct-VL: Data-Free Continual Structured VL Concepts Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Teaching Structured Vision & Language Concepts to Vision & Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Synthetic Pre-Training Tasks for Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Baby steps towards few-shot learning with multiple semantics.
Pattern Recognit. Lett., 2022

Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

A Maximal Correlation Framework for Fair Machine Learning.
Entropy, 2022

Exploring Consistency in Cross-Domain Transformer for Domain Adaptive Semantic Segmentation.
CoRR, 2022

Teaching Structured Vision&Language Concepts to Vision&Language Models.
CoRR, 2022

FETA: Towards Specializing Foundation Models for Expert Task Applications.
CoRR, 2022

SimVQA: Exploring Simulated Environments for Visual Question Answering.
CoRR, 2022

How Transferable are Video Representations Based on Synthetic Data?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Procedural Image Programs for Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

FETA: Towards Specializing Foundational Models for Expert Task Applications.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Maximal Correlation Approach to Imposing Fairness in Machine Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

Everything at Once - Multi-modal Fusion Transformer for Video Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Task2Sim: Towards Effective Pre-training and Transfer from Synthetic Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

VALHALLA: Visual Hallucination for Machine Translation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Targeted Supervised Contrastive Learning for Long-Tailed Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unsupervised Domain Generalization by Learning a Bridge Across Domains.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Sim VQA: Exploring Simulated Environments for Visual Question Answering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
MetAdapt: Meta-learned task-adaptive architecture for few-shot classification.
Pattern Recognit. Lett., 2021

IA-RED<sup>2</sup>: Interpretability-Aware Redundancy Reduction for Vision Transformers.
CoRR, 2021

All at Once Network Quantization via Collaborative Knowledge Transfer.
CoRR, 2021

VA-RED<sup>2</sup>: Video Adaptive Redundancy Reduction.
CoRR, 2021

IA-RED$^2$: Interpretability-Aware Redundancy Reduction for Vision Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

AVLnet: Learning Audio-Visual Language Representations from Instructional Videos.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Cascaded Multilingual Audio-Visual Learning from Videos.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

VA-RED2: Video Adaptive Redundancy Reduction.
Proceedings of the 9th International Conference on Learning Representations, 2021

AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition.
Proceedings of the 9th International Conference on Learning Representations, 2021

Dynamic Network Quantization for Efficient Video Inference.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A Broad Study on the Transferability of Visual Representations with Contrastive Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Detector-Free Weakly Supervised Grounding by Separation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language Feedback.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Separating Skills and Concepts for Novel Visual Question Answering.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Semi-Supervised Action Recognition With Temporal Contrastive Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Spoken Moments: Learning Joint Audio-Visual Representations From Video Descriptions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Pseudo-IoU: Improving Label Assignment in Anchor-Free Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Deep Analysis of CNN-Based Spatio-Temporal Representations for Action Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Fine-Grained Angular Contrastive Learning With Coarse Labels.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

NASTransfer: Analyzing Architecture Transferability in Large Scale Neural Architecture Search.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

StarNet: towards Weakly Supervised Few-Shot Object Detection.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Learning from Lexical Perturbations for Consistent Visual Question Answering.
CoRR, 2020

Large Scale Neural Architecture Search with Polyharmonic Splines.
CoRR, 2020

AVLnet: Learning Audio-Visual Language Representations from Instructional Videos.
CoRR, 2020

StarNet: towards weakly supervised few-shot detection and explainable few-shot classification.
CoRR, 2020

AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Mitigating Dataset Imbalance via Joint Generation and Classification.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

AR-Net: Adaptive Frame Resolution for Efficient Action Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020

TAFSSL: Task-Adaptive Feature Sub-Space Learning for Few-Shot Classification.
Proceedings of the Computer Vision - ECCV 2020, 2020

A Broader Study of Cross-Domain Few-Shot Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

We Have So Much in Common: Modeling Semantic Relational Set Abstractions in Videos.
Proceedings of the Computer Vision - ECCV 2020, 2020

OnlineAugment: Online Data Augmentation with Less Domain Knowledge.
Proceedings of the Computer Vision - ECCV 2020, 2020

Differential Treatment for Stuff and Things: A Simple Unsupervised Domain Adaptation Method for Semantic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Alleviating Semantic-level Shift: A Semi-supervised Domain Adaptation Method for Semantic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Relationship Matters: Relation Guided Knowledge Transfer for Incremental Learning of Object Detectors.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Video Instance Segmentation Tracking With a Modified VAE Architecture.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Automatic Curation of Sports Highlights Using Multimodal Excitement Features.
IEEE Trans. Multim., 2019

A New Benchmark for Evaluation of Cross-Domain Few-Shot Learning.
CoRR, 2019

AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning.
CoRR, 2019

The Fashion IQ Dataset: Retrieving Images by Combining Side Information and Relative Natural Language Feedback.
CoRR, 2019

Depthwise Convolution is All You Need for Learning Multiple Visual Domains.
CoRR, 2019

Diversity in Faces.
CoRR, 2019

Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition.
Proceedings of the 7th International Conference on Learning Representations, 2019

Video-Text Compliance: Activity Verification Based on Natural Language Instructions.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Learning Motion in Feature Space: Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Identifying Interpretable Action Concepts in Deep Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

RepMet: Representative-Based Metric Learning for Classification and Few-Shot Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

SpotTune: Transfer Learning Through Adaptive Fine-Tuning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Grounding Spoken Words in Unlabeled Video.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

LaSO: Label-Set Operations Networks for Multi-Label Few-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Improving Object Detection from Scratch via Gated Feature Reuse.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
RED-Net: A Recurrent Encoder-Decoder Network for Video-Based Face Alignment.
Int. J. Comput. Vis., 2018

Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection.
CoRR, 2018

Decoupled Classification Refinement: Hard False Positive Suppression for Object Detection.
CoRR, 2018

RepMet: Representative-based metric learning for classification and one-shot object detection.
CoRR, 2018

Dialog-based Interactive Image Retrieval.
CoRR, 2018

Object-Centric Spatio-Temporal Activity Detection and Recognition.
Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018

Delta-encoder: an effective sample synthesis method for few-shot object recognition.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Co-regularized Alignment for Unsupervised Domain Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Dialog-based Interactive Image Retrieval.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Collaborative Human-AI (CHAI): Evidence-Based Interpretable Melanoma Classification in Dermoscopic Images.
Proceedings of the Understanding and Interpreting Machine Learning in Medical Image Computing Applications, 2018

Segmentation of Both Diseased and Healthy Skin From Clinical Photographs in a Primary Care Setting.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018

Revisiting RCNN: On Awakening the Classification Power of Faster RCNN.
Proceedings of the Computer Vision - ECCV 2018, 2018

BlockDrop: Dynamic Inference Paths in Residual Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

The Excitement of Sports: Automatic Highlights Using Audio/Visual Cues.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Learning to Separate Object Sounds by Watching Unlabeled Video.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Jointly Optimize Data Augmentation and Network Training: Adversarial Data Augmentation in Human Pose Estimation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Learning Object Detectors from Scratch with Gated Recurrent Feature Pyramids.
CoRR, 2017

IBM High-Five: Highlights From Intelligent Video Engine.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

S3Pool: Pooling with Stochastic Spatial Sampling.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Automatic Curation of Golf Highlights Using Multimodal Excitement Features.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Fully-Adaptive Feature Sharing in Multi-Task Networks with Applications in Person Attribute Classification.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Edge-Guided Single Depth Image Super Resolution.
IEEE Trans. Image Process., 2016

Generative Adversarial Networks as Variational Training of Energy Based Models.
CoRR, 2016

A Recurrent Encoder-Decoder Network for Sequential Face Alignment.
Proceedings of the Computer Vision - ECCV 2016, 2016

A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection.
Proceedings of the Computer Vision - ECCV 2016, 2016

Walk and Learn: Facial Attribute Representation Learning from Egocentric Video and Contextual Data.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Joint Super Resolution and Denoising From a Single Depth Image.
IEEE Trans. Multim., 2015

Fine registration of 3D point clouds fusing structural and photometric information using an RGB-D camera.
J. Vis. Commun. Image Represent., 2015

Fast Neural Networks with Circulant Projections.
CoRR, 2015

Automated Axon Segmentation from Highly Noisy Microscopic Videos.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Cross-Domain Image Retrieval with a Dual Attribute-Aware Ranking Network.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

An Exploration of Parameter Redundancy in Deep Networks with Circulant Projections.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Deep domain adaptation for describing people based on fine-grained clothing attributes.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Efficient 24/7 object detection in surveillance videos.
Proceedings of the 12th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2015

2014
A spatial-color layout feature for representing galaxy images.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

IBM-Northwestern@TRECVID 2014: Surveillance Event Detection.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

A mission-oriented citizen science platform for efficient flower classification based on combination of feature descriptors.
Proceedings of the 1st International Workshop on Environnmental Multimedia Retrieval co-located with ACM International Conference on Multimedia Retrieval, 2014

Flower Classification for a Citizen Science Mobile App.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Attribute-based People Search: Lessons Learnt from a Practical Surveillance System.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Appearance-Based Object Detection Under Varying Environmental Conditions.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Temporal Non-maximum Suppression for Pedestrian Detection Using Self-Calibration.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Single depth image super resolution and denoising via coupled dictionary learning with local constraints and shock filtering.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

RiskWheel: Interactive visual analytics for surveillance event detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Fusing well-crafted feature descriptors for efficient fine-grained classification.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013
Domain adaptive object detection.
Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision, 2013

Boosting object detection performance in crowded surveillance videos.
Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision, 2013

IBM Research and Columbia University TRECVID-2013 Multimedia Event Detection (MED), Multimedia Event Recounting (MER), Surveillance Event Detection (SED), and Semantic Indexing (SIN) Systems.
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

Spatio-temporal fisher vector coding for surveillance event detection.
Proceedings of the ACM Multimedia Conference, 2013

Fine registration of 3D point clouds with iterative closest point using an RGB-D camera.
Proceedings of the 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), 2013

Fast Face Detector Training Using Tailored Views.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Shape Analysis Using the Spectral Graph Wavelet Transform.
Proceedings of the 9th IEEE International Conference on eScience, 2013

Designing Category-Level Attributes for Discriminative Visual Recognition.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Hierarchical Feature Pooling with Structure Learning: A New Method for Pedestrian Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013

Efficient Maximum Appearance Search for Large-Scale Object Detection.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Large-Scale Vehicle Detection, Indexing, and Search in Urban Surveillance Videos.
IEEE Trans. Multim., 2012


Unsupervised model selection for view-invariant object detection in surveillance environments.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Appearance modeling for person re-identification using Weighted Brightness Transfer Functions.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Learning Detectors from Large Datasets for Object Retrieval in Video Surveillance.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

2011
Automatic Video Activity Recognition.
Proceedings of the Multimedia Analysis, Processing and Communications, 2011

Robust Detection of Abandoned and Removed Objects in Complex Surveillance Videos.
IEEE Trans. Syst. Man Cybern. Part C, 2011

Practical computer vision: Example techniques and challenges.
IBM J. Res. Dev., 2011

Analytics-driven asset management.
IBM J. Res. Dev., 2011

Large-scale vehicle detection in challenging urban surveillance environments.
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2011), 2011

Attribute-based vehicle search in crowded surveillance videos.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

Hierarchical ranking of facial attributes.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

Image ranking and retrieval based on multi-attribute queries.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Benchmarking Datasets for Human Activity Recognition.
Proceedings of the Visual Analysis of Humans - Looking at People., 2011

2010
Unsupervised Action Classification Using Space-Time Link Analysis.
EURASIP J. Image Video Process., 2010

Multi-Scale People Detection and Motion Analysis for Video Surveillance.
Proceedings of the Machine Learning for Human Motion Analysis - Theory and Practice., 2010

2009
Attribute-based people search in surveillance environments.
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2009), 2009

A projector-camera setup for geometry-invariant frequency demultiplexing.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Shape classification through structured learning of matching measures.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Video Analytics in Urban Environments.
Proceedings of the Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009

Composite Event Detection in Multi-Camera and Multi-Sensor Surveillance Networks.
Proceedings of the Multi-Camera Networks, 2009

2008
Multiflash Stereopsis: Depth-Edge-Preserving Stereo with Small Baseline Illumination.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

Facial image analysis using local feature adaptation prior to learning.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

Characterizing the shadow space of camera-light pairs.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Efficient partial shape matching using Smith-Waterman algorithm.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2008

An Integrated System for Moving Object Classification in Surveillance Videos.
Proceedings of the Fifth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2008

2007
Capturing People in Surveillance Video.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Video analytics for retail.
Proceedings of the Fourth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2007

Searching surveillance video.
Proceedings of the Fourth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2007

2006
Specular Highlights Detection and Reduction with Multi-Flash Photography.
J. Braz. Comput. Soc., 2006

Local approach for face verification in polar frequency domain.
Image Vis. Comput., 2006

Manifold based analysis of facial expression.
Image Vis. Comput., 2006

Non-photorealistic camera: depth edge detection and stylized rendering using multi-flash imaging.
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2006

Dealing with Multi-Scale Depth Changes and Motion in Depth Edge Detection.
Proceedings of the 19th Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI 2006), 2006

The Isometric Self-Organizing Map for 3D Hand Pose Estimation.
Proceedings of the Seventh IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2006), 2006

Multi-view Appearance-based 3D Hand Pose Estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2006

2005
Harnessing Real-World Depth Edges with Multiflash Imaging.
IEEE Computer Graphics and Applications, 2005

Face Verification in Polar Frequency Domain: A Biologically Motivated Approach.
Proceedings of the Advances in Visual Computing, First International Symposium, 2005

Discontinuity Preserving Stereo with Small Baseline Multi-Flash Illumination.
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005

2004
Non-photorealistic camera: depth edge detection and stylized rendering using multi-flash imaging.
ACM Trans. Graph., 2004

A wavelet subspace method for real-time face tracking.
Real Time Imaging, 2004

The non-photorealistic camera: automatic stylization with multi-flash imaging.
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2004

Specular reflection reduction using a multi-flash camera.
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2004

Specular Reflection Reduction with Multi-Flash Imaging.
Proceedings of the XVII Brazilian Symposium on Computer Graphics and Image Processing, 2004

Shape-Enhanced Surgical Visualizations and Medical Illustrations with Multi-flash Imaging.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention -- MICCAI 2004, 2004

Exploiting Depth Discontinuities for Vision-Based Fingerspelling Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2004

2003
Active Wavelet Networks for Face Alignment.
Proceedings of the British Machine Vision Conference, 2003

Real-time View-based Face Alignment using Active Wavelet Networks.
Proceedings of the 2003 IEEE International Workshop on Analysis and Modeling of Faces and Gestures (AMFG 2003), 2003

2002
Hierarchical Wavelet Networks for Facial Feature Localization.
Proceedings of the 5th IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2002), 2002

2001
Locating and Tracking Facial Landmarks Using Gabor Wavelet Networks.
Proceedings of the Advances in Pattern Recognition, 2001

Wavelet Subspace Method for Real-Time Face Tracking.
Proceedings of the Pattern Recognition, 2001

2000
Tracking Facial Features using Gabor Wavelet Networks.
Proceedings of the 13th Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI 2000), 2000

Improved Face×Non-Face Discrimination using Fourier Descriptors through Feature Selection.
Proceedings of the 13th Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI 2000), 2000

Detection and Tracking of Facial Features in Video Sequences.
Proceedings of the MICAI 2000: Advances in Artificial Intelligence, 2000

Eigenfaces Versus Eigeneyes: First Steps Toward Performance Assessment of Representations for Face Recognition.
Proceedings of the MICAI 2000: Advances in Artificial Intelligence, 2000


  Loading...