José M. Álvarez

Orcid: 0000-0002-7535-6322

Affiliations:
  • NVIDIA, Santa Clara, CA, USA
  • Toyota Research Institute (TRI), Mountain View, CA, USA (former)
  • CSIRO, Data61, Australia (former)
  • NICTA, Computer Vision Research Laboratory, Canberra, ACT, Australia (former)
  • New York University, Courant Institute of Mathematical Sciences, New York, NY, USA (former)
  • Autonomous University of Barcelona, Computer Vision Center, Barcelona, Spain (former, PhD 2010)


According to our database1, José M. Álvarez authored at least 138 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Uncertainty Estimation for 3D Object Detection via Evidential Learning.
CoRR, 2024

SSE: Multimodal Semantic Data Selection and Enrichment for Industrial-scale Data Assimilation.
CoRR, 2024

Exploring Camera Encoder Designs for Autonomous Driving Perception.
CoRR, 2024

Multi-Dimensional Pruning: Joint Channel, Layer and Block Pruning with Latency Constraint.
CoRR, 2024

Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation.
CoRR, 2024

Step Out and Seek Around: On Warm-Start Training with Incremental Data.
CoRR, 2024

Memorize What Matters: Emergent Scene Decomposition from Multitraverse.
CoRR, 2024

OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning.
CoRR, 2024

SF3D: SlowFast Temporal 3D Object Detection.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2024

FasterViT: Fast Vision Transformers with Hierarchical Attention.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Adaptive Sharpness-Aware Pruning for Robust Sparse Networks.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

SegIC: Unleashing the Emergent Correspondence for In-Context Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Improving Distant 3D Object Detection Using 2D Box Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

What is Point Supervision Worth in Video Instance Segmentation?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation.
CoRR, 2023

Hardware-Aware Latency Pruning for Real-Time 3D Object Detection.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2023

Augmenting Legacy Networks for Flexible Inference.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2023

Fully Attentional Networks with Self-emerging Token Labeling.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird's-Eye View.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FB-BEV: BEV Representation from Forward-Backward View Transformations.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Viewpoint Robustness in Bird's Eye View Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FocalFormer3D : Focusing on Hard Instance for 3D Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

VoxFormer: Sparse Voxel Transformer for Camera-Based 3D Semantic Scene Completion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Vision Transformers are Good Mask Auto-Labelers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Knowledge Distillation for 6D Pose Estimation by Aligning Distributions of Local Predictions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Capitalizing on RGB-FIR Hybrid Imaging for Road Detection.
IEEE Trans. Intell. Transp. Syst., 2022

Training Data Subset Search With Ensemble Active Learning.
IEEE Trans. Intell. Transp. Syst., 2022

Cost Volume Pyramid Based Depth Inference for Multi-View Stereo.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Knowledge Distillation for 6D Pose Estimation by Keypoint Distribution Alignment.
CoRR, 2022

M<sup>2</sup>BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation.
CoRR, 2022

Structural Pruning via Latency-Saliency Knapsack.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Optimizing Data Collection for Machine Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Object-Level Targeted Selection via Deep Template Matching.
Proceedings of the 2022 IEEE Intelligent Vehicles Symposium, 2022

Understanding The Robustness in Vision Transformers.
Proceedings of the International Conference on Machine Learning, 2022

Soft Masking for Cost-Constrained Channel Pruning.
Proceedings of the Computer Vision - ECCV 2022, 2022

A-ViT: Adaptive Tokens for Efficient Vision Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Non-parametric Depth Distribution Modelling based Depth Inference for Multi-view Stereo.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

FreeSOLO: Learning to Segment Objects without Annotations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

When to Prune? A Policy towards Early Structural Pruning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

How Much More Data Do I Need? Estimating Requirements for Downstream Tasks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Not All Labels Are Equal: Rationalizing The Labeling Costs for Training Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Privacy Vulnerability of Split Computing to Data-Free Model Inversion Attacks.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
AdaViT: Adaptive Tokens for Efficient Vision Transformer.
CoRR, 2021

HALP: Hardware-Aware Latency Pruning.
CoRR, 2021

Panoptic SegFormer.
CoRR, 2021

Deep Neural Networks are Surprisingly Reversible: A Baseline for Zero-Shot Inversion.
CoRR, 2021

Towards Reducing Labeling Cost in Deep Object Detection.
CoRR, 2021

Data-free Knowledge Distillation for Object Detection.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Distilling Image Classifiers in Object Detectors.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Boosting Supervised Learning Performance with Co-training.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2021

Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection.
Proceedings of the 38th International Conference on Machine Learning, 2021

Personalized Federated Learning with First Order Model Optimization.
Proceedings of the 9th International Conference on Learning Representations, 2021

Contrastive Syn-to-Real Generalization.
Proceedings of the 9th International Conference on Learning Representations, 2021

Active Learning for Deep Object Detection via Probabilistic Modeling.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

See Through Gradients: Image Batch Recovery via GradInversion.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Self-Supervised Learning of Depth Inference for Multi-View Stereo.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Optimal Quantization Using Scaled Codebook.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Context Based Emotion Recognition Using EMOTIC Dataset.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Quadtree Generating Networks: Efficient Hierarchical Scene Parsing with Sparse Convolutions.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

ExpandNets: Linear Over-parameterization to Train Compact Convolutional Networks.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Scalable Active Learning for Object Detection.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

Dreaming to Distill: Data-Free Knowledge Transfer via DeepInversion.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Integrating Dense LiDAR-Camera Road Detection Maps by a Multi-Modal CRF Model.
IEEE Trans. Veh. Technol., 2019

An Illumination-Invariant Nonparametric Model for Urban Road Detection.
IEEE Trans. Intell. Veh., 2019

VACL: Variance-Aware Cross-Layer Regularization for Pruning Deep Residual Networks.
CoRR, 2019

Less is More: An Exploration of Data Redundancy with Active Dataset Subsampling.
CoRR, 2019

Bridging the Day and Night Domain Gap for Semantic Segmentation.
Proceedings of the 2019 IEEE Intelligent Vehicles Symposium, 2019

Two-View Fusion based Convolutional Neural Network for Urban Road Detection.
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019

VACL: Variance-Aware Cross-Layer Regularization for Pruning Deep Residual Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

2018
Curb Detection for Road and Sidewalk Detection.
IEEE Trans. Veh. Technol., 2018

3-D LiDAR + Monocular Camera: An Inverse-Depth-Induced Fusion Framework for Urban Road Detection.
IEEE Trans. Intell. Veh., 2018

Guest Editorial Introduction to the Special Issue on Robust and Efficient Vision Techniques for Intelligent Vehicles.
IEEE Trans. Intell. Transp. Syst., 2018

ERFNet: Efficient Residual Factorized ConvNet for Real-Time Semantic Segmentation.
IEEE Trans. Intell. Transp. Syst., 2018

Incorporating Network Built-in Priors in Weakly-Supervised Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Expression-Invariant Age Estimation Using Structured Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Frame selection for OCR from video stream of book flipping.
Multim. Tools Appl., 2018

ExpandNets: Exploiting Linear Redundancy to Train Small Networks.
CoRR, 2018

Large-Scale Visual Active Learning with Deep Probabilistic Ensembles.
CoRR, 2018

Deep Probabilistic Ensembles: Approximate Variational Inference through KL Regularization.
CoRR, 2018

Fusion of LiDAR and Camera by Scanning in LiDAR Imagery and Image-Guided Diffusion for Urban Road Detection.
Proceedings of the 2018 IEEE Intelligent Vehicles Symposium, 2018

Train Here, Deploy There: Robust Segmentation in Unseen Domains.
Proceedings of the 2018 IEEE Intelligent Vehicles Symposium, 2018

Effective Use of Synthetic Data for Urban Scene Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
An illumination-invariant nonparametric model for urban road detection using monocular camera and single-line lidar.
Proceedings of the 2017 IEEE International Conference on Robotics and Biomimetics, 2017

Compression-aware Training of Deep Networks.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Learning Camera Pose from Optical Colonoscopy Frames Through Deep Convolutional Neural Network (CNN).
Proceedings of the Computer Assisted and Robotic Endoscopy and Clinical Image-Based Procedures, 2017

Efficient ConvNet for real-time semantic segmentation.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2017

Bringing Background into the Foreground: Making All Classes Equal in Weakly-Supervised Video Semantic Segmentation.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Domain-Adaptive Deep Network Compression.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Emotion Recognition in Context.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

EMOTIC: Emotions in Context Dataset.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

2016
Motion Estimation via Robust Decomposition With Constrained Rank.
IEEE Trans. Intell. Veh., 2016

Exploiting Large Image Sets for Road Scene Parsing.
IEEE Trans. Intell. Transp. Syst., 2016

Semantic labeling for prosthetic vision.
Comput. Vis. Image Underst., 2016

Invertible Conditional GANs for image editing.
CoRR, 2016

Learning Image Matching by Simply Watching Video.
CoRR, 2016

DecomposeMe: Simplifying ConvNets for End-to-End Learning.
CoRR, 2016

Efficient transductive semantic segmentation.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Learning the Number of Neurons in Deep Networks.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

2D-3D semantic segmentation using cardinality as higher-order loss.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Latent structural SVM with marginal probabilities for weakly labeled structured learning.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Built-in Foreground/Background Prior for Weakly-Supervised Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2016, 2016

Learning Image Matching by Simply Watching Video.
Proceedings of the Computer Vision - ECCV 2016, 2016

Efficient Framework for Action Recognition Using Reduced Fisher Vector Encoding.
Proceedings of International Conference on Computer Vision and Image Processing, 2016

2015
Unsupervised image transformation for outdoor semantic labelling.
Proceedings of the 2015 IEEE Intelligent Vehicles Symposium, 2015

Efficient scene parsing by sampling unary potentials in a fully-connected CRF.
Proceedings of the 2015 IEEE Intelligent Vehicles Symposium, 2015

2014
Combining Priors, Appearance, and Context for Road Detection.
IEEE Trans. Intell. Transp. Syst., 2014

Road Detection by One-Class Color Classification: Dataset and Experiments.
CoRR, 2014

Road Detection via On-line Label Transfer.
CoRR, 2014

Data-driven road detection.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Large-scale semantic co-labeling of image sets.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Fast road detection and tracking in aerial videos.
Proceedings of the 2014 IEEE Intelligent Vehicles Symposium Proceedings, 2014

Expression-Invariant Age Estimation.
Proceedings of the British Machine Vision Conference, 2014

2013
Road Geometry Classification by Adaptive Shape Models.
IEEE Trans. Intell. Transp. Syst., 2013

Duplicate open page removal from video stream of book flipping.
Proceedings of the Fourth National Conference on Computer Vision, 2013

Learning appearance models for road detection.
Proceedings of the 2013 IEEE Intelligent Vehicles Symposium (IV), 2013

Evaluating Color Representations for On-Line Road Detection.
Proceedings of the 2013 IEEE International Conference on Computer Vision Workshops, 2013

OCR from Video Stream of Book Flipping.
Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, 2013

2012
Understanding Road Scenes Using Visual Cues and GPS Information.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Semantic Road Segmentation via Multi-scale Ensembles of Learned Features.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Road Scene Segmentation from a Single Image.
Proceedings of the Computer Vision - ECCV 2012, 2012

2011
A New Framework for Stereo Sensor Pose Through Road Segmentation and Registration.
IEEE Trans. Intell. Transp. Syst., 2011

Road Detection Based on Illuminant Invariance.
IEEE Trans. Intell. Transp. Syst., 2011

2010
Learning Photometric Invariance for Object Detection.
Int. J. Comput. Vis., 2010

Geographic information for vision-based road detection.
Proceedings of the IEEE Intelligent Vehicles Symposium (IV), 2010

Vision-based road detection via on-line video registration.
Proceedings of the 13th International IEEE Conference on Intelligent Transportation Systems, 2010

3D Scene priors for road detection.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Vision-based road detection using road models.
Proceedings of the International Conference on Image Processing, 2009

Automatic ground-truthing using video registration for on-board detection algorithms.
Proceedings of the International Conference on Image Processing, 2009

Learning photometric invariance from diversified color model ensembles.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008
Novel Index for Objective Evaluation of Road Detection Algorithms.
Proceedings of the 11th International IEEE Conference on Intelligent Transportation Systems, 2008

2007
Alignment of videos recorded from moving vehicles.
Proceedings of the 14th International Conference on Image Analysis and Processing (ICIAP 2007), 2007

Synchronization of Video Sequences from Free-Moving Cameras.
Proceedings of the Pattern Recognition and Image Analysis, Third Iberian Conference, 2007

Shadow Resistant Road Segmentation from a Mobile Monocular System.
Proceedings of the Pattern Recognition and Image Analysis, Third Iberian Conference, 2007


  Loading...