Brais Martínez

Orcid: 0000-0001-7511-8941

According to our database1, Brais Martínez authored at least 68 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
MobileQuant: Mobile-friendly Quantization for On-device Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Efficient Vision-Language pre-training via domain-specific learning for human activities.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs.
Proceedings of the Computer Vision - ECCV 2024, 2024

You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Graph Guided Question Answer Generation for Procedural Question-Answering.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

2023
SimDETR: Simplifying self-supervised pretraining for DETR.
CoRR, 2023

Effective Self-supervised Pre-training on Low-compute Networks without Distillation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Black Box Few-Shot Adaptation for Vision-Language models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Bayesian Prompt Learning for Image-Language Model Generalization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ReGen: A good Generative zero-shot video classifier should be Rewarded.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Graph2Vid: Flow graph to Video Grounding for Weakly-supervised Multi-Step Localization.
CoRR, 2022

Variational prompt tuning improves generalization of vision-language models.
CoRR, 2022

REST: REtrieve & Self-Train for generative action recognition.
CoRR, 2022

Efficient Attention-free Video Shift Transformers.
CoRR, 2022

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning.
CoRR, 2022

Knowledge Distillation Meets Open-Set Semi-Supervised Learning.
CoRR, 2022

EdgeViTs: Competing Light-Weight CNNs on Mobile Devices with Vision Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

SOS! Self-supervised Learning over Sets of Handled Objects in Egocentric Action Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

Flow Graph to Video Grounding for Weakly-Supervised Multi-step Localization.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
SAIC_Cambridge-HuPBA-FBK Submission to the EPIC-Kitchens-100 Action Recognition Challenge 2021.
CoRR, 2021

Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization.
CoRR, 2021

Low-Fidelity Video Encoder Optimization for Temporal Action Localization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Space-time Mixing Attention for Video Transformer.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

High-Capacity Expert Binary Networks.
Proceedings of the 9th International Conference on Learning Representations, 2021

Knowledge distillation via softmax regression representation learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

Boundary-sensitive Pre-training for Temporal Localization in Videos.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Towards Practical Lipreading with Distilled and Efficient Models.
Proceedings of the IEEE International Conference on Acoustics, 2021

Few-shot Action Recognition with Prototype-centered Attentive Learning.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Knowing What, Where and When to Look: Video Action modelling with Attention.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Egocentric Action Recognition by Video Attention and Temporal Context.
CoRR, 2020

Knowing What, Where and When to Look: Efficient Video Action Modeling with Attention.
CoRR, 2020

Knowledge distillation via adaptive instance normalization.
CoRR, 2020

Training binary neural networks with real-to-binary convolutions.
Proceedings of the 8th International Conference on Learning Representations, 2020

Lipreading Using Temporal Convolutional Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

BATS: Binary ArchitecTure Search.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Automatic Analysis of Facial Actions: A Survey.
IEEE Trans. Affect. Comput., 2019

Action Recognition With Spatial-Temporal Discriminative Filter Banks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
A Functional Regression Approach to Facial Landmark Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

2017
Fusing Deep Learned and Hand-Crafted Features of Appearance, Shape, and Dynamics for Automatic Pain Estimation.
Proceedings of the 12th IEEE International Conference on Automatic Face & Gesture Recognition, 2017

2016
The Automatic Detection of Chronic Pain-Related Expression: Requirements, Challenges and the Multimodal EmoPain Dataset.
IEEE Trans. Affect. Comput., 2016

Cascaded regression with sparsified feature covariance matrix for facial landmark detection.
Pattern Recognit. Lett., 2016

L<sub>2, 1</sub>-based regression and prediction accumulation across views for robust facial landmark detection.
Image Vis. Comput., 2016

Cascaded Continuous Regression for Real-Time Incremental Face Tracking.
Proceedings of the Computer Vision - ECCV 2016, 2016

The Visual Object Tracking VOT2016 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

ChaLearn Looking at People and Faces of the World: Face AnalysisWorkshop and Challenge 2016.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016

2015
Empirical analysis of cascade deformable models for multi-view face detection.
Image Vis. Comput., 2015

Facial landmarking for in-the-wild images with local inference based on global appearance.
Image Vis. Comput., 2015

TRIC-track: Tracking by Regression with Incrementally Learned Cascades.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Learning to Transfer: Transferring Latent Task Structures and Its Application to Person-Specific Facial Action Unit Detection.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Learning to combine local models for facial Action Unit detection.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

2014
A Dynamic Appearance Descriptor Approach to Facial Actions Temporal Modeling.
IEEE Trans. Cybern., 2014

Decision Level Fusion of Domain Specific Regions for Facial Action Recognition.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Parametric temporal alignment for the detection of facial action temporal segments.
Proceedings of the British Machine Vision Conference, 2014

2013
Local Evidence Aggregation for Regression-Based Facial Point Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

The MAHNOB Laughter database.
Image Vis. Comput., 2013

2011
DLIG: Direct Local Indirect Global Alignment for Video Mosaicing.
IEEE Trans. Circuits Syst. Video Technol., 2011

2010
Facial component detection in thermal imagery.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

Facial point detection using boosted regression and graph models.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Compatible Particles for Part-Based Tracking.
Proceedings of the Articulated Motion and Deformable Objects, 6th International Conference, 2010

2009
Multiple Cue Data Fusion using Markov Random Fields for Motion Detection.
Proceedings of the VISAPP 2009 - Proceedings of the Fourth International Conference on Computer Vision Theory and Applications, Lisboa, Portugal, February 5-8, 2009, 2009

Real-Time Motion Detection for a Mobile Observer Using Multiple Kernel Tracking and Belief Propagation.
Proceedings of the Pattern Recognition and Image Analysis, 4th Iberian Conference, 2009

2008
Piecewise affine kernel tracking for non-planar targets.
Pattern Recognit., 2008

2007
Structure Restriction for Tracking Through Multiple Views and Occlusions.
Proceedings of the Pattern Recognition and Image Analysis, Third Iberian Conference, 2007

A Density-Based Data Reduction Algorithm for Robust Estimators.
Proceedings of the Pattern Recognition and Image Analysis, Third Iberian Conference, 2007

Ground Plane Estimation Based on Virtual Camera Rotation.
Proceedings of the Artificial Intelligence Research and Development, 2007

2006
Multiple Kernel Two-Step Tracking.
Proceedings of the International Conference on Image Processing, 2006

Two-Step Tracking by Parts Using Multiple Kernels.
Proceedings of the Artificial Intelligence Research and Development, 2006


  Loading...