Samuel Albanie

Orcid: 0000-0003-1732-9198

According to our database1, Samuel Albanie authored at least 80 papers between 2016 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
TeachText: CrossModal text-video retrieval through generalized distillation.
Artif. Intell., 2025

2024
Iterate Averaging in the Quest for Best Test Error.
J. Mach. Learn. Res., 2024

A Practitioner's Guide to Continual Multimodal Pretraining.
CoRR, 2024

GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models.
CoRR, 2024

On scalable oversight with weak LLMs judging strong LLMs.
CoRR, 2024

Inverse Constitutional AI: Compressing Preferences into Principles.
CoRR, 2024

HelloFresh: LLM Evaluations on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits.
CoRR, 2024

A Tale of Two Languages: Large-Vocabulary Continuous Sign Language Recognition from Spoken Language Supervision.
CoRR, 2024

SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation.
CoRR, 2024

Foundational Challenges in Assuring Alignment and Safety of Large Language Models.
CoRR, 2024

No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance.
CoRR, 2024

Lifelong Benchmarks: Efficient Model Evaluation in an Era of Rapid Progress.
CoRR, 2024

Visual Data-Type Understanding does not emerge from scaling Vision-Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

A Sound Approach: Using Large Language Models to Generate Audio Descriptions for Egocentric Text-Audio Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2024

InstructVideo: Instructing Video Diffusion Models with Human Feedback.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HelloFresh: LLM Evalutions on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Audio Retrieval With Natural Language Queries: A Benchmark Study.
IEEE Trans. Multim., 2023

arXiVeri: Automatic table verification with GPT.
CoRR, 2023

GPT4GEO: How a Language Model Sees the World's Geography.
CoRR, 2023

SATIN: A Multi-Task Metadataset for Classifying Satellite Imagery using Vision-Language Models.
CoRR, 2023

Can GPT-4 Perform Neural Architecture Search?
CoRR, 2023

Large Language Models are Few-shot Publication Scoopers.
CoRR, 2023

DeepMIM: Deep Supervision for Masked Image Modeling.
CoRR, 2023

RLIPv2: Fast Scaling of Relational Language-Image Pre-training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SuS-X: Training-Free Name-Only Transfer of Vision-Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Simple Baselines for Interactive Video Retrieval with Questions and Answers.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Moment Detection in Long Tutorial Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

NamedMask: Distilling Segmenters from Complementary Foundation Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Zero-shot Unsupervised Transfer Instance Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Crosslingual Generalization through Multitask Finetuning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Scaling Up Sign Spotting Through Sign Language Dictionaries.
Int. J. Comput. Vis., 2022

A 23 MW data centre is all you need.
CoRR, 2022

RLIP: Relational Language-Image Pre-training for Human-Object Interaction Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ReCo: Retrieve and Co-segment for Zero-shot Transfer.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Automatic Dense Annotation of Large-Vocabulary Sign Language Videos.
Proceedings of the Computer Vision - ECCV 2022, 2022

Unsupervised Salient Object Detection with Spectral Cluster Voting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Sign Language Video Retrieval with Free-Form Textual Queries.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Cross Modal Retrieval with Querybank Normalisation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Weakly-supervised Fingerspelling Recognition in British Sign Language Videos.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
BBC-Oxford British Sign Language Dataset.
CoRR, 2021

On the Origin of Species of Self-Supervised Learning.
CoRR, 2021

Quantum Self-Supervised Learning.
CoRR, 2021

Preface.
Proceedings of the NeurIPS 2021 Workshop on Pre-Registration in Machine Learning, 2021

Audio Retrieval with Natural Language Queries.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

All you need are a few pixels: semantic segmentation with PixelPick.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

TeachText: CrossModal Generalized Distillation for Text-Video Retrieval.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Aligning Subtitles in Sign Language Videos.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Sign Language Segmentation with Temporal Convolutional Networks.
Proceedings of the IEEE International Conference on Acoustics, 2021

QUERYD: A Video Dataset with High-Quality Text and Audio Narrations.
Proceedings of the IEEE International Conference on Acoustics, 2021

SeeHear: Signer Diarisation and a New Dataset.
Proceedings of the IEEE International Conference on Acoustics, 2021

Read and Attend: Temporal Localisation in Sign Language Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Sign Segmentation With Changepoint-Modulated Pseudo-Labelling.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Mind-the-Gap! Unsupervised Domain Adaptation for Text-Video Retrieval.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Squeeze-and-Excitation Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

QuerYD: A video dataset with high-quality textual and audio narrations.
CoRR, 2020

Explaining the Adaptive Generalisation Gap.
CoRR, 2020

The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020).
CoRR, 2020

State-of-Art-Reviewing: A Radical Proposal to Improve Scientific Publication.
CoRR, 2020

Preface.
Proceedings of the NeurIPS 2020 Workshop on Pre-registration in Machine Learning, 2020

Disentangled Speech Embeddings Using Cross-Modal Self-Supervision.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

SLRTP 2020: The Sign Language Recognition, Translation & Production Workshop.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

BSL-1K: Scaling Up Co-articulated Sign Language Recognition Using Mouthing Cues.
Proceedings of the Computer Vision - ECCV 2020, 2020

Seeing wake words: Audio-visual Keyword Spotting.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

Watch, Read and Lookup: Learning to Spot Signs from Multiple Supervisors.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
Deep Industrial Espionage.
CoRR, 2019

Unsupervised Learning of Landmarks by Descriptor Vector Exchange.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Small Steps and Giant Leaps: Minimal Newton Solvers for Deep Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Use What You Have: Video retrieval using representations from collaborative experts.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
Substitute Teacher Networks: Learning with Almost No Supervision.
CoRR, 2018

Gather-Excite: Exploiting Feature Context in Convolutional Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Emotion Recognition in Speech using Cross-Modal Transfer in the Wild.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Semi-convolutional Operators for Instance Segmentation.
Proceedings of the Computer Vision - ECCV 2018, 2018

Learnable PINs: Cross-modal Embeddings for Person Identity.
Proceedings of the Computer Vision - ECCV 2018, 2018

Self-Supervised Learning of Geometrically Stable Features Through Probabilistic Introspection.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Seeing Voices and Hearing Faces: Cross-Modal Biometric Matching.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Unknowable Manipulators: Social Network Curator Algorithms.
CoRR, 2017

Stopping GAN Violence: Generative Unadversarial Networks.
CoRR, 2017

2016
Learning Grimaces by Watching TV.
Proceedings of the British Machine Vision Conference 2016, 2016


  Loading...