Hervé Jégou

Affiliations:
  • Kyutai, Paris, France
  • Facebook AI Research, Paris, France (former)


According to our database1, Hervé Jégou authored at least 149 papers between 2003 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
DINOv2: Learning Robust Visual Features without Supervision.
Trans. Mach. Learn. Res., 2024

Neutral residues: revisiting adapters for model extension.
CoRR, 2024

Moshi: a speech-text foundation model for real-time dialogue.
CoRR, 2024

Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach.
CoRR, 2024

The Faiss library.
CoRR, 2024

2023
ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

Image Compression with Product Quantized Masked Image Modeling.
Trans. Mach. Learn. Res., 2023

DINOv2: Learning Robust Visual Features without Supervision.
CoRR, 2023

Birth of a Transformer: A Memory Viewpoint.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models.
Proceedings of the International Conference on Machine Learning, 2023

Active Image Indexing.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

The Stable Signature: Rooting Watermarks in Latent Diffusion Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Variable Rate Allocation for Vector-Quantized Autoencoders.
Proceedings of the IEEE International Conference on Acoustics, 2023

Co-training 2L Submodels for Visual Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Co-training 2<sup>L</sup> Submodels for Visual Recognition.
CoRR, 2022

Nearest Neighbor Search with Compact Codes: A Decoder Perspective.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Watermarking Images in Self-Supervised Latent Spaces.
Proceedings of the IEEE International Conference on Acoustics, 2022

DeiT III: Revenge of the ViT.
Proceedings of the Computer Vision, 2022

Three Things Everyone Should Know About Vision Transformers.
Proceedings of the Computer Vision, 2022

2021
Billion-Scale Similarity Search with GPUs.
IEEE Trans. Big Data, 2021

Augmenting Convolutional networks with attention-based aggregation.
CoRR, 2021

Are Large-scale Datasets Necessary for Self-Supervised Pre-training?
CoRR, 2021

ResNet strikes back: An improved training procedure in timm.
CoRR, 2021

XCiT: Cross-Covariance Image Transformers.
CoRR, 2021

ResMLP: Feedforward networks for image classification with data-efficient training.
CoRR, 2021

Training Vision Transformers for Image Retrieval.
CoRR, 2021

XCiT: Cross-Covariance Image Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Training data-efficient image transformers & distillation through attention.
Proceedings of the 38th International Conference on Machine Learning, 2021

Training with Quantization Noise for Extreme Model Compression.
Proceedings of the 9th International Conference on Learning Representations, 2021

Grafit: Learning fine-grained image representations with coarse labels.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Going deeper with Image Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Emerging Properties in Self-Supervised Vision Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Gradient-based Adversarial Attacks against Text Transformers.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Powers of layers for image-to-image translation.
CoRR, 2020

Fixing the train-test resolution discrepancy: FixEfficientNet.
CoRR, 2020

Radioactive data: tracing through training.
Proceedings of the 37th International Conference on Machine Learning, 2020

And the Bit Goes Down: Revisiting the Quantization of Neural Networks.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Understanding and Improving Kernel Local Descriptors.
Int. J. Comput. Vis., 2019

Augmenting Self-attention with Persistent Memory.
CoRR, 2019

Billion-scale semi-supervised learning for image classification.
CoRR, 2019

MultiGrain: a unified image embedding for classes and instances.
CoRR, 2019

Fixing the train-test resolution discrepancy.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Large Memory Layers with Product Keys.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

White-box vs Black-box: Bayes Optimal Strategies for Membership Inference.
Proceedings of the 36th International Conference on Machine Learning, 2019

Equi-normalization of Neural Networks.
Proceedings of the 7th International Conference on Learning Representations, 2019

Spreading vectors for similarity search.
Proceedings of the 7th International Conference on Learning Representations, 2019

2018
Memory Vectors for Similarity Search in High-Dimensional Spaces.
IEEE Trans. Big Data, 2018

Déjà Vu: an empirical evaluation of the memorization properties of ConvNets.
CoRR, 2018

A neural network catalyzer for multi-dimensional similarity search.
CoRR, 2018

Word translation without parallel data.
Proceedings of the 6th International Conference on Learning Representations, 2018

Loss in Translation: Learning Bilingual Word Mapping with a Retrieval Criterion.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Link and Code: Fast Indexing With Graphs and Compact Regression Codes.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Low-Shot Learning With Large-Scale Diffusion.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

LAMV: Learning to Align and Match Videos With Kernelized Temporal Layers.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Efficient similarity search.
Proceedings of the Frontiers of Multimedia Research, 2018

2017
Guest Editorial: Large-Scale Multimedia Data Retrieval, Classification, and Understanding.
IEEE Trans. Multim., 2017

Interferences in Match Kernels.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Tubelets: Unsupervised Action Proposals from Spatiotemporal Super-Voxels.
Int. J. Comput. Vis., 2017

An Evaluation of Large-scale Methods for Image Instance and Class Discovery.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Efficient softmax approximation for GPUs.
Proceedings of the 34th International Conference on Machine Learning, 2017

How should we evaluate supervised hashing?
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Erratum to: Image Search with Selective Match Kernels: Aggregation Across Single and Multiple Images.
Int. J. Comput. Vis., 2016

Image Search with Selective Match Kernels: Aggregation Across Single and Multiple Images.
Int. J. Comput. Vis., 2016

Circulant Temporal Encoding for Video Retrieval and Temporal Alignment.
Int. J. Comput. Vis., 2016

Particular object retrieval with integral max-pooling of CNN activations.
Proceedings of the 4th International Conference on Learning Representations, 2016

FastText.zip: Compressing text classification models.
CoRR, 2016

Approximate Search with Quantized Sparse Representations.
Proceedings of the Computer Vision - ECCV 2016, 2016

Polysemous Codes.
Proceedings of the Computer Vision - ECCV 2016, 2016

2015
A Comparison of Dense Region Detectors for Image Search and Fine-Grained Classification.
IEEE Trans. Image Process., 2015

Explicit Embeddings for Nearest Neighbor Search with Mercer Kernels.
J. Math. Imaging Vis., 2015

Improved Motion Description for Action Classification.
Frontiers ICT, 2015

Rotation and translation covariant match kernels for image retrieval.
Comput. Vis. Image Underst., 2015

Multiple Measurements and Joint Dimensionality Reduction for Large Scale Image Search with Short Vectors - Extended Version.
CoRR, 2015

Temporal Matching Kernel with Explicit Feature Maps.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Memory Vectors for Particular Object Retrieval with Multiple Queries.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Multiple Measurements and Joint Dimensionality Reduction for Large Scale Image Search with Short Vectors.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Kernel Local Descriptors with Implicit Rotation Matching.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Early burst detection for memory-efficient image retrieval.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Revisiting the Fisher vector for fine-grained classification.
Pattern Recognit. Lett., 2014

Visual query expansion with or without geometry: Refining local descriptors by feature aggregation.
Pattern Recognit., 2014

Image Retrieval with Reciprocal and Shared Nearest Neighbors.
Proceedings of the VISAPP 2014, 2014

A Group Testing Framework for Similarity Search in High-dimensional Spaces.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

The Yael Library.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Instance classification with prototype selection.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Beyond "project and sign" for cosine estimation with binary codes.
Proceedings of the IEEE International Conference on Acoustics, 2014

Orientation Covariant Aggregation of Local Descriptors with Embeddings.
Proceedings of the Computer Vision - ECCV 2014, 2014

Triangulation Embedding and Democratic Aggregation for Image Search.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Action Localization with Tubelets from Motion.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Fast and secure similarity search in high dimensional space.
Proceedings of the 2013 IEEE International Workshop on Information Forensics and Security, 2013

Sim-min-hash: an efficient matching technique for linking large image collections.
Proceedings of the ACM Multimedia Conference, 2013

Revisiting the VLAD image representation.
Proceedings of the ACM Multimedia Conference, 2013

Retrieving geo-location of videos with a divide & conquer hierarchical multimodal approach.
Proceedings of the International Conference on Multimedia Retrieval, 2013

Query-Adaptive Asymmetrical Dissimilarities for Visual Object Retrieval.
Proceedings of the IEEE International Conference on Computer Vision, 2013

To Aggregate or Not to aggregate: Selective Match Kernels for Image Search.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Stable Hyper-pooling and Query Expansion for Event Detection.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Event Retrieval in Large Video Collections with Circulant Temporal Encoding.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Better Exploiting Motion for Better Action Recognition.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Oriented pooling for dense and non-dense rotation-invariant features.
Proceedings of the British Machine Vision Conference, 2013

2012
Aggregating Local Image Descriptors into Compact Codes.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Quaero at TRECVID 2012: Semantic Indexing.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012


Hamming embedding similarity-based image classification.
Proceedings of the International Conference on Multimedia Retrieval, 2012

Anti-sparse coding for approximate nearest neighbor search.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

BABAZ: A large scale audio search system for video copy detection.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Efficient Mining of Repetitions in Large-Scale TV Streams with Product Quantization Hashing.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Negative Evidences and Co-occurences in Image Retrieval: The Benefit of PCA and Whitening.
Proceedings of the Computer Vision - ECCV 2012, 2012

Large-scale and larger-scale image search.
Proceedings of the British Machine Vision Conference, 2012

2011
Product Quantization for Nearest Neighbor Search.
IEEE Trans. Pattern Anal. Mach. Intell., 2011

Quaero at TRECVID 2011: Semantic Indexing and Multimedia Event Detection.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011


INRIA @TRECVID 2011: Copy Detection & Multimedia Event Detection.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

Bag-of-colors for improved image search.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Asymmetric hamming embedding: taking the best of our bits for large scale image search.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Searching in one billion vectors: Re-rank with source coding.
Proceedings of the IEEE International Conference on Acoustics, 2011

Reconstructing an image from its local descriptors.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Balancing clusters to reduce response time variability in large scale image search.
Proceedings of the 9th International Workshop on Content-Based Multimedia Indexing, 2011

2010
An Image-Based Approach to Video Copy Detection With Spatio-Temporal Post-Filtering.
IEEE Trans. Multim., 2010

Locality sensitive hashing: A comparison of hash function types and querying mechanisms.
Pattern Recognit. Lett., 2010

Accurate Image Search Using the Contextual Dissimilarity Measure.
IEEE Trans. Pattern Anal. Mach. Intell., 2010

Improving Bag-of-Features for Large Scale Image Search.
Int. J. Comput. Vis., 2010

INRIA LEAR-TEXMEX: Video Copy Detection Task.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010


Searching with expectations.
Proceedings of the IEEE International Conference on Acoustics, 2010

Compact Video Description for Copy Detection with Precise Temporal Alignment.
Proceedings of the Computer Vision, 2010

Aggregating local descriptors into a compact image representation.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Computation of posterior marginals on aggregated state models for soft source decoding.
IEEE Trans. Commun., 2009


Packing bag-of-features.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

On the burstiness of visual elements.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Evaluation of GIST descriptors for web-scale image search.
Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009

2008
Error Recovery Properties and Soft Decoding of Quasi-Arithmetic Codes.
EURASIP J. Adv. Signal Process., 2008

INRIA-LEAR'S Video Copy Detection System.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

Query adaptative locality sensitive hashing.
Proceedings of the IEEE International Conference on Acoustics, 2008

Recent Advances in Large Scale Image Search.
Proceedings of the Emerging Trends in Visual Computing, 2008

Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search.
Proceedings of the Computer Vision, 2008

2007
Synchronization Recovery and State Model Reduction for Soft Decoding of Variable Length Codes.
IEEE Trans. Inf. Theory, 2007

Entropy Coding With Variable-Length Rewriting Systems.
IEEE Trans. Commun., 2007

A contextual dissimilarity measure for accurate and efficient image search.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
Error-resilient first-order multiplexed source codes: performance bounds, design and decoding algorithms.
IEEE Trans. Signal Process., 2006

Progressive and Error-Resilient Transmission Strategies for VLC Encoded Signals over Noisy Channels.
EURASIP J. Adv. Signal Process., 2006

Error recovery properties of quasi-arithmetic codes and soft decoding with length constraint.
Proceedings of the Proceedings 2006 IEEE International Symposium on Information Theory, 2006

2005
Robust codes and joint source-channel codes for multimedia transmission over mobile channels. (Codes robustes et codes joints source-canal pour transmission multimédia sur canaux mobiles).
PhD thesis, 2005

Robust multiplexed codes for compression of heterogeneous data.
IEEE Trans. Inf. Theory, 2005

Entropy coding with variable length re-writing systems.
Proceedings of the 2005 IEEE International Symposium on Information Theory, 2005

2004
First-order multiplexed source codes for error-resilient entropy coding.
Proceedings of the 2004 IEEE International Symposium on Information Theory, 2004

Suffix-constrained codes for progressive and robust data compression: Self-multiplexed codes.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

2003
Source multiplexed codes for error-prone channels.
Proceedings of IEEE International Conference on Communications, 2003

Error-resilient binary multiplexed source codes.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003


  Loading...