Leonid Karlinsky

Orcid: 0000-0003-2524-2068

According to our database1, Leonid Karlinsky authored at least 87 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
MAEDAY: MAE for few- and zero-shot AnomalY-Detection.
Comput. Vis. Image Underst., 2024

LiveXiv - A Multi-Modal Live Benchmark Based on Arxiv Papers Content.
CoRR, 2024

GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models.
CoRR, 2024

Scaling Granite Code Models to 128K Context.
CoRR, 2024

Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning.
CoRR, 2024

Navigating the Labyrinth: Evaluating and Enhancing LLMs' Ability to Reason About Search Problems.
CoRR, 2024

Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts.
CoRR, 2024

Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation.
CoRR, 2024

Comparison Visual Instruction Tuning.
CoRR, 2024

ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMs.
CoRR, 2024

Trans-LoRA: towards data-free Transferable Parameter Efficient Finetuning.
CoRR, 2024

Towards Multimodal In-Context Learning for Vision & Language Models.
CoRR, 2024

CAMELoT: Towards Large Language Models with Training-Free Consolidated Associative Memory.
CoRR, 2024

PromptonomyViT: Multi-Task Prompt Learning Improves Video Transformers using Synthetic Scene Data.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Large Scale Generative AI Text Applied to Sports and Music.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Listen, Think, and Understand.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

NumeroLogic: Number Encoding for Enhanced LLMs' Numerical Reasoning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Meta-prompting for Automating Zero-Shot Visual Recognition with LLMs.
Proceedings of the Computer Vision - ECCV 2024, 2024

Adaptive Memory Replay for Continual Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Self-Specialization: Uncovering Latent Expertise within Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Mitigating Confirmation Bias in Semi-supervised Learning via Efficient Bayesian Model Averaging.
Trans. Mach. Learn. Res., 2023

3VL: using Trees to teach Vision & Language models compositional concepts.
CoRR, 2023

GeRA: Label-Efficient Geometrically Regularized Alignment.
CoRR, 2023

Self-Specialization: Uncovering Latent Expertise within Large Language Models.
CoRR, 2023

TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification.
CoRR, 2023

Constructive Assimilation: Boosting Contrastive Learning Performance through View Generation Strategies.
CoRR, 2023

Learning Human Action Recognition Representations Without Real Humans.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning to Grow Pretrained Models for Efficient Transformer Training.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Contrastive Audio-Visual Masked Autoencoder.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Going Beyond Nouns With Vision & Language Models Using Synthetic Data.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2023

Incorporating Structured Representations into Pretrained Vision & Language Models Using Scene Graphs.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

CODA-Prompt: COntinual Decomposed Attention-Based Prompting for Rehearsal-Free Continual Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ConStruct-VL: Data-Free Continual Structured VL Concepts Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Teaching Structured Vision & Language Concepts to Vision & Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Joint Audio and Speech Understanding.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Baby steps towards few-shot learning with multiple semantics.
Pattern Recognit. Lett., 2022

A Maximal Correlation Framework for Fair Machine Learning.
Entropy, 2022

On the Transferability of Visual Features in Generalized Zero-Shot Learning.
CoRR, 2022

Teaching Structured Vision&Language Concepts to Vision&Language Models.
CoRR, 2022

On the Importance of Calibration in Semi-supervised Learning.
CoRR, 2022

VL-Taboo: An Analysis of Attribute-based Zero-shot Capabilities of Vision-Language Models.
CoRR, 2022

FETA: Towards Specializing Foundation Models for Expert Task Applications.
CoRR, 2022

Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 2022.
CoRR, 2022

How Transferable are Video Representations Based on Synthetic Data?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

FETA: Towards Specializing Foundational Models for Expert Task Applications.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Maximal Correlation Approach to Imposing Fairness in Machine Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

Self-Supervised Classification Network.
Proceedings of the Computer Vision - ECCV 2022, 2022

Task2Sim: Towards Effective Pre-training and Transfer from Synthetic Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unsupervised Domain Generalization by Learning a Bridge Across Domains.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
MetAdapt: Meta-learned task-adaptive architecture for few-shot classification.
Pattern Recognit. Lett., 2021

CHARTER: heatmap-based multi-type chart data extraction.
CoRR, 2021

Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition.
Proceedings of the 9th International Conference on Learning Representations, 2021

A Broad Study on the Transferability of Visual Representations with Contrastive Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Detector-Free Weakly Supervised Grounding by Separation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Fine-Grained Angular Contrastive Learning With Coarse Labels.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

StarNet: towards Weakly Supervised Few-Shot Object Detection.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
StarNet: towards weakly supervised few-shot detection and explainable few-shot classification.
CoRR, 2020

AR-Net: Adaptive Frame Resolution for Efficient Action Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020

TAFSSL: Task-Adaptive Feature Sub-Space Learning for Few-Shot Classification.
Proceedings of the Computer Vision - ECCV 2020, 2020

A Broader Study of Cross-Domain Few-Shot Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

OnlineAugment: Online Data Augmentation with Less Domain Knowledge.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
A New Benchmark for Evaluation of Cross-Domain Few-Shot Learning.
CoRR, 2019

A CNN based method for automatic mass detection and classification in mammograms.
Comput. methods Biomech. Biomed. Eng. Imaging Vis., 2019

RepMet: Representative-Based Metric Learning for Classification and Few-Shot Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

LaSO: Label-Set Operations Networks for Multi-Label Few-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
RepMet: Representative-based metric learning for classification and one-shot object detection.
CoRR, 2018

Delta-encoder: an effective sample synthesis method for few-shot object recognition.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Co-regularized Alignment for Unsupervised Domain Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017
Deep Learning for Automatic Detection of Abnormal Findings in Breast Mammography.
Proceedings of the Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support, 2017

Domain specific convolutional neural nets for detection of architectural distortion in mammograms.
Proceedings of the 14th IEEE International Symposium on Biomedical Imaging, 2017

Fine-Grained Recognition of Thousands of Object Categories with Single-Example Training.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Hybrid Remote Expert - an Emerging Pattern of Industrial Remote Support.
Proceedings of the Forum and Doctoral Consortium Papers Presented at the 29th International Conference on Advanced Information Systems Engineering, 2017

2016
A Region Based Convolutional Network for Tumor Detection and Classification in Breast Mammography.
Proceedings of the Deep Learning and Data Labeling for Medical Applications, 2016

2012
Using Linking Features in Learning Non-parametric Part Models.
Proceedings of the Computer Vision - ECCV 2012, 2012

2010
Using body-anchored priors for identifying actions in single images.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

The chains model for detecting parts by their context.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
Unsupervised feature optimization (UFO): Simultaneous selection of multiple features with their detection parameters.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008
Unsupervised Classification and Part Localization by Consistency Amplification.
Proceedings of the Computer Vision, 2008


  Loading...