Yonatan Bitton

Orcid: 0000-0002-1185-6838

According to our database1, Yonatan Bitton authored at least 30 papers between 2020 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
NL-Eye: Abductive NLI for Images.
CoRR, 2024

Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models.
CoRR, 2024

Contrastive Sequential-Diffusion Learning: An approach to Multi-Scene Instructional Video Synthesis.
CoRR, 2024

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision.
CoRR, 2024

Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation.
CoRR, 2024

DataComp-LM: In search of the next generation of training sets for language models.
CoRR, 2024

VideoPhy: Evaluating Physical Commonsense for Video Generation.
CoRR, 2024

TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation.
CoRR, 2024

DOCCI: Descriptions of Connected and Contrasting Images.
CoRR, 2024

ParallelPARC: A Scalable Pipeline for Generating Natural-Language Analogies.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

ImageInWords: Unlocking Hyper-Detailed Image Descriptions.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment.
Proceedings of the Computer Vision - ECCV 2024, 2024

VideoCon: Robust Video-Language Alignment via Contrast Captions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use.
CoRR, 2023

OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models.
CoRR, 2023

Read, Look or Listen? What's Needed for Solving a Multimodal Dataset.
CoRR, 2023

Transferring Visual Attributes from Natural Language to Verified Image Generation.
CoRR, 2023

What You See is What You Read? Improving Text-Image Alignment Evaluation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023


VisIT-Bench: A Dynamic Benchmark for Evaluating Instruction-Following Vision-and-Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

IRFL: Image Recognition of Figurative Language.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

q2d: Turning Questions into Dialogs to Teach Models How to Search.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

VASR: Visual Analogies of Situation Recognition.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQA.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Data Efficient Masked Language Modeling for Vision and Language.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

2020
Cross-lingual Unified Medical Language System entity linking in online health communities.
J. Am. Medical Informatics Assoc., 2020


  Loading...