Idan Schwartz

According to our database1, Idan Schwartz authored at least 24 papers between 2017 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Iterative Object Count Optimization for Text-to-image Diffusion Models.
CoRR, 2024

Improving Visual Commonsense in Language Models via Multiple Image Generation.
CoRR, 2024

Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation.
CoRR, 2023

Discriminative Class Tokens for Text-to-Image Diffusion Models.
CoRR, 2023

Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Discriminative Class Tokens for Text-to-Image Diffusion Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Zero-Shot Video Captioning by Evolving Pseudo-tokens.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
Cognitive Models in Deep Learning.
PhD thesis, 2022

Zero-Shot Video Captioning with Evolving Pseudo-Tokens.
CoRR, 2022

Video and Text Matching with Conditioned Embeddings.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Optimizing Relevance Maps of Vision Transformers Improves Robustness.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Ordered Attention for Coherent Visual Storytelling.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Describing Sets of Images with Textual-PCA.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Latent Space Explanation by Intervention.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic.
CoRR, 2021

Towards Coherent Visual Storytelling with Ordered Image Attention.
CoRR, 2021

Perceptual Score: What Data Modalities Does Your Model Perceive?
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Ensemble of MRR and NDCG models for Visual Dialog.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

2020
Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional Entropies.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
Factor Graph Attention.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

A Simple Baseline for Audio-Visual Scene-Aware Dialog.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2017
High-Order Attention Models for Visual Question Answering.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017


  Loading...