Erkut Erdem

Giambattista Parascandolo

Proceedings of the IGARSS 2024, 2024

ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

An End-to-End Generative System for Smart Travel Assistant.

[BibT_eX]

[DOI]

Proceedings of the 16th International Joint Conference on Knowledge Discovery, 2024

2023

CLIP-guided StyleGAN Inversion for Text-driven Real Image Editing.

[BibT_eX]

[DOI]

ACM Trans. Graph., October, 2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.

[BibT_eX]

[DOI]

Bartlomiej Bojanowski

Christopher D. Manning

Daniel Moseguí González

Eunice Engefu Manyasi

Evgenii Zheltonozhskii

Fanyue Xia

Fatemeh Siar

Fernando Martínez-Plumed

Giorgio Mariani

Gloria Wang

Gonzalo Jaimovitch-López

Jaime Fernández Fisac

Jascha Sohl-Dickstein

José Hernández-Orallo

Karthik Gopalakrishnan

Lidia Contreras Ochando

Louis-Philippe Morency

María José Ramírez-Quintana

Michael I. Ivanitskiy

Neta Gur-Ari Krakover

Nitish Shirish Keskar

Pablo Antonio Moreno Casares

Pegah Alipoormolabashi

Shyamolima (Shammie) Debnath

Sneha Priscilla Makini

Yadollah Yaghoobzadeh

Trans. Mach. Learn. Res., 2023

Spherical Vision Transformer for 360-degree Video Saliency Prediction.

[BibT_eX]

[DOI]

CoRR, 2023

Inst-Inpaint: Instructing to Remove Objects with Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2023

VidStyleODE: Disentangled Video Editing via StyleGAN and NeuralODEs.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ST360IQ: No-Reference Omnidirectional Image Quality Assessment With Spherical Vision Transformers.

[BibT_eX]

[DOI]

Nafiseh Jabbari Tofighi

Proceedings of the IEEE International Conference on Acoustics, 2023

Harnessing Dataset Cartography for Improved Compositional Generalization in Transformers.

[BibT_eX]

[DOI]

Osman Batur Ince

Tanin Zeraati

Semih Yagcioglu

Yadollah Yaghoobzadeh

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

EVREAL: Towards a Comprehensive Benchmark and Analysis Suite for Event-based Video Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Spherical Vision Transformer for 360° Video Saliency Prediction.

[BibT_eX]

[DOI]

Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022

A Gated Fusion Network for Dynamic Saliency Prediction.

[BibT_eX]

[DOI]

Aysun Kocak

IEEE Trans. Cogn. Dev. Syst., 2022

Leveraging semantic saliency maps for query-specific video summarization.

[BibT_eX]

[DOI]

Kemal Cizmeciler

Multim. Tools Appl., 2022

Neural Natural Language Generation: A Survey on Multilinguality, Multimodality, Controllability and Learning.

[BibT_eX]

[DOI]

Ciprian-Octavian Truica

Branislava Sandrih

Sanda Martincic-Ipsic

Gábor Berend

Albert Gatt

Grazina Korvel

J. Artif. Intell. Res., 2022

Detecting Euphemisms with Literal Descriptions and Visual Imagery.

[BibT_eX]

[DOI]

CoRR, 2022

Stochastic Video Prediction with Structure and Motion.

[BibT_eX]

[DOI]

CoRR, 2022

Multi-Contrast MRI Synthesis with Channel-Exchanging-Network.

[BibT_eX]

[DOI]

Proceedings of the 30th Signal Processing and Communications Applications Conference, 2022

Perception-Distortion Trade-Off in the SR Space Spanned by Flow Models.

[BibT_eX]

[DOI]

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Modulating Bottom-Up and Top-Down Visual Processing via Language-Conditional Filters.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

How scene attributes and sound influence visual exploration of omnidirectional panoramic scenes.

[BibT_eX]

[DOI]

Proceedings of the 44th Annual Meeting of the Cognitive Science Society, 2022

Disentangling Content and Motion for Text-Based Neural Video Manipulation.

[BibT_eX]

[DOI]

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions.

[BibT_eX]

[DOI]

Tayfun Ates

Muhammed Samil Atesoglu

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021

Burst Photography for Learning to Enhance Extremely Dark Images.

[BibT_eX]

[DOI]

Ahmet Serdar Karadeniz

IEEE Trans. Image Process., 2021

Synthetic18K: Learning better representations for person re-ID and attribute recognition from 1.4 million synthetic images.

[BibT_eX]

[DOI]

Signal Process. Image Commun., 2021

Generating visual story graphs with application to photo album summarization.

[BibT_eX]

[DOI]

Signal Process. Image Commun., 2021

Leveraging auxiliary image descriptions for dense video captioning.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2021

MSVD-Turkish: a comprehensive multimodal video dataset for integrated vision and language research in Turkish.

[BibT_eX]

[DOI]

Mach. Transl., 2021

mustGAN: multi-stream Generative Adversarial Networks for MR Image Synthesis.

[BibT_eX]

[DOI]

Medical Image Anal., 2021

Using synthetic data for person tracking under adverse weather conditions.

[BibT_eX]

[DOI]

Image Vis. Comput., 2021

From Noon to Sunset: Interactive Rendering, Relighting, and Recolouring of Landscape Photographs by Modifying Solar Position.

[BibT_eX]

[DOI]

Murat Türe

Mustafa Ege Çiklabakkal

Comput. Graph. Forum, 2021

NOVA: Rendering Virtual Worlds with Humans for Computer Vision Tasks.

[BibT_eX]

[DOI]

Comput. Graph. Forum, 2021

Leveraging Frequency Based Salient Spatial Sound Localization to Improve 360° Video Saliency Prediction.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Machine Vision and Applications, 2021

SLAMP: Stochastic Latent Appearance and Motion Prediction.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Cross-lingual Visual Pre-training for Multimodal Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

2020

Manipulating Attributes of Natural Scenes via Hallucination.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2020

Hedging static saliency models to predict dynamic saliency.

[BibT_eX]

[DOI]

Yasin Kavak

Signal Process. Image Commun., 2020

MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in Turkish.

[BibT_eX]

[DOI]

CoRR, 2020

Belief Regulated Dual Propagation Nets for Learning Action Effects on Groups of Articulated Objects.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

2019

mustGAN: Multi-Stream Generative Adversarial Networks for MR Image Synthesis.

[BibT_eX]

[DOI]

CoRR, 2019

Belief Regulated Dual Propagation Nets for Learning Action Effects on Articulated Multi-Part Objects.

[BibT_eX]

[DOI]

CoRR, 2019

MSVD-Turkish: A Large-Scale Dataset for Video Captioning in Turkish.

[BibT_eX]

[DOI]

Proceedings of the 27th Signal Processing and Communications Applications Conference, 2019

A Comparative Analysis of Practices in Training Deep Models for Fashion Attribute Detection.

[BibT_eX]

[DOI]

Mustafa Sercan Amac

Proceedings of the 27th Signal Processing and Communications Applications Conference, 2019

Diverse Neural Photo Album Summarization.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Image Processing Theory, 2019

Procedural Reasoning Networks for Understanding Multimodal Procedures.

[BibT_eX]

[DOI]

Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

2018

Spatio-Temporal Saliency Networks for Dynamic Saliency Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Language Guided Fashion Image Manipulation with Feature-wise Transformations.

[BibT_eX]

[DOI]

Mehmet Günel

CoRR, 2018

Image Synthesis in Multi-Contrast MRI with Conditional Generative Adversarial Networks.

[BibT_eX]

[DOI]

CoRR, 2018

Image captioning in Turkish with subword units.

[BibT_eX]

[DOI]

Menekse Kuyu

Proceedings of the 26th Signal Processing and Communications Applications Conference, 2018

Generating person images based on attributes.

[BibT_eX]

[DOI]

Mehmet Gunel

Proceedings of the 26th Signal Processing and Communications Applications Conference, 2018

RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking Recipes.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Joint Exploitation of Features and Optical Flow for Real-Time Moving Object Detection on Drones.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

2017

Alpha Matting With KL-Divergence-Based Sparse Sampling.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2017

A comparative study for feature integration strategies in dynamic saliency estimation.

[BibT_eX]

[DOI]

Yasin Kavak

Signal Process. Image Commun., 2017

Data-driven image captioning via salient region discovery.

[BibT_eX]

[DOI]

IET Comput. Vis., 2017

Adjusting transient attributes of outdoor images using generative adversarial networks.

[BibT_eX]

[DOI]

Proceedings of the 25th Signal Processing and Communications Applications Conference, 2017

Turkish cuisine: A benchmark dataset with Turkish meals for food recognition.

[BibT_eX]

[DOI]

Proceedings of the 25th Signal Processing and Communications Applications Conference, 2017

Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures (Extended Abstract).

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Feature-Based Efficient Moving Object Detection for Low-Altitude Aerial Platforms.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Re-evaluating Automatic Metrics for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

2016

Deformable part-based tracking by coupled global and local correlation filters.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2016

Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2016

Learning to Generate Images of Outdoor Scenes from Attributes and Semantic Layouts.

[BibT_eX]

[DOI]

CoRR, 2016

Two-Stream Convolutional Networks for Dynamic Saliency Prediction.

[BibT_eX]

[DOI]

Çagdas Bak

CoRR, 2016

An Objective Deghosting Quality Metric for HDR Images.

[BibT_eX]

[DOI]

Comput. Graph. Forum, 2016

TasvirEt: A benchmark dataset for automatic Turkish description generation from images.

[BibT_eX]

[DOI]

Proceedings of the 24th Signal Processing and Communication Application Conference, 2016

Summarizing personal image collections with intrinsic properties.

[BibT_eX]

[DOI]

Proceedings of the 24th Signal Processing and Communication Application Conference, 2016

Clustering motion trajectories via dominant sets.

[BibT_eX]

[DOI]

Çagdas Bak

Proceedings of the 24th Signal Processing and Communication Application Conference, 2016

Dominant sets based analysis of human crowds.

[BibT_eX]

[DOI]

Burcak Asal

Proceedings of the 24th Signal Processing and Communication Application Conference, 2016

HUCVL at MediaEval 2016: Predicting Interesting Key Frames with Deep Models.

[BibT_eX]

[DOI]

Goksu Erdogan

Gorthi R. K. Sai Subrahmanyam

Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016

The Visual Object Tracking VOT2016 Challenge Results.

[BibT_eX]

[DOI]

Alireza Memarmoghadam

Guilherme Sousa Bastos

Kannappan Palaniappan

Mario Edoardo Maresca

Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

The Thermal Infrared Visual Object Tracking VOT-TIR2016 Challenge Results.

[BibT_eX]

[DOI]

Abdelrahman Eldesokey

Kannappan Palaniappan

Mario Edoardo Maresca

Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Leveraging Captions in the Wild to Improve Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 5th Workshop on Vision and Language, 2016

2015

Predicting memorability of images using attention-driven spatial pooling and image semantics.

[BibT_eX]

[DOI]

Bora Celikkale

Image Vis. Comput., 2015

The State of the Art in HDR Deghosting: A Survey and Evaluation.

[BibT_eX]

[DOI]

Comput. Graph. Forum, 2015

City Scale Image Geolocalization via Dense Scene Alignment.

[BibT_eX]

[DOI]

Semih Yagcioglu

Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Image Matting with KL-Divergence Based Sparse Sampling.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

A Distributed Representation Based Query Expansion Approach for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014

Evaluating deghosting algorithms for HDR images.

[BibT_eX]

[DOI]

Proceedings of the 2014 22nd Signal Processing and Communications Applications Conference (SIU), 2014

Visual saliency guided exposure fusion.

[BibT_eX]

[DOI]

Kubra Mammadova

Proceedings of the 2014 22nd Signal Processing and Communications Applications Conference (SIU), 2014

Data-driven image captioning with meta-class based retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2014 22nd Signal Processing and Communications Applications Conference (SIU), 2014

Image colorization via dense correspondences.

[BibT_eX]

[DOI]

Proceedings of the 2014 22nd Signal Processing and Communications Applications Conference (SIU), 2014

Top down saliency estimation via superpixel-based discriminative dictionaries.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference, 2014

2013

Structure-preserving image smoothing via region covariances.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2013

Visual saliency estimation by integrating features using multiple kernel learning.

[BibT_eX]

[DOI]

Yasin Kavak

CoRR, 2013

Group sparsity based sparse coding for region covariances.

[BibT_eX]

[DOI]

Hasan Tugrul Erdogan

Proceedings of the 21st Signal Processing and Communications Applications Conference, 2013

Visual Attention-Driven Spatial Pooling for Image Memorability.

[BibT_eX]

[DOI]

Bora Celikkale

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012

Visual tracking by fusing multiple cues with context-sensitive reliabilities.

[BibT_eX]

[DOI]

Séverine Dubuisson

Isabelle Bloch

Pattern Recognit., 2012

Fragments based tracking with adaptive cue integration.

[BibT_eX]

[DOI]

Séverine Dubuisson

Isabelle Bloch

Comput. Vis. Image Underst., 2012

Revisiting milis multiple instance learning algorithm with a different instance selection mechanism.

[BibT_eX]

[DOI]

Osman Akin

Proceedings of the 20th Signal Processing and Communications Applications Conference, 2012

2011

Multiple-Instance Learning with Instance Selection via Dominant Sets.

[BibT_eX]

[DOI]

Proceedings of the Similarity-Based Pattern Recognition - First International Workshop, 2011

2009

Mumford-Shah Regularizer with Contextual Feedback.

[BibT_eX]

[DOI]

J. Math. Imaging Vis., 2009

Segmentation using the edge strength function as a shape prior within a local deformation model.

[BibT_eX]

[DOI]

Luminita A. Vese

Proceedings of the International Conference on Image Processing, 2009

2008

Simultaneous bottom-up/top-down processing in early and mid level vision (Erken ve orta düzey görmede alttan üste/yukarıdan aşağı eşzamanlı işleme)

[BibT_eX]

[DOI]

PhD thesis, 2008

Disconnected Skeleton: Shape at Its Absolute Scale.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2008

2007

Mumford-Shah Regularizer with Spatial Coherence.

[BibT_eX]

[DOI]

Aysun Sancar-Yilmaz

Proceedings of the Scale Space and Variational Methods in Computer Vision, 2007

2005

Edge Strength Functions as Shape Priors in Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2005

2003

Computer vision based unistroke keyboard system and mouse for the handicapped.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Image-based Extraction of Material Reflectance Properties of a 3D Rigid Object.

[BibT_eX]

[DOI]