Erkut Erdem

Orcid: 0000-0002-6744-8614

According to our database1, Erkut Erdem authored at least 111 papers between 2002 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Hyperspectral image denoising via self-modulating convolutional neural networks.
Signal Process., January, 2024

HyperE2VID: Improving Event-Based Video Reconstruction via Hypernetworks.
IEEE Trans. Image Process., 2024

Object and relation centric representations for push effect prediction.
Robotics Auton. Syst., 2024

Omnidirectional image quality assessment with local-global vision transformers.
Image Vis. Comput., 2024

HUE Dataset: High-Resolution Event and Frame Sequences for Low-Light Vision.
CoRR, 2024

Evaluating Linguistic Capabilities of Multimodal LLMs in the Lens of Few-Shot Learning.
CoRR, 2024

CLIPAway: Harmonizing Focused Embeddings for Removing Objects via Diffusion Models.
CoRR, 2024

SonicDiffusion: Audio-Driven Image Generation and Editing with Pretrained Diffusion Models.
CoRR, 2024

Hippocrates: An Open-Source Framework for Advancing Large Language Models in Healthcare.
CoRR, 2024

HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and Manipulation.
Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024

Sequential Compositional Generalization in Multimodal Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Self-Supervised Calibration of the Denoising Networks for HSI.
Proceedings of the IGARSS 2024, 2024

ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

An End-to-End Generative System for Smart Travel Assistant.
Proceedings of the 16th International Joint Conference on Knowledge Discovery, 2024

2023
CLIP-guided StyleGAN Inversion for Text-driven Real Image Editing.
ACM Trans. Graph., October, 2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Trans. Mach. Learn. Res., 2023

Spherical Vision Transformer for 360-degree Video Saliency Prediction.
CoRR, 2023

Inst-Inpaint: Instructing to Remove Objects with Diffusion Models.
CoRR, 2023

VidStyleODE: Disentangled Video Editing via StyleGAN and NeuralODEs.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ST360IQ: No-Reference Omnidirectional Image Quality Assessment With Spherical Vision Transformers.
Proceedings of the IEEE International Conference on Acoustics, 2023

Harnessing Dataset Cartography for Improved Compositional Generalization in Transformers.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

EVREAL: Towards a Comprehensive Benchmark and Analysis Suite for Event-based Video Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Spherical Vision Transformer for 360° Video Saliency Prediction.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
A Gated Fusion Network for Dynamic Saliency Prediction.
IEEE Trans. Cogn. Dev. Syst., 2022

Leveraging semantic saliency maps for query-specific video summarization.
Multim. Tools Appl., 2022

Neural Natural Language Generation: A Survey on Multilinguality, Multimodality, Controllability and Learning.
J. Artif. Intell. Res., 2022

Detecting Euphemisms with Literal Descriptions and Visual Imagery.
CoRR, 2022

Stochastic Video Prediction with Structure and Motion.
CoRR, 2022

Multi-Contrast MRI Synthesis with Channel-Exchanging-Network.
Proceedings of the 30th Signal Processing and Communications Applications Conference, 2022

Perception-Distortion Trade-Off in the SR Space Spanned by Flow Models.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Modulating Bottom-Up and Top-Down Visual Processing via Language-Conditional Filters.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

How scene attributes and sound influence visual exploration of omnidirectional panoramic scenes.
Proceedings of the 44th Annual Meeting of the Cognitive Science Society, 2022

Disentangling Content and Motion for Text-Based Neural Video Manipulation.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Burst Photography for Learning to Enhance Extremely Dark Images.
IEEE Trans. Image Process., 2021

Synthetic18K: Learning better representations for person re-ID and attribute recognition from 1.4 million synthetic images.
Signal Process. Image Commun., 2021

Generating visual story graphs with application to photo album summarization.
Signal Process. Image Commun., 2021

Leveraging auxiliary image descriptions for dense video captioning.
Pattern Recognit. Lett., 2021

MSVD-Turkish: a comprehensive multimodal video dataset for integrated vision and language research in Turkish.
Mach. Transl., 2021

mustGAN: multi-stream Generative Adversarial Networks for MR Image Synthesis.
Medical Image Anal., 2021

Using synthetic data for person tracking under adverse weather conditions.
Image Vis. Comput., 2021

From Noon to Sunset: Interactive Rendering, Relighting, and Recolouring of Landscape Photographs by Modifying Solar Position.
Comput. Graph. Forum, 2021

NOVA: Rendering Virtual Worlds with Humans for Computer Vision Tasks.
Comput. Graph. Forum, 2021

Leveraging Frequency Based Salient Spatial Sound Localization to Improve 360° Video Saliency Prediction.
Proceedings of the 17th International Conference on Machine Vision and Applications, 2021

SLAMP: Stochastic Latent Appearance and Motion Prediction.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Cross-lingual Visual Pre-training for Multimodal Machine Translation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

2020
Manipulating Attributes of Natural Scenes via Hallucination.
ACM Trans. Graph., 2020

Hedging static saliency models to predict dynamic saliency.
Signal Process. Image Commun., 2020

MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in Turkish.
CoRR, 2020

Belief Regulated Dual Propagation Nets for Learning Action Effects on Groups of Articulated Objects.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

2019
mustGAN: Multi-Stream Generative Adversarial Networks for MR Image Synthesis.
CoRR, 2019

Belief Regulated Dual Propagation Nets for Learning Action Effects on Articulated Multi-Part Objects.
CoRR, 2019

MSVD-Turkish: A Large-Scale Dataset for Video Captioning in Turkish.
Proceedings of the 27th Signal Processing and Communications Applications Conference, 2019

A Comparative Analysis of Practices in Training Deep Models for Fashion Attribute Detection.
Proceedings of the 27th Signal Processing and Communications Applications Conference, 2019

Diverse Neural Photo Album Summarization.
Proceedings of the Ninth International Conference on Image Processing Theory, 2019

Procedural Reasoning Networks for Understanding Multimodal Procedures.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

2018
Spatio-Temporal Saliency Networks for Dynamic Saliency Prediction.
IEEE Trans. Multim., 2018

Language Guided Fashion Image Manipulation with Feature-wise Transformations.
CoRR, 2018

Image Synthesis in Multi-Contrast MRI with Conditional Generative Adversarial Networks.
CoRR, 2018

Image captioning in Turkish with subword units.
Proceedings of the 26th Signal Processing and Communications Applications Conference, 2018

Generating person images based on attributes.
Proceedings of the 26th Signal Processing and Communications Applications Conference, 2018

RecipeQA: A Challenge Dataset for Multimodal Comprehension of Cooking Recipes.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Joint Exploitation of Features and Optical Flow for Real-Time Moving Object Detection on Drones.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

2017
Alpha Matting With KL-Divergence-Based Sparse Sampling.
IEEE Trans. Image Process., 2017

A comparative study for feature integration strategies in dynamic saliency estimation.
Signal Process. Image Commun., 2017

Data-driven image captioning via salient region discovery.
IET Comput. Vis., 2017

Adjusting transient attributes of outdoor images using generative adversarial networks.
Proceedings of the 25th Signal Processing and Communications Applications Conference, 2017

Turkish cuisine: A benchmark dataset with Turkish meals for food recognition.
Proceedings of the 25th Signal Processing and Communications Applications Conference, 2017

Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures (Extended Abstract).
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Feature-Based Efficient Moving Object Detection for Low-Altitude Aerial Platforms.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Re-evaluating Automatic Metrics for Image Captioning.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

2016
Deformable part-based tracking by coupled global and local correlation filters.
J. Vis. Commun. Image Represent., 2016

Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures.
J. Artif. Intell. Res., 2016

Learning to Generate Images of Outdoor Scenes from Attributes and Semantic Layouts.
CoRR, 2016

Two-Stream Convolutional Networks for Dynamic Saliency Prediction.
CoRR, 2016

An Objective Deghosting Quality Metric for HDR Images.
Comput. Graph. Forum, 2016

TasvirEt: A benchmark dataset for automatic Turkish description generation from images.
Proceedings of the 24th Signal Processing and Communication Application Conference, 2016

Summarizing personal image collections with intrinsic properties.
Proceedings of the 24th Signal Processing and Communication Application Conference, 2016

Clustering motion trajectories via dominant sets.
Proceedings of the 24th Signal Processing and Communication Application Conference, 2016

Dominant sets based analysis of human crowds.
Proceedings of the 24th Signal Processing and Communication Application Conference, 2016

HUCVL at MediaEval 2016: Predicting Interesting Key Frames with Deep Models.
Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016

The Visual Object Tracking VOT2016 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016


Leveraging Captions in the Wild to Improve Object Detection.
Proceedings of the 5th Workshop on Vision and Language, 2016

2015
Predicting memorability of images using attention-driven spatial pooling and image semantics.
Image Vis. Comput., 2015

The State of the Art in HDR Deghosting: A Survey and Evaluation.
Comput. Graph. Forum, 2015

City Scale Image Geolocalization via Dense Scene Alignment.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Image Matting with KL-Divergence Based Sparse Sampling.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

A Distributed Representation Based Query Expansion Approach for Image Captioning.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Evaluating deghosting algorithms for HDR images.
Proceedings of the 2014 22nd Signal Processing and Communications Applications Conference (SIU), 2014

Visual saliency guided exposure fusion.
Proceedings of the 2014 22nd Signal Processing and Communications Applications Conference (SIU), 2014

Data-driven image captioning with meta-class based retrieval.
Proceedings of the 2014 22nd Signal Processing and Communications Applications Conference (SIU), 2014

Image colorization via dense correspondences.
Proceedings of the 2014 22nd Signal Processing and Communications Applications Conference (SIU), 2014

Top down saliency estimation via superpixel-based discriminative dictionaries.
Proceedings of the British Machine Vision Conference, 2014

2013
Structure-preserving image smoothing via region covariances.
ACM Trans. Graph., 2013

Visual saliency estimation by integrating features using multiple kernel learning.
CoRR, 2013

Group sparsity based sparse coding for region covariances.
Proceedings of the 21st Signal Processing and Communications Applications Conference, 2013

Visual Attention-Driven Spatial Pooling for Image Memorability.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Visual tracking by fusing multiple cues with context-sensitive reliabilities.
Pattern Recognit., 2012

Fragments based tracking with adaptive cue integration.
Comput. Vis. Image Underst., 2012

Revisiting milis multiple instance learning algorithm with a different instance selection mechanism.
Proceedings of the 20th Signal Processing and Communications Applications Conference, 2012

2011
Multiple-Instance Learning with Instance Selection via Dominant Sets.
Proceedings of the Similarity-Based Pattern Recognition - First International Workshop, 2011

2009
Mumford-Shah Regularizer with Contextual Feedback.
J. Math. Imaging Vis., 2009

Segmentation using the edge strength function as a shape prior within a local deformation model.
Proceedings of the International Conference on Image Processing, 2009

2008
Simultaneous bottom-up/top-down processing in early and mid level vision (Erken ve orta düzey görmede alttan üste/yukarıdan aşağı eşzamanlı işleme)
PhD thesis, 2008

Disconnected Skeleton: Shape at Its Absolute Scale.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

2007
Mumford-Shah Regularizer with Spatial Coherence.
Proceedings of the Scale Space and Variational Methods in Computer Vision, 2007

2005
Edge Strength Functions as Shape Priors in Image Segmentation.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2005

2003
Computer vision based unistroke keyboard system and mouse for the handicapped.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Image-based Extraction of Material Reflectance Properties of a 3D Rigid Object.
Proceedings of the 24th Annual Conference of the European Association for Computer Graphics, 2003

2002
Computer vision based mouse.
Proceedings of the IEEE International Conference on Acoustics, 2002


  Loading...