Jordi Pont-Tuset

Orcid: 0000-0001-7133-3724

According to our database1, Jordi Pont-Tuset authored at least 60 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Imagen 3.
CoRR, 2024

Evaluating Numerical Reasoning in Text-to-Image Models.
CoRR, 2024

Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models.
CoRR, 2024

DOCCI: Descriptions of Connected and Contrasting Images.
CoRR, 2024

Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings.
CoRR, 2024

Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Rich Human Feedback for Text-to-Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
PiGLET: Pixel-Level Grounding of Language Expressions With Transformers.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

The Liver Tumor Segmentation Benchmark (LiTS).
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Medical Image Anal., 2023

EgoCOL: Egocentric Camera pose estimation for Open-world 3D object Localization @Ego4D challenge 2023.
CoRR, 2023

Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Connecting Vision and Language with Video Localized Narratives.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Two-Level Temporal Relation Model for Online Video Instance Segmentation.
CoRR, 2022

Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 2022.
CoRR, 2022

Crossmodal-3600: A Massively Multilingual Multimodal Evaluation Dataset.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Adversarially Robust Panoptic Segmentation (ARPaS) Benchmark.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

2021
PanGEA: The Panoramic Graph Environment Annotation Toolkit.
CoRR, 2021

Telling the What while Pointing the Where: Fine-grained Mouse Trace and Language Supervision for Improved Image Retrieval.
CoRR, 2021

Panoptic Narrative Grounding.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Telling the What while Pointing to the Where: Multimodal Queries for Image Retrieval.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
The Open Images Dataset V4.
Int. J. Comput. Vis., 2020

Connecting Vision and Language with Localized Narratives.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Video Object Segmentation without Temporal Information.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Natural Vocabulary Emerges from Free-Form Annotations.
CoRR, 2019

The 2019 DAVIS Challenge on VOS: Unsupervised Multi-Object Segmentation.
CoRR, 2019

2018
Convolutional Oriented Boundaries: From Image Segmentation to High-Level Tasks.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale.
CoRR, 2018

The 2018 DAVIS Challenge on Video Object Segmentation.
CoRR, 2018

Iterative Deep Retinal Topology Extraction.
Proceedings of the Patch-Based Techniques in Medical Imaging, 2018

Deep Extreme Cut: From Extreme Points to Object Segmentation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Blazingly Fast Video Object Segmentation With Pixel-Wise Metric Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Iterative Deep Learning for Road Topology Extraction.
Proceedings of the British Machine Vision Conference 2018, 2018

2017
Multiscale Combinatorial Grouping for Image Segmentation and Object Proposal Generation.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Iterative Deep Learning for Network Topology Extraction.
CoRR, 2017

Detection-aided liver lesion segmentation using deep learning.
CoRR, 2017

The 2017 DAVIS Challenge on Video Object Segmentation.
CoRR, 2017

Semantically-Guided Video Object Segmentation.
CoRR, 2017

One-Shot Video Object Segmentation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Supervised Evaluation of Image Segmentation and Object Proposal Techniques.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Deep Retinal Image Understanding.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2016, 2016

Convolutional Oriented Boundaries.
Proceedings of the Computer Vision - ECCV 2016, 2016

A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Scale-Aware Alignment of Hierarchical Image Segmentation.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Oracle MCG: A first peek into COCO Detection Challenges.
CoRR, 2015

Video content and structure description based on keyframes, clusters and storyboards.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

Boosting Object Proposals: From Pascal to COCO.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Semi-automatic video object segmentation by advanced manipulation of segmentation hierarchies.
Proceedings of the 13th International Workshop on Content-Based Multimedia Indexing, 2015

2014
Image segmentation evaluation and its application to object detection.
PhD thesis, 2014

From global image annotation to interactive object segmentation.
Multim. Tools Appl., 2014

Multiscale Combinatorial Grouping.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Measures and Meta-Measures for the Supervised Evaluation of Image Segmentation.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Upper-bound assessment of the spatial accuracy of hierarchical region-based image representations.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Supervised Assessment of Segmentation Hierarchies.
Proceedings of the Computer Vision - ECCV 2012, 2012

2010
Contour detection using Binary Partition Trees.
Proceedings of the International Conference on Image Processing, 2010

System architecture of a web service for Content-Based Image Retrieval.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

2008
ONN the Use of Neural Networks for Data Privacy.
Proceedings of the SOFSEM 2008: Theory and Practice of Computer Science, 2008

Improving Microaggregation for Complex Record Anonymization.
Proceedings of the Modeling Decisions for Artificial Intelligence, 2008

2007
Ordered Data Set Vectorization for Linear Regression on Data Privacy.
Proceedings of the Modeling Decisions for Artificial Intelligence, 2007

Increasing Polynomial Regression Complexity for Data Anonymization.
Proceedings of the 2007 International Conference on Intelligent Pervasive Computing, 2007


  Loading...