Xavier Giró-i-Nieto

Orcid: 0000-0002-9935-5332

Affiliations:
  • Polytechnic University of Catalonia, Barcelona, Spain


According to our database1, Xavier Giró-i-Nieto authored at least 133 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
HyperFast: Instant Classification for Tabular Data.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Neural ADMIXTURE for rapid genomic clustering.
Nat. Comput. Sci., 2023

A closer look at referring expressions for video object segmentation.
Multim. Tools Appl., 2023

The Liver Tumor Segmentation Benchmark (LiTS).
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Medical Image Anal., 2023

Towards Robust Image-in-Audio Deep Steganography.
CoRR, 2023

SIRA: Relightable Avatars from a Single Image.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Adversarial Learning for Feature Shift Detection and Correction.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Sign Language Translation from Instructional Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Genomic Databases Homogenization with Machine Learning.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2023

2022
Topic Detection in Continuous Sign Language Videos.
CoRR, 2022

Hyper-Representations for Pre-Training and Transfer Learning.
CoRR, 2022

Tackling Low-Resourced Sign Language Translation: UPC at WMT-SLT 22.
Proceedings of the Seventh Conference on Machine Translation, 2022

Model Zoos: A Dataset of Diverse Populations of Neural Network Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Hyper-Representations as Generative Models: Sampling Unseen Neural Network Weights.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Pixinwav: Residual Steganography for Hiding Pixels in Audio.
Proceedings of the IEEE International Conference on Acoustics, 2022

Generative Moment Matching Networks for Genotype Simulation.
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022

Predicting Dog Phenotypes from Genotypes.
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022

Sign Language Video Retrieval with Free-Form Textual Queries.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Unsupervised Skill-Discovery and Skill-Learning in Minecraft.
CoRR, 2021

Multiple Object Tracking with Mixture Density Networks for Trajectory Estimation.
CoRR, 2021

SynthRef: Generation of Synthetic Referring Expressions for Object Segmentation.
CoRR, 2021

H3D-Net: Few-Shot High-Fidelity 3D Head Reconstruction.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Seasonal Contrast: Unsupervised Pre-Training from Uncurated Remote Sensing Data.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

How2Sign: A Large-Scale Multimodal Dataset for Continuous American Sign Language.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Mask-guided sample selection for semi-supervised instance segmentation.
Multim. Tools Appl., 2020

Can Everybody Sign Now? Exploring Sign Language Video Generation from 2D Poses.
CoRR, 2020

RefVOS: A Closer Look at Referring Expressions for Video Object Segmentation.
CoRR, 2020

How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language.
CoRR, 2020

Curriculum Learning for Recurrent Video Object Segmentation.
CoRR, 2020

Transcription-Enriched Joint Embeddings for Spoken Descriptions of Images and Videos.
CoRR, 2020

Enhancing Online Knowledge Graph Population with Semantic Knowledge.
Proceedings of the Semantic Web - ISWC 2020, 2020

One Perceptron to Rule Them All: Language, Vision, Audio and Speech.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

Automatic Reminiscence Therapy for Dementia.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Multiresolution co-clustering for uncalibrated multiview segmentation.
Signal Process. Image Commun., 2019

Recurrent Instance Segmentation using Sequences of Referring Expressions.
CoRR, 2019

Hate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation.
CoRR, 2019

VLX-Stories: Building an Online Event Knowledge Base with Emerging Entity Detection.
Proceedings of the Semantic Web - ISWC 2019, 2019

VLX-Stories: A Semantically Linked Event Platform for Media Publishers.
Proceedings of the ISWC 2019 Satellite Tracks (Posters & Demonstrations, 2019

Recurrent Instance Segmentation using Sequences of Referring Expressions.
Proceedings of the Visually Grounded Interaction and Language (ViGIL), 2019

Assessing Knee OA Severity with CNN attention-based end-to-end architectures.
Proceedings of the International Conference on Medical Imaging with Deep Learning, 2019

Hyperparameter-Free Losses for Model-Based Monocular Reconstruction.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Multi-View 3D Face Reconstruction in the Wild Using Siamese Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Wav2Pix: Speech-conditioned Face Generation Using Generative Adversarial Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

RVOS: End-To-End Recurrent Network for Video Object Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Inverse Cooking: Recipe Generation From Food Images.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Budget-aware Semi-Supervised Semantic and Instance Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Simple vs complex temporal recurrences for video saliency prediction.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
Scanpath and saliency prediction on 360 degree images.
Signal Process. Image Commun., 2018

Introduction to the special issue: Egocentric Vision and Lifelogging.
J. Vis. Commun. Image Represent., 2018

Importance Weighted Evolution Strategies.
CoRR, 2018

Temporal Saliency Adaptation in Egocentric Videos.
CoRR, 2018

Online Action Detection in Untrimmed, Streaming Videos - Modeling and Evaluation.
CoRR, 2018

Linking Media: adopting Semantic Technologies for Multimodal Media Connection.
Proceedings of the ISWC 2018 Posters & Demonstrations, Industry and Blue Sky Ideas Tracks co-located with 17th International Semantic Web Conference (ISWC 2018), Monterey, USA, October 8th - to, 2018

Demonstration of an Open Source Framework for Qualitative Evaluation of CBIR Systems.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

An Interactive Lifelog Search Engine for LSC2018.
Proceedings of the 2018 ACM Workshop on The Lifelog Search Challenge, 2018

What Is Going on in the World? A Display Platform for Media Understanding.
Proceedings of the IEEE 1st Conference on Multimedia Information Processing and Retrieval, 2018

Comparing Fixed and Adaptive Computation Time for Recurrent Neural Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

Cross-modal Embeddings for Video and Audio Retrieval.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Online Detection of Action Start in Untrimmed, Streaming Videos.
Proceedings of the Computer Vision - ECCV 2018, 2018

PathGAN: Visual Scanpath Prediction with Generative Adversarial Networks.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Saliency Weighted Convolutional Features for Instance Search.
Proceedings of the 2018 International Conference on Content-Based Multimedia Indexing, 2018

2017
From pixels to sentiment: Fine-tuning CNNs for visual sentiment prediction.
Image Vis. Comput., 2017

Recurrent Neural Networks for Semantic Instance Segmentation.
CoRR, 2017

Detection-aided liver lesion segmentation using deep learning.
CoRR, 2017

Cost-Effective Active Learning for Melanoma Segmentation.
CoRR, 2017

SalGAN: Visual Saliency Prediction with Generative Adversarial Networks.
CoRR, 2017

Disentangling Motion, Foreground and Background Features in Videos.
CoRR, 2017

Semantic Summarization of Egocentric Photo Stream Events.
Proceedings of the 2nd Workshop on Lifelogging Tools and Applications, 2017

LTA 2017: The Second Workshop on Lifelogging Tools and Applications.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

More Cat than Cute?: Interpretable Prediction of Adjective-Noun Pairs.
Proceedings of the Workshop on Multimodal Understanding of Social, 2017

ViTS: Video Tagging System from Massive Web Multimedia Collections.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

SaltiNet: Scan-Path Prediction on 360 Degree Images Using Saliency Volumes.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Distributed training strategies for a computer vision deep learning algorithm on a distributed GPU cluster.
Proceedings of the International Conference on Computational Science, 2017

Scaling a Convolutional Neural Network for classification of Adjective Noun Pairs with TensorFlow on GPU Clusters.
Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017

Class Weighted Convolutional Features for Visual Instance Search.
Proceedings of the British Machine Vision Conference 2017, 2017

2016
Assessment of crowdsourcing and gamification loss in user-assisted object segmentation.
Multim. Tools Appl., 2016

Temporal Activity Detection in Untrimmed Videos with Recurrent Neural Networks.
CoRR, 2016

Open-Ended Visual Question-Answering.
CoRR, 2016

Hierarchical Object Detection with Deep Reinforcement Learning.
CoRR, 2016

Dublin City University and Partners' Participation in the INS and VTT Tracks at TRECVid 2016.
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016

LEMoRe: A Lifelog Engine for Moments Retrieval at the NTCIR-Lifelog LSAT Task.
Proceedings of the 12th NTCIR Conference on Evaluation of Information Access Technologies, 2016

Where is my Phone?: Personal Object Retrieval from Egocentric Images.
Proceedings of the first Workshop on Lifelogging Tools and Applications, 2016

LTA 2016: The First Workshop on Lifelogging Tools and Applications.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Bags of Local Convolutional Features for Scalable Instance Search.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Faster R-CNN Features for Instance Search.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016

Shallow and Deep Convolutional Networks for Saliency Prediction.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Large scale content-based video retrieval with LIvRE.
Proceedings of the 14th International Workshop on Content-Based Multimedia Indexing, 2016

2015
Improving object segmentation by using EEG signals and rapid serial visual presentation.
Multim. Tools Appl., 2015

End-to-end Convolutional Network for Saliency Prediction.
CoRR, 2015


Diving Deep into Sentiment: Understanding Fine-tuned CNNs for Visual Sentiment Prediction.
Proceedings of the 1st International Workshop on Affect & Sentiment in Multimedia, 2015

Exploring EEG for Object Detection and Retrieval.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

UPC-UB-STP @ MediaEval 2015 Diversity Task: Iterative Reranking of Relevant Images.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

Visual summary of egocentric photostreams by representative keyframes.
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

Improving spatial codification in semantic segmentation.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Quality control in crowdsourced object segmentation.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Cultural Event recognition with visual ConvNets and temporal models.
Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015

Event video retrieval using global and local descriptors in visual domain.
Proceedings of the 13th International Workshop on Content-Based Multimedia Indexing, 2015

Visual information retrieval in endoscopic video archives.
Proceedings of the 13th International Workshop on Content-Based Multimedia Indexing, 2015

2014
Improving retrieval accuracy of Hierarchical Cellular Trees for generic metric spaces.
Multim. Tools Appl., 2014

From global image annotation to interactive object segmentation.
Multim. Tools Appl., 2014

Insight Centre for Data Analytics (DCU) at TRECVid 2014: Instance Search and Semantic Indexing Tasks.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Object Segmentation in Images using EEG Signals.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Click'n'Cut: Crowdsourced Interactive Segmentation with Object Candidates.
Proceedings of the 2014 International ACM Workshop on Crowdsourcing for Multimedia, 2014

UPC at MediaEval 2014 Social Event Detection Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

2013
Crowdsourced object segmentation with a game.
Proceedings of the 2nd ACM international workshop on Crowdsourcing for multimedia, 2013

UPC at MediaEval 2013 Hyperlinking Task.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

UPC at MediaEval 2013 Social Event Detection Task.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

Automatic keyframe selection based on mutual reinforcement algorithm.
Proceedings of the 11th International Workshop on Content-Based Multimedia Indexing, 2013

2012
Part-based object retrieval with binary partition trees.
PhD thesis, 2012

Interactive segmentation and tracking of video objects.
Proceedings of the 13th International Workshop on Image Analysis for Multimedia Interactive Services, 2012

Multiscale annotation of still images with GAT.
Proceedings of the 1st International Workshop on Visual Interfaces for Ground Truth Collection in Computer Vision Applications, 2012

Hierarchical Navigation and Visual Search for Video Keyframe Retrieval.
Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012

2011
Acoustic Event Detection Based on Feature-Level Fusion of Audio and Video Modalities.
EURASIP J. Adv. Signal Process., 2011

Rich Internet Application for Semi-automatic Annotation of Semantic Shots on Keyframes.
Proceedings of the Computational Intelligence for Multimedia Understanding, 2011

Diversity ranking for video retrieval from a broadcaster archive.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

2010
GAT: a Graphical Annotation Tool for semantic regions.
Multim. Tools Appl., 2010

TRECVID 2010 Experiments at Dublin City University.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Digimatge, a rich internet application for video retrieval from a multimedia asset management system.
Proceedings of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval, 2010

System architecture of a web service for Content-Based Image Retrieval.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

2009
Improving detection of acoustic events using audiovisual data and feature level fusion.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Audiovisual event detection towards scene understanding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009

2007
The socio-economic dimensions of ICT-driven educational change.
Comput. Educ., 2007

Composite Object Detection in Video Sequences: Application to Controlled Environments.
Proceedings of the Eighth International Workshop on Image Analysis for Multimedia Interactive Services, 2007

Region-based Annotation Tool using Partition Trees.
Proceedings of the Poster and Demo Proceedings of the 2nd International Conference on Semantic and Digital Media Technologies, 2007

2006
BPT Enhancement Based on Syntactic and Semantic Criteria.
Proceedings of the Semantic Multimedia, 2006

From Partition Trees to Semantic Trees.
Proceedings of the Multimedia Content Representation, 2006

2005
Detection of semantic objects using description graphs.
Proceedings of the 2005 International Conference on Image Processing, 2005

Automatic Extraction and Analysis of Visual Objects Information.
Proceedings of the Multimedia Content and the Semantic Web, 2005

2003
Wavelet Coding of Volumetric Medical Datasets.
IEEE Trans. Medical Imaging, 2003

Unified Access to Heterogeneous Audiovisual Archives.
J. Univers. Comput. Sci., 2003


  Loading...