Alan Hanjalic

Orcid: 0000-0002-5771-2549

  • Delft University of Technology, Netherlands

According to our database1, Alan Hanjalic authored at least 229 papers between 1996 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.


IEEE Fellow

IEEE Fellow 2016, "For contributions to multimedia information retrieval".



In proceedings 
PhD thesis 


Online presence:



AGALE: A Graph-Aware Continual Learning Evaluation Framework.
Trans. Mach. Learn. Res., 2024

A data-centric approach for assessing progress of Graph Neural Networks.
CoRR, 2024

Few-Shot Learning for Fine-Grained Emotion Recognition Using Physiological Signals.
IEEE Trans. Multim., 2023

Multi-label Node Classification On Graph-Structured Data.
Trans. Mach. Learn. Res., 2023

Weakly-Supervised Learning for Fine-Grained Emotion Recognition Using Physiological Signals.
IEEE Trans. Affect. Comput., 2023

Depth-Aware Sparse Transformer for Video-Language Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Mitigating Mainstream Bias in Recommendation via Cost-sensitive Learning.
Proceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval, 2023

Task-Aware Connectivity Learning for Incoming Nodes Over Growing Graphs.
IEEE Trans. Signal Inf. Process. over Networks, 2022

Temporal Network Prediction and Interpretation.
IEEE Trans. Netw. Sci. Eng., 2022

Guest Editorial: Learning From Noisy Multimedia Data.
IEEE Trans. Multim., 2022

Joint Feature Synthesis and Embedding: Adversarial Cross-Modal Retrieval Revisited.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Topological-temporal properties of evolving networks.
J. Complex Networks, 2022

Influence of clustering coefficient on network embedding in link prediction.
Appl. Netw. Sci., 2022

Subjective QoE Evaluation of User-Centered Adaptive Streaming of Dynamic Point Clouds.
Proceedings of the 14th International Conference on Quality of Multimedia Experience, 2022

Evaluating the Impact of Tiled User-Adaptive Real-Time Point Cloud Streaming on VR Remote Communication.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Graph-Time Convolutional Autoencoders.
Proceedings of the Learning on Graphs Conference, 2022

Cross-Modal Hybrid Feature Fusion for Image-Sentence Matching.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Radial Graph Convolutional Network for Visual Question Generation.
IEEE Trans. Neural Networks Learn. Syst., 2021

Generating Images From Spoken Descriptions.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

CorrNet: Fine-Grained Emotion Recognition for Video Watching Using Wearable Physiological Sensors.
Sensors, 2021

Towards user-oriented privacy for recommender system data: A personalization-based approach to gender obfuscation for user profiles.
Inf. Process. Manag., 2021

Accuracy-diversity trade-off in recommender systems via graph convolutions.
Inf. Process. Manag., 2021

Leave No User Behind: Towards Improving the Utility of Recommender Systems for Non-mainstream Users.
Proceedings of the WSDM '21, 2021

New Insights into Metric Optimization for Ranking-based Recommendation.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

How do Metric Score Distributions affect the Type I Error Rate of Statistical Significance Tests in Information Retrieval?
Proceedings of the ICTIR '21: The 2021 ACM SIGIR International Conference on the Theory of Information Retrieval, 2021

One deep music representation to rule them all? A comparative analysis of different representation learning strategies.
Neural Comput. Appl., 2020

Unified Binary Generative Adversarial Network for Image Retrieval and Compression.
Int. J. Comput. Vis., 2020

Partially Synthetic Data for Recommender Systems: Prediction Performance and Preference Hiding.
CoRR, 2020

User Centered Adaptive Streaming of Dynamic Point Clouds with Low Complexity Tiling.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

S2IGAN: Speech-to-Image Generation via Adversarial Learning.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

RCEA: Real-time, Continuous Emotion Annotation for Collecting Precise Mobile Video Ground Truth Labels.
Proceedings of the CHI '20: CHI Conference on Human Factors in Computing Systems, 2020

Top-N Recommendation with Multi-Channel Positive Feedback using Factorization Machines.
ACM Trans. Inf. Syst., 2019

From Deterministic to Generative: Multimodal Stochastic RNNs for Video Captioning.
IEEE Trans. Neural Networks Learn. Syst., 2019

Make Some Noise. Unleashing the Power of Convolutional Neural Networks for Profiled Side-channel Analysis.
IACR Trans. Cryptogr. Hardw. Embed. Syst., 2019

Are Nearby Neighbors Relatives? Testing Deep Music Embeddings.
Frontiers Appl. Math. Stat., 2019

Are Nearby Neighbors Relatives?: Diagnosing Deep Music Embedding Spaces.
CoRR, 2019

A New Perspective on Score Standardization.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Statistical Significance Testing in Information Retrieval: An Empirical Analysis of Type I, Type II and Type III Errors.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Data Masking for Recommender Systems: Prediction Performance and Rating Hiding.
Proceedings of ACM RecSys 2019 Late-Breaking Results co-located with the 13th ACM Conference on Recommender Systems, 2019

The influence of personal values on music taste: towards value-based music recommendations.
Proceedings of the 13th ACM Conference on Recommender Systems, 2019

Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Suppressing Information Diffusion via Link Blocking in Temporal Networks.
Proceedings of the Complex Networks and Their Applications VIII, 2019

From Intra-Modal to Inter-Modal Space: Multi-task Learning of Shared Representations for Cross-Modal Retrieval.
Proceedings of the Fifth IEEE International Conference on Multimedia Big Data, 2019

Audio Segmentation.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Audio Representation.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Audio Content Analysis.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Audio Classification.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Detecting Socially Significant Music Events Using Temporally Noisy Labels.
IEEE Trans. Multim., 2018

Geo-Distinctive Visual Element Matching for Location Estimation of Images.
IEEE Trans. Multim., 2018

Video Captioning by Adversarial LSTM.
IEEE Trans. Image Process., 2018

Semantic-aware blind image quality assessment.
Signal Process. Image Commun., 2018

Factorization Machines for Data with Implicit Feedback.
CoRR, 2018

Information diffusion backbones in temporal networks.
CoRR, 2018

Towards Seed-Free Music Playlist Generation: Enhancing Collaborative Filtering with Playlist Title Information.
Proceedings of the ACM Recommender Systems Challenge, 2018

Binary Generative Adversarial Networks for Image Retrieval.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Exploiting scene maps and spatial relationships in quasi-static scenes for video face clustering.
Image Vis. Comput., 2017

Multimedia Research: What Is the Right Approach?
IEEE Multim., 2017

From Deterministic to Generative: Multi-Modal Stochastic RNNs for Video Captioning.
CoRR, 2017

Modeling of Information Diffusion on Social Networks with Applications to WeChat.
CoRR, 2017

Vision-based Detection of Acoustic Timed Events: a Case Study on Clarinet Note Onsets.
CoRR, 2017

Adversarial Cross-Modal Retrieval.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

On the Automatic Identification of Music for Common Activities.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

The Geo-Privacy Bonus of Popular Photo Enhancements.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

A Reliable Methodology to Collect Ground Truth Data of Image Aesthetic Appeal.
IEEE Trans. Multim., 2016

On detecting the playing/non-playing activity of musicians in symphonic music videos.
Comput. Vis. Image Underst., 2016

User Intent in Multimedia Search: A Survey of the State of the Art and Future Challenges.
ACM Comput. Surv., 2016

Learning Subclass Representations for Visually-varied Image Classification.
CoRR, 2016

Where to be wary: The impact of widespread photo-taking and image enhancement practices on users' geo-privacy.
CoRR, 2016

Bayesian Personalized Ranking with Multi-Channel User Feedback.
Proceedings of the 10th ACM Conference on Recommender Systems, 2016

QoE Prediction for Enriched Assessment of Individual Video Viewing Experience.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Augmenting Blind Image Quality Assessment Using Image Semantics.
Proceedings of the IEEE International Symposium on Multimedia, 2016

Does visual quality depend on semantics? A study on the relationship between impairment annoyance and image semantics at early attentive stages.
Proceedings of the Human Vision and Electronic Imaging, 2016

Simple tag-based subclass representations for visually-varied image classes.
Proceedings of the 14th International Workshop on Content-Based Multimedia Indexing, 2016

Exploiting the Deep-Link Commentsphere to Support Non-Linear Video Access.
IEEE Trans. Multim., 2015

Global-Scale Location Prediction for Social Images Using Geo-Visual Ranking.
IEEE Trans. Multim., 2015

Uploader Intent for Online Video: Typology, Inference, and Applications.
IEEE Trans. Multim., 2015

Guest Editorial: Challenges and Perspectives for Affective Analysis in Multimedia.
IEEE Trans. Affect. Comput., 2015

Multimedia Search: From Relevance to Usefulness.
IEEE Multim., 2015

Scientific Conferences.
IEEE Multim., 2015

Recommendation with the Right Slice: Speeding Up Collaborative Filtering with Factorization Machines.
Proceedings of the Poster Proceedings of the 9th ACM Conference on Recommender Systems, 2015

Evento 360: Social Event Discovery from Web-scale Multimedia Collection.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Comparative Analysis of Orchestral Performance Recordings: An Image-Based Approach.
Proceedings of the 16th International Society for Music Information Retrieval Conference, 2015

Towards a comprehensive model for predicting the quality of individual visual experience.
Proceedings of the Human Vision and Electronic Imaging XX, 2015

Pairwise geometric matching for large-scale object retrieval.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Corpus Development for Affective Video Indexing.
IEEE Trans. Multim., 2014

Predicting Failing Queries in Video Search.
IEEE Trans. Multim., 2014

Intent-Aware Video Search Result Optimization.
IEEE Trans. Multim., 2014

Multimedia Data Management in Mobile Computing.
IEEE Multim., 2014

Collaborative Filtering beyond the User-Item Matrix: A Survey of the State of the Art and Future Challenges.
ACM Comput. Surv., 2014

'Free lunch' enhancement for collaborative filtering with factorization machines.
Proceedings of the Eighth ACM Conference on Recommender Systems, 2014

Beauty is in the scale of the beholder: Comparison of methodologies for the subjective assessment of image aesthetic appeal.
Proceedings of the Sixth International Workshop on Quality of Multimedia Experience, 2014

Heterogeneous recovery rates against SIS epidemics in directed networks.
Proceedings of the 7th International Conference on NETwork Games, COntrol and OPtimization, 2014

Detecting Drops in Electronic Dance Music: Content based approaches to a socially significant music event.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Exploiting Instrument-wise Playing/Non-Playing Labels for Score Synchronization of Symphonic Music.
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014

Cross-Domain Collaborative Filtering with Factorization Machines.
Proceedings of the Advances in Information Retrieval, 2014

CARS2: Learning Context-aware Representations for Context-aware Recommendations.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Multimedia retrieval that matters.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Learning Crowdsourced User Preferences for Visual Summarization of Image Collections.
IEEE Trans. Multim., 2013

Generating Visual Summaries of Geographic Areas Using Community-Contributed Images.
IEEE Trans. Multim., 2013

Nontrivial landmark recommendation using geotagged photos.
ACM Trans. Intell. Syst. Technol., 2013

Mining contextual movie similarity with matrix factorization for context-aware recommendation.
ACM Trans. Intell. Syst. Technol., 2013

Unifying rating-oriented and ranking-oriented collaborative filtering for improved recommendation.
Inf. Sci., 2013

Searching for images by video.
Int. J. Multim. Inf. Retr., 2013

When music makes a scene.
Int. J. Multim. Inf. Retr., 2013

Learning to Rerank Web Images.
IEEE Multim., 2013

Generalized Tag-induced Cross-Domain Collaborative Filtering
CoRR, 2013

Looking beyond sound: Unsupervised analysis of musician videos.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013

xCLiMF: optimizing expected reciprocal rank for data with multiple levels of relevance.
Proceedings of the Seventh ACM Conference on Recommender Systems, 2013

How do we deep-link?: leveraging user-contributed time-links for non-linear video access.
Proceedings of the ACM Multimedia Conference, 2013

Geo-visual ranking for location prediction of social images.
Proceedings of the International Conference on Multimedia Retrieval, 2013

Exploration of Feature Combination in Geo-visual Ranking for Visual Content-based Location Prediction.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

CLiMF: Collaborative Less-Is-More Filtering.
Proceedings of the IJCAI 2013, 2013

GAPfm: optimal top-n recommendations for graded relevance domains.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

A unified context model for web image retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2012

Prototype-Based Image Search Reranking.
IEEE Trans. Multim., 2012

Special Section on Object and Event Classification in Large-Scale Video Collections.
IEEE Trans. Multim., 2012

Leveraging visual concepts and query performance prediction for semantic-theme-based video retrieval.
Int. J. Multim. Inf. Retr., 2012

New grand challenge for multimedia information retrieval: bridging the utility gap.
Int. J. Multim. Inf. Retr., 2012

Adaptive diversification of recommendation results via latent factor portfolio.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

TFMAP: optimizing MAP for top-n context-aware recommendation.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

CLiMF: learning to maximize reciprocal rank with collaborative less-is-more filtering.
Proceedings of the Sixth ACM Conference on Recommender Systems, 2012

A New Gap to Bridge: Where to Go Next in Social Media Retrieval? - (Extended Abstract).
Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012

LikeLines: collecting timecode-level feedback for web videos through user interactions.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

MuseSync: standing on the shoulders of Hollywood.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

When video search goes wrong: predicting query failure using search engine logs and visual search results.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Intent and its discontents: the user at the wheel of the online video search engine.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Preliminary Exploration of the Use of Geographical Information for Content-based Geo-tagging of Social Video.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

A structure-based video representation for web video categorization.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Music Information Retrieval: An Inspirational Guide to Transfer from Related Disciplines.
Proceedings of the Multimodal Music Processing, 2012

Music Information Technology and Professional Stakeholder Audiences: Mind the Adoption Gap.
Proceedings of the Multimodal Music Processing, 2012

Object Retrieval Using Visual Query Context.
IEEE Trans. Multim., 2011

A framework for unsupervised training of object detectors from unlabeled surveillance video.
J. Ambient Intell. Smart Environ., 2011

Tags as Bridges between Domains: Improving Recommendation with Tag-Induced Cross-Domain Collaborative Filtering.
Proceedings of the User Modeling, Adaption and Personalization, 2011

Expressivity in Musical Timing in Relation to Musical Structure and Interpretation: A Cross-Performance, Audio-Based Approach.
Proceedings of the AES International Conference Semantic Audio 2011, 2011

Learning from search engine and human supervision for web image search.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Video-based image retrieval.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Finding representative and diverse community contributed images to create visual summaries of geographic areas.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

The need for music information retrieval with user-centered and multimodal strategies.
Proceedings of the 1st international ACM workshop on Music information retrieval with user-centered and multimodal strategies, Scottsdale, AZ, USA, November 28, 2011

Reading between the tags to predict real-world size-class for visually depicted objects in images.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Alice's worlds of wonder: exploiting tags to understand images in terms of size and scale.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Frontiers in multimedia search.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

TUD-MM at MediaEval 2011 Genre Tagging Task: Video search reranking for genre tagging.
Proceedings of the Working Notes Proceedings of the MediaEval 2011 Workshop, 2011

TUD-MIR at MediaEval 2011 Genre Tagging Task: Query expansion from a limited number of labeled videos.
Proceedings of the Working Notes Proceedings of the MediaEval 2011 Workshop, 2011

Expressive Timing from Cross-Performance and Audio-based Alignment Patterns: An Extended Case Study.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

Personalized Landmark Recommendation Based on Geotags from Photo Sharing Sites.
Proceedings of the Fifth International Conference on Weblogs and Social Media, 2011

How Far Are We in Trust-Aware Recommendation?
Proceedings of the Advances in Information Retrieval, 2011

Reranking Collaborative Filtering with Multiple Self-contained Modalities.
Proceedings of the Advances in Information Retrieval, 2011

To Seek, Perchance to Fail: Expressions of User Needs in Internet Video Search.
Proceedings of the Advances in Information Retrieval, 2011

A System Concept for Socially Enriched Access to Soccer Video Collections.
IEEE Multim., 2010

Visual concept-based selection of query expansions for spoken content retrieval.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

List-wise learning to rank with matrix factorization for collaborative filtering.
Proceedings of the 2010 ACM Conference on Recommender Systems, 2010

Supervised reranking for web image search.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Exploiting noisy visual concept detection to improve spoken content based video retrieval.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Advances in multimedia retrieval, part i: frontiers in multimedia search.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

The influence of internet hypes on multimedia information retrieval research.
Proceedings of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval, 2010

Exploiting Result Consistency to Select Query Expansions for Spoken Content Retrieval.
Proceedings of the Advances in Information Retrieval, 2010

Contextual image retrieval model.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

Audio Segmentation.
Proceedings of the Encyclopedia of Database Systems, 2009

Audio Representation.
Proceedings of the Encyclopedia of Database Systems, 2009

Audio Content Analysis.
Proceedings of the Encyclopedia of Database Systems, 2009

Audio Classification.
Proceedings of the Encyclopedia of Database Systems, 2009

Proceedings of the Encyclopedia of Database Systems, 2009

Integration of Context and Content for Multimedia Management: An Introduction to the Special Issue.
IEEE Trans. Multim., 2009

Text-Like Segmentation of General Audio for Content-Based Retrieval.
IEEE Trans. Multim., 2009

On emerging techniques for multimedia content sharing, search and understanding.
J. Vis. Commun. Image Represent., 2009

Eye localization in low and standard definition content with application to face matching.
Comput. Vis. Image Underst., 2009

Unsupervised and simultaneous training of multiple object detectors from unlabeled surveillance video.
Comput. Vis. Image Underst., 2009

Exploiting visual reranking to improve pseudo-relevance feedback for spoken-content-based video retrieval.
Proceedings of the 10th Workshop on Image Analysis for Multimedia Interactive Services, 2009

Exploiting user similarity based on rated-item pools for improved user-based collaborative filtering.
Proceedings of the 2009 ACM Conference on Recommender Systems, 2009

Visual Resampling for Pseudo-Relevance Feedback during Speech-based Video Retrieval.
Proceedings of the LWA 2009: Workshop-Woche: Lernen, 2009

Cover Song Retrieval: A Comparative Study of System Component Choices.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Towards unsupervised learning for automatic multi-class object detection in surveillance videos.
Proceedings of the IEEE International Conference on Acoustics, 2009

Special section from the ACM multimedia conference 2007.
ACM Trans. Multim. Comput. Commun. Appl., 2008

Audio Keywords Discovery for Text-Like Audio Content Analysis and Retrieval.
IEEE Trans. Multim., 2008

Co-clustering for Auditory Scene Categorization.
IEEE Trans. Multim., 2008

Autonomous and Adaptive Learning of Shadows for Surveillance.
Proceedings of the Ninth International Workshop on Image Analysis for Multimedia Interactive Services, 2008

Unsupervised anchor space generation for similarity measurement of general audio.
Proceedings of the IEEE International Conference on Acoustics, 2008

Accurate eye localization in low and standard definition content.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

Online training of object detectors from unlabeled surveillance video.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2008

Eye localization for face matching: is it always useful and under what conditions?
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

Towards Theoretical Performance Limits of Video Parsing.
IEEE Trans. Circuits Syst. Video Technol., 2007

Intelligent browsing of concert videos.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Person-based search in videos.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Comparison of face matching techniques under pose variation.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

A Combined RANSAC-Hough Transform Algorithm for Fundamental Matrix Estimation.
Proceedings of the British Machine Vision Conference 2007, 2007

On the development of an autonomous and self-adaptable moving object detector.
Proceedings of the Fourth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2007

A flexible framework for key audio effects detection and auditory context inference.
IEEE Trans. Speech Audio Process., 2006

Extracting moods from pictures and sounds: towards truly personalized TV.
IEEE Signal Process. Mag., 2006

Low Level Analysis of Video Using Spatiotemporal Pixel Blocks.
Proceedings of the Multimedia Content Representation, 2006

Towards optimal audio "keywords" detection for audio content analysis and discovery.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Towards a Robust Solution to People Counting.
Proceedings of the International Conference on Image Processing, 2006

Audio Elements Based Auditory Scene Segmentation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Affective video content representation and modeling.
IEEE Trans. Multim., 2005

Adaptive extraction of highlights from a sport video based on excitement modeling.
IEEE Trans. Multim., 2005

TU Delft at TRECVID 2005: Shot Boundary Detection.
Proceedings of the 2005 TREC Video Retrieval Evaluation, 2005

Unsupervised content discovery in composite audio.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Real-Time and Distributed AV Content Analysis System for Consumer Electronics Networks.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

The Multimedian Concert-Video Browser.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Towards a unified framework for content-based audio analysis.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Outlier Identification in Stereo Correspondences using Quadrics.
Proceedings of the British Machine Vision Conference 2005, Oxford, UK, September 2005, 2005

Logo recognition in video by line profile classification.
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2004, 2004

Content-based analysis of digital video.
Kluwer, ISBN: 978-1-4020-8114-9, 2004

Multimodal approach to measuring excitement in video.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Logo recognition in video stills by string matching.
Proceedings of the 2003 International Conference on Image Processing, 2003

Generic approach to highlights extraction from a sport video.
Proceedings of the 2003 International Conference on Image Processing, 2003

Shot-boundary detection: unraveled and resolved?
IEEE Trans. Circuits Syst. Video Technol., 2002

Logo detection and classification in a sport video: video indexing for sponsorship revenue control.
Proceedings of the Storage and Retrieval for Media Databases 2002, 2002

Indexing and retrieval of TV broadcast news using DANCERS.
J. Electronic Imaging, 2001

Recent Advances in Video Content Analysis: From Visual Features to Semantic Video Segments.
Int. J. Image Graph., 2001

DANCERS: Delft advanced news retrieval system.
Proceedings of the Storage and Retrieval for Media Databases 2001, 2001

Video and image retrieval beyond the cognitive level: the needs and possibilities.
Proceedings of the Storage and Retrieval for Media Databases 2001, 2001

Broadcast News Indexing Using Dancers.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

Toward a robust solution for image coding with easy content access.
Proceedings of the Visual Communications and Image Processing 2000, 2000

An integrated scheme for automated video abstraction based on unsupervised cluster-validity analysis.
IEEE Trans. Circuits Syst. Video Technol., 1999

Automated high-level movie segmentation for advanced video-retrieval systems.
IEEE Trans. Circuits Syst. Video Technol., 1999

Automatically Segmenting Movies into Logical Story Units.
Proceedings of the Visual Information and Information Systems, 1999

Semiautomatic news analysis, indexing, and classification system based on topic preselection.
Proceedings of the Storage and Retrieval for Image and Video Databases VII, 1999

Optimal Shot Boundary Detection Based on Robust Statistical Models.
Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1999

Efficient Image Codec with Reduced Content Access Work.
Proceedings of the 1999 International Conference on Image Processing, 1999

A New Method for Key Frame Based Video Content Representation.
Proceedings of the Image Databases and Multi-Media Search, 1998

Template-based Detection of Anchorperson Shots in News Programs.
Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

Rate distortion optimal contour compression using cubic B-splines.
Proceedings of the 9th European Signal Processing Conference, 1998

Automation of Systems Enabling Search on Stored Video Data.
Proceedings of the Storage and Retrieval for Image and Video Databases V, 1997

3D motion and scene structure estimation with motion dependent distortion of measurement windows.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996

Visual search in a SMASH system.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996
