Mohan S. Kankanhalli

Orcid: 0000-0002-4846-2015

Affiliations:
  • National University of Singapore, Department of Computer Science, Singapore


According to our database1, Mohan S. Kankanhalli authored at least 489 papers between 1990 and 2024.

Collaborative distances:

Awards

IEEE Fellow

IEEE Fellow 2014, "For contributions to multimedia content processing and security".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Fast Yet Effective Machine Unlearning.
IEEE Trans. Neural Networks Learn. Syst., September, 2024

Recurrent Appearance Flow for Occlusion-Free Virtual Try-On.
ACM Trans. Multim. Comput. Commun. Appl., August, 2024

PAINT: Photo-realistic Fashion Design Synthesis.
ACM Trans. Multim. Comput. Commun. Appl., February, 2024

Unsupervised Domain Adaptation by Causal Learning for Biometric Signal-based HCI.
ACM Trans. Multim. Comput. Commun. Appl., February, 2024

Balanced Class-Incremental 3D Object Classification and Retrieval.
IEEE Trans. Knowl. Data Eng., January, 2024

A Comprehensive Picture of Factors Affecting User Willingness to Use Mobile Health Applications.
ACM Trans. Comput. Heal., January, 2024

Learning to Agree on Vision Attention for Visual Commonsense Reasoning.
IEEE Trans. Multim., 2024

Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering.
IEEE Trans. Multim., 2024

Multi2Human: Controllable human image generation with multimodal controls.
Neurocomputing, 2024

Multi-Modal Meta-Transfer Fusion Network for Few-Shot 3D Model Classification.
Int. J. Comput. Vis., 2024

Strong Preferences Affect the Robustness of Value Alignment.
CoRR, 2024

STAR: Skeleton-aware Text-based 4D Avatar Generation with In-Network Motion Retargeting.
CoRR, 2024

Multi-Modal Recommendation Unlearning.
CoRR, 2024

TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment.
CoRR, 2024

Bridging the Intent Gap: Knowledge-Enhanced Visual Generation.
CoRR, 2024

DPTraj-PM: Differentially Private Trajectory Synthesis Using Prefix Tree and Markov Process.
CoRR, 2024

Cluster-based Graph Collaborative Filtering.
CoRR, 2024

S3Editor: A Sparse Semantic-Disentangled Self-Training Framework for Face Video Editing.
CoRR, 2024

How to Understand Named Entities: Using Common Sense for News Captioning.
CoRR, 2024

Hallucination is Inevitable: An Innate Limitation of Large Language Models.
CoRR, 2024

Privacy-Enhancing Person Re-identification Framework - A Dual-Stage Approach.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Attribute-driven Disentangled Representation Learning for Multimodal Recommendation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Diffusion Facial Forgery Detection.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

EcoVal: An Efficient Data Valuation Framework for Machine Learning.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

MCM: Multi-condition Motion Synthesis Framework.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Improving Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

AutoLoRa: An Automated Robust Fine-Tuning Framework.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

An LLM can Fool Itself: A Prompt-Based Adversarial Attack.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Finetuning Text-to-Image Diffusion Models for Fairness.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Bilateral Adaptation for Human-Object Interaction Detection with Occlusion-Robustness.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PELA: Learning Parameter-Efficient Models with Low-Rank Approximation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
On Modality Bias Recognition and Reduction.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Semantic-Aware Triplet Loss for Image Classification.
IEEE Trans. Multim., 2023

Learning to Minimize the Remainder in Supervised Learning.
IEEE Trans. Multim., 2023

Disentangled Multimodal Representation Learning for Recommendation.
IEEE Trans. Multim., 2023

Joint Answering and Explanation for Visual Commonsense Reasoning.
IEEE Trans. Image Process., 2023

Zero-Shot Machine Unlearning.
IEEE Trans. Inf. Forensics Secur., 2023

When and Why Static Images Are More Effective Than Videos.
IEEE Trans. Affect. Comput., 2023

Fair Representation: Guaranteeing Approximate Multiple Group Fairness for Unknown Tasks.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Point Spatio-Temporal Transformer Networks for Point Cloud Video Modeling.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Emotional Attention: From Eye Tracking to Computational Modeling.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Understanding Before Recommendation: Semantic Aspect-Aware Review Exploitation via Large Language Models.
CoRR, 2023

Generating Human-Centric Visual Cues for Human-Object Interaction Detection via Large Vision-Language Models.
CoRR, 2023

UNK-VQA: A Dataset and A Probe into Multi-modal Large Models' Abstention Ability.
CoRR, 2023

Prior-Free Continual Learning with Unlabeled Data in the Wild.
CoRR, 2023

AutoLoRa: A Parameter-Free Automated Robust Fine-Tuning Framework.
CoRR, 2023

ELIP: Efficient Language-Image Pre-training with Fewer Vision Tokens.
CoRR, 2023

Distill to Delete: Unlearning in Graph Networks with Knowledge Distillation.
CoRR, 2023

MCM: Multi-condition Motion Synthesis Framework for Multi-scenario.
CoRR, 2023

DPMix: Mixture of Depth and Point Cloud Video Experts for 4D Action Segmentation.
CoRR, 2023

Towards Generalizable Deepfake Detection by Primary Region Regularization.
CoRR, 2023

A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023.
CoRR, 2023

Continual Learning with Strong Experience Replay.
CoRR, 2023

What Makes for Good Visual Tokenizers for Large Language Models?
CoRR, 2023

Efficient Adversarial Contrastive Learning via Robustness-Aware Coreset Selection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Enhancing Adversarial Contrastive Learning via Adversarial Invariant Regularization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Sequential Action Retrieval for Generating Narratives from Long Videos.
Proceedings of the 2nd Workshop on User-centric Narrative Summarization of Long Videos, 2023

Narrative Graph for Narrative Generation from Long Videos.
Proceedings of the 2nd Workshop on User-centric Narrative Summarization of Long Videos, 2023

Combating Misinformation in the Era of Generative AI Models.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Semantic-Guided Feature Distillation for Multimodal Recommendation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Panel: Multimodal Large Foundation Models.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

NarSUM '23: The 2nd Workshop on User-Centric Narrative Summarization of Long Videos.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Sample Less, Learn More: Efficient Action Recognition via Frame Feature Restoration.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Deep Regression Unlearning.
Proceedings of the International Conference on Machine Learning, 2023

Continuous-Discrete Convolution for Geometry-Sequence Modeling in Proteins.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

DSFNet: Dual Space Fusion Network for Occlusion-Robust 3D Dense Face Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

PointListNet: Deep Learning on 3D Point Lists.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Text to Point Cloud Localization with Relation-Enhanced Transformer.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Can Bad Teaching Induce Forgetting? Unlearning in Deep Networks Using an Incompetent Teacher.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Enhanced 3D Shape Reconstruction With Knowledge Graph of Category Concept.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Toward Region-Aware Attention Learning for Scene Graph Generation.
IEEE Trans. Neural Networks Learn. Syst., 2022

Relation-Aware Compositional Zero-Shot Learning for Attribute-Object Pair Recognition.
IEEE Trans. Multim., 2022

Unsupervised Spatial-Spectral Network Learning for Hyperspectral Compressive Snapshot Reconstruction.
IEEE Trans. Geosci. Remote. Sens., 2022

Monocular Image-Based 3-D Model Retrieval: A Benchmark.
IEEE Trans. Cybern., 2022

Video Snapshot Compressive Imaging Using Residual Ensemble Network.
IEEE Trans. Circuits Syst. Video Technol., 2022

Understanding Atomic Hand-Object Interaction With Human Intention.
IEEE Trans. Circuits Syst. Video Technol., 2022

Recognition of Advertisement Emotions With Application to Computational Advertising.
IEEE Trans. Affect. Comput., 2022

Entropy guided attention network for weakly-supervised action localization.
Pattern Recognit., 2022

Deep Hierarchical Representation of Point Cloud Videos via Spatio-Temporal Decomposition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Editorial.
Multim. Syst., 2022

One-shot Video Graph Generation for Explainable Action Reasoning.
Neurocomputing, 2022

My Health Sensor, My Classifier - Adapting a Trained Classifier to Unlabeled End-User Data.
ACM Trans. Comput. Heal., 2022

Superclass-aware network for few-shot learning.
Comput. Vis. Image Underst., 2022

Adversarial Attacks and Defense for Non-Parametric Two-Sample Tests.
CoRR, 2022

Learning to Predict Gradients for Semi-Supervised Continual Learning.
CoRR, 2022

Privacy-Preserving Synthetic Data Generation for Recommendation Systems.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Don't Pour Cereal into Coffee: Differentiable Temporal Logic for Temporal Action Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Panel Discussion: Emerging Topics on Video Summarization.
Proceedings of the NarSUM '22: Proceedings of the 1st Workshop on User-centric Narrative Summarization of Long Videos, 2022

Compute to Tell the Tale: Goal-Driven Narrative Generation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Distance Matters in Human-Object Interaction Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

NarSUM '22: 1st Workshop on User-centric Narrative Summarization of Long Videos.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Learning Realistic Patterns from Visually Unrealistic Stimuli: Generalization and Data Anonymization (Extended Abstract).
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Adversarial Attack and Defense for Non-Parametric Two-Sample Tests.
Proceedings of the International Conference on Machine Learning, 2022

Chairs Can Be Stood On: Overcoming Object Bias in Human-Object Interaction Detection.
Proceedings of the Computer Vision, 2022

DIOT: Detecting Implicit Obstacles from Trajectories.
Proceedings of the Database Systems for Advanced Applications, 2022

Self-Supervised Global-Local Structure Modeling for Point Cloud Domain Adaptation with Reliable Voted Pseudo Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Message from the General Chairs: IEEE BigMM 2022.
Proceedings of the Eighth IEEE International Conference on Multimedia Big Data, 2022

2021
A New Foreground-Background based Method for Behavior-Oriented Social Media Image Classification.
ACM Trans. Multim. Comput. Commun. Appl., 2021

A Matrix Factorization Based Framework for Fusion of Physical and Social Sensors.
IEEE Trans. Multim., 2021

DeepDance: Music-to-Dance Motion Choreography With Adversarial Learning.
IEEE Trans. Multim., 2021

Adversarial Learning for Personalized Tag Recommendation.
IEEE Trans. Multim., 2021

Toward Multi-Modal Conditioned Fashion Image Translation.
IEEE Trans. Multim., 2021

Unsupervised Abstract Reasoning for Raven's Problem Matrices.
IEEE Trans. Image Process., 2021

Scene Graph Inference via Multi-Scale Context Modeling.
IEEE Trans. Circuits Syst. Video Technol., 2021

A new DCT-PCM method for license plate number detection in drone images.
Pattern Recognit. Lett., 2021

Direction Concentration Learning: Enhancing Congruency in Machine Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Learning Realistic Patterns from Visually Unrealistic Stimuli: Generalization and Data Anonymization.
J. Artif. Intell. Res., 2021

Understanding the Interaction of Adversarial Training with Noisy Labels.
CoRR, 2021

Using Under-Trained Deep Ensembles to Learn Under Extreme Label Noise: A Case Study for Sleep Apnea Detection.
IEEE Access, 2021

Unsupervised Motion Representation Learning with Capsule Autoencoders.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning to Predict Trustworthiness with Steep Slope Loss.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Motion = Video - Content: Towards Unsupervised Learning of Motion Representation from Videos.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Human Attributes Prediction under Privacy-preserving Conditions.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Effective Abstract Reasoning with Dual-Contrast Network.
Proceedings of the 9th International Conference on Learning Representations, 2021

Geometry-aware Instance-reweighted Adversarial Training.
Proceedings of the 9th International Conference on Learning Representations, 2021

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences.
Proceedings of the 9th International Conference on Learning Representations, 2021

Learning Causal Representation for Training Cross-Domain Pose Estimator via Generative Interventions.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
G-Softmax: Improving Intraclass Compactness and Interclass Separability of Features.
IEEE Trans. Neural Networks Learn. Syst., 2020

Interact as You Intend: Intention-Driven Human-Object Interaction Detection.
IEEE Trans. Multim., 2020

Video Storytelling: Textual Summaries for Events.
IEEE Trans. Multim., 2020

Unsupervised Online Video Object Segmentation With Motion Property Understanding.
IEEE Trans. Image Process., 2020

Photography and Exploration of Tourist Locations Based on Optimal Foraging Theory.
IEEE Trans. Circuits Syst. Video Technol., 2020

Egocentric Analysis of Dash-Cam Videos for Vehicle Forensics.
IEEE Trans. Circuits Syst. Video Technol., 2020

A Deeper Look at Human Visual Perception of Images.
SN Comput. Sci., 2020

Visual Social Relationship Recognition.
Int. J. Comput. Vis., 2020

Evaluating salient object detection in natural images with multiple objects having multi-level saliency.
IET Image Process., 2020

Multimedia Data Privacy Against Machines.
IEEE Multim., 2020

Using Under-trained Deep Ensembles to Learn Under Extreme Label Noise.
CoRR, 2020

Learning Realistic Patterns from Unrealistic Stimuli: Generalization and Data Anonymization.
CoRR, 2020

Gender and Emotion Recognition from Implicit User Behavior Signals.
CoRR, 2020

Robust Federated Recommendation System.
CoRR, 2020

Hierarchically Fair Federated Learning.
CoRR, 2020

Solving Raven's Progressive Matrices with Neural Networks.
CoRR, 2020

Protecting sensitive place visits in privacy-preserving trajectory publishing.
Comput. Secur., 2020

GradMix: Multi-source Transfer across Domains and Tasks.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Weakly-Supervised Multi-Person Action Recognition in 360° Videos.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Locality-Sensitive Hashing Scheme based on Longest Circular Co-Substring.
Proceedings of the 2020 International Conference on Management of Data, 2020

Who You Are Decides How You Tell.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Helping Users Tackle Algorithmic Threats on Social Media: A Multimedia Research Agenda.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

The World has Changed - The World Needs to Change. What Multimedia has to Offer for Our Common Digital Future.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

MediaEval 2020: Maintaining Human-Imperceptibility of Image Adversarial Attack by Using Human-Aware Sensitivity Map.
Proceedings of the Working Notes Proceedings of the MediaEval 2020 Workshop, 2020

Attacks Which Do Not Kill Training Make Adversarial Learning Stronger.
Proceedings of the 37th International Conference on Machine Learning, 2020

Inferring DQN structure for high-dimensional continuous control.
Proceedings of the 37th International Conference on Machine Learning, 2020

Nudging Users to Slow Down the Spread of Fake News in Social Media.
Proceedings of the 2020 IEEE International Conference on Multimedia & Expo Workshops, 2020

n-Reference Transfer Learning for Saliency Prediction.
Proceedings of the Computer Vision - ECCV 2020, 2020

COGAM: Measuring and Moderating Cognitive Load in Machine Learning Model Explanations.
Proceedings of the CHI '20: CHI Conference on Human Factors in Computing Systems, 2020

2019
A Multi-sensor Framework for Personal Presentation Analytics.
ACM Trans. Multim. Comput. Commun. Appl., 2019

CloseUp - A Community-Driven Live Online Search Engine.
ACM Trans. Internet Techn., 2019

Attentive Long Short-Term Preference Modeling for Personalized Product Search.
ACM Trans. Inf. Syst., 2019

MMALFM: Explainable Recommendation by Leveraging Reviews and Images.
ACM Trans. Inf. Syst., 2019

Multi-Modal and Multi-Domain Embedding Learning for Fashion Retrieval and Analysis.
IEEE Trans. Multim., 2019

Pricing Average Price Advertising Options When Underlying Spot Market Prices Are Discontinuous.
IEEE Trans. Knowl. Data Eng., 2019

Dual-Stream Recurrent Neural Network for Video Captioning.
IEEE Trans. Circuits Syst. Video Technol., 2019

Surface-Electromyography-Based Gesture Recognition by Multi-View Deep Learning.
IEEE Trans. Biomed. Eng., 2019

Deep Reinforcement Learning in Soft Viscoelastic Actuator of Dielectric Elastomer.
IEEE Robotics Autom. Lett., 2019

A multi-stream convolutional neural network for sEMG-based gesture recognition in muscle-computer interface.
Pattern Recognit. Lett., 2019

Music auto-tagging based on the unified latent semantic modeling.
Multim. Tools Appl., 2019

LSTM-based multi-label video event detection.
Multim. Tools Appl., 2019

CARF-Net: CNN attention and RNN fusion network for video-based person reidentification.
J. Electronic Imaging, 2019

Overview of currency recognition using deep learning.
J. Bank. Financial Technol., 2019

Pushing the Boundary of Multimedia Big Data: An Overview of IEEE MIPR.
IEEE Multim., 2019

Semantically-Regularized Logic Graph Embeddings.
CoRR, 2019

Fast Video Object Segmentation via Mask Transfer Network.
CoRR, 2019

G-softmax: Improving Intra-class Compactness and Inter-class Separability of Features.
CoRR, 2019

sEMG-Based Gesture Recognition With Embedded Virtual Hand Poses and Adversarial Learning.
IEEE Access, 2019

Quantifying and Alleviating the Language Prior Problem in Visual Question Answering.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Augmenting Physiological Time Series Data: A Case Study for Sleep Apnea Detection.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

Embedding Symbolic Knowledge into Deep Networks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Explainable Video Action Reasoning via Prior Knowledge and State Transitions.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Unsupervised Domain Adaptation for 3D Human Pose Estimation.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Human-imperceptible Privacy Protection Against Machines.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

User Diverse Preference Modeling by Multimodal Attentive Metric Learning.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Self-supervised Representation Learning Using 360° Data.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

LiveSense: Contextual Advertising in Live Streaming Videos.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Towards Robust ResNet: A Small Step but a Giant Leap.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Sublinear Time Nearest Neighbor Search over Generalized Weighted Space.
Proceedings of the 36th International Conference on Machine Learning, 2019

CRNN Based Jersey-Bib Number/Text Recognition in Sports and Marathon Images.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Learning to Detect Human-Object Interactions With Knowledge.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning to Learn From Noisy Labeled Data.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Emotion-Aware Human Attention Prediction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
2D Vector Map Fragile Watermarking with Region Location.
ACM Trans. Spatial Algorithms Syst., 2018

Multimodal Multiplatform Social Media Event Summarization.
ACM Trans. Multim. Comput. Commun. Appl., 2018

A Spring-Electric Graph Model for Socialized Group Photography.
IEEE Trans. Multim., 2018

Saliency flow based video segmentation via motion guided contour refinement.
Signal Process., 2018

Robust tracking based on H-CNN with low-resource sampling and scaling by frame-wise motion localization.
Multim. Tools Appl., 2018

Early biological vision inspired system for salience computation in images.
Multidimens. Syst. Signal Process., 2018

Video Storytelling.
CoRR, 2018

A Fine-Grained Spatial-Temporal Attention Model for Video Captioning.
IEEE Access, 2018

Aspect-Aware Latent Factor Model: Rating Prediction with Ratings and Reviews.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

EPICURE - Aspect-based Multimodal Review Summarization.
Proceedings of the 10th ACM Conference on Web Science, 2018

Unsupervised Learning of View-invariant Action Representations.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Multi-modal Preference Modeling for Product Search.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

AI + Multimedia Make Better Life?
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

A^3NCF: An Adaptive Aspect Attention Model for Rating Prediction.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Looking Beyond a Clever Narrative: Visual Context and Attention are Primary Drivers of Affect in Video Advertisements.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

EEG-based Evaluation of Cognitive Workload Induced by Acoustic Parameters for Data Sonification.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Emotional Attention: A Study of Image Sentiment and Visual Attention.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Trends and Trajectories for Explainable, Accountable and Intelligible Systems: An HCI Research Agenda.
Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 2018

2017
Cyber-Physical Social Networks.
ACM Trans. Internet Techn., 2017

Benchmarking a Multimodal and Multiview and Interactive Dataset for Human Action Recognition.
IEEE Trans. Cybern., 2017

ClickSmart: A Context-Aware Viewpoint Recommendation System for Mobile Photography.
IEEE Trans. Circuits Syst. Video Technol., 2017

Hierarchical Clustering Multi-Task Learning for Joint Human Action Grouping and Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Content based authentication of visual cryptography.
Multim. Tools Appl., 2017

As-similar-as-possible saliency fusion.
Multim. Tools Appl., 2017

Online object tracking based on CNN with spatial-temporal saliency guided sampling.
Neurocomputing, 2017

Hierarchical & multimodal video captioning: Discovering and transferring multimodal knowledge for vision to language.
Comput. Vis. Image Underst., 2017

Pricing average price advertisement options when underlying spot market prices are discontinuous.
CoRR, 2017

Multi-Camera Action Dataset for Cross-Camera Action Recognition Benchmarking.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Tianjin University and National University of Singapore at TRECVID 2017: Video to Text Description.
Proceedings of the 2017 TREC Video Retrieval Evaluation, 2017

Exploring User-Specific Information in Music Retrieval.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Optimizing Trade-offs Among Stakeholders in Real-Time Bidding by Incorporating Multimedia Metrics.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Affect Recognition in Ads with Application to Computational Advertising.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Attention Transfer from Web Images for Video Recognition.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Understanding Fashion Trends from Street Photos via Neighbor-Constrained Embedding Learning.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

The Role of Visual Attention in Sentiment Prediction.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

MM2RTB: Bringing Multimedia Metrics to Real-Time Bidding.
Proceedings of the ADKDD'17, Halifax, NS, Canada, August 13 - 17, 2017, 2017

Semi-Supervised Learning for Surface EMG-based Gesture Recognition.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Exploiting Music Play Sequence for Music Recommendation.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Evaluating content-centric vs. user-centric ad affect recognition.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017

Multimedia signatures for vehicle forensics.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Dual-Glance Model for Deciphering Social Relationships.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2016
Visible watermarking based on importance and just noticeable distortion of image regions.
Multim. Tools Appl., 2016

Introduction to the Issue on Person-Centered Signal Processing for Assistive, Rehabilitative, and Wearable Health Technologies.
IEEE J. Sel. Top. Signal Process., 2016

Decoupled Multicamera Sensing for Flexible View Generation.
J. Sensors, 2016

Multi-Camera Action Dataset (MCAD): A Dataset for Studying Non-overlapped Cross-Camera Action Recognition.
CoRR, 2016

ConTagNet: Exploiting User Context for Image Tag Recommendation.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Concept Based Hybrid Fusion of Multimodal Event Signals.
Proceedings of the IEEE International Symposium on Multimedia, 2016

Demo Paper: PreSense - An Assistive Presentation Self-Quantification System.
Proceedings of the IEEE International Symposium on Multimedia, 2016

Multi-stream Deep Learning Framework for Automated Presentation Assessment.
Proceedings of the IEEE International Symposium on Multimedia, 2016

Tweeting Camera: A New Paradigm of Event-based Smart Sensing Device: Demo.
Proceedings of the 10th International Conference on Distributed Smart Camera, 2016

Marker-Less 3D Human Motion Capture with Monocular Image Sequence and Height-Maps.
Proceedings of the Computer Vision - ECCV 2016, 2016

Analysis of Comparators for Binary Watermarks.
Proceedings of International Conference on Computer Vision and Image Processing, 2016

Near-Optimal Active Learning of Multi-Output Gaussian Processes.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Context-Aware Photography Learning for Smart Mobile Devices.
ACM Trans. Multim. Comput. Commun. Appl., 2015

Multi-Camera Coordination and Control in Surveillance Systems: A Survey.
ACM Trans. Multim. Comput. Commun. Appl., 2015

Competence-Based Song Recommendation: Matching Songs to One's Singing Skill.
IEEE Trans. Multim., 2015

Multi-Keyword Multi-Click Advertisement Option Contracts for Sponsored Search.
ACM Trans. Intell. Syst. Technol., 2015

Salience computation in images based on perceptual distinctness.
Signal Process. Image Commun., 2015

ICMR 2014: 4th ACM International Conference on Multimedia Retrieval.
SIGIR Forum, 2015

Currency security and forensics: a survey.
Multim. Tools Appl., 2015

A bio-inspired center-surround model for salience computation in images.
J. Vis. Commun. Image Represent., 2015

Introduction to the Issue on Signal Processing for Situational Awareness From Networked Sensors and Social Media.
IEEE J. Sel. Top. Signal Process., 2015

Group $K$-Means.
CoRR, 2015

Tweeting Cameras for Event Detection.
Proceedings of the 24th International Conference on World Wide Web, 2015

Face Search in Encrypted Domain.
Proceedings of the Image and Video Technology - 7th Pacific-Rim Symposium, 2015

Multi-sensor Self-Quantification of Presentations.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

SalAd: A Multimodal Approach for Contextual Video Advertising.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

A multimodal approach for image de-fencing and depth inpainting.
Proceedings of the Eighth International Conference on Advances in Pattern Recognition, 2015

2014
Up-Fusion: An Evolving Multimedia Fusion Method.
ACM Trans. Multim. Comput. Commun. Appl., 2014

Online Estimation of Evolving Human Visual Interest.
ACM Trans. Multim. Comput. Commun. Appl., 2014

CAVVA: Computational Affective Video-in-Video Advertising.
IEEE Trans. Multim., 2014

Audio Matters in Visual Attention.
IEEE Trans. Circuits Syst. Video Technol., 2014

W3-privacy: understanding what, when, and where inference channels in multi-camera surveillance video.
Multim. Tools Appl., 2014

Real-life events in multimedia: detection, representation, retrieval, and applications.
Multim. Tools Appl., 2014

Guest editorial: Advances in multimedia surveillance.
Multim. Tools Appl., 2014

Active Learning Is Planning: Nonmyopic ε-Bayes-Optimal Active Learning of Gaussian Processes.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014

Multi-view action recognition by cross-domain learning.
Proceedings of the IEEE 16th International Workshop on Multimedia Signal Processing, 2014

View-invariant feature discovering for multi-camera human action recognition.
Proceedings of the IEEE 16th International Workshop on Multimedia Signal Processing, 2014

Context-Based Photography Learning using Crowdsourced Images and Social Media.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Song Recommendation for Social Singing Community.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Recovering Social Interaction Spatial Structure from Multiple First-Person Views.
Proceedings of the 3rd International Workshop on Socially-Aware Multimedia, 2014

Nonmyopic \(\epsilon\)-Bayes-Optimal Active Learning of Gaussian Processes.
Proceedings of the 31th International Conference on Machine Learning, 2014

Super-resolution de-fencing: Simultaneous fence removal and high-resolution image recovery using videos.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Scalable Decision-Theoretic Coordination and Control for Real-time Active Multi-Camera Surveillance.
Proceedings of the International Conference on Distributed Smart Cameras, 2014

No One is Left "Unwatched": Fairness in Observation of Crowds of Mobile Targets in Active Camera Surveillance.
Proceedings of the ECAI 2014 - 21st European Conference on Artificial Intelligence, 18-22 August 2014, Prague, Czech Republic, 2014

Action and Interaction Recognition in First-Person Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014

Mode of teaching based segmentation and annotation of video lectures.
Proceedings of the 12th International Workshop on Content-Based Multimedia Indexing, 2014

Decision-theoretic approach to maximizing fairness in multi-target observation in multi-camera surveillance.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2014

Discovering Person Identity via Large-Scale Observations.
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014

2013
A reward-and-punishment-based approach for concept detection using adaptive ontology rules.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Multimedia Fusion With Mean-Covariance Analysis.
IEEE Trans. Multim., 2013

Image Re-Attentionizing.
IEEE Trans. Multim., 2013

Privacy aware publication of surveillance video.
Int. J. Trust. Manag. Comput. Commun., 2013

Multi-Keyword Multi-Click Option Contracts for Sponsored Search Advertising.
CoRR, 2013

Interactive Video Advertising: A Multimodal Affective Approach.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Static saliency vs. dynamic saliency: a comparative study.
Proceedings of the ACM Multimedia Conference, 2013

An evaluation of wearable activity monitoring devices.
Proceedings of the 1st ACM international workshop on Personal data meets distributed multimedia, 2013

Temporal encoded F-formation system for social interaction detection.
Proceedings of the ACM Multimedia Conference, 2013

Hazy image enhancement based on the full-saturation assumption.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Seeing through the fence: Image de-fencing using a video sequence.
Proceedings of the IEEE International Conference on Image Processing, 2013

VIP: A Unifying Framework for Computational Eye-Gaze Research.
Proceedings of the Human Behavior Understanding - 4th International Workshop, 2013

2012
Image hatching for visual cryptography.
ACM Trans. Multim. Comput. Commun. Appl., 2012

Aggregate licenses validation for digital rights violation detection.
ACM Trans. Multim. Comput. Commun. Appl., 2012

Introduction to special issue on multimedia security.
ACM Trans. Multim. Comput. Commun. Appl., 2012

Adaptive Workload Equalization in Multi-Camera Surveillance Systems.
IEEE Trans. Multim., 2012

Robust Watermarking of Compressed and Encrypted JPEG2000 Images.
IEEE Trans. Multim., 2012

Introduction to the special issue of the multimedia tools and applications journal on events in multimedia.
Multim. Tools Appl., 2012

Concept-based near-duplicate video clip detection for novelty re-ranking of web video search results.
Multim. Syst., 2012

Guest editorial: Privacy-aware multimedia surveillance systems.
Multim. Syst., 2012

Adaptive Transformation for Robust Privacy Protection in Video Surveillance.
Adv. Multim., 2012

A Multimodal Approach for Online Estimation of Subtle Facial Expression.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

A Synaesthetic Approach for Image Slideshow Generation.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Decision-theoretic coordination and control for active multi-camera surveillance in uncertain, partially observable environments.
Proceedings of the Sixth International Conference on Distributed Smart Cameras, 2012

Depth Matters: Influence of Depth Cues on Visual Saliency.
Proceedings of the Computer Vision - ECCV 2012, 2012

Decision-theoretic approach to maximizing observation of multiple targets in multi-camera surveillance.
Proceedings of the International Conference on Autonomous Agents and Multiagent Systems, 2012

2011
Probabilistic temporal multimedia data mining.
ACM Trans. Intell. Syst. Technol., 2011

Multimedia data mining: state of the art and challenges.
Multim. Tools Appl., 2011

Effective multimedia surveillance using a human-centric approach.
Multim. Tools Appl., 2011

Guest Editorial.
J. Multim., 2011

Pedestrian Tracking Based on <i>Hidden-Latent</i> Temporal Markov Chain.
Proceedings of the Advances in Multimedia Modeling, 2011

Affect-based adaptive presentation of home videos.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Up-fusion: an evolving multimedia decision fusion method.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Modeling and representing events in multimedia.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Eye-tracking methodology and applications to images and video.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Affective Video Summarization and Story Board Generation Using Pupillary Dilation and Eye Gaze.
Proceedings of the 2011 IEEE International Symposium on Multimedia, 2011

Dynamic workload assignment in video surveillance systems.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Anonymous surveillance.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Mechanism Design for Incentivizing Social Media Contributions.
Proceedings of the Social Media Modeling and Computing., 2011

2010
MultiFusion: A boosting approach for multimedia fusion.
ACM Trans. Multim. Comput. Commun. Appl., 2010

Multimodal fusion for multimedia analysis: a survey.
Multim. Syst., 2010

A Geometric Approach for Efficient Licenses Validation in DRM.
Proceedings of the Secure Data Management, 7th VLDB Workshop, SDM 2010, Singapore, 2010

Video retargeting for aesthetic enhancement.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Automated aesthetic enhancement of videos.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Portfolio theory of multimedia fusion.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Modeling, detecting, and processing events in multimedia.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

An efficient access control method for multimedia social networks.
Proceedings of second ACM SIGMM workshop on Social media, 2010

Making computers look the way we look: exploiting visual attention for image understanding.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Privacy preserving video surveillance using pedestrian tracking mechanism.
Proceedings of the 2nd ACM workshop on Multimedia in forensics, security and intelligence, 2010

EMD and psychoacoustic model based watermarking for audio.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Compressed-encrypted domain JPEG2000 image watermarking.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Privacy modeling for video data publication.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

An Eye Fixation Database for Saliency Detection in Images.
Proceedings of the Computer Vision, 2010

Efficient Aggregate Licenses Validation in DRM.
Proceedings of the Database Systems for Advanced Applications, 2010

An Authentication Mechanism Using Chinese Remainder Theorem for Efficient Surveillance Video Transmission.
Proceedings of the Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010

Functionality Delegation in Distributed Surveillance Systems.
Proceedings of the Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010

2009
Design of multimedia surveillance systems.
ACM Trans. Multim. Comput. Commun. Appl., 2009

Joint watermarking scheme for multiparty multilevel DRM architecture.
IEEE Trans. Inf. Forensics Secur., 2009

Adversary aware surveillance systems.
IEEE Trans. Inf. Forensics Secur., 2009

Robust and Reversible Numerical Set Watermarking.
Proceedings of the SIGMAP 2009, 2009

On the Security of an MPEG-Video Encryption Scheme Based on Secret Huffman Tables.
Proceedings of the Advances in Image and Video Technology, Third Pacific Rim Symposium, 2009

Secure Domain Architecture for Interoperable Content Distribution.
Proceedings of the Advances in Multimedia Information Processing, 2009

Robust Alignment of Presentation Videos with Slides.
Proceedings of the Advances in Multimedia Information Processing, 2009

Secure multimedia content delivery with multiparty multilevel DRM architecture.
Proceedings of the Network and Operating System Support for Digital Audio and Video, 2009

Motivating contributors in social media networks.
Proceedings of the first SIGMM workshop on Social media, 2009

Events in multimedia.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Automated localization of affective objects and actions in images via caption text-cum-eye gaze analysis.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Performance Modeling of Multimedia Surveillance Systems.
Proceedings of the 11th IEEE International Symposium on Multimedia, 2009

A CRT based watermark for multiparty multilevel DRM architecture.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A robust framework for aligning lecture slides with video.
Proceedings of the International Conference on Image Processing, 2009

Robust Numeric Set Watermarking: Numbers Don't Lie.
Proceedings of the e-Business and Telecommunications - 6th International Joint Conference, 2009

Analyzing Abnormal Events from Spatio-temporal Trajectories.
Proceedings of the ICDM Workshops 2009, 2009

Spatiotemporal latent semantic cues for moving people tracking.
Proceedings of the IEEE International Conference on Acoustics, 2009

Efficient license validation in MPML DRM architecture.
Proceedings of the 9th ACM Workshop on Digital Rights Management, 2009

Privacy Preserving Multiparty Multilevel DRM Architecture.
Proceedings of the 6th IEEE Consumer Communications and Networking Conference, 2009

A Flexible Surveillance System Architecture.
Proceedings of the Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009

Context-Based Multimedia Sensor Selection Method.
Proceedings of the Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009

Cross-Modal Approach for Karaoke Artifacts Correction.
Proceedings of the Handbook of Multimedia for Digital Entertainment and Arts, 2009

2008
Progressive Audio Scrambling in Compressed Domain.
IEEE Trans. Multim., 2008

Application Potential of Multimedia Information Retrieval.
Proc. IEEE, 2008

Coopetitive multi-camera surveillance using model predictive control.
Mach. Vis. Appl., 2008

A cross-modal approach for karaoke artifacts correction.
Multim. Tools Appl., 2008

Finding Interesting Images in Albums using Attention.
J. Multim., 2008

Effectiveness of Signal Segmentation for Music Content Representation.
Proceedings of the Advances in Multimedia Modeling, 2008

Multimodal observation systems.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Quality-aware GSM speech watermarking.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2008), 2008

Integrated Detect-Track Framework for Multi-view Face Detection in Video.
Proceedings of the Sixth Indian Conference on Computer Vision, Graphics & Image Processing, 2008

Pre-attentive discrimination of interestingness in images.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

08251 Summary - Contextual and Social Media Understanding and Usage.
Proceedings of the Contextual and Social Media Understanding and Usage, 15.06., 2008

08251 Abstracts Collection - Contextual and Social Media Understanding and Usage.
Proceedings of the Contextual and Social Media Understanding and Usage, 15.06., 2008

2007
Multimedia simplification for optimized MMS synthesis.
ACM Trans. Multim. Comput. Commun. Appl., 2007

Goal-oriented optimal subset selection of correlated multimedia streams.
ACM Trans. Multim. Comput. Commun. Appl., 2007

Render Sequence Encoding for Document Protection.
IEEE Trans. Multim., 2007

A scalable signature scheme for video authentication.
Multim. Tools Appl., 2007

Using Camera Settings Templates ("Scene Modes") for Image Scene Classification of Photographs Taken on Manual/Expert Settings.
Proceedings of the Advances in Multimedia Information Processing, 2007

Coopetitive Multimedia Surveillance.
Proceedings of the Advances in Multimedia Modeling, 2007

Metadata Management, Reuse, Inference and Propagation in a Collection-Oriented Metadata Framework for Digital Images.
Proceedings of the Advances in Multimedia Modeling, 2007

Confidence Building Among Correlated Streams in Multimedia Surveillance Systems.
Proceedings of the Advances in Multimedia Modeling, 2007

Identifying Source Cell Phone using Chromatic Aberration.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Towards Adversary Aware Surveillance Systems.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

A Survey on Digital Camera Image Forensic Methods.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

2006
Automatic summarization of music videos.
ACM Trans. Multim. Comput. Commun. Appl., 2006

Metadata handling: A video perspective.
ACM Trans. Multim. Comput. Commun. Appl., 2006

Precise pitch profile feature extraction from musical audio for key detection.
IEEE Trans. Multim., 2006

Experiential Sampling on Multiple Data Streams.
IEEE Trans. Multim., 2006

Experiential Sampling in Multimedia Systems.
IEEE Trans. Multim., 2006

Mask-based fingerprinting scheme for digital video broadcasting.
Multim. Tools Appl., 2006

Information assimilation framework for event detection in multimedia surveillance systems.
Multim. Syst., 2006

Building trust in peer-to-peer systems: a review.
Int. J. Secur. Networks, 2006

Modeling Intent for Home Video Repurposing.
IEEE Multim., 2006

Music structure based vector space retrieval.
Proceedings of the SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006

An Anonymous Routing Protocol with The Local-repair Mechanism for Mobile Ad Hoc Networks.
Proceedings of the Third Annual IEEE Communications Society on Sensor and Ad Hoc Communications and Networks, 2006

A design methodology for selection and placement of sensors in multimedia surveillance systems.
Proceedings of the 4th ACM International Workshop on Video Surveillance and Sensor Networks, 2006

Predominant Vocal Pitch Detection in Polyphonic Music.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

A Hierarchical Approach for Music Chord Modeling Based on the Analysis of Tonal Characteristics.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

A Collection-Oriented Metadata Framework for Digital Images.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Experiential Sampling based Foreground/Background Segmentation for Video Surveillance.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Audio Based Event Detection for Multimedia Surveillance.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Multimedia Surveillance and Monitoring.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

2005
Automatic video logo detection and removal.
Multim. Syst., 2005

Analogies based video editing.
Multim. Syst., 2005

Progressive color visual cryptography.
J. Electronic Imaging, 2005

Reversible watermarking using a perceptual model.
J. Electronic Imaging, 2005

Efficient and robust key management for large mobile ad hoc networks.
Comput. Networks, 2005

Automatic music video summarization based on audio-visual-text analysis and alignment.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Music Key Detection for Musical Audio.
Proceedings of the 11th International Conference on Multi Media Modeling (MMM 2005), 2005

What is the state of our community?
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Timeline-based information assimilation in multimedia surveillance and monitoring systems.
Proceedings of the Third ACM International Workshop on Video Surveillance & Sensor Networks, 2005

Progressive scrambling for MP3 audio.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Providing efficient certification services against active attacks in ad hoc networks.
Proceedings of the 24th IEEE International Performance Computing and Communications Conference, 2005

Goal based optimal selection of media streams.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Automatic music summarization based on music structure analysis.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
A Hierarchical Signature Scheme for Robust Video Authentication using Secret Sharing.
Proceedings of the 10th International Multimedia Modeling Conference (MMM 2004), 2004

Content-based music structure analysis with applications to music semantics understanding.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Probability fusion for correlated multimedia streams.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Anonymous Secure Routing in Mobile Ad-Hoc Networks.
Proceedings of the 29th Annual IEEE Conference on Local Computer Networks (LCN 2004), 2004

Visual cryptography for print and scan applications.
Proceedings of the 2004 International Symposium on Circuits and Systems, 2004

Key-Based Melody Segmentation for Popular Songs.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Visual Keywords Labeling in Soccer Video.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

A method for solmization of melody.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Automatically summarize musical audio using adaptive clustering.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Video content representation on tiny devices.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Mosaic based view enlargement for moving objects in moving pictures.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Unsupervised classification of music genre using hidden Markov model.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Content based editing of semantic video metadata.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

A new approch to automatic music video summarization.
Proceedings of the 2004 International Conference on Image Processing, 2004

Goal detection in soccer video using audio/visual keywords.
Proceedings of the 2004 International Conference on Image Processing, 2004

Harmonicity and dynamics-based features for audio.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Automatic music summarization in compressed domain.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Trust Establishment in Large Scale Grid Settings.
Proceedings of the Grid and Cooperative Computing, 2004

2003
A digital rights management scheme for broadcast video.
Multim. Syst., 2003

Robust image authentication using content based compression.
Multim. Syst., 2003

Pivot Vector Space Approach for Audio-Video Mixing.
IEEE Multim., 2003

Melody Alignment and Similarity Metric for Content-Based Music Retrieval.
Proceedings of the Storage and Retrieval for Media Databases 2003, 2003

Semantic Video Annotation and Vague Query.
Proceedings of the 9th International Conference on Multi-Media Modeling, 2003

Music scale modeling for melody matching.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

Experience based sampling technique for multimedia analysis.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

Experiential sampling for monitoring.
Proceedings of the 2003 ACM SIGMM Workshop on Experiential Telepresence, 2003

Lossless Watermarking Considering the Human Visual System.
Proceedings of the Digital Watermarking, Second International Workshop, 2003

Motion trajectory based video authentication.
Proceedings of the 2003 International Symposium on Circuits and Systems, 2003

Semantic video summarization in compressed domain MPEG video.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Scrambling of engineering drawings.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Colorizing infrared home videos.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Creating audio keywords for event detection in soccer video.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Wide baseline spectral matching.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Fractional scaling of image and video in DCT domain.
Proceedings of the 2003 International Conference on Image Processing, 2003

Automatically generating summaries for musical video.
Proceedings of the 2003 International Conference on Image Processing, 2003

A hierarchical framework for face tracking using state vector fusion for compressed video.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Harmonicity and dynamics based audio separation.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Feature extraction of volume data based on multi-scale representation.
Proceedings of the 1st International Conference on Computer Graphics and Interactive Techniques in Australasia and Southeast Asia 2003, 2003

A 3D shape matching framework.
Proceedings of the 1st International Conference on Computer Graphics and Interactive Techniques in Australasia and Southeast Asia 2003, 2003

Print signatures for document authentication.
Proceedings of the 10th ACM Conference on Computer and Communications Security, 2003

Multimedia Analysis and Synthesis.
Proceedings of the AI 2003: Advances in Artificial Intelligence, 2003

2002
Detection of human faces in a compressed domain for video stratification.
Vis. Comput., 2002

Compressed-domain scrambler/descrambler for digital video.
IEEE Trans. Consumer Electron., 2002

Watermarking of Electronic Text Documents.
Electron. Commer. Res., 2002

Statistical Analysis of Musical Instruments.
Proceedings of the Advances in Multimedia Information Processing, 2002

Detection and removal of lighting & shaking artifacts in home videos.
Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002

SmartAlbum: a multi-modal photo annotation system.
Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002

Erasing video logos based on image inpainting.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Compressed domain object tracking for automatic indexing of objects in MPEG home video.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Similarity matching of continuous melody contours for humming querying of melody databases.
Proceedings of the IEEE 5th Workshop on Multimedia Signal Processing, 2002

2001
Pitch Tracking and Melody Slope Matching for Song Retrieval.
Proceedings of the Advances in Multimedia Information Processing, 2001

Compressed Domain Summarization of Digital Video.
Proceedings of the Advances in Multimedia Information Processing, 2001

Melody Curve Processing For Music Retrieval.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

Copyright Protection For Mpeg-2 Compressed Broadcast Video.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

Authentication of Volume Data Using Wavelet-Based Foveation.
Proceedings of the Eurographics Multimedia Workshop 2001, 2001

Robust Invisible Watermarking of Volume Data Using the 3D DCT.
Proceedings of the Computer Graphics International 2001 (CGI'01), 2001

2000
Perspectives on Content-Based Multimedia Systems
The Kluwer International Series on Information Retrieval 9, Kluwer, ISBN: 978-0-306-47033-2, 2000

Video Summarization Using R-Sequences.
Real Time Imaging, 2000

Video Modeling Using Strata-Based Annotation.
IEEE Multim., 2000

Temporal multiresolution analysis for video segmentation.
Proceedings of the Storage and Retrieval for Media Databases 2000, 2000

A caching and streaming framework for mulitmedia.
Proceedings of the 8th ACM International Conference on Multimedia 2000, Los Angeles, CA, USA, October 30, 2000

A DCT Domain Visible Watermarking Technique for Images.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

1999
Color and spatial feature for content-based image retrieval.
Pattern Recognit. Lett., 1999

A dual watermarking technique for images.
Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

Adaptive Visible Watermarking of Images.
Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1999

Relevance Feedback Techniques for Image Retrieval Using Multiple Attributes.
Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1999

1998
Content-Based Image Retrieval Using a Composite Color-Shape Approach.
Inf. Process. Manag., 1998

Content Based Watermarking of Images.
Proceedings of the 6th ACM International Conference on Multimedia '98, 1998

Content-Based Representative Frame Extraction for Digital Video.
Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1998

1997
Benchmarking Multimedia Databases.
Multim. Tools Appl., 1997

Shape Measures for Content Based Image Retrieval: A Comparison.
Inf. Process. Manag., 1997

1996
Cluster-based color matching for image retrieval.
Pattern Recognit., 1996

1995
Adaptive marching cubes.
Vis. Comput., 1995

Color matching for image retrieval.
Pattern Recognit. Lett., 1995

Color Indexing for Efficient Image Retrieval.
Multim. Tools Appl., 1995

Area and Perimeter Computation of the Union of a Set of Iso-Rectangles in Parallel.
J. Parallel Distributed Comput., 1995

Selectively meshed surface representation.
Comput. Graph., 1995

1994
Efficient linear octree generation from voxels.
Image Vis. Comput., 1994

Handling small features in isosurface generation using Marching Cubes.
Comput. Graph., 1994

1993
An adaptive dominant point detection algorithm for digital curves.
Pattern Recognit. Lett., 1993

Multidimensional On-Line Bin-Packing: An Algorithm and its Average-Case Analysis.
Inf. Process. Lett., 1993

Volumes From Overlaying 3-D Triangulations in Parallel.
Proceedings of the Advances in Spatial Databases, 1993

Volume Modeling for Orthopedic Surgery.
Proceedings of the Graphics, Design and Visualization, Proceedings of the IFIP TC5/WG5.2/WG5.10 CSI International Conference on Computer Graphics, 1993

1992
Worst and Average Case Evaluation of Heuristics for Multi-Processor Scheduling.
Int. J. High Speed Comput., 1992

1990
Parallel object-space hidden surface removal.
Proceedings of the 17th Annual Conference on Computer Graphics and Interactive Techniques, 1990


  Loading...