Jiebo Luo

Orcid: 0000-0002-4516-9729

Affiliations:
  • University of Rochester, Department of Computer Science, NY, USA
  • Eastman Kodak Company, R&D Labs, Rochester, NY, USA


According to our database1, Jiebo Luo authored at least 784 papers between 1993 and 2024.

Collaborative distances:

Awards

ACM Fellow

ACM Fellow 2018, "For contributions to multimedia content analysis and social multimedia informatics".

IEEE Fellow

IEEE Fellow 2009, "For contributions to semantic image understanding and intelligent image processing".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Context-Aware Proposal-Boundary Network With Structural Consistency for Audiovisual Event Localization.
IEEE Trans. Neural Networks Learn. Syst., November, 2024

CoSeg: Cognitively Inspired Unsupervised Generic Event Segmentation.
IEEE Trans. Neural Networks Learn. Syst., September, 2024

Introduction to the Special Issue on AI-Generated Content for Multimedia.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

MtArtGPT: A Multi-Task Art Generation System With Pre-Trained Transformer.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

TLDW: Extreme Multimodal Summarization of News Videos.
IEEE Trans. Circuits Syst. Video Technol., March, 2024

Editorial: Learning With Fewer Labels in Computer Vision.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

Recent advances in artificial intelligence generated content.
Frontiers Inf. Technol. Electron. Eng., January, 2024

Dynamic Pathway for Query-Aware Feature Learning in Language-Driven Action Localization.
IEEE Trans. Multim., 2024

Cross Modality Bias in Visual Question Answering: A Causal View With Possible Worlds VQA.
IEEE Trans. Multim., 2024

Commonsense Knowledge Prompting for Few-Shot Action Recognition in Videos.
IEEE Trans. Multim., 2024

VideoXum: Cross-Modal Visual and Textural Summarization of Videos.
IEEE Trans. Multim., 2024

Cross-Modality Spatial-Temporal Transformer for Video-Based Visible-Infrared Person Re-Identification.
IEEE Trans. Multim., 2024

Learning 3D Shape Latent for Point Cloud Completion.
IEEE Trans. Multim., 2024

Prototype-Augmented Self-Supervised Generative Network for Generalized Zero-Shot Learning.
IEEE Trans. Image Process., 2024

A Closer Look at the Reflection Formulation in Single Image Reflection Removal.
IEEE Trans. Image Process., 2024

Excitements and concerns in the post-ChatGPT era: Deciphering public perception of AI through social media analysis.
Telematics Informatics, 2024

Human behavior in the time of COVID-19: Learning from big data.
Frontiers Big Data, 2024

End-to-end Open-vocabulary Video Visual Relationship Detection using Multi-modal Prompting.
CoRR, 2024

Semantics Preserving Emoji Recommendation with Large Language Models.
CoRR, 2024

Learning Brain Tumor Representation in 3D High-Resolution MR Images via Interpretable State Space Models.
CoRR, 2024

X-Reflect: Cross-Reflection Prompting for Multimodal Recommendation.
CoRR, 2024

Retrieval Augmentation via User Interest Clustering.
CoRR, 2024

Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection.
CoRR, 2024

3D Gaussian Splatting: Survey, Technologies, Challenges, and Opportunities.
CoRR, 2024

Downstream-Pretext Domain Knowledge Traceback for Active Learning.
CoRR, 2024

Representation Bias in Political Sample Simulations with Large Language Models.
CoRR, 2024

ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation.
CoRR, 2024

INS-MMBench: A Comprehensive Benchmark for Evaluating LVLMs' Performance in Insurance.
CoRR, 2024

Dual Attribute-Spatial Relation Alignment for 3D Visual Grounding.
CoRR, 2024

PromptFix: You Prompt and We Fix the Photo.
CoRR, 2024

BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis.
CoRR, 2024

CT-GLIP: 3D Grounded Language-Image Pretraining with CT Scans and Radiology Reports for Full-Body Scenarios.
CoRR, 2024

V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning.
CoRR, 2024

Harnessing GPT-4V(ision) for Insurance: A Preliminary Exploration.
CoRR, 2024

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators.
CoRR, 2024

In the Eyes of the Bystander: Are the Stances on Different Conflicts Correlated?
CoRR, 2024

CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs.
CoRR, 2024

Unifying Local and Global Knowledge: Empowering Large Language Models as Political Experts with Knowledge Graphs.
Proceedings of the ACM on Web Conference 2024, 2024

LLM-Rec: Personalized Recommendation via Prompting Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

SMP Challenge Summary: Social Media Prediction Challenge.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Large Multimodal Models as Social Multimedia Analysis Engines.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

OpenLEAF: A Novel Benchmark for Open-Domain Interleaved Image-Text Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Plastic Surgery Image Classification and Generation.
Proceedings of the 7th IEEE International Conference on Multimedia Information Processing and Retrieval, 2024

Holistic Visual-Textual Sentiment Analysis with Prior Models.
Proceedings of the 7th IEEE International Conference on Multimedia Information Processing and Retrieval, 2024

Bring Metric Functions into Diffusion Models.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Human vs. LMMs: Exploring the Discrepancy in Emoji Interpretation and Usage in Digital Communication.
Proceedings of the Eighteenth International AAAI Conference on Web and Social Media, 2024

Computational Assessment of Hyperpartisanship in News Titles.
Proceedings of the Eighteenth International AAAI Conference on Web and Social Media, 2024

Dual Attribute-Spatial Relation Alignment for 3D Visual Grounding.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Chain-of-Thought Prompting for Demographic Inference with Large Multimodal Models.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

3D Point Cloud Pre-Training with Knowledge Distilled from 2D Images.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Mixture of Weak and Strong Experts on Graphs.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Deceptive Fairness Attacks on Graphs via Meta Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

The Adversarial AI-Art: Understanding, Generation, Detection, and Benchmarking.
Proceedings of the Computer Security - ESORICS 2024, 2024

FineMatch: Aspect-Based Fine-Grained Image and Text Mismatch Detection and Correction.
Proceedings of the Computer Vision - ECCV 2024, 2024

DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SoMeLVLM: A Large Vision Language Model for Social Media Processing.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Fine-Grained Image-Text Alignment in Medical Imaging Enables Explainable Cyclic Image-Report Generation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
LibFewShot: A Comprehensive Library for Few-Shot Learning.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Introduction to the Special Issue on Trustworthy Multimedia Computing and Applications in Urban Scenes.
ACM Trans. Multim. Comput. Commun. Appl., November, 2023

Unsupervised anomaly detection by densely contrastive learning for time series data.
Neural Networks, November, 2023

A Multilayer Framework for Online Metric Learning.
IEEE Trans. Neural Networks Learn. Syst., October, 2023

Cascade Multi-Level Transformer Network for Surgical Workflow Analysis.
IEEE Trans. Medical Imaging, October, 2023

Few-Shot Partial Multi-View Learning.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Domain-Scalable Unpaired Image Translation via Latent Space Anchoring.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Infant death prediction using machine learning: A population-based retrospective study.
Comput. Biol. Medicine, October, 2023

Multi-modal graph contrastive encoding for neural machine translation.
Artif. Intell., October, 2023

Adaptive Siamese Tracking With a Compact Latent Network.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Bi-calibration Networks for Weakly-Supervised Video Representation Learning.
Int. J. Comput. Vis., July, 2023

Semantic and Relation Modulation for Audio-Visual Event Localization.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Defensive Few-Shot Learning.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Open Set Domain Adaptation With Soft Unknown-Class Rejection.
IEEE Trans. Neural Networks Learn. Syst., March, 2023

Semantic Layout Manipulation With High-Resolution Sparse Attention.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Understanding Public Opinion Toward the #StopAsianHate Movement and the Relation With Racially Motivated Hate Crimes in the US.
IEEE Trans. Comput. Soc. Syst., February, 2023

Occluded Visible-Infrared Person Re-Identification.
IEEE Trans. Multim., 2023

Trip-ROMA: Self-Supervised Learning with Triplets and Random Mappings.
Trans. Mach. Learn. Res., 2023

Video Understanding with Large Language Models: A Survey.
CoRR, 2023

Part to Whole: Collaborative Prompting for Surgical Instrument Segmentation.
CoRR, 2023

GPT-4V(ision) as A Social Media Analysis Engine.
CoRR, 2023

Mixture of Weak & Strong Experts on Graphs.
CoRR, 2023

OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation.
CoRR, 2023

LLM-Rec: Personalized Recommendation via Prompting Large Language Models.
CoRR, 2023

Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA.
CoRR, 2023

Learning to Evaluate the Artness of AI-generated Images.
CoRR, 2023

Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation.
CoRR, 2023

Bias or Diversity? Unraveling Semantic Discrepancy in U.S. News Headlines.
CoRR, 2023

SegPrompt: Using Segmentation Map as a Better Prompt to Finetune Deep Models for Kidney Stone Classification.
CoRR, 2023

Is Bigger Always Better? An Empirical Study on Efficient Architectures for Style Transfer and Beyond.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Dismantling Hate: Understanding Hate Speech Trends Against NBA Athletes.
Proceedings of the Social, Cultural, and Behavioral Modeling, 2023

Wyze Rule: Federated Rule Dataset for Rule Recommendation Benchmarking.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

SMP Challenge: An Overview and Analysis of Social Media Prediction Challenge.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

TopicCAT: Unsupervised Topic-Guided Co-Attention Transformer for Extreme Multimodal Summarisation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Jurassic World Remake: Bringing Ancient Fossils Back to Life via Zero-Shot Long Image-to-Image Translation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

SegPrompt: Using Segmentation Map as a Better Prompt to Finetune Deep Models for Kidney Stone Classification.
Proceedings of the Medical Imaging with Deep Learning, 2023

How Art-like are AI-generated Images? An Exploratory Study.
Proceedings of the 1st International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice, 2023

FeDXL: Provable Federated Learning for Deep X-Risk Optimization.
Proceedings of the International Conference on Machine Learning, 2023

CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Alignment.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Point Cloud Denoising Via Momentum Ascent in Gradient Fields.
Proceedings of the IEEE International Conference on Image Processing, 2023

Improving Video Colorization by Test-Time Tuning.
Proceedings of the IEEE International Conference on Image Processing, 2023

Applying Machine Learning to Predict Esophageal Cancer Recurrence after Esophagectomy.
Proceedings of the IEEE International Conference on Digital Health, 2023

Predicting Adverse Neonatal Outcomes for Preterm Neonates with Multi-Task Learning.
Proceedings of the IEEE International Conference on Digital Health, 2023

Remote Medication Status Prediction for Individuals with Parkinson's Disease using Time-series Data from Smartphones.
Proceedings of the IEEE International Conference on Digital Health, 2023

sDREAMER: Self-distilled Mixture-of-Modality-Experts Transformer for Automatic Sleep Staging.
Proceedings of the IEEE International Conference on Digital Health, 2023

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Grounding 3D Object Affordance from 2D Interactions in Images.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Spatial-Aware Token for Weakly Supervised Object Localization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PromptCap: Prompt-Guided Image Captioning for VQA with GPT-3.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

User-Controllable Recommendation via Counterfactual Retrospective and Prospective Explanations.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

Stare at What You See: Masked Image Modeling without Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

QuantArt: Quantizing Image Style Transfer Towards High Visual Fidelity.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

AnchorFormer: Point Cloud Completion from Discriminative Nodes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Event-Guided Person Re-Identification via Sparse-Dense Complementary Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Meta-Causal Learning for Single Domain Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

A Fine-Grained Analysis of Public Opinion toward Chinese Technology Companies on Reddit.
Proceedings of the IEEE International Conference on Big Data, 2023

Investigating the Effectiveness of Deep Learning and CFA Interpolation Based Classifiers on Identifying AIGC.
Proceedings of the IEEE International Conference on Big Data, 2023

Understanding Divergent Framing of the Supreme Court Controversies: Social Media vs. News Outlets.
Proceedings of the IEEE International Conference on Big Data, 2023

2022
Exploiting Informative Video Segments for Temporal Action Localization.
IEEE Trans. Multim., 2022

CAST: Learning Both Geometric and Texture Style Transfers for Effective Caricature Generation.
IEEE Trans. Image Process., 2022

Guest Editorial Introduction to the Special Section on Video and Language.
IEEE Trans. Circuits Syst. Video Technol., 2022

Anatomy-Aware 3D Human Pose Estimation With Bone-Based Pose Decomposition.
IEEE Trans. Circuits Syst. Video Technol., 2022

Federated learning of molecular properties with graph neural networks in a heterogeneous setting.
Patterns, 2022

Multi-Scale 2D Temporal Adjacency Networks for Moment Localization With Natural Language.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Zero-Shot Video Object Segmentation With Co-Attention Siamese Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Distribution-Aware Margin Calibration for Semantic Segmentation in Images.
Int. J. Comput. Vis., 2022

Learning Cooperative Neural Modules for Stylized Image Captioning.
Int. J. Comput. Vis., 2022

How COVID-19 Has Changed Crowdfunding: Evidence From GoFundMe.
Frontiers Comput. Sci., 2022

3D Point Cloud Pre-training with Knowledge Distillation from 2D Images.
CoRR, 2022

Structure-Guided Image Completion with Image-level and Object-level Semantic Discriminators.
CoRR, 2022

A Unified Framework for Contrastive Learning from a Perspective of Affinity Matrix.
CoRR, 2022

Improving Visual-textual Sentiment Analysis by Fusing Expert Features.
CoRR, 2022

PromptCap: Prompt-Guided Task-Aware Image Captioning.
CoRR, 2022

FedX: Federated Learning for Compositional Pairwise Risk Optimization.
CoRR, 2022

TLDW: Extreme Multimodal Summarisation of News Videos.
CoRR, 2022

Contextual Modeling for 3D Dense Captioning on Point Clouds.
CoRR, 2022

Causal Inference via Nonlinear Variable Decorrelation for Healthcare Applications.
CoRR, 2022

CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment.
CoRR, 2022

Fine-tuning Pre-trained Language Models with Noise Stability Regularization.
CoRR, 2022

CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training.
CoRR, 2022

Breast Cancer Induced Bone Osteolysis Prediction Using Temporal Variational Auto-Encoders.
CoRR, 2022

Understanding Political Polarization on Social Platforms by Jointly Modeling Users, Connections and Multi-modal Post Contents in Heterogeneous Graphs.
CoRR, 2022

"Ban the Chinese spyware!": A Fine-Grained Analysis of Public Opinion toward Chinese Technology Companies on Reddit.
CoRR, 2022

RawlsGCN: Towards Rawlsian Difference Principle on Graph Convolutional Network.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Facial Attribute Transformers for Precise and Robust Makeup Transfer.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Explainable Fairness in Recommendation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Cloud2Sketch: Augmenting Clouds with Imaginary Sketches.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Understanding Political Polarization via Jointly Modeling Users, Connections and Multimodal Contents on Heterogeneous Graphs.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Federated Medical Image Analysis with Virtual Sample Synthesis.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Look behind the Censorship: Reposting-User Characterization and Muted-Topic Restoration.
Proceedings of the Workshop Proceedings of the 16th International AAAI Conference on Web and Social Media, 2022

Taking sides: Public Opinion over the Israel-Palestine Conflict in 2021.
Proceedings of the Workshop Proceedings of the 16th International AAAI Conference on Web and Social Media, 2022

Cross-modal Contrastive Distillation for Instructional Activity Anticipation.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Learning to Aggregate and Refine Noisy Labels for Visual Sentiment Analysis.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Unsupervised Low-light Image Enhancement with Decoupled Networks.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

When Few-Shot Learning Meets Video Object Detection.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Manet: Improving Video Denoising with a Multi-Alignment Network.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Learning a Grammar Inducer from Massive Uncurated Instructional Videos.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Image Inpainting with Cascaded Modulation GAN and Object-Aware Training.
Proceedings of the Computer Vision - ECCV 2022, 2022

Stand-Alone Inter-Frame Attention in Video Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Automatic Relation-aware Graph Network Proliferation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Localized Adversarial Domain Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Color Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Self-Sustaining Representation Expansion for Non-Exemplar Class-Incremental Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Deep Federated Anomaly Detection for Multivariate Time Series Data.
Proceedings of the IEEE International Conference on Big Data, 2022

How to Prepare for the Next Pandemic - Investigation of Correlation Between Food Prices and COVID-19 From Global and Local Perspectives.
Proceedings of the IEEE International Conference on Big Data, 2022

iFiG: Individually Fair Multi-view Graph Clustering.
Proceedings of the IEEE International Conference on Big Data, 2022

Doctors vs. Nurses: Understanding the Great Divide in Vaccine Hesitancy among Healthcare Workers.
Proceedings of the IEEE International Conference on Big Data, 2022

2021
Pose Flow Learning From Person Images for Pose Guided Synthesis.
IEEE Trans. Image Process., 2021

Cross-Domain Image Captioning via Cross-Modal Retrieval and Model Adaptation.
IEEE Trans. Image Process., 2021

EnAET: A Self-Trained Framework for Semi-Supervised and Supervised Learning With Ensemble Transformations.
IEEE Trans. Image Process., 2021

Semantics-Aware Spatial-Temporal Binaries for Cross-Modal Video Retrieval.
IEEE Trans. Image Process., 2021

UniFaceGAN: A Unified Framework for Temporally Consistent Facial Video Editing.
IEEE Trans. Image Process., 2021

Joint Learning of Multiple Latent Domains and Deep Representations for Domain Adaptation.
IEEE Trans. Cybern., 2021

Grounding-Tracking-Integration.
IEEE Trans. Circuits Syst. Video Technol., 2021

Sense and Sensibility: Characterizing Social Media Users Regarding the Use of Controversial Terms for COVID-19.
IEEE Trans. Big Data, 2021

What Contributes to a Crowdfunding Campaign's Success? Evidence and Analyses from GoFundMe Data.
J. Soc. Comput., 2021

Temperature network for few-shot learning with distribution-aware large-margin metric.
Pattern Recognit., 2021

Unsupervised text-to-image synthesis.
Pattern Recognit., 2021

Novelty Detection and Online Learning for Chunk Data Streams.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Exploring Explicit Domain Supervision for Latent Space Disentanglement in Unpaired Image-to-Image Translation.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Spatial-Temporal Relation Reasoning for Action Prediction in Videos.
Int. J. Comput. Vis., 2021

SpaceEdit: Learning a Unified Editing Space for Open-Domain Image Editing.
CoRR, 2021

Music Sentiment Transfer.
CoRR, 2021

CoSeg: Cognitively Inspired Unsupervised Generic Event Segmentation.
CoRR, 2021

Social Disparities in Oral Health in America amid the COVID-19 Pandemic.
CoRR, 2021

Federated Learning of Molecular Properties in a Heterogeneous Setting.
CoRR, 2021

Multi-Modulation Network for Audio-Visual Event Localization.
CoRR, 2021

Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph.
CoRR, 2021

Adaptive Recursive Circle Framework for Fine-grained Action Recognition.
CoRR, 2021

Triplet is All You Need with Random Mappings for Unsupervised Visual Representation Learning.
CoRR, 2021

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training.
CoRR, 2021

How COVID-19 Have Changed Crowdfunding: Evidence From GoFundMe.
CoRR, 2021

Both Rates of Fake News and Fact-based News on Twitter Negatively Correlate with the State-level COVID-19 Vaccine Uptake.
CoRR, 2021

State-level Racially Motivated Hate Crimes Contrast Public Opinion on the #StopAsianHate and #StopAAPIHate Movement.
CoRR, 2021

What Kind of Person Wins the Turing Award?
CoRR, 2021

Facial Attribute Transformers for Precise and Robust Makeup Transfer.
CoRR, 2021

Memory Enhanced Embedding Learning for Cross-Modal Video-Text Retrieval.
CoRR, 2021

Few-Shot Learning for Video Object Detection in a Transfer-Learning Scheme.
CoRR, 2021

Are Top School Students More Critical of Their Professors? Mining Comments on RateMyProfessor.com.
CoRR, 2021

Understanding Patterns of Users Who Repost Censored Posts on Weibo.
CoRR, 2021

From Gen Z, Millennials, to Babyboomers: Portraits of Working from Home during the COVID-19 Pandemic.
CoRR, 2021

Characterizing Discourse about COVID-19 Vaccines: A Reddit Version of the Pandemic Story.
CoRR, 2021

Enhanced aspect-based sentiment analysis models with progressive self-supervised attention learning.
Artif. Intell., 2021

What Makes a Turing Award Winner?
Proceedings of the Social, Cultural, and Behavioral Modeling, 2021

How Political is the Spread of COVID-19 in the United States? - An Analysis Using Transportation and Weather Data.
Proceedings of the Social, Cultural, and Behavioral Modeling, 2021

Fine-Grained Analysis of the Use of Neutral and Controversial Terms for COVID-19 on Social Media.
Proceedings of the Social, Cultural, and Behavioral Modeling, 2021

Multi-modal Dependency Tree for Video Captioning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Video-aided Unsupervised Grammar Induction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Noise Stability Regularization for Improving BERT Fine-tuning.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Learning Fine-Grained Motion Embedding for Landscape Animation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Latent Memory-augmented Graph Transformer for Visual Storytelling.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021


Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

SAT: 2D Semantics Assisted Training for 3D Visual Grounding.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Procedure Planning in Instructional Videos via Contextual Modeling and Model-based Policy Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning Bias-Invariant Representation by Cross-Sample Mutual Information Minimization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Group-aware Label Transfer for Domain Adaptive Person Re-identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

TAP: Text-Aware Pre-Training for Text-VQA and Text-Caption.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Structured Multi-Level Interaction Network for Video Moment Localization via Language Query.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Improving OCR-Based Image Captioning by Incorporating Geometrical Relationship.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Personalized Fashion Recommendation from Personal Social Media Data: An Item-to-Set Metric Learning Approach.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

Understanding the Hoarding Behaviors during the COVID-19 Pandemic using Large Scale Social Media Data.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

Mi YouTube es Su YouTube? Analyzing the Cultures using YouTube Thumbnails of Popular Videos.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

From Static to Dynamic Prediction: Wildfire Risk Assessment Based on Multiple Environmental Factors.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

XraySyn: Realistic View Synthesis From a Single Radiograph Through CT Priors.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Spatial-temporal Causal Inference for Partial Image-to-video Adaptation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Editorial.
IEEE Trans. Multim., 2020

Coarse-to-Fine Localization of Temporal Action Proposals.
IEEE Trans. Multim., 2020

Sentiment Recognition for Short Annotated GIFs Using Visual-Textual Fusion.
IEEE Trans. Multim., 2020

ADN: Artifact Disentanglement Network for Unsupervised Metal Artifact Reduction.
IEEE Trans. Medical Imaging, 2020

Learning Rich Part Hierarchies With Progressive Attention Networks for Fine-Grained Image Recognition.
IEEE Trans. Image Process., 2020

STC-GAN: Spatio-Temporally Coupled Generative Adversarial Networks for Predictive Scene Parsing.
IEEE Trans. Image Process., 2020

Jointly Learning Commonality and Specificity Dictionaries for Person Re-Identification.
IEEE Trans. Image Process., 2020

Confidence-Guided Self Refinement for Action Prediction in Untrimmed Videos.
IEEE Trans. Image Process., 2020

Measuring Female Representation and Impact in Films over Time.
Trans. Data Sci., 2020

stagNet: An Attentive Semantic RNN for Group Activity and Individual Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., 2020

Sports Video Captioning via Attentive Motion Representation and Group Relationship Modeling.
IEEE Trans. Circuits Syst. Video Technol., 2020

Double-layer conditional random fields model for human action recognition.
Signal Process. Image Commun., 2020

Structure alignment of attributes and visual features for cross-dataset person re-identification.
Pattern Recognit., 2020

Semi-Supervised Adversarial Monocular Depth Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

CariGAN: Caricature generation through weakly paired adversarial learning.
Neural Networks, 2020

Constructing biomedical domain-specific knowledge graph with minimum supervision.
Knowl. Inf. Syst., 2020

Noise-robust image fusion with low-rank sparse decomposition guided by external patch prior.
Inf. Sci., 2020

Multi-Scale 2D Temporal Adjacent Networks for Moment Localization with Natural Language.
CoRR, 2020

Social Media Study of Public Opinions on Potential COVID-19 Vaccines: Informing Dissent, Disparities, and Dissemination.
CoRR, 2020

Slender Object Detection: Diagnoses and Improvements.
CoRR, 2020

Region Comparison Network for Interpretable Few-shot Image Classification.
CoRR, 2020

Universal Model for Multi-Domain Medical Image Retrieval.
CoRR, 2020

Task-agnostic Temporally Consistent Facial Video Editing.
CoRR, 2020

Monitoring Depression Trend on Twitter during the COVID-19 Pandemic.
CoRR, 2020

Real-time Universal Style Transfer on High-resolution Images via Zero-channel Pruning.
CoRR, 2020

Tracking Public Opinion in China through Various Stages of the COVID-19 Pandemic.
CoRR, 2020

Machine Identification of High Impact Research through Text and Image Analysis.
CoRR, 2020

Unsupervised Real-world Low-light Image Enhancement with Decoupled Networks.
CoRR, 2020

In the Eyes of the Beholder: Sentiment and Topic Analyses on Social Media Use of Neutral and Controversial Terms for COVID-19.
CoRR, 2020

Example-Guided Image Synthesis across Arbitrary Scenes using Masked Spatial-Channel Attention and Self-Supervision.
CoRR, 2020

Unsupervised Learning of Landmarks based on Inter-Intra Subject Consistencies.
CoRR, 2020

Video-based Person Re-Identification using Gated Convolutional Recurrent Neural Networks.
CoRR, 2020

Unifying Specialist Image Embedding into Universal Image Embedding.
CoRR, 2020

Anatomy-aware 3D Human Pose Estimation in Videos.
CoRR, 2020

Seeing through the smoke : a world-wide comparative study of e-cigarette flavors, brands and markets using data from Reddit and Twitter.
CoRR, 2020

#MeToo on Campus: Studying College Sexual Assault at Scale Using Data Reported on Social Media.
CoRR, 2020

Measuring Women Representation and Impact in Films over Time.
CoRR, 2020

Learning Semantic-aware Normalization for Generative Adversarial Networks.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Structured Graph Attention Network for Vehicle Re-Identification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multimodal Attention with Image Text Spatial Relationship for OCR-Based Image Captioning.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Few-Shot Ensemble Learning for Video Classification with SlowFast Memory Networks.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Dynamic Context-guided Capsule Network for Multimodal Machine Translation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Image Sentiment Transfer.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Dual Path Interaction Network for Video Moment Localization.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Alleviating the Incompatibility Between Cross Entropy Loss and Episode Training for Few-Shot Skin Disease Classification.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

A Smartphone-Based System for Real-Time Early Childhood Caries Diagnosis.
Proceedings of the Medical Ultrasound, and Preterm, Perinatal and Paediatric Image Analysis, 2020

Modeling Heterogeneity in Feature Selection for MCI Classification.
Proceedings of the 17th IEEE International Symposium on Biomedical Imaging, 2020

An Iterative Multi-Source Mutual Knowledge Transfer Framework for Machine Reading Comprehension.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Asymmetric Distribution Measure for Few-shot Learning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Cost-Effective Adversarial Attacks against Scene Text Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Weakly Supervised Body Part Segmentation with Pose based Part Priors.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Pose-based Body Language Recognition for Emotion and Psychiatric Symptom Interpretation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

DAIL: Dataset-Aware and Invariant Learning for Face Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Arithmetic Evaluation System Based on MixNet-YOLOv3 and CRNN Neural Networks.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

Unsupervised Learning of Facial Landmarks based on Inter-Intra Subject Consistencies.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Global Image Sentiment Transfer.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Learning with Unpaired Data.
Proceedings of the 19th IEEE International Conference on Machine Learning and Applications, 2020

Sed-Net: Detecting Multi-Type Edits Of Images.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Predicting Parkinson's Disease with Multimodal Irregularly Collected Longitudinal Smartphone Data.
Proceedings of the 20th IEEE International Conference on Data Mining, 2020

Open-Ended Visual Question Answering by Multi-Modal Domain Adaptation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Example-Guided Image Synthesis Using Masked Spatial-Channel Attention and Self-supervision.
Proceedings of the Computer Vision - ECCV 2020, 2020

Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-identification.
Proceedings of the Computer Vision - ECCV 2020, 2020

Improving One-Stage Visual Grounding by Recursive Sub-query Construction.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning to Localize Actions from Moments.
Proceedings of the Computer Vision - ECCV 2020, 2020

TuiGAN: Learning Versatile Image-to-Image Translation with Two Unpaired Images.
Proceedings of the Computer Vision - ECCV 2020, 2020

Structured Landmark Detection via Topology-Adapting Deep Graph Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

Adaptive Offline Quintuplet Loss for Image-Text Matching.
Proceedings of the Computer Vision - ECCV 2020, 2020

Self-Supervised Domain-Aware Generative Network for Generalized Zero-Shot Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

On Vocabulary Reliance in Scene Text Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning a Weakly-Supervised Video Actor-Action Segmentation Model With a Wise Selection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Fine-Grained Image-to-Image Transformation Towards Visual Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

TransMatch: A Transfer-Learning Scheme for Semi-Supervised Few-Shot Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Face Off: Polarized Public Opinions on Personal Face Mask Usage during the COVID-19 Pandemic.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

Content-based Analysis of the Cultural Differences between TikTok and Douyin.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

Stock Price Prediction Under Anomalous Circumstances.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

Do Sports and Politics Mix? Cross-Analysis of Fan Bases of Major League Sports and Presidential Candidates.
Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2020

The Ivory Tower Lost: How College Students Respond Differently than the General Public to the COVID-19 Pandemic.
Proceedings of the IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2020

A Novel Graph-based Multi-modal Fusion Encoder for Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Learning 2D Temporal Adjacent Networks for Moment Localization with Natural Language.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Neural Simile Recognition with Cyclic Multitask Learning and Local Attention.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Enhancing Pointer Network for Sentence Ordering with Pairwise Ordering Predictions.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Joint Commonsense and Relation Reasoning for Image and Video Captioning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Expressing Objects Just Like Words: Recurrent Visual Embedding for Image-Text Matching.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Ultrafast Photorealistic Style Transfer via Neural Architecture Search.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Action Recognition With Spatio-Temporal Visual Attention on Skeleton Image Sequences.
IEEE Trans. Circuits Syst. Video Technol., 2019

Future-Aware Knowledge Distillation for Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Novel event analysis for human-machine collaborative underwater exploration.
Pattern Recognit., 2019

Holistic CNN Compression via Low-Rank Decomposition with Knowledge Transfer.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Grounding-Tracking-Integration.
CoRR, 2019

Learning Sparse 2D Temporal Adjacent Networks for Temporal Action Localization.
CoRR, 2019

Example-Guided Scene Image Synthesis using Masked Spatial-Channel Attention and Patch-Based Self-Supervision.
CoRR, 2019

EnAET: Self-Trained Ensemble AutoEncoding Transformations for Semi-Supervised Learning.
CoRR, 2019

Defensive Few-shot Adversarial Learning.
CoRR, 2019

SMP Challenge: An Overview of Social Media Prediction Challenge 2019.
CoRR, 2019

Unsupervised Pose Flow Learning for Pose Guided Synthesis.
CoRR, 2019

AI for Earth: Rainforest Conservation by Acoustic Surveillance.
CoRR, 2019

Weakly Supervised Body Part Parsing with Pose based Part Priors.
CoRR, 2019

Fast Universal Style Transfer for Artistic and Photorealistic Rendering.
CoRR, 2019

StyleNAS: An Empirical Study of Neural Architecture Search to Uncover Surprisingly Fast End-to-End Universal Style Transfer Networks.
CoRR, 2019

Relational Reasoning using Prior Knowledge for Visual Captioning.
CoRR, 2019

To Return or to Explore: Modelling Human Mobility and Dynamics in Cyberspace.
Proceedings of the World Wide Web Conference, 2019

Learning Deep Bilinear Transformation for Fine-grained Image Representation.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Exploiting Temporal Relationships in Video Moment Localization with Natural Language.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

SMP Challenge: An Overview of Social Media Prediction Challenge 2019.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

MSAFusionNet: Multiple Subspace Attention Based Deep Multi-modal Fusion Network.
Proceedings of the Machine Learning in Medical Imaging - 10th International Workshop, 2019

DCCL: A Benchmark for Cervical Cytology Analysis.
Proceedings of the Machine Learning in Medical Imaging - 10th International Workshop, 2019

Automatic Radiology Report Generation Based on Multi-view Image Fusion and Medical Concept Enrichment.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Artifact Disentanglement Network for Unsupervised Metal Artifact Reduction.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Generative Mask Pyramid Network for CT/CBCT Metal Artifact Reduction with Joint Projection-Sinogram Correction.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Patch Transformer for Multi-tagging Whole Slide Histopathology Images.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Graph-based Neural Sentence Ordering.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Human-Centered Emotion Recognition in Animated GIFs.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Meta-Learning Perspective for Personalized Image Aesthetics Assessment.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

A Fast and Accurate One-Stage Approach to Visual Grounding.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Joint Syntax Representation Learning and Visual Cue Translation for Video Captioning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Large-Scale Tag-Based Font Retrieval With Generative Feature Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Iterative Dual Domain Adaptation for Neural Machine Translation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Looking for the Devil in the Details: Learning Trilinear Attention Sampling Network for Fine-Grained Image Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transformations Rather Than Data.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Foreground-Aware Image Inpainting.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Attentive Relational Networks for Mapping Images to Scene Graphs.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Gaussian Temporal Awareness Networks for Action Localization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

DuDoNet: Dual Domain Network for CT Metal Artifact Reduction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Multiview 2D/3D Rigid Registration via a Point-Of-Interest Network for Tracking and Triangulation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Revisiting Local Descriptor Based Image-To-Class Measure for Few-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Unsupervised Image Captioning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Spatio-Temporal Video Re-Localization by Warp LSTM.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Help Oneself in Helping the Others: the Ecology of Online Support Groups.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

Uncovering download fraud activities in mobile app markets.
Proceedings of the ASONAM '19: International Conference on Advances in Social Networks Analysis and Mining, 2019

Progressive Self-Supervised Attention Learning for Aspect-Level Sentiment Analysis.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Distribution Consistency Based Covariance Metric Networks for Few-Shot Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Localizing Natural Language in Videos.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Joint Vertebrae Identification and Localization in Spinal CT Images by Combining Short- and Long-Range Contextual Information.
IEEE Trans. Medical Imaging, 2018

Learning Multi-Instance Deep Ranking and Regression Network for Visual House Appraisal.
IEEE Trans. Knowl. Data Eng., 2018

The Effect of Pets on Happiness: A Large-Scale Multi-Factor Analysis Using Social Multimedia.
ACM Trans. Intell. Syst. Technol., 2018

Online Similarity Learning for Big Data with Overfitting.
IEEE Trans. Big Data, 2018

User attribute discovery with missing labels.
Pattern Recognit., 2018

Real-Time Referring Expression Comprehension by Single-Stage Grounding Network.
CoRR, 2018

Online Deep Metric Learning.
CoRR, 2018

Sleep-deprived Fatigue Pattern Analysis using Large-Scale Selfies from Social Med.
CoRR, 2018

The Great Division.
CoRR, 2018

Image Captioning at Will: A Versatile Scheme for Effectively Injecting Sentiments into Image Descriptions.
CoRR, 2018

End-to-end Multi-Modal Multi-Task Vehicle Control for Self-Driving Cars with Visual Perception.
CoRR, 2018

Integrating Scene Text and Visual Appearance for Fine-Grained Image Classification.
IEEE Access, 2018

Analyzing and Predicting Emoji Usages in Social Media.
Proceedings of the Companion of the The Web Conference 2018 on The Web Conference 2018, 2018

Cognitive Computing Track Chairs' Welcome & Organization.
Proceedings of the Companion of the The Web Conference 2018 on The Web Conference 2018, 2018

Fine-grained Video Attractiveness Prediction Using Multimodal Deep Learning on a Large Real-world Dataset.
Proceedings of the Companion of the The Web Conference 2018 on The Web Conference 2018, 2018

When E-commerce Meets Social Media: Identifying Business on WeChat Moment Using Bilateral-Attention LSTM.
Proceedings of the Companion of the The Web Conference 2018 on The Web Conference 2018, 2018

Improving Text-Based Person Search by Spatial Matching and Adaptive Threshold.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Deep Domain Adaptation Hashing with Adversarial Learning.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Sports Video Captioning by Attentive Motion Representation based Hierarchical Recurrent Neural Networks.
Proceedings of the 1st International Workshop on Multimedia Content Analysis in Sports, 2018

Session details: Keynote 3.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Social and Political Event Analysis based on Rich Media.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Twitter Sentiment Analysis via Bi-sense Emoji Embedding and Attention-based LSTM.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

EE-USAD: ACM MM 2018Workshop on UnderstandingSubjective Attributes of Data focus on Evoked Emotions.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Exploring Facial Differences in European Countries Boundary by Fine-Tuned Neural Networks.
Proceedings of the IEEE 1st Conference on Multimedia Information Processing and Retrieval, 2018

Do They All Look the Same? Deciphering Chinese, Japanese and Koreans by Fine-Grained Deep Learning.
Proceedings of the IEEE 1st Conference on Multimedia Information Processing and Retrieval, 2018

More Knowledge Is Better: Cross-Modality Volume Completion and 3D+2D Segmentation for Intracardiac Echocardiography Contouring.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2018, 2018

Adversarial Sparse-View CBCT Artifact Reduction.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2018, 2018

Multi-Task Clustering with Model Relation Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Fast Factorization-free Kernel Learning for Unlabeled Chunk Data Streams.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Mining the Relationship between Emoji Usage Patterns and Personality.
Proceedings of the Twelfth International Conference on Web and Social Media, 2018

Life in the "Matrix": Human Mobility Patterns in the Cyber Space.
Proceedings of the Twelfth International Conference on Web and Social Media, 2018

Boundary-based Image Forgery Detection by Fast Shallow CNN.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

End-to-end Multi-Modal Multi-Task Vehicle Control for Self-Driving Cars with Visual Perceptions.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Action Recognition with Visual Attention on Skeleton Images.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Are French Really That Different? Recognizing Europeans from Faces Using Data-Driven Learning.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

ICPR2018 Contest on Object Detection in Aerial Images (ODAI-18).
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

stagNet: An Attentive Semantic RNN for Group Activity Recognition.
Proceedings of the Computer Vision - ECCV 2018, 2018

VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions.
Proceedings of the Computer Vision - ECCV 2018, 2018

Video Re-localization.
Proceedings of the Computer Vision - ECCV 2018, 2018

"Factual" or "Emotional": Stylized Image Captioning with Adaptive Learning and Attention.
Proceedings of the Computer Vision - ECCV 2018, 2018

End-to-End Convolutional Semantic Embeddings.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning to Generate Time-Lapse Videos Using Multi-Stage Dynamic Generative Adversarial Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

DOTA: A Large-Scale Dataset for Object Detection in Aerial Images.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

VizWiz Grand Challenge: Answering Visual Questions From Blind People.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Touch Your Heart: A Tone-aware Chatbot for Customer Care on Social Media.
Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 2018

How to Become Instagram Famous: Post Popularity Prediction with Dual-Attention.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

A unified scheme of text localization and structured data extraction for joint OCR and data mining.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

Do the Communities We Choose Shape our Political Beliefs? A Study of the Politicization of Topics in Online Social Groups.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

You Type a Few Words and We Do the Rest: Image Recommendation for Social Multimedia Posts.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

Determining Code Words in Euphemistic Hate Speech Using Word Embedding Networks.
Proceedings of the 2nd Workshop on Abusive Language Online, 2018

Face Completion with Semantic Knowledge and Collaborative Adversarial Learning.
Proceedings of the Computer Vision - ACCV 2018, 2018

A Joint Local and Global Deep Metric Learning Method for Caricature Recognition.
Proceedings of the Computer Vision - ACCV 2018, 2018

Towards Perceptual Image Dehazing by Physics-Based Disentanglement and Adversarial Training.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Unsupervised Deep Learning of Mid-Level Video Representation for Action Recognition.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Image-Based Appraisal of Real Estate Properties.
IEEE Trans. Multim., 2017

Mining Fashion Outfit Composition Using an End-to-End Deep Learning Approach on Set Data.
IEEE Trans. Multim., 2017

Human Facial Age Estimation by Cost-Sensitive Label Ranking and Trace Norm Regularization.
IEEE Trans. Multim., 2017

Tracking Illicit Drug Dealing and Abuse on Instagram Using Multimodal Analysis.
ACM Trans. Intell. Syst. Technol., 2017

Mobile Social Multimedia Analytics in the Big Data Era: An Introduction to the Special Issue.
ACM Trans. Intell. Syst. Technol., 2017

Adaptive Greedy Dictionary Selection for Web Media Summarization.
IEEE Trans. Image Process., 2017

Exploring Coherent Motion Patterns via Structured Trajectory Learning for Crowd Mood Modeling.
IEEE Trans. Circuits Syst. Video Technol., 2017

Regularized Deep Belief Network for Image Attribute Detection.
IEEE Trans. Circuits Syst. Video Technol., 2017

Tales of Two Cities: Using Social Media to Understand Idiosyncratic Lifestyles in Distinctive Metropolitan Areas.
IEEE Trans. Big Data, 2017

Multi-modal deep feature learning for RGB-D object detection.
Pattern Recognit., 2017

Editorial for special section of video analytics with deep learning.
Pattern Recognit., 2017

Guest editorial: mobile visual tagging with mobile context.
Multim. Syst., 2017

Learning hierarchical video representation for action recognition.
Int. J. Multim. Inf. Retr., 2017

Editorial for the ICMR 2016 special issue.
Int. J. Multim. Inf. Retr., 2017

Autism spectrum disorder detection from semi-structured and unstructured medical data.
EURASIP J. Bioinform. Syst. Biol., 2017

When Celebrities Endorse Politicians: Analyzing the Behavior of Celebrity Followers in the 2016 U.S. Presidential Election.
CoRR, 2017

Tactics and Tallies: A Study of the 2016 U.S. Presidential Campaign Using Twitter 'Likes'.
CoRR, 2017

Learning from Noisy Labels with Distillation.
CoRR, 2017

Rumor Detection on Twitter Pertaining to the 2016 U.S. Presidential Election.
CoRR, 2017

Spice up Your Chat: The Intentions and Sentiment Effects of Using Emoji.
CoRR, 2017

Election Bias: Comparing Polls and Twitter in the 2016 U.S. Election.
CoRR, 2017

When Fashion Meets Big Data: Discriminative Mining of Best Selling Clothing Features.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

A Selfie is Worth a Thousand Words: Mining Personal Patterns behind User Selfie-posting Behaviours.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

Predicting Multiple Risky Behaviors via Multimedia Content.
Proceedings of the Social Informatics, 2017

How Polarized Have We Become? A Multimodal Classification of Trump Followers and Clinton Followers.
Proceedings of the Social Informatics, 2017

When Follow is Just One Click Away: Understanding Twitter Follow Behavior in the 2016 U.S. Presidential Election.
Proceedings of the Social Informatics, 2017

Inferring Follower Preferences in the 2016 U.S. Presidential Primaries with Sparse Learning.
Proceedings of the Social, Cultural, and Behavioral Modeling, 2017

Gender Politics in the 2016 U.S. Presidential Election: A Computer Vision Approach.
Proceedings of the Social, Cultural, and Behavioral Modeling, 2017

Large-Scale Sleep Condition Analysis Using Selfies from Social Media.
Proceedings of the Social, Cultural, and Behavioral Modeling, 2017

Detection and Analysis of 2016 US Presidential Election Related Rumors on Twitter.
Proceedings of the Social, Cultural, and Behavioral Modeling, 2017

Social Multimedia Sentiment Analysis.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

VSCC'2017: Visual Analysis for Smart and Connected Communities.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Multimodal Fusion with Recurrent Neural Networks for Rumor Detection on Microblogs.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Mixture Factorized Ornstein-Uhlenbeck Processes for Time-Series Forecasting.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

Cultural Diffusion and Trends in Facebook Photographs.
Proceedings of the Eleventh International Conference on Web and Social Media, 2017

Detecting the Hate Code on Social Media.
Proceedings of the Eleventh International Conference on Web and Social Media, 2017

A World of Difference: Divergent Word Interpretations Among People.
Proceedings of the Eleventh International Conference on Web and Social Media, 2017

Spice Up Your Chat: The Intentions and Sentiment Effects of Using Emojis.
Proceedings of the Eleventh International Conference on Web and Social Media, 2017

When saliency meets sentiment: Understanding how image content invokes emotion and sentiment.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Personalized pose estimation for body language understanding.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Reducing noisy labels in weakly labeled data for visual sentiment analysis.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Aesthetic Quality Assessment of Photos with Faces.
Proceedings of the Image and Graphics - 9th International Conference, 2017

Learning Multi-attention Convolutional Neural Network for Fine-Grained Image Recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Learning from Noisy Labels with Distillation.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Deep Multimodal Representation Learning from Temporal Data.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Improving Pairwise Ranking for Multi-label Image Classification.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Machine Identification of High Impact Research through Text and Image Analysis.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

One-shot learning for fine-grained relation extraction via convolutional siamese neural network.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

Predicting high taxi demand regions using social media check-ins.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

Sleep-deprived fatigue pattern analysis using large-scale selfies from social media.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

Understanding what affects career progression using linkedin and twitter data.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

Understanding and Predicting Multiple Risky Behaviors from Social Media.
Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017

Visual Sentiment Analysis by Attending on Local Image Regions.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Fast Online Incremental Learning on Mixture Streaming Data.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

A Deep Multi-Task Learning Approach to Skin Lesion Classification.
Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Constrained Clustering With Nonnegative Matrix Factorization.
IEEE Trans. Neural Networks Learn. Syst., 2016

Effective Active Skeleton Representation for Low Latency Human Action Recognition.
IEEE Trans. Multim., 2016

A picture tells a thousand words - About you! User interest profiling from user generated visual content.
Signal Process., 2016

Home location inference from sparse and noisy data: models and applications.
Frontiers Inf. Technol. Electron. Eng., 2016

A Deep Structured Model with Radius-Margin Bound for 3D Human Activity Recognition.
Int. J. Comput. Vis., 2016

Detecting Visually Observable Disease Symptoms from Faces.
EURASIP J. Bioinform. Syst. Biol., 2016

Understanding Illicit Drug Use Behaviors by Mining Social Media.
CoRR, 2016

When Saliency Meets Sentiment: Understanding How Image Content Invokes Emotion and Sentiment.
CoRR, 2016

Image Based Appraisal of Real Estate Properties.
CoRR, 2016

Voting with Feet: Who are Leaving Hillary Clinton and Donald Trump?
CoRR, 2016

Will Sanders Supporters Jump Ship for Trump? Fine-grained Analysis of Twitter Followers.
CoRR, 2016

Tactics and Tallies: Inferring Voter Preferences in the 2016 U.S. Presidential Primaries Using Sparse Learning.
CoRR, 2016

Gender Politics in the 2016 U.S. Presidential Election: A Computer Vision Approach.
CoRR, 2016

Pricing the Woman Card: Gender Politics between Hillary Clinton and Donald Trump.
CoRR, 2016

Image Credibility Analysis with Effective Domain Transferred Deep Networks.
CoRR, 2016

Inferring Fine-grained Details on User Activities and Home Location from Social Media: Detecting Drinking-While-Tweeting Patterns in Communities.
CoRR, 2016

Cross-modality Consistent Regression for Joint Visual-Textual Sentiment Analysis of Social Multimedia.
Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, 2016

Deep recursive and hierarchical conditional random fields for human action recognition.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Using Social Media to Promote STEM Education: Matching College Students with Role Models.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2016

Robust Multi-view Manifold Ranking for Image Retrieval.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2016

Robust Visual-Textual Sentiment Analysis: When Attention meets Tree-structured Recursive Neural Networks.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Action Recognition by Learning Deep Multi-Granular Spatio-Temporal Video Representation.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Collective Sensemaking via Social Sensors: Extracting, Profiling, Analyzing, and Predicting Real-world Events.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Voting with Feet: Who are Leaving Hillary Clinton and Donald Trump.
Proceedings of the IEEE International Symposium on Multimedia, 2016

Unsupervised Alignment of Actions in Video with Text Descriptions.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Catching Fire via "Likes": Inferring Topic Preferences of Trump Followers on Twitter.
Proceedings of the Tenth International Conference on Web and Social Media, 2016

To Follow or Not to Follow: Analyzing the Growth Patterns of the Trumpists on Twitter.
Proceedings of the News and Public Opinion, 2016

Deciphering the 2016 U.S. Presidential Campaign in the Twitter Sphere: A Comparison of the Trumpists and Clintonists.
Proceedings of the Tenth International Conference on Web and Social Media, 2016

What the Language You Tweet Says About Your Occupation.
Proceedings of the Tenth International Conference on Web and Social Media, 2016

Precise Localization of Homes and Activities: Detecting Drinking-While-Tweeting Patterns in Communities.
Proceedings of the Tenth International Conference on Web and Social Media, 2016

Aligning movies with scripts by exploiting temporal ordering constraints.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Skin disease classification versus skin lesion characterization: Achieving robust diagnosis using multi-label deep neural networks.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Learning effective Gait features using LSTM.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Multi-type Co-clustering of General Heterogeneous Information Networks via Nonnegative Matrix Tri-Factorization.
Proceedings of the IEEE 16th International Conference on Data Mining, 2016

Image Captioning with Semantic Attention.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

TGIF: A New Dataset and Benchmark on Animated GIF Description.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Mining Shopping Patterns for Divergent Urban Regions by Incorporating Mobility Data.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Fine-grained mining of illicit drug use patterns using social multimedia data from instagram.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Solving cold-start problem in large-scale recommendation engines: A deep learning approach.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

The effect of pets on happiness: A data-driven approach via large-scale social media.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Pricing the woman card: Gender politics between hillary clinton and donald trump.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Inferring restaurant styles by mining crowd sourced photos from user-review websites.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

"What makes a pro eating disorder hashtag": Using hashtags to identify pro eating disorder tumblr posts and Twitter users.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

When do luxury cars hit the road? Findings by a big data approach.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and The Benchmark.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

News Verification by Exploiting Conflicting Social Viewpoints in Microblogs.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
A Multifaceted Approach to Social Multimedia-Based Prediction of Elections.
IEEE Trans. Multim., 2015

Guest Editorial: Deep Learning for Multimedia Computing.
IEEE Trans. Multim., 2015

Randomized Spatial Context for Object Search.
IEEE Trans. Image Process., 2015

Speeded Up Low-Rank Online Metric Learning for Object Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2015

Weakly Semi-Supervised Deep Learning for Multi-Label Image Annotation.
IEEE Trans. Big Data, 2015

Deep sparse feature selection for computer aided endoscopy diagnosis.
Pattern Recognit., 2015

A computer vision-based approach to grade simulated cataract surgeries.
Mach. Vis. Appl., 2015

Real-time one-dimensional motion estimation and its application in computer vision.
Mach. Vis. Appl., 2015

Accurate sensing of scene geo-context via mobile visual localization.
Multim. Syst., 2015

Snap n' shop: Visual search-based mobile shopping made a breeze by machine and crowd intelligence.
Proceedings of the 9th IEEE International Conference on Semantic Computing, 2015

Discriminative Unsupervised Alignment of Natural Language Instructions with Corresponding Video Segments.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Joint Visual-Textual Sentiment Analysis with Deep Neural Networks.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Pinterest Board Recommendation for Twitter Users.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Who are the Devils Wearing Prada in New York City?
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Towards Lifestyle Understanding: Predicting Home and Vacation Locations from User's Online Photo Collections.
Proceedings of the Ninth International Conference on Web and Social Media, 2015

Using user generated online photos to estimate and monitor air pollution in major cities.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

To Love or to Loathe: How is the World Reacting to China's Rise?
Proceedings of the IEEE International Conference on Data Mining Workshop, 2015

Semantic Video Entity Linking Based on Visual Content and Metadata.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Multi-task deep visual-semantic embedding for video thumbnail selection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

America Tweets China: A fine-grained analysis of the state and individual characteristics regarding attitudes towards China.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

Monitoring adolescent alcohol use via multimodal analysis in social multimedia.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

User-curated image collections: Modeling and recommendation.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

Tackling Mental Health by Integrating Unobtrusive Multimodal Sensing.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Robust Image Sentiment Analysis Using Progressively Trained and Domain Transferred Deep Networks.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Sentiment Analysis Using Social Multimedia.
Proceedings of the Multimedia Data Mining and Analytics - Disruptive Innovation, 2015

Vision-Based Fine-Grained Location Estimation.
Proceedings of the Multimodal Location Estimation of Videos and Images, 2015

2014
Guest Editorial Special Section on Socio-Mobile Media Analysis and Retrieval.
IEEE Trans. Multim., 2014

Estimating the camera direction of a geotagged image using reference images.
Pattern Recognit., 2014

Retrieval-Based Face Annotation by Weak Label Regularized Local Coordinate Coding.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Jointly Image Topic and Emotion Detection using Multi-Modal Hierarchical Latent Dirichlet Allocation.
J. Multim. Inf. Syst., 2014

Operationally optimal vertex-based shape coding with arbitrary direction edge encoding structures.
J. Electronic Imaging, 2014

Guest Editorial: Geometry, Lighting, Motion, and Learning.
Int. J. Comput. Vis., 2014

Discriminative coupled dictionary hashing for fast cross-media retrieval.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Inferring Home Location from User's Photo Collections based on Visual Content and Mobility Patterns.
Proceedings of the 3rd ACM Multimedia Workshop on Geotagging and Its Applications in Multimedia, 2014

Cross-media hashing with kernel regression.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Personalized image recommendation for web search engine users.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Attribute prediction with long-range interactions via path coding.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

The Eyes of the Beholder: Gender Prediction Using Images Posted in Online Social Networks.
Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014

What Makes an Open Source Code Popular on Git Hub?
Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014

Moneyball for Academia: Toward Measuring and Maximizing Faculty Performance and Impact.
Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014

Adaptive Edge Encoding Schemes for the Rate-Distortion Optimal Polygon-Based Shape Coding.
Proceedings of the Data Compression Conference, 2014

Unsupervised Alignment of Natural Language Instructions with Video Segments.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Low-Rank Online Metric Learning.
Proceedings of the Low-Rank and Sparse Modeling for Visual Analysis, 2014

Recognizing People in Social Context.
Proceedings of the Human-Centered Social Media Analytics, 2014

2013
Robust and accurate mobile visual localization and its applications.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Effective Multiple Feature Hashing for Large-Scale Near-Duplicate Video Retrieval.
IEEE Trans. Multim., 2013

Learning to Produce 3D Media From a Captured 2D Video.
IEEE Trans. Multim., 2013

Image Re-Attentionizing.
IEEE Trans. Multim., 2013

Reinforced Similarity Integration in Image-Rich Information Networks.
IEEE Trans. Knowl. Data Eng., 2013

Self-Supervised Online Metric Learning With Low Rank Constraint for Scene Categorization.
IEEE Trans. Image Process., 2013

Action Recognition Using Multilevel Features and Latent Structural SVM.
IEEE Trans. Circuits Syst. Video Technol., 2013

Improving Bottom-up Saliency Detection by Looking into Neighbors.
IEEE Trans. Circuits Syst. Video Technol., 2013

Self-taught dimensionality reduction on the high-dimensional small-sized data.
Pattern Recognit., 2013

Local image tagging via graph regularized joint group sparsity.
Pattern Recognit., 2013

Regularized Semi-Supervised Latent Dirichlet Allocation for visual concept learning.
Neurocomputing, 2013

Are there cultural differences in event driven information propagation over social media?
Proceedings of the 2nd international workshop on Socially-aware multimedia, 2013

The third ACM international workshop on interactive multimedia on mobile and portable devices (IMMPD'13).
Proceedings of the ACM Multimedia Conference, 2013

Vision with a billion eyes.
Proceedings of the 2nd ACM international workshop on Geotagging and its applications in multimedia, 2013

Sentribute: image sentiment analysis from a mid-level perspective.
Proceedings of the Second International Workshop on Issues of Sentiment Discovery and Opinion Mining, 2013

Towards social imagematics: sentiment analysis in social multimedia.
Proceedings of the Thirteenth International Workshop on Multimedia Data Mining, 2013

Segment Based Depth Extraction Approach for Monocular Image with Linear Perspective.
Proceedings of the Intelligence Science and Big Data Engineering, 2013

A Markov logic framework for recognizing complex events from multimodal data.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

A practical method for counting arbitrary target objects in arbitrary scenes.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Is a picture worth 1000 votes? Analyzing the sentiment of election related social photos.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Task-relevant object detection and tracking.
Proceedings of the IEEE International Conference on Image Processing, 2013

Towards Understanding the Effectiveness of Election Related Images in Social Media.
Proceedings of the 13th IEEE International Conference on Data Mining Workshops, 2013

A General Framework for Recognizing Complex Events in Markov Logic.
Proceedings of the Statistical Relational Artificial Intelligence, 2013

2012
Guest editorial: content, concept and context mining in social media.
World Wide Web, 2012

Photo Stream Alignment and Summarization for Collaborative Photo Collection and Sharing.
IEEE Trans. Multim., 2012

Understanding Kin Relationships in a Photo.
IEEE Trans. Multim., 2012

Towards Scalable Summarization of Consumer Videos Via Sparse Dictionary Selection.
IEEE Trans. Multim., 2012

Tag-Based Image Retrieval Improved by Augmented Features and Group-Based Refinement.
IEEE Trans. Multim., 2012

Probabilistic Exposure Fusion.
IEEE Trans. Image Process., 2012

A Multimedia Retrieval Framework Based on Semi-Supervised Ranking and Relevance Feedback.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Visual Event Recognition in Videos by Learning from Web Data.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Inferring photographic location using geotagged web images.
Multim. Tools Appl., 2012

Toward Assessing and Improving the Quality of Stereo Images.
IEEE J. Sel. Top. Signal Process., 2012

RankCompete: Simultaneous ranking and clustering of information networks.
Neurocomputing, 2012

Geo-location inference on news articles via multimodal pLSA.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Finding perfect rendezvous on the go: accurate mobile visual localization and its applications to routing.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Correlated attribute transfer with multi-task graph-guided fusion.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Simultaneous Image Annotation and Geo-Tag Prediction via Correlation Guided Multi-task Learning.
Proceedings of the 2012 IEEE International Symposium on Multimedia, 2012

AMIGO: accurate mobile image geotagging.
Proceedings of the 4th International Conference on Internet Multimedia Computing and Service, 2012

Online Web-Data-Driven Segmentation of Selected Moving Objects in Videos.
Proceedings of the Computer Vision, 2012

2011
Interactive Co-segmentation of Objects in Image Collections.
Springer Briefs in Computer Science, Springer, ISBN: 978-1-4614-1915-0, 2011

A Distortion-Sensitive Seam Carving Algorithm for Content-Aware Image Resizing.
J. Signal Process. Syst., 2011

Introduction to ACM multimedia 2010 best paper candidates.
ACM Trans. Multim. Comput. Commun. Appl., 2011

Introduction to special issue on social media.
ACM Trans. Multim. Comput. Commun. Appl., 2011

Event-Based Semantic Image Adaptation for User-Centric Mobile Display Devices.
IEEE Trans. Multim., 2011

Collection-based sparse label propagation and its application on social group suggestion from photos.
ACM Trans. Intell. Syst. Technol., 2011

Aesthetics and Emotions in Images.
IEEE Signal Process. Mag., 2011

SocialSpamGuard: A Data Mining-Based Spam Detection System for Social Media Networks.
Proc. VLDB Endow., 2011

Textual Query of Personal Photos Facilitated by Large-Scale Web Data.
IEEE Trans. Pattern Anal. Mach. Intell., 2011

Geotagging in multimedia and computer vision - a survey.
Multim. Tools Appl., 2011

Interactively Co-segmentating Topically Related Images with Intelligent Scribble Guidance.
Int. J. Comput. Vis., 2011

Efficient manifold ranking for image retrieval.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Diversified Trajectory Pattern Ranking in Geo-tagged Social Media.
Proceedings of the Eleventh SIAM International Conference on Data Mining, 2011

Regularized Semi-supervised Latent Dirichlet Allocation for Visual Concept Learning.
Proceedings of the Advances in Multimedia Modeling, 2011

Extracting key frames from consumer videos using bi-layer group sparsity.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Reliving on demand: a total viewer experience.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Dynamic media show drivable by semantics.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Finding geographically representative music via social media.
Proceedings of the 1st international ACM workshop on Music information retrieval with user-centered and multimodal strategies, Scottsdale, AZ, USA, November 28, 2011

ACM international workshop on interactive multimedia on mobile and portable devices (IMMPD'11).
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

WSM2011: third ACM workshop on social media.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

LikeMiner: a system for mining the power of 'like' in social media networks.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

Action recognition using context and appearance distribution features.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Noise resistant graph ranking for improved web image search.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Using Geotags to Derive Rich Tag-Clouds for Image Annotation.
Proceedings of the Social Media Modeling and Computing., 2011

2010
Mining Compositional Features From GPS and Visual Cues for Event Recognition in Photo Collections.
IEEE Trans. Multim., 2010

User-Friendly Interactive Image Segmentation Through Unified Combinatorial User Inputs.
IEEE Trans. Image Process., 2010

Recognizing Cartoon Image Gestures for Retrieval and Interactive Cartoon Clip Synthesis.
IEEE Trans. Circuits Syst. Video Technol., 2010

Knowledge Discovery from Community-Contributed Multimedia.
IEEE Multim., 2010

Fast and Robust Methods for Multiple-View Vision.
EURASIP J. Image Video Process., 2010

Social group suggestion from user image collections.
Proceedings of the 19th International Conference on World Wide Web, 2010

iRIN: image retrieval in image-rich information networks.
Proceedings of the 19th International Conference on World Wide Web, 2010

RankCompete: simultaneous ranking and clustering of web photos.
Proceedings of the 19th International Conference on World Wide Web, 2010

Understanding multimedia content using web scale social media data.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Beyond GPS: determining the camera viewing direction of a geotagged image.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

The wisdom of social multimedia: using flickr for prediction and forecast.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

WSM'10: 2nd ACM workshop on social media.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Semantic adaptation of consumer photo for mobile device access.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010

Suggesting Songs for Media Creation Using Semantics.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

User guided semantic image adaptation for mobile display devices.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Exploring user image tags for geo-location inference.
Proceedings of the IEEE International Conference on Acoustics, 2010

A worldwide tourism recommendation system based on geotaggedweb photos.
Proceedings of the IEEE International Conference on Acoustics, 2010

Seeing People in Social Context: Recognizing People and Social Relationships.
Proceedings of the Computer Vision - ECCV 2010, 2010

Nonparametric Label-to-Region by search.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Tag-based web photo retrieval improved by batch mode re-tagging.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

iCoseg: Interactive co-segmentation with intelligent scribble guidance.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Visual cube and on-line analytical processing of images.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2009
Integration of Context and Content for Multimedia Management: An Introduction to the Special Issue.
IEEE Trans. Multim., 2009

Image Annotation Within the Context of Personal Photo Collections Using Hierarchical Event and Scene Models.
IEEE Trans. Multim., 2009

Towards Extracting Semantically Meaningful Key Frames From Personal Video Clips: From Humans to Computers.
IEEE Trans. Circuits Syst. Video Technol., 2009

Guest Editors' Introduction to the Special Section on Probabilistic Graphical Models.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Search strategies for shape regularized active contour.
Comput. Vis. Image Underst., 2009

Ranking with local regression and global alignment for cross media retrieval.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

1st ACM international workshop on interactive multimedia for consumer electronics (IMCE'09).
Proceedings of the 17th International Conference on Multimedia 2009, 2009

T-IRS: textual query based image retrieval system for consumer photos.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Using large-scale web data to facilitate textual query based retrieval of consumer photos.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Event recognition from photo collections via PageRank.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Mobile media search: has media search finally found its perfect platform? part II.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Enhancing semantic and geographic annotation of web images via logistic canonical correlation regression.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

First ACM SIGMM international workshop onsocial media (WSM'09).
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Connecting people in photo-sharing sites by photo content and user annotations.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Seed Image Selection in interactive cosegmentation.
Proceedings of the International Conference on Image Processing, 2009

Mining Personal Image Collection for Social Group Suggestion.
Proceedings of the ICDM Workshops 2009, 2009

Heterogeneous feature machines for visual recognition.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Action recognition in unconstrained amateur videos.
Proceedings of the IEEE International Conference on Acoustics, 2009

Mobile media search.
Proceedings of the IEEE International Conference on Acoustics, 2009

Recognizing realistic actions from videos .
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Geo-location inference from image content and user tags.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009

2008
Face Recognition Using Spatially Constrained Earth Mover's Distance.
IEEE Trans. Image Process., 2008

A Subspace Model-Based Approach to Face Relighting Under Unknown Lighting and Poses.
IEEE Trans. Image Process., 2008

Mining Recurring Events Through Forest Growing.
IEEE Trans. Circuits Syst. Video Technol., 2008

An Introduction to the Special Issue on Event Analysis in Videos.
IEEE Trans. Circuits Syst. Video Technol., 2008

A Hierarchical Compositional Model for Face Representation and Sketching.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

Real-World Image Annotation and Retrieval: An Introduction to the Special Section.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

Event recognition: viewing the world with a third eye.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Image annotation using personal calendars as context.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Annotating photo collections by label propagation according to multiple similarity cues.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Mining GPS traces and visual words for event classification.
Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008

Recognizing picture-taking environment from satellite images: A feasibility study.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Mining compositional features for boosting.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Discovery of social relationships in consumer photo collections using Markov Logic.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2008

Selective hidden random fields: Exploiting domain-specific saliency for event classification.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Annotating collections of photos using hierarchical event and scene models.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Utilizing semantic word similarity measures for video retrieval.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Leveraging probabilistic season and location context models for scene understanding.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

Inferring generic activities and events from image content and bags of geo-tags.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

2007
Scene Parsing Using Region-Based Generative Models.
IEEE Trans. Multim., 2007

Key frame extraction from unstructured consumer video clips.
Proceedings of the Visual Communications and Image Processing 2007, 2007

Highly automated image recomposition: the picture you wish you had taken.
Proceedings of the Visual Communications and Image Processing 2007, 2007

Automatic target segmentation in color dental images.
Proceedings of the Visual Communications and Image Processing 2007, 2007

Kodak's consumer video benchmark data set: concept definition and annotation.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

New challenges in multimedia research for the increasingly connected and fast growing digital society.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Large-scale multimodal semantic concept detection for consumer video.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Subject Content-Based Intelligent Cropping of Digital Photos.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

First- and third-party ground truth for key frame extraction from consumer video clips.
Proceedings of the Human Vision and Electronic Imaging XII, San Jose, CA, USA, January 29, 2007

Accurate Dynamic Sketching of Faces from Video.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
Robust online orientation correction for radiographs in PACS environments.
IEEE Trans. Medical Imaging, 2006

Color object detection using spatial-color joint probability functions.
IEEE Trans. Image Process., 2006

Pictures are not taken in a vacuum - an overview of exploiting context for semantic scene content understanding.
IEEE Signal Process. Mag., 2006

<i>Data Mining: Multimedia, Soft Computing, and Bioinformatics</i>.
J. Electronic Imaging, 2006

<i>Dictionary of Computer Vision and Image Processing</i>.
J. Electronic Imaging, 2006

Face Recognition by Expression-Driven Sketch Graph Matching.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Using Semantic Features for Scene Classification: how Good do they Need to Be?
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Body Localization in Still Images Using Hierarchical Models and Hybrid Search.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Generalized Multiclass AdaBoost and Its Applications to Multimedia Classification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2006

Factor Graphs for Region-based Whole-scene Classification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2006

2005
Image transform bootstrapping and its applications to semantic scene classification.
IEEE Trans. Syst. Man Cybern. Part B, 2005

A Bayesian network-based framework for semantic image understanding.
Pattern Recognit., 2005

Special Issue on Image Understanding for Digital Photographs.
Pattern Recognit., 2005

Natural scene classification using overcomplete ICA.
Pattern Recognit., 2005

Beyond pixels: Exploiting camera metadata for photo classification.
Pattern Recognit., 2005

Automatic Image Orientation Detection via Confidence-Based Integration of Low-Level and Semantic Cues.
IEEE Trans. Pattern Anal. Mach. Intell., 2005

A generalized temporal context model for classifying image collections.
Multim. Syst., 2005

Shape regularized active contour based on dynamic programming for anatomical structure segmentation.
Proceedings of the Medical Imaging 2005: Image Processing, 2005

Inducing node specification in active shape models for accurate lung-field segmentation.
Proceedings of the Medical Imaging 2005: Image Processing, 2005

Improved semantic region labeling based on scene context.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Overcomplete ICA-based Manmade Scene Classification.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Shape Regularized Active Contour Using Iterative Global Search and Local Optimization.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

2004
Improved scene classification using efficient low-level features and semantic cues.
Pattern Recognit., 2004

Learning multi-label scene classification.
Pattern Recognit., 2004

A computational approach to determination of main subject regions in photographic images.
Image Vis. Comput., 2004

Media Reviews.
IEEE Multim., 2004

Multilabel machine learning and its application to semantic scene classification.
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2004, 2004

Using image-transform-based bootstrapping to improve scene classification.
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2004, 2004

Photo Classification by Integrating Image Content and Camera Metadata.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Incorporating Temporal Context with Content for Classifying Image Collections.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Improved blue sky detection using polynomial model fit.
Proceedings of the 2004 International Conference on Image Processing, 2004

A Probabilistic Approach to Image Orientation Detection via Confidence-Based Integration of Low-Level and Semantic Cues.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2004

Robust Color Object Detection Using Spatial-Color Joint Probability Functions.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

A Generalized Temporal Context Model for Semantic Scene Classification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2004

Bayesian Fusion of Camera Metadata Cues in Semantic Scene Classification.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004

2003
Perceptual grouping of segmented regions in color images.
Pattern Recognit., 2003

Modeling of subband coefficients for clustering-based adaptive quantization with spatial constraints.
J. Vis. Commun. Image Represent., 2003

Spatially Adaptive Rendering of Images for Display on Mobile Devices.
Proceedings of the PICS 2003: The PICS Conference, 2003

Efficient Mobile Imaging Using Emphasis Image Selection.
Proceedings of the PICS 2003: The PICS Conference, 2003

Natural object detection in outdoor scenes based on probabilistic spatial context models.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Sunset scene classification using simulated image recomposition.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Psychophysical study of image orientation perception.
Proceedings of the Human Vision and Electronic Imaging VIII, 2003

Probabilistic Spatial Context Models for Scene Content Understanding.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

Novel color palettization scheme for preserving important colors.
Proceedings of the Color Imaging VIII: Processing, Hardcopy, and Applications, Santa Clara, 2003

2002
Displaying images on mobile devices: capabilities, issues, and solutions.
Wirel. Commun. Mob. Comput., 2002

Special issue: Multimedia over mobile IP.
Wirel. Commun. Mob. Comput., 2002

A physical model-based approach to detecting sky in photographic images.
IEEE Trans. Image Process., 2002

Normalized Kemeny and Snell Distance: A Novel Metric for Quantitative Evaluation of Rank-Order Similarity of Images.
IEEE Trans. Pattern Anal. Mach. Intell., 2002

Multiresolution Block Sampling-Based Method for Texture Synthesis.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Clothed People Detection in Still Images.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

A Computationally Efficient Approach to Indoor/Outdoor Scene Classification.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

A Physics-Motivated Approach to Detecting Sky in Photographs.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Displaying images on mobile devices: capabilities, issues, and solutions.
Proceedings of the 2002 International Conference on Image Processing, 2002

A triage method of determining the extent of JPEG compression artifacts.
Proceedings of the 2002 International Conference on Image Processing, 2002

Non-purposive perceptual region grouping.
Proceedings of the 2002 International Conference on Image Processing, 2002

2001
On measuring low-level self and relative saliency in photographic images.
Pattern Recognit. Lett., 2001

Self-supervised texture segmentation using complementary types of features.
Pattern Recognit., 2001

Synthesis of directional texture based on multiresolution block sampling and constrained block movement.
Proceedings of the 2001 International Conference on Image Processing, 2001

Indoor vs outdoor classification of consumer photographs using low-level and semantic features.
Proceedings of the 2001 International Conference on Image Processing, 2001

Performance-scalable computational approach to main-subject detection in photographs.
Proceedings of the Human Vision and Electronic Imaging VI, 2001

2000
On the Application of Bayes Networks to Semantic Understanding of Consumer Photographs.
Proceedings of the 2000 International Conference on Image Processing, 2000

Two-Stage Texture Segmentation Using Complementary Features.
Proceedings of the 2000 International Conference on Image Processing, 2000

Quantitative Evaluation of Rank-Order Similarity of Images.
Proceedings of the 2000 International Conference on Image Processing, 2000

Ground truth for training and evaluation of automatic main subject detection.
Proceedings of the Human Vision and Electronic Imaging V, 2000

On Measuring Low-Level Saliency in Photographic Images.
Proceedings of the 2000 Conference on Computer Vision and Pattern Recognition (CVPR 2000), 2000

1999
Automatic Detection of Radiation Fields in Digital Radiographic Images.
Int. J. Pattern Recognit. Artif. Intell., 1999

Method for recognizing multiple radiation fields in computed radiography.
Proceedings of the Medical Imaging 1999: Image Processing, 1999

1998
A robust technique for image descreening based on the wavelet transform.
IEEE Trans. Signal Process., 1998

Image segmentation via adaptive K-mean clustering and knowledge-based morphological operations with biomedical applications.
IEEE Trans. Image Process., 1998

Coherently three-dimensional wavelet-based approach to volumetric image compression.
J. Electronic Imaging, 1998

Incorporation of Derivative Priors in Adaptive Bayesian Color Image Segmentation.
Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

1997
Joint Scene and Signal Modeling for Wavelet-Based Video Coding with Cellular Neural Network Architecture.
J. VLSI Signal Process., 1997

A scene adaptive and signal adaptive quantization for subband image and video compression using wavelets.
IEEE Trans. Circuits Syst. Video Technol., 1997

Ultrasound Image Compression Based on Subband Decomposition and Speckle Synthesis.
Proceedings of the Proceedings 1997 International Conference on Image Processing, 1997

Towards physics-based segmentation of photographic color images.
Proceedings of the Proceedings 1997 International Conference on Image Processing, 1997

Universal descreening technique via wavelet analysis.
Proceedings of the Color Imaging: Device-Independent Color, 1997

1996
Artifact reduction in low bit rate DCT-based image compression.
IEEE Trans. Image Process., 1996

Face location in wavelet-based video compression for high perceptual quality videoconferencing.
IEEE Trans. Circuits Syst. Video Technol., 1996

A cellular neural network for clustering-based adaptive quantization in subband video compression.
IEEE Trans. Circuits Syst. Video Technol., 1996

1995
On the application of Gibbs random field in image processing: from segmentation to enhancement.
J. Electronic Imaging, 1995

Adaptive quantization with spatial constraints in subband video compression using wavelets.
Proceedings of the Proceedings 1995 International Conference on Image Processing, 1995

1994
Three Dimensional Subband Video Analysis and Synthesis with Adaptive Clustering in High Frequency Subbands.
Proceedings of the Proceedings 1994 International Conference on Image Processing, 1994

A Knowledge-Based Approach to Volumetric Medical Image Segmentation.
Proceedings of the Proceedings 1994 International Conference on Image Processing, 1994

A new method for block effect removal in low bit-rate image compression.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Left ventricle global motion and shape from CT volumetric data.
Proceedings of the IEEE International Conference on Acoustics, 1993


  Loading...