Wen-Huang Cheng

Orcid: 0000-0002-4662-7875

Affiliations:
  • National Chiao Tung University, Taiwan


According to our database1, Wen-Huang Cheng authored at least 230 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Human-Object Interaction Detection: An Overview.
IEEE Consumer Electron. Mag., November, 2024

Lightweight Deep Learning for Resource-Constrained Environments: A Survey.
ACM Comput. Surv., October, 2024

Lightweight Deep Learning: An Overview.
IEEE Consumer Electron. Mag., July, 2024

Language-guided Residual Graph Attention Network and Data Augmentation for Visual Grounding.
ACM Trans. Multim. Comput. Commun. Appl., January, 2024

A DeNoising FPN With Transformer R-CNN for Tiny Object Detection.
IEEE Trans. Geosci. Remote. Sens., 2024

Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language.
CoRR, 2024

An Investigation of Incorporating Mamba for Speech Enhancement.
CoRR, 2024

MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection.
CoRR, 2024

Natural Light Can Also be Dangerous: Traffic Sign Misinterpretation Under Adversarial Natural Light Attacks.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

SMP Challenge Summary: Social Media Prediction Challenge.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MEGC2024: ACM Multimedia 2024 Facial Micro-Expression Grand Challenge.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

ReCorD: Reasoning and Correcting Diffusion for HOI Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Learning Efficient Interaction Anchor for HOI Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Refining Valence-Arousal Estimation with Dual-Stream Label Density Smoothing.
Proceedings of the IEEE International Conference on Consumer Electronics, 2024

Representation and Boundary Enhancement for Action Segmentation Using Transformer.
Proceedings of the IEEE International Conference on Acoustics, 2024

Language-Guided Negative Sample Mining for Open-Vocabulary Object Detection.
Proceedings of the International Conference on Electronics, Information, and Communication, 2024

The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation.
Proceedings of the Computer Vision - ECCV 2024, 2024

TrajPrompt: Aligning Color Trajectory with Vision-Language Representations.
Proceedings of the Computer Vision - ECCV 2024, 2024

DQ-DETR: DETR with Dynamic Query for Tiny Object Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

EmoVIT: Revolutionizing Emotion Insights with Visual Instruction Tuning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

TrajFine: Predicted Trajectory Refinement for Pedestrian Trajectory Forecasting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Distraction is All You Need: Memory-Efficient Image Immunization against Diffusion-Based Image Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
A Survey of Artificial Intelligence in Fashion.
IEEE Signal Process. Mag., May, 2023

Specific Expert Learning: Enriching Ensemble Diversity via Knowledge Distillation.
IEEE Trans. Cybern., April, 2023

Editorial for pattern recognition letters special issue on face-based emotion understanding.
Pattern Recognit. Lett., April, 2023

Referring Expression Comprehension Via Enhanced Cross-modal Graph Attention Networks.
ACM Trans. Multim. Comput. Commun. Appl., 2023

An Overview of Facial Micro-Expression Analysis: Data, Methodology and Challenge.
IEEE Trans. Affect. Comput., 2023

Seeing the unseen: Wifi-based 2D human pose estimation via an evolving attentive spatial-Frequency network.
Pattern Recognit. Lett., 2023

Optimizing 3D Object Detection with Data Importance-Based Loss Reweighting.
Proceedings of the Technologies and Applications of Artificial Intelligence, 2023

MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

SMP Challenge: An Overview and Analysis of Social Media Prediction Challenge.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

FME '23: 3rd Facial Micro-Expression Workshop.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

MEGC2023: ACM Multimedia 2023 ME Grand Challenge.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Fast Vehicle Detection and Tracking on Fisheye Traffic Monitoring Video using Motion Trail.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2023

DiffAds: An Interactive Platform for Personalized Visual Advertisement Generation.
Proceedings of the IEEE International Conference on Multimedia and Expo Workshops, 2023

Most Important Person-guided Dual-branch Cross-Patch Attention for Group Affect Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Size Does Matter: Size-aware Virtual Try-on via Clothing-oriented Transformation Try-on Network.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Dynamic Feature Fusion for Visual Object Detection and Segmentation.
Proceedings of the IEEE International Conference on Consumer Electronics, 2023

Anchor-Based Detection for Natural Language Localization in Ego-Centric Videos.
Proceedings of the IEEE International Conference on Consumer Electronics, 2023

Task-Specific Pruning: Efficient Parameter Reduction in Multi-task Object Detection Models.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Learning to Prompt for Vision-Language Emotion Recognition.
Proceedings of the 11th International Conference on Affective Computing and Intelligent Interaction, ACII 2023, 2023

Zero-Shot Face-Based Voice Conversion: Bottleneck-Free Speech Disentanglement in the Real-World Scenario.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Mask or Non-Mask? Robust Face Mask Detector via Triplet-Consistency Representation Learning.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Improving Crowd Density Estimation by Fusing Aerial Images and Radio Signals.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Template-Free Try-On Image Synthesis via Semantic-Guided Optimization.
IEEE Trans. Neural Networks Learn. Syst., 2022

Spatiotemporal Dilated Convolution With Uncertain Matching for Video-Based Crowd Estimation.
IEEE Trans. Multim., 2022

Facial Chirality: From Visual Self-Reflection to Robust Facial Feature Learning.
IEEE Trans. Multim., 2022

Correction to: HoloTube: a low-cost portable 360-degree interactive autostereoscopic display.
Multim. Tools Appl., 2022

Practical 3D human skeleton tracking based on multi-view and multi-Kinect fusion.
Multim. Syst., 2022

Code generation from a graphical user interface via attention-based encoder-decoder model.
Multim. Syst., 2022

Fashion Meets Computer Vision: A Survey.
ACM Comput. Surv., 2022

Dual-branch Cross-Patch Attention Learning for Group Affect Recognition.
CoRR, 2022

Vision Transformers: State of the Art and Research Challenges.
CoRR, 2022

Mimicking the Annotation Process for Recognizing the Micro Expressions.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MEGC2022: ACM Multimedia 2022 Micro-Expression Grand Challenge.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

FME '22: 2nd Workshop on Facial Micro-Expression: Advanced Techniques for Multi-Modal Facial Expression Analysis.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Fashion Meets Computer Vision.
Proceedings of the MCFR@MM 2022: Proceedings of the 1st Workshop on Multimedia Computing towards Fashion Recommendation, 2022

The Hierarchical Ensemble Model for Network Intrusion Detection in the Real-world Dataset.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2022

Residual Graph Attention Network and Expression-Respect Data Augmentation Aided Visual Grounding.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Fast Vehicle Detection and Tracking on Fisheye Traffic Monitoring Video Using CNN and Bounding Box Propagation.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Social-SSL: Self-supervised Cross-Sequence Representation Learning Based on Transformers for Multi-agent Trajectory Prediction.
Proceedings of the Computer Vision - ECCV 2022, 2022


2021
Introduction to the Special Issue on Explainable AI on Multimedia Computing.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Enabling Artistic Control Over Pattern Density and Stroke Strength.
IEEE Trans. Multim., 2021

Dress With Style: Learning Style From Joint Deep Embedding of Clothing Styles and Body Shapes.
IEEE Trans. Multim., 2021

Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media.
IEEE Trans. Cogn. Dev. Syst., 2021

ROSNet: Robust one-stage network for CT lesion detection.
Pattern Recognit. Lett., 2021

Technical Report for Valence-Arousal Estimation in ABAW2 Challenge.
CoRR, 2021

DAF: re: A Challenging, Crowd-Sourced, Large-Scale, Long-Tailed Dataset For Anime Character Recognition.
CoRR, 2021

DensER: Density-imbalance-Eased Representation for LiDAR-based Whole Scene Upsampling.
Proceedings of the International Conference on Visual Communications and Image Processing, 2021

Face-based Voice Conversion: Learning the Voice behind a Face.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

FME'21: 1st Workshop on Facial Micro-Expression: Advanced Techniques for Facial Expressions Generation and Spotting.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Re-Attention Is All You Need: Memory-Efficient Scene Text Detection via Re-Attention on Uncertain Regions.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Heterogeneous Federated Learning Through Multi-Branch Network.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Facial Chirality: Using Self-Face Reflection to Learn Discriminative Features for Facial Expression Recognition.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Single Patch Based 3D High-Fidelity Mask Face Anti-Spoofing.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

FashionMirror: Co-attention Feature-remapping Virtual Try-on with Sequential Template Poses.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ZYELL-NCTU NetTraffic-1.0: A Large-Scale Dataset for Real-World Network Anomaly Detection.
Proceedings of the IEEE International Conference on Consumer Electronics-Taiwan, 2021

2020
Unlocking Author Power: On the Exploitation of Auxiliary Author-Retweeter Relations for Predicting Key Retweeters.
IEEE Trans. Knowl. Data Eng., 2020

LR3M: Robust Low-Light Enhancement via Low-Rank Regularized Retinex Model.
IEEE Trans. Image Process., 2020

Photobomb Defusal Expert: Automatically Remove Distracting People From Photos.
IEEE Trans. Emerg. Top. Comput. Intell., 2020

Hybrid context enriched deep learning model for fine-grained sentiment analysis in textual and visual semiotic modality social data.
Inf. Process. Manag., 2020

Urban Multimedia Computing: Emerging Methods in Multimedia Computing for Urban Data Analysis and Applications.
IEEE Multim., 2020

MER-GCN: Micro Expression Recognition Based on Relation Modeling with Graph Convolutional Network.
CoRR, 2020

AU-assisted Graph Attention Convolutional Network for Micro-Expression Recognition.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

S2SiamFC: Self-supervised Fully Convolutional Siamese Network for Visual Tracking.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Trajectory Prediction in Heterogeneous Environment via Attended Ecology Embedding.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

ATQAM/MAST'20: Joint Workshop on Aesthetic and Technical Quality Assessment of Multimedia and Media Analytics for Societal Trends.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Coping with Pandemics: Opportunities and Challenges for AI Multimedia in the "New Normal".
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Emotion Recognition from Galvanic Skin Response Signal Based on Deep Hybrid Neural Networks.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

MER-GCN: Micro-Expression Recognition Based on Relation Modeling with Graph Convolutional Networks.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

2019
Furniture style compatibility recommendation with cross-class triplet loss.
Multim. Tools Appl., 2019

SMP Challenge: An Overview of Social Media Prediction Challenge 2019.
CoRR, 2019

Reversible AMBTC-Based Data Hiding With Security Improvement by Chaotic Encryption.
IEEE Access, 2019

3D Object Completion via Class-Conditional Generative Adversarial Network.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

Multiple Fisheye Camera Tracking via Real-Time Feature Clustering.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Session details: Best Paper Session.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Stop Hiding Behind Windshield: A Windshield Image Enhancer Based on a Two-way Generative Adversarial Network.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

FashionOn: Semantic-guided Image-based Virtual Try-on with Detailed Human and Clothing Information.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

SMP Challenge: An Overview of Social Media Prediction Challenge 2019.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Garment Detectives: Discovering Clothes and Its Genre in Consumer Photos.
Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

Switch Mode Based Deep Fractional Interpolation in Video Coding.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2019

Adapting Semantic Segmentation of Urban Scenes via Mask-Aware Gated Discriminator.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Dressing for Attention: Outfit Based Fashion Popularity Prediction.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Segmenting Hepatic Lesions Using Residual Attention U-Net with an Adaptive Weighted Dice Loss.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Spatially-Aware Domain Adaptation for Semantic Segmentation of Urban Scenes.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Fit-me: Image-Based Virtual Try-on With Arbitrary Poses.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Fuzzy Personalized Scoring Model for Recommendation System.
Proceedings of the IEEE International Conference on Acoustics, 2019

BeautyGlow: On-Demand Makeup Transfer Framework With Reversible Generative Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
NHAD: Neuro-Fuzzy Based Horizontal Anomaly Detection in Online Social Networks.
IEEE Trans. Knowl. Data Eng., 2018

Background Extraction Using Random Walk Image Fusion.
IEEE Trans. Cybern., 2018

Learning and Recognition of Clothing Genres From Full-Body Images.
IEEE Trans. Cybern., 2018

Background Extraction Based on Joint Gaussian Conditional Random Fields.
IEEE Trans. Circuits Syst. Video Technol., 2018

Robust RGB-D Hand Tracking Using Deep Learning Priors.
IEEE Trans. Circuits Syst. Video Technol., 2018

A Cloud-based Intelligent Skin and Scalp Analysis System.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Vehicle Detection in Thermal Images Using Deep Neural Network.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

ZipNet: ZFNet-level Accuracy with 48× Fewer Parameters.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

What Dress Fits Me Best?: Fashion Recommendation on the Clothing Style for Personal Body Shape.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Session details: FF-4.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

AI + Multimedia Make Better Life?
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Session details: Panel-2.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Joint Enhancement and Denoising Method via Sequential Decomposition.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

Exploiting Category-Specific Information for Image Popularity Prediction in Social Media.
Proceedings of the 2018 IEEE International Conference on Multimedia & Expo Workshops, 2018

Pedestrian Detection from Lidar Data via Cooperative Deep and Hand-Crafted Features.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Enhanced Intra Prediction with Recurrent Neural Network in Video Coding.
Proceedings of the 2018 Data Compression Conference, 2018

2017
HoloTube: a low-cost portable 360-degree interactive autostereoscopic display.
Multim. Tools Appl., 2017

CrossbowCam: a handheld adjustable multi-camera system.
Multim. Tools Appl., 2017

HoloTabletop: an anamorphic illusion interactive holographic-like tabletop system.
Multim. Tools Appl., 2017

Intelligent deployment of UAVs in 5G heterogeneous communication environment for improved coverage.
J. Netw. Comput. Appl., 2017

O-Displaying: an orientation-based augmented reality display on a smart glass with a user tracking from a depth camera.
Proceedings of the SIGGRAPH Asia 2017 Posters, Bangkok, Thailand, November 27 - 30, 2017, 2017

i-Stylist: Finding the Right Dress Through Your Social Networks.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Fashion World Map: Understanding Cities Through Streetwear Fashion.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Sequential Prediction of Social Media Popularity with Deep Temporal Context Networks.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Multi-cue pedestrian detection from 3D point cloud data.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

DeepSheet: A sheet music generator based on deep learning.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Medical image denoising using sparse representations.
Proceedings of the IEEE 8th International Conference on Awareness Science and Technology, 2017

2016
Animating Still Landscape Photographs Through Cloud Motion Creation.
IEEE Trans. Multim., 2016

A comparative study of data fusion for RGB-D based visual recognition.
Pattern Recognit. Lett., 2016

SocialCRC: Enabling socially-consensual rendezvous coordination by mobile phones.
Pervasive Mob. Comput., 2016

UbiShop: Commercial item recommendation using visual part-based object representation.
Multim. Tools Appl., 2016

Photo sundial: Estimating the time of capture in consumer photos.
Neurocomputing, 2016

Sensor-Web Systems, Applications, and Services.
Int. J. Distributed Sens. Networks, 2016

A novel comparative deep learning framework for facial age estimation.
EURASIP J. Image Video Process., 2016

What Catches Your Eyes as You Move Around? On the Discovery of Interesting Regions in the Street.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Locality Constrained Sparse Representation for Cat Recognition.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Time Matters: Multi-scale Temporalization of Social Media Popularity.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

A feature fusion framework for hashing.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Machine learning-based behavior recognition system for a basketball player using multiple Kinect cameras.
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

A Spatial-Pyramid Scene Categorization Algorithm based on Locality-aware Sparse Coding.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

A Framework of Enlarging Face Datasets Used for Makeup Face Analysis.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

Unfolding Temporal Dynamics: Predicting Social Media Popularity Using Multi-scale Temporal Decomposition.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Gestalt Rule Feature Points.
IEEE Trans. Multim., 2015

Real-Time Human Movement Retrieval and Assessment With Kinect Sensor.
IEEE Trans. Cybern., 2015

Efficient human detection in crowded environment.
Multim. Syst., 2015

An efficient pitch-by-pitch extraction algorithm through multimodal information.
Inf. Sci., 2015

An interactive 3D social media browsing system in a tech-art gallery.
Proceedings of the SIGGRAPH Asia 2015 Posters, Kobe, Japan, November 2-6, 2015, 2015

G-spacing: a gyro sensor based relative 3D space positioning scheme.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2015

Poster: Exploring the Need for Sensor Learning and Collaboration in IoT-based Parking Systems.
Proceedings of the 13th ACM Conference on Embedded Networked Sensor Systems, 2015

VRank: Voting system on Ranking model for human age estimation.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

eMosic: Mobile Media Pushing through Social Emotion Sensing.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Supervised Multi-scale Locality Sensitive Hashing.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

An efficient algorithm for periodic halftone identification.
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

A social media based real scene navigation system with a holographic projection on a HUD.
Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2015 ACM International Symposium on Wearable Computers, 2015

Workshop I: International workshop on Learning Semantics for Multimedia Big Data (LSMBD).
Proceedings of the 3rd IAPR Asian Conference on Pattern Recognition, 2015

2014
Learning and Recognition of On-Premise Signs From Weakly Labeled Street View Images.
IEEE Trans. Image Process., 2014

AttachedShock: Design of a crossing-based target selection technique on augmented reality devices and its implications.
Int. J. Hum. Comput. Stud., 2014

A Robust Learning-Based Detection and Tracking Algorithm.
Proceedings of the Technologies and Applications of Artificial Intelligence, 2014

LaRED: a large RGB-D extensible hand gesture dataset.
Proceedings of the Multimedia Systems Conference 2014, 2014

Semantic Based Background Music Recommendation for Home Videos.
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014

Who's the Best Charades Player? Mining Iconic Movement of Semantic Concepts.
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014

MOSRO: Enabling Mobile Sensing for Real-Scene Objects with Grid Based Structured Output Learning.
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014

What are the Fashion Trends in New York?
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

A robust tracking algorithm for 3D hand gesture with rapid hand motion through deep learning.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Attaching-music: An interactive music delivery system for private listening as wherever you go.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

A real-time human identification system through multimodal information.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Intraframe Coding with Massive Dictionaries of Tree-Structured Representations.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2014

2013
FingerPad: private and subtle interaction using fingertips.
Proceedings of the 26th Annual ACM Symposium on User Interface Software and Technology, 2013

Painting photolization.
Proceedings of the SIGGRAPH Asia 2013, 2013

A mixed-reality showcase for multiple users from unconstrained viewing angles.
Proceedings of the SIGGRAPH Asia 2013, 2013

Compass fusion: high precision indoor people localization and identification.
Proceedings of the 11th Annual International Conference on Mobile Systems, 2013

Artistic eye: recognizing key viewing points of popular sites.
Proceedings of the 11th Annual International Conference on Mobile Systems, 2013

Human Action Search Based on Dynamic Shape Volumes.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Physiognomy master: a novel personality analysis system based on facial features.
Proceedings of the ACM Multimedia Conference, 2013

CINDY: A cylindrical interactive display and its applications.
Proceedings of the IEEE International Symposium on Consumer Electronics, 2013

Whac-a-mole: A head detection scheme by estimating the 3D envelope from depth image.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Demo paper: A depth-based crowded heads detection system through a freely-located camera.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Efficient human detection in crowded environment based on motion and appearance information.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

Boundary Delineation of Breast Lesions in Series of 2D Sonography by Modeling the Spatial-Temporal Prior and Cell-Based MAP Approach.
Proceedings of the Seventh International Conference on Image and Graphics, 2013

Rectangling Stereographic Projection for Wide-Angle Image Visualization.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2012
Perspective-aware warping for seamless stereoscopic image cloning.
ACM Trans. Graph., 2012

Unsupervised Semantic Feature Discovery for Image Object Retrieval and Tag Refinement.
IEEE Trans. Multim., 2012

Interactive human action search using body language.
Proceedings of the 21st Annual Wireless and Optical Communications Conference, 2012

Texturing and deforming models with casual images.
Proceedings of the SIGGRAPH Asia 2012 Poster Proceedings, Singapore, Singapore, November 28, 2012

Traveling through space-time: an interactive photo browsing system.
Proceedings of the SIGGRAPH Asia 2012 Poster Proceedings, Singapore, Singapore, November 28, 2012

Omni-Tube: a low-cost portable omnidirectional interactive 3D display.
Proceedings of the SIGGRAPH Asia 2012 Poster Proceedings, Singapore, Singapore, November 28, 2012

User-Assisted Disparity Maps.
Proceedings of the 20th Pacific Conference on Computer Graphics and Applications, 2012

U-Drumwave: An Interactive Performance System for Drumming.
Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012

AttachedShock: facilitating moving targets acquisition on augmented reality devices using goal-crossing actions.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Actions speak louder than words: searching human action video based on body movement.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Human action recognition and retrieval using sole depth information.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Clothing genre classification by exploiting the style elements.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Action tutor: real-time exemplar-based sequential movement assessment with kinect sensor.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Who's Who in a Sports Video? An Individual Level Sports Video Indexing System.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Single image depth estimation from image descriptors.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

MobileQueue: an image-based queue card management system through augmented reality phones.
Proceedings of the 2012 ACM Conference on Ubiquitous Computing, 2012

2011
MobiUP: An Upsampling-Based System Architecture for High-Quality Video Streaming on Mobile Devices.
IEEE Trans. Multim., 2011

Interactive digital scrapbook generation for travel photos based on design principles of typography.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Dynamic social network for narrative video analysis.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Augmenting mobile city-view image retrieval with context-rich user-contributed photos.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Unsupervised auxiliary visual words discovery for large-scale image object retrieval.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
SocialCRC: a social- and context-aware rendezvous coordination system.
Proceedings of the 28th International Conference on Human Factors in Computing Systems, 2010

2009
Context-based page unit recommendation for web-based sensemaking tasks.
Proceedings of the 14th International Conference on Intelligent User Interfaces, 2009

2008
Semantic Analysis for Automatic Event Recognition and Segmentation of Wedding Ceremony Videos.
IEEE Trans. Circuits Syst. Video Technol., 2008

Context-based page unit recommendation for web-basedsensemaking tasks.
Proceedings of the 17th International Conference on World Wide Web, 2008

Photo navigator.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

2007
Video Adaptation for Small Display Based on Content Recomposition.
IEEE Trans. Circuits Syst. Video Technol., 2007

Film Narrative Exploration Through the Analysis of Aesthetic Elements.
Proceedings of the Advances in Multimedia Modeling, 2007

Semantic-event based analysis and segmentation of wedding ceremony videos.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

2006
Semantic Context Detection Using Audio Event Fusion.
EURASIP J. Adv. Signal Process., 2006

2005
A practical foveation-based rate-shaping mechanism for MPEG videos.
IEEE Trans. Circuits Syst. Video Technol., 2005

Toward semantic indexing and retrieval using hierarchical audio models.
Multim. Syst., 2005

A Visual Attention Based Region-of-Interest Determination Framework for Video Sequences.
IEICE Trans. Inf. Syst., 2005

Generative and Discriminative Modeling toward Semantic Context Detection in Audio Tracks.
Proceedings of the 11th International Conference on Multi Media Modeling (MMM 2005), 2005

Baseball event detection using game-specific feature sets and rules.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Automatic video region-of-interest determination based on user attention model.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

2004
A Unified Framework Using Spatial Color Descriptor and Motion-Based Post Refinement for Shot Boundary Detection.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

A study of semantic context detection by using SVM and GMM approaches.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

2003
Encoding strategies for realizing MPEG-4 universal scalable video coding.
Proceedings of the Visual Communications and Image Processing 2003, 2003

Semantic context detection based on hierarchical audio models.
Proceedings of the 5th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2003


  Loading...