Wei-Ta Chu

Orcid: 0000-0001-5722-7239

According to our database1, Wei-Ta Chu authored at least 128 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Frequency disentangled residual network.
Multim. Syst., February, 2024

Overall positive prototype for few-shot open-set recognition.
Pattern Recognit., 2024

Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval.
CoRR, 2024

Unsupervised Anomaly Detection on Histopathology Images Using Adversarial Learning and Simulated Anomaly.
Proceedings of the Medical Image Understanding and Analysis - 28th Annual Conference, 2024

Multiple Player Tracking With 3D Projection and Spatio-Temporal Information In Multi-View Sports Videos.
Proceedings of the IEEE International Conference on Acoustics, 2024

Chart Question Answering based on Modality Conversion and Large Language Models.
Proceedings of the 1st ACM Workshop on AI-Powered Q&A Systems for Multimedia, 2024

2023
SSSD: Self-Supervised Self Distillation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Weakly-Supervised Deep Image Hashing based on Cross-Modal Transformer.
Proceedings of the 18th International Conference on Machine Vision and Applications, 2023

Manga Text Detection with Manga-Specific Data Augmentation and Its Applications on Emotion Analysis.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023

The NCKU-VTF Dataset and a Multi-scale Thermal-to-Visible Face Synthesis System.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023

Occlusion­-Aware Manga Character Re­-identification with Self-Paced Contrastive Learning.
Proceedings of the ACM Multimedia Asia 2023, 2023

A Trajectory-based Statistics and Tactics Analysis System for Table Tennis.
Proceedings of the ACM Multimedia Asia 2023, 2023

2022
Instant Basketball Defensive Trajectory Generation.
ACM Trans. Intell. Syst. Technol., 2022

Enhancing Fan Engagement in a 5G Stadium With AI-Based Technologies and Live Streaming.
IEEE Syst. J., 2022

An imitation learning framework for generating multi-modal trajectories from unstructured demonstrations.
Neurocomputing, 2022

Indie Games Popularity Prediction by Considering Multimodal Features.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

Multimodal Fusion with Cross-Modal Attention for Action Recognition in Still Images.
Proceedings of the 4th ACM International Conference on Multimedia in Asia, 2022

Vision Transformer Hashing for Image Retrieval.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

2021
How it Flies and Why it Flies? Volleyball Trajectory Segmentation and Classification.
IEEE Trans. Circuits Syst. II Express Briefs, 2021

Multi-label image recognition by using semantics consistency, object correlation, and multiple samples.
J. Vis. Commun. Image Represent., 2021

A Real-Time Sculpting and Terrain Generation System for Interactive Content Creation.
IEEE Access, 2021

Thermal Face Recognition Based on Multi-scale Image Synthesis.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021

Automatic Baseball Pitch Overlay.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

PONAS: Progressive One-shot Neural Architecture Search for Very Efficient Deployment.
Proceedings of the International Joint Conference on Neural Networks, 2021

Searching by Generating: Flexible and Efficient One-Shot NAS With Architecture Generator.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Multi-Class Novelty Detection with Generated Hard Novel Features.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Semi-Supervised 3D Human Pose Estimation by Jointly Considering Temporal and Multiview Information.
IEEE Access, 2020

Thermal Face Recognition Based on Transformation by Residual U-Net and Pixel Shuffle Upsampling.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

An autoregressive generation model for producing instant basketball defensive trajectory.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

BatikGAN: A Generative Adversarial Network for Batik Creation.
Proceedings of the 2020 Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia, 2020

MMArt-ACM'20: International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia 2020.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

Session details: Attractiveness Computing in Multimedia.
Proceedings of the 2020 Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia, 2020

A Study of Self Distillation for Mango Image Classification.
Proceedings of the International Computer Symposium, 2020

2019
Manga face detection based on deep neural networks fusing global and local information.
Pattern Recognit., 2019

Spatiotemporal Modeling and Label Distribution Learning for Video Summarization.
Proceedings of the 21st IEEE International Workshop on Multimedia Signal Processing, 2019

Thermal Facial Landmark Detection by Deep Multi-Task Learning.
Proceedings of the 21st IEEE International Workshop on Multimedia Signal Processing, 2019

Photo Filter Classification and Filter Recommendation without Much Manual Labeling.
Proceedings of the 21st IEEE International Workshop on Multimedia Signal Processing, 2019

A Genetic Programming Approach to Integrate Multilayer CNN Features for Image Classification.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

3D Foot Model Construction from Photos, Model Segmentation, and Model Alignment.
Proceedings of the IEEE 8th Global Conference on Consumer Electronics, 2019

2018
Image Style Classification Based on Learnt Deep Correlation Features.
IEEE Trans. Multim., 2018

Visual Weather Temperature Prediction.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Text Detection in Manga by Deep Region Proposal, Classification, and Regression.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

A Parametric Study of Deep Perceptual Model on Visible to Thermal Face Recognition.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

2017
A hybrid recommendation system considering visual information for predicting favorite restaurants.
World Wide Web, 2017

Cultural difference and visual information on hotel rating prediction.
World Wide Web, 2017

On broadcasted game video analysis: event detection, highlight detection, and highlight forecast.
Multim. Tools Appl., 2017

Camera as weather sensor: Estimating weather information from single images.
J. Vis. Commun. Image Represent., 2017

Movie Genre Classification based on Poster Images with Deep Neural Networks.
Proceedings of the Workshop on Multimodal Understanding of Social, 2017

Badminton Video Analysis based on Spatiotemporal and Stroke Features.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Manga FaceNet: Face Detection in Manga based on Deep Neural Network.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Blog Article Summarization with Image-Text Alignment Techniques.
Proceedings of the 19th IEEE International Symposium on Multimedia, 2017

Food image description based on deep-based joint food category, ingredient, and cooking method recognition.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

2016
Predicting Occupation from Images by Combining Face and Body Context Information.
ACM Trans. Multim. Comput. Commun. Appl., 2016

Deep Correlation Features for Image Style Classification.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

News story clustering with fisher embedding.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Manga-specific features and latent style model for manga style analysis.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Image2Weather: A Large-Scale Image Dataset for Weather Property Estimation.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

A Study of Combining Re-coloring and Adding Patterns to Images for Dichromats.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

2015
Optimized Comics-Based Storytelling for Temporal Image Sequences.
IEEE Trans. Multim., 2015

Street sweeper: detecting and removing cars in street view images.
Multim. Tools Appl., 2015

Weather-Adaptive Distance Metric for Landmark Image Classification.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Event Detection and Highlight Detection of Broadcasted Game Videos.
Proceedings of the 2nd Workshop on Computational Models of Social Interactions: Human-Computer-Media Communication, 2015

A Privacy-Preserving Bipartite Graph Matching Framework for Multimedia Analysis and Retrieval.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

2014
Color CENTRIST: Embedding color information in scene categorization.
J. Vis. Commun. Image Represent., 2014

Line-Based Drawing Style Description for Manga Classification.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Fast Object Detection Using Multistage Particle Window Deformable Part Model.
Proceedings of the 2014 IEEE International Symposium on Multimedia, 2014

Predicting Occupation from Single Facial Images.
Proceedings of the 2014 IEEE International Symposium on Multimedia, 2014

2013
Mathematical Formula Detection in Heterogeneous Document Images.
Proceedings of the Conference on Technologies and Applications of Artificial Intelligence, 2013

Evaluation of Product Quantization for Image Search.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Optimized speech balloon placement for automatic comics generation.
Proceedings of the 3rd ACM international workshop on Interactive multimedia on mobile & portable devices, 2013

Size does matter: how image size affects aesthetic perception?
Proceedings of the ACM Multimedia Conference, 2013

ACM multimedia 2013 workshop on crowdsourcing for multimedia.
Proceedings of the ACM Multimedia Conference, 2013

Tag suggestion and localization for images by bipartite graph matching.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Rhythm of Motion Extraction and Rhythm-Based Cross-Media Alignment for Dance Videos.
IEEE Trans. Multim., 2012

Somebody helps me: Travel video scene detection using web-based context.
Neurocomputing, 2012

Enabling portable animation browsing by transforming animations into comics.
Proceedings of the 2nd ACM international workshop on Interactive multimedia on mobile and portable devices, 2012

ACM multimedia 2012 workshop on crowdsourcing for multimedia.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Visual pattern discovery for architecture image classification and product image search.
Proceedings of the International Conference on Multimedia Retrieval, 2012

Color CENTRIST: a color descriptor for scene categorization.
Proceedings of the International Conference on Multimedia Retrieval, 2012

GPU-accelerated scene categorization under multiscale category-specific visual word strategy.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Logo recognition and localization in real-world images by using visual patterns.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
A Comprehensive Study of Sports Video Analysis.
Proceedings of the Multimedia Analysis, Processing and Communications, 2011

Editing by Viewing: Automatic Home Video Summarization by Viewing Behavior Analysis.
IEEE Trans. Multim., 2011

Travelmedia: An intelligent management system for media captured in travel.
J. Vis. Commun. Image Represent., 2011

Score Following and Retrieval Based on Chroma and Octave Representation.
Proceedings of the Advances in Multimedia Modeling, 2011

News story clustering from both what and how aspects: using bag of word model and affinity propagation.
Proceedings of the 2011 ACM international workshop on Automated media analysis and production for novel TV services, 2011

2010
Modeling spatiotemporal relationships between moving objects for event tactics analysis in tennis videos.
Multim. Tools Appl., 2010

Consumer photo management and browsing facilitated by near-duplicate detection with feature filtering.
J. Vis. Commun. Image Represent., 2010

Travel Photo and Video Summarization with Cross-Media Correlation and Mutual Influence.
Proceedings of the Advances in Multimedia Modeling, 2010

Age classification for pose variant and occluded faces.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

A real-time user Interest Meter and its applications in home video summarizing.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

2009
RoleNet: Movie Analysis from the Perspective of Social Networks.
IEEE Trans. Multim., 2009

A User Experience Model for Home Video Summarization.
Proceedings of the Advances in Multimedia Modeling, 2009

Visual language model for face clustering in consumer photos.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Feature classification for representative photo selection.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Automatic summarization of travel photos using near-duplication detection and feature filtering.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Using context information and local feature points in face clustering for consumer photos.
Proceedings of the IEEE International Conference on Acoustics, 2009

Using cross-media correlation for scene detection in travel videos.
Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009

2008
Explicit semantic events detection and development of realistic applications for broadcasting baseball videos.
Multim. Tools Appl., 2008

Aesthetics-Based Automatic Home Video Skimming System.
Proceedings of the Advances in Multimedia Modeling, 2008

Automatic selection of representative photo and smart thumbnailing using near-duplicate detection.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Event detection in tennis matches based on video data mining.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

2007
Tiling Slideshow: An Audiovisual Presentation Method for Consumer Photos.
IEEE Multim., 2007

RoleNet: treat a movie as a small society.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

ITEMS: intelligent travel experience management system.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Semantic-event based analysis and segmentation of wedding ceremony videos.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Movie Analysis Based on Roles' Social Network.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Exploring Broadcasting Baseball Videos Based on Multimodal and Multidisciplinary Study.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

2006
Semantic Context Detection Using Audio Event Fusion.
EURASIP J. Adv. Signal Process., 2006

Development of realistic applications based on explicit event detection in broadcasting baseball videos.
Proceedings of the 12th International Conference on Multi Media Modeling (MMM 2006), 2006

Audiovisual slideshow: present your journey by photos.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Tiling slideshow.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Extraction of Baseball Trajectory and Physics-Based Validation for Single-View Baseball Video Sequences.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

2005
Toward semantic indexing and retrieval using hierarchical audio models.
Multim. Syst., 2005

Toward better retrieval and presentation by exploring cross-media correlations.
Multim. Syst., 2005

A Visual Attention Based Region-of-Interest Determination Framework for Video Sequences.
IEICE Trans. Inf. Syst., 2005

Improvement of Commercial Boundary Detection Using Audiovisual Features.
Proceedings of the Advances in Multimedia Information Processing, 2005

Generative and Discriminative Modeling toward Semantic Context Detection in Audio Tracks.
Proceedings of the 11th International Conference on Multi Media Modeling (MMM 2005), 2005

Baseball event detection using game-specific feature sets and rules.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Automatic video region-of-interest determination based on user attention model.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Integration of rule-based and model-based decision methods for baseball event detection.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

2004
A Unified Framework Using Spatial Color Descriptor and Motion-Based Post Refinement for Shot Boundary Detection.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Action movies segmentation and summarization based on tempo analysis.
Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2004

A study of semantic context detection by using SVM and GMM approaches.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

2003
Semantic context detection based on hierarchical audio models.
Proceedings of the 5th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2003

2002
Multiple Granularity Access to Navigated Hypermedia Documents Using Temporal Meta-information.
Proceedings of the Advances in Multimedia Information Processing, 2002

The WSML system: web-based synchronization multimedia lecture system.
Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002

Cross-media correlation: a case study of navigated hypermedia documents.
Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002


  Loading...