Winston H. Hsu
Orcid: 0000-0002-3330-0638Affiliations:
- National Taiwan University, Taipei, Taiwan
According to our database1,
Winston H. Hsu
authored at least 223 papers
between 2003 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Unveiling Narrative Reasoning Limits of Large Language Models with Trope in Movie Synopses.
CoRR, 2024
Revisiting Semi-supervised Adversarial Robustness via Noise-aware Online Robust Distillation.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Bridging Episodes and Semantics: A Novel Framework for Long-Form Video Understanding.
CoRR, 2024
Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies.
CoRR, 2024
Shared-unique Features and Task-aware Prioritized Sampling on Multi-task Reinforcement Learning.
CoRR, 2024
VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation.
CoRR, 2024
Tel2Veh: Fusion of Telecom Data and Vehicle Flow to Predict Camera-Free Traffic via a Spatio-Temporal Framework.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024
Enhancing Sustainable Urban Mobility Prediction with Telecom Data: A Spatio-Temporal Framework Approach.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
WLST: Weak Labels Guided Self-training for Weakly-supervised Domain Adaptation on 3D Object Detection.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024
Unveiling Narrative Reasoning Limits of Large Language Models with Trope in Movie Synopses.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Unsupervised Image Prior via Prompt Learning and CLIP Semantic Guidance for Low-Light Image Enhancement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
TelTrans: Applying Multi-Type Telecom Data to Transportation Evaluation and Prediction via Multifaceted Graph Modeling.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
CoRR, 2023
Proceedings of the 20th IEEE International Symposium on Biomedical Imaging, 2023
Proceedings of the 2nd International Workshop on Spatio-Temporal Reasoning and Learning (STRL 2023) co-located with the 32nd International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023
Proceedings of the IEEE International Conference on Robotics and Automation, 2023
Proceedings of the IEEE International Conference on Robotics and Automation, 2023
Proceedings of the IEEE International Conference on Robotics and Automation, 2023
Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Revisiting Depth-guided Methods for Monocular 3D Object Detection by Hierarchical Balanced Depth.
Proceedings of the Conference on Robot Learning, 2023
CTCam: Enhancing Transportation Evaluation through Fusion of Cellular Traffic and Camera-Based Vehicle Flows.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023
STAMINA (Spatial-Temporal Aligned Meteorological INformation Attention) and FPL (Focal Precip Loss): Advancements in Precipitation Nowcasting for Heavy Rainfall Events.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
CoRR, 2022
CoRR, 2022
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022
$\mathrm {D^2ADA}$: Dynamic Density-Aware Active Domain Adaptation for Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal Modeling.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022
Proceedings of the 33rd British Machine Vision Conference 2022, 2022
Stage Conscious Attention Network (SCAN): A Demonstration-Conditioned Policy for Few-Shot Imitation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
ACM Trans. Multim. Comput. Commun. Appl., 2021
IEEE Trans. Circuits Syst. Video Technol., 2021
J. Vis. Commun. Image Represent., 2021
CoRR, 2021
CoRR, 2021
Should I Look at the Head or the Tail? Dual-awareness Attention for Few-Shot Object Detection.
CoRR, 2021
Proceedings of the WWW '21: The Web Conference 2021, 2021
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021
ReDAL: Region-based and Diversity-aware Active Learning for Point Cloud Semantic Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021
Multivariate and Propagation Graph Attention Network for Spatial-Temporal Prediction with Outdoor Cellular Traffic.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021
NOD: Taking a Closer Look at Detection under Extreme Low-Light Conditions with Night Object Detection Dataset.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021
Multi-Stream Attention Learning for Monocular Vehicle Velocity and Inter-Vehicle Distance Estimation.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021
2020
Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos.
IEEE Trans. Multim., 2020
CoRR, 2020
CoRR, 2020
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020
Video Question Generation via Semantic Rich Cross-Modal Self-Attention Networks Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 4th Conference on Robot Learning, 2020
2019
CoRR, 2019
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019
Proceedings of the 30th British Machine Vision Conference 2019, 2019
Proceedings of the 2019 International Conference on 3D Vision, 2019
2018
IEEE Trans. Multim., 2018
Proceedings of the Companion of the The Web Conference 2018 on The Web Conference 2018, 2018
Proceedings of the Computer Vision - ECCV 2018, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018
Drone-View Building Identification by Cross-View Visual Learning and Relative Spatial Estimation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018
Proceedings of the Computer Vision - ACCV 2018, 2018
Proceedings of the 2018 International Conference on 3D Vision, 2018
2017
IEEE Trans. Multim., 2017
Dehashing: Server-Side Context-Aware Feature Reconstruction for Mobile Visual Search.
IEEE Trans. Circuits Syst. Video Technol., 2017
Scalable Face Track Retrieval in Video Archives Using Bag-of-Faces Sparse Representation.
IEEE Trans. Circuits Syst. Video Technol., 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017
Proceedings of the IEEE International Conference on Computer Vision, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Joint Sequence Learning and Cross-Modality Convolution for 3D Biomedical Segmentation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
Proceedings of the 10th Eurographics Workshop on 3D Object Retrieval, 2017
2016
City-view image location identification by multiple geo-social media and graph-based image cluster refinement.
J. Vis. Commun. Image Represent., 2016
De-Hashing: Server-Side Context-Aware Feature Reconstruction for Mobile Visual Search.
CoRR, 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Face Recognition and Retrieval Using Cross-Age Reference Coding With Cross-Age Celebrity Dataset.
IEEE Trans. Multim., 2015
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015
Exploiting Word and Visual Word Co-occurrence for Sketch-based Clipart Image Retrieval.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Real-Time Instant Event Detection in Egocentric Videos by Leveraging Sensor-Based Motion Context.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Trending pool: Visual analytics for trending event compositions for time-series categorical log data.
Proceedings of the 10th IEEE Conference on Visual Analytics Science and Technology, 2015
Summarizing While Recording: Context-Based Highlight Detection for Egocentric Videos.
Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Enhancing sparse voice annotation for semantic retrieval of personal photos by continuous space word representations.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Approximating Weighted Hamming Distance by Probabilistic Selection for Multiple Hash Tables.
Proceedings of the Advances in Information Retrieval, 2015
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015
Visually Interpreting Names as Demographic Attributes by Exploiting Click-Through Data.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
2014
Scalable Mobile Visual Classification by Kernel Preserving Projection Over High-Dimensional Features.
IEEE Trans. Multim., 2014
Online image search result grouping with MapReduce-based image clustering and graph construction for large-scale photos.
J. Vis. Commun. Image Represent., 2014
Me-link: link me to the media - fusing audio and visual cues for robust and efficient mobile media interaction.
Proceedings of the 23rd International World Wide Web Conference, 2014
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Efficient Cross-Domain Image Retrieval by Multi-Level Matching and Spatial Verification for Structural Similarity.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Automatic Facial Image Annotation and Retrieval by Integrating Voice Label and Visual Appearance.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the International Conference on Multimedia Retrieval, 2014
Proceedings of the International Conference on Multimedia Retrieval, 2014
Proceedings of the International Conference on Multimedia Retrieval, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Investigating and predicting social and visual image interestingness on social media by crowdsourcing.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the Computer Vision - ECCV 2014, 2014
Proceedings of the Computer Vision - ECCV 2014, 2014
2013
Automatic Training Image Acquisition and Effective Feature Selection From Community-Contributed Photos for Facial Attribute Detection.
IEEE Trans. Multim., 2013
IEEE Trans. Multim., 2013
Travel Recommendation by Mining People Attributes and Travel Group Types From Community-Contributed Photos.
IEEE Trans. Multim., 2013
Investigating 3-D Model and Part Information for Improving Content-Based Vehicle Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2013
Graph-based semi-supervised learning with multi-modality propagation for large-scale image datasets.
J. Vis. Commun. Image Represent., 2013
Scalable Mobile Video Retrieval with Sparse Projection Learning and Pseudo Label Mining.
IEEE Multim., 2013
Proceedings of the ACM Multimedia Conference, 2013
Proceedings of the ACM Multimedia Conference, 2013
Enabling low bitrate mobile visual recognition: a performance versus bandwidth evaluation.
Proceedings of the ACM Multimedia Conference, 2013
Proceedings of the 2nd ACM international workshop on Geotagging and its applications in multimedia, 2013
Proceedings of the ACM Multimedia Conference, 2013
Proceedings of the IEEE International Conference on Computer Vision, 2013
Full body human attribute detection in indoor surveillance environment using color-depth information.
Proceedings of the 10th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2013
2012
Preference-Aware View Recommendation System for Scenic Photos Based on Bag-of-Aesthetics-Preserving Features.
IEEE Trans. Multim., 2012
Unsupervised Semantic Feature Discovery for Image Object Retrieval and Tag Refinement.
IEEE Trans. Multim., 2012
Learning by expansion: Exploiting social media for image classification with few training examples.
Neurocomputing, 2012
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012
Sharing the trees among random forests for effective and efficient concept detection.
Proceedings of the 14th IEEE International Workshop on Multimedia Signal Processing, 2012
Large-scale simultaneous multi-object recognition and localization via bottom up search-based approach.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Detecting the directions of viewing landmarks for recommendation by large-scale user-contributed photos.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Proceedings of the 2nd ACM international workshop on Interactive multimedia on mobile and portable devices, 2012
Discovering informative social subgraphs and predicting pairwise relationships from group photos.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
Proceedings of the 20th International Conference on World Wide Web, 2011
Multi-layer graph-based semi-supervised learning for large-scale image datasets using mapreduce.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011
Proceedings of the Advances in Multimedia Modeling, 2011
Scalable mobile video question-answering system with locally aggregated descriptors and random projection.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Multiple object localization by context-aware adaptive window search and search-based object recognition.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Augmenting mobile city-view image retrieval with context-rich user-contributed photos.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Proceedings of the 2011 international ACM workshop on Interactive multimedia on mobile and portable devices, 2011
Personalized travel recommendation by mining people attributes from community-contributed photos.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011
Unsupervised auxiliary visual words discovery for large-scale image object retrieval.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011
2010
Boosting image object retrieval and indexing by automatically discovered pseudo-objects.
J. Vis. Commun. Image Represent., 2010
Interactive inquiry for object of interest in video playback by motion-augmented graph cut.
Proceedings of the 18th International Conference on Multimedia 2010, 2010
A technical demonstration of large-scale image object retrieval by efficient query evaluation and effective auxiliary visual feature discovery.
Proceedings of the 18th International Conference on Multimedia 2010, 2010
GPS, compass, or camera?: investigating effective mobile sensors for automatic search-based image annotation.
Proceedings of the 18th International Conference on Multimedia 2010, 2010
Search-Based Automatic Image Annotation via Flickr Photos Using Tag Expansion.
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
Online Reranking via Ordinal Informative Concepts for Context Fusion in Concept Detection and Video Search.
IEEE Trans. Circuits Syst. Video Technol., 2009
Proceedings of the Advances in Multimedia Information Processing, 2009
Proceedings of the 17th International Conference on Multimedia 2009, 2009
Proceedings of the 17th International Conference on Multimedia 2009, 2009
Canonical image selection and efficient image graph construction for large-scale flickr photos.
Proceedings of the 17th International Conference on Multimedia 2009, 2009
A latent semantic retrieval and clustering system for personal photos with sparse speech annotation.
Proceedings of the third workshop on Searching spontaneous conversational speech, 2009
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009
Proceedings of the International Conference on Image Processing, 2009
2008
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008
ContextSeer: context search and recommendation at query time for shared consumer photos.
Proceedings of the 16th International Conference on Multimedia 2008, 2008
Proceedings of the 16th International Conference on Multimedia 2008, 2008
Proceedings of the 16th International Conference on Multimedia 2008, 2008
SheepDog: group and tag recommendation for flickr photos by automatic search-based learning.
Proceedings of the 16th International Conference on Multimedia 2008, 2008
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
2007
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007
Proceedings of the 1st ACM Workshop on Video Summarization, 2007
Proceedings of the 15th International Conference on Multimedia 2007, 2007
2006
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006
Proceedings of the 14th ACM International Conference on Multimedia, 2006
Topic Tracking Across Broadcast News Videos with Visual Duplicates and Semantic Concepts.
Proceedings of the International Conference on Image Processing, 2006
2005
Proceedings of the 2005 TREC Video Retrieval Evaluation, 2005
Visual Cue Cluster Construction via Information Bottleneck Principle and Kernel Density Estimation.
Proceedings of the Image and Video Retrieval, 4th International Conference, 2005
2004
Proceedings of the 2004 TREC Video Retrieval Evaluation, 2004
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2004, 2004
Story boundary detection in large broadcast news video archives: techniques, experience and trends.
Proceedings of the 12th ACM International Conference on Multimedia, 2004
Generative, discriminative, and ensemble learning on multi-modal perceptual fusion toward news video story segmentation.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004
News video story segmentation using fusion of multi-level multi-modal features in TRECVID 2003.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
Discovery and Fusion of Salient Multi-modal Features Towards News Story Segmentation.
Proceedings of the 2003 TREC Video Retrieval Evaluation, 2003
Proceedings of the 2003 TREC Video Retrieval Evaluation, 2003
A statistical framework for fusing mid-level perceptual features in news story segmentation.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003