Benjamin Z. Yao

Orcid: 0009-0005-8622-3540

According to our database1, Benjamin Z. Yao authored at least 31 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A Self-Learning Framework for Large-Scale Conversational AI Systems.
IEEE Comput. Intell. Mag., May, 2024

Bringing Multimodality to Amazon Visual Search System.
CoRR, 2024

Bringing Multimodality to Amazon Visual Search System.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Diffusion Models for Multi-Task Generative Modeling.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs.
Proceedings of the Computer Vision - ECCV 2024, 2024

Open Vocabulary Multi-label Video Classification.
Proceedings of the Computer Vision - ECCV 2024, 2024

VidLA: Video-Language Alignment at Scale.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
KEPLET: Knowledge-Enhanced Pretrained Language Model with Topic Entity Awareness.
CoRR, 2023

PersonaPKT: Building Personalized Dialogue Agents via Parameter-efficient Knowledge Transfer.
Proceedings of The Fourth Workshop on Simple and Efficient Natural Language Processing, 2023

KEPLET: Knowledge-Enhanced Pretrained Language Model with Topic Entity Awareness.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
Joint Goal Segmentation and Goal Success Prediction on Multi-Domain Conversations.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
Feedback-Based Self-Learning in Large-Scale Conversational AI Agents.
AI Mag., 2021

2020
Large-scale Hybrid Approach for Predicting User Satisfaction with Conversational Agents.
CoRR, 2020

IQ-Net: A DNN Model for Estimating Interaction-level Dialogue Quality with Conversational Agents.
Proceedings of the KDD 2020 Workshop on Conversational Systems Towards Mainstream Adoption co-located with the 26TH ACM SIGKDD Conference on Knowledge Discovery and Data Mining (SIGKDD 2020), 2020

Knowledge Distillation from Internal Representations.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Knowledge Distillation from Internal Representations.
CoRR, 2019

2016
Compositional models and Structured learning for visual recognition.
Pattern Recognit., 2016

2014
Animated Pose Templates for Modeling and Detecting Human Actions.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

2013
Auto learning temporal atomic actions for activity classification.
Pattern Recognit., 2013

Learning and parsing video events with goal and intent prediction.
Comput. Vis. Image Underst., 2013

2012
Reconfigurable templates for robust vehicle detection and classification.
Proceedings of the IEEE Workshop on Applications of Computer Vision, 2012

Modelling Atomic Actions for Activity Classification.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

2011
Inferring social roles in long timespan video sequence.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Unsupervised learning of event AND-OR grammar and semantics from video.
Proceedings of the IEEE International Conference on Computer Vision, 2011

2010
I2T: Image Parsing to Text Description.
Proc. IEEE, 2010

Action detection using multiple spatial-temporal interest point features.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

2009
Learning deformable action templates from cluttered videos.
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009

Image parsing with stochastic grammar: The Lotus Hill dataset and inference scheme.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009

2008
Learning a scene contextual model for tracking and abnormality detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2008

A hierarchical and contextual model for aerial image understanding.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
Introduction to a Large-Scale General Purpose Ground Truth Database: Methodology, Annotation Tool and Benchmarks.
Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition, 2007


  Loading...