Hassan Mansoor

According to our database1, Hassan Mansoor authored at least 10 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
VQA Training Sets are Self-play Environments for Generating Few-shot Pools.
CoRR, 2024

PERL: Parameter Efficient Reinforcement Learning from Human Feedback.
CoRR, 2024

Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

ScreenAI: A Vision-Language Model for UI and Infographics Understanding.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

LLMs cannot find reasoning errors, but can correct them given the error location.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
LLMs cannot find reasoning errors, but can correct them!
CoRR, 2023

The Impact of Preference Agreement in Reinforcement Learning from Human Feedback: A Case Study in Summarization.
CoRR, 2023

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback.
CoRR, 2023

2016
Towards Semi-Automatic Size Measurement of User Interfaces in Web Applications with IFPUG SNAP.
Proceedings of the 2016 Joint Conference of the International Workshop on Software Measurement and the International Conference on Software Process and Product Measurement, 2016


  Loading...