Sertan Girgin

According to our database1, Sertan Girgin authored at least 43 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Diversity-Rewarded CFG Distillation.
CoRR, 2024

Gemma 2: Improving Open Language Models at a Practical Size.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2024

BOND: Aligning LLMs with Best-of-N Distillation.
CoRR, 2024

WARP: On the Benefits of Weight Averaged Rewarded Policies.
CoRR, 2024

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models.
CoRR, 2024


MusicRL: Aligning Music Generation to Human Preferences.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision.
Trans. Assoc. Comput. Linguistics, 2023

Nash Learning from Human Feedback.
CoRR, 2023

Get Back Here: Robust Imitation by Return-to-Distribution Planning.
CoRR, 2023

Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
vec2text with Round-Trip Translations.
CoRR, 2022

Scalable Deep Reinforcement Learning Algorithms for Mean Field Games.
Proceedings of the International Conference on Machine Learning, 2022

Continuous Control with Action Quantization from Demonstrations.
Proceedings of the International Conference on Machine Learning, 2022

Decoding a Neural Retriever's Latent Space for Query Suggestion.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Solving N-Player Dynamic Routing Games with Congestion: A Mean-Field Approach.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

2021
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning.
CoRR, 2021

What Matters for Adversarial Imitation Learning?
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Brax - A Differentiable Physics Engine for Large Scale Rigid Body Simulation.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Hyperparameter Selection for Imitation Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study.
Proceedings of the 9th International Conference on Learning Representations, 2021

2020
What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study.
CoRR, 2020

2017
Text based user comments as a signal for automatic language identification of online videos.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017

2013
A novel report generation approach for medical applications: The SISDS methodology and its applications.
Int. J. Medical Informatics, 2013

From assets to stories via the Google Cultural Institute Platform.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

2012
Managing advertising campaigns - an approximate planning approach.
Frontiers Comput. Sci., 2012

2011
A Bilinear Interpolation Based Approach for Optimizing Hematoxylin and Eosin Stained Microscopical Images.
Proceedings of the Pattern Recognition in Bioinformatics, 2011

2010
Improving reinforcement learning by using sequence trees.
Mach. Learn., 2010

Advertising Campaigns Management: Should We Be Greedy?
Proceedings of the ICDM 2010, 2010

2009
Developing Diagnostic DSSs Based on a Novel Data Collection Methodology.
Proceedings of the Knowledge Science, 2009

A Novel Multilingual Report Generation System for Medical Applications.
Proceedings of the Artificial Intelligence in Medicine, 2009

Feature discovery in approximate dynamic programming.
Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2009

2008
Swarm Robotics.
Proceedings of the Swarm Intelligence: Introduction and Applications, 2008

Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture.
Proceedings of the Seventh International Conference on Machine Learning and Applications, 2008

Basis Expansion in Natural Actor Critic Methods.
Proceedings of the Recent Advances in Reinforcement Learning, 8th European Workshop, 2008

Feature Discovery in Reinforcement Learning Using Genetic Programming.
Proceedings of the Genetic Programming, 11th European Conference, 2008

2007
Abstraction in reinforcement learning (Pekiştirmeli öğrenmede soyutlama)
PhD thesis, 2007

Positive Impact of State Similarity on Reinforcement Learning Performance.
IEEE Trans. Syst. Man Cybern. Part B, 2007

State Similarity Based Approach for Improving Performance in RL.
Proceedings of the IJCAI 2007, 2007

2006
Area measurement of large closed regions with a mobile robot.
Auton. Robots, 2006

Effectiveness of Considering State Similarity for Reinforcement Learning.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2006

Learning by Automatic Option Discovery from Conditionally Terminating Sequences.
Proceedings of the ECAI 2006, 17th European Conference on Artificial Intelligence, August 29, 2006

2005
Option Discovery in Reinforcement Learning using Frequent Common Subsequences of Actions.
Proceedings of the 2005 International Conference on Computational Intelligence for Modelling Control and Automation (CIMCA 2005), 2005


  Loading...