Alborz Geramifard

According to our database1, Alborz Geramifard authored at least 48 papers between 2006 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Overview of the Tenth Dialog System Technology Challenge: DSTC10.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Overview of the Ninth Dialog System Technology Challenge: DSTC9.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Sequential Decision-Making for Inline Text Autocomplete.
RLJ, 2024

Score Models for Offline Goal-Conditioned Reinforcement Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

When should we prefer Decision Transformers for Offline Reinforcement Learning?
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Robustness through Data Augmentation Loss Consistency.
Trans. Mach. Learn. Res., 2023

Sequence Modeling is a Robust Contender for Offline Reinforcement Learning.
CoRR, 2023

Curriculum Script Distillation for Multilingual Visual Question Answering.
CoRR, 2023

2022
Tell Your Story: Task-Oriented Dialogs for Interactive Content Creation.
CoRR, 2022

Multilingual Multimodality: A Taxonomical Survey of Datasets, Techniques, Challenges and Opportunities.
CoRR, 2022

Reinforcement Learning of Multi-Domain Dialog Policies Via Action Embeddings.
CoRR, 2022

Database Search Results Disambiguation for Task-Oriented Dialog Systems.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Memformer: A Memory-Augmented Transformer for Sequence Modeling.
Proceedings of the Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022

Navigating Connected Memories with a Task-oriented Dialog System.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
Guest editorial: special issue on reinforcement learning for real life.
Mach. Learn., 2021

DAIR: Data Augmented Invariant Regularization.
CoRR, 2021

Annotation Inconsistency and Entity Bias in MultiWOZ.
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021

DialogStitch: Synthetic Deeper and Multi-Context Task-Oriented Dialogs.
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021

An Analysis of State-of-the-Art Models for Situated Interactive MultiModal Conversations (SIMMC).
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021

Conversational AI Efforts within Facebook AI Applied Research.
Proceedings of the MuCAI'21: Proceedings of the 2nd ACM Multimedia Workshop on Multimodal Conversational AI, 2021

SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

DVD: A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Resource Constrained Dialog Policy Learning Via Differentiable Inductive Logic Programming.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Situated and Interactive Multimodal Conversations.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
SIMMC: Situated Interactive Multi-Modal Conversational Data Collection And Evaluation Platform.
CoRR, 2019

Domain-Independent turn-level Dialogue Quality Evaluation via User Satisfaction Estimation.
CoRR, 2019

2017
Learning Robust Dialog Policies in Noisy Environments.
CoRR, 2017

The Future of Artificially Intelligent Assistants.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

2015
RLPy: a value-function-based reinforcement learning framework for education and research.
J. Mach. Learn. Res., 2015

2013
Intelligent Cooperative Control Architecture: A Framework for Performance Improvement Using Safe Learning.
J. Intell. Robotic Syst., 2013

A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning.
Found. Trends Mach. Learn., 2013

Batch-iFDD for Representation Expansion in Large MDPs.
Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence, 2013

Reinforcement learning with misspecified model classes.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

Decentralized control of partially observable Markov decision processes.
Proceedings of the 52nd IEEE Conference on Decision and Control, 2013

2012
Adaptive Planning for Markov Decision Processes with Uncertain Transition Models via Incremental Feature Dependency Discovery.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2012

Model estimation within planning and learning.
Proceedings of the American Control Conference, 2012

2011
Online Discovery of Feature Dependencies.
Proceedings of the 28th International Conference on Machine Learning, 2011

UAV cooperative control with stochastic risk models.
Proceedings of the American Control Conference, 2011

2010
On the Design and Use of a Micro Air Vehicle to Track and Avoid Adversaries.
Int. J. Robotics Res., 2010

An intelligent Cooperative Control Architecture.
Proceedings of the American Control Conference, 2010

Actor-Critic Policy Learning in Cooperative Planning.
Proceedings of the Embedded Reasoning, 2010

2008
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping.
Proceedings of the UAI 2008, 2008

Co-ordinated Tracking and Planning Using Air and Ground Vehicles.
Proceedings of the Experimental Robotics, The Eleventh International Symposium, 2008

Sigma point policy iteration.
Proceedings of the 7th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2008), 2008

2006
A Hybrid Three Layer Architecture for Fire Agent Management in Rescue Simulation Environment
CoRR, 2006

iLSTD: Eligibility Traces and Convergence Analysis.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Biased Cost Pathfinding.
Proceedings of the Second Artificial Intelligence and Interactive Digital Entertainment Conference, 2006

Incremental Least-Squares Temporal Difference Learning.
Proceedings of the Proceedings, 2006


  Loading...