Dale Schuurmans

Affiliations:
  • University of Alberta


According to our database1, Dale Schuurmans authored at least 269 papers between 1989 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Toward Understanding In-context vs. In-weight Learning.
CoRR, 2024

Faster WIND: Accelerating Iterative Best-of-<i>N</i> Distillation for LLM Alignment.
CoRR, 2024

Plastic Learning with Deep Fourier Features.
CoRR, 2024

Autoregressive Large Language Models are Computationally Universal.
CoRR, 2024

Generative Hierarchical Materials Search.
CoRR, 2024

UQE: A Query Engine for Unstructured Databases.
CoRR, 2024

Exploring and Benchmarking the Planning Capabilities of Large Language Models.
CoRR, 2024

Learning Continually by Spectral Regularization.
CoRR, 2024

Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF.
CoRR, 2024

Soft Preference Optimization: Aligning Language Models to Expert Distributions.
CoRR, 2024

Video as the New Language for Real-World Decision Making.
CoRR, 2024

Beyond Expectations: Learning with Stochastic Dominance Made Practical.
CoRR, 2024

Mitigating the Curse of Horizon in Monte-Carlo Returns.
RLJ, 2024

Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Position: Video as the New Language for Real-World Decision Making.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Learning Interactive Real-World Simulators.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Probabilistic Adaptation of Black-Box Text-to-Video Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Scalable Diffusion for Materials Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Reinforcement Teaching.
Trans. Mach. Learn. Res., 2023

Curvature Explains Loss of Plasticity.
CoRR, 2023

Large Language Models can Learn Rules.
CoRR, 2023

Learning Interactive Real-World Simulators.
CoRR, 2023

Probabilistic Adaptation of Text-to-Video Models.
CoRR, 2023

Foundation Models for Decision Making: Problems, Methods, and Opportunities.
CoRR, 2023

Memory Augmented Large Language Models are Computationally Universal.
CoRR, 2023

Energy-based Predictive Representations for Partially Observed Reinforcement Learning.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Ordering-based Conditions for Global Convergence of Policy Gradient Methods.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DISCS: A Benchmark for Discrete Sampling.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Universal Policies via Text-Guided Video Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Revisiting Sampling for Combinatorial Optimization.
Proceedings of the International Conference on Machine Learning, 2023

Gradient-Free Structured Pruning with Unlabeled Data.
Proceedings of the International Conference on Machine Learning, 2023

Stochastic Gradient Succeeds for Bandits.
Proceedings of the International Conference on Machine Learning, 2023

Least-to-Most Prompting Enables Complex Reasoning in Large Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

TEMPERA: Test-Time Prompt Editing via Reinforcement Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Dichotomy of Control: Separating What You Can Control from What You Cannot.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Score-based Continuous-time Discrete Diffusion Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Any-scale Balanced Samplers for Discrete Space.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Spectral Decomposition Representation for Reinforcement Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Latent Variable Representation for Reinforcement Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

What learning algorithm is in-context learning? Investigations with linear models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Self-Consistency Improves Chain of Thought Reasoning in Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Discrete Langevin Samplers via Wasserstein Gradient Flow.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Learning to Optimize with Stochastic Dominance Constraints.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
TEMPERA: Test-Time Prompting via Reinforcement Learning.
CoRR, 2022

Learning to Optimize with Stochastic Dominance Constraints.
CoRR, 2022

Rationale-Augmented Ensembles in Language Models.
CoRR, 2022

Discrete Langevin Sampler via Wasserstein Gradient Flow.
CoRR, 2022

Least-to-Most Prompting Enables Complex Reasoning in Large Language Models.
CoRR, 2022

Self-Consistency Improves Chain of Thought Reasoning in Language Models.
CoRR, 2022

On the Effect of Log-Barrier Regularization in Decentralized Softmax Gradient Play in Multiagent Systems.
CoRR, 2022

Chain of Thought Prompting Elicits Reasoning in Large Language Models.
CoRR, 2022

On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Chain of Thought Imitation with Procedure Cloning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Optimal Scaling for Locally Balanced Proposals in Discrete Spaces.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

The Role of Baselines in Policy Gradient Optimization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

A Simple Decentralized Cross-Entropy Method.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Making Linear MDPs Practical via Contrastive Representation Learning.
Proceedings of the International Conference on Machine Learning, 2022

A Parametric Class of Approximate Gradient Updates for Policy Optimization.
Proceedings of the International Conference on Machine Learning, 2022

Marginal Distribution Adaptation for Discrete Sets via Module-Oriented Divergence Minimization.
Proceedings of the International Conference on Machine Learning, 2022

Understanding and Leveraging Overparameterization in Recursive Value Estimation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Neural Stochastic Dual Dynamic Programming.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Offline Policy Selection under Uncertainty.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

The Curse of Passive Data Collection in Batch Reinforcement Learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

2021
On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data.
CoRR, 2021

Joint Attention for Multi-Agent Coordination and Social Learning.
CoRR, 2021

Optimization Issues in KL-Constrained Approximate Policy Iteration.
CoRR, 2021

Combiner: Full Attention Transformer with Sparse Computation Cost.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Understanding the Effect of Stochasticity in Policy Optimization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

On the Optimality of Batch Policy Optimization Algorithms.
Proceedings of the 38th International Conference on Machine Learning, 2021

Characterizing the Gap Between Actor-Critic and Policy Gradient.
Proceedings of the 38th International Conference on Machine Learning, 2021

LEGO: Latent Execution-Guided Reasoning for Multi-Hop Question Answering on Knowledge Graphs.
Proceedings of the 38th International Conference on Machine Learning, 2021

Leveraging Non-uniformity in First-order Non-convex Optimization.
Proceedings of the 38th International Conference on Machine Learning, 2021

EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL.
Proceedings of the 38th International Conference on Machine Learning, 2021

Deep Probabilistic Canonical Correlation Analysis.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Attention that does not Explain Away.
CoRR, 2020

Variational Inference for Deep Probabilistic Canonical Correlation Analysis.
CoRR, 2020

ConQUR: Mitigating Delusional Bias in Deep Q-learning.
CoRR, 2020

Off-Policy Evaluation via the Regularized Lagrangian.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Escaping the Gravitational Pull of Softmax.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Maximum-Entropy Approach to Off-Policy Evaluation in Average-Reward MDPs.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

CoinDICE: Off-Policy Confidence Interval Estimation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Go Wide, Then Narrow: Efficient Training of Deep Thin Networks.
Proceedings of the 37th International Conference on Machine Learning, 2020

Energy-Based Processes for Exchangeable Data.
Proceedings of the 37th International Conference on Machine Learning, 2020

Domain Aggregation Networks for Multi-Source Domain Adaptation.
Proceedings of the 37th International Conference on Machine Learning, 2020

Batch Stationary Distribution Estimation.
Proceedings of the 37th International Conference on Machine Learning, 2020

ConQUR: Mitigating Delusional Bias in Deep Q-Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

On the Global Convergence Rates of Softmax Policy Gradient Methods.
Proceedings of the 37th International Conference on Machine Learning, 2020

Scalable Deep Generative Modeling for Sparse Graphs.
Proceedings of the 37th International Conference on Machine Learning, 2020

An Optimistic Perspective on Offline Reinforcement Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

GenDICE: Generalized Offline Estimation of Stationary Values.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Learning to Combat Compounding-Error in Model-Based Reinforcement Learning.
CoRR, 2019

AlgaeDICE: Policy Gradient from Arbitrary Experience.
CoRR, 2019

Striving for Simplicity in Off-policy Deep Reinforcement Learning.
CoRR, 2019

Maximum Entropy Monte-Carlo Planning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Invertible Convolutional Flow.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Exponential Family Estimation via Adversarial Dynamics Embedding.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Surrogate Objectives for Batch Policy Optimization in One-step Decision Making.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

A Geometric Perspective on Optimal Representations for Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Advantage Amplification in Slowly Evolving Latent-State Environments.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

On Principled Entropy Exploration in Policy Optimization.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

The Value Function Polytope in Reinforcement Learning.
Proceedings of the 36th International Conference on Machine Learning, 2019

Understanding the Impact of Entropy on Policy Optimization.
Proceedings of the 36th International Conference on Machine Learning, 2019

Learning to Generalize from Sparse and Underspecified Rewards.
Proceedings of the 36th International Conference on Machine Learning, 2019

Kernel Exponential Family Estimation via Doubly Dual Embedding.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Planning and Learning with Stochastic Action Sets.
CoRR, 2018

Non-delusional Q-learning and value-iteration.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Planning and Learning with Stochastic Action Sets.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Smoothed Action Value Functions for Learning Gaussian Policies.
Proceedings of the 35th International Conference on Machine Learning, 2018

Trust-PCL: An Off-Policy Trust Region Method for Continuous Control.
Proceedings of the 6th International Conference on Learning Representations, 2018

Variational Rejection Sampling.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2018

2017
Generalized Conditional Gradient for Sparse Estimation.
J. Mach. Learn. Res., 2017

Safe Exploration for Identifying Linear Systems via Robust Optimization.
CoRR, 2017

Holographic Feature Representations of Deep Networks.
Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence, 2017

Bridging the Gap Between Value and Policy Based Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Multi-view Matrix Factorization for Linear Dynamical System Estimation.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Logistic Markov Decision Processes.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Improving Policy Gradient by Exploring Under-appreciated Rewards.
Proceedings of the 5th International Conference on Learning Representations, 2017

Formalizing Anthropomorphism Through Games: A Study in Deep Neural Networks.
Proceedings of the Workshops of the The Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Deep Learning Games.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Reward Augmented Maximum Likelihood for Neural Structured Prediction.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Stochastic Neural Networks with Monotonic Activation Functions.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

Scalable and Sound Low-Rank Tensor Learning.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

2015
On Time Domain Analysis of Photoplethysmogram Signals for Monitoring Heat Stress.
Sensors, 2015

Learning with a Strong Adversary.
CoRR, 2015

Frequency analysis of photoplethysmogram and its derivatives.
Comput. Methods Programs Biomed., 2015

Scalable Metric Learning for Co-Embedding.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2015

Generalization in Unsupervised Learning.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2015

Embedding Inference for Structured Multilabel Prediction.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Correcting Covariate Shift with the Frank-Wolfe Algorithm.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Semi-Supervised Zero-Shot Classification with Label Representation Learning.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Variance Reduction via Antithetic Markov Chains.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

Optimal Estimation of Multivariate ARMA Models.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Convex Deep Learning via Normalized Kernels.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Adaptive Monte Carlo via Bandit Allocation.
Proceedings of the 31th International Conference on Machine Learning, 2014

Convex Co-embedding.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Reinforcement Ranking
CoRR, 2013

Exploiting Syntactic, Semantic, and Lexical Regularities in Language Modeling via Directed Markov Random Fields.
Comput. Intell., 2013

Convex Relaxations of Bregman Divergence Clustering.
Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence, 2013

Protein-chemical Interaction Prediction via Kernelized Sparse Learning SVM.
Proceedings of the Biocomputing 2013: Proceedings of the Pacific Symposium, 2013

Multi-label Classification with Output Kernels.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2013

Polar Operators for Structured Sparse Estimation.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Convex Two-Layer Modeling.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Characterizing the Representer Theorem.
Proceedings of the 30th International Conference on Machine Learning, 2013

Divergence based graph estimation for manifold learning.
Proceedings of the IEEE Global Conference on Signal and Information Processing, 2013

Learning a Metric Space for Neighbourhood Topology Estimation: Application to Manifold Learning.
Proceedings of the Asian Conference on Machine Learning, 2013

2012
The Latent Maximum Entropy Principle.
ACM Trans. Knowl. Discov. Data, 2012

Generalized Optimal Reverse Prediction.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

An efficient algorithm for maximal margin clustering.
J. Glob. Optim., 2012

An experimental methodology for response surface optimization methods.
J. Glob. Optim., 2012

Linear Coherent Bi-Clustering via Beam Searching and Sample Set Clustering.
Discret. Math. Algorithms Appl., 2012

Sparse Learning Based Linear Coherent Bi-clustering.
Proceedings of the Algorithms in Bioinformatics - 12th International Workshop, 2012

Semi-supervised Multi-label Classification - A Simultaneous Large-Margin, Subspace Learning Approach.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2012

Accelerated Training for Matrix-norm Regularization: A Boosting Approach.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

A Polynomial-time Form of Robust Regression.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Convex Multi-view Subspace Learning.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Regularizers versus Losses for Nonlinear Dimensionality Reduction: A Factored View with New Convex Relaxations.
Proceedings of the 29th International Conference on Machine Learning, 2012

2011
Real-Time Discriminative Background Subtraction.
IEEE Trans. Image Process., 2011

Rank/Norm Regularization with Closed-Form Solutions: Application to Subspace Clustering.
Proceedings of the UAI 2011, 2011

Modular Community Detection in Networks.
Proceedings of the IJCAI 2011, 2011

MapReduce for Parallel Reinforcement Learning.
Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

Convex Sparse Coding, Subspace Learning, and Semi-Supervised Extensions.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Adaptive Large Margin Training for Multilabel Classification.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
Relaxed Clipping: A Global Training Method for Robust Regression and Classification.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Distributed Flow Algorithms for Scalable Similarity Visualization.
Proceedings of the ICDMW 2010, 2010

Improved Natural Language Learning via Variance-Regularization Support Vector Machines.
Proceedings of the Fourteenth Conference on Computational Natural Language Learning, 2010

Linear Coherent Bi-cluster Discovery via Beam Detection and Sample Set Clustering.
Proceedings of the Combinatorial Optimization and Applications, 2010

Strictly Lexicalised Dependency Parsing.
Proceedings of the Trends in Parsing Technology, 2010

2009
Dual Temporal Difference Learning.
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009

Learning Exercise Policies for American Options.
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009

A General Projection Property for Distribution Families.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Convex Relaxation of Mixture Regression with Efficient Algorithms.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Optimal reverse prediction: a unified perspective on supervised, unsupervised and semi-supervised learning.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Discriminative Maximum Margin Image Object Categorization with Exact Inference.
Proceedings of the Fifth International Conference on Image and Graphics, 2009

Fast normalized cut with linear constraints.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Linear Coherent Bi-cluster Discovery via Line Detection and Sample Majority Voting.
Proceedings of the Combinatorial Optimization and Applications, 2009

Inference of the structural credit risk model using MLE.
Proceedings of the 2009 IEEE Symposium on Computational Intelligence for Financial Engineering, 2009

A Reformulation of Support Vector Machines for General Confidence Functions.
Proceedings of the Advances in Machine Learning, 2009

2008
Policy Iteration for Learning an Exercise Policy for American Options.
Proceedings of the Recent Advances in Reinforcement Learning, 8th European Workshop, 2008

Efficient global optimization for exponential family PCA and low-rank matrix factorization.
Proceedings of the 46th Annual Allerton Conference on Communication, 2008

Semi-Supervised Convex Training for Dependency Parsing.
Proceedings of the ACL 2008, 2008

2007
Learning Gene Regulatory Networks via Globally Regularized Risk Minimization.
Proceedings of the Comparative Genomics, RECOMB 2007 International Workshop, 2007

Stable Dual Dynamic Programming.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Convex Relaxations of Latent Variable Training.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Discriminative Batch Mode Active Learning.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

Simple Training of Dependency Parsers via Structured Boosting.
Proceedings of the IJCAI 2007, 2007

Automatic Gait Optimization with Gaussian Process Regression.
Proceedings of the IJCAI 2007, 2007

2006
Graphical Models and Point Pattern Matching.
IEEE Trans. Pattern Anal. Mach. Intell., 2006

Constraint-based optimization and utility elicitation using the minimax decision criterion.
Artif. Intell., 2006

Convex Structure Learning for Bayesian Networks: Polynomial Feature Selection and Approximate Ordering.
Proceedings of the UAI '06, 2006

Web Communities Identification from Random Walks.
Proceedings of the Knowledge Discovery in Databases: PKDD 2006, 2006

Information Marginalization on Subgraphs.
Proceedings of the Knowledge Discovery in Databases: PKDD 2006, 2006

Learning to Model Spatial Dependency: Semi-Supervised Discriminative Random Fields.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

implicit Online Learning with Kernels.
Proceedings of the Advances in Neural Information Processing Systems 19, 2006

Discriminative unsupervised learning of structured predictors.
Proceedings of the Machine Learning, 2006

Stochastic Analysis of Lexical and Semantic Enhanced Structural Language Model.
Proceedings of the Grammatical Inference: Algorithms and Applications, 2006

Improved Large Margin Dependency Parsing via Local Constraints and Laplacian Regularization.
Proceedings of the Tenth Conference on Computational Natural Language Learning, 2006

An Online Discriminative Approach to Background Subtraction.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

Semi-Supervised Conditional Random Fields for Improved Sequence Segmentation and Labeling.
Proceedings of the ACL 2006, 2006

Robust Support Vector Machine Training via Convex Outlier Ablation.
Proceedings of the Proceedings, 2006

Compact, Convex Upper Bound Iteration for Approximate POMDP Planning.
Proceedings of the Proceedings, 2006

Metric-Based Approaches for Semi-Supervised Regression and Classification.
Proceedings of the Semi-Supervised Learning, 2006

2005
Combining Statistical Language Models via the Latent Maximum Entropy Principle.
Mach. Learn., 2005

Maximum Margin Bayesian Networks.
Proceedings of the UAI '05, 2005

Strictly Lexical Dependency Parsing.
Proceedings of the Ninth International Workshop on Parsing Technology, 2005

Learning Coordination Classifiers.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Regret-based Utility Elicitation in Constraint-based Decision Problems.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Bayesian sparse sampling for on-line reward optimization.
Proceedings of the Machine Learning, 2005

Variational Bayesian image modelling.
Proceedings of the Machine Learning, 2005

Tangent-Corrected Embedding.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

Unsupervised and Semi-Supervised Multi-Class Support Vector Machines.
Proceedings of the Proceedings, 2005

2004
Learning mixture models with the regularized latent maximum entropy principle.
IEEE Trans. Neural Networks, 2004

Dynamic Web log session identification with statistical language models.
J. Assoc. Inf. Sci. Technol., 2004

Augmenting Naive Bayes Classifiers with Statistical Language Models.
Inf. Retr., 2004

Maximum Margin Clustering.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Transformation-Invariant Embedding for Image Analysis.
Proceedings of the Computer Vision, 2004

2003
Automatic basis selection techniques for RBF networks.
Neural Networks, 2003

Applying Machine Learning to Text Segmentation for Information Retrieval.
Inf. Retr., 2003

Boltzmann Machine Learning with the Latent Maximum Entropy Principle.
Proceedings of the UAI '03, 2003

Monte Carlo Matrix Inversion Policy Evaluation.
Proceedings of the UAI '03, 2003

Language and Task Independent Text Categorization with Simple Language Models.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Text classification in Asian languages without word segmentation.
Proceedings of the Sixth International Workshop on Information Retrieval with Asian Languages, 2003

Learning Mixture Models with the Latent Maximum Entropy Principle.
Proceedings of the Machine Learning, 2003

Semantic n-gram language modeling with the latent maximum entropy principle.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Combining Naive Bayes and n-Gram Language Models for Text Classification.
Proceedings of the Advances in Information Retrieval, 2003

Language Independent Authorship Attribution with Character Level N-Grams.
Proceedings of the EACL 2003, 2003

Face Alignment Using Statistical Models and Wavelet Features.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

Constraint-Based Optimization with the Minimax Decision Criterion.
Proceedings of the Principles and Practice of Constraint Programming, 2003

Learning Continuous Latent Variable Models with Bregman Divergences.
Proceedings of the Algorithmic Learning Theory, 14th International Conference, 2003

Latent Maximum Entropy Approach for Semantic N-gram Language Modeling.
Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics, 2003

Model-Based Least-Squares Policy Evaluation.
Proceedings of the Advances in Artificial Intelligence, 2003

Session Boundary Detection for Association Rule Learning Using n-Gram Language Models.
Proceedings of the Advances in Artificial Intelligence, 2003

2002
Metric-Based Methods for Adaptive Model Selection and Regularization.
Mach. Learn., 2002

Guest Introduction: Special Issue on New Methods for Model Selection and Model Combination.
Mach. Learn., 2002

Using self-supervised word segmentation in Chinese information retrieval.
Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

Waterloo at NTCIR-3: Using Self-supervised Word Segmentation.
Proceedings of the Third NTCIR Workshop on Research in Information Retrieval, 2002

Regularized Greedy Importance Sampling.
Proceedings of the Advances in Neural Information Processing Systems 15 [Neural Information Processing Systems, 2002

Investigating the Maximum Likelihood Alternative to TD(lambda).
Proceedings of the Machine Learning, 2002

Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs.
Proceedings of the Machine Learning, 2002

Investigating the Relationship between Word Segmentation Performance and Retrieval Performance in Chinese IR.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

Piecewise Linear Value Function Approximation for Factored MDPs.
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

Greedy Linear Value-Approximation for Factored Markov Decision Processes.
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

Data Perturbation for Escaping Local Maxima in Learning.
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

2001
General Convergence Results for Linear Discriminant Updates.
Mach. Learn., 2001

Local search characteristics of incomplete SAT procedures.
Artif. Intell., 2001

A Hierarchical EM Approach to Word Segmentation.
Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, 2001

A Simple Closed-Class/Open-Class Factorization for Improved Language Modeling.
Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, 2001

Direct value-approximation for factored MDPs.
Proceedings of the Advances in Neural Information Processing Systems 14 [Neural Information Processing Systems: Natural and Synthetic, 2001

The Exponentiated Subgradient Algorithm for Heuristic Boolean Programming.
Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

Self-Supervised Chinese Word Segmentation.
Proceedings of the Advances in Intelligent Data Analysis, 4th International Conference, 2001

2000
Monte Carlo inference via greedy importance sampling.
Proceedings of the UAI '00: Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence, Stanford University, Stanford, California, USA, June 30, 2000

An Adaptive Regularization Criterion for Supervised Learning.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

1999
Greedy Importance Sampling.
Proceedings of the Advances in Neural Information Processing Systems 12, [NIPS Conference, Denver, Colorado, USA, November 29, 1999

Efficient exploration for optimizing immediate reward.
Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, 1999

1998
Boosting in the Limit: Maximizing the Margin of Learned Ensembles.
Proceedings of the Fifteenth National Conference on Artificial Intelligence and Tenth Innovative Applications of Artificial Intelligence Conference, 1998

1997
Characterizing Rational Versus Exponential learning Curves.
J. Comput. Syst. Sci., 1997

Learning Bayesian Nets that Perform Well.
Proceedings of the UAI '97: Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence, 1997

Characterizing the generalization performance of model selection strategies.
Proceedings of the Fourteenth International Conference on Machine Learning (ICML 1997), 1997

A New Metric-Based Approach to Model Selection.
Proceedings of the Fourteenth National Conference on Artificial Intelligence and Ninth Innovative Applications of Artificial Intelligence Conference, 1997

1996
Effective classification learning.
PhD thesis, 1996

1995
Practical PAC Learning.
Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1995

Sequential PAC Learning.
Proceedings of the Eigth Annual Conference on Computational Learning Theory, 1995

1992
Learning Useful Horn Approximations.
Proceedings of the 3rd International Conference on Principles of Knowledge Representation and Reasoning (KR'92). Cambridge, 1992

Learning an Optimally Accurate Representation System.
Proceedings of the Foundation of Knowledge Representation and Reasoning [the book grew out of an ECAI-92 workshop], 1992

1989
Representational Difficulties with Classifier Systems.
Proceedings of the 3rd International Conference on Genetic Algorithms, 1989


  Loading...