Oriol Vinyals

Orcid: 0000-0001-7848-7283

According to our database1, Oriol Vinyals authored at least 160 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A Practitioner's Guide to Continual Multimodal Pretraining.
CoRR, 2024

Capabilities of Gemini Models in Medicine.
CoRR, 2024

Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Faster sorting algorithms discovered using deep reinforcement learning.
Nat., 2023

AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning.
CoRR, 2023

Optimizing Memory Mapping Using Deep Reinforcement Learning.
CoRR, 2023

Waffling around for Performance: Visual Classification with Random Words and Broad Concepts.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Emergent Abilities of Large Language Models.
Trans. Mach. Learn. Res., 2022

A Generalist Agent.
Trans. Mach. Learn. Res., 2022

Guest Editorial: Non-Euclidean Machine Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Deep Audio-Visual Speech Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

GraphCast: Learning skillful medium-range global weather forecasting.
CoRR, 2022

Flamingo: a Visual Language Model for Few-Shot Learning.
CoRR, 2022

Training Compute-Optimal Large Language Models.
CoRR, 2022

Competition-Level Code Generation with AlphaCode.
CoRR, 2022

Hierarchical Perceiver.
CoRR, 2022

MuZero with Self-competition for Rate Control in VP9 Video Compression.
CoRR, 2022

Unified Scaling Laws for Routed Language Models.
CoRR, 2022

An empirical analysis of compute-optimal large language model training.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022


General-purpose, long-context autoregressive modeling with Perceiver AR.
Proceedings of the International Conference on Machine Learning, 2022



Perceiver IO: A General Architecture for Structured Inputs & Outputs.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Integrating Language Guidance into Vision-based Deep Metric Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Non-isotropy Regularization for Proxy-based Deep Metric Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Scaling Language Models: Methods, Analysis & Insights from Training Gopher.
CoRR, 2021

WikiGraphs: A Wikipedia Text - Knowledge Graph Paired Dataset.
CoRR, 2021

The Benchmark Lottery.
CoRR, 2021

The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors.
CoRR, 2021

Understanding deep learning (still) requires rethinking generalization.
Commun. ACM, 2021

Multimodal Few-Shot Learning with Frozen Language Models.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Vector Quantized Models for Planning.
Proceedings of the 38th International Conference on Machine Learning, 2021

Perceiver: General Perception with Iterative Attention.
Proceedings of the 38th International Conference on Machine Learning, 2021

Efficient Visual Pretraining with Contrastive Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Machine Translation Decoding beyond Beam Search.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Solving Mixed Integer Programs Using Neural Networks.
CoRR, 2020

AlignNet: Unsupervised Entity Alignment.
CoRR, 2020

Strong Generalization and Efficiency in Neural Programs.
CoRR, 2020

Pointer Graph Networks.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML.
Proceedings of the 8th International Conference on Learning Representations, 2020

Reinforced Genetic Algorithm Learning for Optimizing Computation Graphs.
Proceedings of the 8th International Conference on Learning Representations, 2020

2019
Grandmaster level in StarCraft II using multi-agent reinforcement learning.
Nat., 2019

Unsupervised Doodling and Painting with Improved SPIRAL.
CoRR, 2019

REGAL: Transfer Learning For Fast Optimization of Computation Graphs.
CoRR, 2019

Generating Diverse High-Fidelity Images with VQ-VAE-2.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Classification Accuracy Score for Conditional Generative Models.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Retrospective Analysis of the 2019 MineRL Competition on Sample Efficient Reinforcement Learning.
Proceedings of the NeurIPS 2019 Competition and Demonstration Track, 2019

Graph Matching Networks for Learning the Similarity of Graph Structured Objects.
Proceedings of the 36th International Conference on Machine Learning, 2019

Deep reinforcement learning with relational inductive biases.
Proceedings of the 7th International Conference on Learning Representations, 2019

Meta-Learning with Latent Embedding Optimization.
Proceedings of the 7th International Conference on Learning Representations, 2019

Generating Diverse High-Resolution Images with VQ-VAE.
Proceedings of the Deep Generative Models for Highly Structured Data, 2019

Preventing Posterior Collapse with delta-VAEs.
Proceedings of the 7th International Conference on Learning Representations, 2019

Attentive Neural Processes.
Proceedings of the 7th International Conference on Learning Representations, 2019

Universal Transformers.
Proceedings of the 7th International Conference on Learning Representations, 2019

Sample Efficient Adaptive Text-to-Speech.
Proceedings of the 7th International Conference on Learning Representations, 2019

Low Bit-rate Speech Coding with VQ-VAE and a WaveNet Decoder.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Generating Diverse Programs with Instruction Conditioned Reinforced Adversarial Learning.
CoRR, 2018

Representation Learning with Contrastive Predictive Coding.
CoRR, 2018

Relational Deep Reinforcement Learning.
CoRR, 2018

Relational inductive biases, deep learning, and graph networks.
CoRR, 2018

A Study on Overfitting in Deep Reinforcement Learning.
CoRR, 2018

Learning Deep Generative Models of Graphs.
CoRR, 2018

Learning Fast Optimizers for Contextual Stochastic Integer Programs.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

Relational recurrent neural networks.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Learning Implicit Generative Models with the Method of Learned Moments.
Proceedings of the 35th International Conference on Machine Learning, 2018


Learning to Search with MCTSnets.
Proceedings of the 35th International Conference on Machine Learning, 2018

Synthesizing Programs for Images using Reinforced Adversarial Learning.
Proceedings of the 35th International Conference on Machine Learning, 2018

Memory-based Parameter Adaptation.
Proceedings of the 6th International Conference on Learning Representations, 2018

Few-shot Autoregressive Density Estimation: Towards Learning to Learn Distributions.
Proceedings of the 6th International Conference on Learning Representations, 2018

Hierarchical Representations for Efficient Architecture Search.
Proceedings of the 6th International Conference on Learning Representations, 2018

Temporal Modeling Using Dilated Convolution and Gating for Voice-Activity-Detection.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Show and Tell: Lessons Learned from the 2015 MSCOCO Image Captioning Challenge.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Population Based Training of Neural Networks.
CoRR, 2017

StarCraft II: A New Challenge for Reinforcement Learning.
CoRR, 2017

Imagination-Augmented Agents for Deep Reinforcement Learning.
CoRR, 2017

Learning model-based planning from scratch.
CoRR, 2017

Adversarial Evaluation of Dialogue Models.
CoRR, 2017

Bayesian Recurrent Neural Networks.
CoRR, 2017

Imagination-Augmented Agents for Deep Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Neural Discrete Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Neural Episodic Control.
Proceedings of the 34th International Conference on Machine Learning, 2017

Video Pixel Networks.
Proceedings of the 34th International Conference on Machine Learning, 2017

Decoupled Neural Interfaces using Synthetic Gradients.
Proceedings of the 34th International Conference on Machine Learning, 2017

Neural Message Passing for Quantum Chemistry.
Proceedings of the 34th International Conference on Machine Learning, 2017

Understanding Synthetic Gradients and Decoupled Neural Interfaces.
Proceedings of the 34th International Conference on Machine Learning, 2017

Understanding deep learning requires rethinking generalization.
Proceedings of the 5th International Conference on Learning Representations, 2017

Metacontrol for Adaptive Imagination-Based Optimization.
Proceedings of the 5th International Conference on Learning Representations, 2017

Lip Reading Sentences in the Wild.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation.
CoRR, 2016

Order Matters: Sequence to sequence for sets.
Proceedings of the 4th International Conference on Learning Representations, 2016

Connecting Generative Adversarial Networks and Actor-Critic Methods.
CoRR, 2016

Multi-task Sequence to Sequence Learning.
Proceedings of the 4th International Conference on Learning Representations, 2016

Exploring the Limits of Language Modeling.
CoRR, 2016

Decoupled Neural Interfaces using Synthetic Gradients.
CoRR, 2016

Contextual LSTM (CLSTM) models for Large scale NLP tasks.
CoRR, 2016

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems.
CoRR, 2016

WaveNet: A Generative Model for Raw Audio.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Matching Networks for One Shot Learning.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Strategic Attentive Writer for Learning Macro-Actions.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Conditional Image Generation with PixelCNN Decoders.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

An Online Sequence-to-Sequence Model Using Partial Conditioning.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Multilingual Language Processing From Bytes.
Proceedings of the NAACL HLT 2016, 2016

Listen, attend and spell: A neural network for large vocabulary conversational speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Generating Sentences from a Continuous Space.
Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, 2016

2015
A Neural Conversational Model.
CoRR, 2015

Towards Principled Unsupervised Learning.
CoRR, 2015

An Online Sequence-to-Sequence Model Using Partial Conditioning.
CoRR, 2015

Distilling the Knowledge in a Neural Network.
CoRR, 2015

Qualitatively characterizing neural network optimization problems.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Listen, Attend and Spell.
CoRR, 2015

Grammar as a Foreign Language.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Pointer Networks.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Learning the speech front-end with raw waveform CLDNNs.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Sentence Compression by Deletion with LSTMs.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Show and tell: A neural image caption generator.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Beyond short snippets: Deep networks for video classification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Addressing the Rare Word Problem in Neural Machine Translation.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Recurrent Neural Network Regularization.
CoRR, 2014

Sequence to Sequence Learning with Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Sequence discriminative distributed training of long short-term memory recurrent neural networks.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition.
Proceedings of the 31th International Conference on Machine Learning, 2014

Chasing the metric: Smoothing learning algorithms for keyword detection.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Beyond Deep Learning: Scalable Methods and Models for Learning.
PhD thesis, 2013

Pooling-Invariant Image Feature Learning
CoRR, 2013

Why Size Matters: Feature Coding as Nystrom Sampling
Proceedings of the 1st International Conference on Learning Representations, 2013

Deep vs. wide: depth on a budget for robust speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

On Compact Codes for Spatially Pooled Features.
Proceedings of the 30th International Conference on Machine Learning, 2013

2012
Speaker Diarization: A Review of Recent Research.
IEEE Trans. Speech Audio Process., 2012

The ICSI RT-09 Speaker Diarization System.
IEEE Trans. Speech Audio Process., 2012

Krylov Subspace Descent for Deep Learning.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Learning with Recursive Perceptual Representations.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Are Sparse Representations Rich Enough for Acoustic Modeling?
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Recurrent Neural Networks for Noise Reduction in Robust ASR.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Feature learning using Generalized Extreme Value distribution based K-means clustering.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Learning speaker, addressee and overlap detection models from multimodal streams.
Proceedings of the International Conference on Multimodal Interaction, 2012

Revisiting Recurrent Neural Networks for robust ASR.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Improved Overlapped Speech Handling for Speaker Diarization.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Comparing multilayer perceptron to Deep Belief Network Tandem features for robust ASR.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Multimodal Indoor Localization: An Audio-Wireless-Based Approach.
Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010), 2010

Precise indoor localization using smart phones.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Multimodal location estimation.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Discriminative training for hierarchical clustering in speaker diarization.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A hybrid approach to online speaker diarization.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

System output combination for improved speaker diarization.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Prosodic and other Long-Term Features for Speaker Diarization.
IEEE Trans. Speech Audio Process., 2009

Discriminative pronounciation learning using phonetic decoder and minimum-classification-error criterion.
Proceedings of the IEEE International Conference on Acoustics, 2009

Fusing short term and long term features for improved speaker diarization.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Towards Semantic Analysis of Conversations: A System for the Live Identification of Speakers in Meetings.
Proceedings of the 2th IEEE International Conference on Semantic Computing (ICSC 2008), 2008

Live speaker identification in conversations.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

A Hardware-Independent Fast Logarithm Approximation with Adjustable Accuracy.
Proceedings of the Tenth IEEE International Symposium on Multimedia (ISM2008), 2008

Modulation spectrogram features for improved speaker diarization.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Overlapped speech detection for improved speaker diarization in multiparty meetings.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Parameterized kernels for support vector machine classification.
Proceedings of the VISAPP 2007: Proceedings of the Second International Conference on Computer Vision Theory and Applications, Barcelona, Spain, March 8-11, 2007, 2007

Learning Kernel Expansions for Image Classification.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

A fast-match approach for robust, faster than real-time speaker diarization.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007


  Loading...