Richard Socher

Affiliations:
  • You.com
  • Salesforce Research, USA


According to our database1, Richard Socher authored at least 160 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Large Language Models: A Survey.
CoRR, 2024

2023
Author Correction: Prostate cancer therapy personalization via multi-modal deep learning on randomized phase III clinical trials.
npj Digit. Medicine, 2023

2022
Prostate cancer therapy personalization via multi-modal deep learning on randomized phase III clinical trials.
npj Digit. Medicine, 2022

Converse: A Tree-Based Modular Task-Oriented Dialogue System.
CoRR, 2022

2021
SummEval: Re-evaluating Summarization Evaluation.
Trans. Assoc. Comput. Linguistics, 2021

Biological data annotation via a human-augmenting AI-based labeling system.
npj Digit. Medicine, 2021

COVID-19 information retrieval with deep-learning based semantic search, question answering, and abstractive summarization.
npj Digit. Medicine, 2021

Deep learning-enabled medical computer vision.
npj Digit. Medicine, 2021

The AI Economist: Optimal Economic Policy Design via Two-level Deep Reinforcement Learning.
CoRR, 2021

Evaluating State-of-the-Art Classification Models Against Bayes Optimality.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

DART: Open-Domain Structured Data Record to Text Generation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts Generalization.
Proceedings of the 38th International Conference on Machine Learning, 2021

BERTology Meets Biology: Interpreting Attention in Protein Language Models.
Proceedings of the 9th International Conference on Learning Representations, 2021

GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing.
Proceedings of the 9th International Conference on Learning Representations, 2021

GeDi: Generative Discriminator Guided Sequence Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Explaining and Improving Model Behavior with k Nearest Neighbor Representations.
CoRR, 2020

Explaining Creative Artifacts.
CoRR, 2020

Central Yup'ik and Machine Translation of Low-Resource Polysynthetic Languages.
CoRR, 2020

DART: Open-Domain Structured Data Record to Text Generation.
CoRR, 2020

CO-Search: COVID-19 Information Retrieval with Semantic Search, Question Answering, and Abstractive Summarization.
CoRR, 2020

EMT: Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading.
CoRR, 2020

Prototypical Contrastive Learning of Unsupervised Representations.
CoRR, 2020

The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies.
CoRR, 2020

ToD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogues.
CoRR, 2020

ProGen: Language Modeling for Protein Generation.
CoRR, 2020

Improving out-of-distribution generalization via multi-task self-supervised pretraining.
CoRR, 2020

Towards Noise-resistant Object Detection with Noisy Annotations.
CoRR, 2020

Neural Bayes: A Generic Parameterization Method for Unsupervised Representation Learning.
CoRR, 2020

Taylorized Training: Towards Better Approximation of Neural Network Training at Finite Width.
CoRR, 2020

Find or Classify? Dual Strategy for Slot-Value Predictions on Multi-Domain Dialog State Tracking.
Proceedings of the Ninth Joint Conference on Lexical and Computational Semantics, 2020

Theory-Inspired Path-Regularized Differential Network Architecture Search.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Online Structured Meta-learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Simple Language Model for Task-Oriented Dialogue.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Towards Understanding Hierarchical Learning: Benefits of Neural Representations.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Limits of Detecting Text Generated by Large-Scale Language Models.
Proceedings of the Information Theory and Applications Workshop, 2020

An Investigation of Phone-Based Subword Units for End-to-End Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills.
Proceedings of the 37th International Conference on Machine Learning, 2020

Tree-Structured Attention with Hierarchical Accumulation.
Proceedings of the 8th International Conference on Learning Representations, 2020

DivideMix: Learning with Noisy Labels as Semi-supervised Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Non-Autoregressive Dialog State Tracking.
Proceedings of the 8th International Conference on Learning Representations, 2020

Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering.
Proceedings of the 8th International Conference on Learning Representations, 2020

Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Universal Natural Language Processing with Limited Annotations: Try Few-shot Textual Entailment as a Start.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Simple Data Augmentation with the Mask Token Improves Domain Adaptation for Dialog Act Tagging.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Composed Variational Natural Language Generation for Few-shot Intents.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogue.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Evaluating the Factual Consistency of Abstractive Text Summarization.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

The Thieves on Sesame Street are Polyglots - Extracting Multilingual Models from Monolingual APIs.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Learning From Noisy Anchors for One-Stage Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Assessing Local Generalization Capability in Deep Models.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Photon: A Robust Cross-Domain Text-to-SQL System.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

ESPRIT: Explaining Solutions to Physical Reasoning Tasks.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

ERASER: A Benchmark to Evaluate Rationalized NLP Models.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Attentive Student Meets Multi-Task Teacher: Improved Knowledge Distillation for Pretrained Models.
CoRR, 2019

Sketch-Fill-A-R: A Persona-Grounded Chit-Chat Generation Framework.
CoRR, 2019

Global Capacity Measures for Deep ReLU Networks via Path Sampling.
CoRR, 2019

Entropy Penalty: Towards Generalization Beyond the IID Assumption.
CoRR, 2019

CTRL: A Conditional Transformer Language Model for Controllable Generation.
CoRR, 2019

Pretrained AI Models: Performativity, Mobility, and Change.
CoRR, 2019

Deleter: Leveraging BERT to Perform Unsupervised Successive Text Compression.
CoRR, 2019

Learning World Graphs to Accelerate Hierarchical Reinforcement Learning.
CoRR, 2019

XLDA: Cross-Lingual Data Augmentation for Natural Language Inference and Question Answering.
CoRR, 2019

Unifying Question Answering and Text Classification via Span Extraction.
CoRR, 2019

A High-Quality Multilingual Dataset for Structured Documentation Translation.
Proceedings of the Fourth Conference on Machine Translation, 2019

Genie: a generator of natural language semantic parsers for virtual assistant commands.
Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2019

Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

On the Generalization Gap in Reparameterizable Reinforcement Learning.
Proceedings of the 36th International Conference on Machine Learning, 2019

Taming MAML: Efficient unbiased meta-reinforcement learning.
Proceedings of the 36th International Conference on Machine Learning, 2019

Learn to Grow: A Continual Structure Learning Framework for Overcoming Catastrophic Forgetting.
Proceedings of the 36th International Conference on Machine Learning, 2019

Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering.
Proceedings of the 7th International Conference on Learning Representations, 2019

Global-to-local Memory Pointer Networks for Task-Oriented Dialogue.
Proceedings of the 7th International Conference on Learning Representations, 2019

Self-Monitoring Navigation Agent via Auxiliary Progress Estimation.
Proceedings of the 7th International Conference on Learning Representations, 2019

Competitive experience replay.
Proceedings of the 7th International Conference on Learning Representations, 2019

Augmented Cyclic Adversarial Learning for Low Resource Domain Adaptation.
Proceedings of the 7th International Conference on Learning Representations, 2019

A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation.
Proceedings of the 7th International Conference on Learning Representations, 2019

StartNet: Online Detection of Action Start in Untrimmed Videos.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Neural Text Summarization: A Critical Evaluation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

WSLLN: Weakly Supervised Natural Language Localization Networks.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

AdaFrame: Adaptive Frame Selection for Fast Video Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

SParC: Cross-Domain Semantic Parsing in Context.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Explain Yourself! Leveraging Language Models for Commonsense Reasoning.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

BERT is Not an Interlingua and the Bias of Tokenization.
Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP, 2019

2018
Identifying Generalization Properties in Neural Networks.
CoRR, 2018

Augmented Cyclic Adversarial Learning for Domain Adaptation.
CoRR, 2018

The Natural Language Decathlon: Multitask Learning as Question Answering.
CoRR, 2018

Using Mode Connectivity for Loss Landscape Analysis.
CoRR, 2018

Global-Locally Self-Attentive Dialogue State Tracker.
CoRR, 2018

An Analysis of Neural Language Modeling at Multiple Scales.
CoRR, 2018

A Multi-Discriminator CycleGAN for Unsupervised Non-Parallel Speech Domain Adaptation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

DCN+: Mixed Objective And Deep Residual Coattention for Question Answering.
Proceedings of the 6th International Conference on Learning Representations, 2018

Interpretable Counting for Visual Question Answering.
Proceedings of the 6th International Conference on Learning Representations, 2018

Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning.
Proceedings of the 6th International Conference on Learning Representations, 2018

A Flexible Approach to Automated RNN Architecture Generation.
Proceedings of the 6th International Conference on Learning Representations, 2018

A Deep Reinforced Model for Abstractive Summarization.
Proceedings of the 6th International Conference on Learning Representations, 2018

Regularizing and Optimizing LSTM Language Models.
Proceedings of the 6th International Conference on Learning Representations, 2018

Non-Autoregressive Neural Machine Translation.
Proceedings of the 6th International Conference on Learning Representations, 2018

Improving End-to-End Speech Recognition with Policy Learning.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multi-Hop Knowledge Graph Reasoning with Reward Shaping.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Improving Abstraction in Text Summarization.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

End-to-End Dense Video Captioning With Masked Transformer.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Efficient and Robust Question Answering from Minimal Context over Documents.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Global-Locally Self-Attentive Encoder for Dialogue State Tracking.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Improving Generalization Performance by Switching from Adam to SGD.
CoRR, 2017

Block-diagonal Hessian-free Optimization for Training Neural Networks.
CoRR, 2017

Improved Regularization Techniques for End-to-End Speech Recognition.
CoRR, 2017

Weighted Transformer Network for Machine Translation.
CoRR, 2017

Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning.
CoRR, 2017

Revisiting Activation Regularization for Language RNNs.
CoRR, 2017

Learning when to skim and when to read.
Proceedings of the 2nd Workshop on Representation Learning for NLP, 2017

Learned in Translation: Contextualized Word Vectors.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Dynamic Coattention Networks For Question Answering.
Proceedings of the 5th International Conference on Learning Representations, 2017

Pointer Sentinel Mixture Models.
Proceedings of the 5th International Conference on Learning Representations, 2017

Tying Word Vectors and Word Classifiers: A Loss Framework for Language Modeling.
Proceedings of the 5th International Conference on Learning Representations, 2017

Quasi-Recurrent Neural Networks.
Proceedings of the 5th International Conference on Learning Representations, 2017

A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Towards Neural Machine Translation with Latent Tree Attention.
Proceedings of the 2nd Workshop on Structured Prediction for Natural Language Processing, 2017

Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
A Way out of the Odyssey: Analyzing and Combining Recent Insights for LSTMs.
CoRR, 2016

MetaMind Neural Machine Translation System for WMT 2016.
Proceedings of the First Conference on Machine Translation, 2016

Deep Learning for Sentiment Analysis - Invited Talk.
Proceedings of the 7th Workshop on Computational Approaches to Subjectivity, 2016

Dynamic Memory Networks for Visual and Textual Question Answering.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Ask Me Anything: Dynamic Memory Networks for Natural Language Processing.
Proceedings of the 33nd International Conference on Machine Learning, 2016

2015
Ask Me Anything: Dynamic Memory Networks for Natural Language Processing.
CoRR, 2015

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Recursive deep learning for natural language processing and computer vision.
PhD thesis, 2014

Grounded Compositional Semantics for Finding and Describing Images with Sentences.
Trans. Assoc. Comput. Linguistics, 2014

Global Belief Recursive Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Scaling short-answer grading by combining peer assessment with algorithmic scoring.
Proceedings of the First (2014) ACM Conference on Learning @ Scale, 2014

Glove: Global Vectors for Word Representation.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

A Neural Network for Factoid Question Answering over Paragraphs.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

2013
Zero-Shot Learning Through Cross-Modal Transfer
Proceedings of the 1st International Conference on Learning Representations, 2013

Learning New Facts From Knowledge Bases With Neural Tensor Networks and Semantic Word Vectors
Proceedings of the 1st International Conference on Learning Representations, 2013

Zero-Shot Learning Through Cross-Modal Transfer.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Reasoning With Neural Tensor Networks for Knowledge Base Completion.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Deep Learning for NLP (without Magic).
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Bilingual Word Embeddings for Phrase-Based Machine Translation.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Better Word Representations with Recursive Neural Networks for Morphology.
Proceedings of the Seventeenth Conference on Computational Natural Language Learning, 2013

Parsing with Compositional Vector Grammars.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Convolutional-Recursive Deep Learning for 3D Object Classification.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Semantic Compositionality through Recursive Matrix-Vector Spaces.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Deep Learning for NLP (without Magic).
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, 2012

Improving Word Representations via Global Context and Multiple Word Prototypes.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Spectral Chinese Restaurant Processes: Nonparametric Clustering Based on Similarities.
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011

Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Parsing Natural Scenes and Natural Language with Recursive Neural Networks.
Proceedings of the 28th International Conference on Machine Learning, 2011

Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

2010
Connecting modalities: Semi-supervised segmentation and annotation of images using unaligned text corpora.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
A Bayesian Analysis of Dynamics in Free Recall.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Towards total scene understanding: Classification, annotation and segmentation in an automatic framework.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

ImageNet: A large-scale hierarchical image database.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

2008
A learning based hierarchical model for vessel segmentation.
Proceedings of the 2008 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 2008

2007
Combining Contexts in Lexicon Learning for Semantic Parsing.
Proceedings of the 16th Nordic Conference of Computational Linguistics, 2007


  Loading...