Mrinmaya Sachan

Orcid: 0000-0001-8787-8681

According to our database1, Mrinmaya Sachan authored at least 148 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Variational Classification: A Probabilistic Generalization of the Softmax Classifier.
Trans. Mach. Learn. Res., 2024

SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning.
CoRR, 2024

SMART: Self-learning Meta-strategy Agent for Reasoning Tasks.
CoRR, 2024

MathGAP: Out-of-Distribution Evaluation on Problems with Arbitrarily Complex Proofs.
CoRR, 2024

LLM-based Cognitive Models of Students with Misconceptions.
CoRR, 2024

Towards the Pedagogical Steering of Large Language Models for Tutoring: A Case Study with Modeling Productive Failure.
CoRR, 2024

Automated Knowledge Concept Annotation and Question Representation Learning for Knowledge Tracing.
CoRR, 2024

Do Vision-Language Models Really Understand Visual Language?
CoRR, 2024

GPT-4 as a Homework Tutor can Improve Student Engagement and Learning Outcomes.
CoRR, 2024

Multilingual Trolley Problems for Language Models.
CoRR, 2024

Confidence Regulation Neurons in Language Models.
CoRR, 2024

DIRAS: Efficient LLM-Assisted Annotation of Document Relevance in Retrieval Augmented Generation.
CoRR, 2024

AI-Assisted Human Evaluation of Machine Translation.
CoRR, 2024

On Affine Homotopy between Language Encoders.
CoRR, 2024

CausalQuest: Collecting Natural Causal Questions for AI Agents.
CoRR, 2024

NL2FOL: Translating Natural Language to First-Order Logic for Logical Fallacy Detection.
CoRR, 2024

Cooperate or Collapse: Emergence of Sustainability Behaviors in a Society of LLM Agents.
CoRR, 2024

On the Causal Nature of Sentiment Analysis.
CoRR, 2024

Calibrating Large Language Models with Sample Consistency.
CoRR, 2024

Scaling the Authoring of AutoTutors with Large Language Models.
CoRR, 2024

Error Span Annotation: A Balanced Approach for Human Evaluation of Machine Translation.
Proceedings of the Ninth Conference on Machine Translation, 2024

The ART of LLM Refinement: Ask, Refine, and Trust.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

A Transformer with Stack Attention.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Elastic Weight Removal for Faithful and Abstractive Dialogue Generation.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

AutoTutor meets Large Language Models: A Language Model Tutor with Rich Pedagogy and Guardrails.
Proceedings of the Eleventh ACM Conference on Learning @ Scale, 2024

Slicing, Chatting, and Refining: A Concept-Based Approach for Machine Learning Model Validation with ConceptSlicer.
Proceedings of the 29th International Conference on Intelligent User Interfaces, 2024

Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Can Large Language Models Infer Causation from Correlation?
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Do LLMs Think Fast and Slow? A Causal Study on Sentiment Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Towards Aligning Language Models with Textual Feedback.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Efficiently Computing Susceptibility to Context in Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Implicit Personalization in Language Models: A Systematic Study.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

RETRO-LI: Small-Scale Retrieval Augmented Generation Supporting Noisy Similarity Searches and Domain Shift Generalization.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

PWESuite: Phonetic Word Embeddings and Tasks They Facilitate.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

RELIC: Investigating Large Language Model Responses using Self-Consistency.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024

Book2Dial: Generating Teacher Student Interactions from Textbooks for Cost-Effective Development of Educational Chatbots.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

AFaCTA: Assisting the Annotation of Factual Claim Detection with Reliable LLM Annotators.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

What Do Language Models Learn in Context? The Structured Task Hypothesis.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

How to Engage your Readers? Generating Guiding Questions to Promote Active Reading.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

CausalCite: A Causal Formulation of Paper Citations.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Navigating the Ocean of Biases: Political Bias Attribution in Language Models via Causal Structures.
CoRR, 2023

CausalCite: A Causal Formulation of Paper Citations.
CoRR, 2023

Agents: An Open-source Framework for Autonomous Language Agents.
CoRR, 2023

Learning the String Partial Order.
CoRR, 2023

Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis.
CoRR, 2023

All Roads Lead to Rome? Exploring the Invariance of Transformers' Representations.
CoRR, 2023

RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text.
CoRR, 2023

Re-visiting Automated Topic Model Evaluation with Large Language Models.
CoRR, 2023

Efficient Prompting via Dynamic In-Context Learning.
CoRR, 2023

Discourse Centric Evaluation of Machine Translation with a Densely Annotated Parallel Corpus.
CoRR, 2023

Variational Classification.
CoRR, 2023

Beyond Good Intentions: Reporting the Research Landscape of NLP for Social Good.
CoRR, 2023

Psychologically-Inspired Causal Prompts.
CoRR, 2023

PWESuite: Phonetic Word Embeddings and Tasks They Facilitate.
CoRR, 2023

CLadder: A Benchmark to Assess Causal Reasoning Capabilities of Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Controlled Text Generation with Natural Language Instructions.
Proceedings of the International Conference on Machine Learning, 2023

Infusing Lattice Symmetry Priors in Attention Mechanisms for Sample-Efficient Abstract Geometric Reasoning.
Proceedings of the International Conference on Machine Learning, 2023

Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Revisiting Automated Topic Model Evaluation with Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Enhancing Textbooks with Visuals from the Web for Improved Learning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

A Diachronic Perspective on User Trust in AI under Uncertainty.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Beyond Good Intentions: Reporting the Research Landscape of NLP for Social Good.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Linear-Time Modeling of Linguistic Structure: An Order-Theoretic Perspective.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Strategize Before Teaching: A Conversational Tutoring System with Pedagogy Self-Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Longtonotes: OntoNotes with Longer Coreference Chains.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Opportunities and Challenges in Neural Dialog Tutoring.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Automatic Educational Question Generation with Difficulty Level Controls.
Proceedings of the Artificial Intelligence in Education - 24th International Conference, 2023

A Formal Perspective on Byte-Pair Encoding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Tokenization and the Noiseless Channel.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Distilling Reasoning Capabilities into Smaller Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

World Models for Math Story Problems.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

When Does Aggregating Multiple Skills with Multi-Task Learning Work? A Case Study in Financial NLP.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Membership Inference Attacks against Language Models via Neighbourhood Comparison.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

XDailyDialog: A Multilingual Parallel Dialogue Corpus.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Discourse-Centric Evaluation of Document-level Machine Translation with a New Densely Annotated Parallel Corpus of Novels.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Adaptive and Personalized Exercise Generation for Online Language Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Learning the Transformer Kernel.
Trans. Mach. Learn. Res., 2022

Understanding Stereotypes in Language Models: Towards Robust Measurement and Zero-Shot Debiasing.
CoRR, 2022

Distilling Multi-Step Reasoning Capabilities of Large Language Models into Smaller Models via Semantic Decompositions.
CoRR, 2022

Autoregressive Structured Prediction with Language Models.
CoRR, 2022

Investigating the Role of Centering Theory in the Context of Neural Coreference Resolution Systems.
CoRR, 2022

A Bilingual Parallel Corpus with Discourse Annotations.
CoRR, 2022

When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment.
CoRR, 2022

Understanding Knowledge Integration in Language Models with Graph Convolutions.
CoRR, 2022

A Simple Unsupervised Approach for Coreference Resolution using Rule-based Weak Supervision.
Proceedings of the 11th Joint Conference on Lexical and Computational Semantics, 2022

When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Original or Translated? A Causal Analysis of the Impact of Translationese on Machine Translation Performance.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Self-Supervised Contrastive Learning with Adversarial Perturbations for Defending Word Substitution-based Attacks.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

A Structured Span Selector.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Probing via Prompting.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Case-based reasoning for better generalization in textual reinforcement learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Automatic Generation of Socratic Subquestions for Teaching Math Word Problems.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Differentially Private Language Models for Secure Data Sharing.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Autoregressive Structured Prediction with Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Logical Fallacy Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Adapters for Enhanced Modeling of Multilingual Knowledge and Text.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

What Has Been Enhanced in my Knowledge-Enhanced Language Model?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Beyond prompting: Making Pre-trained Language Models Better Zero-shot Learners by Clustering Representations.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Slangvolution: A Causal Analysis of Semantic Change and Frequency Dynamics in Slang.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Calibration of Machine Reading Systems at Scale.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Deep Clustering of Text Representations for Supervision-Free Probing of Syntax.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Differentiable Subset Pruning of Transformer Heads.
Trans. Assoc. Comput. Linguistics, 2021

Case-based Reasoning for Better Generalization in Text-Adventure Games.
CoRR, 2021

Self-Supervised Contrastive Learning with Adversarial Perturbations for Robust Pretrained Language Models.
CoRR, 2021

Causal Direction of Data Collection Matters: Implications of Causal and Anticausal Learning for NLP.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

"Let Your Characters Tell Their Story": A Dataset for Character-Centric Narrative Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Scaling Within Document Coreference to Long Texts.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Efficient Text-based Reinforcement Learning by Jointly Leveraging State and Commonsense Graph Representations.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

How Good Is NLP? A Sober Look at NLP Tasks through the Lens of Social Impact.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Bird's Eye: Probing for Linguistic Graph Structures with a Simple Information-Theoretic Approach.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Towards Literate Artificial Intelligence.
PhD thesis, 2020

Clustering Contextualized Representations of Text for Unsupervised Syntax Induction.
CoRR, 2020

Stronger Transformers for Neural Multi-Hop Question Generation.
CoRR, 2020

Enhancing Text-based Reinforcement Learning Agents with Commonsense Knowledge.
CoRR, 2020

Knowledge Graph Embedding Compression.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

2019
Discourse in Multimedia: A Case Study in Extracting Geometry Knowledge from Textbooks.
Comput. Linguistics, 2019

2018
Discourse in Multimedia: A Case Study in Information Extraction.
CoRR, 2018

Learning Pipelines with Limited Data and Domain Knowledge: A Study in Parsing Physics Problems.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Self-Training for Jointly Learning to Ask and Answer Questions.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Effective Use of Bidirectional Language Modeling for Transfer Learning in Biomedical Named Entity Recognition.
Proceedings of the Machine Learning for Healthcare Conference, 2018

Parsing to Programs: A Framework for Situated QA.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Contextual Parameter Generation for Universal Neural Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
Learning to Solve Geometry Problems from Natural Language Demonstrations in Textbooks.
Proceedings of the 6th Joint Conference on Lexical and Computational Semantics, 2017

From Textbooks to Knowledge: A Case Study in Harvesting Axiomatic Knowledge from Textbooks to Solve Geometry Problems.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2016
Grounding Topic Models with Knowledge Bases.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Learning Concept Taxonomies from Multi-modal Data.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Machine Comprehension using Rich Semantic Representations.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Easy Questions First? A Case Study on Curriculum Learning for Question Answering.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Science Question Answering using Instructional Materials.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
An Active Learning Approach to Coreference Resolution.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Learning Answer-Entailing Structures for Machine Comprehension.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Spatial compactness meets topical consistency: jointly modeling links and content for community detection.
Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, 2014

2013
Solving electrical networks to incorporate supervision in random walks.
Proceedings of the 22nd International World Wide Web Conference, 2013

Collective matrix factorization for co-clustering.
Proceedings of the 22nd International World Wide Web Conference, 2013

A Structured Distributional Semantic Model : Integrating Structure with Semantics.
Proceedings of the Workshop on Continuous Vector Space Models and their Compositionality, 2013

A Structured Distributional Semantic Model for Event Co-reference.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Using content and interactions for discovering communities in social networks.
Proceedings of the 21st World Wide Web Conference 2012, 2012

2011
Using Text Reviews for Product Entity Completion.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Probabilistic model for discovering topic based communities in social networks.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011


  Loading...