Soumen Chakrabarti

Orcid: 0000-0002-9419-7486

Affiliations:
  • IIT Bombay
  • Google (2014-2016)
  • Carnegie-Mellon University (2004)
  • IBM Almaden Research Center (1996-1999)
  • University of California, Berkeley, USA (1991-1996)


According to our database1, Soumen Chakrabarti authored at least 172 papers between 1993 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
How to think step-by-step: A mechanistic understanding of chain-of-thought reasoning.
Trans. Mach. Learn. Res., 2024

Graph Regularized Encoder Training for Extreme Classification.
CoRR, 2024

Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Incomplete Gamma Integrals for Deep Cascade Prediction Using Content, Network, and Exogenous Signals.
IEEE Trans. Knowl. Data Eng., June, 2023

Transfer Learning for Low-Resource Multilingual Relation Classification.
ACM Trans. Asian Low Resour. Lang. Inf. Process., January, 2023

CoRE-CoG: Conversational Recommendation of Entities using Constrained Generation.
CoRR, 2023

Locality Sensitive Hashing in Fourier Frequency Domain For Soft Set Containment Search.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

CRUSH4SQL: Collective Retrieval Using Schema Hallucination For Text2SQL.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

TwiRGCN: Temporally Weighted Graph Convolution for Question Answering over Temporal Knowledge Graphs.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Entropy-guided Vocabulary Augmentation of Multilingual Language Models for Low-resource Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

mOKB6: A Multilingual Open Knowledge Base Completion Benchmark.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Multi-Row, Multi-Span Distant Supervision For Table+Text Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Structured Case-Based Reasoning for Inference-Time Adaptation of Text-to-SQL Parsers.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Semi-supervised Stance Detection of Tweets Via Distant Network Supervision.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Deep Knowledge Graph Representation Learning for Completion, Alignment, and Question Answering.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Maximum Common Subgraph Guided Graph Retrieval: Late and Early Interaction Networks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Neural Estimation of Submodular Functions with Applications to Differentiable Subset Selection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

AIT-QA: Question Answering Dataset over Complex Tables in the Airline Industry.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, 2022

VarScene: A Deep Generative Model for Realistic Scene Graph Synthesis.
Proceedings of the International Conference on Machine Learning, 2022

Joint Completion and Alignment of Multilingual Knowledge Graphs.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Alignment-Augmented Consistent Translation for Multilingual Open Information Extraction.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Interpretable Neural Subgraph Matching for Graph Retrieval.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Multi-Instance Training for Question Answering Across Table and Linked Text.
CoRR, 2021

Multilingual Knowledge Graph Completion with Joint Relation and Entity Alignment.
CoRR, 2021

Select, Substitute, Search: A New Benchmark for Knowledge-Augmented Visual Question Answering.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Active Assessment of Prediction Services as Accuracy Surface Over Attribute Combinations.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Redesigning the Transformer Architecture with Insights from Multi-particle Dynamical Systems.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Topic Transferable Table Question Answering.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Joint Autoregressive and Graph Models for Software and Developer Social Networks.
Proceedings of the Advances in Information Retrieval, 2021

A Data Bootstrapping Recipe for Low-Resource Multilingual Relation Classification.
Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

Integrating Transductive and Inductive Embeddings Improves Link Prediction Accuracy.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Multilingual Knowledge Graph Completion With Joint Relation and Entity Alignment.
Proceedings of the 3rd Conference on Automated Knowledge Base Construction, 2021

Question Answering Over Temporal Knowledge Graphs.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Adversarial Permutation Guided Node Representations for Link Prediction.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Differentially Private Link Prediction with Protected Connections.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
On Computing Entity Relatedness in Wikipedia, with Applications.
Knowl. Based Syst., 2020

Analysis of reference and citation copying in evolving bibliographic networks.
J. Informetrics, 2020

Knowledge Base Completion: Baseline strikes back (Again).
CoRR, 2020

Interpretable Complex Question Answering.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Deep Neural Matching Models for Graph Retrieval.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Deep Exogenous and Endogenous Influence Combination for Social Chatter Intensity Prediction.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

NLP Service APIs and Models for Efficient Registration of New Clients.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

OpenIE6: Iterative Grid Labeling and Coordination Analysis for Open Information Extraction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Temporal Knowledge Base Completion: New Algorithms and Evaluation Protocols.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

The Kauwa-Kaate Fake News Detection System: Demo.
Proceedings of the CoDS-COMAD 2020: 7th ACM IKDD CoDS and 25th COMAD, 2020

IMoJIE: Iterative Memory-Based Joint Open Information Extraction.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Learning Linear Influence Models in Social Networks from Transient Opinion Dynamics.
ACM Trans. Web, 2019

Complex Program Induction for Querying Knowledge Bases in the Absence of Gold Programs.
Trans. Assoc. Comput. Linguistics, 2019

Neural architecture for question answering using a knowledge graph and web corpus.
Inf. Retr. J., 2019

Scene Graph based Image Retrieval - A case study on the CLEVR Dataset.
CoRR, 2019

Privacy Preserving Link Prediction with Latent Geometric Network Models.
CoRR, 2019

Multi-task Learning for Target-Dependent Sentiment Classification.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2019

A Deep Generative Model for Code Switched Text.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Neural Program Induction for KBQA Without Gold Programs or Query Annotations.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Automated Early Leaderboard Generation from Comparative Tables.
Proceedings of the Advances in Information Retrieval, 2019

Improved Sentiment Detection via Label Transfer from Monolingual to Synthetic Code-Switched Text.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Topic Sensitive Attention on Generic Corpora Corrects Sense Bias in Pretrained Embeddings.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

GIRNet: Interleaved Multi-Task Recurrent State Sequence Models.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Focused Web Crawling.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

"Open-domain question answering using a knowledge graph and web corpus" by Uma Sawant, Soumen Chakrabarti and Ganesh Ramakrishnan with Martin Vesely as coordinator.
SIGWEB Newsl., 2018

Ranking State-of-the-art Papers via Incomplete Tournaments Induced by Citations from Performance Tables.
CoRR, 2018

New Embedded Representations and Evaluation Protocols for Inferring Transitive Relations.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Knowledge Extraction and Inference from Text: Shallow, Deep, and Everything in Between.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Mitigating the Effect of Out-of-Vocabulary Entity Pairs in Matrix Factorization for KB Inference.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Generalizing Across Domains via Cross-Gradient Training.
Proceedings of the 6th International Conference on Learning Representations, 2018

Type-Sensitive Knowledge Base Inference Without Explicit Type Supervision.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Task-Specific Representation Learning for Web-Scale Entity Disambiguation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
More Accurate Entity Ranking Using Knowledge Graph and Web Corpus.
CoRR, 2017

Joint Matrix-Tensor Factorization for Knowledge Base Inference.
CoRR, 2017

Relay-Linking Models for Prominence and Obsolescence in Evolving Networks.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

Earth Mover's Distance Pooling over Siamese LSTMs for Automatic Short Answer Grading.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

A Two-Stage Framework for Computing Entity Relatedness in Wikipedia.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

2016
Discriminative Link Prediction using Local, Community, and Global Signals.
IEEE Trans. Knowl. Data Eng., 2016

Collective Entity Resolution with Multi-Focal Attention.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Sic Transit Gloria Manuscriptum: Two Views of the Aggregate Fate of Ancient Papers.
CoRR, 2015

2014
Open-domain quantity queries on web tables: annotation, response, and consensus models.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Knowledge Graph and Corpus Driven Segmentation and Answer Inference for Telegraphic Entity-seeking Queries.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Learning a Linear Influence Model from Transient Opinion Dynamics.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

2013
Data-based research at IIT Bombay.
SIGMOD Rec., 2013

Features and Aggregators for Web-scale Entity Search
CoRR, 2013

Learning joint query interpretation and response ranking.
Proceedings of the 22nd International World Wide Web Conference, 2013

Discriminative Link Prediction Using Local Links, Node Features and Community Structure.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Web-scale entity annotation using MapReduce.
Proceedings of the 20th Annual International Conference on High Performance Computing, 2013

Joint Bootstrapping of Corpus Annotations and Entity Types.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

2012
Compressed data structures for annotated web search.
Proceedings of the 21st World Wide Web Conference 2012, 2012

2011
Index design and query processing for graph conductance search.
VLDB J., 2011

Web-scale entity-relation search architecture.
Proceedings of the 20th International Conference on World Wide Web, 2011

SCAD: collective discovery of attribute values.
Proceedings of the 20th International Conference on World Wide Web, 2011

Social media: source of information or bunch of noise.
Proceedings of the 20th International Conference on World Wide Web, 2011

Diversity in ranking via resistive graph centers.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

2010
Annotating and Searching Web Tables Using Entities, Types and Relationships.
Proc. VLDB Endow., 2010

Enhancing Search with Structure.
IEEE Data Eng. Bull., 2010

Making Web-scale Entity-Relationship Search a Reality.
Proceedings of the 16th International Conference on Management of Data, 2010

2009
Focused Web Crawling.
Proceedings of the Encyclopedia of Database Systems, 2009

Learning to rank for quantity consensus queries.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Collective annotation of Wikipedia entities in web text.
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28, 2009

Conditional Models for Non-smooth Ranking Loss Functions.
Proceedings of the ICDM 2009, 2009

2008
New closed-form bounds on the partition function.
Mach. Learn., 2008

Fast algorithms for topk personalized pagerank queries.
Proceedings of the 17th International Conference on World Wide Web, 2008

Structured learning for non-smooth ranking losses.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Index Design for Dynamic Personalized PageRank.
Proceedings of the 24th International Conference on Data Engineering, 2008


2007
The Influence of Search Engines on Preferential Attachment.
Internet Math., 2007

Learning to Rank in Vector Spaces and Social Networks.
Internet Math., 2007

Dynamic personalized pagerank in entity-relation graphs.
Proceedings of the 16th International Conference on World Wide Web, 2007

Learning random walks to rank nodes in graphs.
Proceedings of the Machine Learning, 2007

2006
Optimizing scoring functions and indexes for proximity search in type-annotated corpora.
Proceedings of the 15th international conference on World Wide Web, 2006

Learning Parameters in Entity Relationship Graphs from Ranking Preferences.
Proceedings of the Knowledge Discovery in Databases: PKDD 2006, 2006

Learning to rank networked entities.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Accelerating Newton Optimization for Log-Linear Models through Feature Redundancy.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

2005
Exploiting the dynamic networking effects of the web.
Proceedings of the 14th international conference on World Wide Web, 2005

Shuffling a Stacked Deck: The Case for Partially Randomized Ranking of Search Engine Results.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

Bidirectional Expansion For Keyword Search on Graph Databases.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

SPIN: searching personal information networks.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Enhanced Answer Type Inference from Questions using Sequential Models.
Proceedings of the HLT/EMNLP 2005, 2005

2004
Guest Editors' Introduction: Special Section on Mining and Searching the Web.
IEEE Trans. Knowl. Data Eng., 2004

Is question answering an acquired skill?
Proceedings of the 13th international conference on World Wide Web, 2004

Web-CAM: Monitoring the dynamic Web to respond to Continual Queries.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2004

HIClass: Hyper-interactive Text Classification by Interactive Supervision of Document and Term Labels.
Proceedings of the Knowledge Discovery in Databases: PKDD 2004, 2004

Document Classification Through Interactive Supervision of Document and Term Labels.
Proceedings of the Knowledge Discovery in Databases: PKDD 2004, 2004

Breaking Through the Syntax Barrier: Searching with Entities and Relations.
Proceedings of the Knowledge Discovery in Databases: PKDD 2004, 2004

Discovering Links Between Lexical and Surface Features in Questions and Answers.
Proceedings of the Advances in Web Mining and Web Usage Analysis, 2004

2003
Fast and accurate text classification via multiple linear discriminant projections pp.
VLDB J., 2003

Monitoring the dynamic web to respond to continuous queries.
Proceedings of the Twelfth International World Wide Web Conference, 2003

Cross-training: learning probabilistic mappings between topics.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

User Interaction in the BANKS System.
Proceedings of the 19th International Conference on Data Engineering, 2003

Mining the web - discovering knowledge from hypertext data.
Morgan Kaufmann, ISBN: 978-1-55860-754-5, 2003

2002
Analyzing Fine-grained Hypertext Features for Enhanced Crawling and Topic Distillation.
IEEE Data Eng. Bull., 2002

Accelerated focused crawling through online relevance feedback.
Proceedings of the Eleventh International World Wide Web Conference, 2002

The structure of broad topics on the web.
Proceedings of the Eleventh International World Wide Web Conference, 2002

Fast and Accurate Text Classification via Multiple Linear Discriminant Projections.
Proceedings of 28th International Conference on Very Large Data Bases, 2002

Searching and Mining Fine-Grained Semi-Structured Data.
Proceedings of 28th International Conference on Very Large Data Bases, 2002

BANKS: Browsing and Keyword Searching in Relational Databases.
Proceedings of 28th International Conference on Very Large Data Bases, 2002

Scaling multi-class support vector machines using inter-class confusion.
Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2002

Keyword Searching and Browsing in Databases using BANKS.
Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, February 26, 2002

2001
Keeyword Search in Databases.
IEEE Data Eng. Bull., 2001

Integrating the document object model with hyperlinks for enhanced topic distillation and information extraction.
Proceedings of the Tenth International World Wide Web Conference, 2001

Enhanced Topic Distillation Using Text, Markup Tags, and Hyperlinks.
Proceedings of the SIGIR 2001: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2001

Hubs and Authorities: Spreading Out and Zooming In.
Proceedings of the First International Workshop on Web Dynamics, 2001

2000
Data mining for hypertext: A tutorial survey.
SIGKDD Explor., 2000

Using Memex to archive and mine community Web browsing experience.
Comput. Networks, 2000

Memex: A Browsing Assistant for Collaborative Archiving and Mining of Surf Trails.
Proceedings of the VLDB 2000, 2000

Hypertext data mining (tutorial AM-1).
Proceedings of the Tutorial notes of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000

1999
Recent results in automatic Web resource discovery.
ACM Comput. Surv., 1999

Mining the Web's Link Structure.
Computer, 1999

Surfing the Web Backwards.
Comput. Networks, 1999

Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery.
Comput. Networks, 1999

Topic Distillation and Spectral Filtering.
Artif. Intell. Rev., 1999

Distributed Hypertext Resource Discovery Through Examples.
Proceedings of the VLDB'99, 1999

Hypertext Databases and Data Mining.
Proceedings of the SIGMOD 1999, 1999

1998
Scalable Feature Selection, Classification and Signature Generation for Organizing Large Text Databases into Hierarchical Topic Taxonomies.
VLDB J., 1998

Parallel randomized load balancing.
Random Struct. Algorithms, 1998

Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text.
Comput. Networks, 1998

Mining Surprising Patterns Using Temporal Description Length.
Proceedings of the VLDB'98, 1998

Flow and Stretch Metrics for Scheduling Continuous Job Streams.
Proceedings of the Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, 1998

Enhanced Hypertext Categorization Using Hyperlinks.
Proceedings of the SIGMOD 1998, 1998

1997
Models and Scheduling Algorithms for Mixed Data and Task Parallel Programs.
J. Parallel Distributed Comput., 1997

Using Taxonomy, Discriminants, and Signatures for Navigating in Text Databases.
Proceedings of the VLDB'97, 1997

1996
Random Allocation of Jobs with Weights and Precedence.
Theor. Comput. Sci., 1996

Resource Scheduling for Parallel Database and Scientific Applications.
Proceedings of the 8th Annual ACM Symposium on Parallel Algorithms and Architectures, 1996

Global Communication Analysis and Optimization.
Proceedings of the ACM SIGPLAN'96 Conference on Programming Language Design and Implementation (PLDI), 1996

Improved Scheduling Algorithms for Minsum Criteria.
Proceedings of the Automata, Languages and Programming, 23rd International Colloquium, 1996

1995
Parallel randomized load balancing (Preliminary Version).
Proceedings of the Twenty-Seventh Annual ACM Symposium on Theory of Computing, 1995

Modeling the Benefits of Mixed Data and Task Parallelism.
Proceedings of the 7th Annual ACM Symposium on Parallel Algorithms and Architectures, 1995

Portable Parallel Irregular Applications.
Proceedings of the Parallel Symbolic Languages and Systems, 1995

Runtime Support for Portable Distributed Data Structures.
Proceedings of the Languages, 1995

1994
Distributed Data Structures and Algorithms for Gröbner Basis Computation.
LISP Symb. Comput., 1994

Adaptive Control for Packet Video.
Proceedings of the International Conference on Multimedia Computing and Systems, 1994

1993
On the Correctness of a Distributed Memory Gröbner basis Algorithm.
Proceedings of the Rewriting Techniques and Applications, 5th International Conference, 1993

Implementing an Irregular Application on a Distributed Memory Multiprocessor.
Proceedings of the Fourth ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPOPP), 1993


  Loading...