David A. Smith

Orcid: 0000-0002-6636-6940

According to our database1, David A. Smith authored at least 98 papers between 1983 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of two.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Mind the Gap: Analyzing Lacunae with Transformer-Based Transcription.
CoRR, 2024

Detecting Manuscript Annotations in Historical Print: Negative Evidence and Evaluation Metrics.
Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods, 2024

Retrieving and Analyzing Translations of American Newspaper Comics with Visual Evidence.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 Workshops, 2024

Self-training and Active Learning with Pseudo-relevance Feedback for Handwriting Detection in Historical Print.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

MONSTERMASH: Multidirectional, Overlapping, Nested, Spiral Text Extraction for Recognition Models of Arabic-Script Handwriting.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 Workshops, 2024

Mind the Gap:Analyzing Lacunae with Transformer-Based Transcription.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 Workshops, 2024

2023
Composition and Deformance: Measuring Imageability with a Text-to-Image Model.
CoRR, 2023

Adapting Transformer Language Models for Predictive Typing in Brain-Computer Interfaces.
CoRR, 2023

Automatic Collation for Diversifying Corpora: Commonly Copied Texts as Distant Supervision for Handwritten Text Recognition.
Proceedings of the Computational Humanities Research Conference 2023, 2023

Testing the Limits of Neural Sentence Alignment Models on Classical Greek and Latin Texts and Translations.
Proceedings of the Computational Humanities Research Conference 2023, 2023

2022
An Experiment in Live Collaborative Programming on the Croquet Shared Experience Platform.
Proceedings of the 6th International Conference on the Art, 2022

2021
Text mining Mill: Computationally detecting influence in the writings of John Stuart Mill from library records.
Digit. Scholarsh. Humanit., 2021

Recovering Lexically and Semantically Reused Texts.
Proceedings of *SEM 2021: The Tenth Joint Conference on Lexical and Computational Semantics, 2021

Digital Editions as Distant Supervision for Layout Analysis of Printed Books.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Content-based Models of Quotation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

2020
The Augmented Conversation and the Amplified World.
Proceedings of the UIST '20: The 33rd Annual ACM Symposium on User Interface Software and Technology, 2020

Source Attribution: Recovering the Press Releases Behind Health Science News.
Proceedings of the Fourteenth International AAAI Conference on Web and Social Media, 2020

Detecting de minimis Code-Switching in Historical German Books.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2018
Predicting News Coverage of Scientific Articles.
Proceedings of the Twelfth International Conference on Web and Social Media, 2018

Contrastive Training for Models of Information Cascades.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Multilingual Topic Models.
CoRR, 2017

A perspective from the long view: 35 Years in VR (Keynote).
Proceedings of the 2017 IEEE Virtual Reality, 2017

Retrieving and Combining Repeated Passages to Improve OCR.
Proceedings of the 2017 ACM/IEEE Joint Conference on Digital Libraries, 2017

What News is New?: Ads, Extras, and Viral Texts on the Nineteenth-Century Newspaper Page.
Proceedings of the 12th Annual International Conference of the Alliance of Digital Humanities Organizations, 2017

2016
First-fit coloring on interval graphs has performance ratio at least 5.
Eur. J. Comb., 2016

Online Multilingual Topic Models with Multi-Level Hyperpriors.
Proceedings of the NAACL HLT 2016, 2016

Bootstrapping Translation Detection and Sentence Extraction from Comparable Corpora.
Proceedings of the NAACL HLT 2016, 2016

Automatic Construction of Evaluation Sets and Evaluation of Document Similarity Models in Large Scholarly Retrieval Systems.
Proceedings of the Scholarly Big Data: AI Perspectives, 2016

2015
Evaluating Retrieval Models through Histogram Analysis.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

2014
Automatic suggestion of phrasal-concept queries for literature search.
Inf. Process. Manag., 2014

The Virtual World Framework: Collaborative virtual environments on the web.
Proceedings of the 2014 IEEE Virtual Reality, 2014

Detecting and modeling local text reuse.
Proceedings of the IEEE/ACM Joint Conference on Digital Libraries, 2014

Social Network Signatures of Effective Online Communication.
Proceedings of the Intelligent Tutoring Systems - 12th International Conference, 2014

Identifying Social Deliberative Behavior from Online Communication - A Cross-Domain Study.
Proceedings of the Twenty-Seventh International Florida Artificial Intelligence Research Society Conference, 2014

2013
Online Polylingual Topic Models for Fast Document Translation Detection.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Efficient Nearest-Neighbor Search in the Probability Simplex.
Proceedings of the International Conference on the Theory of Information Retrieval, 2013

Using a Probabilistic Syllable Model to Improve Scene Text Recognition.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

Mining Social Deliberation in Online Communication - If You Were Me and I Were You.
Proceedings of the 6th International Conference on Educational Data Mining, 2013

Uncovering Reprinting Networks in Nineteenth-Century American Newspapers.
Proceedings of the 8th Annual International Conference of the Alliance of Digital Humanities Organizations, 2013

Infectious texts: Modeling text reuse in nineteenth-century newspapers.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

Signal classification of satellite-based recordings of radiofrequency (RF) transients using data-adaptive dictionaries.
Proceedings of the 2013 Asilomar Conference on Signals, 2013

2012
Extracting two thousand years of latin from a million book library.
ACM Journal on Computing and Cultural Heritage, 2012

A framework for manipulating and searching multiple retrieval types.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

Parse, Price and Cut--Delayed Column and Row Generation for Graph Based Parsers.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Improving NLP through Marginalization of Hidden Syntactic Structure.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Grammarless Parsing for Joint Inference.
Proceedings of the COLING 2012, 2012

A Dictionary of Wisdom and Wit: Learning to Extract Quotable Phrases.
Proceedings of the Workshop on Computational Linguistics for Literature, 2012

Discovering Factions in the Computational Linguistics Community.
Proceedings of the Special Workshop on Rediscovering 50 Years of Discoveries@ACL 2012, 2012

2011
Online community search using conversational structures.
Inf. Retr., 2011

Learning on the fly: a font-free approach toward multilingual OCR.
Int. J. Document Anal. Recognit., 2011

A Minimally Supervised Approach for Detecting and Ranking Document Translation Pairs.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Passage Reranking for Question Answering Using Syntactic Structures and Answer Types.
Proceedings of the Advances in Information Retrieval, 2011

Mining relational structure from millions of books: position paper.
Proceedings of the 4th ACM Workshop on Online books, 2011

A quasi-synchronous dependence model for information retrieval.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Evaluating an associative browsing model for personal information.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Passage retrieval for incorporating global evidence in sequence labeling.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

A Discriminative Model for Joint Morphological Disambiguation and Dependency Parsing.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Joint Annotation of Search Queries.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010
The First-Fit Algorithm Uses Many Colors on Some Interval Graphs.
PhD thesis, 2010

Computing continuous core/periphery structures for social relations data with MINRES/SVD.
Soc. Networks, 2010

Inference by Minimizing Size, Divergence, or their Sum.
Proceedings of the UAI 2010, 2010

Relaxed Marginal Inference and its Application to Dependency Parsing.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Modeling reformulation using passage analysis.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Building a semantic representation for personal information.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Structural annotation of search queries using pseudo-relevance feedback.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2009
Classics in the Million Book Library.
Digit. Humanit. Q., 2009

Two-stage query segmentation for information retrieval.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Parser Adaptation and Projection with Quasi-Synchronous Grammar Features.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Polylingual Topic Models.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Online community search using thread structure.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
Dependency Parsing by Belief Propagation.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

2007
Probabilistic Models of Nonprojective Dependency Trees.
Proceedings of the EMNLP-CoNLL 2007, 2007

Bootstrapping Feature-Rich Dependency Parsers with Entropic Priors.
Proceedings of the EMNLP-CoNLL 2007, 2007

Log-Linear Models of Non-Projective Trees, $k$-best MST Parsing and Tree-Ranking.
Proceedings of the EMNLP-CoNLL 2007, 2007

2006
Debabelizing Libraries: Machine Translation by and for Digital Collections.
D Lib Mag., 2006

Quasi-Synchronous Grammars: Alignment by Soft Projection of Syntactic Dependencies.
Proceedings of the Proceedings on the Workshop on Statistical Machine Translation, 2006

Vine Parsing and Minimum Risk Reranking for Speed and Precision.
Proceedings of the Tenth Conference on Computational Natural Language Learning, 2006

Scalability of Collaborative Environments.
Proceedings of the Fourth International Conference on Creating, 2006

Minimum Risk Annealing for Training Log-Linear Models.
Proceedings of the ACL 2006, 2006

2005
Context-Based Morphological Disambiguation with Random Fields.
Proceedings of the HLT/EMNLP 2005, 2005

Filters and Tasks in Croquet.
Proceedings of the 3rd Conference on Creating, 2005

2004
Bilingual Parsing with Factored Estimation: Using English to Parse Korean.
Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing , 2004

Croquet: A Menagerie of New User Interfaces.
Proceedings of the 2004 Conference on Creating, 2004

2003
Croquet - A Collaboration System Architecture.
Proceedings of the 2003 Conference on Creating, 2003

2002
Detecting and Browsing Events in Unstructured text.
Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

Integrating harvesting into digital library content.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2002

Detecting events with date and place information in unstructured text.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2002

2001
Drudgery and deep thought.
Commun. ACM, 2001

Building a hypertextual digital library in the humanities: a case study on London.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2001

Disambiguating Geographic Names in a Historical Digital Library.
Proceedings of the Research and Advanced Technology for Digital Libraries, 2001

2000
Managing XML Documents in an Integrated Digital Library.
Markup Lang., 2000

1999
Benchmarking the Use of IT to Support Supplier Management in Construction.
J. Inf. Technol. Constr., 1999

1997
Current status of computed radiography in emergency departments.
J. Digit. Imaging, 1997

1992
Project CALC: Calculus as a Laboratory Course.
Proceedings of the Computer Assisted Learning, 4th International Conference, 1992

1990
Integrated-Optic Acoustically-Tunable Filters for WDM Networks.
IEEE J. Sel. Areas Commun., 1990

1987
Mechanisms for abstraction in Ada.
Proceedings of the Joint Ada conference Fifth National Conference on Ada Technology and Fourth Washington Ada Symposium, 1987

1983
Algorithm 602: HURRY: An Acceleration Algorithm for Scalar Sequences and Series.
ACM Trans. Math. Softw., 1983

HURRY: An Acceleration Algorithm for Scalar Sequences and Series.
ACM Trans. Math. Softw., 1983


  Loading...