Proceedings of the 2022 Web Archiving & Digital Libraries Workshop (WADL 2022), 2022
ABCDEF: the 6 key features behind scalable, multi-tenant web archive processing with ARCH: archive, big data, concurrent, distributed, efficient, flexible.
Proceedings of the JCDL '22: The ACM/IEEE Joint Conference on Digital Libraries in 2022, Cologne, Germany, June 20, 2022
Fostering Community Engagement through Datathon Events: The Archives Unleashed Experience.
Digit. Humanit. Q., 2021
Building a Local Digital Preservation Infrastructure: Experiences in Selecting and Implementing Digital Preservation Systems.
Proceedings of the 17th International Conference on Digital Preservation, 2021
Building community at distance: a datathon during COVID-19.
Digit. Libr. Perspect., 2020
The Archives Unleashed Project: Technology, Process, and Community to Improve Scholarly Access to Web Archives.
Proceedings of the JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, 2020
Content-Based Exploration of Archival Images Using Neural Networks.
Proceedings of the JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, 2020
We Could, but Should We?: Ethical Considerations for Providing Access to GeoCities and Other Historical Digital Collections.
Proceedings of the CHIIR '20: Conference on Human Information Interaction and Retrieval, 2020
Solr Integration in the Anserini Information Retrieval Toolkit.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019
Scalable Content-Based Analysis of Images in Web Archives with TensorFlow and the Archives Unleashed Toolkit.
Proceedings of the 19th ACM/IEEE Joint Conference on Digital Libraries, 2019
Warclight: A Rails Engine for Web Archive Discovery.
Proceedings of the 19th ACM/IEEE Joint Conference on Digital Libraries, 2019
Building Community and Tools for Analyzing Web Archives Through Datathons.
Proceedings of the 19th ACM/IEEE Joint Conference on Digital Libraries, 2019
The Archives Unleashed Notebook: Madlibs for Jumpstarting Scholarly Exploration of Web Archives.
Proceedings of the 19th ACM/IEEE Joint Conference on Digital Libraries, 2019
The Cost of a WARC: Analyzing Web Archives in the Cloud.
Proceedings of the 19th ACM/IEEE Joint Conference on Digital Libraries, 2019
Strategies for Collecting, Processing, and Analyzing Tweets from Large Newsworthy Events.
Proceedings of the 2017 Web Archiving & Digital Libraries Workshop (WADL 2017), 2017
Building a National Web Archiving Collaborative Platform: The Web Archives for Longitudinal Knowledge Project.
Proceedings of the 2017 Web Archiving & Digital Libraries Workshop (WADL 2017), 2017
Content Selection and Curation for Web Archiving: The Gatekeepers vs. the Masses.
Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries, 2016
Desiderata for Exploratory Search Interfaces to Web Archives in Support of Scholarly Activities.
Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries, 2016