2005
Automating Content Extraction of HTML Documents.
World Wide Web, 2005

2003
DOM-based content extraction of HTML documents.
Proceedings of the Twelfth International World Wide Web Conference, 2003