Luis Gravano

Affiliations:
  • Columbia University, New York City, USA


According to our database1, Luis Gravano authored at least 107 papers between 1991 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Interactive Machine Teaching by Labeling Rules and Instances.
CoRR, 2024

Seasonality Patterns in 311-Reported Foodborne Illness Cases and Machine Learning-Identified Indications of Foodborne Illnesses from Yelp Reviews, New York City, 2022-2023.
CoRR, 2024

2023
What is a good question? Task-oriented asking with fact-level masking.
CoRR, 2023

2021
Quantifying the Effects of COVID-19 on Restaurant Reviews.
Proceedings of the Ninth International Workshop on Natural Language Processing for Social Media, 2021

2020
Cross-Lingual Text Classification with Minimal Resources by Transferring a Sparse Teacher.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Detecting Foodborne Illness Complaints in Multiple Languages Using English Annotations Only.
Proceedings of the 11th International Workshop on Health Text Mining and Information Analysis, 2020

2019
Leveraging Just a Few Keywords for Fine-Grained Aspect Detection Through Weakly Supervised Co-Training.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Weakly Supervised Attention Networks for Fine-Grained Opinion Mining and Public Health.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

2018
Discovering foodborne illness in online restaurant reviews.
J. Am. Medical Informatics Assoc., 2018

2017
Fast and Accurate Time-Series Clustering.
ACM Trans. Database Syst., 2017

Sampling strategies for information extraction over the deep web.
Inf. Process. Manag., 2017

2016
k-Shape: Efficient and Accurate Clustering of Time Series.
SIGMOD Rec., 2016

Predicting the impact of scientific concepts using full-text features.
J. Assoc. Inf. Sci. Technol., 2016

2015
Learning to Rank Adaptively for Scalable Information Extraction.
Proceedings of the 18th International Conference on Extending Database Technology, 2015

Ranking Deep Web Text Collections for Scalable Information Extraction.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

2014
REEL: A Relation Extraction Learning framework.
Proceedings of the IEEE/ACM Joint Conference on Digital Libraries, 2014

2013
When Speed Has a Price: Fast Information Extraction Using Approximate Algorithms.
Proc. VLDB Endow., 2013

Effective Event Identification in Social Media.
IEEE Data Eng. Bull., 2013

2012
Answering General Time-Sensitive Queries.
IEEE Trans. Knowl. Data Eng., 2012

Identifying content for planned events across social media sites.
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012

2011
Hip and trendy: Characterizing emerging trends on Twitter.
J. Assoc. Inf. Sci. Technol., 2011

Selecting Quality Twitter Content for Events.
Proceedings of the Fifth International Conference on Weblogs and Social Media, 2011

Beyond Trending Topics: Real-World Event Identification on Twitter.
Proceedings of the Fifth International Conference on Weblogs and Social Media, 2011

Automatic Identification and Presentation of Twitter Content for Planned Events.
Proceedings of the Fifth International Conference on Weblogs and Social Media, 2011

2010
Learning similarity metrics for event identification in social media.
Proceedings of the Third International Conference on Web Search and Web Data Mining, 2010

Popularity-Guided Top-k Extraction of Entity Attributes.
Proceedings of the 13th International Workshop on the Web and Databases 2010, 2010

2009
Event Identification in Social Media.
Proceedings of the 12th International Workshop on the Web and Databases, 2009

Querying text databases and the web: beyond traditional keyword search.
Proceedings of the First International Workshop on Keyword Search on Structured Data, 2009

Join Optimization of Information Extraction Output: Quality Matters!
Proceedings of the 25th International Conference on Data Engineering, 2009

2008
Classification-aware hidden-web text database selection.
ACM Trans. Inf. Syst., 2008

Building query optimizers for information extraction: the SQoUT project.
SIGMOD Rec., 2008

Optimizing SQL Queries over Text Databases.
Proceedings of the 24th International Conference on Data Engineering, 2008

2007
Modeling and managing changes in text databases.
ACM Trans. Database Syst., 2007

Towards a query optimizer for text-centric tasks.
ACM Trans. Database Syst., 2007

Efficient summarization-aware search for online news articles.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2007

Efficient Keyword Search Across Heterogeneous Relational Databases.
Proceedings of the 23rd International Conference on Data Engineering, 2007

SQL Queries Over Unstructured Text Databases.
Proceedings of the 23rd International Conference on Data Engineering, 2007

2006
To search or to crawl?: towards a query optimizer for text-centric tasks.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2006

2005
XML & Data Streams.
Proceedings of the Stream Data Management, 2005

Modeling and Managing Content Changes in Text Databases.
Proceedings of the 21st International Conference on Data Engineering, 2005

2004
Learning to find answers to questions on the Web.
ACM Trans. Internet Techn., 2004

Evaluating top-<i>k</i> queries over web-accessible databases.
ACM Trans. Database Syst., 2004

Optimizing Top-k Selection Queries over Multimedia Repositories.
IEEE Trans. Knowl. Data Eng., 2004

When one Sample is not Enough: Improving Text Database Selection Using Shrinkage.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2004

Selectivity Estimation for String Predicates: Overcoming the Underestimation Problem.
Proceedings of the 20th International Conference on Data Engineering, 2004

2003
QProber: A system for automatic classification of hidden-Web databases.
ACM Trans. Inf. Syst., 2003

Text joins in an RDBMS for web data integration.
Proceedings of the Twelfth International World Wide Web Conference, 2003

Modeling Query-Based Access to Text Databases.
Proceedings of the International Workshop on Web and Databases, 2003

Efficient IR-Style Keyword Search over Relational Databases.
Proceedings of 29th International Conference on Very Large Data Bases, 2003

QXtract: A Building Block for Efficient Information Extraction from Plain-Text Databases.
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003

Text Joins for Data Cleansing and Integration in an RDBMS.
Proceedings of the 19th International Conference on Data Engineering, 2003

Navigation- vs. Index-Based XML Multi-Query Processing.
Proceedings of the 19th International Conference on Data Engineering, 2003

Querying Text Databases for Efficient Information Extraction.
Proceedings of the 19th International Conference on Data Engineering, 2003

Categorizing web queries according to geographical locality.
Proceedings of the 2003 ACM CIKM International Conference on Information and Knowledge Management, 2003

2002
Data Integration and Access - The Digital Government Research Center's Energy Data Collection (EDC) Project.
Proceedings of the Advances in Digital Government - Technology, Human Factors, and Policy, 2002

Top-k selection queries over relational databases: Mapping strategies and performance evaluation.
ACM Trans. Database Syst., 2002

Query- vs. Crawling-based Classification of Searchable Web Databases.
IEEE Data Eng. Bull., 2002

Distributed Search over the Hidden Web: Hierarchical Database Sampling and Selection.
Proceedings of 28th International Conference on Very Large Data Bases, 2002

Extending SDARTS: extracting metadata from web databases and interfacing with the open archives initiative.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2002

Evaluating Top-k Queries over Web-Accessible Databases.
Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, USA, February 26, 2002

Web Mining Meets Web Search.
Proceedings of the 2002 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 2002

2001
Reminiscences on Influential Papers.
SIGMOD Rec., 2001

Using q-grams in a DBMS for Approximate String Processing.
IEEE Data Eng. Bull., 2001

Letter from the Special Issue Editor.
IEEE Data Eng. Bull., 2001

Simplifying Data Access: The Energy Data Collection Project.
Computer, 2001

Learning search engine specific query transformations for question answering.
Proceedings of the Tenth International World Wide Web Conference, 2001

Approximate String Joins in a Database (Almost) for Free.
Proceedings of the VLDB 2001, 2001

Probe, Count, and Classify: Categorizing Hidden Web Databases.
Proceedings of the 2001 ACM SIGMOD international conference on Management of data, 2001

STHoles: A Multidimensional Workload-Aware Histogram.
Proceedings of the 2001 ACM SIGMOD international conference on Management of data, 2001

Snowball: A Prototype System for Extracting Relations from Large Text Collections.
Proceedings of the 2001 ACM SIGMOD international conference on Management of data, 2001

PERSIVAL, a system for personalized search and summarization over multimedia healthcare information.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2001

PERSIVAL demo: categorizing hidden-web resources.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2001

SDLIP + STARTS = SDARTS a protocol and toolkit for metasearching.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2001

2000
Automatic Classification of Text Databases Through Query Probing.
Proceedings of the Third International Workshop on the Web and Databases, 2000

Computing Geographical Scopes of Web Resources.
Proceedings of the VLDB 2000, 2000

An investigation of linguistic features and clustering algorithms for topical document clustering.
Proceedings of the SIGIR 2000: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2000

Combining Strategies for Extracting Relations from Text Collections.
Proceedings of the 2000 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 2000

<i>Snowball</i>: extracting relations from large plain-text collections.
Proceedings of the Fifth ACM Conference on Digital Libraries, 2000

Simplifying data access: the energy data collection (EDC) project.
Proceedings of the 2000 National Conference on Digital Government Research, 2000

Characterizing Web Resources for Improved Search.
Proceedings of the First DELOS Network of Excellence Workshop on Information Seeking, 2000

1999
GlOSS: Text-Source Discovery over the Internet.
ACM Trans. Database Syst., 1999

Exploiting Geographical Location Information of Web Pages.
Proceedings of the ACM SIGMOD Workshop on The Web and Databases, 1999

Evaluating Top-<i>k</i> Selection Queries.
Proceedings of the VLDB'99, 1999

1998
Database Research at Columbia University.
SIGMOD Rec., 1998

Mediating and Metasearching on the Internet.
IEEE Data Eng. Bull., 1998

The Stanford InfoBus and Its Service Layers: Augmenting the Internet with High-Level Information Management Protocols.
Proceedings of the Digital Libraries in Computer Science: The MeDoc Approach, 1998

1997
Querying multiple document collections across the internet.
PhD thesis, 1997

Data Structures for Efficient Broker Implementation.
ACM Trans. Inf. Syst., 1997

The Stanford Digital Library Metadata Architecture.
Int. J. Digit. Libr., 1997

Merging Ranks from Heterogeneous Internet Sources.
Proceedings of the VLDB'97, 1997

STARTS: Stanford Proposal for Internet Meta-Searching (Experience Paper).
Proceedings of the SIGMOD 1997, 1997

Metadata for Digital Libraries: Architecture and Design Rationale.
Proceedings of the 2nd ACM International Conference on Digital Libraries, 1997

1996
Optimizing Queries over Multimedia Repositories.
IEEE Data Eng. Bull., 1996

dSCAM: Finding Document Copies Across Multiple Databases.
Proceedings of the Fourth International Conference on Parallel and Distributed Information Systems, 1996

1995
Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies.
Proceedings of the VLDB'95, 1995

1994
Fully Adaptive Minimal Deadlock-Free Packet Routing in Hypercubes, Meshes, and other Networks: Algorithms and Simulations.
IEEE Trans. Parallel Distributed Syst., 1994

Adaptive Deadlock- and Livelock-Free Routing in the Hypercube Network.
IEEE Trans. Parallel Distributed Syst., 1994

Adaptive Deadlock- and Livelock-Free Routing with All Minimal Paths in Torus Networks.
IEEE Trans. Parallel Distributed Syst., 1994

Storage-Efficient, Deadlock-Free Packet Routing Algorithms for Torus Networks.
IEEE Trans. Computers, 1994

Requirements for Deadlock-Free, Adaptive Packet Routing.
SIAM J. Comput., 1994

The Effectiveness of GlOSS for the Text Database Discovery Problem.
Proceedings of the 1994 ACM SIGMOD International Conference on Management of Data, 1994

Precision and Recall of GlOSS Estimators for Database Discovery.
Proceedings of the Third International Conference on Parallel and Distributed Information Systems (PDIS 94), 1994

1992
Adaptive Deadlock-Free Worrnhole Routing in Hypercubes.
Proceedings of the 6th International Parallel Processing Symposium, 1992

Adaptive, Deadlock-Free Packet Routing in Torus Networks with Minimal Storage.
Proceedings of the 1992 International Conference on Parallel Processing, 1992

1991
Routing techniques for massively parallel communication.
Proc. IEEE, 1991

Fully-Adaptive Minimal Deadlock-Free Packet Routing in Hypercubes, Meshes, and Other Networks.
Proceedings of the 3rd Annual ACM Symposium on Parallel Algorithms and Architectures, 1991

Fully-adaptive routing: packet switching performance and wormhole algorithms.
Proceedings of the Proceedings Supercomputing '91, 1991


  Loading...