Ben Kao

Orcid: 0000-0002-0501-9435

Affiliations:
  • University of Hong Kong


According to our database1, Ben Kao authored at least 151 papers between 1991 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Bringing legal knowledge to the public by constructing a legal question bank using large-scale pre-trained language model.
Artif. Intell. Law, September, 2024

ZeroEA: A Zero-Training Entity Alignment Framework via Pre-Trained Language Model.
Proc. VLDB Endow., March, 2024

Debiasing Recommendation with Personal Popularity.
Proceedings of the ACM on Web Conference 2024, 2024

OpenGraph: Towards Open Graph Foundation Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Spatio-temporal flow patterns.
CoRR, 2023

Automated Self-Supervised Learning for Recommendation.
Proceedings of the ACM Web Conference 2023, 2023

Judgment Retrieval Made Easier Through Query Analysis.
Proceedings of the Legal Knowledge and Information Systems, 2023

Multi-Task Processing in Vertex-Centric Graph Systems: Evaluations and Insights.
Proceedings of the Proceedings 26th International Conference on Extending Database Technology, 2023

Annotating Educational Dialog Act with Data Augmentation in Online One-on-One Tutoring.
Proceedings of the Artificial Intelligence in Education. Posters and Late Breaking Results, Workshops and Tutorials, Industry and Innovation Tracks, Practitioners, Doctoral Consortium and Blue Sky, 2023

CEMA - Cost-Efficient Machine-Assisted Document Annotations.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
SCHAIN-IRAM: An Efficient and Effective Semi-Supervised Clustering Algorithm for Attributed Heterogeneous Information Networks.
IEEE Trans. Knowl. Data Eng., 2022

The Social Technology and Research (STAR) Lab in the University of Hong Kong.
SIGMOD Rec., 2022

Distributed PageRank computation with improved round complexities.
Inf. Sci., 2022

Everest: A Top-K Deep Video Analytics System.
Proceedings of the SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12, 2022

Judgment Tagging and Recommendation Using Pre-Trained Language Models and Legal Taxonomy.
Proceedings of the Legal Knowledge and Information Systems, 2022

Modeling Long-Range Travelling Times with Big Railway Data.
Proceedings of the Database Systems for Advanced Applications, 2022

Automatic Meta-Path Discovery for Effective Graph-Based Recommendation.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Lexical Knowledge Internalization for Neural Dialog Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Reinforced Meta-path Selection for Recommendation on Heterogeneous Information Networks.
CoRR, 2021

Top-K Deep Video Analytics: A Probabilistic Approach.
Proceedings of the SIGMOD '21: International Conference on Management of Data, 2021

Characterizing Amateur Tutoring Behavior on a Large Online Learning Platform.
Proceedings of the L@S'21: Eighth ACM Conference on Learning @ Scale, 2021

Semantic Search and Summarization of Judgments Using Topic Modeling.
Proceedings of the Legal Knowledge and Information Systems, 2021

Leveraging Meta-path Contexts for Classification in Heterogeneous Information Networks.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
BATON: Batch One-Hop Personalized PageRanks with Efficiency and Accuracy.
IEEE Trans. Knowl. Data Eng., 2020

ROAM: A Fundamental Routing Query on Road Networks with Efficiency.
IEEE Trans. Knowl. Data Eng., 2020

Finding Interesting Frames in Deep Video Analytics: a Top-K Approach.
CoRR, 2020

PERQ: Predicting, Explaining, and Rectifying Failed Questions in KB-QA Systems.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

MULCE: Multi-level Canonicalization with Embeddings of Open Knowledge Bases.
Proceedings of the Web Information Systems Engineering - WISE 2020, 2020

CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Integrating Domain Knowledge in AI-Assisted Criminal Sentencing of Drug Trafficking Cases.
Proceedings of the Legal Knowledge and Information Systems, 2020

Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Efficient Batch One-Hop Personalized PageRanks.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

MPR - A Partitioning-Replication Framework for Multi-Processing kNN Search on Road Networks.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Spectral Clustering in Heterogeneous Information Networks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Uncertain Data Mining.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

TOAIN: A Throughput Optimizing Adaptive Index for Answering Dynamic kNN Queries on Road Networks.
Proc. VLDB Endow., 2018

Decentralized Search on Decentralized Web.
CoRR, 2018

ROSC: Robust Spectral Clustering on Multi-scale Data.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Towards Practical Open Knowledge Base Canonicalization.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

2017
Efficient Pattern-Based Aggregation on Sequence Data.
IEEE Trans. Knowl. Data Eng., 2017

Finding k most influential edges on flow graphs.
Inf. Syst., 2017

Efficient processing of shortest path queries in evolving graph sequences.
Inf. Syst., 2017

Semi-supervised Clustering in Attributed Heterogeneous Information Networks.
Proceedings of the 26th International Conference on World Wide Web, 2017

SEQ: Example-based Query for Spatial Objects.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

2016
Neural enquirer: learning to query tables in natural language.
IEEE Data Eng. Bull., 2016

Neural Enquirer: Learning to Query Tables in Natural Language.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

On Transductive Classification in Heterogeneous Information Networks.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

2015
SDB: A Secure Query Processing System with Data Interoperability.
Proc. VLDB Endow., 2015

Neural Enquirer: Learning to Query Tables.
CoRR, 2015

ByteSlice: Pushing the Envelop of Main Memory Data Processing with a New Storage Layout.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

TLB misses: The Missing Issue of Adaptive Radix Tree?
Proceedings of the 11th International Workshop on Data Management on New Hardware, 2015

Answering Questions with Complex Semantic Constraints on Open Knowledge Bases.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

Classification with Active Learning and Meta-Paths in Heterogeneous Information Networks.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

2014
Secure query processing with data interoperability in a cloud database environment.
Proceedings of the International Conference on Management of Data, 2014

Who are experts specializing in landscape photography?: analyzing topic-specific authority on content sharing services.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Evaluating multi-way joins over discounted hitting time.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

CLUDE: An Efficient Algorithm for LU Decomposition Over a Sequence of Evolving Graphs.
Proceedings of the 17th International Conference on Extending Database Technology, 2014

2013
Mining Order-Preserving Submatrices from Data with Repeated Measurements.
IEEE Trans. Knowl. Data Eng., 2013

On the Use of Bug and Predicate Signatures for Statistical Debugging.
Proceedings of the 25th International Conference on Software Engineering and Knowledge Engineering, 2013

On incentive-based tagging.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

Optimizing plurality for human intelligence tasks.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Fast evaluation of iceberg pattern-based aggregate queries.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

2012
Location-sensitive resources recommendation in social tagging systems.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

DQR: a probabilistic approach to diversified query recommendation.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

A filter-based protocol for continuous queries over imprecise location data.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Evaluation Metric for Multiple-Bug Localization with Simple and Complex Predicates.
Proceedings of the 19th Asia-Pacific Software Engineering Conference, 2012

2011
Decision Trees for Uncertain Data.
IEEE Trans. Knowl. Data Eng., 2011

On Querying Historical Evolving Graph Sequences.
Proc. VLDB Endow., 2011

Metric and trigonometric pruning for clustering of uncertain data in 2D geometric space.
Inf. Syst., 2011

CubeLSI: An effective and efficient method for searching resources in social tagging systems.
Proceedings of the 27th International Conference on Data Engineering, 2011

I/O-efficient algorithms for answering pattern-based aggregate queries in a sequence OLAP system.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

2010
Clustering Uncertain Data Using Voronoi Diagrams and R-Tree Index.
IEEE Trans. Knowl. Data Eng., 2010

Filtering Data Streams for Entity-Based Continuous Queries.
IEEE Trans. Knowl. Data Eng., 2010

Efficient Skyline Evaluation over Partially Ordered Domains.
Proc. VLDB Endow., 2010

S-OLAP: an OLAP system for analyzing sequence data.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

2009
An Audit Environment for Outsourcing of Frequent Itemset Mining.
Proc. VLDB Endow., 2009

Secure kNN computation on encrypted databases.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2009

Naive Bayes Classification of Uncertain Data.
Proceedings of the ICDM 2009, 2009

XML Data Integration Using Fragment Join.
Proceedings of the Database Systems for Advanced Applications, 2009

Supporting ranking pattern-based aggregate queries in sequence data cubes.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Collaborative resource discovery in social tagging systems.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
OLAP on sequence data.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

A Decremental Approach for Mining Frequent Itemsets from Uncertain Data.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2008

Clustering Uncertain Data Using Voronoi Diagrams.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Mining Order-Preserving Submatrices from Data with Repeated Measurements.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

2007
Mining periodic patterns with gap requirement from sequences.
ACM Trans. Knowl. Discov. Data, 2007

On mining micro-array data by Order-Preserving Submatrix.
Int. J. Bioinform. Res. Appl., 2007

Security in Outsourcing of Association Rule Mining.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Mining Frequent Itemsets from Uncertain Data.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2007

Reducing UK-Means to K-Means.
Proceedings of the Workshops Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

Mining Time-Delayed Associations from Discrete Event Datasets.
Proceedings of the Advances in Databases: Concepts, 2007

Adaptive Frequency Counting over Bursty Data Streams.
Proceedings of the IEEE Symposium on Computational Intelligence and Data Mining, 2007

2006
Uncertain Data Mining: An Example in Clustering Location Data.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2006

Efficient Clustering of Uncertain Data.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

2005
Subsampled Model Aggregation.
Int. J. Artif. Intell. Tools, 2005

Efficient Algorithms for Mining and Incremental Update of Maximal Frequent Sequences.
Data Min. Knowl. Discov., 2005

Indexing Multi-Dimensional Uncertain Data with Arbitrary Probability Density Functions.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

Adaptive Stream Filters for Entity-based Queries with Non-Value Tolerance.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

Online Algorithms for Mining Inter-stream Associations from Large Sensor Networks.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2005

2004
Optimization in Data Cube System Design.
J. Intell. Inf. Syst., 2004

SF-Tree: An Efficient and Flexible Structure for Estimating Selectivity of Simple Path Expressions with Statistical Accuracy Guarantee.
Proceedings of the Database Systems for Advances Applications, 2004

2003
Maintaining Temporal Consistency of Discrete Objects in Soft Real-Time Database Systems.
IEEE Trans. Computers, 2003

Maintenance of Partial-Sum-Based Histograms.
Proceedings of the Eighth International Conference on Database Systems for Advanced Applications (DASFAA '03), 2003

Mining Emerging Substrings.
Proceedings of the Eighth International Conference on Database Systems for Advanced Applications (DASFAA '03), 2003

2002
A lattice-based approach for I/O efficient association rule mining.
Inf. Syst., 2002

Evaluation of concurrency control strategies for mixed soft real-time database systems.
Inf. Syst., 2002

Efficient Algorithms for Incremental Update of Frequent Sequences.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2002

An Efficient Algorithm for Incremental Update of Concept Spaces.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2002

A Comparison Study on Algorithms for Incremental Update of Frequent Sequences.
Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM 2002), 2002

2001
Towards the building of a dense-region-based OLAP system.
Data Knowl. Eng., 2001

Updates and View Maintenance.
Proceedings of the Real-Time Database Systems: Architecture and Techniques, 2001

Disk Scheduling.
Proceedings of the Real-Time Database Systems: Architecture and Techniques, 2001

Prototypes: Programmed Stock Trading.
Proceedings of the Real-Time Database Systems: Architecture and Techniques, 2001

FFS - An I/O-Efficient Algorithm for Mining Frequent Sequences.
Proceedings of the Knowledge Discovery and Data Mining, 2001

Efficient Algorithms for Concept Space Construction.
Proceedings of the Knowledge Discovery and Data Mining, 2001

2000
Anchor point indexing in Web document retrieval.
IEEE Trans. Syst. Man Cybern. Part C, 2000

Modeling splice sites with Bayes networks.
Bioinform., 2000

Text-Source Discovery and <i>GlOSS</i> Update in a Dynamic Web.
Proceedings of the Knowledge Discovery and Data Mining, 2000

An Optimization Problem in Data Cube System Design.
Proceedings of the Knowledge Discovery and Data Mining, 2000

A Study on N-Gram Indexing of Musical Features.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

Exploiting the Duality of Maximal Frequent Itemsets and Minimal Infrequent Itemsets for I/O Efficient Association Rule Mining.
Proceedings of the Database and Expert Systems Applications, 11th International Conference, 2000

Selection of Melody Lines for Music Databases.
Proceedings of the 24th International Computer Software and Applications Conference (COMPSAC 2000), 2000

1999
Priority Scheduling of Transactions in Distributed Real-Time Databases.
Real Time Syst., 1999

A Fast Algorithm for Density-Based Clustering in Large Database.
Proceedings of the Methodologies for Knowledge Discovery and Data Mining, 1999

LGen - A Lattice-Based Candidate Set Generation Algorithm for I/O Efficient Association Rule Mining.
Proceedings of the Methodologies for Knowledge Discovery and Data Mining, 1999

A Study of Musical Features for Melody Databases.
Proceedings of the Database and Expert Systems Applications, 10th International Conference, 1999

DROLAP - A Dense-Region Based Approach to On-Line Analytical Processing.
Proceedings of the Database and Expert Systems Applications, 10th International Conference, 1999

Updates and View Maintenance in Soft Real-Time Database Systems.
Proceedings of the 1999 ACM CIKM International Conference on Information and Knowledge Management, 1999

Requirement-Based Data Cube Schema Design.
Proceedings of the 1999 ACM CIKM International Conference on Information and Knowledge Management, 1999

1998
Discovering user access patterns on the World Wide Web.
Knowl. Based Syst., 1998

An analysis of lock-based and optimistic concurrency control protocols in multiprocessor real-time databases.
J. Syst. Softw., 1998

Is Sampling Useful in Data Mining? A Case in the Maintenance of Discovered Association Rules.
Data Min. Knowl. Discov., 1998

Indexing Multilingual Information on the Web.
Proceedings of the COMPSAC '98, 1998

Recommending Anchor Points in Structure-Preserving Hypertext Document Retrieval.
Proceedings of the COMPSAC '98, 1998

1997
Deadline Assignment in a Distributed Soft Real-Time System.
IEEE Trans. Parallel Distributed Syst., 1997

Intelligent Agents for Matching Information Providers and Consumers on the World-Wide-Web.
Proceedings of the 30th Annual Hawaii International Conference on System Sciences (HICSS-30), 1997

A General Incremental Technique for Maintaining Discovered Association Rules.
Proceedings of the Database Systems for Advanced Applications '97, 1997

1996
Scheduling Soft Real-Time Jobs Over Dual Non-Real-Time Servers.
IEEE Trans. Parallel Distributed Syst., 1996

Overview of the STanford Real-time Information Processor (STRIP).
SIGMOD Rec., 1996

Priority Assignment for Sub-transaction in Distributed Real-time Databases.
Proceedings of the Online-Proceedings of the First International Workshop on Real-Time Databases: Issues and Applications, 1996

Impact of priority assignment on optimistic concurrency control in distributed real-time databases.
Proceedings of the Third International Workshop on Real-Time Computing Systems Application (RTCSA '96), October 30, 1996

Database Support for Efficiently Maintaining Derived Data.
Proceedings of the Advances in Database Technology, 1996

1995
Applying Update Streams in a Soft Real-Time Database System.
Proceedings of the 1995 ACM SIGMOD International Conference on Management of Data, 1995

1994
Aggressive Transmissions of Short Messages Over Redundant Paths.
IEEE Trans. Parallel Distributed Syst., 1994

Emulating Soft Real-Time Scheduling Using Traditional Operating System Schedulers.
Proceedings of the 15th IEEE Real-Time Systems Symposium (RTSS '94), 1994

Subtask Deadline Assignment for Complex Distributed Soft Real-Time Tasks.
Proceedings of the 14th International Conference on Distributed Computing Systems, 1994

1993
Third Generation TP Monitors: A Database Challenge.
Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, 1993

The Gold Mailer.
Proceedings of the Ninth International Conference on Data Engineering, 1993

1992
An Overview of Real-Time Database Systems.
Proceedings of the Real Time Computing, 1992

1991
Aggressive transmissions over redundant paths.
Proceedings of the 10th International Conference on Distributed Computing Systems (ICDCS 1991), 1991


  Loading...