Masafumi Oyamada

Orcid: 0000-0002-4045-7350

According to our database1, Masafumi Oyamada authored at least 29 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Can Large Language Models Invent Algorithms to Improve Themselves?
CoRR, 2024

Optimizing Low-Resource Language Model Training: Comprehensive Analysis of Multi-Epoch, Multi-Lingual, and Two-Stage Approaches.
CoRR, 2024

LightPAL: Lightweight Passage Retrieval for Open Domain Multi-Document Summarization.
CoRR, 2024

Large Language Models as Data Preprocessors.
Proceedings of Workshops at the 50th International Conference on Very Large Data Bases, 2024

Relevance, Diversity, and Exclusivity: Designing Keyword-augmentation Strategy for Zero-shot Classifiers.
Proceedings of the 13th Joint Conference on Lexical and Computational Semantics, 2024

Jellyfish: Instruction-Tuning Local Large Language Models for Data Preprocessing.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

On the Use of Large Language Models for Table Tasks.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

2023
DeepJoin: Joinable Table Discovery with Pre-trained Language Models.
Proc. VLDB Endow., 2023

Jellyfish: A Large Language Model for Data Preprocessing.
CoRR, 2023

QA-Matcher: Unsupervised Entity Matching Using a Question Answering Model.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2023

Context Quality Matters in Training Fusion-in-Decoder for Extractive Open-Domain Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Cross-Domain User Similarity without Overlapping Attributes via Optimal Transport Theory.
Proceedings of the 2023 SIGIR Workshop on eCommerce co-located with the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2023), 2023

Towards Large Language Model Organization: A Case Study on Abstractive Summarization.
Proceedings of the IEEE International Conference on Big Data, 2023

2022
Table Enrichment System for Machine Learning.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

2021
Continuous top-k spatial-keyword search on dynamic objects.
VLDB J., 2021

Quality Control for Hierarchical Classification with Incomplete Annotations.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2021

User Identity Linkage for Different Behavioral Patterns across Domains.
Proceedings of the Fifteenth International AAAI Conference on Web and Social Media, 2021

Efficient Joinable Table Discovery in Data Lakes: A High-Dimensional Similarity-Based Approach.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Low-resource Taxonomy Enrichment with Pretrained Language Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Learning with Unsure Responses.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Extracting Feature Engineering Knowledge from Data Science Notebooks.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

Meimei: An Efficient Probabilistic Approach for Semantically Annotating Tables.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Compressed Vector Set: A Fast and Space-Efficient Data Mining Framework.
J. Inf. Process., 2018

Accelerating Feature Engineering with Adaptive Partial Aggregation Tree.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

2017
Link Prediction for Isolated Nodes in Heterogeneous Network by Topic-Based Co-clustering.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2017

Relational Mixture of Experts: Explainable Demographics Prediction with Behavioral Data.
Proceedings of the 2017 IEEE International Conference on Data Mining, 2017

2014
MOARLE: Matrix Operation Accelerator Based on Run-Length Encoding.
Proceedings of the Web Technologies and Applications - 16th Asia-Pacific Web Conference, 2014

2013
Continuous query processing with concurrency control: reading updatable resources consistently.
Proceedings of the 28th Annual ACM Symposium on Applied Computing, 2013

2011
Efficient Invocation of Transaction Sequences Triggered by Data Streams.
Proceedings of the 2011 International Conference on P2P, 2011


  Loading...