Youngja Park

Orcid: 0000-0002-4579-9261

According to our database1, Youngja Park authored at least 62 papers between 2001 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Towards Generating Informative Textual Description for Neurons in Language Models.
CoRR, 2024

2023
Looking Beyond IoCs: Automatically Extracting Attack Patterns from External CTI.
Proceedings of the 26th International Symposium on Research in Attacks, 2023

A Pretrained Language Model for Cyber Threat Intelligence.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

Split-NER: Named Entity Recognition via Two Question-Answering-based Classifications.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
CyNER: A Python Library for Cybersecurity Named Entity Recognition.
CoRR, 2022

Backdoor smoothing: Demystifying backdoor attacks on deep neural networks.
Comput. Secur., 2022

Full-Stack Information Extraction System for Cybersecurity Intelligence.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: EMNLP 2022 - Industry Track, Abu Dhabi, UAE, December 7, 2022

2021
Ontology-driven Knowledge Graph for Android Malware.
CoRR, 2021

An Ontology-driven Knowledge Graph for Android Malware.
Proceedings of the CCS '21: 2021 ACM SIGSAC Conference on Computer and Communications Security, Virtual Event, Republic of Korea, November 15, 2021

2020
A new measure for overfitting and its implications for backdooring of deep learning.
CoRR, 2020

Utilizing Multimodal Feature Consistency to Detect Adversarial Examples on Clinical Summaries.
Proceedings of the 3rd Clinical Natural Language Processing Workshop, 2020

2019
A biplot correlation range for group-wise metabolite selection in mass spectrometry.
BioData Min., 2019

Unsupervised Sentence Embedding Using Document Structure-Based Context.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

Predicting Malware Attributes from Cybersecurity Texts.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Incorporating Domain Knowledge in Learning Word Embedding.
Proceedings of the 31st IEEE International Conference on Tools with Artificial Intelligence, 2019

Supervising Unsupervised Open Information Extraction Models.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2018
UMBC at SemEval-2018 Task 8: Understanding Text about Malware.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

2017
Learning Domain-Specific Word Embeddings from Sparse Cybersecurity Texts.
CoRR, 2017

2016
Data classification and sensitivity estimation for critical asset discovery.
IBM J. Res. Dev., 2016

A platform and analytics for usage and entitlement analytics.
IBM J. Res. Dev., 2016

Enumeration of generalized lattice paths by string types, peaks, and ascents.
Discret. Math., 2016

Distributed Flexible Nonlinear Tensor Factorization.
CoRR, 2016

Chung-Feller Property of Schröder Objects.
Electron. J. Comb., 2016

Tri-Modularization of Firewall Policies.
Proceedings of the 21st ACM on Symposium on Access Control Models and Technologies, 2016

Graph Analytics for Real-Time Scoring of Cross-Channel Transactional Fraud.
Proceedings of the Financial Cryptography and Data Security, 2016

Comparing Password Ranking Algorithms on Real-World Password Datasets.
Proceedings of the Computer Security - ESORICS 2016, 2016

DinTucker: Scaling Up Gaussian Process Models on Large Multidimensional Arrays.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Learning from Others: User Anomaly Detection Using Anomalous Samples from Other Users.
Proceedings of the Computer Security - ESORICS 2015, 2015

Scalable Nonparametric Multiway Data Analysis.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

2014
PAKDD'12 best paper: generating balanced classifier-independent training samples from unlabeled data.
Knowl. Inf. Syst., 2014

Hetero-Labeled LDA: A Partially Supervised Topic Model with Heterogeneous Labels.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014

Detecting Insider Information Theft Using Features from File Access Logs.
Proceedings of the Computer Security - ESORICS 2014, 2014

2013
Predicting Network Activity from High Throughput Metabolomics.
PLoS Comput. Biol., 2013

DinTucker: Scaling up Gaussian process models on multidimensional arrays with billions of elements.
CoRR, 2013

Ensuring continuous compliance through reconciling policy with usage.
Proceedings of the 18th ACM Symposium on Access Control Models and Technologies, 2013

A bigData platform for analytics on access control policies and logs.
Proceedings of the 18th ACM Symposium on Access Control Models and Technologies, 2013

Estimating Asset Sensitivity by Profiling Users.
Proceedings of the Computer Security - ESORICS 2013, 2013

2012
Generative models for access control policies: applications to role mining over logs with attribution.
Proceedings of the 17th ACM Symposium on Access Control Models and Technologies, 2012

Generating Balanced Classifier-Independent Training Samples from Unlabeled Data.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2012

Business email classification using incremental subspace learning.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

2011
Clustering based on periodicity in high-throughput time course data.
Stat. Anal. Data Min., 2011

System for automatic estimation of data sensitivity with applications to access control and other applications.
Proceedings of the 16th ACM Symposium on Access Control Models and Technologies, 2011

Automatic Call Quality Monitoring Using Cost-Sensitive Classification.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

An experimental study on the measurement of data sensitivity.
Proceedings of the First Workshop on Building Analysis Datasets and Gathering Experience Returns for Security, 2011

2009
apLCMS - adaptive processing of high-resolution LC/MS data.
Bioinform., 2009

Low-cost call type classification for contact center calls using partial transcripts.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Towards real-time measurement of customer satisfaction using automatically generated call transcripts.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
Controlling the False Discovery Rate for Feature Selection in High-resolution NMR Spectra.
Stat. Anal. Data Min., 2008

Discovery of metabolite features for the modelling and analysis of high-resolution NMR spectra.
Int. J. Data Min. Bioinform., 2008

Genetic algorithm-based feature selection in high-resolution NMR spectra.
Expert Syst. Appl., 2008

An empirical analysis of word error rate and keyword error rate.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Automatically constructing blue pages for characters in instructional videos.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Semi-automated logging of contact center telephone calls.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

2007
Semantic Analysis for Topical Segmentation of Videos.
Proceedings of the First IEEE International Conference on Semantic Computing (ICSC 2007), 2007

Automatic call section segmentation for contact-center calls.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

2006
Extracting Salient Keywords from Instructional Videos Using Joint Text, Audio and Visual Cues.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Atomic topical segments detection for instructional videos.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

MAGICAL demonstration: system for automated metadata generation for instructional content.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

2004
Glossary extraction and utilization in the information search and delivery system for IBM Technical Support.
IBM Syst. J., 2004

GlossOnt: A Concept-focused Ontology Building Tool.
Proceedings of the Principles of Knowledge Representation and Reasoning: Proceedings of the Ninth International Conference (KR2004), 2004

2002
Automatic Glossary Extraction: Beyond Terminology Identification.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

2001
Hybrid Text Mining for Finding Abbreviations and their Definitions.
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2001


  Loading...