Steven Skiena

Orcid: 0000-0003-0397-7514

Affiliations:
  • Stony Brook University, Department of Computer Science, NY, USA


According to our database1, Steven Skiena authored at least 205 papers between 1985 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
The Shape of Word Embeddings: Recognizing Language Phylogenies through Topological Data Analysis.
CoRR, 2024

HINENI: Human Identity across the Nations of the Earth Ngram Investigator.
Proceedings of the Eighteenth International AAAI Conference on Web and Social Media, 2024

The Evolution of Occupational Identity in Twitter Biographies.
Proceedings of the Eighteenth International AAAI Conference on Web and Social Media, 2024

The Shape of Word Embeddings: Quantifying Non-Isometry with Topological Data Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Word Definitions from Large Language Models.
CoRR, 2023

Analyzing Film Adaptation through Narrative Alignment.
CoRR, 2023

STONYBOOK: A System and Resource for Large-Scale Analysis of Novels.
CoRR, 2023

Prosody Analysis of Audiobooks.
CoRR, 2023

Accelerating Personalized PageRank Vector Computation.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Inferring Age from Linguistic and Verbal Cues in Celebrity Interviews.
Proceedings of the International Conference on Frontiers of Artificial Intelligence and Machine Learning, 2023

Provable Fairness for Neural Network Models Using Formal Verification.
Proceedings of the 2nd European Workshop on Algorithmic Fairness, 2023

GNAT: A General Narrative Alignment Tool.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Analyzing Film Adaptation through Narrative Alignment.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Improving the Sensitivity of MinHash Through Hash-Value Analysis.
Proceedings of the 34th Annual Symposium on Combinatorial Pattern Matching, 2023

Does It Pay to Optimize AUC?
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Fast spatial autocorrelation.
Knowl. Inf. Syst., 2022

Hierarchies over Vector Space: Orienting Word and Graph Embeddings.
CoRR, 2022

Time Window Frechet and Metric-Based Edit Distance for Passively Collected Trajectories.
CoRR, 2022

Verba Volant, Scripta Volant: Understanding Post-publication Title Changes in News Outlets.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Subset Node Anomaly Tracking over Large Dynamic Graphs.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Chapter Ordering in Novels.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Statistical methodology for ribosomal frameshift detection.
Proceedings of the BCB '22: 13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, Northbrook, Illinois, USA, August 7, 2022

Low-dimensional genotype embeddings for predictive models.
Proceedings of the BCB '22: 13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, Northbrook, Illinois, USA, August 7, 2022

Learning and Evaluating Character Representations in Novels.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Maximizing the Expected Value of a Lottery Ticket: How to Sell and When to Buy.
CoRR, 2021

Subset Node Representation Learning over Large Dynamic Graphs.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Cleaning Dirty Books: Post-OCR Processing for Previously Scanned Texts.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

NeuroPredictome: A Data-Driven Predictome Linking Neuroimaging to Phenotype.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

2020
The Algorithm Design Manual, Third Edition
Texts in Computer Science, Springer, ISBN: 978-3-030-54255-9, 2020

Improved MapReduce Load Balancing through Distribution-Dependent Hash Function Optimization.
Proceedings of the 26th IEEE International Conference on Parallel and Distributed Systems, 2020

Online AUC Optimization for Sparse High-Dimensional Datasets.
Proceedings of the 20th IEEE International Conference on Data Mining, 2020

Chapter Captor: Text Segmentation in Novels.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

What time is it? Temporal Analysis of Novels.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
Data Races and the Discrete Resource-time Tradeoff Problem with Resource Reuse over Paths.
Proceedings of the 31st ACM on Symposium on Parallelism in Algorithms and Architectures, 2019

The Secret Lives of Names?: Name Embeddings from Social Media.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

MediaRank: Computational Ranking of Online News Sources.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

The Trumpiest Trump? Identifying a Subject's Most Characteristic Tweets.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Social Relation Inference via Label Propagation.
Proceedings of the Advances in Information Retrieval, 2019

Learning to Represent Bilingual Dictionaries.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Fast and Accurate Network Embeddings via Very Sparse Random Projection.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Pre-Phaser: Precise Cell-Cycle Phase Detector for scRNA-seq.
Proceedings of the 10th ACM International Conference on Bioinformatics, 2019

2018
A Tutorial on Network Embeddings.
CoRR, 2018

Co-training Embeddings of Knowledge Graphs and Entity Descriptions for Cross-lingual Entity Alignment.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Syntax-Directed Variational Autoencoder for Structured Data.
Proceedings of the 6th International Conference on Learning Representations, 2018

Multi-view Models for Political Ideology Detection of News Articles.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Simple Neologism Based Domain Independent Models to Predict Year of Authorship.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Enhanced Network Embeddings via Exploiting Edge Labels.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

DeepAnnotator: Genome Annotation with Deep Learning.
Proceedings of the 2018 ACM International Conference on Bioinformatics, 2018

HARP: Hierarchical Representation Learning for Networks.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
The Data Science Design Manual
Texts in Computer Science, Springer, ISBN: 978-3-319-55443-3, 2017

Vector-based similarity measurements for historical figures.
Inf. Syst., 2017

Latent Human Traits in the Language of Social Media: An Open-Vocabulary Approach.
CoRR, 2017

Citation histories of papers: sometimes the rich get richer, sometimes they don't.
CoRR, 2017

Recognizing Descriptive Wikipedia Categories for Historical Figures.
CoRR, 2017

DeepBrowse: Similarity-Based Browsing Through Large Lists (Extended Abstract).
Proceedings of the Similarity Search and Applications - 10th International Conference, 2017

Nationality Classification Using Name Embeddings.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Generating Look-alike Names For Security Challenges.
Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security, 2017

Optimal codon pair bias design (extended abstract).
Proceedings of the 2017 IEEE International Conference on Bioinformatics and Biomedicine, 2017

Don't Walk, Skip!: Online Learning of Multi-scale Network Embeddings.
Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017, Sydney, Australia, July 31, 2017

2016
On the Convergent Properties of Word Embedding Methods.
CoRR, 2016

Walklets: Multiscale Graph Embeddings for Interpretable Network Classification.
CoRR, 2016

False-Friend Detection and Entity Matching via Unsupervised Transliteration.
CoRR, 2016

Freshman or Fresher? Quantifying the Geographic Variation of Language in Online Social Media.
Proceedings of the Tenth International Conference on Web and Social Media, 2016

NanoBLASTer: Fast alignment and characterization of Oxford Nanopore single molecule sequencing reads.
Proceedings of the 6th IEEE International Conference on Computational Advances in Bio and Medical Sciences, 2016

2015
Freshman or Fresher? Quantifying the Geographic Variation of Internet Language.
CoRR, 2015

Exact Age Prediction in Social Networks.
Proceedings of the 24th International Conference on World Wide Web Companion, 2015

Statistically Significant Detection of Linguistic Change.
Proceedings of the 24th International Conference on World Wide Web, 2015

Optimizing Read Reversals for Sequence Compression - (Extended Abstract).
Proceedings of the Algorithms in Bioinformatics - 15th International Workshop, 2015

POLYGLOT-NER: Massive Multilingual Named Entity Recognition.
Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

2014
News-Based Group Modeling and Forecasting.
CoRR, 2014

Exploring the power of GPU's for training Polyglot language models.
CoRR, 2014

DeepWalk: online learning of social representations.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Inducing Language Networks from Continuous Space Word Representations.
Proceedings of the Complex Networks V, 2014

Building Sentiment Lexicons for All Major Languages.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
The Expressive Power of Word Embeddings
CoRR, 2013

Synthetic Sequence Design for Signal Location Search.
Algorithmica, 2013

Polyglot: Distributed Word Representations for Multilingual NLP.
Proceedings of the Seventeenth Conference on Computational Natural Language Learning, 2013

Designing Autocorrelated Genes.
Proceedings of the ACM Conference on Bioinformatics, 2013

2012
Watch the Story Unfold with TextWheel: Visualization of Large-Scale News Streams.
ACM Trans. Intell. Syst. Technol., 2012

Optimizing restriction site placement for synthetic genomes.
Inf. Comput., 2012

Redesigning Viral Genomes.
Computer, 2012

Designing RNA Secondary Structures in Coding Regions.
Proceedings of the Bioinformatics Research and Applications - 8th International Symposium, 2012

SpeedRead: A Fast Named Entity Recognition Pipeline.
Proceedings of the COLING 2012, 2012

2011
Constructing Orthogonal de Bruijn Sequences.
Proceedings of the Algorithms and Data Structures - 12th International Symposium, 2011

2010
Access: news and blog analysis for the social sciences.
Proceedings of the 19th International Conference on World Wide Web, 2010

Trading Strategies to Exploit Blog and News Sentiment.
Proceedings of the Fourth International Conference on Weblogs and Social Media, 2010

The Wisdom of Bookies? Sentiment Analysis Versus. the NFL Point Spread.
Proceedings of the Fourth International Conference on Weblogs and Social Media, 2010

2009
Concordance-based entity-oriented search.
Web Intell. Agent Syst., 2009

Expanding network communities from representative examples.
ACM Trans. Knowl. Discov. Data, 2009

Algorithms for Deterministic Call Admission Control of Pre-stored VBR Video Streams.
J. Multim., 2009

Pattern matching with address errors: Rearrangement distances.
J. Comput. Syst. Sci., 2009

Analysis of Airplane Boarding Times.
Oper. Res., 2009

Crystallizing short-read assemblies around seeds.
BMC Bioinform., 2009

Improving Movie Gross Prediction through News Analysis.
Proceedings of the 2009 IEEE/WIC/ACM International Conference on Web Intelligence, 2009

Name-ethnicity classification from open sources.
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28, 2009

Identifying Differences in News Coverage between Cultural/Ethnic Groups.
Proceedings of the 2009 IEEE/WIC/ACM International Conference on Web Intelligence and International Conference on Intelligent Agent Technology, 2009

2008
Combinatorial dominance guarantees for problems with infeasible solutions.
ACM Trans. Algorithms, 2008

Improved bounds on sorting by length-weighted reversals.
J. Comput. Syst. Sci., 2008

Call Admission Control Algorithm for pre-stored VBR video streams
CoRR, 2008

International Sentiment Analysis for News and Blogs.
Proceedings of the Second International Conference on Weblogs and Social Media, 2008

The Embroidery Problem.
Proceedings of the 20th Annual Canadian Conference on Computational Geometry, 2008

The Algorithm Design Manual, Second Edition.
Springer, 2008

2007
Two proteins for the price of one: the design of maximally compressed coding sequences.
Nat. Comput., 2007

Restricting SBH ambiguity via restriction enzymes.
Discret. Appl. Math., 2007

Large-Scale Sentiment Analysis for News and Blogs (system demonstration).
Proceedings of the First International Conference on Weblogs and Social Media, 2007

Large-Scale Sentiment Analysis for News and Blogs.
Proceedings of the First International Conference on Weblogs and Social Media, 2007

2006
Spatial Analysis of News Sources.
IEEE Trans. Vis. Comput. Graph., 2006

Some Lower Bounds on Geometric Separability Problems.
Int. J. Comput. Geom. Appl., 2006

Meta-analysis based on control of false discovery rate: combining yeast ChIP-chip datasets.
Bioinform., 2006

Improving Usability Through Password-Corrective Hashing.
Proceedings of the String Processing and Information Retrieval, 2006

Identifying Co-referential Names Across Large Corpora.
Proceedings of the Combinatorial Pattern Matching, 17th Annual Symposium, 2006

Newspapers vs. Blogs: Who Gets the Scoop?
Proceedings of the Computational Approaches to Analyzing Weblogs, 2006

2005
Lowest common ancestors in trees and directed acyclic graphs.
J. Algorithms, 2005

Question Answering with Lydia (TREC 2005 QA Track).
Proceedings of the Fourteenth Text REtrieval Conference, 2005

Lydia: A System for Large-Scale News Analysis.
Proceedings of the String Processing and Information Retrieval, 2005

Attention and Communication: Decision Scenarios for Teleoperating Robots.
Proceedings of the 38th Hawaii International Conference on System Sciences (HICSS-38 2005), 2005

Bacterial population assay via k-mer analysis.
Proceedings of 3rd Asia-Pacific Bioinformatics Conference, 17-21 January 2005, Singapore, 2005

Airplane Boarding, Disk Scheduling and Space-Time Geometry.
Proceedings of the Algorithmic Applications in Management, First International Conference, 2005

2004
Geometric Reconstruction Problems.
Proceedings of the Handbook of Discrete and Computational Geometry, Second Edition., 2004

Data structures for maintaining set partitions.
Random Struct. Algorithms, 2004

Shift error detection in standardized exams.
J. Discrete Algorithms, 2004

Integrating Microarray Data By Consensus Clustering.
Int. J. Artif. Intell. Tools, 2004

When can you fold a map?
Comput. Geom., 2004

Visualizing Objects with Mirrors.
Comput. Graph. Forum, 2004

An Improved Time-Sensitive Metaheuristic Framework for Combinatorial Optimization.
Proceedings of the Experimental and Efficient Algorithms, Third International Workshop, 2004

Alphabet Permutation for Differentially Encoding Text.
Proceedings of the String Processing and Information Retrieval, 2004

Improved bounds on sorting with length-weighted reversals.
Proceedings of the Fifteenth Annual ACM-SIAM Symposium on Discrete Algorithms, 2004

Heterogeneous Data Integration with the Consensus Clustering Formalism.
Proceedings of the Data Integration in the Life Sciences, First International Workshop, 2004

2003
Programming Challenges - The Programming Contest Training Manual.
Texts in Computer Science, Springer, ISBN: 978-0-387-22081-9, 2003

Programming challenges: the programming contest training manual.
SIGACT News, 2003

Algorithms for testing that sets of DNA words concatenate without secondary structure.
Nat. Comput., 2003

Deconvolving Sequence Variation in Mixed DNA Populations.
J. Comput. Biol., 2003

Natural Selection and Algorithmic Design of mRNA.
J. Comput. Biol., 2003

The Lazy Bureaucrat scheduling problem.
Inf. Comput., 2003

A Model for Analyzing Black-Box Optimization.
Proceedings of the Algorithms and Data Structures, 8th International Workshop, 2003

Parsing Without a Grammar: Making Sense of Unknown File Formats.
Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), 2003

2002
Analysis Techniques for Microarray Time-Series Data.
J. Comput. Biol., 2002

Designing RNA structures: natural and artificial selection.
Proceedings of the Sixth Annual International Conference on Computational Biology, 2002

Microarray synthesis through multiple-use PCR primer design.
Proceedings of the Tenth International Conference on Intelligent Systems for Molecular Biology, 2002

A Time-Sensitive System for Black-Box Combinatorial Optimization.
Proceedings of the Algorithm Engineering and Experiments, 4th International Workshop, 2002

2001
Identifying gene regulatory networks from experimental data.
Parallel Comput., 2001

Dealing with errors in interactive sequencing by hybridization.
Bioinform., 2001

Finding least common ancestors in directed acyclic graphs.
Proceedings of the Twelfth Annual Symposium on Discrete Algorithms, 2001

Designing better phages.
Proceedings of the Ninth International Conference on Intelligent Systems for Molecular Biology, 2001

2000
LINK: a system for graph computation.
Softw. Pract. Exp., 2000

Efficiently computing and updating triangle strips for real-time rendering.
Comput. Aided Des., 2000

A case study in genome-level fragment assembly.
Bioinform., 2000

Some Separability Problems in the Plane.
EuroCG, 2000

1999
Who is interested in algorithms and why?: lessons from the Stony Brook algorithms repository.
SIGACT News, 1999

On the Maximum Scatter Traveling Salesperson Problem.
SIAM J. Comput., 1999

Matching for Run-Length Encoded Strings.
J. Complex., 1999

Optimizing combinatorial library construction via split synthesis.
Proceedings of the Third Annual International Conference on Research in Computational Molecular Biology, 1999

1998
Decision trees for geometric models.
Int. J. Comput. Geom. Appl., 1998

Recognizing polygonal parts from width measurements.
Comput. Geom., 1998

On Minimum-Area Hulls.
Algorithmica, 1998

1997
Local Rules for Protein Folding on a Triangular Lattice and Generalized Hydrophobicity in the HP Model.
J. Comput. Biol., 1997

Guest Editors' Foreword.
Int. J. Comput. Geom. Appl., 1997

On the Maximum Scatter TSP (Extended Abstract).
Proceedings of the Eighth Annual ACM-SIAM Symposium on Discrete Algorithms, 1997

Local Rules for Protein Folding on a Triangular Lattice and Generalized Hydrophobicity in the HP Model.
Proceedings of the Eighth Annual ACM-SIAM Symposium on Discrete Algorithms, 1997

Fabricating arrays of strings.
Proceedings of the First Annual International Conference on Research in Computational Molecular Biology, 1997

Efficient Array Partitioning.
Proceedings of the Automata, Languages and Programming, 24th International Colloquium, 1997

Graph Drawing and Manipulation with <i>LINK</i>.
Proceedings of the Graph Drawing, 5th International Symposium, 1997

Trie-Based Data Structures for Sequence Assembly.
Proceedings of the Combinatorial Pattern Matching, 8th Annual Symposium, 1997

Geometric Decision Trees for Optical Character Recognition (Extended Abstract).
Proceedings of the Thirteenth Annual Symposium on Computational Geometry, 1997

The algorithm design manual.
Springer, ISBN: 978-0-387-94860-7, 1997

1996
Dialing for Documents: An Experiment in Information Theory.
J. Vis. Lang. Comput., 1996

Hamiltonian triangulations for fast rendering.
Vis. Comput., 1996

Principles and Practice of Unification Factoring.
ACM Trans. Program. Lang. Syst., 1996

Sorting with Fixed-length Reversals.
Discret. Appl. Math., 1996

Positional sequencing by hybridization.
Comput. Appl. Biosci., 1996

Optimizing Triangle Strips for Fast Rendering.
Proceedings of the 7th IEEE Visualization Conference, 1996

Stripe: a software tool for efficient triangle strips.
Proceedings of the ACM SIGGRAPH 96 Visual Proceedings: The art and interdisciplinary programs of SIGGRAPH 1996, 1996

On Minimum-Area Hulls (Extended Abstract).
Proceedings of the Algorithms, 1996

1995
Algorithms for Square Roots of Graphs.
SIAM J. Discret. Math., 1995

Recognizing small subgraphs.
Networks, 1995

Reconstructing Strings from Substrings.
J. Comput. Biol., 1995

Complexity aspects of visibility graphs.
Int. J. Comput. Geom. Appl., 1995

Unification Factoring for Efficient Execution of Logic Programs.
Proceedings of the Conference Record of POPL'95: 22nd ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, 1995

Reconstructing Strings from Substrings in Rounds.
Proceedings of the 36th Annual Symposium on Foundations of Computer Science, 1995

1994
Hamilton Triangulations for Fast Rendering.
Proceedings of the Algorithms, 1994

1993
Reconstructing Strings from Substrings (Extended Abstract).
Proceedings of the Algorithms and Data Structures, Third Workshop, 1993

Point Probe Decision Trees for Geometric Concept Classes.
Proceedings of the Algorithms and Data Structures, Third Workshop, 1993

A Partial Digest Approach to Restriction Site Mapping.
Proceedings of the 1st International Conference on Intelligent Systems for Molecular Biology, 1993

Ranger: A Tool for Nearest Neighbor Search in High Dimensions.
Proceedings of the Ninth Annual Symposium on Computational GeometrySan Diego, 1993

Reconstructing Polygons From X-Rays.
Proceedings of the 5th Canadian Conference on Computational Geometry, 1993

1992
Interactive reconstruction via geometric probing.
Proc. IEEE, 1992

Model-based Probing Strategies for Convex Polygons.
Comput. Geom., 1992

Analyzing Integer Sequences.
Proceedings of the Computational Support for Discrete Mathematics, 1992

1991
Probing Convex Polygons with Half-Planes.
J. Algorithms, 1991

Tight bounds on a problem of lines and intersections.
Discret. Math., 1991

Inducing Codes from Examples.
Proceedings of the IEEE Data Compression Conference, 1991

1990
Searching on a Tape.
IEEE Trans. Computers, 1990

Counting <i>k</i>-projections of a point set.
J. Comb. Theory A, 1990

Reconstructing Sets from Interpoint Distances (Extended Abstract).
Proceedings of the Sixth Annual Symposium on Computational Geometry, 1990

Implementing discrete mathematics - combinatorics and graph theory with Mathematica.
Addison-Wesley, ISBN: 978-0-201-50943-4, 1990

1989
Reconstructing graphs from cut-set sizes.
Inf. Process. Lett., 1989

Eight Pieces Cannot Cover a Chess Board.
Comput. J., 1989

Problems in Geometric Probing.
Algorithmica, 1989

1988
Geometric Probing
PhD thesis, 1988

Probing Convex Polygons with X-Rays.
SIAM J. Comput., 1988

Tablet: Personal Computer in the Year 2000.
Commun. ACM, 1988

Encroaching Lists as a Measure of Presortedness.
BIT, 1988

1987
Further Evidence for Randomness in π.
Complex Syst., 1987

1986
An Overview of Machine Learning in Computer Chess.
J. Int. Comput. Games Assoc., 1986

1985
Compiler optimization by detecting recursive subprograms.
Proceedings of the 1985 ACM annual conference on The range of computing: mid-80's perspective: mid-80's perspective, 1985


  Loading...