Steffen Herbold

Orcid: 0000-0001-9765-2803

According to our database1, Steffen Herbold authored at least 92 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Studying the explanations for the automated prediction of bug and non-bug issues using LIME and SHAP.
Empir. Softw. Eng., July, 2024

A new perspective on the competent programmer hypothesis through the reproduction of real faults with repeated mutations.
Softw. Test. Verification Reliab., May, 2024

Semantic similarity prediction is better than other semantic similarity measures.
Trans. Mach. Learn. Res., 2024

Large Language Models can impersonate politicians and other public figures.
CoRR, 2024

Legal Aspects for Software Developers Interested in Generative AI Applications.
CoRR, 2024

Question Type Prediction in Natural Debate.
Proceedings of the 25th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2024

2023
Galba: genome annotation with miniprot and AUGUSTUS.
BMC Bioinform., December, 2023

Are automated static analysis tools worth it? An investigation into relative warning density and external software quality on the example of Apache open source projects.
Empir. Softw. Eng., June, 2023

On the Validity of Pre-Trained Transformers for Natural Language Processing in the Software Engineering Domain.
IEEE Trans. Software Eng., April, 2023

What really changes when developers intend to improve their source code: a commit-level study of static metric value and static analysis warning changes.
Empir. Softw. Eng., March, 2023

Differential testing for machine learning: an analysis for classification algorithms beyond deep learning.
Empir. Softw. Eng., March, 2023

AI, write an essay for me: A large-scale comparison of human-written versus ChatGPT-generated essays.
CoRR, 2023

Understanding issues related to personal data and data protection in open source projects on GitHub.
CoRR, 2023

An exploratory study of bug-introducing changes: what happens when bugs are introduced in open source software?
CoRR, 2023

What are the Machine Learning best practices reported by practitioners on Stack Exchange?
CoRR, 2023

On Using Information Retrieval to Recommend Machine Learning Good Practices for Software Engineers.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

Problems with with SZZ and Features: An empirical assessment of the state of practice of defect prediction data collection.
Proceedings of the Software Engineering 2023, 2023

On the Impact of Reconstruction and Context for Argument Prediction in Natural Debate.
Proceedings of the 10th Workshop on Argument Mining, 2023

2022
SmartSHARK 2.2 Small.
Dataset, July, 2022

SmartSHARK 2.2 Full.
Dataset, July, 2022

SmartSHARK 1.3.
Dataset, July, 2022

SmartSHARK 2.1 Full.
Dataset, July, 2022

Exploring the relationship between performance metrics and cost saving potential of defect prediction models.
Empir. Softw. Eng., 2022

Problems with SZZ and features: An empirical study of the state of practice of defect prediction data collection.
Empir. Softw. Eng., 2022

A fine-grained data set and analysis of tangling in bug fixing commits.
Empir. Softw. Eng., 2022

Smoke testing for machine learning: simple tests to discover severe bugs.
Empir. Softw. Eng., 2022

Studying the explanations for the automated prediction of bug and non-bug issues using LIME and SHAP.
CoRR, 2022

Predicting Issue Types with seBERT.
Proceedings of the 2022 IEEE/ACM 1st International Workshop on Natural Language-Based Software Engineering (NLBSE 2022), 2022

2021
SmartSHARK 2.2 Small.
Dataset, December, 2021

SmartSHARK 2.2 Full.
Dataset, December, 2021

SmartSHARK 1.3.
Dataset, December, 2021

SmartSHARK 2.1 Full.
Dataset, November, 2021

SmartSHARK 2.1 Full.
Dataset, September, 2021

On the Costs and Profit of Software Defect Prediction.
IEEE Trans. Software Eng., 2021

A systematic mapping study of developer social network research.
J. Syst. Softw., 2021

Are automated static analysis tools worth it? An investigation into relative warning density and external software quality.
CoRR, 2021

Broccoli: Bug localization with the help of text search engines.
CoRR, 2021

On the differences between quality increasing and other changes in open source Java projects.
CoRR, 2021

A new perspective on the competent programmer hypothesis through the reproduction of bugs with repeated mutations.
CoRR, 2021

Exploring the relationship between performance metrics and cost saving potential of defect prediction models.
CoRR, 2021

Expert decision support system for aeroacoustic classification.
CoRR, 2021

The SmartSHARK Repository Mining Data.
CoRR, 2021

On the Cost and Profit of Software Defect Prediction.
Proceedings of the Software Engineering 2021, 2021

2020
Are unit and integration test definitions still valid for modern Java projects? An empirical study on open-source projects.
J. Syst. Softw., 2020

Autorank: A Python package for automated ranking of classifiers.
J. Open Source Softw., 2020

GIMO: A multi-objective anytime rule mining system to ease iterative feedback from domain experts.
Expert Syst. Appl. X, 2020

A longitudinal study of static analysis warning evolution and the effects of PMD on software quality in Apache open source projects.
Empir. Softw. Eng., 2020

Correction to: On the feasibility of automated prediction of bug and non-bug issues.
Empir. Softw. Eng., 2020

On the feasibility of automated prediction of bug and non-bug issues.
Empir. Softw. Eng., 2020

Automatic source localization and spectra generation from deconvolved beamforming maps.
CoRR, 2020

Large-Scale Manual Validation of Bug Fixing Commits: A Fine-grained Analysis of Tangling.
CoRR, 2020

Smoke Testing for Machine Learning: Simple Tests to Discover Severe Defects.
CoRR, 2020

On the Feasibility of Automated Issue Type Prediction.
CoRR, 2020

Large-Scale Manual Validation of Bugfixing Changes.
Proceedings of the MSR '20: 17th International Conference on Mining Software Repositories, 2020

Static source code metrics and static analysis warnings for fine-grained just-in-time defect prediction.
Proceedings of the IEEE International Conference on Software Maintenance and Evolution, 2020

The SmartSHARK ecosystem for software repository mining.
Proceedings of the ICSE '20: 42nd International Conference on Software Engineering, Companion Volume, Seoul, South Korea, 27 June, 2020

With registered reports towards large scale data curation.
Proceedings of the ICSE-NIER 2020: 42nd International Conference on Software Engineering, New Ideas and Emerging Results, Seoul, South Korea, 27 June, 2020

2019
Correction of "A Comparative Study to Benchmark Cross-Project Defect Prediction Approaches".
IEEE Trans. Software Eng., 2019

Issues with SZZ: An empirical assessment of the state of practice of defect prediction data collection.
CoRR, 2019

Chapter Seven - Experiences With Replicable Experiments and Replication Kits for Software Engineering Research.
Adv. Comput., 2019

2018
A Comparative Study to Benchmark Cross-Project Defect Prediction Approaches.
IEEE Trans. Software Eng., 2018

Addressing problems with replicability and validity of repository mining studies through a smart data platform.
Empir. Softw. Eng., 2018

A Multi-Objective Anytime Rule Mining System to Ease Iterative Feedback from Domain Experts.
CoRR, 2018

An Industrial Case Study on Shrinking Code Review Changesets through Remark Prediction.
CoRR, 2018

Benchmarking cross-project defect prediction approaches with costs metrics.
CoRR, 2018

2017
Comments on ScottKnottESD in Response to "An Empirical Comparison of Model Validation Techniques for Defect Prediction Models".
IEEE Trans. Software Eng., 2017

Combining usage-based and model-based testing for service-oriented architectures in the industrial practice.
Int. J. Softw. Tools Technol. Transf., 2017

Model-based testing as a service.
Int. J. Softw. Tools Technol. Transf., 2017

Global vs. local models for cross-project defect prediction - A replication study.
Empir. Softw. Eng., 2017

A systematic mapping study on cross-project defect prediction.
CoRR, 2017

Performance tuning for automotive Software Fault Prediction.
Proceedings of the IEEE 24th International Conference on Software Analysis, 2017

On the Relatively Small Impact of Deep Dependencies on Cloud Application Reliability.
Proceedings of the 2017 IEEE 10th International Conference on Cloud Computing (CLOUD), 2017

2016
Hidden Markov Models for the Prediction of Developer Involvement Dynamics and Workload.
Proceedings of the The 12th International Conference on Predictive Models and Data Analytics in Software Engineering, 2016

Learning from Software Project Histories - Predictive Studies Based on Mining Software Repositories.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2016

Adressing problems with external validity of repository mining studies through a smart data platform.
Proceedings of the 13th International Conference on Mining Software Repositories, 2016

2015
Effizientere IT-Sicherheitstest mit Hilfe von Usagebased Testing.
Softwaretechnik-Trends, 2015

Automated Deployment and Parallel Execution of Legacy Applications in Cloud Environments (Short Paper).
Proceedings of the 8th IEEE International Conference on Service-Oriented Computing and Applications, 2015

Improving Security Testing with Usage-Based Fuzz Testing.
Proceedings of the Risk Assessment and Risk-Driven Testing - Third International Workshop, 2015

Novel Insights on Cross Project Fault Prediction Applied to Automotive Software.
Proceedings of the Testing Software and Systems, 2015

Intuition vs. Truth: Evaluation of Common Myths about StackOverflow Posts.
Proceedings of the 12th IEEE/ACM Working Conference on Mining Software Repositories, 2015

Mining Software Dependency Networks for Agent-Based Simulation of Software Evolution.
Proceedings of the 30th IEEE/ACM International Conference on Automated Software Engineering Workshops, 2015

CrossPare: A Tool for Benchmarking Cross-Project Defect Predictions.
Proceedings of the 30th IEEE/ACM International Conference on Automated Software Engineering Workshops, 2015


2014
A Generalized Model of PAC Learning and its Applicability.
RAIRO Theor. Informatics Appl., 2014

2013
Training data selection for cross-project defect prediction.
Proceedings of the 9th International Conference on Predictive Models in Software Engineering, 2013

AutoQUEST - Automated Quality Engineering of Event-Driven Software.
Proceedings of the Sixth IEEE International Conference on Software Testing, 2013

2012
Usage-based Testing of Event-driven Software.
PhD thesis, 2012

Deployable Capture/Replay Supported by Internal Messages.
Adv. Comput., 2012

2011
Calculation and optimization of thresholds for sets of software metrics.
Empir. Softw. Eng., 2011

A Model for Usage-Based Testing of Event-Driven Software.
Proceedings of the Fifth International Conference on Secure Software Integration and Reliability Improvement, 2011

Improved Bug Reporting and Reproduction through Non-intrusive GUI Usage Monitoring and Automated Replaying.
Proceedings of the Fourth IEEE International Conference on Software Testing, 2011

2010
Retrospective Analysis of Software Projects using k-Means Clustering.
Softwaretechnik-Trends, 2010


  Loading...