Hongyu Zhang

Orcid: 0000-0002-3063-9425

Affiliations:
  • Chongqing University, China
  • University of Newcastle, School of Electrical Engineering and Computing, Callaghan, Australia (former)
  • Microsoft Research Asia, Beijing, China (former)
  • Tsinghua University, School of Software, Beijing, China (former)
  • National University of Singapore, School of Computing (PhD 2003)


According to our database1, Hongyu Zhang authored at least 302 papers between 2001 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit.
ACM Comput. Surv., December, 2024

The future of API analytics.
Autom. Softw. Eng., November, 2024

An empirical study of data sampling techniques for just-in-time software defect prediction.
Autom. Softw. Eng., November, 2024

Method-Level Test-to-Code Traceability Link Construction by Semantic Correlation Learning.
IEEE Trans. Software Eng., October, 2024

MTL-TRANSFER: Leveraging Multi-task Learning and Transferred Knowledge for Improving Fault Localization and Program Repair.
ACM Trans. Softw. Eng. Methodol., July, 2024

Try with Simpler - An Evaluation of Improved Principal Component Analysis in Log-based Anomaly Detection.
ACM Trans. Softw. Eng. Methodol., June, 2024

KADEL: Knowledge-Aware Denoising Learning for Commit Message Generation.
ACM Trans. Softw. Eng. Methodol., June, 2024

A New Look and Convergence Rate of Federated Multitask Learning With Laplacian Regularization.
IEEE Trans. Neural Networks Learn. Syst., June, 2024

Automatic recognizing relevant fragments of APIs using API references.
Autom. Softw. Eng., June, 2024

Reusing Convolutional Neural Network Models through Modularization and Composition.
ACM Trans. Softw. Eng. Methodol., March, 2024

High-dimensional Bayesian Optimization via Covariance Matrix Adaptation Strategy.
Trans. Mach. Learn. Res., 2024

Generative AI: Redefining the Future of Software Engineering.
IEEE Softw., 2024

LogSD: Detecting Anomalies from System Logs through Self-Supervised Learning and Frequency-Based Masking.
Proc. ACM Softw. Eng., 2024

BARO: Robust Root Cause Analysis for Microservices via Multivariate Bayesian Online Change Point Detection.
Proc. ACM Softw. Eng., 2024

IRCoCo: Immediate Rewards-Guided Deep Reinforcement Learning for Code Completion.
Proc. ACM Softw. Eng., 2024

Automated Data Visualization from Natural Language via Large Language Models: An Exploratory Study.
Proc. ACM Manag. Data, 2024

PreLog: A Pre-trained Model for Log Analytics.
Proc. ACM Manag. Data, 2024

Sifting through the Chaff: On Utilizing Execution Feedback for Ranking the Generated Code Candidates.
CoRR, 2024

Source Code Summarization in the Era of Large Language Models.
CoRR, 2024

Towards more realistic evaluation of LLM-based code generation: an experimental study and beyond.
CoRR, 2024

Hybrid Automated Program Repair by Combining Large Language Models and Program Analysis.
CoRR, 2024

Does Your Neural Code Completion Model Use My Code? A Membership Inference Approach.
CoRR, 2024

VISION2UI: A Real-World Dataset with Layout for Code Generation from UI Designs.
CoRR, 2024

NL2Formula: Generating Spreadsheet Formulas from Natural Language Queries.
CoRR, 2024

Between Lines of Code: Unraveling the Distinct Patterns of Machine and Human Programmers.
CoRR, 2024

SparseCoder: Identifier-Aware Sparse Transformer for File- Level Code Summarization.
Proceedings of the IEEE International Conference on Software Analysis, 2024

Guiding ChatGPT for Better Code Generation: An Empirical Study.
Proceedings of the IEEE International Conference on Software Analysis, 2024

An Empirical Study of Code Search in Intelligent Coding Assistant: Perceptions, Expectations, and Directions.
Proceedings of the Companion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering, 2024

FedDCSR: Federated Cross-domain Sequential Recommendation via Disentangled Representation Learning.
Proceedings of the 2024 SIAM International Conference on Data Mining, 2024

FedHCDR: Federated Cross-Domain Recommendation with Hypergraph Signal Decoupling.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2024

When to Stop? Towards Efficient Code Generation in LLMs with Excess Token Prevention.
Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2024

Graph Neural Networks for Vulnerability Detection: A Counterfactual Explanation.
Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2024

Bridge and Hint: Extending Pre-trained Language Models for Long-Range Code.
Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2024

Beyond Pairwise Testing: Advancing 3-wise Combinatorial Interaction Testing for Highly Configurable Systems.
Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2024

Efficient Log-based Anomaly Detection with Knowledge Distillation.
Proceedings of the IEEE International Conference on Web Services, 2024

OptLLM: Optimal Assignment of Queries to Large Language Models.
Proceedings of the IEEE International Conference on Web Services, 2024

DSFM: Enhancing Functional Code Clone Detection with Deep Subtree Interactions.
Proceedings of the 46th IEEE/ACM International Conference on Software Engineering, 2024

Modularizing while Training: A New Paradigm for Modularizing DNN Models.
Proceedings of the 46th IEEE/ACM International Conference on Software Engineering, 2024

LogShrink: Effective Log Compression by Leveraging Commonality and Variability of Log Data.
Proceedings of the 46th IEEE/ACM International Conference on Software Engineering, 2024

An Empirical Study on Low GPU Utilization of Deep Learning Jobs.
Proceedings of the 46th IEEE/ACM International Conference on Software Engineering, 2024

Local Search-based Approach for Cost-effective Job Assignment on Large Language Models.
Proceedings of the Genetic and Evolutionary Computation Conference Companion, 2024

Optimizing the Utilization of Large Language Models via Schedule Optimization: An Exploratory Study.
Proceedings of the 18th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, 2024

CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

NL2Formula: Generating Spreadsheet Formulas from Natural Language Queries.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Enhancing Code Generation Performance of Smaller Models by Distilling the Reasoning Ability of LLMs.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Tackling Long Code Search with Splitting, Encoding, and Aggregating.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Prompt Tuning in Code Intelligence: An Experimental Evaluation.
IEEE Trans. Software Eng., November, 2023

CoCoAST: Representing Source Code via Hierarchical Splitting and Reconstruction of Abstract Syntax Trees.
Empir. Softw. Eng., November, 2023

EnCoSum: enhanced semantic features for multi-scale multi-modal source code summarization.
Empir. Softw. Eng., October, 2023

Retrieving API Knowledge from Tutorials and Stack Overflow Based on Natural Language Queries.
ACM Trans. Softw. Eng. Methodol., September, 2023

DSSDPP: Data Selection and Sampling Based Domain Programming Predictor for Cross-Project Defect Prediction.
IEEE Trans. Software Eng., April, 2023

Challenging Machine Learning-Based Clone Detectors via Semantic-Preserving Code Transformations.
IEEE Trans. Software Eng., 2023

Understanding and predicting incident mitigation time.
Inf. Softw. Technol., 2023

Leveraging Stack Overflow to detect relevant tutorial fragments of APIs.
Empir. Softw. Eng., 2023

Enhancing Redundancy-based Automated Program Repair by Fine-grained Pattern Mining.
CoRR, 2023

On the Effectiveness of Large Language Models in Domain-Specific Code Generation.
CoRR, 2023

SoTaNa: The Open-Source Software Development Assistant.
CoRR, 2023

Provably Efficient Bayesian Optimization with Unbiased Gaussian Process Hyperparameter Estimation.
CoRR, 2023

An Evaluation of Log Parsing with ChatGPT.
CoRR, 2023

Improving ChatGPT Prompt for Code Generation.
CoRR, 2023

Semi-supervised and unsupervised anomaly detection by mining numerical workflow relations from system logs.
Autom. Softw. Eng., 2023

EDITS: An Easy-to-difficult Training Strategy for Cloud Failure Prediction.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

Revisiting Code Search in a Two-Stage Paradigm.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

CAmpactor: A Novel and Effective Local Search Algorithm for Optimizing Pairwise Covering Arrays.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

xASTNN: Improved Code Representations for Industrial Practice.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

Revisiting and Improving Retrieval-Augmented Deep Assertion Generation.
Proceedings of the 38th IEEE/ACM International Conference on Automated Software Engineering, 2023

Log Parsing: How Far Can ChatGPT Go?
Proceedings of the 38th IEEE/ACM International Conference on Automated Software Engineering, 2023

What Makes Good In-Context Demonstrations for Code Intelligence Tasks with LLMs?
Proceedings of the 38th IEEE/ACM International Conference on Automated Software Engineering, 2023

Detecting Condition-Related Bugs with Control Flow Graph Neural Network.
Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023

Towards Efficient Fine-Tuning of Pre-trained Code Models: An Experimental Study and Beyond.
Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023

You Augment Me: Exploring ChatGPT-based Data Augmentation for Semantic Code Search.
Proceedings of the IEEE International Conference on Software Maintenance and Evolution, 2023

TraceArk: Towards Actionable Performance Anomaly Alerting for Online Service Systems.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023

Vulnerability Detection with Graph Simplification and Enhanced Graph Representation Learning.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

CoCoSoDa: Effective Contrastive Learning for Code Search.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

Reusing Deep Neural Network Models through Model Re-engineering.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

Template-based Neural Program Repair.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

Incident-aware Duplicate Ticket Aggregation for Cloud Systems.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

CONAN: Diagnosing Batch Failures for Cloud Systems.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023

Log Parsing with Prompt-based Few-shot Learning.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

Keeping Pace with Ever-Increasing Data: Towards Continual Learning of Code Intelligence Models.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering, 2023

An Empirical Study on Quality Issues of Deep Learning Platform.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023

Runtime Performance Prediction for Deep Learning Models with Graph Neural Network.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023

What are Pros and Cons? Stance Detection and Summarization on Feature Request.
Proceedings of the ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, 2023

Uncovering Limitations in Text-to-Image Generation: A Contrastive Approach with Structured Semantic Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Bi-channel Multiple Sparse Graph Attention Networks for Session-based Recommendation.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022
DreamLoc: A Deep Relevance Matching-Based Framework for bug Localization.
IEEE Trans. Reliab., 2022

A large-scale empirical study of commit message generation: models, datasets and evaluation.
Empir. Softw. Eng., 2022

Uncertainty-Aware Performance Prediction for Highly Configurable Software Systems via Bayesian Neural Networks.
CoRR, 2022

Enhanced Fairness Testing via Generating Effective Initial Individual Discriminatory Instances.
CoRR, 2022

LogGD: Detecting Anomalies from System Logs by Graph Neural Networks.
CoRR, 2022

Long Code for Code Search.
CoRR, 2022

Enhancing Semantic Code Search with Multimodal Contrastive Learning and Soft Data Augmentation.
CoRR, 2022

ECMG: Exemplar-based Commit Message Generation.
CoRR, 2022

Adaptive Performance Anomaly Detection for Online Service Systems via Pattern Sketching.
CoRR, 2022

Cross-Domain Deep Code Search with Few-Shot Meta Learning.
CoRR, 2022

UniParser: A Unified Log Parser for Heterogeneous Log Data.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Cross-Language Binary-Source Code Matching with Intermediate Representations.
Proceedings of the IEEE International Conference on Software Analysis, 2022

Diet code is healthy: simplifying programs for pre-trained models of code.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022

SPINE: a scalable log parser with feedback guidance.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022

No more fine-tuning? an experimental evaluation of prompt tuning in code intelligence.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022

You see what I want you to see: poisoning vulnerabilities in neural code search.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022

An empirical investigation of missing data handling in cloud node failure prediction.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022

SamplingCA: effective and efficient sampling-based pairwise testing for highly configurable software systems.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022

LogGD: Detecting Anomalies from System Logs with Graph Neural Networks.
Proceedings of the 22nd IEEE International Conference on Software Quality, 2022

A Collaboration-Aware Approach to Profiling Developer Expertise with Cross-Community Data.
Proceedings of the 22nd IEEE International Conference on Software Quality, 2022

LibDB: An Effective and Efficient Framework for Detecting Third-Party Libraries in Binaries.
Proceedings of the 19th IEEE/ACM International Conference on Mining Software Repositories, 2022

NENYA: Cascade Reinforcement Learning for Cost-Aware Failure Mitigation at Microsoft 365.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Multi-task Hierarchical Classification for Disk Failure Prediction in Online Service Systems.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Patching Weak Convolutional Neural Network Models through Modularization and Composition.
Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering, 2022

HELoC: hierarchical contrastive learning of source code representation.
Proceedings of the 30th IEEE/ACM International Conference on Program Comprehension, 2022

Where is Your App Frustrating Users?
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

What Do They Capture? - A Structural Analysis of Pre-Trained Language Models for Source Code.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

NaturalCC: An Open-Source Toolkit for Code Intelligence.
Proceedings of the 44th IEEE/ACM International Conference on Software Engineering: Companion Proceedings, 2022

On the Evaluation of Neural Code Summarization.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

Improving Fault Localization and Program Repair with Deep Semantic Features and Transferred Knowledge.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

Log-based Anomaly Detection with Deep Learning: How Far Are We?
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

REFTY: Refinement Types for Valid Deep Learning Models.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

Adaptive Performance Anomaly Detection for Online Service Systems via Pattern Sketching.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

Cross-Domain Deep Code Search with Meta Learning.
Proceedings of the 44th IEEE/ACM 44th International Conference on Software Engineering, 2022

RACE: Retrieval-augmented Commit Message Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Accelerating Code Search with Deep Hashing and Code Classification.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Coverage Prediction for Accelerating Compiler Testing.
IEEE Trans. Software Eng., 2021

CRaDLe: Deep code retrieval based on semantic Dependency Learning.
Neural Networks, 2021

Generating API tags for tutorial fragments from Stack Overflow.
Empir. Softw. Eng., 2021

Embedding API dependency graph for neural code generation.
Empir. Softw. Eng., 2021

A Survey of Compiler Testing.
ACM Comput. Surv., 2021

Neural Code Summarization: How Far Are We?
CoRR, 2021

CoCoSum: Contextual Code Summarization with Multi-Relational Graph Neural Network.
CoRR, 2021

FedU: A Unified Framework for Federated Multi-Task Learning with Laplacian Regularization.
CoRR, 2021

NTAM: Neighborhood-Temporal Attention Model for Disk Failure Prediction in Cloud Platforms.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Fighting the Fog of War: Automated Incident Detection for Cloud Systems.
Proceedings of the 2021 USENIX Annual Technical Conference, 2021

Onion: identifying incident-indicating logs for cloud systems.
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021

Intelligent container reallocation at Microsoft 365.
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021

LS-sampling: an effective local search based sampling approach for achieving high t-wise coverage.
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021

Effective low capacity status prediction for cloud systems.
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021

HALO: Hierarchy-aware Fault Localization for Cloud Systems.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Log-based Anomaly Detection Without Log Parsing.
Proceedings of the 36th IEEE/ACM International Conference on Automated Software Engineering, 2021

Graph-based Incident Aggregation for Large-Scale Online Service Systems.
Proceedings of the 36th IEEE/ACM International Conference on Automated Software Engineering, 2021

How Long Will it Take to Mitigate this Incident for Online Service Systems?
Proceedings of the 32nd IEEE International Symposium on Software Reliability Engineering, 2021

LogDP: Combining Dependency and Proximity for Log-Based Anomaly Detection.
Proceedings of the Service-Oriented Computing - 19th International Conference, 2021

On the Evaluation of Commit Message Generation Models: An Experimental Study.
Proceedings of the IEEE International Conference on Software Maintenance and Evolution, 2021

Fast Outage Analysis of Large-scale Production Clouds with Service Correlation Mining.
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering, 2021

AutoCCAG: An Automated Approach to Constrained Covering Array Generation.
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering, 2021

Resource-Guided Configuration Space Reduction for Deep Learning Models.
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering, 2021

Efficient Compiler Autotuning via Bayesian Optimization.
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering, 2021

CAST: Enhancing Code Summarization with Hierarchical Splitting and Reconstruction of Abstract Syntax Trees.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

APIRecX: Cross-Library API Recommendation via Pre-Trained Language Model.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Do Bugs Propagate? An Empirical Analysis of Temporal Correlations Among Software Bugs.
Proceedings of the 35th European Conference on Object-Oriented Programming, 2021

PULNS: Positive-Unlabeled Learning with Effective Negative Sample Selector.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Correlation-Aware Heuristic Search for Intelligent Virtual Machine Provisioning in Cloud Systems.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Data-driven approach to application programming interface documentation mining: A review.
WIREs Data Mining Knowl. Discov., 2020

Developer recommendation for Topcoder through a meta-learning based policy model.
Empir. Softw. Eng., 2020

Anomaly Detection via Mining Numerical Workflow Relations from Logs.
Proceedings of the International Symposium on Reliable Distributed Systems, 2020

How to mitigate the incident? an effective troubleshooting guide recommendation technique for online service systems.
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020

Efficient incident identification from multi-dimensional issue reports via meta-heuristic search.
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020

Estimating GPU memory consumption of deep learning models.
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020

Identifying linked incidents in large-scale online service systems.
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020

Towards intelligent incident management: why we need it and how we make it.
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020

Learning to Handle Exceptions.
Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering, 2020

How Incidental are the Incidents? Characterizing and Prioritizing Incidents for Large-Scale Online Service Systems.
Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering, 2020

Cost-Effective Testing of a Deep Learning Model through Input Reduction.
Proceedings of the 31st IEEE International Symposium on Software Reliability Engineering, 2020

Intelligent Virtual Machine Provisioning in Cloud Computing.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

An empirical study on program failures of deep learning jobs.
Proceedings of the ICSE '20: 42nd International Conference on Software Engineering, Seoul, South Korea, 27 June, 2020

Retrieval-based neural source code summarization.
Proceedings of the ICSE '20: 42nd International Conference on Software Engineering, Seoul, South Korea, 27 June, 2020

Perf-AL: Performance Prediction for Configurable Software through Adversarial Learning.
Proceedings of the ESEM '20: ACM / IEEE International Symposium on Empirical Software Engineering and Measurement, 2020

2019
On the Multiple Sources and Privacy Preservation Issues for Heterogeneous Defect Prediction.
IEEE Trans. Software Eng., 2019

Improving the reliability of implicit averaging methods using new conditional operators for robust optimization.
Swarm Evol. Comput., 2019

Does the fault reside in a stack trace? Assisting crash localization by predicting crashing fault residence.
J. Syst. Softw., 2019

Heterogeneous defect prediction with two-stage ensemble learning.
Autom. Softw. Eng., 2019

Outage Prediction and Diagnosis for Cloud Service Systems.
Proceedings of the World Wide Web Conference, 2019

Cross-dataset Time Series Anomaly Detection for Cloud Systems.
Proceedings of the 2019 USENIX Annual Technical Conference, 2019

A Novel U-Shaped Transfer Function for Binary Particle Swarm Optimisation.
Proceedings of the Soft Computing for Problem Solving 2019, 2019

Robust log-based anomaly detection on unstable log data.
Proceedings of the ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2019

Towards more efficient meta-heuristic algorithms for combinatorial test generation.
Proceedings of the ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2019

Ares: Inferring Error Specifications through Static Analysis.
Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering, 2019

CodeKernel: A Graph Kernel Based Approach to the Selection of API Usage Examples.
Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering, 2019

History-Guided Configuration Diversification for Compiler Test-Program Generation.
Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering, 2019

Continuous Incident Triage for Large-Scale Online Service Systems.
Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering, 2019

Local Search with Efficient Automatic Configuration for Minimum Vertex Cover.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

AutoMR: Automatic Discovery and Cleansing of Numerical Metamorphic Relations.
Proceedings of the 2019 IEEE International Conference on Software Maintenance and Evolution, 2019

Automatic Discovery and Cleansing of Numerical Metamorphic Relations.
Proceedings of the 2019 IEEE International Conference on Software Maintenance and Evolution, 2019

Performance-Influence Model for Highly Configurable Software with Fourier Learning and Lasso Regression.
Proceedings of the 2019 IEEE International Conference on Software Maintenance and Evolution, 2019

A novel neural source code representation based on abstract syntax tree.
Proceedings of the 41st International Conference on Software Engineering, 2019

DeepPerf: performance prediction for configurable software with deep sparse neural network.
Proceedings of the 41st International Conference on Software Engineering, 2019

An empirical investigation of incident triage for online service systems.
Proceedings of the 41st International Conference on Software Engineering: Software Engineering in Practice, 2019

Neural Feature Search: A Neural Architecture for Automated Feature Engineering.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

2018
Editorial: Special Section on Best Papers of PROMISE 2016.
Inf. Softw. Technol., 2018

ChangeLocator: locate crash-inducing changes based on crash reports.
Empir. Softw. Eng., 2018

Improving Service Availability of Cloud Systems by Predicting Disk Error.
Proceedings of the 2018 USENIX Annual Technical Conference, 2018

Predicting Node failure in cloud service systems.
Proceedings of the 2018 ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2018

Identifying impactful service system problems via log analysis.
Proceedings of the 2018 ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2018

BigIN4: Instant, Interactive Insight Identification for Multi-Dimensional Big Data.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Shaping program repair space with existing patches and similar code.
Proceedings of the 27th ACM SIGSOFT International Symposium on Software Testing and Analysis, 2018

Automatically answering API-related questions.
Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings, 2018

Deep code search.
Proceedings of the 40th International Conference on Software Engineering, 2018

Live path control flow integrity.
Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings, 2018

Live Path CFI Against Control Flow Hijacking Attacks.
Proceedings of the Information Security and Privacy - 23rd Australasian Conference, 2018

2017
Testing and Verification of Compilers (Dagstuhl Seminar 17502).
Dagstuhl Reports, 2017

Developer recommendation on bug commenting: a ranking approach for the developer crowd.
Sci. China Inf. Sci., 2017

DeepAM: Migrate APIs with Multi-modal Sequence to Sequence Learning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Heterogeneous Defect Prediction Through Multiple Kernel Learning and Ensemble Learning.
Proceedings of the 2017 IEEE International Conference on Software Maintenance and Evolution, 2017

Cross-project and within-project semi-supervised software defect prediction problems study using a unified solution.
Proceedings of the 39th International Conference on Software Engineering, 2017

Learning to prioritize test programs for compiler testing.
Proceedings of the 39th International Conference on Software Engineering, 2017

Neural Programming by Example.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Diversity maximization speedup for localizing faults in single-fault and multi-fault programs.
Autom. Softw. Eng., 2016

Bing developer assistant: improving developer productivity by recommending sample code.
Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2016

Deep API learning.
Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2016

Casper: an efficient approach to call trace collection.
Proceedings of the 43rd Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, 2016

Test Case Prioritization for Compilers: A Text-Vector Based Approach.
Proceedings of the 2016 IEEE International Conference on Software Testing, 2016

Log clustering based problem identification for online service systems.
Proceedings of the 38th International Conference on Software Engineering, 2016

iDice: problem identification for emerging issues.
Proceedings of the 38th International Conference on Software Engineering, 2016

An empirical comparison of compiler testing techniques.
Proceedings of the 38th International Conference on Software Engineering, 2016

Improving Failure Detection by Automatically Generating Test Cases Near the Boundaries.
Proceedings of the 40th IEEE Annual Computer Software and Applications Conference, 2016

How to tame your online services.
Proceedings of the Perspectives on Data Science for Software Engineering, 2016

Visual analytics for software engineering data.
Proceedings of the Perspectives on Data Science for Software Engineering, 2016

2015
Improving Software Quality and Productivity Leveraging Mining Techniques: [Summary of the Second Workshop on Software Mining, at ASE 2013].
ACM SIGSOFT Softw. Eng. Notes, 2015

Log2: A Cost-Aware Logging Mechanism for Performance Diagnosis.
Proceedings of the 2015 USENIX Annual Technical Conference, 2015

CodeHow: Effective Code Search Based on API Understanding and Extended Boolean Model (E).
Proceedings of the 30th IEEE/ACM International Conference on Automated Software Engineering, 2015

Learning to Log: Helping Developers Make Informed Logging Decisions.
Proceedings of the 37th IEEE/ACM International Conference on Software Engineering, 2015

An Empirical Study on Quality Issues of Production Big Data Platform.
Proceedings of the 37th IEEE/ACM International Conference on Software Engineering, 2015

2014
Locating Crashing Faults based on Crash Stack Traces.
CoRR, 2014

Querying sequential software engineering data.
Proceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering, (FSE-22), Hong Kong, China, November 16, 2014

Mining key and referential constraints enforcement patterns.
Proceedings of the Symposium on Applied Computing, 2014

SymCrash: selective recording for reproducing crashes.
Proceedings of the ACM/IEEE International Conference on Automated Software Engineering, 2014

CrashLocator: locating crashing faults based on crash stacks.
Proceedings of the International Symposium on Software Testing and Analysis, 2014

Effective Bug Triage Based on Historical Bug-Fix Information.
Proceedings of the 25th IEEE International Symposium on Software Reliability Engineering, 2014

Boosting Bug-Report-Oriented Fault Localization with Segmentation and Stack-Trace Analysis.
Proceedings of the 30th IEEE International Conference on Software Maintenance and Evolution, Victoria, BC, Canada, September 29, 2014

ABOR: An Automatic Framework for Buffer Overflow Removal in C/C++Programs.
Proceedings of the Enterprise Information Systems - 16th International Conference, 2014

Automatic Removal of Buffer Overflow Vulnerabilities in C/C++ Programs.
Proceedings of the ICEIS 2014, 2014

Identifying Recurrent and Unknown Performance Issues.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

Detecting infeasible branches based on code patterns.
Proceedings of the 2014 Software Evolution Week, 2014

Towards a Consistent Feature Model using OWL.
Proceedings of the Semantic Web Enabled Software Engineering, 2014

2013
Balancing Privacy and Utility in Cross-Company Defect Prediction.
IEEE Trans. Software Eng., 2013

Simulation of software maintenance process, with and without a work-in-process limit.
J. Softw. Evol. Process., 2013

Has this bug been reported?
Proceedings of the 20th Working Conference on Reverse Engineering, 2013

A cost-effectiveness criterion for applying software defect prediction models.
Proceedings of the Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering, 2013

BugMap: a topographic map of bugs.
Proceedings of the Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering, 2013

Mining succinct and high-coverage API usage patterns from source code.
Proceedings of the 10th Working Conference on Mining Software Repositories, 2013

Predicting bug-fixing time: an empirical study of commercial software projects.
Proceedings of the 35th International Conference on Software Engineering, 2013

Is This a Bug or an Obsolete Test?
Proceedings of the ECOOP 2013 - Object-Oriented Programming, 2013

2012
Sample-based software defect prediction with active and semi-supervised learning.
Autom. Softw. Eng., 2012

A Comparative Study of Scrum and Kanban Approaches on a Real Case Study Using Simulation.
Proceedings of the Agile Processes in Software Engineering and Extreme Programming, 2012

Diversity maximization speedup for fault localization.
Proceedings of the IEEE/ACM International Conference on Automated Software Engineering, 2012

Interactive fault localization leveraging simple user feedback.
Proceedings of the 28th IEEE International Conference on Software Maintenance, 2012

Where should the bugs be fixed? More accurate information retrieval-based bug localization based on bug reports.
Proceedings of the 34th International Conference on Software Engineering, 2012

ReBucket: A method for clustering duplicate crash reports based on call stack similarity.
Proceedings of the 34th International Conference on Software Engineering, 2012

Predicting defect numbers based on defect state transition models.
Proceedings of the 2012 ACM-IEEE International Symposium on Empirical Software Engineering and Measurement, 2012

Detection of Buffer Overflow Vulnerabilities in C/C++ with Pattern Based Limited Symbolic Evaluation.
Proceedings of the 36th Annual IEEE Computer Software and Applications Conference Workshops, 2012

Learning to rank duplicate bug reports.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Modeling and Verification of Context-Aware Systems.
Proceedings of the 19th Asia-Pacific Software Engineering Conference, 2012

2011
Checking enforcement of integrity constraints in database applications based on code patterns.
J. Syst. Softw., 2011

ReLink: recovering links between bugs and changes.
Proceedings of the SIGSOFT/FSE'11 19th ACM SIGSOFT Symposium on the Foundations of Software Engineering (FSE-19) and ESEC'11: 13th European Software Engineering Conference (ESEC-13), 2011

Automated Extraction of Data Lifecycle Support from Database Applications.
Proceedings of the 23rd International Conference on Software Engineering & Knowledge Engineering (SEKE'2011), 2011

Integrating software engineering data using semantic web technologies.
Proceedings of the 8th International Working Conference on Mining Software Repositories, 2011

University-Industry Collaboration Journey towards Product Lines.
Proceedings of the Top Productivity through Software Reuse, 2011

Dealing with noise in defect prediction.
Proceedings of the 33rd International Conference on Software Engineering, 2011

Workshop on emerging trends in software metrics: (WETSoM 2011).
Proceedings of the 33rd International Conference on Software Engineering, 2011

2010
Monitoring Software Quality Evolution for Defects.
IEEE Softw., 2010

2010 ICSE workshop on emerging trends in software metrics.
ACM SIGSOFT Softw. Eng. Notes, 2010

Measuring design complexity of semantic web ontologies.
J. Syst. Softw., 2010

New Generation of Software Metrics.
Adv. Softw. Eng., 2010

A Hybrid Approach to Feature-Oriented Programming in XVCL.
Proceedings of the Software Product Lines: Going Beyond - 14th International Conference, 2010

On the value of learning from defect dense components for software defect prediction.
Proceedings of the 6th International Conference on Predictive Models in Software Engineering, 2010

Sampling program quality.
Proceedings of the 26th IEEE International Conference on Software Maintenance (ICSM 2010), 2010

Automatic checking of license compliance.
Proceedings of the 26th IEEE International Conference on Software Maintenance (ICSM 2010), 2010

Workshop on Emerging Trends in Software Metrics (WETSoM 2010).
Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering, 2010

2009
Conceptual data model-based software size estimation for information systems.
ACM Trans. Softw. Eng. Methodol., 2009

Discovering power laws in computer programs.
Inf. Process. Manag., 2009

The Distribution of Program Sizes and Its Implications: An Eclipse Case Study
CoRR, 2009

Understanding Chinese Characteristics of Requirements Engineering.
Proceedings of the RE 2009, 17th IEEE International Requirements Engineering Conference, Atlanta, Georgia, USA, August 31, 2009

An investigation of the relationships between lines of code and defects.
Proceedings of the 25th IEEE International Conference on Software Maintenance (ICSM 2009), 2009

Increasing usability of preprocessing for feature management in product lines with queries.
Proceedings of the 31st International Conference on Software Engineering, 2009

Preference Model Driven Services Selection.
Proceedings of the Advanced Information Systems Engineering, 21st International Conference, 2009

Avoiding Some Common Preprocessing Pitfalls with Feature Queries.
Proceedings of the 16th Asia-Pacific Software Engineering Conference, 2009

2008
On the Distribution of Software Faults.
IEEE Trans. Software Eng., 2008

The scale-free nature of semantic web ontology.
Proceedings of the 17th International Conference on World Wide Web, 2008

Exploring Regularity in Source Code: Software Science and Zipf's Law.
Proceedings of the WCRE 2008, 2008

An initial study of the growth of eclipse defects.
Proceedings of the 2008 International Working Conference on Mining Software Repositories, 2008

2007
Verifying feature models using OWL.
J. Web Semant., 2007

Comments on "Data Mining Static Code Attributes to Learn Defect Predictors".
IEEE Trans. Software Eng., 2007

Reuse of Database Access Layer Components in JEE Product Lines: Limitations and a Possible Solution (Case Study).
Proceedings of the Nineteenth International Conference on Software Engineering & Knowledge Engineering (SEKE'2007), 2007

Predicting Defective Software Components from Code Complexity Measures.
Proceedings of the 13th IEEE Pacific Rim International Symposium on Dependable Computing (PRDC 2007), 2007

An Empirical Study of Class Sizes for Large Java Systems.
Proceedings of the 14th Asia-Pacific Software Engineering Conference (APSEC 2007), 2007

2006
Analysis of Meta-programs: an Example.
Int. J. Softw. Eng. Knowl. Eng., 2006

Estimating LOC for information systems from their conceptual data models.
Proceedings of the 28th International Conference on Software Engineering (ICSE 2006), 2006

2005
A Bayesian Network Approach to Rational Architectural Design.
Int. J. Softw. Eng. Knowl. Eng., 2005

Verify Feature Models using protegeowl.
Proceedings of the 14th international conference on World Wide Web, 2005

Formal Semantics and Verification for Feature Modeling.
Proceedings of the 10th International Conference on Engineering of Complex Computer Systems (ICECCS 2005), 2005

2004
XVCL: a mechanism for handling variants in software product lines.
Sci. Comput. Program., 2004

Analysis of meta-programs: a case study.
Proceedings of the Sixteenth International Conference on Software Engineering & Knowledge Engineering (SEKE'2004), 2004

2003
Handling variant requirements in domain modeling.
J. Syst. Softw., 2003

An XVCL-based Approach to Software Product Line Development.
Proceedings of the Fifteenth International Conference on Software Engineering & Knowledge Engineering (SEKE'2003), 2003

XVCL: XML-based Variant Configuration Language.
Proceedings of the 25th International Conference on Software Engineering, 2003

Quality Prediction and Assessment for Product Lines.
Proceedings of the Advanced Information Systems Engineering, 15th International Conference, 2003

An XVCL Approach to Handling Variants: A KWIC Product Line Example.
Proceedings of the 10th Asia-Pacific Software Engineering Conference (APSEC 2003), 2003

2002
XVCL: a tutorial.
Proceedings of the 14th international conference on Software engineering and knowledge engineering, 2002

2001
XML implementation of frame processor.
Proceedings of the ACM SIGSOFT Symposium on Software Reusability: Putting Software Reuse in Context, 2001

Enhacing component reuse with control flow abstraction analysis.
Proceedings of the Thirteenth International Conference on Software Engineering & Knowledge Engineering (SEKE'2001), 2001

Handling Variant Requirements in 9omain Modeling.
Proceedings of the Thirteenth International Conference on Software Engineering & Knowledge Engineering (SEKE'2001), 2001

XML-Based Method and Tool for Handling Variant Requirements in Domain Models.
Proceedings of the 5th IEEE International Symposium on Requirements Engineering (RE 2001), 2001

XVCL Approach to Separating Concerns in Product Family Assets.
Proceedings of the Generative and Component-Based Software Engineering, 2001


  Loading...