Baishakhi Ray

Orcid: 0000-0003-3406-5235

According to our database1, Baishakhi Ray authored at least 120 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Yuga: Automatically Detecting Lifetime Annotation Bugs in the Rust Language.
IEEE Trans. Software Eng., October, 2024

Automated Code Editing With Search-Generate-Modify.
IEEE Trans. Software Eng., July, 2024

Code-Aware Prompting: A Study of Coverage-Guided Test Generation in Regression Setting using LLM.
Proc. ACM Softw. Eng., 2024

CYCLE: Learning to Self-Refine the Code Generation.
Proc. ACM Program. Lang., 2024

Comment on Revisiting Neural Program Smoothing for Fuzzing.
CoRR, 2024

On Mitigating Code LLM Hallucinations with API Documentation.
CoRR, 2024

Solving Zebra Puzzles Using Constraint-Guided Multi-Agent Systems.
CoRR, 2024

KGym: A Platform and Dataset to Benchmark Large Language Models on Linux Kernel Crash Resolution.
CoRR, 2024

SemCoder: Training Code Language Models with Comprehensive Semantics.
CoRR, 2024

Training LLMs to Better Self-Debug and Explain Code.
CoRR, 2024

SpecTra: Enhancing the Code Translation Ability of Language Models by Generating Multi-Modal Specifications.
CoRR, 2024

Automatic Programming: Large Language Models and Beyond.
CoRR, 2024

Vulnerability Detection with Code Language Models: How Far Are We?
CoRR, 2024

Towards Causal Deep Learning for Vulnerability Detection.
Proceedings of the 46th IEEE/ACM International Conference on Software Engineering, 2024

TRACED: Execution-aware Pre-training for Source Code.
Proceedings of the 46th IEEE/ACM International Conference on Software Engineering, 2024

Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

CodeFort: Robust Training for Code Generation Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

PropTest: Automatic Property Testing for Improved Visual Programming.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Neural Network Guided Evolutionary Fuzzing for Finding Traffic Violations of Autonomous Vehicles.
IEEE Trans. Software Eng., April, 2023

Learning Approximate Execution Semantics From Traces for Binary Function Similarity.
IEEE Trans. Software Eng., April, 2023

Programming Language Processing (Dagstuhl Seminar 23062).
Dagstuhl Reports, February, 2023

A Static Evaluation of Code Completion by Large Language Models.
CoRR, 2023

TraceFixer: Execution Trace-Driven Program Repair.
CoRR, 2023

Variation of Gender Biases in Visual Recognition Models Before and After Finetuning.
CoRR, 2023

Greener yet Powerful: Taming Large Code Generation Models with Quantization.
CoRR, 2023

IvySyn: Automated Vulnerability Discovery in Deep Learning Frameworks.
Proceedings of the 32nd USENIX Security Symposium, 2023

Towards Greener Yet Powerful Code Generation via Quantization: An Empirical Study.
Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023

CONCORD: Clone-Aware Contrastive Learning for Source Code.
Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis, 2023

Programming Language Processing : How AI can Revolutionize Software Development?
Proceedings of the 16th Innovations in Software Engineering Conference, 2023

On ML-Based Program Translation: Perils and Promises.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: New Ideas and Emerging Results, 2023

Guided Conditional Diffusion for Controllable Traffic Simulation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Summarize and Generate to Back-translate: Unsupervised Translation of Programming Languages.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Language-Guided Traffic Simulation via Scene-Level Diffusion.
Proceedings of the Conference on Robot Learning, 2023

CAMEO: A Causal Transfer Learning Approach for Performance Optimization of Configurable Computer Systems.
Proceedings of the 2023 ACM Symposium on Cloud Computing, SoCC 2023, 2023

ContraCLM: Contrastive Learning For Causal Language Model.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

A Static Evaluation of Code Completion by Large Language Models.
Proceedings of the The 61st Annual Meeting of the Association for Computational Linguistics: Industry Track, 2023

ReCode: Robustness Evaluation of Code Generation Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
ConEx: Efficient Exploration of Big-Data System Configurations for Better Performance.
IEEE Trans. Software Eng., 2022

Deep Learning Based Vulnerability Detection: Are We There Yet?
IEEE Trans. Software Eng., 2022

CODIT: Code Editing With Tree-Based Neural Models.
IEEE Trans. Software Eng., 2022

Multi-lingual Evaluation of Code Generation Models.
CoRR, 2022

ContraGen: Effective Contrastive Learning For Causal Language Model.
CoRR, 2022

Automatic Map Generation for Autonomous Driving System Testing.
CoRR, 2022

Repairing Group-Level Errors for DNNs Using Weighted Regularization.
CoRR, 2022

VELVET: a noVel Ensemble Learning approach to automatically locate VulnErable sTatements.
Proceedings of the IEEE International Conference on Software Analysis, 2022

NeuDep: neural binary memory dependence analysis.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022

NatGen: generative pre-training by "naturalizing" source code.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022

Predictive synthesis of API-centric code.
Proceedings of the MAPS@PLDI 2022: 6th ACM SIGPLAN International Symposium on Machine Programming, 2022

CARGO: AI-Guided Dependency Analysis for Migrating Monolithic Applications to Microservices Architecture.
Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering, 2022

Detecting multi-sensor fusion errors in advanced driver-assistance systems.
Proceedings of the ISSTA '22: 31st ACM SIGSOFT International Symposium on Software Testing and Analysis, Virtual Event, South Korea, July 18, 2022

Unicorn: reasoning about configurable system performance through the lens of causality.
Proceedings of the EuroSys '22: Seventeenth European Conference on Computer Systems, Rennes, France, April 5, 2022

Towards Learning (Dis)-Similarity of Source Code from Program Contrasts.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
A Survey on Scenario-Based Testing for Automated Driving Systems in High-Fidelity Simulation.
CoRR, 2021

Contrastive Learning for Source Code with Structural and Functional Properties.
CoRR, 2021

Detecting Safety Problems of Multi-Sensor Fusion in Autonomous Driving.
CoRR, 2021

StateFormer: fine-grained type recovery from binaries using generative state modeling.
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021

Unified Pre-training for Program Understanding and Generation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

On Multi-Modal Learning of Editing Source Code.
Proceedings of the 36th IEEE/ACM International Conference on Automated Software Engineering, 2021

Understanding Local Robustness of Deep Neural Networks under Natural Variations.
Proceedings of the Fundamental Approaches to Software Engineering, 2021

Retrieval Augmented Code Generation and Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

PMFuzz: test case generation for persistent memory programs.
Proceedings of the ASPLOS '21: 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2021

2020
Trex: Learning Execution Semantics from Micro-Traces for Binary Similarity.
CoRR, 2020

CADET: A Systematic Method For Debugging Misconfigurations using Counterfactual Reasoning.
CoRR, 2020

Understanding Spatial Robustness of Deep Neural Networks.
CoRR, 2020

Deep Learning & Software Engineering: State of Research and Future Directions.
CoRR, 2020

Pythia: Grammar-Based Fuzzing of REST APIs with Coverage-guided Feedback and Learning-based Mutations.
CoRR, 2020

Neutaint: Efficient Dynamic Taint Analysis with Neural Networks.
Proceedings of the 2020 IEEE Symposium on Security and Privacy, 2020

MTFuzz: fuzzing with a multi-task neural network.
Proceedings of the ESEC/FSE '20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2020

Patching as Translation: the Data and the Metaphor.
Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering, 2020

Testing DNN image classifiers for confusion & bias errors.
Proceedings of the ICSE '20: 42nd International Conference on Software Engineering, Companion Volume, Seoul, South Korea, 27 June, 2020

An Empirical Study on the Use and Misuse of Java 8 Streams.
Proceedings of the Fundamental Approaches to Software Engineering, 2020

Multitask Learning Strengthens Adversarial Robustness.
Proceedings of the Computer Vision - ECCV 2020, 2020

A Transformer-based Approach for Source Code Summarization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Bringing Engineering Rigor to Deep Learning.
ACM SIGOPS Oper. Syst. Rev., 2019

Cross-project code clones in GitHub.
Empir. Softw. Eng., 2019

Rebuttal to Berger et al., TOPLAS 2019.
CoRR, 2019

Unrestricted Adversarial Attacks for Semantic Segmentation.
CoRR, 2019

Neutaint: Efficient Dynamic Taint Analysis with Neural Networks.
CoRR, 2019

Testing Deep Neural Network based Image Classifiers.
CoRR, 2019

NEUZZ: Efficient Fuzzing with Neural Program Smoothing.
Proceedings of the 2019 IEEE Symposium on Security and Privacy, 2019

Toward Optimal Selection of Information Retrieval Models for Software Engineering Tasks.
Proceedings of the 19th International Working Conference on Source Code Analysis and Manipulation, 2019

Metric Learning for Adversarial Robustness.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
Tree2Tree Neural Translation Model for Learning Source Code Changes.
CoRR, 2018

A Case Study on the Impact of Similarity Measure on Information Retrieval based Software Engineering Tasks.
CoRR, 2018

NEUZZ: Efficient Fuzzing with Neural Program Learning.
CoRR, 2018

Entropy Guided Spectrum Based Bug Localization Using Statistical Language Model.
CoRR, 2018

Recommending GitHub Projects for Developer Onboarding.
IEEE Access, 2018

Obfuscation resilient search through executable classification.
Proceedings of the 2nd ACM SIGPLAN International Workshop on Machine Learning and Programming Languages, 2018

Evaluating how developers use general-purpose web-search for code retrieval.
Proceedings of the 15th International Conference on Mining Software Repositories, 2018

Replay without recording of production bugs for service oriented applications.
Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering, 2018

DeepTest: automated testing of deep-neural-network-driven autonomous cars.
Proceedings of the 40th International Conference on Software Engineering, 2018

Which similarity metric to use for software documents?: a study on information retrieval based software engineering tasks.
Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings, 2018

A recommender system for developer onboarding.
Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings, 2018

Searching for high-performing software configurations with metaheuristic algorithms.
Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings, 2018

Building Language Models for Text with Named Entities.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Interpreted Formalisms for Configurations.
CoRR, 2017

A large-scale study of programming languages and code quality in GitHub.
Commun. ACM, 2017

Automatically diagnosing and repairing error handling bugs in C.
Proceedings of the 2017 11th Joint Meeting on Foundations of Software Engineering, 2017

Some from here, some from there: cross-project code reuse in GitHub.
Proceedings of the 14th International Conference on Mining Software Repositories, 2017

GitcProc: a tool for processing and classifying GitHub commits.
Proceedings of the 26th ACM SIGSOFT International Symposium on Software Testing and Analysis, Santa Barbara, CA, USA, July 10, 2017

2016
Automatically Detecting Error Handling Bugs Using Error Specifications.
Proceedings of the 25th USENIX Security Symposium, 2016

APEx: automated inference of error specifications for C APIs.
Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering, 2016

On the "naturalness" of buggy code.
Proceedings of the 38th International Conference on Software Engineering, 2016

A large ecosystem study to understand the effect of programming languages on code quality.
Proceedings of the Perspectives on Data Science for Software Engineering, 2016

2015
On the "Naturalness" of Buggy Code.
CoRR, 2015

The Uniqueness of Changes: Characteristics and Applications.
Proceedings of the 12th IEEE/ACM Working Conference on Mining Software Repositories, 2015

Assert Use in GitHub Projects.
Proceedings of the 37th IEEE/ACM International Conference on Software Engineering, 2015

Gender and Tenure Diversity in GitHub Teams.
Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, 2015

2014
Using Frankencerts for Automated Adversarial Testing of Certificate Validation in SSL/TLS Implementations.
Proceedings of the 2014 IEEE Symposium on Security and Privacy, 2014

2013
Detecting and characterizing semantic inconsistencies in ported code.
Proceedings of the 2013 28th IEEE/ACM International Conference on Automated Software Engineering, 2013

An Empirical Study of API Stability and Adoption in the Android Ecosystem.
Proceedings of the 2013 IEEE International Conference on Software Maintenance, 2013

2012
REPERTOIRE: a cross-system porting analysis tool for forked software projects.
Proceedings of the 20th ACM SIGSOFT Symposium on the Foundations of Software Engineering (FSE-20), 2012

A case study of cross-system porting in forked projects.
Proceedings of the 20th ACM SIGSOFT Symposium on the Foundations of Software Engineering (FSE-20), 2012

An empirical study of supplementary bug fixes.
Proceedings of the 9th IEEE Working Conference of Mining Software Repositories, 2012

2011
PTask: operating system abstractions to manage GPUs as compute devices.
Proceedings of the 23rd ACM Symposium on Operating Systems Principles 2011, 2011

2009
Touch Me Wear: Getting Physical with Social Networks.
Proceedings of the 12th IEEE International Conference on Computational Science and Engineering, 2009

2008
WhozThat? evolving an ecosystem for context-aware mobile social networks.
IEEE Netw., 2008

A Protocol for Building Secure and Reliable Covert Channel.
Proceedings of the Sixth Annual Conference on Privacy, Security and Trust, 2008

Secure and reliable covert channel.
Proceedings of the 4th annual workshop on Cyber security and information intelligence research, 2008


  Loading...