Daya Guo

According to our database1, Daya Guo authored at least 38 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Contextualized Data-Wrangling Code Generation in Computational Notebooks.
CoRR, 2024

RLCoder: Reinforcement Learning for Repository-Level Code Completion.
CoRR, 2024

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence.
CoRR, 2024

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data.
CoRR, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model.
CoRR, 2024

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models.
CoRR, 2024

DeepSeek-Coder: When the Large Language Model Meets Programming - The Rise of Code Intelligence.
CoRR, 2024

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism.
CoRR, 2024

SparseCoder: Identifier-Aware Sparse Transformer for File- Level Code Summarization.
Proceedings of the IEEE International Conference on Software Analysis, 2024

2023
LongCoder: A Long-Range Pre-trained Language Model for Code Completion.
Proceedings of the International Conference on Machine Learning, 2023

Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Noisy Pair Corrector for Dense Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
CodeReviewer: Pre-Training for Automating Code Review Activities.
CoRR, 2022

Automating code review activities by large-scale pre-training.
Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2022

Analytical Reasoning of Text.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Learning to Complete Code with Sketches.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Soft-Labeled Contrastive Pre-Training for Function-Level Code Representation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

ReACC: A Retrieval-Augmented Code Completion Framework.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

UniXcoder: Unified Cross-Modal Pre-training for Code Representation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Learning to Generate Code Sketches.
CoRR, 2021

AR-LSAT: Investigating Analytical Reasoning of Text.
CoRR, 2021

CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Multi-modal Representation Learning for Video Advertisement Content Structuring.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

GraphCodeBERT: Pre-training Code Representations with Data Flow.
Proceedings of the 9th International Conference on Learning Representations, 2021

Syntax-Enhanced Pre-trained Model.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
CodeBLEU: a Method for Automatic Evaluation of Code Synthesis.
CoRR, 2020

Pre-training Text Representations as Meta Learning.
CoRR, 2020

Inferential Text Generation with Multiple Knowledge Sources and Meta-Learning.
CoRR, 2020

CodeBERT: A Pre-Trained Model for Programming and Natural Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Evidence-Aware Inferential Text Generation with Vector Quantised Variational AutoEncoder.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Graph-Based Reasoning over Heterogeneous External Knowledge for Commonsense Question Answering.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Multi-modal Representation Learning for Short Video Understanding and Recommendation.
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

Multi-Task Learning for Conversational Question Answering over a Large-Scale Knowledge Base.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Coupling Retrieval and Meta-Learning for Context-Dependent Semantic Parsing.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Knowledge Based Machine Reading Comprehension.
CoRR, 2018

Dialog-to-Action: Conversational Question Answering Over a Large-Scale Knowledge Base.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Question Generation from SQL Queries Improves Neural Semantic Parsing.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018


  Loading...