Yuval Pinter

Orcid: 0000-0003-3174-1621

According to our database1, Yuval Pinter authored at least 46 papers between 2014 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
PragFormer: Data-Driven Parallel Source Code Classification with Transformers.
Int. J. Parallel Program., February, 2025

2024
Don't Touch My Diacritics.
CoRR, 2024

OMPar: Automatic Parallelization with AI-Driven Source-to-Source Compilation.
CoRR, 2024

Protecting Privacy in Classifiers by Token Manipulation.
CoRR, 2024

Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge.
CoRR, 2024

An Analysis of BPE Vocabulary Trimming in Neural Machine Translation.
CoRR, 2024

Greed is All You Need: An Evaluation of Tokenizer Inference Methods.
CoRR, 2024

MPIrigen: MPI Code Generation through Domain-Specific Language Models.
CoRR, 2024

Tokenization Matters: Navigating Data-Scarce Tokenization for Gender Inclusive Language Technologies.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Tokenization Is More Than Compression.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

BiVert: Bidirectional Vocabulary Evaluation Using Relations for Machine Translation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Domain-Specific Code Language Models: Unraveling the Potential for HPC Codes and Tasks.
CoRR, 2023

Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark.
CoRR, 2023

Scope is all you need: Transforming LLMs for HPC Code.
CoRR, 2023

The BGU-MeLeL System for the SIGMORPHON 2023 Shared Task on Morphological Inflection.
Proceedings of the 20th SIGMORPHON workshop on Computational Research in Phonetics, 2023

MPI-RICAL: Data-Driven MPI Distributed Parallelism Assistance with Transformers.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Learning to Parallelize in a Shared-Memory Environment with Transformers.
Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2023

Advising OpenMP Parallelization via A Graph-Based Approach with Transformers.
Proceedings of the OpenMP: Advanced Task-Based, Device and Compiler Programming, 2023

Quantifying OpenMP: Statistical Insights into Usage and Adoption.
Proceedings of the IEEE High Performance Extreme Computing Conference, 2023

Emptying the Ocean with a Spoon: Should We Edit Models?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Analyzing Cognitive Plausibility of Subword Tokenization.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Incorporating Context into Subword Vocabularies.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2022
Lost in Space Marking.
CoRR, 2022

CIAug: Equipping Interpolative Augmentation with Curriculum Learning.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Restoring Hebrew Diacritics Without a Dictionary.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

UniMorph 4.0: Universal Morphology.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

2021
Integrating Approaches to Word Representation.
CoRR, 2021

Learning to Look Inside: Augmenting Token-Based Encoders with Character-Level Information.
CoRR, 2021

2020

Will it Unblend?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

NYTWIT: A Dataset of Novel Words in the New York Times.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Learning to Faithfully Rationalize by Construction.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Attending Form and Context to Generate Specialized Out-of-VocabularyWords Representations.
CoRR, 2019

Attention is not not Explanation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Character Eyes: Seeing Language through Character-Level Taggers.
Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, 2019

2018
Si O No, Que Penses? Catalonian Independence and Linguistic Identity on Social Media.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Predicting Semantic Relations using Global Graph Properties.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
Overview of the Medical Question Answering Task at TREC 2017 LiveQA.
Proceedings of The Twenty-Sixth Text REtrieval Conference, 2017

Mimicking Word Embeddings using Subword RNNs.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2016
The Yahoo Query Treebank, V. 1.0.
CoRR, 2016

Identifying Web Queries with Question Intent.
Proceedings of the 25th International Conference on World Wide Web, 2016

Overview of the TREC 2016 LiveQA Track.
Proceedings of The Twenty-Fifth Text REtrieval Conference, 2016

Syntactic Parsing of Web Queries with Question Intent.
Proceedings of the NAACL HLT 2016, 2016

2015
Overview of the TREC 2015 LiveQA Track.
Proceedings of The Twenty-Fourth Text REtrieval Conference, 2015

2014
Improving Term Weighting for Community Question Answering Search Using Syntactic Analysis.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014


  Loading...