Rodrigo Nogueira
Orcid: 0000-0002-2600-6035Affiliations:
- State University of Campinas (UNICAMP), Campinas, SP, Brazil
- University of Waterloo, ON, Canada (2020-2022)
- New York University (NYU), New York City, NY, USA (2014-2019, PhD)
According to our database1,
Rodrigo Nogueira
authored at least 99 papers
between 2015 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on linkedin.com
-
on orcid.org
On csauthors.net:
Bibliography
2025
CoRR, January, 2025
The interplay between domain specialization and model size: a case study in the legal domain.
CoRR, January, 2025
2024
BERT models for Brazilian Portuguese: Pretraining, evaluation and tokenization analysis.
Appl. Soft Comput., December, 2024
CoRR, 2024
INACIA: Integrating Large Language Models in Brazilian Audit Courts: Opportunities and Challenges.
CoRR, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2024
ptt5-v2: A Closer Look at Continued Pretraining of T5 Models for the Portuguese Language.
Proceedings of the Intelligent Systems - 34th Brazilian Conference, 2024
Proceedings of the Intelligent Systems - 34th Brazilian Conference, 2024
SurveySum: A Dataset for Summarizing Multiple Scientific Articles into a Survey Section.
Proceedings of the Intelligent Systems - 34th Brazilian Conference, 2024
2023
CoRR, 2023
An experiment on an automated literature survey of data-driven speech enhancement methods.
CoRR, 2023
Predictive Authoring for Brazilian Portuguese Augmentative and Alternative Communication.
CoRR, 2023
InPars Toolkit: A Unified and Reproducible Synthetic Data Generation Pipeline for Neural Information Retrieval.
CoRR, 2023
CoRR, 2023
InPars-v2: Large Language Models as Efficient Dataset Generators for Information Retrieval.
CoRR, 2023
Proceedings of the Thirty-Second Text REtrieval Conference Proceedings (TREC 2023), 2023
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Proceedings of the International Joint Conference on Neural Networks, 2023
An Augmentative and Alternative Communication Synthetic Corpus for Brazilian Portuguese.
Proceedings of the IEEE International Conference on Advanced Learning Technologies, 2023
Proceedings of the Advances in Information Retrieval, 2023
Proceedings of the Intelligent Systems - 12th Brazilian Conference, 2023
[inline-graphic not available: see fulltext] Sabiá: Portuguese Large Language Models.
Proceedings of the Intelligent Systems - 12th Brazilian Conference, 2023
Proceedings of the Intelligent Systems - 12th Brazilian Conference, 2023
2022
Induced Natural Language Rationales and Interleaved Markup Tokens Enable Extrapolation in Large Language Models.
CoRR, 2022
A Boring-yet-effective Approach for the Product Ranking Task of the Amazon KDD Cup 2022.
CoRR, 2022
No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval.
CoRR, 2022
Billions of Parameters Are Worth More Than In-domain Training Data: A case study in the Legal Case Entailment Task.
CoRR, 2022
CoRR, 2022
Proceedings of the Thirty-First Text REtrieval Conference, 2022
Simple Yet Effective Neural Ranking and Reranking Baselines for Cross-Lingual Information Retrieval.
Proceedings of the Thirty-First Text REtrieval Conference, 2022
NeuralMind-UNICAMP at 2022 TREC NeuCLIR: Large Boring Rerankers for Cross-lingual Retrieval.
Proceedings of the Thirty-First Text REtrieval Conference, 2022
Document Expansion Baselines and Learned Sparse Lexical Representations for MS MARCO V1 and V2.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022
NeuralSearchX: Serving a Multi-billion-parameter Reranker for Multilingual Metasearch at a Low Cost.
Proceedings of the Third International Conference on Design of Experimental Search & Information REtrieval Systems, 2022
Sequence-to-Sequence Models for Extracting Information from Registration and Legal Documents.
Proceedings of the Document Analysis Systems - 15th IAPR International Workshop, 2022
Proceedings of the 29th International Conference on Computational Linguistics, 2022
2021
Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, ISBN: 978-3-031-02181-7, 2021
Multi-Stage Conversational Passage Retrieval: An Approach to Fusing Term Importance Estimation and Neural Query Rewriting.
ACM Trans. Inf. Syst., 2021
CoRR, 2021
CoRR, 2021
Pyserini: An Easy-to-Use Python Toolkit to Support Replicable IR Research with Sparse and Dense Representations.
CoRR, 2021
The Expando-Mono-Duo Design Pattern for Text Ranking with Pretrained Sequence-to-Sequence Models.
CoRR, 2021
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021
Vera: Prediction Techniques for Reducing Harmful Misinformation in Consumer Health Search.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021
Pyserini: A Python Toolkit for Reproducible Information Retrieval Research with Sparse and Dense Representations.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021
Proceedings of the ICAIL '21: Eighteenth International Conference for Artificial Intelligence and Law, São Paulo Brazil, June 21, 2021
Proceedings of the 12th International Workshop on Health Text Mining and Information Analysis, 2021
2020
Proceedings of the Precision Health and Medicine - A Digital Revolution in Healthcare, 2020
Navigation-based candidate expansion and pretrained language models for citation recommendation.
Scientometrics, 2020
Can questions summarize a corpus? Using question generation for characterizing COVID-19 research.
CoRR, 2020
CoRR, 2020
Query Reformulation using Query History for Passage Retrieval in Conversational Search.
CoRR, 2020
Rapidly Deploying a Neural Search Engine for the COVID-19 Open Research Dataset: Preliminary Thoughts and Lessons Learned.
CoRR, 2020
Conversational Question Reformulation via Sequence-to-Sequence Architectures and Pretrained Language Models.
CoRR, 2020
Proceedings of the Fifth Conference on Machine Translation, 2020
H2oloo at TREC 2020: When all you got is a hammer... Deep Learning, Health Misinformation, and Precision Medicine.
Proceedings of the Twenty-Ninth Text REtrieval Conference, 2020
Covidex: Neural Ranking Models and Keyword Search Infrastructure for the COVID-19 Open Research Dataset.
Proceedings of the First Workshop on Scholarly Document Processing, 2020
Proceedings of SustaiNLP: Workshop on Simple and Efficient Natural Language Processing, 2020
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Designing Templates for Eliciting Commonsense Knowledge from Pretrained Sequence-to-Sequence Models.
Proceedings of the 28th International Conference on Computational Linguistics, 2020
Proceedings of the Intelligent Systems - 9th Brazilian Conference, 2020
Proceedings of the 10th International Workshop on Bibliometric-enhanced Information Retrieval co-located with 42nd European Conference on Information Retrieval, 2020
2019
Proceedings of the Deep Reinforcement Learning Meets Structured Prediction, 2019
2018
Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation.
CoRR, 2018
Proceedings of the Twenty-Seventh Text REtrieval Conference, 2018
2017
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
2016
IEEE Trans. Inf. Forensics Secur., 2016
WebNav: A New Large-Scale Task for Natural Language based Sequential Decision Making.
CoRR, 2016
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016
2015
Evaluating software-based fingerprint liveness detection using Convolutional Networks and Local Binary Patterns.
CoRR, 2015