2024
Investigating Hallucinations in Pruned Large Language Models for Abstractive Summarization.
Trans. Assoc. Comput. Linguistics, 2024
Predicting and analyzing the popularity of false rumors in Weibo.
Expert Syst. Appl., 2024
How Private are Language Models in Abstractive Summarization?
CoRR, 2024
Vocabulary Expansion of Chat Models with Unlabeled Target Language Data.
CoRR, 2024
Self-calibration for Language Model Quantization and Pruning.
CoRR, 2024
Vocabulary Expansion for Low-resource Cross-lingual Transfer.
CoRR, 2024
An Empirical Study on Cross-lingual Vocabulary Adaptation for Efficient Generative LLM Inference.
CoRR, 2024
Comparing Explanation Faithfulness between Multilingual and Monolingual Fine-tuned Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
An Empirical Study on Cross-lingual Vocabulary Adaptation for Efficient Language Model Inference.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science Research.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Improving Multimodal Classification of Social Media Posts by Leveraging Image-Text Auxiliary Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024
RISE: Robust Early-exiting Internal Classifiers for Suicide Risk Evaluation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Examining the Limitations of Computational Rumor Detection Models Trained on Static Datasets.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Who Is Bragging More Online? A Large Scale Analysis of Bragging in Social Media.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
On the Impact of Calibration Data in Post-training Quantization and Pruning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Bayesian Prompt Ensembles: Model Uncertainty Estimation for Black-Box Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
How Does Calibration Data Affect the Post-training Pruning and Quantization of Large Language Models?
CoRR, 2023
Lighter, yet More Faithful: Investigating Hallucinations in Pruned Large Language Models for Abstractive Summarization.
CoRR, 2023
Frustratingly Simple Memory Efficiency for Pre-trained Language Models via Dynamic Embedding Pruning.
CoRR, 2023
Self-training through Classifier Disagreement for Cross-Domain Opinion Target Extraction.
Proceedings of the ACM Web Conference 2023, 2023
A Multimodal Analysis of Influencer Content on Twitter.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023
We Need to Talk About Classification Evaluation Metrics in NLP.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023
Robust Weak Supervision with Variational Auto-Encoders.
Proceedings of the International Conference on Machine Learning, 2023
Pit One Against Many: Leveraging Attention-head Embeddings for Parameter-efficient Multi-head Attention.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Active Learning Principles for In-Context Learning with Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Regulation and NLP (RegNLP): Taming Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Understanding the Role of Input Token Characters in Language Models: How Does Information Loss Affect Performance?
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
It's about Time: Rethinking Evaluation on Rumor Detection Benchmarks using Chronological Splits.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023
Incorporating Attribution Importance for Improving Faithfulness Metrics.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Rethinking Semi-supervised Learning with Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Trading Syntax Trees for Wordpieces: Target-oriented Opinion Words Extraction with Wordpieces and Aspect Enhancement.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023
On the Limitations of Simulating Active Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Schema-Guided User Satisfaction Modeling for Task-Oriented Dialogues.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
Node-Feature Convolution for Graph Convolutional Networks.
Pattern Recognit., 2022
Identifying and Characterizing Active Citizens who Refute Misinformation in Social Media.
Proceedings of the WebSci '22: 14th ACM Web Science Conference 2022, Barcelona, Spain, June 26, 2022
Towards Suicide Ideation Detection Through Online Conversational Context.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022
Combining Humor and Sarcasm for Improving Political Parody Detection.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Improving Graph-Based Text Representations with Character and Word Level N-grams.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022
On the Impact of Temporal Concept Drift on Model Explanations.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
HashFormers: Towards Vocabulary-independent Pre-trained Transformers.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
A Hierarchical N-Gram Framework for Zero-Shot Link Prediction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Domain Classification-based Source-specific Term Penalization for Domain Adaptation in Hate-speech Detection.
Proceedings of the 29th International Conference on Computational Linguistics, 2022
On the Importance of Effectively Adapting Pretrained Language Models for Active Learning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022
Automatic Identification and Classification of Bragging in Social Media.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Translation Error Detection as Rationale Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
An Empirical Study on Explanations in Out-of-Domain Settings.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
LexGLUE: A Benchmark Dataset for Legal Language Understanding in English.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
Dynamically Refined Regularization for Improving Cross-corpora Hate Speech Detection.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
How does the pre-training objective affect what large language models learn about linguistic properties?
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022
Flexible Instance-Specific Rationalization of NLP Models.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
Bayesian Active Learning with Pretrained Language Models.
CoRR, 2021
Variable Instance-Level Explainability for Text Classification.
CoRR, 2021
Modeling the Severity of Complaints in Social Media.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Paragraph-level Rationale Extraction through Regularization: A case study on European Court of Human Rights Cases.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Frustratingly Simple Pretraining Alternatives to Masked Language Modeling.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Point-of-Interest Type Prediction using Text and Images.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
An Empirical Study on Leveraging Position Embeddings for Target-oriented Opinion Words Extraction.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Active Learning by Acquiring Contrastive Examples.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Enjoy the Salience: Towards Better Transformer-based Faithful Explanations with Word Salience.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Analyzing Online Political Advertisements.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
In Factuality: Efficient Integration of Relevant Facts for Visual Question Answering.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
On the Ethical Limits of Natural Language Processing on Legal Text.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
Knowledge Distillation for Quality Estimation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
Improving the Faithfulness of Attention-based Explanations with Task-specific Information for Text Classification.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
Unsupervised Quality Estimation for Neural Machine Translation.
Trans. Assoc. Comput. Linguistics, 2020
Identifying Twitter users who repost unreliable news sources with linguistic information.
PeerJ Comput. Sci., 2020
LEGAL-BERT: The Muppets straight out of Law School.
CoRR, 2020
Automatic Generation of Topic Labels.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020
Point-of-Interest Type Inference from Social Media Text.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020
LEGAL-BERT: "Preparing the Muppets for Court'".
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Quality In, Quality Out: Learning from Actual Mistakes.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020
Complaint Identification in Social Media with Transformer Networks.
Proceedings of the 28th International Conference on Computational Linguistics, 2020
LegalOps: A Summarization Corpus of Legal Opinions.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020
Analyzing Political Parody in Social Media.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
2019
Extreme Multi-Label Legal Text Classification: A case study in EU Legislation.
CoRR, 2019
Re-Ranking Words to Improve Interpretability of Automatically Generated Topics.
Proceedings of the 13th International Conference on Computational Semantics, 2019
Journalist-in-the-Loop: Continuous Learning as a Service for Rumour Analysis.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Automatically Identifying Complaints in Social Media.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Neural Legal Judgment Prediction in English.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
2018
Predicting Twitter User Socioeconomic Attributes with Network and Language Information.
Proceedings of the 29th on Hypertext and Social Media, 2018
Nowcasting the Stance of Social Media Users in a Sudden Vote: The Case of the Greek Referendum.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018
2017
Evaluating topic representations for exploring document collections.
J. Assoc. Inf. Sci. Technol., 2017
Labeling Topics with Images Using a Neural Network.
Proceedings of the Advances in Information Retrieval, 2017
Multimodal Topic Labelling.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017
2016
Predicting judicial decisions of the European Court of Human Rights: a Natural Language Processing perspective.
PeerJ Comput. Sci., 2016
Why are these similar? Investigating item similarity types in a large digital library.
J. Assoc. Inf. Sci. Technol., 2016
Labeling Topics with Images using Neural Networks.
CoRR, 2016
Inferring the Socioeconomic Status of Social Media Users Based on Behaviour and Language.
Proceedings of the Advances in Information Retrieval, 2016
2015
A Hybrid Distributional and Knowledge-based Model of Lexical Semantics.
Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics, 2015
TM 2015 - Topic Models: Post-Processing and Applications Workshop.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015
An analysis of the user occupational class through Twitter content.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015
2014
Interpreting document collections with topic models.
PhD thesis, 2014
Representing topics labels for exploring digital libraries.
Proceedings of the IEEE/ACM Joint Conference on Digital Libraries, 2014
Predicting and Characterising User Impact on Twitter.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014
Measuring the Similarity between Automatically Generated Topics.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014
Labelling Topics using Unsupervised Graph-based Methods.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
2013
UBC_UOS-TYPED: Regression for typed-similarity.
Proceedings of the Second Joint Conference on Lexical and Computational Semantics, 2013
Representing Topics Using Images.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013
Evaluating Topic Coherence Using Distributional Semantics.
Proceedings of the 10th International Conference on Computational Semantics, 2013
PATHS: A System for Accessing Cultural Heritage Collections.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013
2012
Computing similarity between items in a digital library of cultural heritage.
ACM Journal on Computing and Cultural Heritage, 2012
Computing Similarity between Cultural Heritage Items using Multimodal Features.
Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, 2012
User-Centred Design to Support Exploration and Path Creation in Cultural Heritage Collections.
Proceedings of the 2nd European Workshop on Human-Computer Interaction and Information Retrieval, 2012
PATHS - Exploring Digital Cultural Heritage Spaces.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Theory and Practice of Digital Libraries, 2012