Trung Bui

Orcid: 0000-0002-0871-349X

According to our database1, Trung Bui authored at least 101 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Identifying Speakers in Dialogue Transcripts: A Text-based Approach Using Pretrained Language Models.
CoRR, 2024

VSP: Assessing the dual challenges of perception and reasoning in spatial planning tasks for VLMs.
CoRR, 2024

Retrieval Augmented Generation for Domain-specific Question Answering.
CoRR, 2024

Pre-trained Vision-Language Models Learn Discoverable Visual Concepts.
CoRR, 2024

PEEB: Part-based Image Classifiers with an Explainable and Editable Language Bottleneck.
CoRR, 2024

Fine-tuning CLIP Text Encoders with Two-step Paraphrasing.
CoRR, 2024

Multilingual Meta-Distillation Alignment for Semantic Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

PEEB: Part-based Image Classifiers with an Explainable and Editable Language Bottleneck.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Multi-modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation.
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024

LRM: Large Reconstruction Model for Single Image to 3D.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Towards Enhancing Coherence in Extractive Summarization: Dataset and Experiments with LLMs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Fine-tuning CLIP Text Encoders with Two-step Paraphrasing.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Scaling Up Video Summarization Pretraining with Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Multilingual Sentence-Level Semantic Search using Meta-Distillation Learning.
CoRR, 2023

Boosting Punctuation Restoration with Data Generation and Reinforcement Learning.
CoRR, 2023

LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

PaperToPlace: Transforming Instruction Documents into Spatialized and Context-Aware Mixed Reality Experiences.
Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology, 2023

Boosting Punctuation Restoration with Data Generation and Reinforcement Learning.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning Navigational Visual Representations with Semantic Map Supervision.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Moment Detection in Long Tutorial Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

PR-MCS: Perturbation Robust Metric for MultiLingual Image Captioning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

PiC: A Phrase-in-Context Dataset for Phrase Understanding and Semantic Search.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Align and Attend: Multimodal Summarization with Dual Contrastive Losses.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Aspect-based Meeting Transcript Summarization: A Two-Stage Approach with Weak Supervision on Sentence Classification.
Proceedings of the IEEE International Conference on Big Data, 2023

SCCS: Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

MeetingQA: Extractive Question-Answering on Meeting Transcripts.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment.
CoRR, 2022

Factual Error Correction for Abstractive Summaries Using Entity Retrieval.
CoRR, 2022

MHMS: Multimodal Hierarchical Multimedia Summarization.
CoRR, 2022

Multimodal Intent Discovery from Livestream Videos.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Fine-grained Image Captioning with CLIP Reward.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Double Trouble: How to not Explain a Text Classifier's Decisions Using Counterfactuals Synthesized by Masked Language Models?
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

End-To-End Neural Coreference Resolution Revisited: A Simple Yet Effective Baseline.
Proceedings of the IEEE International Conference on Acoustics, 2022

Keyphrase Prediction from Video Transcripts: New Dataset and Directions.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Virtual Knowledge Graph Construction for Zero-Shot Domain-Specific Document Retrieval.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Offensive Content Detection via Synthetic Code-Switched Text.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Medical Question Understanding and Answering with Knowledge Grounding and Semantic Self-Supervision.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

gScoreCAM: What Objects Is CLIP Looking At?
Proceedings of the Computer Vision - ACCV 2022, 2022

A Framework for Automated Text Generation Benchmarking.
Proceedings of the Workshop on Scientific Document Understanding co-located with 36th AAAI Conference on Artificial Inteligence, 2022

CAISE: Conversational Agent for Image Search and Editing.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Open-Domain Question Answering with Pre-Constructed Question Spaces.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, 2021

X-METRA-ADA: Cross-lingual Meta-Transfer learning Adaptation to Natural Language Understanding and Question Answering.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

KPQA: A Metric for Generative Question Answering Using Keyphrase Weights.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

A Context-Dependent Gated Module for Incorporating Symbolic Semantics into Event Coreference Resolution.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Few-Shot Intent Detection via Contrastive Pre-Training and Fine-Tuning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

StreamHover: Livestream Transcript Summarization and Annotation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Learning by Planning: Language-Guided Global Image Editing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

UCSD-Adobe at MEDIQA 2021: Transfer Learning and Answer Sentence Selection for Medical Summarization.
Proceedings of the 20th Workshop on Biomedical Language Processing, 2021

Out of Order: How important is the sequential order of words in a sentence in Natural Language Understanding tasks?
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

A Gradually Soft Multi-Task and Data-Augmented Approach to Medical Question Understanding.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

UMIC: An Unreferenced Metric for Image Captioning via Contrastive Learning.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Bayesian Optimization for Selecting Efficient Machine Learning Models.
CoRR, 2020

Efficient Deployment of Conversational Natural Language Interfaces over Databases.
CoRR, 2020

KPQA: A Metric for Generative Question Answering Using Word Weights.
CoRR, 2020

DSTC8-AVSD: Multimodal Semantic Transformer Network with Retrieval Style Word Generator.
CoRR, 2020

A Multimodal Dialogue System for Conversational Image Editing.
CoRR, 2020

Variational Hierarchical Dialog Autoencoder for Dialogue State Tracking Data Augmentation.
CoRR, 2020

Propagate-Selector: Detecting Supporting Sentences for Question Answering via Graph Neural Networks.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Adjusting Image Attributes of Localized Regions with Low-level Dialogue.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

AutoNLU: An On-demand Cloud-based Natural Language Understanding System for Enterprises.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing: System Demonstrations, 2020

ISA: An Intelligent Shopping Assistant.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing: System Demonstrations, 2020

A Simple But Effective Bert Model for Dialog State Tracking on Resource-Limited Systems.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

ViLBERTScore: Evaluating Image Caption Using Vision-and-Language BERT.
Proceedings of the First Workshop on Evaluation and Comparison of NLP Systems, 2020

Variational Hierarchical Dialog Autoencoder for Dialog State Tracking Data Augmentation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Rethinking Self-Attention: Towards Interpretability in Neural Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Scene Graph Modification Based on Natural Language Commands.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

PhraseCut: Language-Based Image Segmentation in the Wild.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

A Joint Learning Approach based on Self-Distillation for Keyphrase Extraction from Scientific Documents.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

History for Visual Dialog: Do we really need it?
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

A Benchmark and Baseline for Language-Driven Image Editing.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
Rethinking Self-Attention: An Interpretable Self-Attentive Encoder-Decoder Parser.
CoRR, 2019

Exploiting Semi-Supervised Training Through a Dropout Regularization in End-to-End Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Markov Network Model for Natural Language Semantic Matching.
Proceedings of the 2019 International Conference on Data Mining Workshops, 2019

Dance Dance Generation: Motion Transfer for Internet Videos.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

A Gated Self-attention Memory Network for Answer Selection.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

A Compare-Aggregate Model with Latent Clustering for Answer Selection.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Expressing Visual Relationships via Language.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Towards a Service-Oriented Architecture for Knowledge Management in Big Data Era.
Int. J. Intell. Inf. Technol., 2018

A System for Automated Image Editing from Natural Language Commands.
CoRR, 2018

Conversational Image Editing: Incremental Intent Identification in a New Dialogue Task.
Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, 2018

The Context-Dependent Additive Recurrent Neural Net.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Edit me: A Corpus and a Framework for Understanding Natural Language Image Editing.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

PhotoshopQuiA: A Corpus of Non-Factoid Questions and Answers for Why-Question Answering.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

A Framework for Speech Recognition Benchmarking.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Supervised Transfer Learning for Product Information Question Answering.
Proceedings of the 17th IEEE International Conference on Machine Learning and Applications, 2018

Visually Indicated Sound Generation by Perceptually Optimized Classification.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Visual to Sound: Generating Natural Sound for Videos in the Wild.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

A Review on Deep Learning Techniques Applied to Answer Selection.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

A Simple End-to-End Question Answering Model for Product Information.
Proceedings of the First Workshop on Economics and Natural Language Processing, 2018

2017
AMC: Attention guided Multi-modal Correlation Learning for Image Search.
CoRR, 2017

AMC: Attention Guided Multi-modal Correlation Learning for Image Search.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Adobe-MIT submission to the DSTC 4 Spoken Language Understanding pilot task.
CoRR, 2016

Proposing Plausible Answers for Open-ended Visual Question Answering.
CoRR, 2016

Robust Dialog State Tracking for Large Ontologies.
Proceedings of the Dialogues with Social Robots, 2016

Practical Linear Models for Large-Scale One-Class Collaborative Filtering.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

A Service-Oriented Framework for Big Data-Driven Knowledge Management Systems.
Proceedings of the Exploring Services Science - 7th International Conference, 2016

Towards an Architecture for Big Data-Driven Knowledge Management Systems.
Proceedings of the 22nd Americas Conference on Information Systems, 2016


  Loading...