Tamara L. Berg

Orcid: 0000-0002-1272-3359

According to our database1, Tamara L. Berg authored at least 79 papers between 2004 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Revealing Single Frame Bias for Video-and-Language Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval.
CoRR, 2022

CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval.
CoRR, 2022

CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

End-to-End Visual Editing with a Generatively Pre-trained Artist.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries.
CoRR, 2021

Large-Scale Attribute-Object Compositions.
CoRR, 2021

VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Detecting Moments and Highlights in Videos via Natural Language Queries.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Connecting What To Say With Where To Look by Modeling Human Attention Traces.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Less Is More: ClipBERT for Video-and-Language Learning via Sparse Sampling.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

mTVR: Multilingual Moment Retrieval in Videos.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
What is More Likely to Happen Next? Video-and-Language Future Event Prediction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval.
Proceedings of the Computer Vision - ECCV 2020, 2020

Attention-Based Query Expansion Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

TVQA+: Spatio-Temporal Grounding for Video Question Answering.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Combining Multiple Cues for Visual Madlibs Question Answering.
Int. J. Comput. Vis., 2019

Dance Dance Generation: Motion Transfer for Internet Videos.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

IMP: Instance Mask Projection for High Accuracy Semantic Segmentation of Things.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Multi-Target Embodied Question Answering.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Physics-Inspired Garment Recovery from a Single-View Image.
ACM Trans. Graph., 2018

From image to language and back again.
Nat. Lang. Eng., 2018

Image2GIF: Generating Cinemagraphs Using Recurrent Deep Q-Networks.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

TVQA: Localized, Compositional Video Question Answering.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Visual to Sound: Generating Natural Sound for Videos in the Wild.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

MAttNet: Modular Attention Network for Referring Expression Comprehension.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
When Was That Made?
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Hierarchically-Attentive RNN for Album Summarization and Storytelling.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

A Joint Speaker-Listener-Reinforcer Model for Referring Expressions.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Large Scale Retrieval and Generation of Image Descriptions.
Int. J. Comput. Vis., 2016

Detailed Garment Recovery from a Single-View Image.
CoRR, 2016

Learning to name objects.
Commun. ACM, 2016

Combining multiple sources of knowledge in deep CNNs for action recognition.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Learning Temporal Transformations from Time-Lapse Videos.
Proceedings of the Computer Vision - ECCV 2016, 2016

Modeling Context in Referring Expressions.
Proceedings of the Computer Vision - ECCV 2016, 2016

Solving VIsual Madlibs with Multiple Cues.
Proceedings of the British Machine Vision Conference 2016, 2016

Auto-Illustrating Poems and Songs with Style.
Proceedings of the Computer Vision - ACCV 2016, 2016

2015
Retrieving Similar Styles to Parse Clothing.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Predicting Entry-Level Categories.
Int. J. Comput. Vis., 2015

Visual Madlibs: Fill in the blank Image Generation and Question Answering.
CoRR, 2015

Runway to Realway: Visual Analysis of Fashion.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Temporal Perception and Prediction in Ego-Centric Video.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Visual Madlibs: Fill in the Blank Description Generation and Question Answering.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Where to Buy It: Matching Street Clothing Photos in Online Shops.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Refer-to-as Relations as Semantic Knowledge.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
TREETALK: Composition and Compression of Trees for Image Descriptions.
Trans. Assoc. Comput. Linguistics, 2014

Materials discovery: Fine-grained classification of X-ray scattering images.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Chic or Social: Visual Popularity Analysis in Online Fashion Networks.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

ReferItGame: Referring to Objects in Photographs of Natural Scenes.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Learning High-Level Judgments of Urban Perception.
Proceedings of the Computer Vision - ECCV 2014, 2014

Hipster Wars: Discovering Elements of Fashion Styles.
Proceedings of the Computer Vision - ECCV 2014, 2014

2013
BabyTalk: Understanding and Generating Simple Image Descriptions.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

Paper Doll Parsing: Retrieving Similar Styles to Parse Clothing Items.
Proceedings of the IEEE International Conference on Computer Vision, 2013

From Large Scale Image Categorization to Entry-Level Categories.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Studying Relationships between Human Gaze, Description, and Computer Vision.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Generalizing Image Captions for Image-Text Parallel Corpus.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Detecting Visual Text.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Midge: Generating Image Descriptions From Computer Vision Detections.
Proceedings of the EACL 2012, 2012

Two-person interaction detection using body-pose features and multiple instance learning.
Proceedings of the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2012

Parsing clothing in fashion photographs.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Understanding and predicting importance in images.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Collective Generation of Natural Image Descriptions.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Can Computers Master the Art of Communication?: A Focus on Visual Analytics.
IEEE Computer Graphics and Applications, 2011

Iconizer: A Framework to Identify and Create Effective Representations for Visual Information Encoding.
Proceedings of the Smart Graphics - 11th International Symposium, 2011

Im2Text: Describing Images Using 1 Million Captioned Photographs.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Who are you with and where are you going?
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Baby talk: Understanding and generating simple image descriptions.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

High level describable attributes for predicting aesthetics and interestingness.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

Composing Simple Image Descriptions using Web-scale N-grams.
Proceedings of the Fifteenth Conference on Computational Natural Language Learning, 2011

2010
It's All About the Data.
Proc. IEEE, 2010

iWalk: a tool for interacting with geo-located data through movement and gesture.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Automatic Attribute Discovery and Characterization from Noisy Web Data.
Proceedings of the Computer Vision, 2010

2009
Finding iconic images.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2009

2006
Animals on the Web.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006


2005
Shape Matching and Object Recognition Using Low Distortion Correspondences.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

2004
Whos In the Picture.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Names and Faces in the News.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), with CD-ROM, 27 June, 2004


  Loading...