Rameswar Panda

Aadarsh Sahoo

Trans. Mach. Learn. Res., 2024

Stick-breaking Attention.

[BibT_eX]

[DOI]

CoRR, 2024

Calibrating Expressions of Certainty.

[BibT_eX]

[DOI]

CoRR, 2024

Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler.

[BibT_eX]

[DOI]

CoRR, 2024

Scaling Granite Code Models to 128K Context.

[BibT_eX]

[DOI]

CoRR, 2024

The infrastructure powering IBM's Gen AI model development.

[BibT_eX]

[DOI]

Constantinos Evangelinos

Bengi Karacali-Akyamac

Sophia Wen

Tatsuhiro Chiba

Sunyanan Choochotkaew

Swaminathan Sundararaman

CoRR, 2024

Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks.

[BibT_eX]

[DOI]

CoRR, 2024

Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts.

[BibT_eX]

[DOI]

CoRR, 2024

Reducing Transformer Key-Value Cache Size with Cross-Layer Attention.

[BibT_eX]

[DOI]

Jonathan Ragan-Kelley

CoRR, 2024

Granite Code Models: A Family of Open Foundation Models for Code Intelligence.

[BibT_eX]

[DOI]

CoRR, 2024

Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization.

[BibT_eX]

[DOI]

CoRR, 2024

Scattered Mixture-of-Experts Implementation.

[BibT_eX]

[DOI]

CoRR, 2024

API Pack: A Massive Multilingual Dataset for API Call Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Diversity Measurement and Subset Selection for Instruction Tuning Datasets.

[BibT_eX]

[DOI]

CoRR, 2024

Improved Techniques for Quantizing Deep Networks with Adaptive Bit-Widths.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

LangNav: Language as a Perceptual Representation for Navigation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

SITAR: Semi-supervised Image Transformer for Action Recognition.

[BibT_eX]

[DOI]

Owais Iqbal

Aftab Hussain

Bishwaranjan Bhattacharjee

Proceedings of the Pattern Recognition - 27th International Conference, 2024

Gated Linear Attention Transformers with Hardware-Efficient Training.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Data Engineering for Scaling Language Models to 128K Context.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

2023

Neural Architecture Search for Effective Teacher-Student Knowledge Transfer in Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Select, Label, and Mix: Learning Discriminative Invariant Feature Representations for Partial Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Semi-Supervised Domain Adaptation with Auto-Encoder via Simultaneous Learning.

[BibT_eX]

[DOI]

Md Mahmudur Rahman

Mohammad Arif Ul Alam

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Learning Human Action Recognition Representations Without Real Humans.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Energy Transformer.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Data Efficient Neural Scaling Law via Model Reusing.

[BibT_eX]

[DOI]

Peihao Wang

Zhangyang Wang

Proceedings of the International Conference on Machine Learning, 2023

Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning to Grow Pretrained Models for Efficient Transformer Training.

[BibT_eX]

[DOI]

Peihao Wang

Lucas Torroba Hennigen

Proceedings of the Eleventh International Conference on Learning Representations, 2023

AnyDA: Anytime Domain Adaptation.

[BibT_eX]

[DOI]

Aadarsh Sahoo

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Going Beyond Nouns With Vision & Language Models Using Synthetic Data.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CODA-Prompt: COntinual Decomposed Attention-Based Prompting for Rehearsal-Free Continual Learning.

[BibT_eX]

[DOI]

James Seale Smith

Leonid Karlinsky

Vyshnavi Gutta

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ConStruct-VL: Data-Free Continual Structured VL Concepts Learning.

[BibT_eX]

[DOI]

James Seale Smith

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Teaching Structured Vision & Language Concepts to Vision & Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Synthetic Pre-Training Tasks for Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

A Maximal Correlation Framework for Fair Machine Learning.

[BibT_eX]

[DOI]

Entropy, 2022

Teaching Structured Vision&Language Concepts to Vision&Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

FETA: Towards Specializing Foundation Models for Expert Task Applications.

[BibT_eX]

[DOI]

CoRR, 2022

How Transferable are Video Representations Based on Synthetic Data?

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

FETA: Towards Specializing Foundational Models for Expert Task Applications.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Selective Regression under Fairness Criteria.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Can an Image Classifier Suffice For Action Recognition?

[BibT_eX]

[DOI]

Chun-Fu Chen

Proceedings of the Tenth International Conference on Learning Representations, 2022

RegionViT: Regional-to-Local Attention for Vision Transformers.

[BibT_eX]

[DOI]

Chun-Fu Chen

Proceedings of the Tenth International Conference on Learning Representations, 2022

A Maximal Correlation Approach to Imposing Fairness in Machine Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Task2Sim: Towards Effective Pre-training and Transfer from Synthetic Data.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

VALHALLA: Visual Hallucination for Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Person Re-identification with Limited Supervision

[BibT_eX]

[DOI]

Synthesis Lectures on Computer Vision, Morgan & Claypool Publishers, ISBN: 978-3-031-01825-1, 2021

Exploiting Global Camera Network Constraints for Unsupervised Video Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2021

An Image Classifier Can Suffice For Video Understanding.

[BibT_eX]

[DOI]

Chun-Fu Chen

CoRR, 2021

IA-RED<sup>2</sup>: Interpretability-Aware Redundancy Reduction for Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2021

All at Once Network Quantization via Collaborative Knowledge Transfer.

[BibT_eX]

[DOI]

Kailash Gopalakrishnan

Aude Oliva

Rogério Feris

Kate Saenko

CoRR, 2021

VA-RED<sup>2</sup>: Video Adaptive Redundancy Reduction.

[BibT_eX]

[DOI]

CoRR, 2021

Contrast and Mix: Temporal Contrastive Video Domain Adaptation with Background Mixing.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

IA-RED$^2$: Interpretability-Aware Redundancy Reduction for Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data.

[BibT_eX]

[DOI]

Ashraful Islam

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

AVLnet: Learning Audio-Visual Language Representations from Instructional Videos.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Cascaded Multilingual Audio-Visual Learning from Videos.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Fair Selective Classification Via Sufficiency.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

VA-RED2: Video Adaptive Redundancy Reduction.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Dynamic Network Quantization for Efficient Video Inference.

[BibT_eX]

[DOI]

Ximeng Sun

Aude Oliva

Rogério Feris

Kate Saenko

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A Broad Study on the Transferability of Visual Representations with Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Detector-Free Weakly Supervised Grounding by Separation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Semi-Supervised Action Recognition With Temporal Contrastive Learning.

[BibT_eX]

[DOI]

Ankit Singh

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Deep Analysis of CNN-Based Spatio-Temporal Representations for Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

NASTransfer: Analyzing Architecture Transferability in Large Scale Neural Architecture Search.

[BibT_eX]

[DOI]

Bishwaranjan Bhattacharjee

Minsik Cho

Rogério Feris

David S. Kung

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Construction of Diverse Image Datasets From Web Collections With Limited Labeling.

[BibT_eX]

[DOI]

Bishwaranjan Bhattacharjee

IEEE Trans. Circuits Syst. Video Technol., 2020

Large Scale Neural Architecture Search with Polyharmonic Splines.

[BibT_eX]

[DOI]

CoRR, 2020

Measurement-driven Security Analysis of Imperceptible Impersonation Attacks.

[BibT_eX]

[DOI]

Srikanth V. Krishnamurthy

Ananthram Swami

CoRR, 2020

AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Adversarial Knowledge Transfer from Unlabeled Data.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Fairness of Classifiers Across Skin Tones in Dermatology.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Mitigating Dataset Imbalance via Joint Generation and Classification.

[BibT_eX]

[DOI]

Aadarsh Sahoo

Ankit Singh

Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

AR-Net: Adaptive Frame Resolution for Efficient Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Relationship Matters: Relation Guided Knowledge Transfer for Incremental Learning of Object Detectors.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Non-Adversarial Video Synthesis with Learned Priors.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Camera On-Boarding for Person Re-Identification Using Hypothesis Transfer Learning.

[BibT_eX]

[DOI]

Sk Miraj Ahmed

Aske R. Lejbølle

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Adaptation of person re-identification models for on-boarding new camera(s).

[BibT_eX]

[DOI]

Amran Bhuiyan

Vittorio Murino

Pattern Recognit., 2019

AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning.

[BibT_eX]

[DOI]

Ximeng Sun

CoRR, 2019

Estimating Skin Tone and Effects on Classification Performance in Dermatology Datasets.

[BibT_eX]

[DOI]

CoRR, 2019

Consistent Cross-view Matching for Unsupervised Person Re-identification.

[BibT_eX]

[DOI]

Xueping Wang

Min Liu

CoRR, 2019

2018

Visual Learning with Weak Supervision: Applications in Video Summarization and Person Re-Identification.

[BibT_eX]

[DOI]

PhD thesis, 2018

Nyström Approximated Temporally Constrained Multisimilarity Spectral Clustering Approach for Movie Scene Detection.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2018

Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval.

[BibT_eX]

[DOI]

Evangelos E. Papalexakis

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Contemplating Visual Emotions: Understanding and Overcoming Dataset Bias.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

FFNet: Video Fast-Forwarding via Reinforcement Learning.

[BibT_eX]

[DOI]

Shuyue Lan

Qi Zhu

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Multi-View Surveillance Video Summarization via Joint Embedding and Sparse Optimization.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2017

Diversity-Aware Multi-Video Summarization.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2017

Continuous adaptation of multi-camera person identification models through sparse non-redundant representative selection.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2017

Weakly Supervised Summarization of Web Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Sparse modeling for topic-oriented video summarization.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Collaborative Summarization of Topic-Related Videos.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Unsupervised Adaptive Re-identification in Open World Dynamic Camera Networks.

[BibT_eX]

[DOI]

Amran Bhuiyan

Vittorio Murino

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Generating Diverse Image Datasets with Limited Labeling.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Video summarization in a multi-view camera network.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Embedded sparse coding for summarizing multi-view videos.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

2015

Active image pair selection for continuous person re-identification.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

2014

Scalable Video Summarization Using Skeleton Graph and Random Walk.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Pattern Recognition, 2014

2013

Video key frame extraction through dynamic Delaunay clustering with a structural constraint.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2013

Video Key Frame Extraction through Canonical Correlation Analysis and Graph Modularity.

[BibT_eX]

[DOI]