Rameswar Panda

Orcid: 0000-0003-4359-2475

According to our database1, Rameswar Panda authored at least 107 papers between 2012 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
XPL: A Cross-Model framework for Semi-Supervised Prompt Learning in Vision-Language Models.
Trans. Mach. Learn. Res., 2024

Stick-breaking Attention.
CoRR, 2024

Calibrating Expressions of Certainty.
CoRR, 2024

Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler.
CoRR, 2024

Scaling Granite Code Models to 128K Context.
CoRR, 2024

The infrastructure powering IBM's Gen AI model development.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2024

Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks.
CoRR, 2024

Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts.
CoRR, 2024

Reducing Transformer Key-Value Cache Size with Cross-Layer Attention.
CoRR, 2024

Granite Code Models: A Family of Open Foundation Models for Code Intelligence.
CoRR, 2024

Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models.
CoRR, 2024

Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization.
CoRR, 2024

Scattered Mixture-of-Experts Implementation.
CoRR, 2024

API Pack: A Massive Multilingual Dataset for API Call Generation.
CoRR, 2024

Diversity Measurement and Subset Selection for Instruction Tuning Datasets.
CoRR, 2024

Improved Techniques for Quantizing Deep Networks with Adaptive Bit-Widths.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

LangNav: Language as a Perceptual Representation for Navigation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

SITAR: Semi-supervised Image Transformer for Action Recognition.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

Gated Linear Attention Transformers with Hardware-Efficient Training.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Data Engineering for Scaling Language Models to 128K Context.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

2023
Neural Architecture Search for Effective Teacher-Student Knowledge Transfer in Language Models.
CoRR, 2023

Select, Label, and Mix: Learning Discriminative Invariant Feature Representations for Partial Domain Adaptation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Semi-Supervised Domain Adaptation with Auto-Encoder via Simultaneous Learning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Learning Human Action Recognition Representations Without Real Humans.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Energy Transformer.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Data Efficient Neural Scaling Law via Model Reusing.
Proceedings of the International Conference on Machine Learning, 2023

Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learning to Grow Pretrained Models for Efficient Transformer Training.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

AnyDA: Anytime Domain Adaptation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Going Beyond Nouns With Vision & Language Models Using Synthetic Data.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CODA-Prompt: COntinual Decomposed Attention-Based Prompting for Rehearsal-Free Continual Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ConStruct-VL: Data-Free Continual Structured VL Concepts Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Teaching Structured Vision & Language Concepts to Vision & Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Synthetic Pre-Training Tasks for Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
A Maximal Correlation Framework for Fair Machine Learning.
Entropy, 2022

Teaching Structured Vision&Language Concepts to Vision&Language Models.
CoRR, 2022

FETA: Towards Specializing Foundation Models for Expert Task Applications.
CoRR, 2022

How Transferable are Video Representations Based on Synthetic Data?
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

FETA: Towards Specializing Foundational Models for Expert Task Applications.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Selective Regression under Fairness Criteria.
Proceedings of the International Conference on Machine Learning, 2022

Can an Image Classifier Suffice For Action Recognition?
Proceedings of the Tenth International Conference on Learning Representations, 2022

RegionViT: Regional-to-Local Attention for Vision Transformers.
Proceedings of the Tenth International Conference on Learning Representations, 2022

A Maximal Correlation Approach to Imposing Fairness in Machine Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

Task2Sim: Towards Effective Pre-training and Transfer from Synthetic Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

VALHALLA: Visual Hallucination for Machine Translation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Person Re-identification with Limited Supervision
Synthesis Lectures on Computer Vision, Morgan & Claypool Publishers, ISBN: 978-3-031-01825-1, 2021

Exploiting Global Camera Network Constraints for Unsupervised Video Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., 2021

An Image Classifier Can Suffice For Video Understanding.
CoRR, 2021

IA-RED<sup>2</sup>: Interpretability-Aware Redundancy Reduction for Vision Transformers.
CoRR, 2021

All at Once Network Quantization via Collaborative Knowledge Transfer.
CoRR, 2021

VA-RED<sup>2</sup>: Video Adaptive Redundancy Reduction.
CoRR, 2021

Contrast and Mix: Temporal Contrastive Video Domain Adaptation with Background Mixing.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

IA-RED$^2$: Interpretability-Aware Redundancy Reduction for Vision Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

AVLnet: Learning Audio-Visual Language Representations from Instructional Videos.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Cascaded Multilingual Audio-Visual Learning from Videos.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Fair Selective Classification Via Sufficiency.
Proceedings of the 38th International Conference on Machine Learning, 2021

VA-RED2: Video Adaptive Redundancy Reduction.
Proceedings of the 9th International Conference on Learning Representations, 2021

AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition.
Proceedings of the 9th International Conference on Learning Representations, 2021

Dynamic Network Quantization for Efficient Video Inference.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A Broad Study on the Transferability of Visual Representations with Contrastive Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Detector-Free Weakly Supervised Grounding by Separation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Semi-Supervised Action Recognition With Temporal Contrastive Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Deep Analysis of CNN-Based Spatio-Temporal Representations for Action Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

NASTransfer: Analyzing Architecture Transferability in Large Scale Neural Architecture Search.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Construction of Diverse Image Datasets From Web Collections With Limited Labeling.
IEEE Trans. Circuits Syst. Video Technol., 2020

Large Scale Neural Architecture Search with Polyharmonic Splines.
CoRR, 2020

Measurement-driven Security Analysis of Imperceptible Impersonation Attacks.
CoRR, 2020

AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Adversarial Knowledge Transfer from Unlabeled Data.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Fairness of Classifiers Across Skin Tones in Dermatology.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

Mitigating Dataset Imbalance via Joint Generation and Classification.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

AR-Net: Adaptive Frame Resolution for Efficient Action Recognition.
Proceedings of the Computer Vision - ECCV 2020, 2020

Relationship Matters: Relation Guided Knowledge Transfer for Incremental Learning of Object Detectors.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Non-Adversarial Video Synthesis with Learned Priors.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Camera On-Boarding for Person Re-Identification Using Hypothesis Transfer Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Adaptation of person re-identification models for on-boarding new camera(s).
Pattern Recognit., 2019

AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning.
CoRR, 2019

Estimating Skin Tone and Effects on Classification Performance in Dermatology Datasets.
CoRR, 2019

Consistent Cross-view Matching for Unsupervised Person Re-identification.
CoRR, 2019

2018
Visual Learning with Weak Supervision: Applications in Video Summarization and Person Re-Identification.
PhD thesis, 2018

Nyström Approximated Temporally Constrained Multisimilarity Spectral Clustering Approach for Movie Scene Detection.
IEEE Trans. Cybern., 2018

Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Contemplating Visual Emotions: Understanding and Overcoming Dataset Bias.
Proceedings of the Computer Vision - ECCV 2018, 2018

FFNet: Video Fast-Forwarding via Reinforcement Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Multi-View Surveillance Video Summarization via Joint Embedding and Sparse Optimization.
IEEE Trans. Multim., 2017

Diversity-Aware Multi-Video Summarization.
IEEE Trans. Image Process., 2017

Continuous adaptation of multi-camera person identification models through sparse non-redundant representative selection.
Comput. Vis. Image Underst., 2017

Weakly Supervised Summarization of Web Videos.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Sparse modeling for topic-oriented video summarization.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Collaborative Summarization of Topic-Related Videos.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Unsupervised Adaptive Re-identification in Open World Dynamic Camera Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Generating Diverse Image Datasets with Limited Labeling.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Video summarization in a multi-view camera network.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Embedded sparse coding for summarizing multi-view videos.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

2015
Active image pair selection for continuous person re-identification.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

2014
Scalable Video Summarization Using Skeleton Graph and Random Walk.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

2013
Video key frame extraction through dynamic Delaunay clustering with a structural constraint.
J. Vis. Commun. Image Represent., 2013

Video Key Frame Extraction through Canonical Correlation Analysis and Graph Modularity.
Proceedings of the Pattern Recognition and Machine Intelligence, 2013

A frequency domain approach to silhouette based gait recognition.
Proceedings of the Fourth National Conference on Computer Vision, 2013

2012
Video storyboard design using Delaunay graphs.
Proceedings of the 21st International Conference on Pattern Recognition, 2012


  Loading...