Raghuraman Krishnamoorthi

According to our database1, Raghuraman Krishnamoorthi authored at least 33 papers between 2007 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
ParetoQ: Scaling Laws in Extremely Low-bit LLM Quantization.
CoRR, February, 2025

EdgeTAM: On-Device Track Anything Model.
CoRR, January, 2025

2024
Efficient Track Anything.
CoRR, 2024

Llama Guard 3-1B-INT4: Compact and Efficient Safeguard for Human-AI Conversations.
CoRR, 2024

LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding.
CoRR, 2024

Agent-as-a-Judge: Evaluate Agents with Agents.
CoRR, 2024

SpinQuant: LLM quantization with learned rotations.
CoRR, 2024

Communication Efficient Distributed Training with Distributed Lion.
CoRR, 2024

Data Efficient Reflow for Few Step Audio Generation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Communication Efficient Distributed Training with Distributed Lion.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Sparse Cocktail: Every Sparse Pattern Every Sparse Ratio All At Once.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-Device ASR Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LLM-QAT: Data-Free Quantization Aware Training for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

PathFusion: Path-Consistent Lidar-Camera Deep Feature Fusion.
Proceedings of the International Conference on 3D Vision, 2024

2023
SqueezeSAM: User friendly mobile interactive segmentation.
CoRR, 2023

Gen2Det: Generate to Detect.
CoRR, 2023

Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images.
CoRR, 2023

MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning.
CoRR, 2023

Fast Point Cloud Generation with Straight Flows.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Binary and Ternary Natural Language Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting.
CoRR, 2022

Learning a Dual-Mode Speech Recognition Model VIA Self-Pruning.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Check-N-Run: a Checkpointing System for Training Deep Learning Recommendation Models.
Proceedings of the 19th USENIX Symposium on Networked Systems Design and Implementation, 2022

BiT: Robustly Binarized Multi-distilled Transformer.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks.
Proceedings of the International Conference on Machine Learning, 2022

2021
Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale.
IEEE Micro, 2021

2020
Check-N-Run: A Checkpointing System for Training Recommendation Models.
CoRR, 2020

2019
Deep Learning Recommendation Model for Personalization and Recommendation Systems.
CoRR, 2019

2018
Quantizing deep convolutional networks for efficient inference: A whitepaper.
CoRR, 2018

2007
FLO Physical Layer: An Overview.
IEEE Trans. Broadcast., 2007


  Loading...