Raghuraman Krishnamoorthi

According to our database1, Raghuraman Krishnamoorthi authored at least 25 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
SpinQuant: LLM quantization with learned rotations.
CoRR, 2024

Communication Efficient Distributed Training with Distributed Lion.
CoRR, 2024

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Sparse Cocktail: Every Sparse Pattern Every Sparse Ratio All At Once.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-Device ASR Models.
Proceedings of the IEEE International Conference on Acoustics, 2024

LLM-QAT: Data-Free Quantization Aware Training for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

PathFusion: Path-Consistent Lidar-Camera Deep Feature Fusion.
Proceedings of the International Conference on 3D Vision, 2024

2023
SqueezeSAM: User friendly mobile interactive segmentation.
CoRR, 2023

Gen2Det: Generate to Detect.
CoRR, 2023

Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images.
CoRR, 2023

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything.
CoRR, 2023

MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning.
CoRR, 2023

Fast Point Cloud Generation with Straight Flows.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Binary and Ternary Natural Language Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
LiCo-Net: Linearized Convolution Network for Hardware-efficient Keyword Spotting.
CoRR, 2022

Learning a Dual-Mode Speech Recognition Model VIA Self-Pruning.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Check-N-Run: a Checkpointing System for Training Deep Learning Recommendation Models.
Proceedings of the 19th USENIX Symposium on Networked Systems Design and Implementation, 2022

BiT: Robustly Binarized Multi-distilled Transformer.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks.
Proceedings of the International Conference on Machine Learning, 2022

2021
Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale.
IEEE Micro, 2021

2020
Check-N-Run: A Checkpointing System for Training Recommendation Models.
CoRR, 2020

2019
Deep Learning Recommendation Model for Personalization and Recommendation Systems.
CoRR, 2019

2018
Quantizing deep convolutional networks for efficient inference: A whitepaper.
CoRR, 2018

2007
FLO Physical Layer: An Overview.
IEEE Trans. Broadcast., 2007


  Loading...