Mayank Mishra

Orcid: 0000-0002-4654-1246

According to our database1, Mayank Mishra authored at least 62 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Selective Self-Rehearsal: A Fine-Tuning Approach to Improve Generalization in Large Language Models.
CoRR, 2024

Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler.
CoRR, 2024

Scaling Granite Code Models to 128K Context.
CoRR, 2024

Enhancing Training Efficiency Using Packing with Flash Attention.
CoRR, 2024

The infrastructure powering IBM's Gen AI model development.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2024

Reducing Transformer Key-Value Cache Size with Cross-Layer Attention.
CoRR, 2024

Granite Code Models: A Family of Open Foundation Models for Code Intelligence.
CoRR, 2024

Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models.
CoRR, 2024

Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization.
CoRR, 2024

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order.
CoRR, 2024

StarCoder 2 and The Stack v2: The Next Generation.
CoRR, 2024

Leftovers for LLaMA.
Proceedings of the 15th ACM/SPEC International Conference on Performance Engineering, 2024

LLaMPS: Large Language Models Placement System.
Proceedings of the Companion of the 15th ACM/SPEC International Conference on Performance Engineering, 2024

BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

DeiT-LT: Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

TASCA : Tool for Automatic SCalable Acceleration of ML pipelines✱.
Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD), 2024

2023
A classification framework for Autism Spectrum Disorder detection using sMRI: Optimizer based ensemble of deep convolution neural network with on-the-fly data augmentation.
Biomed. Signal Process. Control., July, 2023

StarCoder: may the source be with you!
Trans. Mach. Learn. Res., 2023

SantaCoder: don't reach for the stars!
CoRR, 2023

Accelerating Model Training: Performance Antipatterns Eliminator Framework.
Proceedings of the 3rd Workshop on Machine Learning and Systems, 2023

Scalable High-Performance Architecture for Evolving Recommender System.
Proceedings of the 3rd Workshop on Machine Learning and Systems, 2023

Prompting with Pseudo-Code Instructions.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Joint Reasoning on Hybrid-knowledge sources for Task-Oriented Dialog.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

2022
MAPLE: Model Aggregation and Prediction for Learned Ecosystem.
Proceedings of the ICPE '22: ACM/SPEC International Conference on Performance Engineering, Bejing, China, April 9, 2022

Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Variational Learning for Unsupervised Knowledge Grounded Dialogs.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

A Closer Look at Smoothness in Domain Adversarial Training.
Proceedings of the International Conference on Machine Learning, 2022

Pay-as-you-Train: Efficient ways of Serverless Training.
Proceedings of the IEEE International Conference on Cloud Engineering, 2022

Metamodel driven acceleration of actor-based simulation.
Proceedings of the BiDEDE '22: Proceedings of The International Workshop on Big Data in Emergent Distributed Environments, 2022

2021
A comparative study of regression, neural network and neuro-fuzzy inference system for determining the compressive strength of brick-mortar masonry by fusing nondestructive testing data.
Eng. Comput., 2021

Accelerating Gradient-based Meta Learner.
CoRR, 2021

Performance and Cost Comparison of Cloud Services for Deep Learning Workload.
Proceedings of the ICPE '21: ACM/SPEC International Conference on Performance Engineering, 2021

RUSLI: Real-time Updatable Spline Learned Index.
Proceedings of the aiDM '21: Fourth Workshop in Exploiting AI Techniques for Data Management, 2021

Distributed training for accelerating metalearning algorithms.
Proceedings of the BiDEDE '21: Proceedings of the International Workshop on Big Data in Emergent Distributed Environments, 2021

SLA-aware Workload Scheduling Using Hybrid Cloud Services.
Proceedings of the HiPS@HPDC 2021: Proceedings of the 1st Workshop on High Performance Serverless Computing, 2021

FASCA: Framework for Automatic Scalable Acceleration of ML Pipeline.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

Benchmarking of Quantization Libraries in Popular Frameworks.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

Towards Accelerating Offline RL based Recommender Systems.
Proceedings of the AIMLSystems 2021: The First International Conference on AI-ML-Systems, Bangalore India, October 21, 2021

2020
Teaching-learning-based optimisation algorithm and its application in capturing critical slip surface in slope stability analysis.
Soft Comput., 2020

Performance Studies of 10 Metaheuristic Techniques in Determination of Damages for Large-Scale Spatial Trusses from Changes in Vibration Responses.
J. Comput. Civ. Eng., 2020

A Vision on Accelerating Enterprise IT System 2.0.
Proceedings of the Fourth Workshop on Data Management for End-To-End Machine Learning, 2020

Recommending in changing times.
Proceedings of the RecSys 2020: Fourteenth ACM Conference on Recommender Systems, 2020

2019
Adversarial Approximate Inference for Speech to Electroglottograph Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Adversarial Approximate Inference for Speech to Electroglottograph Conversion.
CoRR, 2019

Crack Detection on Inner Tunnel Surface Using Image Processing.
Proceedings of the Progress in Advanced Computing and Intelligent Engineering, 2019

2018
A Novel Ramp-based Pulse Shaping Filter for Reducing Out of Band Emission in 5G GFDM System.
Proceedings of the TENCON 2018, 2018

Cracking the monolith: challenges in data transitioning to cloud native architectures.
Proceedings of the 12th European Conference on Software Architecture: Companion Proceedings, 2018

2017
On-Disk Data Processing: Issues and Future Directions.
CoRR, 2017

A Clustering based Prediction Scheme for High Utility Itemsets.
Proceedings of the 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, 2017

2016
Whither Tightness of Packing? The Case for Stable VM Placement.
IEEE Trans. Cloud Comput., 2016

Bulk I/O Storage Management for Big Data Applications.
Proceedings of the 24th IEEE International Symposium on Modeling, 2016

A Novel Clustering Algorithm to Capture Utility Information in Transactional Data.
Proceedings of the 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2016) - Volume 1: KDIR, Porto - Portugal, November 9, 2016

De-Fragmenting the Cloud.
Proceedings of the IEEE/ACM 16th International Symposium on Cluster, 2016

2013
Managing Network Reservation for Tenants in Oversubscribed Clouds.
Proceedings of the 2013 IEEE 21st International Symposium on Modelling, 2013

2012
Dynamic resource management using virtual machine migrations.
IEEE Commun. Mag., 2012

Low cost computing using virtualization for Remote Desktop.
Proceedings of the Fourth International Conference on Communication Systems and Networks, 2012

2011
On Theory of VM Placement: Anomalies in Existing Methodologies and Their Mitigation Using a Novel Vector Based Approach.
Proceedings of the IEEE International Conference on Cloud Computing, 2011

2009
Design and Implementation of HMM-VQ based Isolated Digit Recognition System.
Proceedings of the 4th Indian International Conference on Artificial Intelligence, 2009

2007
An 802.11 Based MAC Protocol for Providing QoS to Real Time Applications.
Proceedings of the 10th International Conference on Information Technology, 2007

2006
A Contention Window Based Differentiation Mechanism for providing QoS in Wireless LANs.
Proceedings of the 9th International Conference in Information Technology, 2006

2005
Design and implementation of an architecture supporting mobile CORBA servants under intermittent connectivity environment.
Proceedings of the IEEE Wireless Communications and Networking Conference, 2005

2004
Architecture for Locating Mobile CORBA Objects in Wireless Mobile Environment.
Proceedings of the Advanced Distributed Systems: Third International School and Symposium, 2004


  Loading...