Aditya Krishna Menon

Affiliations:
  • NICTA, Canberra, Australia
  • Australian National University, College of Engineering & Computer Science


According to our database1, Aditya Krishna Menon authored at least 103 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
What do larger image classifiers memorise?
Trans. Mach. Learn. Res., 2024

A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs.
CoRR, 2024

Efficient Document Ranking with Learnable Late Interactions.
CoRR, 2024

Cascade-Aware Training of Language Models.
CoRR, 2024

Faster Cascades via Speculative Decoding.
CoRR, 2024

Metric-aware LLM inference.
CoRR, 2024

USTAD: Unified Single-model Training Achieving Diverse Scores for Information Retrieval.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

DistillSpec: Improving Speculative Decoding via Knowledge Distillation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

The importance of feature preprocessing for differentially private linear optimization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Plugin estimators for selective classification with out-of-distribution detection.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning to Reject Meets Long-tail Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Language Model Cascades: Token-Level Uncertainty And Beyond.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Think before you speak: Training Language Models With Pause Tokens.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Regression Aware Inference with LLMs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Learning to reject meets OOD detection: Are all abstentions created equal?
CoRR, 2023

EmbedDistill: A Geometric Knowledge Distillation for Information Retrieval.
CoRR, 2023

Robust distillation for worst-class performance: on the interplay between teacher and student objectives.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

ResMem: Learn what you can and memorize the rest.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On student-teacher deviations in distillation: does it pay to disobey?
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

When Does Confidence-Based Cascade Deferral Suffice?
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Supervision Complexity and its Role in Knowledge Distillation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Teacher's pet: understanding and mitigating biases in distillation.
Trans. Mach. Learn. Res., 2022

Interval-censored Hawkes processes.
J. Mach. Learn. Res., 2022

When does mixup promote local linearity in learned representations?
CoRR, 2022

Robust Distillation for Worst-class Performance.
CoRR, 2022

ELM: Embedding and Logit Margins for Long-Tail Learning.
CoRR, 2022

Post-hoc estimators for learning to defer to an expert.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

In defense of dual-encoders for neural ranking.
Proceedings of the International Conference on Machine Learning, 2022

2021
When in Doubt, Summon the Titans: Efficient Inference with Large Models.
CoRR, 2021

Interval-censored Hawkes processes.
CoRR, 2021

Distilling Double Descent.
CoRR, 2021

On the Reproducibility of Neural Network Predictions.
CoRR, 2021

Training Over-parameterized Models with Non-decomposable Objectives.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Disentangling Sampling and Labeling Bias for Learning in Large-output Spaces.
Proceedings of the 38th International Conference on Machine Learning, 2021

A statistical perspective on distillation.
Proceedings of the 38th International Conference on Machine Learning, 2021

Coping with Label Shift via Distributionally Robust Optimisation.
Proceedings of the 9th International Conference on Learning Representations, 2021

Overparameterisation and worst-case generalisation: friend or foe?
Proceedings of the 9th International Conference on Learning Representations, 2021

Long-tail learning via logit adjustment.
Proceedings of the 9th International Conference on Learning Representations, 2021

Self-supervised Learning for Large-scale Item Recommendations.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

RankDistil: Knowledge Distillation for Ranking.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
SupMMD: A Sentence Importance Model for Extractive Summarization using Maximum Mean Discrepancy.
CoRR, 2020

Self-supervised Learning for Deep Models in Recommendations.
CoRR, 2020

Why distillation helps: a statistical perspective.
CoRR, 2020

Doubly-stochastic mining for heterogeneous retrieval.
CoRR, 2020

Robust large-margin learning in hyperbolic space.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Federated Learning with Only Positive Labels.
Proceedings of the 37th International Conference on Machine Learning, 2020

Supervised learning: no loss no cry.
Proceedings of the 37th International Conference on Machine Learning, 2020

Does label smoothing mitigate label noise?
Proceedings of the 37th International Conference on Machine Learning, 2020

Can gradient clipping mitigate label noise?
Proceedings of the 8th International Conference on Learning Representations, 2020

Semantic Label Smoothing for Sequence to Sequence Problems.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

SupMMD: A Sentence Importance Model for Extractive Summarisation using Maximum Mean Discrepancy.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

2019
The risk of trivial solutions in bipartite top ranking.
Mach. Learn., 2019

Online Hierarchical Clustering Approximations.
CoRR, 2019

Noise-tolerant fair classification.
CoRR, 2019

Fairness risk measures.
Proceedings of the 36th International Conference on Machine Learning, 2019

Complementary-Label Learning for Arbitrary Losses and Models.
Proceedings of the 36th International Conference on Machine Learning, 2019

Monge blunts Bayes: Hardness Results for Adversarial Training.
Proceedings of the 36th International Conference on Machine Learning, 2019

On the Minimal Supervision for Training Any Binary Classifier from Only Unlabeled Data.
Proceedings of the 7th International Conference on Learning Representations, 2019

Comparative Document Summarisation via Classification.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Cold-start playlist recommendation with multitask learning.
PeerJ Prepr., 2018

Learning from binary labels with instance-dependent noise.
Mach. Learn., 2018

Monge beats Bayes: Hardness Results for Adversarial Training.
CoRR, 2018

Anomaly Detection using One-Class Neural Networks.
CoRR, 2018

The cost of fairness in binary classification.
Proceedings of the Conference on Fairness, Accountability and Transparency, 2018

Proper Loss Functions for Nonlinear Hawkes Processes.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
The cost of fairness in classification.
CoRR, 2017

Structured Recommendation.
CoRR, 2017

Revisiting revisits in trajectory recommendation.
Proceedings of International Workshop on Citizens for Recommender Systems, 2017

PathRec: Visual Analysis of Travel Route Recommendations.
Proceedings of the Eleventh ACM Conference on Recommender Systems, 2017

Robust, Deep and Inductive Anomaly Detection.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2017

f-GANs in an Information Geometric Nutshell.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Predicting Short-Term Public Transport Demand via Inhomogeneous Poisson Processes.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Low-Rank Linear Cold-Start Recommendation from Social Data.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Bipartite Ranking: a Risk-Theoretic Perspective.
J. Mach. Learn. Res., 2016

Making Neural Networks Robust to Label Noise: a Loss Correction Approach.
CoRR, 2016

Learning from Binary Labels with Instance-Dependent Corruption.
CoRR, 2016

A scaled Bregman theorem with applications.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Practical Linear Models for Large-Scale One-Class Collaborative Filtering.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Linking losses for density ratio and class-probability estimation.
Proceedings of the 33nd International Conference on Machine Learning, 2016

On the Effectiveness of Linear Models for One-Class Collaborative Filtering.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
An Average Classification Algorithm.
CoRR, 2015

AutoRec: Autoencoders Meet Collaborative Filtering.
Proceedings of the 24th International Conference on World Wide Web Companion, 2015

Cross-Modal Retrieval: A Pairwise Classification Approach.
Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

Learning with Symmetric Label Noise: The Importance of Being Unhinged.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Learning from Corrupted Binary Labels via Class-Probability Estimation.
Proceedings of the 32nd International Conference on Machine Learning, 2015

2014
Detecting inappropriate access to electronic health records using collaborative filtering.
Mach. Learn., 2014

Bayes-Optimal Scorers for Bipartite Ranking.
Proceedings of The 27th Conference on Learning Theory, 2014

2013
Latent feature models for dyadic prediction /.
PhD thesis, 2013

Beam search algorithms for multilabel learning.
Mach. Learn., 2013

A colorful approach to text processing by example.
Proceedings of the 26th Annual ACM Symposium on User Interface Software and Technology, 2013

A Machine Learning Framework for Programming by Example.
Proceedings of the 30th International Conference on Machine Learning, 2013

On the Statistical Consistency of Algorithms for Binary Classification under Class Imbalance.
Proceedings of the 30th International Conference on Machine Learning, 2013

2012
Textual Features for Programming by Example
CoRR, 2012

Learning and Inference in Probabilistic Classifier Chains with Beam Search.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2012

Predicting accurate probabilities with a ranking loss.
Proceedings of the 29th International Conference on Machine Learning, 2012

2011
Fast Algorithms for Approximating the Singular Value Decomposition.
ACM Trans. Knowl. Discov. Data, 2011

Link Prediction via Matrix Factorization.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2011

Response prediction using collaborative filtering with hierarchies and side-information.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

2010
Predicting labels for dyadic data.
Data Min. Knowl. Discov., 2010

Dyadic Prediction Using a Latent Feature Log-Linear Model
CoRR, 2010

A Log-Linear Model with Latent Features for Dyadic Prediction.
Proceedings of the ICDM 2010, 2010

2007
An incremental data-stream sketch using sparse random projections.
Proceedings of the Seventh SIAM International Conference on Data Mining, 2007


  Loading...