Navdeep Jaitly

According to our database1, Navdeep Jaitly authored at least 78 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling.
CoRR, 2024

Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition.
CoRR, 2024

How Far Are We from Intelligent Visual Deductive Reasoning?
CoRR, 2024

Divide-or-Conquer? Which Part Should You Distill Your LLM?
CoRR, 2024

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling.
CoRR, 2024

REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Construction of Paired Knowledge Graph - Text Datasets Informed by Cyclic Evaluation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
KGLens: A Parameterized Knowledge Graph Solution to Assess What an LLM Does and Doesn't Know.
CoRR, 2023

Generating Molecular Conformer Fields.
CoRR, 2023

Matryoshka Diffusion Models.
CoRR, 2023

The Entity-Deduction Arena: A playground for probing the conversational reasoning and planning capabilities of LLMs.
CoRR, 2023

Robotic Table Tennis: A Case Study into a High Speed Learning System.
CoRR, 2023


PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Continuous pseudo-labeling from the start.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

More Speaking or More Speakers?
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Understanding the Robustness of Multi-Exit Models under Common Corruptions.
CoRR, 2022

Position Prediction as an Effective Pretraining Strategy.
Proceedings of the International Conference on Machine Learning, 2022

Efficient Representation Learning via Adaptive Context Pooling.
Proceedings of the International Conference on Machine Learning, 2022

Continuous Soft Pseudo-Labeling in ASR.
Proceedings of the Proceedings on "I Can't Believe It's Not Better!, 2022

2021
RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

2020
Robotic Table Tennis with Model-Free Reinforcement Learning.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Imputer: Sequence Modelling via Imputation and Dynamic Programming.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
SPIN: A High Speed, High Resolution Vision Dataset for Tracking and Action Recognition in Ping Pong.
CoRR, 2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.
CoRR, 2019

2018
Hierarchical Policy Design for Sample-Efficient Learning of Robot Table Tennis Through Self-Play.
CoRR, 2018

Speech Recognition for Medical Conversations.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Learning Hard Alignments with Variational Inference.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

State-of-the-Art Speech Recognition with Sequence-to-Sequence Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Deep Learning for Automated Occlusion Edge Detection in RGB-D Frames.
J. Signal Process. Syst., 2017

Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions.
CoRR, 2017

Sequence-to-Sequence Models Can Directly Transcribe Foreign Speech.
CoRR, 2017

Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model.
CoRR, 2017

Discrete Sequential Prediction of Continuous Actions for Deep RL.
CoRR, 2017

An online sequence-to-sequence model for noisy speech recognition.
CoRR, 2017

Next-Step Conditioned Deep Convolutional Neural Networks Improve Protein Secondary Structure Prediction.
CoRR, 2017

Sequence-to-Sequence Models Can Directly Translate Foreign Speech.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Tacotron: Towards End-to-End Speech Synthesis.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

An RNN Model of Text Normalization.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

An Analysis of "Attention" in Sequence-to-Sequence Models.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

A Comparison of Sequence-to-Sequence Models for Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Towards Better Decoding and Language Model Integration in Sequence to Sequence Models.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Latent Sequence Decompositions.
Proceedings of the 5th International Conference on Learning Representations, 2017

Very deep convolutional networks for end-to-end speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Learning online alignments with continuous rewards policy gradient.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
RNN Approaches to Text Normalization: A Challenge.
CoRR, 2016

Protein Secondary Structure Prediction Using Deep Multi-scale Convolutional Neural Networks and Next-Step Conditioning.
CoRR, 2016

Reward Augmented Maximum Likelihood for Neural Structured Prediction.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

An Online Sequence-to-Sequence Model Using Partial Conditioning.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Listen, attend and spell: A neural network for large vocabulary conversational speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Chained Predictions Using Convolutional Neural Networks.
Proceedings of the Computer Vision - ECCV 2016, 2016

Deep Discriminative and Generative Models for speech Pattern Recognition.
Proceedings of the Handbook of Pattern Recognition and Computer Vision, 5th Ed., 2016

2015
Exploring Deep Learning Methods for Discovering Features in Speech Signals.
PhD thesis, 2015

Adversarial Autoencoders.
CoRR, 2015

A Simple Way to Initialize Recurrent Networks of Rectified Linear Units.
CoRR, 2015

An Online Sequence-to-Sequence Model Using Partial Conditioning.
CoRR, 2015

Listen, Attend and Spell.
CoRR, 2015

Object Recognition from Short Videos for Robotic Perception.
CoRR, 2015

Pointer Networks.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

2014
Occlusion Edge Detection in RGB-D Frames using Deep Convolutional Networks.
CoRR, 2014

Multi-task Neural Networks for QSAR Predictions.
CoRR, 2014

Autoregressive product of multi-frame predictions can improve the accuracy of hybrid models.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Towards End-To-End Speech Recognition with Recurrent Neural Networks.
Proceedings of the 31th International Conference on Machine Learning, 2014

2013
MultiAlign: a multiple LC-MS analysis tool for targeted omics analysis.
BMC Bioinform., 2013

Using an autoencoder with deformable templates to discover features for automated speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Hybrid speech recognition with Deep Bidirectional LSTM.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Application of Pretrained Deep Neural Networks to Large Vocabulary Speech Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011
Normalizing Molecular Docking Rankings using Virtually Generated Decoys.
J. Chem. Inf. Model., 2011

Learning a better representation of speech soundwaves using restricted boltzmann machines.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
A Bayesian method for 3D macromolecular structure inference using class average images from single particle electron microscopy.
Bioinform., 2010

Applications in Data-Intensive Computing.
Adv. Comput., 2010

2009
Decon2LS: An open-source software package for automated processing and visualization of high resolution mass spectrometry data.
BMC Bioinform., 2009

An Architecture for Real Time Data Acquisition and Online Signal Processing for High Throughput Tandem Mass Spectrometry.
Proceedings of the Fifth International Conference on e-Science, 2009

2008
DAnTE: a statistical tool for quantitative analysis of -omics data.
Bioinform., 2008

DeconMSn: a software tool for accurate parent ion monoisotopic mass determination for tandem mass spectra.
Bioinform., 2008

2007
VIPER: an advanced software package to support high-throughput LC-MS peptide identification.
Bioinform., 2007


  Loading...