Devang Naik

Orcid: 0009-0007-7838-1623

According to our database1, Devang Naik authored at least 31 papers between 1995 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization.
CoRR, 2024

SLiCK: Exploiting Subsequences for Length-Constrained Keyword Spotting.
CoRR, 2024

An Efficient and Streaming Audio Visual Active Speaker Detection System.
CoRR, 2024

RepCNN: Micro-sized, Mighty Models for Wakeword Detection.
CoRR, 2024

eDKM: An Efficient and Accurate Train-Time Weight Clustering for Large Language Models.
IEEE Comput. Archit. Lett., 2024

KV-Runahead: Scalable Causal LLM Inference by Parallel Key-Value Cache Generation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Flexible Keyword Spotting Based on Homogeneous Audio-Text Embedding.
Proceedings of the IEEE International Conference on Acoustics, 2024

Improving Vision-Inspired Keyword Spotting Using Dynamic Module Skipping in Streaming Conformer Encoder.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Weight subcloning: direct initialization of transformers using larger pretrained ones.
CoRR, 2023

PDP: Parameter-free Differentiable Pruning is All You Need.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Matching Latent Encoding for Audio-Text based Keyword Spotting.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

I See What You Hear: A Vision-Inspired Method to Localize Words.
Proceedings of the IEEE International Conference on Acoustics, 2023

HEiMDaL: Highly Efficient Method for Detection and Localization of Wake-Words.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
I see what you hear: a vision-inspired method to localize words.
CoRR, 2022

2021
Optimize What Matters: Training DNN-Hmm Keyword Spotting Model Using End Metric.
Proceedings of the IEEE International Conference on Acoustics, 2021

Knowledge Transfer for Efficient on-Device False Trigger Mitigation.
Proceedings of the IEEE International Conference on Acoustics, 2021

On The Role of Visual Cues in Audiovisual Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Robust Multichannel Linear Prediction for Online Speech Dereverberation Using Weighted Householder Least Squares Lattice Adaptive Filter.
IEEE Trans. Signal Process., 2020

Double-Talk Robust Multichannel Acoustic Echo Cancellation Using Least-Squares MIMO Adaptive Filtering: Transversal, Array, and Lattice Forms.
IEEE Trans. Signal Process., 2020

Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement.
CoRR, 2020

Complementary Language Model and Parallel Bi-LRNN for False Trigger Mitigation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Multi-Task Learning for Speaker Verification and Voice Trigger Detection.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Detecting Emotion Primitives from Speech and Their Use in Discerning Categorical Emotions.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Lattice-Based Improvements for Voice Triggering Using Graph Neural Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Neural Text-to-Speech Adaptation from Low Quality Public Recordings.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Leveraging Acoustic Cues and Paralinguistic Embeddings to Detect Expression from Voice.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2001
Language-independent, short-enrollment voice verification over a far-field microphone.
Proceedings of the IEEE International Conference on Acoustics, 2001

1999
Design and ccollection of a corpus of polyphones and prosodic contexts for speech synthesis research and development.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1997
Using talker location to detect spurious utterances in desktop command and control.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
A novel word clustering algorithm based on latent semantic analysis.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
Pole-filtered cepstral mean subtraction.
Proceedings of the 1995 International Conference on Acoustics, 1995


  Loading...