We stand with Ukraine

We stand with Ukraine

Michiel Bacchiani

Orcid: 0000-0003-4527-0197

According to our database¹, Michiel Bacchiani authored at least 75 papers between 1994 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

FLEURS-R: A Restored Multilingual Speech Corpus for Generation Tasks.

[BibT_eX]

[DOI]

,

,

,

,

,

Haruko Ishikawa

,

Michiel Bacchiani

CoRR, 2024

2023

Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations.

[BibT_eX]

[DOI]

,

,

,

,

,

Nobuyuki Morioka

,

,

,

,

Michiel Bacchiani

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus.

[BibT_eX]

[DOI]

,

,

,

,

,

Nobuyuki Morioka

,

Michiel Bacchiani

,

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

Wavefit: an Iterative and Non-Autoregressive Neural Vocoder Based on Fixed-Point Iteration.

[BibT_eX]

[DOI]

,

,

,

Michiel Bacchiani

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping.

[BibT_eX]

[DOI]

,

,

,

,

Michiel Bacchiani

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

SNRi Target Training for Joint Speech Enhancement and Recognition.

[BibT_eX]

[DOI]

,

,

,

Sankaran Panchapagesan

,

Michiel Bacchiani

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Knowledge Transfer from Large-Scale Pretrained Language Models to End-To-End Speech Recognizers.

[BibT_eX]

[DOI]

,

,

Michiel Bacchiani

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

DF-Conformer: Integrated Architecture of Conv-Tasnet and Conformer Using Linear Complexity Self-Attention for Speech Enhancement.

[BibT_eX]

[DOI]

,

,

,

,

John R. Hershey

,

,

Michiel Bacchiani

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition.

[BibT_eX]

[DOI]

,

,

Michiel Adriaan Unico Bacchiani

,

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020

Joint Phoneme-Grapheme Model for End-To-End Speech Recognition.

[BibT_eX]

[DOI]

,

Michiel Bacchiani

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Speech Processing for Digital Home Assistants: Combining signal processing with deep-learning techniques.

[BibT_eX]

[DOI]

Reinhold Haeb-Umbach

,

Shinji Watanabe

,

Tomohiro Nakatani

,

Michiel Bacchiani

,

Björn Hoffmeister

,

Michael L. Seltzer

,

,

IEEE Signal Process. Mag., 2019

Introduction to the Issue on Far-Field Speech Processing in the Era of Deep Learning: Speech Enhancement, Separation, and Recognition.

[BibT_eX]

[DOI]

Shinji Watanabe

,

,

Michiel Bacchiani

,

Reinhold Haeb-Umbach

,

Michael L. Seltzer

IEEE J. Sel. Top. Signal Process., 2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Tara N. Sainath

,

,

Chung-Cheng Chiu

,

,

,

,

Stella Laurenzo

,

,

,

Wolfgang Macherey

,

,

,

,

,

,

Rohit Prabhavalkar

,

,

,

,

,

,

Sébastien Jean

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Kuan-Chieh Wang

,

Ekaterina Gonina

,

,

,

,

,

,

,

,

,

George F. Foster

,

John Richardson

,

,

Antoine Bruguier

,

,

,

,

,

,

,

Vijayaditya Peddinti

,

,

Michiel Bacchiani

,

Thomas B. Jablin

,

Robert Suderman

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Dmitry Lepikhin

,

,

,

,

Shubham Toshniwal

,

,

Michael Nirschl

,

CoRR, 2019

2018

An Overview of the IEEE SPS Speech and Language Technical Committee [In the Spotlight].

[BibT_eX]

[DOI]

Michiel Bacchiani

,

Eric Fosler-Lussier

IEEE Signal Process. Mag., 2018

Toward Domain-Invariant Speech Recognition via Large Scale Training.

[BibT_eX]

[DOI]

,

,

,

,

Anshuman Tripathi

,

,

,

Trevor Strohman

,

Michiel Bacchiani

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

From Audio to Semantics: Approaches to End-to-End Spoken Language Understanding.

[BibT_eX]

[DOI]

,

,

Michiel Bacchiani

,

,

,

Pedro J. Moreno

,

Rohit Prabhavalkar

,

,

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Domain Adaptation Using Factorized Hidden Layer for Robust Automatic Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

Anshuman Tripathi

,

,

Tara N. Sainath

,

,

,

Michiel Bacchiani

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Efficient Implementation of the Room Simulator for Training Deep Neural Network Acoustic Models.

[BibT_eX]

[DOI]

,

,

,

Michiel Bacchiani

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Sampled Connectionist Temporal Classification.

[BibT_eX]

[DOI]

,

,

,

,

Michiel Bacchiani

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multi-Dialect Speech Recognition with a Single Sequence-to-Sequence Model.

[BibT_eX]

[DOI]

,

Tara N. Sainath

,

,

Michiel Bacchiani

,

Eugene Weinstein

,

,

,

,

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Spectral Distortion Model for Training Phase-Sensitive Deep-Neural Networks for Far-Field Speech Recognition.

[BibT_eX]

[DOI]

,

Tara N. Sainath

,

,

,

Rajeev C. Nongpiur

,

Michiel Bacchiani

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Sound Source Separation Using Phase Difference and Reliable Mask Selection Selection.

[BibT_eX]

[DOI]

,

,

Michiel Bacchiani

,

Richard M. Stern

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Performance of Mask Based Statistical Beamforming in a Smart Home Scenario.

[BibT_eX]

[DOI]

,

Michiel Bacchiani

,

Tara N. Sainath

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

State-of-the-Art Speech Recognition with Sequence-to-Sequence Models.

[BibT_eX]

[DOI]

Chung-Cheng Chiu

,

Tara N. Sainath

,

,

Rohit Prabhavalkar

,

,

,

,

,

,

Ekaterina Gonina

,

,

,

,

Michiel Bacchiani

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Tara N. Sainath

,

,

Kevin W. Wilson

,

,

,

,

Michiel Bacchiani

,

,

Andrew W. Senior

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model.

[BibT_eX]

[DOI]

,

Tara N. Sainath

,

,

Michiel Bacchiani

,

Eugene Weinstein

,

,

,

,

CoRR, 2017

End-to-End Training of Acoustic Models for Large Vocabulary Continuous Speech Recognition with TensorFlow.

[BibT_eX]

[DOI]

,

,

,

Michiel Bacchiani

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Acoustic Modeling for Google Home.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Generation of Large-Scale Simulated Utterances in Virtual Rooms to Train Deep-Neural Networks for Far-Field Speech Recognition in Google Home.

[BibT_eX]

[DOI]

,

,

,

,

,

Tara N. Sainath

,

Michiel Bacchiani

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Improving the efficiency of forward-backward algorithm using batched computation in TensorFlow.

[BibT_eX]

[DOI]

,

,

,

Tara N. Sainath

,

Michiel Bacchiani

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Raw Multichannel Processing Using Deep Neural Networks.

[BibT_eX]

[DOI]

Tara N. Sainath

,

,

Kevin W. Wilson

,

,

Michiel Bacchiani

,

,

,

,

Andrew W. Senior

,

,

,

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

Speech Research at Google to Enable Universal Speech Interfaces.

[BibT_eX]

[DOI]

Michiel Bacchiani

,

Françoise Beaufays

,

Alexander Gruenstein

,

Pedro J. Moreno

,

Johan Schalkwyk

,

Trevor Strohman

,

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016

Complex Linear Projection (CLP): A Discriminative Approach to Joint Feature Extraction and Acoustic Modeling.

[BibT_eX]

[DOI]

,

Tara N. Sainath

,

,

Michiel Bacchiani

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Reducing the Computational Complexity of Multimicrophone Acoustic Models with Integrated Feature Extraction.

[BibT_eX]

[DOI]

Tara N. Sainath

,

,

,

,

Kevin W. Wilson

,

Michiel Bacchiani

,

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Neural Network Adaptive Beamforming for Robust Multichannel Speech Recognition.

[BibT_eX]

[DOI]

,

Tara N. Sainath

,

,

Kevin W. Wilson

,

Michiel Bacchiani

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Factored spatial and spectral multichannel raw waveform CLDNNs.

[BibT_eX]

[DOI]

Tara N. Sainath

,

,

Kevin W. Wilson

,

,

Michiel Bacchiani

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Large vocabulary automatic speech recognition for children.

[BibT_eX]

[DOI]

,

,

,

Melissa K. Carroll

,

,

,

Tara N. Sainath

,

Andrew W. Senior

,

Françoise Beaufays

,

Michiel Bacchiani

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms.

[BibT_eX]

[DOI]

Tara N. Sainath

,

,

Kevin W. Wilson

,

,

Michiel Bacchiani

,

Andrew W. Senior

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Asynchronous stochastic optimization for sequence training of deep neural networks: towards big data.

[BibT_eX]

[DOI]

,

,

Pedro J. Moreno

,

Andrew W. Senior

,

Michiel Bacchiani

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Robust speech recognition using temporal masking and thresholding algorithm.

[BibT_eX]

[DOI]

,

,

Michiel Bacchiani

,

Richard M. Stern

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Asynchronous, online, GMM-free training of a context dependent acoustic model for speech recognition.

[BibT_eX]

[DOI]

Michiel Bacchiani

,

Andrew W. Senior

,

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

GMM-free DNN acoustic model training.

[BibT_eX]

[DOI]

Andrew W. Senior

,

,

Michiel Bacchiani

,

Proceedings of the IEEE International Conference on Acoustics, 2014

Asynchronous stochastic optimization for sequence training of deep neural networks.

[BibT_eX]

[DOI]

,

,

Vincent Vanhoucke

,

Andrew W. Senior

,

Michiel Bacchiani

Proceedings of the IEEE International Conference on Acoustics, 2014

Context dependent state tying for speech recognition using deep neural network acoustic models.

[BibT_eX]

[DOI]

Michiel Bacchiani

,

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

ivector-based acoustic data selection.

[BibT_eX]

[DOI]

,

Michiel Bacchiani

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Rapid adaptation for mobile speech applications.

[BibT_eX]

[DOI]

Michiel Bacchiani

Proceedings of the IEEE International Conference on Acoustics, 2013

2011

TechWare: Mobile Media Search Resources [Best of the Web].

[BibT_eX]

[DOI]

,

Michiel Bacchiani

IEEE Signal Process. Mag., 2011

Discriminative Features for Language Identification.

[BibT_eX]

[DOI]

Christopher Alberti

,

Michiel Bacchiani

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010

Decision tree state clustering with word and syllable features.

[BibT_eX]

[DOI]

,

Christopher Alberti

,

Michiel Bacchiani

,

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

Restoring punctuation and capitalization in transcribed speech.

[BibT_eX]

[DOI]

Agustín Gravano

,

,

Michiel Bacchiani

Proceedings of the IEEE International Conference on Acoustics, 2009

An audio indexing system for election video material.

[BibT_eX]

[DOI]

Christopher Alberti

,

Michiel Bacchiani

,

,

,

Anastassia Drofa

,

,

Pedro J. Moreno

,

,

Arnaud Sahuguet

,

,

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Confidence scores for acoustic model adaptation.

[BibT_eX]

[DOI]

Christian Gollan

,

Michiel Bacchiani

Proceedings of the IEEE International Conference on Acoustics, 2008

Deploying GOOG-411: Early lessons in data, measurement, and testing.

[BibT_eX]

[DOI]

Michiel Bacchiani

,

Françoise Beaufays

,

Johan Schalkwyk

,

,

Proceedings of the IEEE International Conference on Acoustics, 2008

2006

MAP adaptation of stochastic grammars.

[BibT_eX]

[DOI]

Michiel Bacchiani

,

,

,

Comput. Speech Lang., 2006

2005

Fast vocabulary-independent audio search using path-based graph indexing.

[BibT_eX]

[DOI]

,

Michiel Bacchiani

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004

Language Model Adaptation with MAP Estimation and the Perceptron Algorithm.

[BibT_eX]

[DOI]

Michiel Bacchiani

,

,

Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

Improved name recognition with meta-data dependent name networks.

[BibT_eX]

[DOI]

,

Michiel Bacchiani

,

,

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Meta-data conditional language modeling.

[BibT_eX]

[DOI]

Michiel Bacchiani

,

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Supervised and unsupervised PCFG adaptation to novel domains.

[BibT_eX]

[DOI]

,

Michiel Bacchiani

Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Unsupervised language model adaptation.

[BibT_eX]

[DOI]

Michiel Bacchiani

,

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

Combining maximum likelihood and maximum a posteriori estimation for detailed acoustic modeling of context dependency.

[BibT_eX]

[DOI]

Michiel Bacchiani

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

SCANMail: a voicemail interface that makes speech browsable, readable and searchable.

[BibT_eX]

[DOI]

Steve Whittaker

,

Julia Hirschberg

,

,

,

Michiel Bacchiani

,

Philip L. Isenhour

,

,

,

Aaron E. Rosenberg

Proceedings of the CHI 2002 Conference on Human Factors in Computing Systems: Changing our World, 2002

2001

Audio Browsing and Search in the Voicemail Domain.

[BibT_eX]

[DOI]

Julia Hirschberg

,

Michiel Bacchiani

,

Philip L. Isenhour

Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, 2001

SCANMail: Audio Navigation in the Voicemail Domain.

[BibT_eX]

[DOI]

Michiel Bacchiani

,

Julia Hirschberg

,

Aaron E. Rosenberg

,

Steve Whittaker

,

,

Philip L. Isenhour

,

,

,

Proceedings of the First International Conference on Human Language Technology Research, 2001

Caller identification for the SCANMail voicemail browser.

[BibT_eX]

[DOI]

Aaron E. Rosenberg

,

Julia Hirschberg

,

Michiel Bacchiani

,

Sarangarajan Parthasarathy

,

Philip L. Isenhour

,

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

SCANMail: browsing and searching speech data by content.

[BibT_eX]

[DOI]

Julia Hirschberg

,

Michiel Bacchiani

,

,

Philip L. Isenhour

,

Aaron E. Rosenberg

,

,

,

Steve Whittaker

,

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Automatic transcription of voicemail at AT&T.

[BibT_eX]

[DOI]

Michiel Bacchiani

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

Using maximum likelihood linear regression for segment clustering and speaker identification.

[BibT_eX]

[DOI]

Michiel Bacchiani

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999

Joint lexicon, acoustic unit inventory and model design.

[BibT_eX]

[DOI]

Michiel Bacchiani

,

Speech Commun., 1999

AT&T at TREC-8.

[BibT_eX]

[DOI]

,

Steven P. Abney

,

Michiel Bacchiani

,

Michael Collins

,

,

Fernando C. N. Pereira

Proceedings of The Eighth Text REtrieval Conference, 1999

1998

Using automatically-derived acoustic sub-word units in large vocabulary speech recognition.

[BibT_eX]

[DOI]

Michiel Bacchiani

,

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1996

Speech recognition based on acoustically derived segment units.

[BibT_eX]

[DOI]

Toshiaki Fukada

,

Michiel Bacchiani

,

Kuldip K. Paliwal

,

Yoshinori Sagisaka

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Design of a speech recognition system based on acoustically derived segmental units.

[BibT_eX]

[DOI]

Michiel Bacchiani

,

,

Yoshinori Sagisaka

,

Kuldip K. Paliwal

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995

Minimum classification error training algorithm for feature extractor and pattern classifier in speech recognition.

[BibT_eX]

[DOI]

Kuldip K. Paliwal

,

Michiel Bacchiani

,

Yoshinori Sagisaka

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1994

Optimization of time-frequency masking filters using the minimum classification error criterion.

[BibT_eX]

[DOI]

Michiel Bacchiani

,

Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Loading...