We stand with Ukraine

We stand with Ukraine

Johan Schalkwyk

According to our database¹, Johan Schalkwyk authored at least 34 papers between 1994 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Coupling Speech Encoders with Downstream Text Models.

[BibT_eX]

[DOI]

,

Johan Schalkwyk

CoRR, 2024

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context.

[BibT_eX]

[DOI]

CoRR, 2024

2023

Gemini: A Family of Highly Capable Multimodal Models.

[BibT_eX]

[DOI]

CoRR, 2023

SLM: Bridge the thin gap between speech and text foundation models.

[BibT_eX]

[DOI]

,

,

,

,

Chung-Cheng Chiu

,

,

,

,

,

,

Paul K. Rubenstein

,

,

,

,

,

Nikhil Siddhartha

,

Johan Schalkwyk

,

CoRR, 2023

AudioPaLM: A Large Language Model That Can Speak and Listen.

[BibT_eX]

[DOI]

CoRR, 2023

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages.

[BibT_eX]

[DOI]

CoRR, 2023

Lego-Features: Exporting Modular Encoder Features for Streaming and Deliberation ASR.

[BibT_eX]

[DOI]

,

Rohit Prabhavalkar

,

Johan Schalkwyk

,

,

Tara N. Sainath

,

Françoise Beaufays

Proceedings of the IEEE International Conference on Acoustics, 2023

SLM: Bridge the Thin Gap Between Speech and Text Foundation Models.

[BibT_eX]

[DOI]

,

,

,

,

Chung-Cheng Chiu

,

,

,

,

,

Paul K. Rubenstein

,

,

,

,

Nikhil Siddhartha

,

Johan Schalkwyk

,

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2017

On lattice generation for large vocabulary speech recognition.

[BibT_eX]

[DOI]

,

,

Johan Schalkwyk

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Speech Research at Google to Enable Universal Speech Interfaces.

[BibT_eX]

[DOI]

Michiel Bacchiani

,

Françoise Beaufays

,

Alexander Gruenstein

,

Pedro J. Moreno

,

Johan Schalkwyk

,

Trevor Strohman

,

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2015

Learning acoustic frame labeling for speech recognition with recurrent neural networks.

[BibT_eX]

[DOI]

,

Andrew W. Senior

,

,

,

,

Françoise Beaufays

,

Johan Schalkwyk

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Long short term memory neural network for keyboard gesture decoding.

[BibT_eX]

[DOI]

,

,

Françoise Beaufays

,

,

Thomas M. Breuel

,

Johan Schalkwyk

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2012

Voice Query Refinement.

[BibT_eX]

[DOI]

,

,

,

,

Johan Schalkwyk

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011

A Filter-Based Algorithm for Efficient Composition of Finite-State Transducers.

[BibT_eX]

[DOI]

,

,

Johan Schalkwyk

Int. J. Found. Comput. Sci., 2011

2010

Filters for Efficient Composition of Weighted Finite-State Transducers.

[BibT_eX]

[DOI]

,

,

Johan Schalkwyk

Proceedings of the Implementation and Application of Automata, 2010

Query language modeling for voice search.

[BibT_eX]

[DOI]

,

Johan Schalkwyk

,

Thorsten Brants

,

,

,

,

Carolina Parada

,

Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Voice search for development.

[BibT_eX]

[DOI]

Etienne Barnard

,

Johan Schalkwyk

,

Charl Johannes van Heerden

,

Pedro J. Moreno

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

On-demand language model interpolation for mobile speech input.

[BibT_eX]

[DOI]

Brandon Ballinger

,

,

Alexander Gruenstein

,

Johan Schalkwyk

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

Semantic context effects in the recognition of acoustically unreduced and reduced words.

[BibT_eX]

[DOI]

,

Johan Schalkwyk

,

Roberto Sicconi

,

,

Marco van de Ven

,

Benjamin V. Tucker

,

Mirjam Ernestus

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Language modeling for what-with-where on GOOG-411.

[BibT_eX]

[DOI]

Charl Johannes van Heerden

,

Johan Schalkwyk

,

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

A generalized composition algorithm for weighted finite-state transducers.

[BibT_eX]

[DOI]

,

,

Johan Schalkwyk

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Mobile media search.

[BibT_eX]

[DOI]

,

,

,

,

,

Johan Schalkwyk

Proceedings of the IEEE International Conference on Acoustics, 2009

OpenFst.

[BibT_eX]

[DOI]

Johan Schalkwyk

Proceedings of the Finite-State Methods and Natural Language Processing, 2009

2008

Deploying GOOG-411: Early lessons in data, measurement, and testing.

[BibT_eX]

[DOI]

Michiel Bacchiani

,

Françoise Beaufays

,

Johan Schalkwyk

,

,

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

OpenFst: A General and Efficient Weighted Finite-State Transducer Library.

[BibT_eX]

[DOI]

,

,

Johan Schalkwyk

,

,

Proceedings of the Implementation and Application of Automata, 2007

2003

Speech recognition with dynamic grammars using finite-state transducers.

[BibT_eX]

[DOI]

Johan Schalkwyk

,

I. Lee Hetherington

,

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

1998

Universal speech tools: the CSLU toolkit.

[BibT_eX]

[DOI]

,

,

Jacques de Villiers

,

Johan Schalkwyk

,

Pieter J. E. Vermeulen

,

Michael W. Macon

,

,

Edward C. Kaiser

,

,

Khaldoun Shobaki

,

John-Paul Hosom

,

,

,

Dominic W. Massaro

,

Michael M. Cohen

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997

Experiments with a spoken dialogue system for taking the US census.

[BibT_eX]

[DOI]

,

David G. Novick

,

Pieter J. E. Vermeulen

,

,

,

L. F. A. Wessels

,

Jacques de Villiers

,

Johan Schalkwyk

,

,

Daniel C. Burnett

Speech Commun., 1997

CSLUsh: an extendible research environment.

[BibT_eX]

[DOI]

Johan Schalkwyk

,

Jacques de Villiers

,

Sarel van Vuuren

,

Pieter J. E. Vermeulen

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996

Building 10, 000 spoken dialogue systems.

[BibT_eX]

[DOI]

,

David G. Novick

,

,

Pieter J. E. Vermeulen

,

Jacques de Villiers

,

Johan Schalkwyk

,

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Speech recognition using syllable-like units.

[BibT_eX]

[DOI]

,

Johan Schalkwyk

,

Etienne Barnard

,

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Speaker verification with low storage requirements.

[BibT_eX]

[DOI]

Johan Schalkwyk

,

,

Etienne Barnard

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1994

A prototype voice-response questionnaire for the u.s. census.

[BibT_eX]

[DOI]

,

David G. Novick

,

,

Pieter J. E. Vermeulen

,

,

Daniel C. Burnett

,

Johan Schalkwyk

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Detecting an imposter in telephone speech.

[BibT_eX]

[DOI]

Johan Schalkwyk

,

Etienne Barnard

,

Jeffrey R. Sachs

Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Loading...