default search action
Johan Schalkwyk
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i7]Machel Reid, Nikolay Savinov, Denis Teplyashin, Dmitry Lepikhin, Timothy P. Lillicrap, Jean-Baptiste Alayrac, Radu Soricut, Angeliki Lazaridou, Orhan Firat, Julian Schrittwieser, Ioannis Antonoglou, Rohan Anil, Sebastian Borgeaud, Andrew M. Dai, Katie Millican, Ethan Dyer, Mia Glaese, Thibault Sottiaux, Benjamin Lee, Fabio Viola, Malcolm Reynolds, Yuanzhong Xu, James Molloy, Jilin Chen, Michael Isard, Paul Barham, Tom Hennigan, Ross McIlroy, Melvin Johnson, Johan Schalkwyk, Eli Collins, Eliza Rutherford, Erica Moreira, Kareem Ayoub, Megha Goel, Clemens Meyer, Gregory Thornton, Zhen Yang, Henryk Michalewski, Zaheer Abbas, Nathan Schucher, Ankesh Anand, Richard Ives, James Keeling, Karel Lenc, Salem Haykal, Siamak Shakeri, Pranav Shyam, Aakanksha Chowdhery, Roman Ring, Stephen Spencer, Eren Sezener, et al.:
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context. CoRR abs/2403.05530 (2024) - [i6]Ciprian Chelba, Johan Schalkwyk:
Coupling Speech Encoders with Downstream Text Models. CoRR abs/2407.17605 (2024) - 2023
- [c25]Mingqiu Wang, Wei Han, Izhak Shafran, Zelin Wu, Chung-Cheng Chiu, Yuan Cao, Nanxin Chen, Yu Zhang, Hagen Soltau, Paul K. Rubenstein, Lukas Zilka, Dian Yu, Golan Pundak, Nikhil Siddhartha, Johan Schalkwyk, Yonghui Wu:
SLM: Bridge the Thin Gap Between Speech and Text Foundation Models. ASRU 2023: 1-8 - [c24]Rami Botros, Rohit Prabhavalkar, Johan Schalkwyk, Ciprian Chelba, Tara N. Sainath, Françoise Beaufays:
Lego-Features: Exporting Modular Encoder Features for Streaming and Deliberation ASR. ICASSP 2023: 1-5 - [i5]Yu Zhang, Wei Han, James Qin, Yongqiang Wang, Ankur Bapna, Zhehuai Chen, Nanxin Chen, Bo Li, Vera Axelrod, Gary Wang, Zhong Meng, Ke Hu, Andrew Rosenberg, Rohit Prabhavalkar, Daniel S. Park, Parisa Haghani, Jason Riesa, Ginger Perng, Hagen Soltau, Trevor Strohman, Bhuvana Ramabhadran, Tara N. Sainath, Pedro J. Moreno, Chung-Cheng Chiu, Johan Schalkwyk, Françoise Beaufays, Yonghui Wu:
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages. CoRR abs/2303.01037 (2023) - [i4]Rami Botros, Rohit Prabhavalkar, Johan Schalkwyk, Ciprian Chelba, Tara N. Sainath, Françoise Beaufays:
Lego-Features: Exporting modular encoder features for streaming and deliberation ASR. CoRR abs/2304.00173 (2023) - [i3]Paul K. Rubenstein, Chulayuth Asawaroengchai, Duc Dung Nguyen, Ankur Bapna, Zalán Borsos, Félix de Chaumont Quitry, Peter Chen, Dalia El Badawy, Wei Han, Eugene Kharitonov, Hannah Muckenhirn, Dirk Padfield, James Qin, Danny Rozenberg, Tara N. Sainath, Johan Schalkwyk, Matthew Sharifi, Michelle Tadmor Ramanovich, Marco Tagliasacchi, Alexandru Tudor, Mihajlo Velimirovic, Damien Vincent, Jiahui Yu, Yongqiang Wang, Vicky Zayats, Neil Zeghidour, Yu Zhang, Zhishuai Zhang, Lukas Zilka, Christian Havnø Frank:
AudioPaLM: A Large Language Model That Can Speak and Listen. CoRR abs/2306.12925 (2023) - [i2]Mingqiu Wang, Wei Han, Izhak Shafran, Zelin Wu, Chung-Cheng Chiu, Yuan Cao, Yongqiang Wang, Nanxin Chen, Yu Zhang, Hagen Soltau, Paul K. Rubenstein, Lukas Zilka, Dian Yu, Zhong Meng, Golan Pundak, Nikhil Siddhartha, Johan Schalkwyk, Yonghui Wu:
SLM: Bridge the thin gap between speech and text foundation models. CoRR abs/2310.00230 (2023) - [i1]Rohan Anil, Sebastian Borgeaud, Yonghui Wu, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Slav Petrov, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy P. Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul Ronald Barham, Tom Hennigan, Benjamin Lee, Fabio Viola, Malcolm Reynolds, Yuanzhong Xu, Ryan Doherty, Eli Collins, Clemens Meyer, Eliza Rutherford, Erica Moreira, Kareem Ayoub, Megha Goel, George Tucker, Enrique Piqueras, Maxim Krikun, Iain Barr, Nikolay Savinov, Ivo Danihelka, Becca Roelofs, Anaïs White, Anders Andreassen, Tamara von Glehn, Lakshman Yagati, Mehran Kazemi, Lucas Gonzalez, Misha Khalman, Jakub Sygnowski, et al.:
Gemini: A Family of Highly Capable Multimodal Models. CoRR abs/2312.11805 (2023)
2010 – 2019
- 2017
- [c23]David Rybach, Michael Riley, Johan Schalkwyk:
On lattice generation for large vocabulary speech recognition. ASRU 2017: 228-235 - [p1]Michiel Bacchiani, Françoise Beaufays, Alexander Gruenstein, Pedro J. Moreno, Johan Schalkwyk, Trevor Strohman, Heiga Zen:
Speech Research at Google to Enable Universal Speech Interfaces. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 385-399 - 2015
- [c22]Ouais Alsharif, Tom Ouyang, Françoise Beaufays, Shumin Zhai, Thomas M. Breuel, Johan Schalkwyk:
Long short term memory neural network for keyboard gesture decoding. ICASSP 2015: 2076-2080 - [c21]Hasim Sak, Andrew W. Senior, Kanishka Rao, Ozan Irsoy, Alex Graves, Françoise Beaufays, Johan Schalkwyk:
Learning acoustic frame labeling for speech recognition with recurrent neural networks. ICASSP 2015: 4280-4284 - 2012
- [c20]Cyril Allauzen, Edward Benson, Ciprian Chelba, Michael Riley, Johan Schalkwyk:
Voice Query Refinement. INTERSPEECH 2012: 2462-2465 - 2011
- [j2]Cyril Allauzen, Michael Riley, Johan Schalkwyk:
A Filter-Based Algorithm for Efficient Composition of Finite-State Transducers. Int. J. Found. Comput. Sci. 22(8): 1781-1795 (2011) - 2010
- [c19]Etienne Barnard, Johan Schalkwyk, Charl Johannes van Heerden, Pedro J. Moreno:
Voice search for development. INTERSPEECH 2010: 282-285 - [c18]Brandon Ballinger, Cyril Allauzen, Alexander Gruenstein, Johan Schalkwyk:
On-demand language model interpolation for mobile speech input. INTERSPEECH 2010: 1812-1815 - [c17]Ciprian Chelba, Johan Schalkwyk, Thorsten Brants, Vida Ha, Boulos Harb, Will Neveitt, Carolina Parada, Peng Xu:
Query language modeling for voice search. SLT 2010: 127-132 - [c16]Cyril Allauzen, Michael Riley, Johan Schalkwyk:
Filters for Efficient Composition of Weighted Finite-State Transducers. CIAA 2010: 28-38
2000 – 2009
- 2009
- [c15]Johan Schalkwyk:
OpenFst. FSMNLP 2009: 47 - [c14]Berna Erol, Jordan Cohen, Minoru Etoh, Hsiao-Wuen Hon, Jiebo Luo, Johan Schalkwyk:
Mobile media search. ICASSP 2009: 4897-4900 - [c13]Charl Johannes van Heerden, Johan Schalkwyk, Brian Strope:
Language modeling for what-with-where on GOOG-411. INTERSPEECH 2009: 991-994 - [c12]Cyril Allauzen, Michael Riley, Johan Schalkwyk:
A generalized composition algorithm for weighted finite-state transducers. INTERSPEECH 2009: 1203-1206 - [c11]Chao Wang, Johan Schalkwyk, Roberto Sicconi, Geoffrey Zweig, Marco van de Ven, Benjamin V. Tucker, Mirjam Ernestus:
Semantic context effects in the recognition of acoustically unreduced and reduced words. INTERSPEECH 2009: 1867-1870 - 2008
- [c10]Michiel Bacchiani, Françoise Beaufays, Johan Schalkwyk, Mike Schuster, Brian Strope:
Deploying GOOG-411: Early lessons in data, measurement, and testing. ICASSP 2008: 5260-5263 - 2007
- [c9]Cyril Allauzen, Michael Riley, Johan Schalkwyk, Wojciech Skut, Mehryar Mohri:
OpenFst: A General and Efficient Weighted Finite-State Transducer Library. CIAA 2007: 11-23 - 2003
- [c8]Johan Schalkwyk, I. Lee Hetherington, Ezra Story:
Speech recognition with dynamic grammars using finite-state transducers. INTERSPEECH 2003: 1969-1972
1990 – 1999
- 1998
- [c7]Stephen Sutton, Ronald A. Cole, Jacques de Villiers, Johan Schalkwyk, Pieter J. E. Vermeulen, Michael W. Macon, Yonghong Yan, Edward C. Kaiser, Brian Rundle, Khaldoun Shobaki, John-Paul Hosom, Alexander Kain, Johan Wouters, Dominic W. Massaro, Michael M. Cohen:
Universal speech tools: the CSLU toolkit. ICSLP 1998 - 1997
- [j1]Ronald A. Cole, David G. Novick, Pieter J. E. Vermeulen, Stephen Sutton, Mark A. Fanty, L. F. A. Wessels, Jacques de Villiers, Johan Schalkwyk, Brian Hansen, Daniel C. Burnett:
Experiments with a spoken dialogue system for taking the US census. Speech Commun. 23(3): 243-260 (1997) - [c6]Johan Schalkwyk, Jacques de Villiers, Sarel van Vuuren, Pieter J. E. Vermeulen:
CSLUsh: an extendible research environment. EUROSPEECH 1997: 689-692 - 1996
- [c5]Johan Schalkwyk, Neena Jain, Etienne Barnard:
Speaker verification with low storage requirements. ICASSP 1996: 693-696 - [c4]Stephen Sutton, David G. Novick, Ronald A. Cole, Pieter J. E. Vermeulen, Jacques de Villiers, Johan Schalkwyk, Mark A. Fanty:
Building 10, 000 spoken dialogue systems. ICSLP 1996: 709-712 - [c3]Zhihong Hu, Johan Schalkwyk, Etienne Barnard, Ronald A. Cole:
Speech recognition using syllable-like units. ICSLP 1996: 1117-1120 - 1994
- [c2]Johan Schalkwyk, Etienne Barnard, Jeffrey R. Sachs:
Detecting an imposter in telephone speech. ICASSP (1) 1994: 169-172 - [c1]Ronald A. Cole, David G. Novick, Mark A. Fanty, Pieter J. E. Vermeulen, Stephen Sutton, Daniel C. Burnett, Johan Schalkwyk:
A prototype voice-response questionnaire for the u.s. census. ICSLP 1994: 683-686
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-04 20:10 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint