ATE385024T1 - Multilinguale spracherkennung - Google Patents
Multilinguale spracherkennungInfo
- Publication number
- ATE385024T1 ATE385024T1 AT05003670T AT05003670T ATE385024T1 AT E385024 T1 ATE385024 T1 AT E385024T1 AT 05003670 T AT05003670 T AT 05003670T AT 05003670 T AT05003670 T AT 05003670T AT E385024 T1 ATE385024 T1 AT E385024T1
- Authority
- AT
- Austria
- Prior art keywords
- subword
- speech recognition
- items
- list
- subword unit
- Prior art date
Links
- 238000013518 transcription Methods 0.000 abstract 2
- 230000035897 transcription Effects 0.000 abstract 2
- 238000000034 method Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/32—Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP05003670A EP1693828B1 (de) | 2005-02-21 | 2005-02-21 | Multilinguale Spracherkennung |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE385024T1 true ATE385024T1 (de) | 2008-02-15 |
Family
ID=34933852
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT05003670T ATE385024T1 (de) | 2005-02-21 | 2005-02-21 | Multilinguale spracherkennung |
Country Status (4)
Country | Link |
---|---|
US (1) | US20060206331A1 (de) |
EP (1) | EP1693828B1 (de) |
AT (1) | ATE385024T1 (de) |
DE (1) | DE602005004503T2 (de) |
Families Citing this family (31)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1693829B1 (de) * | 2005-02-21 | 2018-12-05 | Harman Becker Automotive Systems GmbH | Sprachgesteuertes Datensystem |
US7697827B2 (en) | 2005-10-17 | 2010-04-13 | Konicek Jeffrey C | User-friendlier interfaces for a camera |
SG133419A1 (en) * | 2005-12-12 | 2007-07-30 | Creative Tech Ltd | A method and apparatus for accessing a digital file from a collection of digital files |
US7873517B2 (en) | 2006-11-09 | 2011-01-18 | Volkswagen Of America, Inc. | Motor vehicle with a speech interface |
DE102006057159A1 (de) | 2006-12-01 | 2008-06-05 | Deutsche Telekom Ag | Verfahren zur Klassifizierung der gesprochenen Sprache in Sprachdialogsystemen |
EP1975923B1 (de) | 2007-03-28 | 2016-04-27 | Nuance Communications, Inc. | Mehrsprachige nicht-muttersprachliche Spracherkennung |
US8099290B2 (en) * | 2009-01-28 | 2012-01-17 | Mitsubishi Electric Corporation | Voice recognition device |
US8949125B1 (en) * | 2010-06-16 | 2015-02-03 | Google Inc. | Annotating maps with user-contributed pronunciations |
US8489398B1 (en) | 2011-01-14 | 2013-07-16 | Google Inc. | Disambiguation of spoken proper names |
US9286894B1 (en) | 2012-01-31 | 2016-03-15 | Google Inc. | Parallel recognition |
US9431012B2 (en) | 2012-04-30 | 2016-08-30 | 2236008 Ontario Inc. | Post processing of natural language automatic speech recognition |
US9093076B2 (en) * | 2012-04-30 | 2015-07-28 | 2236008 Ontario Inc. | Multipass ASR controlling multiple applications |
US20140214401A1 (en) * | 2013-01-29 | 2014-07-31 | Tencent Technology (Shenzhen) Company Limited | Method and device for error correction model training and text error correction |
US9471567B2 (en) * | 2013-01-31 | 2016-10-18 | Ncr Corporation | Automatic language recognition |
DE102013005844B3 (de) * | 2013-03-28 | 2014-08-28 | Technische Universität Braunschweig | Verfahren und Vorrichtung zum Messen der Qualität eines Sprachsignals |
KR102084646B1 (ko) | 2013-07-04 | 2020-04-14 | 삼성전자주식회사 | 음성 인식 장치 및 음성 인식 방법 |
KR102394485B1 (ko) | 2013-08-26 | 2022-05-06 | 삼성전자주식회사 | 음성 인식을 위한 전자 장치 및 방법 |
DE112013007617B4 (de) | 2013-11-20 | 2020-06-18 | Mitsubishi Electric Corporation | Spracherkennungsvorrichtung und Spracherkennungsverfahren |
US9747897B2 (en) * | 2013-12-17 | 2017-08-29 | Google Inc. | Identifying substitute pronunciations |
US10339920B2 (en) * | 2014-03-04 | 2019-07-02 | Amazon Technologies, Inc. | Predicting pronunciation in speech recognition |
DE102014210716A1 (de) * | 2014-06-05 | 2015-12-17 | Continental Automotive Gmbh | Assistenzsystem, das mittels Spracheingaben steuerbar ist, mit einer Funktionseinrichtung und mehreren Spracherkennungsmodulen |
US9683862B2 (en) * | 2015-08-24 | 2017-06-20 | International Business Machines Corporation | Internationalization during navigation |
DE102015014206B4 (de) | 2015-11-04 | 2020-06-25 | Audi Ag | Verfahren und Vorrichtung zum Auswählen eines Navigationsziels aus einer von mehreren Sprachregionen mittels Spracheingabe |
US9959887B2 (en) * | 2016-03-08 | 2018-05-01 | International Business Machines Corporation | Multi-pass speech activity detection strategy to improve automatic speech recognition |
US10593321B2 (en) * | 2017-12-15 | 2020-03-17 | Mitsubishi Electric Research Laboratories, Inc. | Method and apparatus for multi-lingual end-to-end speech recognition |
US10565320B1 (en) | 2018-09-28 | 2020-02-18 | International Business Machines Corporation | Dynamic multilingual speech recognition |
WO2020226948A1 (en) | 2019-05-03 | 2020-11-12 | Google Llc | Phoneme-based contextualization for cross-lingual speech recognition in end-to-end models |
CN112364658B (zh) | 2019-07-24 | 2024-07-26 | 阿里巴巴集团控股有限公司 | 翻译以及语音识别方法、装置、设备 |
CN110634487B (zh) * | 2019-10-24 | 2022-05-17 | 科大讯飞股份有限公司 | 一种双语种混合语音识别方法、装置、设备及存储介质 |
CN111798836B (zh) * | 2020-08-03 | 2023-12-05 | 上海茂声智能科技有限公司 | 一种自动切换语种方法、装置、系统、设备和存储介质 |
CN113035171B (zh) * | 2021-03-05 | 2022-09-02 | 随锐科技集团股份有限公司 | 语音识别处理方法及系统 |
Family Cites Families (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5602960A (en) * | 1994-09-30 | 1997-02-11 | Apple Computer, Inc. | Continuous mandarin chinese speech recognition system having an integrated tone classifier |
DE19636739C1 (de) * | 1996-09-10 | 1997-07-03 | Siemens Ag | Verfahren zur Mehrsprachenverwendung eines hidden Markov Lautmodelles in einem Spracherkennungssystem |
US6085160A (en) * | 1998-07-10 | 2000-07-04 | Lernout & Hauspie Speech Products N.V. | Language independent speech recognition |
US7120582B1 (en) * | 1999-09-07 | 2006-10-10 | Dragon Systems, Inc. | Expanding an effective vocabulary of a speech recognition system |
US6912499B1 (en) * | 1999-08-31 | 2005-06-28 | Nortel Networks Limited | Method and apparatus for training a multilingual speech model set |
EP1134726A1 (de) * | 2000-03-15 | 2001-09-19 | Siemens Aktiengesellschaft | Verfahren zur Erkennung von Sprachäusserungen nicht-muttersprachlicher Sprecher in einem Sprachverarbeitungssystem |
US7181395B1 (en) * | 2000-10-27 | 2007-02-20 | International Business Machines Corporation | Methods and apparatus for automatic generation of multiple pronunciations from acoustic data |
ATE297588T1 (de) * | 2000-11-14 | 2005-06-15 | Ibm | Anpassung des phonetischen kontextes zur verbesserung der spracherkennung |
GB0028277D0 (en) * | 2000-11-20 | 2001-01-03 | Canon Kk | Speech processing system |
EP1217610A1 (de) * | 2000-11-28 | 2002-06-26 | Siemens Aktiengesellschaft | Verfahren und System zur multilingualen Spracherkennung |
EP1233406A1 (de) * | 2001-02-14 | 2002-08-21 | Sony International (Europe) GmbH | Angepasste Spracherkennung für ausländische Sprecher |
US7043431B2 (en) * | 2001-08-31 | 2006-05-09 | Nokia Corporation | Multilingual speech recognition system using text derived recognition models |
DE10207895B4 (de) * | 2002-02-23 | 2005-11-03 | Harman Becker Automotive Systems Gmbh | Verfahren zur Spracherkennung und Spracherkennungssystem |
US7092883B1 (en) * | 2002-03-29 | 2006-08-15 | At&T | Generating confidence scores from word lattices |
US6932873B2 (en) * | 2002-07-30 | 2005-08-23 | Applied Materials Israel, Ltd. | Managing work-piece deflection |
US7149688B2 (en) * | 2002-11-04 | 2006-12-12 | Speechworks International, Inc. | Multi-lingual speech recognition with cross-language context modeling |
AU2003295682A1 (en) * | 2002-11-15 | 2004-06-15 | Voice Signal Technologies, Inc. | Multilingual speech recognition |
US8285537B2 (en) * | 2003-01-31 | 2012-10-09 | Comverse, Inc. | Recognition of proper nouns using native-language pronunciation |
US7689404B2 (en) * | 2004-02-24 | 2010-03-30 | Arkady Khasin | Method of multilingual speech recognition by reduction to single-language recognizer engine components |
US20050197837A1 (en) * | 2004-03-08 | 2005-09-08 | Janne Suontausta | Enhanced multilingual speech recognition system |
US20050267755A1 (en) * | 2004-05-27 | 2005-12-01 | Nokia Corporation | Arrangement for speech recognition |
-
2005
- 2005-02-21 AT AT05003670T patent/ATE385024T1/de not_active IP Right Cessation
- 2005-02-21 EP EP05003670A patent/EP1693828B1/de active Active
- 2005-02-21 DE DE602005004503T patent/DE602005004503T2/de active Active
-
2006
- 2006-02-21 US US11/360,024 patent/US20060206331A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
DE602005004503T2 (de) | 2009-01-22 |
EP1693828A1 (de) | 2006-08-23 |
EP1693828B1 (de) | 2008-01-23 |
US20060206331A1 (en) | 2006-09-14 |
DE602005004503D1 (de) | 2008-03-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE385024T1 (de) | Multilinguale spracherkennung | |
CN105869634B (zh) | 一种基于领域的带反馈语音识别后文本纠错方法及系统 | |
ATE527652T1 (de) | Mehrstufige spracherkennung | |
US20160336007A1 (en) | Speech search device and speech search method | |
ATE457510T1 (de) | Spracherkennungssystem mit riesigem vokabular | |
JP2006048628A5 (de) | ||
Bhaykar et al. | Speaker dependent, speaker independent and cross language emotion recognition from speech using GMM and HMM | |
CN108074562B (zh) | 语音识别装置、语音识别方法以及存储介质 | |
CN108091334B (zh) | 识别装置、识别方法以及存储介质 | |
Deng et al. | Improving accent identification and accented speech recognition under a framework of self-supervised learning | |
TW200638337A (en) | Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system | |
WO2007015869A3 (en) | Spoken language proficiency assessment by computer | |
WO2006086511A8 (en) | Method and apparatus utilizing voice input to resolve ambiguous manually entered text input | |
WO2009008055A1 (ja) | 音声認識装置、音声認識方法、および、音声認識プログラム | |
Szöke et al. | BUT QUESST 2014 system description. | |
US8682668B2 (en) | Language model score look-ahead value imparting device, language model score look-ahead value imparting method, and program storage medium | |
Rastrow et al. | Towards using hybrid word and fragment units for vocabulary independent LVCSR systems | |
Marin et al. | Using syntactic and confusion network structure for out-of-vocabulary word detection | |
Gupta et al. | A language independent approach to audio search | |
WO2010018453A3 (en) | System and method for processing electronically generated text | |
Dzhambazov et al. | Automatic lyrics-to-audio alignment in classical Turkish music | |
WO2007119221A3 (en) | Method and apparatus for extracting musical score from a musical signal | |
JP2018151413A (ja) | 音声認識装置、音声認識方法およびプログラム | |
JP2009271117A (ja) | 音声検索装置および音声検索方法 | |
Kertkeidkachorn et al. | Using tone information in Thai spelling speech recognition |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |