[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

DE602004020572D1 - Verfahren und Vorrichtung zur Verminderung der Latenzzeit für automatische Spracherkennung mittels Mehrfach-Durchlauf-Teil-Ergebnissen - Google Patents

Verfahren und Vorrichtung zur Verminderung der Latenzzeit für automatische Spracherkennung mittels Mehrfach-Durchlauf-Teil-Ergebnissen

Info

Publication number
DE602004020572D1
DE602004020572D1 DE602004020572T DE602004020572T DE602004020572D1 DE 602004020572 D1 DE602004020572 D1 DE 602004020572D1 DE 602004020572 T DE602004020572 T DE 602004020572T DE 602004020572 T DE602004020572 T DE 602004020572T DE 602004020572 D1 DE602004020572 D1 DE 602004020572D1
Authority
DE
Germany
Prior art keywords
latency
reducing
speech recognition
automatic speech
partial results
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
DE602004020572T
Other languages
English (en)
Inventor
Michiel Adriaan Unic Bacchiani
Brian Scott Amento
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AT&T Corp
Original Assignee
AT&T Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by AT&T Corp filed Critical AT&T Corp
Publication of DE602004020572D1 publication Critical patent/DE602004020572D1/de
Anticipated expiration legal-status Critical
Active legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/32Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
DE602004020572T 2003-12-23 2004-12-15 Verfahren und Vorrichtung zur Verminderung der Latenzzeit für automatische Spracherkennung mittels Mehrfach-Durchlauf-Teil-Ergebnissen Active DE602004020572D1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US10/742,852 US7729912B1 (en) 2003-12-23 2003-12-23 System and method for latency reduction for automatic speech recognition using partial multi-pass results

Publications (1)

Publication Number Publication Date
DE602004020572D1 true DE602004020572D1 (de) 2009-05-28

Family

ID=34552817

Family Applications (1)

Application Number Title Priority Date Filing Date
DE602004020572T Active DE602004020572D1 (de) 2003-12-23 2004-12-15 Verfahren und Vorrichtung zur Verminderung der Latenzzeit für automatische Spracherkennung mittels Mehrfach-Durchlauf-Teil-Ergebnissen

Country Status (4)

Country Link
US (3) US7729912B1 (de)
EP (1) EP1548705B1 (de)
CA (1) CA2489903A1 (de)
DE (1) DE602004020572D1 (de)

Families Citing this family (42)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7899671B2 (en) * 2004-02-05 2011-03-01 Avaya, Inc. Recognition results postprocessor for use in voice recognition systems
US9009046B1 (en) * 2005-09-27 2015-04-14 At&T Intellectual Property Ii, L.P. System and method for disambiguating multiple intents in a natural language dialog system
JP4757599B2 (ja) * 2005-10-13 2011-08-24 日本電気株式会社 音声認識システムと音声認識方法およびプログラム
US7941316B2 (en) * 2005-10-28 2011-05-10 Microsoft Corporation Combined speech and alternate input modality to a mobile device
US9436951B1 (en) 2007-08-22 2016-09-06 Amazon Technologies, Inc. Facilitating presentation by mobile device of additional content for a word or phrase upon utterance thereof
US8510109B2 (en) 2007-08-22 2013-08-13 Canyon Ip Holdings Llc Continuous speech transcription performance indication
US8117268B2 (en) 2006-04-05 2012-02-14 Jablokov Victor R Hosted voice recognition system for wireless devices
US8352261B2 (en) * 2008-03-07 2013-01-08 Canyon IP Holdings, LLC Use of intermediate speech transcription results in editing final speech transcription results
US9973450B2 (en) 2007-09-17 2018-05-15 Amazon Technologies, Inc. Methods and systems for dynamically updating web service profile information by parsing transcribed message strings
US8352264B2 (en) 2008-03-19 2013-01-08 Canyon IP Holdings, LLC Corrective feedback loop for automated speech recognition
US9053489B2 (en) 2007-08-22 2015-06-09 Canyon Ip Holdings Llc Facilitating presentation of ads relating to words of a message
US8335829B1 (en) 2007-08-22 2012-12-18 Canyon IP Holdings, LLC Facilitating presentation by mobile device of additional content for a word or phrase upon utterance thereof
US8301454B2 (en) 2008-08-22 2012-10-30 Canyon Ip Holdings Llc Methods, apparatuses, and systems for providing timely user cues pertaining to speech recognition
US8386251B2 (en) * 2009-06-08 2013-02-26 Microsoft Corporation Progressive application of knowledge sources in multistage speech recognition
WO2011071484A1 (en) * 2009-12-08 2011-06-16 Nuance Communications, Inc. Guest speaker robust adapted speech recognition
US8494852B2 (en) 2010-01-05 2013-07-23 Google Inc. Word-level correction of speech input
US9798653B1 (en) * 2010-05-05 2017-10-24 Nuance Communications, Inc. Methods, apparatus and data structure for cross-language speech adaptation
US9009040B2 (en) * 2010-05-05 2015-04-14 Cisco Technology, Inc. Training a transcription system
US20120089392A1 (en) * 2010-10-07 2012-04-12 Microsoft Corporation Speech recognition user interface
KR101208166B1 (ko) * 2010-12-16 2012-12-04 엔에이치엔(주) 온라인 음성인식을 처리하는 음성인식 클라이언트 시스템, 음성인식 서버 시스템 및 음성인식 방법
JP6109927B2 (ja) 2012-05-04 2017-04-05 カオニックス ラブス リミテッド ライアビリティ カンパニー 源信号分離のためのシステム及び方法
US10497381B2 (en) 2012-05-04 2019-12-03 Xmos Inc. Methods and systems for improved measurement, entity and parameter estimation, and path propagation effect measurement and mitigation in source signal separation
US8515750B1 (en) * 2012-06-05 2013-08-20 Google Inc. Realtime acoustic adaptation using stability measures
US8645138B1 (en) * 2012-12-20 2014-02-04 Google Inc. Two-pass decoding for speech recognition of search and action requests
WO2014145960A2 (en) * 2013-03-15 2014-09-18 Short Kevin M Method and system for generating advanced feature discrimination vectors for use in speech recognition
US9437186B1 (en) * 2013-06-19 2016-09-06 Amazon Technologies, Inc. Enhanced endpoint detection for speech recognition
US9514747B1 (en) * 2013-08-28 2016-12-06 Amazon Technologies, Inc. Reducing speech recognition latency
US9734820B2 (en) 2013-11-14 2017-08-15 Nuance Communications, Inc. System and method for translating real-time speech using segmentation based on conjunction locations
US8719032B1 (en) * 2013-12-11 2014-05-06 Jefferson Audio Video Systems, Inc. Methods for presenting speech blocks from a plurality of audio input data streams to a user in an interface
US11094320B1 (en) * 2014-12-22 2021-08-17 Amazon Technologies, Inc. Dialog visualization
US10395658B2 (en) 2017-05-22 2019-08-27 International Business Machines Corporation Pre-processing partial inputs for accelerating automatic dialog response
US20190204998A1 (en) * 2017-12-29 2019-07-04 Google Llc Audio book positioning
WO2020111676A1 (ko) * 2018-11-28 2020-06-04 삼성전자 주식회사 음성 인식 장치 및 방법
US10573312B1 (en) * 2018-12-04 2020-02-25 Sorenson Ip Holdings, Llc Transcription generation from multiple speech recognition systems
WO2020153736A1 (en) 2019-01-23 2020-07-30 Samsung Electronics Co., Ltd. Method and device for speech recognition
KR20200091797A (ko) * 2019-01-23 2020-07-31 삼성전자주식회사 음성 인식 장치 및 방법
EP3888084A4 (de) * 2019-05-16 2022-01-05 Samsung Electronics Co., Ltd. Verfahren und vorrichtung zur bereitstellung eines spracherkennungsdienstes
CN110310643B (zh) * 2019-05-18 2021-04-30 江苏网进科技股份有限公司 车牌语音识别系统及其方法
JP7566789B2 (ja) * 2019-06-04 2024-10-15 グーグル エルエルシー 2パスエンドツーエンド音声認識
US12073824B2 (en) 2019-12-04 2024-08-27 Google Llc Two-pass end to end speech recognition
US11562731B2 (en) 2020-08-19 2023-01-24 Sorenson Ip Holdings, Llc Word replacement in transcriptions
US20230178079A1 (en) * 2021-12-07 2023-06-08 International Business Machines Corporation Adversarial speech-text protection against automated analysis

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6122613A (en) * 1997-01-30 2000-09-19 Dragon Systems, Inc. Speech recognition using multiple recognizers (selectively) applied to the same input sample
WO2000058946A1 (en) 1999-03-26 2000-10-05 Koninklijke Philips Electronics N.V. Client-server speech recognition
US7058573B1 (en) * 1999-04-20 2006-06-06 Nuance Communications Inc. Speech recognition system to selectively utilize different speech recognition techniques over multiple speech recognition passes
DE10138408A1 (de) 2001-08-04 2003-02-20 Philips Corp Intellectual Pty Verfahren zur Unterstützung des Korrekturlesens eines spracherkannten Textes mit an die Erkennungszuverlässigkeit angepasstem Wiedergabegeschwindigkeitsverlauf
US6950795B1 (en) * 2001-10-11 2005-09-27 Palm, Inc. Method and system for a recognition system having a verification recognition system
US7328155B2 (en) * 2002-09-25 2008-02-05 Toyota Infotechnology Center Co., Ltd. Method and system for speech recognition using grammar weighted based upon location information
US7184957B2 (en) * 2002-09-25 2007-02-27 Toyota Infotechnology Center Co., Ltd. Multiple pass speech recognition method and system
US20040138885A1 (en) 2003-01-09 2004-07-15 Xiaofan Lin Commercial automatic speech recognition engine combinations
US7440895B1 (en) * 2003-12-01 2008-10-21 Lumenvox, Llc. System and method for tuning and testing in a speech recognition system

Also Published As

Publication number Publication date
US20100094628A1 (en) 2010-04-15
US20110313764A1 (en) 2011-12-22
US8010360B2 (en) 2011-08-30
US7729912B1 (en) 2010-06-01
CA2489903A1 (en) 2005-06-23
EP1548705B1 (de) 2009-04-15
US8209176B2 (en) 2012-06-26
EP1548705A1 (de) 2005-06-29

Similar Documents

Publication Publication Date Title
DE602004020572D1 (de) Verfahren und Vorrichtung zur Verminderung der Latenzzeit für automatische Spracherkennung mittels Mehrfach-Durchlauf-Teil-Ergebnissen
DE60309822D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE602005000628D1 (de) Verfahren und Vorrichtung für die mehrschichtige verteilte Spracherkennung
DE602004023364D1 (de) Vorrichtung und Verfahren zur Spracherkennung
DE602006009160D1 (de) Vorrichtung und Verfahren für Sprachverbesserung
DE60310785D1 (de) Verfahren und Vorrichtung zur Übersetzung von gesprochener Sprache
DE602004027325D1 (de) Vorrichtung und Verfahren zur Vorverarbeitung für Bildzeichenerkennung
DE602006006952D1 (de) Vorrichtung und Verfahren zur Personenidentifizierung
DE602004006654D1 (de) Vorrichtung und Verfahren zur Oberflächen-Endbearbeitung
DE602005006412D1 (de) Verfahren und Vorrichtung zur Grundfrequenzbestimmung
DE602005005550D1 (de) Verfahren und Vorrichtung zur Ermüdungsprüfung
DE60316912D1 (de) Verfahren zur Spracherkennung
DE50302434D1 (de) Vorrichtung zur ermittlung der kraftstoffqualität und zugehöriges verfahren
DE602004025322D1 (de) Verfahren und Vorrichtung für die Spinnvliesherstellung
DE602004014675D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE602005014294D1 (de) Vorrichtung und Verfahren zur Verkehrsformung
DE60221158D1 (de) Vorrichtung und verfahren für oberflächeneigenschaften
DE602006020971D1 (de) Vorrichtung und Verfahren zur Bandbreitenzuteilung
DE602006000487D1 (de) Verfahren und Vorrichtung zur Sprachdetektion
DE602004028322D1 (de) System und Verfahren zur Meta-datenabhängigen Sprachmodellierung für automatische Spracherkennung
DE602005023270D1 (de) Vorrichtung und verfahren zur modellierung von beziehungen zwischen signalen
DE602004007418D1 (de) Vorrichtung und Verfahren zur Oberflächen-Endbearbeitung
DE60229315D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE602004010627D1 (de) Vorrichtung und Verfahren zur Interferenzunterdrückung
DE602004001467D1 (de) Vorrichtung und Verfahren zur Oberflächen-Endbearbeitung

Legal Events

Date Code Title Description
8364 No opposition during term of opposition