[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

WO2000046787A3 - System and method for automating transcription services - Google Patents

System and method for automating transcription services Download PDF

Info

Publication number
WO2000046787A3
WO2000046787A3 PCT/US2000/002808 US0002808W WO0046787A3 WO 2000046787 A3 WO2000046787 A3 WO 2000046787A3 US 0002808 W US0002808 W US 0002808W WO 0046787 A3 WO0046787 A3 WO 0046787A3
Authority
WO
WIPO (PCT)
Prior art keywords
training
file
speech recognition
status
recognition program
Prior art date
Application number
PCT/US2000/002808
Other languages
French (fr)
Other versions
WO2000046787A2 (en
Inventor
Jonathan Kahn
Charles Qin
Thomas P Flynn
Robert J Tippe
Original Assignee
Custom Speech Usa Inc
Jonathan Kahn
Charles Qin
Thomas P Flynn
Robert J Tippe
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Custom Speech Usa Inc, Jonathan Kahn, Charles Qin, Thomas P Flynn, Robert J Tippe filed Critical Custom Speech Usa Inc
Priority to GB0118231A priority Critical patent/GB2361569B/en
Priority to AU35882/00A priority patent/AU3588200A/en
Priority to US09/889,870 priority patent/US7006967B1/en
Priority to CA002362462A priority patent/CA2362462A1/en
Publication of WO2000046787A2 publication Critical patent/WO2000046787A2/en
Publication of WO2000046787A3 publication Critical patent/WO2000046787A3/en
Priority to US10/014,677 priority patent/US20020095290A1/en
Priority to HK02101880.9A priority patent/HK1041086A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/221Announcement of recognition results

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Document Processing Apparatus (AREA)
  • Telephonic Communication Services (AREA)

Abstract

A system for substantially automating transcription services for multiple users (10, 11, 12) including a manual transcription station (50), speech recognition program (40) and a routing program (200). A uniquely identified voice dictation file is generated from a user and -- based on the training status -- routes the voice dictation file to a manual transcription station and speech recognition program. A human transcriptionist creates transcribed files for each voice dictation file. The speech recognition program creates written text for each dictation file if the training status is training or automated. If the training status of the current user is enrollment or training, a verbatim file is manually established and the speech recognition program is trained with an acoustic model using the verbatim and voice dictation files. The transcribed file is returned to the user if the training status is enrollment or training or writtent text is returned if the status is automated.
PCT/US2000/002808 1999-02-05 2000-02-04 System and method for automating transcription services WO2000046787A2 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
GB0118231A GB2361569B (en) 1999-02-05 2000-02-04 System and method for automating transcription services
AU35882/00A AU3588200A (en) 1999-02-05 2000-02-04 System and method for automating transcription services
US09/889,870 US7006967B1 (en) 1999-02-05 2000-02-04 System and method for automating transcription services
CA002362462A CA2362462A1 (en) 1999-02-05 2000-02-04 System and method for automating transcription services
US10/014,677 US20020095290A1 (en) 1999-02-05 2001-12-11 Speech recognition program mapping tool to align an audio file to verbatim text
HK02101880.9A HK1041086A1 (en) 1999-02-05 2002-03-12 System and method for automating transcription services

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11894999P 1999-02-05 1999-02-05
US60/118,949 1999-02-05

Publications (2)

Publication Number Publication Date
WO2000046787A2 WO2000046787A2 (en) 2000-08-10
WO2000046787A3 true WO2000046787A3 (en) 2000-12-14

Family

ID=22381731

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/002808 WO2000046787A2 (en) 1999-02-05 2000-02-04 System and method for automating transcription services

Country Status (5)

Country Link
AU (1) AU3588200A (en)
CA (1) CA2362462A1 (en)
GB (1) GB2361569B (en)
HK (1) HK1041086A1 (en)
WO (1) WO2000046787A2 (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7383187B2 (en) 2001-01-24 2008-06-03 Bevocal, Inc. System, method and computer program product for a distributed speech recognition tuning platform
WO2002075724A1 (en) 2001-03-16 2002-09-26 Koninklijke Philips Electronics N.V. Transcription service stopping automatic transcription
DE10126020A1 (en) * 2001-05-28 2003-01-09 Olaf Berberich Automatic conversion of words spoken by speaker into digitally coded terms for processing by computer involves displaying term rejections in correction window for direct entry correction
GB2381688B (en) 2001-11-03 2004-09-22 Dremedia Ltd Time ordered indexing of audio-visual data
GB2388738B (en) 2001-11-03 2004-06-02 Dremedia Ltd Time ordered indexing of audio data
US20080086305A1 (en) * 2006-10-02 2008-04-10 Bighand Ltd. Digital dictation workflow system and method
US8024289B2 (en) 2007-07-31 2011-09-20 Bighand Ltd. System and method for efficiently providing content over a thin client network
CN109285548A (en) * 2017-07-19 2019-01-29 阿里巴巴集团控股有限公司 Information processing method, system, electronic equipment and computer storage medium
CN116074150B (en) * 2023-03-02 2023-06-09 广东浩博特科技股份有限公司 Switch control method and device for intelligent home and intelligent home

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5799273A (en) * 1996-09-24 1998-08-25 Allvoice Computing Plc Automated proofreading using interface linking recognized words to their audio data while text is being changed
US5875448A (en) * 1996-10-08 1999-02-23 Boys; Donald R. Data stream editing system including a hand-held voice-editing apparatus having a position-finding enunciator

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5799273A (en) * 1996-09-24 1998-08-25 Allvoice Computing Plc Automated proofreading using interface linking recognized words to their audio data while text is being changed
US5875448A (en) * 1996-10-08 1999-02-23 Boys; Donald R. Data stream editing system including a hand-held voice-editing apparatus having a position-finding enunciator

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"DRAGON DICTATE FOR WINDOWS", DRAGON DICTATE USER'S GUIDE, XX, XX, 1 January 1995 (1995-01-01), XX, pages 01A - 01L + 01, XP002929983 *

Also Published As

Publication number Publication date
GB0118231D0 (en) 2001-09-19
GB2361569A (en) 2001-10-24
GB2361569B (en) 2003-12-24
AU3588200A (en) 2000-08-25
WO2000046787A2 (en) 2000-08-10
CA2362462A1 (en) 2000-08-10
HK1041086A1 (en) 2002-06-28

Similar Documents

Publication Publication Date Title
CA2351705A1 (en) System and method for automating transcription services
AP2001002243A0 (en) Automated transcription system and method using two speech converting instances and computer-assisted correction.
JP3282075B2 (en) Apparatus and method for automatically generating punctuation in continuous speech recognition
Traunmüller Conventional, biological and environmental factors in speech communication: a modulation theory
WO2002054033A3 (en) Hierarchical language models for speech recognition
ATE297588T1 (en) ADJUSTING PHONETIC CONTEXT TO IMPROVE SPEECH RECOGNITION
EP1022722A3 (en) Speaker adaptation based on eigenvoices
US6138099A (en) Automatically updating language models
HK1054813A1 (en) Language independent voice-based user interface
WO2006023631A3 (en) Document transcription system training
CN103903627A (en) Voice-data transmission method and device
WO2004003688A3 (en) A method for comparing a transcribed text file with a previously created file
CN105304080A (en) Speech synthesis device and speech synthesis method
ATE314718T1 (en) SPEAKER ADAPTED VOICE RECOGNITION
SG128406A1 (en) Character recognizing and translating system and voice recognizing and translating system
WO2001097213A8 (en) Speech recognition using utterance-level confidence estimates
WO2002097590A3 (en) Language independent and voice operated information management system
DE69822179D1 (en) METHOD FOR LEARNING PATTERNS FOR VOICE OR SPEAKER RECOGNITION
EP0867857A3 (en) Enrolment in speech recognition
WO2007140047A3 (en) Grammar adaptation through cooperative client and server based speech recognition
CN108766441A (en) A kind of sound control method and device based on offline Application on Voiceprint Recognition and speech recognition
WO2002071391A3 (en) Hierarchichal language models
EP0788090A3 (en) Transcription of speech data with segments from acoustically dissimilar environments
DE69818231D1 (en) METHOD FOR THE DISCRIMINATIVE TRAINING OF VOICE RECOGNITION MODELS
EP1349145A3 (en) System and method for providing information using spoken dialogue interface

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

WWE Wipo information: entry into national phase

Ref document number: 09889870

Country of ref document: US

ENP Entry into the national phase

Ref document number: 200118231

Country of ref document: GB

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 35882/00

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: IN/PCT/2001/780/KOL

Country of ref document: IN

ENP Entry into the national phase

Ref document number: 2362462

Country of ref document: CA

Ref document number: 2362462

Country of ref document: CA

Kind code of ref document: A

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase