WO2011011224A1 - Hand-held speech generation device - Google Patents
Hand-held speech generation device Download PDFInfo
- Publication number
- WO2011011224A1 WO2011011224A1 PCT/US2010/041799 US2010041799W WO2011011224A1 WO 2011011224 A1 WO2011011224 A1 WO 2011011224A1 US 2010041799 W US2010041799 W US 2010041799W WO 2011011224 A1 WO2011011224 A1 WO 2011011224A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- generation device
- speech generation
- speech
- user
- computer
- Prior art date
Links
- 238000004891 communication Methods 0.000 claims description 56
- 238000000034 method Methods 0.000 claims description 56
- 230000000694 effects Effects 0.000 claims description 25
- FYYHWMGAXLPEAU-UHFFFAOYSA-N Magnesium Chemical compound [Mg] FYYHWMGAXLPEAU-UHFFFAOYSA-N 0.000 claims description 23
- 229910052749 magnesium Inorganic materials 0.000 claims description 23
- 239000011777 magnesium Substances 0.000 claims description 23
- 230000001413 cellular effect Effects 0.000 claims description 16
- 230000002093 peripheral effect Effects 0.000 claims description 13
- 238000012512 characterization method Methods 0.000 claims description 12
- 230000000007 visual effect Effects 0.000 claims description 11
- 239000000463 material Substances 0.000 claims description 9
- 229920003023 plastic Polymers 0.000 claims description 6
- 239000004033 plastic Substances 0.000 claims description 6
- 230000003190 augmentative effect Effects 0.000 claims description 3
- 239000011540 sensing material Substances 0.000 claims description 3
- VYZAMTAEIAYCRO-UHFFFAOYSA-N Chromium Chemical compound [Cr] VYZAMTAEIAYCRO-UHFFFAOYSA-N 0.000 claims description 2
- 239000004698 Polyethylene Substances 0.000 claims description 2
- 230000002996 emotional effect Effects 0.000 claims description 2
- 238000001746 injection moulding Methods 0.000 claims description 2
- 239000003562 lightweight material Substances 0.000 claims description 2
- -1 polyethylene Polymers 0.000 claims description 2
- 229920000573 polyethylene Polymers 0.000 claims description 2
- 229920005989 resin Polymers 0.000 claims description 2
- 239000011347 resin Substances 0.000 claims description 2
- 230000001755 vocal effect Effects 0.000 claims description 2
- 229910000861 Mg alloy Inorganic materials 0.000 claims 2
- 239000002861 polymer material Substances 0.000 claims 1
- 239000012815 thermoplastic material Substances 0.000 claims 1
- 239000000945 filler Substances 0.000 description 24
- 230000008901 benefit Effects 0.000 description 13
- 230000006870 function Effects 0.000 description 10
- 230000009471 action Effects 0.000 description 7
- 230000004913 activation Effects 0.000 description 6
- 238000001994 activation Methods 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 5
- 235000021152 breakfast Nutrition 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 230000007246 mechanism Effects 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 3
- 239000011521 glass Substances 0.000 description 3
- 230000017525 heat dissipation Effects 0.000 description 3
- 238000010348 incorporation Methods 0.000 description 3
- 230000002452 interceptive effect Effects 0.000 description 3
- 229910052751 metal Inorganic materials 0.000 description 3
- 239000002184 metal Substances 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 230000001681 protective effect Effects 0.000 description 3
- 206010011224 Cough Diseases 0.000 description 2
- 206010011469 Crying Diseases 0.000 description 2
- WHXSMMKQMYFTQS-UHFFFAOYSA-N Lithium Chemical compound [Li] WHXSMMKQMYFTQS-UHFFFAOYSA-N 0.000 description 2
- 241001282135 Poromitra oscitans Species 0.000 description 2
- 206010048232 Yawning Diseases 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 238000013500 data storage Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 230000036039 immunity Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 229910052744 lithium Inorganic materials 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000009747 swallowing Effects 0.000 description 2
- 239000012780 transparent material Substances 0.000 description 2
- 241000270728 Alligator Species 0.000 description 1
- 102000016904 Armadillo Domain Proteins Human genes 0.000 description 1
- 108010014223 Armadillo Domain Proteins Proteins 0.000 description 1
- 229920002799 BoPET Polymers 0.000 description 1
- 241000282817 Bovidae Species 0.000 description 1
- 241000289632 Dasypodidae Species 0.000 description 1
- 240000005561 Musa balbisiana Species 0.000 description 1
- 235000018290 Musa x paradisiaca Nutrition 0.000 description 1
- 239000005041 Mylar™ Substances 0.000 description 1
- 241000289692 Myrmecophagidae Species 0.000 description 1
- 241000282881 Orycteropodidae Species 0.000 description 1
- 235000006029 Prunus persica var nucipersica Nutrition 0.000 description 1
- 244000017714 Prunus persica var. nucipersica Species 0.000 description 1
- 230000002730 additional effect Effects 0.000 description 1
- 239000000956 alloy Substances 0.000 description 1
- 229910045601 alloy Inorganic materials 0.000 description 1
- 229910052782 aluminium Inorganic materials 0.000 description 1
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000004397 blinking Effects 0.000 description 1
- 239000003990 capacitor Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000005345 chemically strengthened glass Substances 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 239000004020 conductor Substances 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 230000005684 electric field Effects 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 230000008451 emotion Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000020169 heat generation Effects 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- AMGQUBHHOARCQH-UHFFFAOYSA-N indium;oxotin Chemical compound [In].[Sn]=O AMGQUBHHOARCQH-UHFFFAOYSA-N 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000003754 machining Methods 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 150000002739 metals Chemical class 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 239000002991 molded plastic Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000005693 optoelectronics Effects 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000000704 physical effect Effects 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 238000010079 rubber tapping Methods 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 230000003678 scratch resistant effect Effects 0.000 description 1
- 238000006748 scratching Methods 0.000 description 1
- 230000002393 scratching effect Effects 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000008961 swelling Effects 0.000 description 1
- 229920001169 thermoplastic Polymers 0.000 description 1
- 239000004416 thermosoftening plastic Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000003466 welding Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
Definitions
- the present invention generally pertains to speech generation devices, and more particularly to portable speech generation devices.
- Some speech generation devices while portable, consume large amounts of power and must be located near an electrical outlet, and this requirement limits freedom of movement of the user.
- Other conventional speech generation devices seek to overcome this problem with the provision of a battery, but in order to provide sufficient battery life, these so-called portable SGDs weigh five (5) pounds to seven (7) pounds and/or must be recharged within two (2) or three (3) hours. Because of the substantial power consumption, heat generation can be a problem in some conventional SGDs. While this problem can be addressed by the provision of fans, this solution tends to increase the size and weight of the SGD.
- the present subject matter is directed to an improved speech generation device (SGD) having electronic components that enable the SGD to transmit and receive messages to assist a user in communicating with others.
- the SGD may correspond to a particular special-purpose electronic device that permits a user to communicate with others by producing digitized or synthesized speech based on configured messages.
- Such messages may be preconfigured and/or selected and/or composed by a user within a message window provided as part of the speech generation device user interface.
- a variety of physical input devices and software interface features may be provided to facilitate the capture of user input to define what information should be displayed in a message window and ultimately communicated to others as spoken output, text message, phone call, e-mail or other outgoing
- One exemplary SGD embodiment includes casing features that enable the SGD to be operated as a hand-held device that is both portable and easy for a user to hold in his hands and operate to generate speech and accomplish additional functions.
- Specific casing components to accomplish such features may include first and second opposing sides of an outer shell that are shaped in a convex fashion and with a predetermined thickness to fit within the palms of a user's hands.
- First and second generally convex extensions may extend from a bottom casing component to provide an ergonomic configuration for facilitated location and grip of a user's fingers.
- the first and second extensions may be positioned on the bottom surface of the SGD and an outer speaker grille may be positioned to face outward from the user so that audio output from the SGD radiates outwardly from the user to others within a user's environment.
- one or more outer casing components are made from magnesium or an alioy thereof.
- magnesium When magnesium is used in the casing components, it enables the case to be an active functional component and not just a protective component.
- the magnesium case provides conductive and radiated immunity to the components mounted within the magnesium case without requiring separate shielding.
- the magnesium case also provides heat dissipation features for an SGD by coupling the microprocessor to the case.
- a capacitive touch screen is provided as an input device for the SGD instead of a keyboard, a mouse or a pressure sensitive touchscreen.
- a capacitive touch screen is provided as an input device for the SGD instead of a keyboard, a mouse or a pressure sensitive touchscreen.
- active sensing material can be applied to the back of any transparent material to achieve a thin and lightweight design. Multi-touch interaction is enabled and the need for
- hardware components may also include various communications devices and/or modules and related communications functionality.
- a wireless network adapter may be included to provide integrated Internet access.
- An RF device may be included as an integrated cell phone to enable the user to send and receive text messages and phone calls directly and speak during the phone conversation using the SGD, thereby eliminating the need for a separate telephone device,
- An infrared (IR) transceiver may be provided to function as a universal remote control for the SGD that can operate devices in the user's
- a Bluetooth interface may be included to provide Bluetooth radio signals that can be used to control a desktop computer, which appears on the SGD's display as a mouse and keyboard, or other Bluetooth-enabled devices.
- Additional features that may be included in exemplary SGD embodiments concern computer readable medium that, when executed by a processor, instruct the SGD to perform specific functionality.
- instructions stored in SGD memory may adapt the device to perform integrated web access for vocabulary generation in which content is selected from the Internet, and useful terms, phrases and/or images are extracted and available for immediate use in aided communication via the SGD.
- Another feature that may be provided in exemplary SGD embodiments corresponds to a voice inflection mark up feature that can add inflection or other speech style to the speech output of the SGD or that can add a predefined speech effect.
- Exemplary speech styles for modifying text may be defined in accordance with a variety of parameters, including but not limited to voice, volume, pitch, speech rate, spell mode, emphasis and context.
- communicated message may include but are not limited to coughing, laughing, crying, kissing, sniffing, swallowing, sighing, whistling, yawning, or making noises such as "eh", “oh”, “mmm”, etc.
- the selected speech styles and effects can help communicate different meanings just based on the different inflections imposed on different syllables of the words being spoken by the SGD.
- speech modifiers provide more interactive and enjoyable communication by an SGD user.
- Exemplary SGD embodiments may further include executable instructions stored on a computer-readable medium that implement a unique method of
- FIG. 1 provides a perspective view of a first exemplary embodiment of a portable speech generation device in accordance with an aspect of the present invention
- FIG. 2 provides another perspective view of a first exemplary embodiment of a portable speech generation device in accordance with an aspect of the present invention
- FIG. 3 provides an exploded view of a first exemplary embodiment of a portable speech generation device in accordance with an aspect of the present invention
- FIG. 4 provides a perspective view of a second exemplary embodiment of a portable speech generation device in accordance with an aspect of the present invention
- FIG. 5 provides a perspective view of a second exemplary embodiment of a portable speech generation device in accordance with an aspect of the present invention
- FIG. 6 provides an exploded view of a second exemplary embodiment of a portable speech generation device in accordance with an aspect of the present invention
- Fig. 7 provides a schematic diagram of exemplary hardware components for use within a speech generation device in accordance with an aspect of the present invention
- Fig. 8 provides an exemplary embodiment of a graphical user interface menu for providing a phone contacts menu for use with a cellular phone integrated with a speech generation device;
- Fig. 9 provides an exemplary embodiment of a graphical user interface for providing a contact editing menu for use with a cellular phone integrated with a speech generation device;
- Fig. 10 provides an exemplary embodiment of a graphical user interface menu for providing a phone menu for use with a cellular phone integrated with a speech generation device;
- Fig. 11 provides an exemplary embodiment of a graphical user interface menu for providing a text messaging menu for use with a cellular phone integrated with a speech generation device;
- Fig. 12 provides an exemplary embodiment of a graphical use interface menu for providing a web browser with menu bar use to access the internet with a speech generation device;
- Fig. 13 provides an exemplary embodiment of a graphical user interface menu for selecting text from a web page
- Fig. 14 provides an exemplary embodiment of a graphical user interface menu for providing a vocabulary list box based on information captured from the internet;
- Fig. 15 provides an exemplary embodiment of a graphical user interface menu for importing an online picture for use as symbols;
- Fig. 16 provides a flow chart of an exemplary method for integrating text from a vocabulary source into a user vocabulary list
- Fig. 17 provides an exemplary embodiment of a graphical user interface menu for displaying an embedded vocabulary list in a vocabulary list box;
- Fig. 18 provides a first view of an embodiment of a graphical user interface menu provided via software features for implementing slots and fillers;
- Fig. 19 provides a second view of an embodiment of a graphical user interface menu provided via software features for implementing slots and fillers;
- Fig. 20 provides a flow chart of an exemplary method for implementing speech styles and effects in spoken text;
- Fig. 21 provides an exemplary embodiment of a graphical user interface menu for displaying integrated speech effects and styles as marked up text
- Fig. 22 provides an exemplary embodiment of a graphical user interface menu for displaying a speech style editor
- Fig. 23 provides an exemplary embodiment of a graphical user interface menu for displaying a speech effect editor
- Fig. 24 provides an exemplary embodiment of a graphical user interface menu for implementing Bluetooth functionality associated with an SGD
- Fig. 25A provides an exploded view of portions the speech generation device illustrated in Fig. 1 in accordance with an aspect of the present invention.
- FIG. 25B provides an inverted exploded view of the portions of the speech generation device illustrated in FIG. 25A in accordance with an aspect of the present invention.
- Figs. 1-3 illustrate the various structural components of a first exemplary embodiment of a speech generation device (SGD) 100 in accordance with an aspect of the present invention.
- Structural components of a second exemplary SGD embodiment 400 are shown in and discussed with reference to Figs. 4-6.
- the internal hardware and software components and related functionality of both SGDs 100 and 400 may be as described with reference to Figs, 7-24. It should be appreciated that specific embodiments of an SGD in accordance with the disclosed technology should not be limited by the complete spectrum of hardware and software components described herein. Instead, an SGD may selectively incorporate one or more of such disclosed features and steps and others to create additional and
- a first exemplary SGD 100 corresponds to a hand-held device that is both portable and easy for a user to hold in his hands and operate to generate speech and accomplish additional functions as described herein.
- SGD 100 may include one or more outer housing, or casing components, such as a top casing component 102, a first bottom casing component 104 and a second bottom casing component 106.
- a battery is stored between the first and second bottom casing components 104 and 106, with or without the inclusion of a metal plating cover to help secure the battery therein.
- the top casing component 102 and second bottom casing component 106, together with speaker grille 108 collectively form the outer shell for SGD 100.
- Second bottom casing component 106 forms the bottommost surface of the outer shell.
- First and second opposing sides 110 and 112 of the outer shell formed by casing components 102-106 are generally shaped in a convex fashion and with a predetermined thickness to fit within the palms of a user's hands.
- First and second generally convex extensions 114 and 116 extend from the second bottom casing component 106 to provide an ergonomic configuration for facilitated location and grip of a user's fingers.
- First and second convex extensions 114 and 116 generally extend along second bottom casing component 106 from the speaker grille 108 forming a front surface of SGD 100 towards a rear surface of SGD 100.
- Extensions 114 and 116 may taper towards the rear surface of SGD 100.
- First extension 114 may be provided generally adjacent or near to the first side 110 of second bottom casing component 106, while second extension 116 may be provided generally adjacent or near to the second side 112 of the second bottom casing component 106.
- top casing component 102 may include a recessed area 118 or other feature for accommodating an outer touch screen 120. Top casing component 102 may also be formed to define a relatively large opening 122 therethrough to provide access to a display panel 124 which may be viewed through touch screen 120.
- Touch screen 120 may generally serve as an input feature for SGD 100, and is described later in more detail.
- Display panel 124 may generally serve as an output feature for SGD 100 and is also described later in more detail.
- the casing components forming the outer shell of SGD 100 may also be formed to define additional openings to accommodate data input and output.
- an opening 126 within top casing component 102 may be provided through which an LED or other light source can provide a device power indicator light.
- An opening 128 can provide a location for a button by which a user can toggle the power for SGD 100 in an "on" or "off” position.
- Another opening 130 formed within one or more casing components can provide a location for data input/output ports.
- Speaker grille 108 may be formed to define a first opening 132 for accommodating a volume control knob to control the volume of speakers associated with SGD 100, as well as a second opening 134 to provide a location for a USB port through which various peripheral devices may be coupled to SGD 100.
- PCBs printed circuit boards
- a PCB may be provided as a mounting surface or motherboard for such hardware and/or electronics components as a computer processing unit, hard drive(s) and/or other associated memory or media device(s).
- One or more PCBs may also serve as a mounting surface for radio or wireless communications modules, antennas, data buses, related integrated circuits and passive devices, and the like.
- First and second speakers 136 may also be positioned among and/or securely mounted to the PCBs and/or casing components. As shown in Fig. 3, speakers 136 may be respectively coupled via a first connection to first bottom casing component 104 and via a second connection to an associated PCB.
- Various connectors may be used to establish relative positioning among the various casing components and secure the components to form an integrated assembly.
- mating holes, pins and standoffs may be used to orient adjacent casing components to one another.
- Snap-fit features, threaded screws and/or other connectors may be used to secure and connect the casing components together.
- FIG. 4-6 A second exemplary embodiment of a speech generation device is shown in Figs. 4-6, respectively. From a functional perspective, such embodiment 400 may include some or all of the same or similar internal hardware and software features as the first exemplary embodiment of Fig. 1-3.
- the overall size and dimension of SGD 400 compared to SGD 100 may vary. For example, one exemplary embodiment of SGD 100 corresponds to a size with dimensions 6.15" x 4" x 1.9".
- One exemplary embodiment of SGD 100 corresponds to a size with dimensions 6.15" x 4" x 1.9".
- One exemplary embodiment of SGD 100 corresponds to a size with dimensions 6.15" x 4" x 1.9".
- One exemplary embodiment of SGD 100 corresponds to a size with dimensions 6.15" x 4" x 1.9".
- One exemplary embodiment of SGD 100 corresponds to a size with dimensions 6.15" x 4" x 1.9".
- One exemplary embodiment of SGD 100 corresponds to a size with dimensions
- SGD 400 corresponds to a size with dimensions 10.8" x 8.5" x 1.9".
- the SGD can incorporate one or more of the foregoing features and yet weigh less than 2 pounds when including a 5 1 /4-inch diagonal display and less than 3 pounds when incorporating at least a 10-inch display screen.
- SGD 400 generally includes several structural components, including a display lens 402, a top casing component 404, an internal casing component 406, a bottom casing component 408, a speaker grille 410 and a battery door 412.
- Display lens 402 may be a protective covering or may correspond to the outer surface of a touch-screen LCD or other display device for use with SGD 400, as described later.
- Top casing component 404 is formed to define an opening therein for accepting the display lens 402 and for providing viewing access to a display device.
- top casing component 404 may be formed with a material (e.g., chrome) selected to highlight the outer perimeter of SGD 400 as viewed by a user and to place emphasis on the relatively thin edge profile of SGD 400.
- Internal casing component 406 provides a structural support and backer for the device display. Internal hardware components separate from the display may be mounted and secured between the internal casing component 406 and the bottom casing component 408.
- Bottom casing component 408 may be formed to define a first opening 414 to access data input/output (I/O) ports. Opening 414 may be secured with a data door that can be flush with the surrounding surface of bottom casing component 408.
- Bottom casing component 408 may also be formed with a second opening by which a user may access a battery associated with SGD 400. The battery door 412 with latch feature is provided for securing the battery within second opening.
- first and second opposing sides 430 and 432 of the outer shell formed by casing components 404 and 408 are generally shaped in a convex fashion and with a predetermined thickness to fit within the palms of a user's hands.
- the bottom casing component 408 of SGD 400 may also include first and second generally convex extensions 418 and 420 projecting from a lower surface of bottom casing component 408.
- First and second convex extensions 418 and 420 generally extend along second bottom casing component 408 from the speaker grille 410 forming a front surface of SGD 400 towards a rear surface of SGD 400. Extensions 418 and 420 may taper towards the rear surface of SGD 400.
- First extension 418 may be provided generally adjacent or near to the first side of bottom casing component 408, while second extension 420 may be provided generally adjacent or near to the second side of the bottom casing component 408. Further, the first and second extensions 418 and 420 may define generally concaveiy shaped recessed areas 434 and 436, which may also serve as ergonomic features of the SGD 400. For example, the concave recessed areas 434 and 436 may be configured to facilitate location and grip of a user's fingers as they are wrapped around the first and second extensions 418 and 420.
- the outer casing components used in first and second exemplary SGD embodiments 100 and 400 may be molded from any substantially rigid and lightweight material.
- one or more outer casing components are made from a material such as but not limited to plastic, thermoplastic, polymer, polyethylene, or resin material.
- one or more outer casing components are made from magnesium or an alloy thereof.
- magnesium is used in the casing components, it provides several advantages for an SGD and enables the case to be an active functional component and not just a protective component.
- a magnesium casing provides conductive and radiated immunity to the components mounted within the magnesium case without requiring separate shielding.
- the magnesium prevents intrusion of radiated conducted emissions that otherwise might effect the electronic components of the SGD disposed within the case.
- the physical properties of the magnesium also provide a superior shielding from
- EMI electromagnetic interference
- a magnesium case also provides heat dissipation features for an SGD.
- one or more of the heat generating components of the SGD may be thermally coupled to the magnesium case to enable the case to function as a heat sink.
- Figs. 25A and 25B exploded views of several of the components of the SGD 100 illustrated in Figs. 1-3 are depicted. As shown, the SGD's
- microprocessors 180 are mounted on a circuit board 182 of the SGD 100. Additionally, to enable heat generated from the microprocessors 180 to be conducted through the magnesium case, the microprocessors 180 may be coupled to any suitable thermally conductive plates 184, such as plates formed from aluminum or various other metals. The thermally conductive plates 184 may then be directly coupled to a portion of the magnesium case, such as an inner surface 186 of the the bottom casing component 106 of the magnesium case. Thus, any heat geneated by the microprocessors 180 may be conducted through the thermally conductive plates 184 and into the bottom casing conmponent 106.
- processors 180 may be coupled to the plates 184 and the plates 184 may be coupled to the casing component 106 using any suitable means, such as by welding the components together or using mechanical fasteners (e.g., screws, bolts and the like). Additionally, in an alternative embodiment, the microprocessors 180 may be directly coupled to the magnesium case to provide heat transfer directly from the microprocessors 180 to the case.
- heat dissipation features can eliminate the need for cooling fans, which conserves the power demands on the SGD's lithium polymer battery, reduces the SGD's background noise level, which is especially desirable for a speaking device, and eliminates moving parts of the fan that otherwise are subject to breakage upon dropping of the SGD and therefore improves the overall durability of the SGD.
- the magnesium case provides greater structural integrity with low weight and high strength than conventional plastic casings.
- it permits the walls of the case to be thinner than would be the situation in a fixotropic molded plastic case in a conventional SGD.
- the strength to weight ratio of the magnesium case thus is also greater than might otherwise be if molded from fixotropic plastic requiring thicker walls in order to achieve the same durability or even less durability than is possible with the magnesium case of the present invention.
- Injection molding magnesium produces better complex net shape requiring less machining and secondary operations. Bosses can be drilled and tapped thus eliminating the need for inserts or self-tapping screws which allow the case to be vibration and resonance free, thus improving the sound quality.
- the electronic components may include a combination of hardware, software and/or firmware elements, all of which either correspond to physical tangible apparatuses or which are embedded as instructions on a physical and tangible apparatus such as a computer-readable storage medium.
- the electronic components of an SGD enable the device to transmit and receive messages to assist a user in communicating with others.
- the SGD may correspond to a particular special-purpose electronic device that permits a user to communicate with others by producing digitized or synthesized speech based on configured messages.
- Such messages may be preconfigured and/or selected and/or composed by a user within a message window provided as part of the speech generation device user interface.
- a variety of physical input devices and software interface features may be provided to facilitate the capture of user input to define what information should be displayed in a message window and ultimately communicated to others as spoken output, text message, phone call, e-mail or other outgoing communication.
- central computing device 700 may include a variety of internal and/or peripheral components. Power to such devices may be provided from a battery 703, such as but not limited to a lithium polymer battery or other rechargeable energy source. A power switch or button 705 may be provided as an interface to toggle the power connection between the battery 703 and the other hardware components.
- any peripheral hardware device 707 may be provided and interfaced to the speech generation device via a USB port 709 or other communicative coupling.
- the components shown in Fig. 7 may be provided in different configurations and may be provided with different arrangements of direct and/or indirect physical and communicative links to perform the desired functionality of such
- a central computing device 700 is provided to function as the central controller within a SGD and may generally include such components as at least one memory/media element or database for storing data and software instructions as well as at least one processor.
- one or more processor(s) 702 and associated memory/media devices 704a and 704b are configured to perform a variety of computer-implemented functions (i.e., software-based data services).
- One or more processor(s) 702 within computing device 700 may be configured for operation with any predetermined operating systems, such as, but not limited to, Windows XP, and thus is an open system that is capable of running any application that can be run on Windows XP.
- Other possible operating systems include BSD UNIX, Darwin (Mac OS X), Linux, SunOS (Solaris/OpenSolaris), and Windows NT (XP/Vista/7).
- At least one memory/media device (e.g., device 704a in Fig. 7) is dedicated to storing software and/or firmware in the form of computer-readable and executable instructions that will be implemented by the one or more processor(s) 702.
- Other memory/media devices e.g., memory/media devices 704b are used to store data which will also be accessible by the processors) 702 and which will be acted on per the software instructions stored in memory/media device 704a.
- FIG. 7 may be provided as a single or multiple portions of one or more varieties of computer-readable media, such as, but not limited to, any combination of volatile memory (e.g., random access memory (RAM, such as DRAM 1 SRAM, etc.)) and nonvolatile memory (e.g., ROM, flash, hard drives, magnetic tapes, CD-ROM, DVD- ROM, etc.) or any other memory devices including diskettes, drives, other magnetic- based storage media, optical storage media and others.
- at least one memory device corresponds to an electromechanical hard drive and/or or a solid state drive (e.g., a flash drive) that easily withstands shocks, for example that may occur if the SGD is dropped.
- Fig. 7 shows two separate memory/media devices 704a and 704b, the content dedicated to such devices may actually be stored in one memory/media device or in multiple devices. Any such possible variations and other variations of data storage will be appreciated by one of ordinary skill in the art.
- a first portion of memory/media device 704b is configured to store input data received from a user for performing the desired functional steps associated with a speech generation device.
- data in memory 704b may include inputs received from one or more peripheral devices, including but not limited to touch screen 706, microphone 708 and other peripheral devices 710, which indicate a user's selections of text to be spoken by the SGD or other related actions.
- Memory device 704a includes computer-executable software instructions that can be read and executed by processor(s) 702 to act on the data stored in memory/media device 704b to create new output data (e.g., audio signals, display signals, RF communication signals and the like) for temporary or permanent storage in one of the memory/media devices.
- output data may be communicated to a peripheral output device, such as display device 712, speakers 714, cellular phone or RF device 716, wireless network adapter 718, or as control signals to still further components.
- Computing/processing device(s) 702 may be adapted to operate as a special-purpose machine by executing the software instructions rendered in a computer-readable form stored in memory/media element 704a.
- any suitable programming, scripting, or other type of language or combinations of languages may be used to implement the teachings contained herein.
- the methods disclosed herein may alternatively be implemented by hardwired logic or other circuitry, including, but not limited to application-specific circuits.
- the steps, features and algorithms described later, including but not limited to algorithms for extracting web words, implementing slots and fillers, utilizing vocabulary lists, applying speech inflection styles and effects, and the like, are provided as executable software instructions stored in the computer-readable medium of elements 704a and/or 704b or others.
- various input devices may be part of an SGD and communicatively coupled to the computing device 700.
- a touch screen 706 may be provided to capture user inputs directed to a display location by a user hand or stylus.
- Other exemplary input devices e.g., peripheral device 710) may include but are not limited to a peripheral keyboard, peripheral touch-screen monitor, peripheral microphone, mouse and the like.
- touch screen 706 includes a projective capacitive touchscreen instead of a keyboard, a mouse or a pressure sensitive touchscreen.
- a capacitive touchscreen panel is a sensor typically made of glass coated with a transparent conductor such as indium tin oxide (ITO). This type of sensor is basically a capacitor in which the plates are the overlapping areas between the horizontal and vertical axes in a grid pattern. Since the human body also conducts electricity, a touch on the surface of the sensor will affect the electric field and create a measurable change in the capacitance of the device. Thus, a transparent rigid panel covers the display screen and prevents false activations of the SGD when the transparent panel inadvertently is subject to pressure.
- ITO indium tin oxide
- Each of the icons on the display of the SGD is a capacitive switch, which is activated by the capacitance of the user's body rather than by pressure against the screen.
- a touch screen for use in accordance with the subject SGD corresponds to a capacitive touch screen such as offered for sale by Touch International of Austin, Texas.
- the incorporation of projective capacitive technology into a dedicated speech generating device has several advantages, especially relative to a typical resistive touch panel.
- a primary advantage stems from the fact that active sensing material can be applied to the back of any transparent material, in this case a super thin (0.043") chemically strengthened glass.
- the overall thinness and light weight are cruciai in a lightweight, handheld ambulatory design.
- the top glass surface can be sealed to a bezel associated with a front casing component, providing ingress protection to moisture and dirt as well as a flush mounting surface so there is no recess for debris or liquids to accumulate.
- the active electronics and graphics are applied to the back surface of the glass which makes the top scratch resistant and increases the durability of the panel.
- High resolution devices are generally available, such as on the order of 10,000 dpi (dots per square inch).
- Capacitive touch screens also don't typically require recalibrations, as may be needed with resistive or pressure-sensitive touch screens.
- Capacitive touch screens also provide advantages for conventional users of a speech generation device.
- a capacitive touch panel requires no activation force but only a slight contact, which is an advantage for a user who may have motor control limitations.
- Typical resistive sensors have a predetermined activation force required to physically short two conductive layers together. These top layers are external and can suffer failure due to scratching or physical breakdown of the spacing layer that separates the two conductive surfaces.
- Capacitive touch screens also accommodate multi-touch applications (i.e., a set of interaction techniques which ailow a user to control graphical applications with several fingers) as well as scrolling.
- the different types of input devices are configured with software instructions to accept user inputs in accordance with one or more access methods, including the following: a “Touch Enter”, “Touch Exit”, “Touch Auto Zoom”, “Scanning”, “Joystick”, “Audio Touch”, “Mouse
- Pause/Headtrackers "Morse Code” and/or "Eye Tracking" access methods.
- Touch Enter selection is made upon contact with the touch screen, with highlight and bold options to visually indicate selection.
- Touch Exit selection is made upon release as a user moves from selection to selection by dragging a finger as a stylus across the screen.
- Touch Auto Zoom a portion of the screen that was selected is automatically enlarged for better visual recognition by a user.
- highlighting is used in a specific pattern so that individuals can use a switch (or other device) to make a seiection when the desired object is highlighted.
- Selection can be made with a variety of customization options such as a 1- switch autoscan, 2-switch directed scan, 2-switch directed scan, 1 -switch directed scan with dwell, inverse scanning, and auditory scanning.
- a button on the joystick which is used as a pointer and moved around the touch screen. Users can receive audio feedback while navigating with the joystick.
- an “Audio Touch” mode the speed of directed selection is combined with auditory cues used in the "Scanning” mode.
- selection is made by pausing on an object for a specified amount of time with a computer mouse or track ball that moves the cursor on the touch screen.
- An external switch exists for individuals who have the physical ability to direct a cursor with a mouse, but cannot press down on the mouse button to make selections.
- a "Morse Code” option is used to support one or two switches with visual and audio feedback. In “Eye Tracking” modes, selections are made simply by gazing at the device screen when outfitted with eye controller features and implementing selection based on dwell time, eye blinking or external switch activation.
- SGD hardware components may also include one or more integrated output devices communicatively coupled to the central processing unit, such as but not limited to a display device 712 and speakers 714.
- Display device 712 may correspond to suitable monitor, screen or other output device for visual presentation of to a user.
- Suitable examples include but are not limited to a light-emitting diode (LED) display, electroluminescent display (ELD), plasma display panel (PDP), and liquid crystal display (LCD).
- LED light-emitting diode
- ELD electroluminescent display
- PDP plasma display panel
- LCD liquid crystal display
- One particular embodiment corresponds to an LCD such as an LW500AC9001 component offered for sale by Chi Mei Optoelectronics (CMO) of Taiwan.
- Speakers 714 may generally correspond to any compact high power audio output device, for example, a GC0401 low profile paper and mylar speaker such as offered for sale by Amaoto Industrial Co., Ltd. (A.I. C.) of Taiwan.
- Speakers 714 may function as an audible interface for the speech generation device when computer processor(s) 702 utilize text-to-speech functionality.
- a user provides text, symbols corresponding to text, and/or related or additional information in a "Message Window" which may then be interpreted by a text-to-speech engine and provided as audio output via the speakers 714.
- the SGD also includes an e-book reader that can be controlled by the user to read the e-book and have the e-book speak the words being read to the user.
- Speech output may be generated in accordance with one or more preconfigured text-to-speech generation tools in male or female and adult or child voices, such as but not limited to such products as offered for sale by Cepstral, HQ Voices offered by Acapela, Flexvoice offered by Mindmaker, DECtalk offered by Fonix, Loquendo products, VoiceText offered by NeoSpeech, products by, AT&T's Natural Voices offered by Wizzard, Microsoft Voices, digitized voice (digitally recorded voice clips) or others.
- a volume control module 722 may be controlled by one or more scrolling switches or touch-screen buttons.
- SGD hardware components may also include various communications devices and/or modules, such as but not limited to an antenna 715, cellular phone or RF device 716 and wireless network adapter 718.
- Antenna 715 can support one or more of a variety of RF communications protocols.
- a cellular phone or other RF device 716 may be provided to enable the user to make phone calls directly and speak during the phone conversation using the SGD, thereby eliminating the need for a separate telephone device.
- a wireless network adapter 718 may be provided to enable access to a network, such as but not limited to a dial-in network, a local area network (LAN), wide area network (WAN), public switched telephone network (PSTN), the Internet, intranet or ethernet type networks or others.
- These communications modules (or others, such as but not limited to an infrared (IR) transceiver) may be provided to function as a universal remote control for the SGD that can operate devices in the user's
- TV Portable music player
- DVD player Portable music player
- a dedicated communications interface module 720 may be provided within central computing device 700 to provide a software interface from the processing components of computer 700 to the communication device(s).
- communications interface module 720 includes computer instructions stored on a computer-readable medium as previously described that instruct the communications devices how to send and receive communicated wireless or data signals.
- Antenna 715 may be provided to facilitate wireless communications with other devices in accordance with one or more wireless communications protocols, including but not limited to BLUETOOTH, WI-FI (802.11 b/g) and ZIGBEE wireless communication protocols.
- the antenna 715 enables a user to use the SGD with a BLUETOOTH headset for making phone calls or otherwise providing audio input to the SGD.
- the SGD can also generate BLUETOOTH radio signals that can be used to control a desktop computer, which appears on the SGD's display as a mouse and keyboard.
- An example of a suitable antenna for such devices is a surface mount Rufa brand antenna such as offered for sale by Antenova Ltd. of Cambridge, United Kingdom.
- the communications interface module 720 includes functional features by which the SGD can function as a Bluetooth radio, users can utilize the flexible input methods and software configurations of an SGD to control and operate a separate desktop or laptop computer.
- the SGD appears to the desktop as a Human Interface Device (HID) device which allows the user who may not otherwise be able to type or control a mouse to operate the computer by taking advantage of the accessibility options provided by the SGD and its specialized hardware and software features.
- HID Human Interface Device
- a user plugs a Bluetooth access control module into a USB port on the user's personal computer and performs a communication pairing procedure that establishes short-range wireless communication connectivity between the SGD and the personal computer. Similar steps may be followed to establish Biuetooth connections between an SGD and Bluetooth-enabled headset of keyboard/mouse, etc.
- a switch may be provided for users to mechanically actuate a selection on the SGD and then communicate that selection via Bluetooth protocols. Switching is often used when an SGD operates in a user-input mode where choices are scanned across the display as visual options or sequenced within an audio output and a user can then select one of the scanned options upon selection by switch. Scanning users often rely on switches located wherever the user has consistent and reliable motor control.
- Switches may be located on a head rest, seat, leg support, etc. Many conventional switches are hard-wired and include cables that are routed from the device around a wheelchair or other mounting location to the accessibility points. Provision of a
- Bluetooth-communicating input switch eliminates the potential for wire tangling, thus providing a more convenient and safer environment for the user in a wheelchair with moving parts.
- Bluetooth communications features involves the benefits of a Bluetooth audio pathway. Many users utilize an option of auditory scanning to operate their SGD. A user can choose to use a Bluetooth-enabled headphone to listen to the scanning, thus affording a more private listening environment that eliminates or reduces potential disturbance in a classroom environment without public broadcasting of a user's communications.
- a Bluetooth (or other wirelessly configured headset) can provide advantages over traditional wired headsets, again by overcoming the cumbersome nature of the traditional headsets and their associated wires.
- Fig. 24 shows an exemplary user interface menu provided via software features of the present invention that provides specific functional selection mechanisms for a user to implement Bluetooth functionality.
- the Bluetooth menu shown in Fig. 24 may include a variety of features, including a Bluetooth devices viewport 2400, an Unpair Bluetooth device button 2402, a Refresh button 2404, a locator button 2406, a Bluetooth device power button 2408, a device details group box 2410, a
- Bluetooth devices viewport 2400 displays icons for all Bluetooth-enabled devices to which an SGD is currently connected, paired to, or that are available for connection.
- Unpair Bluetooth device button 2402 breaks the Bluetooth connection between an SGD and the Bluetooth- enabled device selected in the viewport. In other words, button 2402 unpairs a device and removes the device's icon from the viewport.
- Refresh button 2404 starts a new search for available Bluetooth-enabled devices.
- Locator button 2406 is selected if the Bluetooth-enabled device to which a user wants to connect is not displayed in the viewport 2400.
- Actuation of button 2406 opens a dialog box instructing the user to ready the device for pairing and then triggers a new search for available Bluetooth- enabled devices.
- Bluetooth device power button 2408 toggles the Bluetooth radio in the SGD on and off to enable or disable all Bluetooth connections between the SGD and Bluetooth-enabled devices.
- Device details group box 2410 lists details of the Bluetooth- enabled device selected in viewport 2400.
- Connect/disconnect button 2412 toggles between enabling and disabling a Bluetooth connection with the specific device selected in the viewport 2400.
- Flash button 2414 causes the Bluetooth access control module plugged into a USB port of the Bluetooth device to flash an indicator light so that the Bluetooth connection can be verified.
- an exemplary SGD embodiment includes an integrated cell phone
- a user is able to send and receive wireless phone calls and text messages.
- the cell phone component 716 shown in Fig. 7 may include an access port for a subscriber identity module (SIM) card by which a user can provide requisite information for identifying user information and cellular service provider, contact numbers, and other data for cellular phone use.
- SIM subscriber identity module
- associated data storage within the SGD itself can maintain a list of frequently-contacted phone numbers and individuals as well as a phone history or phone call and text messages.
- One or more memory devices or databases within a speech generation device may correspond to computer-readable medium that may include computer-executable instructions for performing various steps/tasks associated with a celiular phone and for providing related graphical user interface menus to a user for initiating the execution of such tasks.
- the input data received from a user via such graphical user interfaces can then be transformed into a visual display or audio output that depicts various information to a user regarding the phone call, such as the contact information, cail status and/or other identifying information.
- General icons available on SGD or displays provided by the SGD can offer access points for quick access to the cell phone menus and functionality, as well as information about the integrated cell phone such as the cellular phone signal strength, battery life and the like.
- CDEC coder/decoder
- DSP digital signal processor
- an exemplary graphical user interface menu provided via software features of the present invention provides for an exemplary phone contacts menu for use with a cellular phone integrated with a speech generation device.
- a contact list can contain identifying information (e.g., address, phone number(s), e-mail address(es), etc.) for people in one or more groups such as family, friends, physicians, clinicians, or others. Once a contact list is established, a user can selectively employ the contact information to streamline cell phone, text messaging and e-mail
- the exemplary contacts menu can include the following items: a contacts viewport 800, contact symbol display 801 , contacts add button 802, contacts edit button 804, contacts delete button 806, contacts search button 808, contacts place call or send text button 810, or others.
- the contacts viewport generally displays the particular information about a contact as stored in computer-readable memory associated with the SGD.
- contacts viewport 800 may display each contact in a user's list, including the phone number(s), e-mail address(es), and a predetermined or selected symbol.
- a larger version of the contact symbol is provided as display 801 , to the right of the viewport 800.
- Contacts add button 802 is provided for selection by a user in order to add a new contact to the contact list.
- Contacts edit button 804 is provided for selection by a user in order to revise a currently selected contact.
- Contacts delete button 806 is provided for selection by a user in order to remove the currently selected contact.
- Contacts search button 808 is provided for selection by a user in order to search through the contact list using first or last name.
- Button 810 is provided for a user to place a call or send a text message to one of the numbers for the currently selected contact.
- a user Before a user can use the contacts menu shown in Fig. 8, a user first provides general contact information about a user.
- An example of the user interface menu for providing initial or subsequent edits for a particular contact is shown in Fig. 9.
- the edit contact menu of Fig. 9 can be accessed and then opened to enter information for the new contact or to edit information for an existing contact.
- contact information may selectively include one or more of the following: name, address, specifically designated ringtone, personalized custom greeting, home phone number, work phone number, cellular phone number, one or more e-mail addresses, and the like.
- a picture may also be selected by a user to provide a symbol
- a phone menu enables a user to dial outgoing phone calls, access the contacts list and view calling history.
- a standard 12-button number pad 1000 is provided by which a user can manually dial a phone number to which an outgoing phone call may be placed.
- a power button 1002 may be used to turn the integrated cell phone on or off. Turning the phone off can extend battery life when the user is not using the integrated phone feature.
- a hide button 1004 can minimize the phone menu so a user can navigate through the communication pages while the phone is active.
- a close button 1006 closes the phone menu, thus disconnecting any active phone calls.
- a viewport 1008 displays the phone number that a user is calling.
- a dial button 1010 enables a user to place an outgoing call to the number shown in the viewport.
- a backspace button 1012 enables a user to erase the last number in the viewport.
- a clear button 1014 enables a user to clear all numbers from the viewport.
- a contacts button 1016 enables a user to open the contacts menu (e.g., as shown in Fig. 8) to search through the user's list of contacts.
- a view call history button 1018 enables a user to display records of all incoming and outgoing phone calls.
- a user can select the dial button 1010 available on the phone menu of Fig. 10.
- a user can use the place call button 810 available from the contacts menu of Fig. 8.
- a user may have programmed a customized button on the SGD to perform the Dial Cell Phone action. Further selection of a programmed "Speak” button can be used to select a message window with content in it so that a user can use the SGD to "speak" the user's message through the integrated cell phone. Similar features can be used to answer an incoming call. In one example, a user will receive an incoming call via the integrated cell phone feature causing the SGD to ring like a telephone.
- An alert box may also be shown on the SGD's display device with information such as the incoming call name and number and selectable input buttons for the user to select either to answer or ignore the incoming call. If the incoming call is from an entity listed in the user's contacts menu, then the contact's symbol may also be provided within an incoming call alert box.
- a text messaging menu enables a user to send a text message and to review the history of all or selected text messages that a user has sent and received.
- a viewport 1100 enables a user to select from a variety of tabs (Unread, Read, Sent, Drafts or Recent) to display a category of text messages in the viewport. Once messages are shown, buttons displayed to the right of viewport 1100 (e.g., View button 1102, Delete button 1104, and Delete All button 1106) will become available to help a user manage the text messages within each tab.
- a number field 1108 enables a user to send a text message to the number shown in the number field 1108.
- a text field 1110 visually displays to a user the body of a text message as it is entered.
- a contact button 1112 provides access to the current contact's numbers. If no contact is selected, the contact button 1112 enables a user to select a contact from available contacts.
- a send message button 1114 enables a user to send the created text message.
- a user may have a variety of options. For example, a user can create a new text message by opening the text messaging menu of Fig. 11 , entering the text message manually and either entering the phone number manually or finding a contact number using the contacts button. A user may alternatively send a quick text message by sending the current contents of a message window as a text message to a specific phone number. In this fashion, a user will be prompted to enter the phone number when the user selects the quick text programmed button. A user can also enter the phone number manually or select a number from the contacts menu to send a quick text message.
- an exemplary SGD embodiment may also include features for connecting to the Internet, for example features including a wireless network adapter 718 such as shown in Fig. 7 or other communications module.
- Additional executable instructions stored in memory associated with central computing device provide a web browser to serve as a graphical user interface for interacting with the Internet.
- software instructions may be provide to call preconfigured web browsers such as Microsoft Internet Explorer or Firefox® internet browser 3Od available from Mozilla software.
- menus and features related to SGD Internet access are now discussed with reference to Figs. 12-15.
- such menus and features allow a user to not only achieve access to the Internet for customary purposes, but also to utilize information accessed via the Internet to further enhance the functionality of an SGD.
- an SGD web browser can be provided to enable the user to go to a webpage and extract key vocabulary from that web page to use the extracted
- the SGD can also upgrade its own software by connecting to the internet and enabling the vendor of the SGD to update the SGD's software from the vendor's dedicated servers. Again, it should be appreciated that all graphical user interfaces and menus that display
- buttons or other features that are selectable by a user correspond to user input features that when selected trigger control signals being sent to the central computing device within an SGD to perform an action in accordance with the selection of the user buttons.
- a set 1201 of go back/forward buttons enables a user to move backwards or forwards by one or more previously selected pages through the web browsing history.
- a reload current page button 1202 refreshes the current web page.
- a stop loading this page button 1203 is selected to provide an instruction to stop loading the current web page.
- a home button 1204 enables a user to open up the web page that the user has designated as a "Home" page.
- a Zoom in button 1205 enables a user to increase the font size of the current web page.
- a zoom out button 1206 enables a user to decrease the font size of the current web page.
- a reset button 1207 enables a user to return the font size on the current web page to standard or default size.
- An address text button 1208 enables a user to select this text box to enter a URL (web page address) that the user wants to open. When the address button 1208 is selected, such action may open up the system keyboard so that a user can enter the URL (web address) for the page the user wants to visit.
- a search text box button 1209 is selectable by a user to enter a keyword to use a search engine (e.g., Google, Yahoo, or other preconfigured searching tool available via the Internet) to search on the keyword.
- a Grab and Drag mode button 1210 enables a user to toggle the "grab and drag" setting in a web browser, such as may enable a user to select text and "flick" the screen to scroll up or down.
- interface menus may be provided that enable a user to incorporate key words located on the Internet (i.e., "web words") into a user- customized vocabulary list. Such identified web words may then be incorporated into a user's custom message by selecting such words from a vocabulary list.
- a user may first enable the wireless network connection (e.g., Wi-Fi connection) available via the SGD, launch a provided web browser (e.g., MOZILLA FIREFOX) and navigate to a web page that includes the words of interest. On the page of interest, a user can select one or more portions of interest (e.g., the highlighted text box shown in Fig. 13).
- a text menu 1300 and possible sub-menu 1310 are provided, for example as also shown in Fig. 13. Such menus may be provided automatically (e.g., via pop-up window upon selection) or by selecting a button after the text has been selected by a user.
- Various options available from the text menu 1300 include a selectable "speak text” feature that instructs an SGD to immediately speak the selected text.
- Another feature is a "send to message window” feature that sends the selected text to a message window where additional actions can be selected.
- An "add vocabulary list as one item” feature enables a user to add the selected text to a vocabulary list as a single block.
- An "add to vocabulary list as multiple items" feature enables a user to add the selected text to a vocabulary list as individual words or phrases.
- a sub-menu 1310 may appear, listing all of the existing vocabulary lists available to a user that are enabled to accept web content. The specific vocabulary list desired by a user may then be selected. Selection may include a previously configured list or a new list.
- a user may view the words in a vocabulary list box, for example, as shown in Fig. 14.
- an SGD may be configured to either speak the word immediately or add the selected word into a message window, or both. Additional details regarding vocabulary list boxes and related features are presented below.
- a still further internet-related feature available via an SGD corresponds to a user interface for importing symbols from the Internet,
- such feature enables a user to select a picture available from an authorized source on a web page and import it into a custom symbol set for use by the user.
- authorized sources include sources whose pictures and symbols are owned by the user, sources with pictures and symbols which the user has permission to use and access, and sources who provide pictures and symbols in the public domain. In order to perform such symbol
- a user may first enable the wireless network connection (e.g., Wi- Fi connection) available via the SGD, launch a provided web browser (e.g., Mozilla Firefox) and navigate to a web page that includes the picture(s) or graphic(s) of interest.
- a user can select the picture or graphic to initiate the display of a web symbol menu 1500 as shown in Fig. 15.
- An "add to my symbols” feature saves the picture with an automatically generated name.
- An "add to my symbols with name” feature prompts the user to enter a unique and meaningful name for the symbol and then saves it to the SGD symbol library.
- the imported symbol is now available to use as background on a page or a popup, as a symbol on a user button, or in another interactive fashion.
- exemplary steps performed in a software algorithm for creating web words is presented.
- integrated web access for vocabulary generation in a speech generating device affords a method to select content from the web, automatically extract useful terms and phrases, and use them immediately in aided communication.
- software steps to perform such a method are stored as executable instructions in a computer-readable medium such as previously described. Execution of the steps is selectively activated by a user based on the user's selection of text and options available via menus, such as those illustrated in Figs. 13, 14 and 15.
- a vocabulary source accessed by the user.
- the vocabulary source may correspond, for example, to text available from a document, web page or other data source. It should be appreciated that information available from the Internet may greatly expand the potential type and amount of information that is available to a user as words that can be integrated into messages and either spoken or communicatively transmitted via a speech generation device. For example, extraction of useful web words may facilitate rapid communication about presently relevant information, including current events, news stories, sports stories, and/or information of a specific technical nature with specialized terminology.
- the web serves as the source of vocabulary augmenting the information typically delivered with a communication device, such as the names of people, places, businesses, sports teams, etc.
- Users can use the integrated web browser to find information related to that which the user wants to communicate about.
- Electronic input is received by an SGD in step 1602 indicating selected text from which to extract useful vocabulary.
- the user may collect one or more pieces of such text, and provide input to the SGD that tags selected pieces or each piece in step 1604 with a user-defined tag indicating a general category of interest, separating the text into tagged buckets.
- each such text bucket may be separately analyzed to determine words and phrases of interest, as described in more detail below. The words and phrases are then made available for use in communication in a scrolling view that presents the words and phrases.
- the words and phrases may be augmented in step 1608 with a picture if a suitable picture is found in an internal database whose label matches the word or phrase.
- Discovered vocabulary may also be used in other ways once it has been discovered.
- the lists of discovered vocabulary may be added to a permanent internal database of vocabulary available to the user at all times.
- One use of the vocabulary lists is with the slot mechanism described below.
- the lists of discovered vocabulary may serve as source material for constructing more traditional
- Vocabulary items can be directly selected and inserted onto buttons on communication pages, along with any matching symbols.
- Discovered vocabulary may be entered into an internal dictionary and used to augment word prediction. The discovered words will become part of the database of words that get predicted in response to partial inputs.
- the text bucket analysis of step 1606 may include a variety of specific software-based rules. For example, selected text may be examined to discover interesting new words and phrases by extracting various identified items: Any new word not in an SGD's internal dictionary; Any noun, adjective, or verb from an internal dictionary that has a frequency of occurrence rating below some threshold; Phrases consisting of adjacent words from the text that contain either interesting words (by the above definitions) or capitalized words. All such words and/or phrases, up to a threshold length, are extracted from the text. Any such phrases that occur more than some threshold number of times in the text are added to the list of extracted words and phrases.
- Audio samples will often be used to augment synthesized text-to-speech for
- Words, phrases, symbols and other information either stored on a SGD or imported as described above can be grouped and used in preconfigured or customized compilations referred to herein as vocabulary lists or vocabulary list boxes.
- Vocabulary list boxes enable a user to have a wide variety of words and phrases immediately available. By listing groups of related words and phrases, vocabulary list boxes enable a user to quickly search through a wide range of text options when composing a message. For example, a user can select a particular group of words and/or phrases and associate all selected items into a new vocabulary list, which may be named and optionally assigned a unique symbol to visually represent the vocabulary list.
- Setting options characterizing each vocabulary list may also be defined, for example, an indication of whether the vocabulary list should accept new words and phrases from sources available on the Internet. Once a vocabulary list is made, new words or phrases can be added to the vocabulary list and/or existing words or phrases may be edited or deleted. Features may also be provided such that existing vocabulary lists can be embedded into other vocabulary lists, thus providing a hierarchy of related sets and subsets of vocabulary lists. Features may be provided to add vocabulary searches into a vocabulary list or to sort and display the contents of a vocabulary list according to selected parameters such as alphabetization and frequency of use in messages.
- Features may also be provided to trigger actions performed by the SGD upon selection of an item from a vocabulary list, for example, to automatically "speak” or provide as audio output the words/phrases from a vocabulary list box immediately as it is selected by a user, or to send the words/phrases from the vocabulary list box to the Message Window as it is selected by a user.
- a user can get better access to more specific vocabulary on a related subject.
- software instructions may be configured to display an embedded vocabulary list in a similar fashion (i.e., as an icon) as any other item in a given vocabulary list box.
- the example in Fig. 17 shows a user interface displaying a vocabulary list box with the vocabulary list "Animals" having the "Animal Sounds” vocabulary list embedded within it.
- an embedded vocabulary list is selected by a user, its contents will then be displayed in the vocabulary list box. All the words in the original vocabulary list
- Animals e.g., aardvark, alligator, animal, ant, anteater, antelope, ape, armadillo, etc.
- Animal Sounds e.g., bark, bleat, buzz, chirp, cluck, croak, gobble, growl, hiss, etc.
- a search for words and phrases will automaticaiiy scan through a dictionary of words and phrases provided on the SGD and display ail words and phrases that meet the user-defined search criteria. For example, suppose that a user adds twelve words to an SGD dictionary and marks them all with the "Clothing" category. The next time a user opens a page with a vocabulary list containing a vocabulary search on "Clothing", those twelve new words will automatically appear in the vocabulary list box. By inserting a search itself to a vocabulary list, a search can be performed dynamically.
- results of an embedded vocabulary search can be intermixed with the other vocabulary items within the vocabulary list. They will appear side by side with the individual vocabulary items, and they can be sorted based on the settings of the vocabulary list box ⁇ alphabetically, as is, or by frequency of use).
- the SGD may further include executable instructions stored on a computer-readable medium that implement a unique method of communication rate enhancement that combines sentences and phrases with slot fillers, in general, implementation of a "slots" feature triggers software provision of mechanisms to insert a "slot" or placeholder in a text phrase. Each one of these placeholders is associated with a list of "fillers” that can potentially fill in this place in the text.
- the list of slots and their associated fillers are stored in a database internal to the software. When a user inserts the text containing the slot into a document, the slot position is filled with a default filler, and that filler is underlined in the text.
- Another variation on the slots protocol allows a user to declare to speak a text phrase containing one or more fillers. At which point the user is prompted to specify the filler values for each slot, and then the entire text phrase, with the filler values chosen by the user, is spoken.
- Slot fillers can optionally have pictures associated with them in which case those pictures can be used to augment the display of th ⁇ filler value. Slots can be added and deleted by the user, and filler values can also be added and deleted by the user.
- the slots can be populated with literal terms, words or phrases, vocabulary lists and related features as described above, embedded searches and/or embedded recursive slots. Moreover, this unique method optionally can be combined with symbols.
- a "slot" placeholder can also be included in button text, button labels and phrases. Specific interface options may be provided for a user to create a phrase with slots, adding slots to buttons, and work with slots in a Message Window.
- the capability embodied in the so-called “slots" protocol provides the user with the ability to fill in certain words in otherwise static text.
- the slots protocol provides the user with easy access to common words that can be used to complete a message in a variety of settings and situations. For example, in the phrase: "Can we have dinner now?", the slot is "dinner". Other commonly-used words like “breakfast” or “coffee” are slots that can be used interchangeably to complete similar messages like: “Can we have breakfast now?". Slots are additional tools that minimize necessary navigation to save the user's time and energy during message composition.
- slots are designed to provide a variety of vocabulary options while reducing the number of selections that a user must make to create a whole message. Slots also help to conserve space on the touch screen. Slots provide a user with easy access to all of the words associated with a particular vocabulary concept (or category). When a user selects a slot, the user can choose to replace the word that is currently filling the slot with another word from the same concept. Rather than build a dynamic message one word at a time, a user can create sentences that contain slots in key locations. When the phrase is added to the Message Window, a user can then select the slots (which are visually indicated in some fashion, for example displayed as blue underlined words) and replace the current words with different options.
- the message in the Message Window is the label text for a button.
- the first slot is associated with the "breakfast” concept and the second slot is associated with the "fruit” concept.
- the slots allow a user to create dynamic messages with a reduced number of selections. By selecting the slots and changing the filler text, the example phrase "I want oatmeal and a banana for breakfast” can quickly and easily be changed to read as follows: "I want toast and a nectarine for breakfast”.
- a user may be able to add slots to his customized phrase database - My Phrases.
- Adding slots to phrases is one way to maximize the potential of both a slot/filler rate enhancement feature as well as a phrase prediction rate enhancement feature.
- This technique provides a user with rapid access to complete statements, while still enabling the user to vary what he is going to say. For example, if a user tells an assistant what he wants to wear every morning, then he may want to create a phrase to say "I want to wear my jeans today.” Then, simply turn the word “jeans" into a slot that accesses the "clothing" concept.
- one or more areas of a user display can be filled with phrases as a user enters text into a document.
- the SGD can match the partially entered text to an internal database of text phrases and present those phrases whose starting characteristics match the partially entered text.
- Each phrase may also be assigned a priority rating (e.g., based on frequency of use or other characterizing feature). In the case where more phrases are found that match the partially entered text than there are buttons to fill, those phrases with the highest priority ratings are shown.
- the phrase database can be added to or deleted from by a user. Phrases may optionally have pictures associated with them in which case those pictures can be used to augment the display of the phrases on the buttons. Phrases may also contain slots with their associated fillers.
- a still further feature that may be provided in exemplary embodiments of a speech generation device corresponds to a voice inflection mark up feature that can add inflection to the speech output of the SGD.
- the speech output not only can be controlled as to the volume of the spoken words, as to the pitch of the voice
- Fig. 20 illustrates exemplary steps that may be performed in a method of implementing speech characterizations such as speech styles and/or speech effects.
- an SGD receives input from a user associating one or more speech styles to selected text.
- a user can selectively mark text that the user wants to be spoken with a specific stylistic variation and then apply the speech style, much like a user would apply a font style in a typical word processing application.
- User interface locations by which a user can apply these speech styles via actions can be embedded in user communication pages, making this feature available to all communication device users regardless of the user's physical disabilities or access needs.
- Input defining one or more associated speech styles can be embedded as markups directly in documents. As such, users can store their documents with stylistic variants for later recall and use. Once entered, a style markup can be selected and its properties modified.
- Supported speech styles may include variations defined by the following non-exhaustive list of exemplary parameters: voice, volume, pitch, speech rate, spell mode, emphasis and context.
- voice When defining a speech style according to voice, a user can select the specific text-to-speech voice to use to speak the selected text.
- volume When defining a speech style according to volume, a user can select the specific volume at which to speak the selected text.
- pitch When defining a speech style according to pitch, a user can select one or more specific pitch changes to apply to selected text.
- defining a speech style according to speech rate a user can select one or more rates of speed at which to speak the selected text.
- a speech style according to spell mode When defining a speech style according to spell mode, the user can select to spell out the selected text rather than pronouncing it normally as a word - e.g., "H-E-L-L-O" instead of "Hello.”
- a speech style according to emphasis a user can select the specific portions of text to emphasize in a spoken portion of text, for example by increasing the amount of voice inflection given to the emphasized text portions.
- a user can define when to declare the context of a selected portion of speech, e.g., to speak the selected portion as if it were a date, currency, or other contextually
- FIG. 22 An example of an interface menu that may be provided to a user for selecting speech styles is shown in Fig. 22.
- input may also be received by an SGD
- steps 2010 and 2012 are shown as separate and subsequent steps, steps 2010 and 2012 can actually occur intermixed in time and each can selectively occur multiple times as desired by a user in a method of implementing voice inflection.
- speech effects may correspond, for example, to different emotional effects, non-verbal communication noises, etc. such as but not limited to coughing, laughing, crying, kissing, sniffing, swallowing, sighing, whistling, yawning, or making noises such as "eh", "oh", “mmm”, etc.
- features may be provided such that speech effects can be embedded in user communication pages.
- An example of an interface menu that may be provided to a user for selecting speech effects is shown in Fig. 23, where selections can be made for a desired speech effect to be entered into a message window or spoken immediately upon selection.
- the marked up text may be displayed in a message window, for example as shown in Fig. 21.
- a first portion of text "Hi Jim!” has been selected and associated with a speech style defining a selected level of volume.
- a second portion of text, "That's” has been selected and associated with a speech style indicating desired emphasis on this first word in the sentence.
- a speech effect namely laughing, is inserted at the end of the given text to simulate a user's laugh.
- Such enhanced speech features provide a more interactive and enjoyable method of communicating for a user of a speech generation device.
- the marked-up text is relayed to an interpretation engine such that the marked text can be analyzed by software rules in step 2014 to determine an optimum speech-to-text engine for speaking the marked-up selected text and associated speech styles/effects.
- an interpretation engine may be configured to examine the graphical markup inclusions and target them for a specific text-to-speech engine, selecting the best match for a given speech style or effect and the underlying text-to-speech engine. Once the optimum text-to-speech engine is determined in step 2014, the actual text-to-speech spoken output may then be generated in step 2016.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephone Function (AREA)
Abstract
A speech generation device is disclosed. The device includes casing features that allow it to be operated as a hand-held device that is both portable and easy for a user to hold in his hands and operate to generate speech. Specific casing components may include opposing sides of an outer shell that are shaped in a convex fashion and with a predetermined thickness to fit within the palms of a user's hands. When a user positions his fingers/hands around the casing components of the device, thus operating the device as a hand-held device, the first and second extensions may be positioned on the bottom surface of the device and an outer speaker grille may be positioned to face outward from the user so that audio output from the device radiates outwardly from the user.
Description
HAND-HELD SPEECH GENERATION DEVICE
PRIORITY CLAIM
[0001] This application claims the benefit of previously filed U.S. Provisional Patent Application entitled "HAND-HELD SPEECH GENEfRATION DEVICE," assigned USSN 61/228,256, filed July 24, 2009, and which is fully incorporated herein by reference for all purposes.
BACKGROUND OF THE INVENTION
[0002] The present invention generally pertains to speech generation devices, and more particularly to portable speech generation devices.
[0003] Various debilitating physical conditions, whether resulting from disease or injuries, can deprive the afflicted person of the ability to communicate audibly with persons or devices in one's environment in real time. Injuries suffered during armed combat, whether by domestic police officers or by soldiers engaged in battle zones in foreign theaters, are swelling the population of potential users. Persons lacking the ability to communicate audibly can compensate for this deficiency by the use of speech generation devices. Some conventional speech generation devices are desktop devices, while others are rendered portable by being mounted on vehicles such as wheel chairs, and still other conventional portable speech generation devices can be carried by user as one would carry a brief case.
[0004] It is estimated that less than 10% of the potential users of speech generation devices currently is being served by conventional speech generation devices. This population is highly variable from the standpoint of a range in ages from preschool children through elderly adults and a variety of lifestyles, geographic locations, educational attainments, language sophistication, and available physical motor skills to operate the speech generation device.
[0005] Some speech generation devices, while portable, consume large amounts of power and must be located near an electrical outlet, and this requirement limits freedom
of movement of the user. Other conventional speech generation devices seek to overcome this problem with the provision of a battery, but in order to provide sufficient battery life, these so-called portable SGDs weigh five (5) pounds to seven (7) pounds and/or must be recharged within two (2) or three (3) hours. Because of the substantial power consumption, heat generation can be a problem in some conventional SGDs. While this problem can be addressed by the provision of fans, this solution tends to increase the size and weight of the SGD.
[0006] Conventional speech generation devices typically have dedicated processors, and the user is not given the ability to run other applications on the device. While some speech generation devices provide a touch panel screen and thus have eiiminated the need for a keyboard separate from the display panel, such touch screens are subject to false activations, which is especially a problem given the limited fine motor skills of some of the intended users. Notwithstanding the use of a molded fixotropic plastic, thick-walled case, most of the conventional SGDs are rendered inoperable and require repair upon being dropped or subjected to a moisture rich environment.
[0007] Due to the desire to limit the overall size, weight, and power consumption requirements, conventional SGDs are largely limited to the speech generation function that enables the user to communicate with other persons in the immediate vicinity of the user.
BRIEF SUMMARY OF THE INVENTION
[0008] In general, the present subject matter is directed to an improved speech generation device (SGD) having electronic components that enable the SGD to transmit and receive messages to assist a user in communicating with others. For example, the SGD may correspond to a particular special-purpose electronic device that permits a user to communicate with others by producing digitized or synthesized speech based on configured messages. Such messages may be preconfigured and/or selected and/or composed by a user within a message window provided as part of the speech generation device user interface. A variety of physical input devices and software interface features may be provided to facilitate the capture of user input to define what information should be displayed in a message window and ultimately communicated to others as spoken output, text message, phone call, e-mail or other outgoing
communication.
[0009] One exemplary SGD embodiment includes casing features that enable the SGD to be operated as a hand-held device that is both portable and easy for a user to hold in his hands and operate to generate speech and accomplish additional functions. Specific casing components to accomplish such features may include first and second opposing sides of an outer shell that are shaped in a convex fashion and with a predetermined thickness to fit within the palms of a user's hands. First and second generally convex extensions may extend from a bottom casing component to provide an ergonomic configuration for facilitated location and grip of a user's fingers. When a user positions his fingers/hands around the casing components of the SGD device, thus operating the SGD as a hand-held device, the first and second extensions may be positioned on the bottom surface of the SGD and an outer speaker grille may be positioned to face outward from the user so that audio output from the SGD radiates outwardly from the user to others within a user's environment.
[0010] In one exemplary SGD embodiment, one or more outer casing components are made from magnesium or an alioy thereof. When magnesium is used in the casing components, it enables the case to be an active functional component and not just a protective component. For example, the magnesium case provides conductive and radiated immunity to the components mounted within the magnesium case without requiring separate shielding. The magnesium case also provides heat dissipation features for an SGD by coupling the microprocessor to the case.
[0011] In another exemplary SGD embodiment, a capacitive touch screen is provided as an input device for the SGD instead of a keyboard, a mouse or a pressure sensitive touchscreen. Inclusion of such a feature is advantageous because active sensing material can be applied to the back of any transparent material to achieve a thin and lightweight design. Multi-touch interaction is enabled and the need for
recalibrations that may be needed with resistive or pressure-sensitive touch screens is obviated. In addition, use of a capacitive touch panel which requires no activation force but only a slight contact, is an advantage for a user who may have motor control limitations.
[0012] In still further exemplary SGD embodiments, hardware components may also include various communications devices and/or modules and related communications functionality. For example, a wireless network adapter may be included to provide integrated Internet access. An RF device may be included as an integrated cell phone to enable the user to send and receive text messages and phone calls directly and
speak during the phone conversation using the SGD, thereby eliminating the need for a separate telephone device, An infrared (IR) transceiver may be provided to function as a universal remote control for the SGD that can operate devices in the user's
environment, for example including TV, DVD player, and CD player. A Bluetooth interface may be included to provide Bluetooth radio signals that can be used to control a desktop computer, which appears on the SGD's display as a mouse and keyboard, or other Bluetooth-enabled devices.
[0013] Additional features that may be included in exemplary SGD embodiments concern computer readable medium that, when executed by a processor, instruct the SGD to perform specific functionality. For example, instructions stored in SGD memory may adapt the device to perform integrated web access for vocabulary generation in which content is selected from the Internet, and useful terms, phrases and/or images are extracted and available for immediate use in aided communication via the SGD.
[0014] Another feature that may be provided in exemplary SGD embodiments corresponds to a voice inflection mark up feature that can add inflection or other speech style to the speech output of the SGD or that can add a predefined speech effect.
Exemplary speech styles for modifying text may be defined in accordance with a variety of parameters, including but not limited to voice, volume, pitch, speech rate, spell mode, emphasis and context. Exemplary speech effects that may be inserted into a
communicated message may include but are not limited to coughing, laughing, crying, kissing, sniffing, swallowing, sighing, whistling, yawning, or making noises such as "eh", "oh", "mmm", etc. The selected speech styles and effects can help communicate different meanings just based on the different inflections imposed on different syllables of the words being spoken by the SGD. In addition, such speech modifiers provide more interactive and enjoyable communication by an SGD user.
[0015] Exemplary SGD embodiments may further include executable instructions stored on a computer-readable medium that implement a unique method of
communication rate enhancement that combines sentences and phrases with slot fillers. In general, implementation of a "slots" feature triggers software provision of
mechanisms to insert a "slot" or placeholder in a text phrase. Each one of these placeholders is associated with a list of "fillers" that can potentially fill in this place in the text. The list of slots and their associated fillers are stored in a database associated with other computer readable medium storing computer executable instructions for implementing the rate enhancement functionality.
[0016] Generally, the technical effect of the disclosed computer-implemented methods is to enhance the operation and functionality of the speech generation device of the present subject matter.
[0017] Additional objects and advantages of the invention will be set forth in part in the description that follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objects and advantages of the invention may be realized and attained by means of the instrumentalities and combinations particularly pointed out in the appended claims.
BRIEF DESCRIPTION OF THE DRAWINGS
[0018] The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate at least one presently preferred embodiment of the invention as well as some alternative embodiments. These drawings, together with the description, serve to explain the principles of the invention but by no means are intended to be exhaustive of all of the possible manifestations of the invention.
[0019] Fig. 1 provides a perspective view of a first exemplary embodiment of a portable speech generation device in accordance with an aspect of the present invention;
[0020] Fig. 2 provides another perspective view of a first exemplary embodiment of a portable speech generation device in accordance with an aspect of the present invention;
[0021] Fig. 3 provides an exploded view of a first exemplary embodiment of a portable speech generation device in accordance with an aspect of the present invention;
[0022] Fig. 4 provides a perspective view of a second exemplary embodiment of a portable speech generation device in accordance with an aspect of the present invention;
[0023] Fig. 5 provides a perspective view of a second exemplary embodiment of a portable speech generation device in accordance with an aspect of the present invention;
[0024] Fig. 6 provides an exploded view of a second exemplary embodiment of a portable speech generation device in accordance with an aspect of the present invention;
[0025] Fig. 7 provides a schematic diagram of exemplary hardware components for use within a speech generation device in accordance with an aspect of the present invention;
[0026] Fig. 8 provides an exemplary embodiment of a graphical user interface menu for providing a phone contacts menu for use with a cellular phone integrated with a speech generation device;
[0027] Fig. 9 provides an exemplary embodiment of a graphical user interface for providing a contact editing menu for use with a cellular phone integrated with a speech generation device;
[0028] Fig. 10 provides an exemplary embodiment of a graphical user interface menu for providing a phone menu for use with a cellular phone integrated with a speech generation device;
[0029] Fig. 11 provides an exemplary embodiment of a graphical user interface menu for providing a text messaging menu for use with a cellular phone integrated with a speech generation device;
[0030] Fig. 12 provides an exemplary embodiment of a graphical use interface menu for providing a web browser with menu bar use to access the internet with a speech generation device;
[0031] Fig. 13 provides an exemplary embodiment of a graphical user interface menu for selecting text from a web page;
[0032] Fig. 14 provides an exemplary embodiment of a graphical user interface menu for providing a vocabulary list box based on information captured from the internet;
[0033] Fig. 15 provides an exemplary embodiment of a graphical user interface menu for importing an online picture for use as symbols;
[0034] Fig. 16 provides a flow chart of an exemplary method for integrating text from a vocabulary source into a user vocabulary list;
[0035] Fig. 17 provides an exemplary embodiment of a graphical user interface menu for displaying an embedded vocabulary list in a vocabulary list box;
[0036] Fig. 18 provides a first view of an embodiment of a graphical user interface menu provided via software features for implementing slots and fillers;
[0037] Fig. 19 provides a second view of an embodiment of a graphical user interface menu provided via software features for implementing slots and fillers;
[0038] Fig. 20 provides a flow chart of an exemplary method for implementing speech styles and effects in spoken text;
[0039] Fig. 21 provides an exemplary embodiment of a graphical user interface menu for displaying integrated speech effects and styles as marked up text;
[0040] Fig. 22 provides an exemplary embodiment of a graphical user interface menu for displaying a speech style editor;
[0041] Fig. 23 provides an exemplary embodiment of a graphical user interface menu for displaying a speech effect editor;
[0042] Fig. 24 provides an exemplary embodiment of a graphical user interface menu for implementing Bluetooth functionality associated with an SGD;
[0043] Fig. 25A provides an exploded view of portions the speech generation device illustrated in Fig. 1 in accordance with an aspect of the present invention; and
[0044] Fig. 25B provides an inverted exploded view of the portions of the speech generation device illustrated in FIG. 25A in accordance with an aspect of the present invention.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
[0045] Reference now will be made in detail to the presently preferred embodiments of the invention, one or more examples of which are illustrated in the accompanying drawings. Each example is provided by way of explanation of the invention, which is not restricted to the specifics of the examples. In fact, it will be apparent to those skilled in the art that various modifications and variations can be made in the present invention without departing from the scope or spirit of the invention. For instance, features illustrated or described as part of one embodiment, can be used on another
embodiment to yield a still further embodiment. Thus, it is intended that the present invention covers such modifications and variations as come within the scope of the appended claims and their equivalents.
[0046] Referring now to the drawings, Figs. 1-3 illustrate the various structural components of a first exemplary embodiment of a speech generation device (SGD) 100 in accordance with an aspect of the present invention. Structural components of a second exemplary SGD embodiment 400 are shown in and discussed with reference to Figs. 4-6. The internal hardware and software components and related functionality of both SGDs 100 and 400 may be as described with reference to Figs, 7-24. It should be appreciated that specific embodiments of an SGD in accordance with the disclosed
technology should not be limited by the complete spectrum of hardware and software components described herein. Instead, an SGD may selectively incorporate one or more of such disclosed features and steps and others to create additional and
alternative embodiments,
[0047] As shown in Figs. 1-3, a first exemplary SGD 100 corresponds to a hand-held device that is both portable and easy for a user to hold in his hands and operate to generate speech and accomplish additional functions as described herein. SGD 100 may include one or more outer housing, or casing components, such as a top casing component 102, a first bottom casing component 104 and a second bottom casing component 106. In one embodiment, a battery is stored between the first and second bottom casing components 104 and 106, with or without the inclusion of a metal plating cover to help secure the battery therein. The top casing component 102 and second bottom casing component 106, together with speaker grille 108 collectively form the outer shell for SGD 100. Second bottom casing component 106 forms the bottommost surface of the outer shell.
[0048] Various features of the outer shell of SGD 100 are provided to accommodate portability of SGD 100 within a user's hands. First and second opposing sides 110 and 112 of the outer shell formed by casing components 102-106 are generally shaped in a convex fashion and with a predetermined thickness to fit within the palms of a user's hands. First and second generally convex extensions 114 and 116 extend from the second bottom casing component 106 to provide an ergonomic configuration for facilitated location and grip of a user's fingers. First and second convex extensions 114 and 116 generally extend along second bottom casing component 106 from the speaker grille 108 forming a front surface of SGD 100 towards a rear surface of SGD 100.
Extensions 114 and 116 may taper towards the rear surface of SGD 100. First extension 114 may be provided generally adjacent or near to the first side 110 of second bottom casing component 106, while second extension 116 may be provided generally adjacent or near to the second side 112 of the second bottom casing component 106. When a user positions his fingers/hands around the casing
components of the SGD device, thus operating the SGD as a hand-held device, the first and second extensions 114 and 116 may be positioned on the bottom surface of the SGD and the outer speaker grille 108 may be positioned to face outward from the user so that audio output from the SGD radiates outwardly to others within a user's environment.
[0049] Referring still to the outer shell formed by the casing components of SGD 100, top casing component 102 may include a recessed area 118 or other feature for accommodating an outer touch screen 120. Top casing component 102 may also be formed to define a relatively large opening 122 therethrough to provide access to a display panel 124 which may be viewed through touch screen 120. Touch screen 120 may generally serve as an input feature for SGD 100, and is described later in more detail. Display panel 124 may generally serve as an output feature for SGD 100 and is also described later in more detail.
[0050] The casing components forming the outer shell of SGD 100 may also be formed to define additional openings to accommodate data input and output. For example, an opening 126 within top casing component 102 may be provided through which an LED or other light source can provide a device power indicator light. An opening 128 can provide a location for a button by which a user can toggle the power for SGD 100 in an "on" or "off" position. Another opening 130 formed within one or more casing components can provide a location for data input/output ports. Speaker grille 108 may be formed to define a first opening 132 for accommodating a volume control knob to control the volume of speakers associated with SGD 100, as well as a second opening 134 to provide a location for a USB port through which various peripheral devices may be coupled to SGD 100.
[0051] One or more printed circuit boards (PCBs) may also be provided as structural components within SGD 100, and may be housed between top casing component 102 and first bottom casing component 104. For example, a PCB may be provided as a mounting surface or motherboard for such hardware and/or electronics components as a computer processing unit, hard drive(s) and/or other associated memory or media device(s). One or more PCBs may also serve as a mounting surface for radio or wireless communications modules, antennas, data buses, related integrated circuits and passive devices, and the like. First and second speakers 136 may also be positioned among and/or securely mounted to the PCBs and/or casing components. As shown in Fig. 3, speakers 136 may be respectively coupled via a first connection to first bottom casing component 104 and via a second connection to an associated PCB.
[0052] Various connectors may be used to establish relative positioning among the various casing components and secure the components to form an integrated assembly. For example, mating holes, pins and standoffs may be used to orient
adjacent casing components to one another. Snap-fit features, threaded screws and/or other connectors may be used to secure and connect the casing components together.
[0053] A second exemplary embodiment of a speech generation device is shown in Figs. 4-6, respectively. From a functional perspective, such embodiment 400 may include some or all of the same or similar internal hardware and software features as the first exemplary embodiment of Fig. 1-3. The overall size and dimension of SGD 400 compared to SGD 100 may vary. For example, one exemplary embodiment of SGD 100 corresponds to a size with dimensions 6.15" x 4" x 1.9". One exemplary
embodiment of SGD 400 corresponds to a size with dimensions 10.8" x 8.5" x 1.9". The SGD can incorporate one or more of the foregoing features and yet weigh less than 2 pounds when including a 5 1 /4-inch diagonal display and less than 3 pounds when incorporating at least a 10-inch display screen.
[0054] Referring now to Figs. 4-6, SGD 400 generally includes several structural components, including a display lens 402, a top casing component 404, an internal casing component 406, a bottom casing component 408, a speaker grille 410 and a battery door 412. Display lens 402 may be a protective covering or may correspond to the outer surface of a touch-screen LCD or other display device for use with SGD 400, as described later. Top casing component 404 is formed to define an opening therein for accepting the display lens 402 and for providing viewing access to a display device. An upper peripheral surface of top casing component 404 may be formed with a material (e.g., chrome) selected to highlight the outer perimeter of SGD 400 as viewed by a user and to place emphasis on the relatively thin edge profile of SGD 400. Internal casing component 406 provides a structural support and backer for the device display. Internal hardware components separate from the display may be mounted and secured between the internal casing component 406 and the bottom casing component 408. Bottom casing component 408 may be formed to define a first opening 414 to access data input/output (I/O) ports. Opening 414 may be secured with a data door that can be flush with the surrounding surface of bottom casing component 408. Bottom casing component 408 may also be formed with a second opening by which a user may access a battery associated with SGD 400. The battery door 412 with latch feature is provided for securing the battery within second opening.
[0055] Similar to the generally ergonomic features of SGD 100, first and second opposing sides 430 and 432 of the outer shell formed by casing components 404 and 408 are generally shaped in a convex fashion and with a predetermined thickness to fit
within the palms of a user's hands. Additionally, the bottom casing component 408 of SGD 400 may also include first and second generally convex extensions 418 and 420 projecting from a lower surface of bottom casing component 408. First and second convex extensions 418 and 420 generally extend along second bottom casing component 408 from the speaker grille 410 forming a front surface of SGD 400 towards a rear surface of SGD 400. Extensions 418 and 420 may taper towards the rear surface of SGD 400. First extension 418 may be provided generally adjacent or near to the first side of bottom casing component 408, while second extension 420 may be provided generally adjacent or near to the second side of the bottom casing component 408. Further, the first and second extensions 418 and 420 may define generally concaveiy shaped recessed areas 434 and 436, which may also serve as ergonomic features of the SGD 400. For example, the concave recessed areas 434 and 436 may be configured to facilitate location and grip of a user's fingers as they are wrapped around the first and second extensions 418 and 420.
[0056] The outer casing components used in first and second exemplary SGD embodiments 100 and 400 may be molded from any substantially rigid and lightweight material. In one embodiment, one or more outer casing components are made from a material such as but not limited to plastic, thermoplastic, polymer, polyethylene, or resin material. In another embodiment, one or more outer casing components (particularly the bottom casing components) are made from magnesium or an alloy thereof. When magnesium is used in the casing components, it provides several advantages for an SGD and enables the case to be an active functional component and not just a protective component.
[0057] First, a magnesium casing provides conductive and radiated immunity to the components mounted within the magnesium case without requiring separate shielding. The magnesium prevents intrusion of radiated conducted emissions that otherwise might effect the electronic components of the SGD disposed within the case. The physical properties of the magnesium also provide a superior shielding from
electromagnetic interference (EMI) signals. Typically plastic enclosures need a metal shield or shielding directly on components on the PCB to provide protection from radiated and conducted emissions.
[0058] A magnesium case also provides heat dissipation features for an SGD. In particular, one or more of the heat generating components of the SGD may be thermally coupled to the magnesium case to enable the case to function as a heat sink. For
example, referring to Figs. 25A and 25B, exploded views of several of the components of the SGD 100 illustrated in Figs. 1-3 are depicted. As shown, the SGD's
microprocessors 180 are mounted on a circuit board 182 of the SGD 100. Additionally, to enable heat generated from the microprocessors 180 to be conducted through the magnesium case, the microprocessors 180 may be coupled to any suitable thermally conductive plates 184, such as plates formed from aluminum or various other metals. The thermally conductive plates 184 may then be directly coupled to a portion of the magnesium case, such as an inner surface 186 of the the bottom casing component 106 of the magnesium case. Thus, any heat geneated by the microprocessors 180 may be conducted through the thermally conductive plates 184 and into the bottom casing conmponent 106. It should be apprecited that the processors 180 may be coupled to the plates 184 and the plates 184 may be coupled to the casing component 106 using any suitable means, such as by welding the components together or using mechanical fasteners (e.g., screws, bolts and the like). Additionally, in an alternative embodiment, the microprocessors 180 may be directly coupled to the magnesium case to provide heat transfer directly from the microprocessors 180 to the case. Regardless, such heat dissipation features can eliminate the need for cooling fans, which conserves the power demands on the SGD's lithium polymer battery, reduces the SGD's background noise level, which is especially desirable for a speaking device, and eliminates moving parts of the fan that otherwise are subject to breakage upon dropping of the SGD and therefore improves the overall durability of the SGD.
[0059] Additionally, the magnesium case provides greater structural integrity with low weight and high strength than conventional plastic casings. In particular, it permits the walls of the case to be thinner than would be the situation in a fixotropic molded plastic case in a conventional SGD. The strength to weight ratio of the magnesium case thus is also greater than might otherwise be if molded from fixotropic plastic requiring thicker walls in order to achieve the same durability or even less durability than is possible with the magnesium case of the present invention. Injection molding magnesium produces better complex net shape requiring less machining and secondary operations. Bosses can be drilled and tapped thus eliminating the need for inserts or self-tapping screws which allow the case to be vibration and resonance free, thus improving the sound quality.
[0060] Electronic components intended for selective use with a speech generation device in accordance with aspects of the present invention are shown in Fig. 7. The
electronic components may include a combination of hardware, software and/or firmware elements, all of which either correspond to physical tangible apparatuses or which are embedded as instructions on a physical and tangible apparatus such as a computer-readable storage medium.
[0061] In general, the electronic components of an SGD enable the device to transmit and receive messages to assist a user in communicating with others. For example, the SGD may correspond to a particular special-purpose electronic device that permits a user to communicate with others by producing digitized or synthesized speech based on configured messages. Such messages may be preconfigured and/or selected and/or composed by a user within a message window provided as part of the speech generation device user interface. As will be described in more detail below, a variety of physical input devices and software interface features may be provided to facilitate the capture of user input to define what information should be displayed in a message window and ultimately communicated to others as spoken output, text message, phone call, e-mail or other outgoing communication.
[0062] Referring now to Fig. 7, central computing device 700 may include a variety of internal and/or peripheral components. Power to such devices may be provided from a battery 703, such as but not limited to a lithium polymer battery or other rechargeable energy source. A power switch or button 705 may be provided as an interface to toggle the power connection between the battery 703 and the other hardware components. In addition to the specific devices discussed herein, it should be appreciated that any peripheral hardware device 707 may be provided and interfaced to the speech generation device via a USB port 709 or other communicative coupling. It should be further appreciated that the components shown in Fig. 7 may be provided in different configurations and may be provided with different arrangements of direct and/or indirect physical and communicative links to perform the desired functionality of such
components.
[0063] Referring more particularly to the exemplary hardware shown in Fig. 7, a central computing device 700 is provided to function as the central controller within a SGD and may generally include such components as at least one memory/media element or database for storing data and software instructions as well as at least one processor. In the particular example of Fig. 7, one or more processor(s) 702 and associated memory/media devices 704a and 704b are configured to perform a variety of computer-implemented functions (i.e., software-based data services). One or more
processor(s) 702 within computing device 700 may be configured for operation with any predetermined operating systems, such as, but not limited to, Windows XP, and thus is an open system that is capable of running any application that can be run on Windows XP. Other possible operating systems include BSD UNIX, Darwin (Mac OS X), Linux, SunOS (Solaris/OpenSolaris), and Windows NT (XP/Vista/7).
[0064] At least one memory/media device (e.g., device 704a in Fig. 7) is dedicated to storing software and/or firmware in the form of computer-readable and executable instructions that will be implemented by the one or more processor(s) 702. Other memory/media devices (e.g., memory/media devices 704b) are used to store data which will also be accessible by the processors) 702 and which will be acted on per the software instructions stored in memory/media device 704a. The various memory/media devices of Fig. 7 may be provided as a single or multiple portions of one or more varieties of computer-readable media, such as, but not limited to, any combination of volatile memory (e.g., random access memory (RAM, such as DRAM1 SRAM, etc.)) and nonvolatile memory (e.g., ROM, flash, hard drives, magnetic tapes, CD-ROM, DVD- ROM, etc.) or any other memory devices including diskettes, drives, other magnetic- based storage media, optical storage media and others. In some embodiments, at least one memory device corresponds to an electromechanical hard drive and/or or a solid state drive (e.g., a flash drive) that easily withstands shocks, for example that may occur if the SGD is dropped. Although Fig. 7 shows two separate memory/media devices 704a and 704b, the content dedicated to such devices may actually be stored in one memory/media device or in multiple devices. Any such possible variations and other variations of data storage will be appreciated by one of ordinary skill in the art.
[0065] In one particular embodiment of the present subject matter, a first portion of memory/media device 704b is configured to store input data received from a user for performing the desired functional steps associated with a speech generation device. For example, data in memory 704b may include inputs received from one or more peripheral devices, including but not limited to touch screen 706, microphone 708 and other peripheral devices 710, which indicate a user's selections of text to be spoken by the SGD or other related actions. Memory device 704a includes computer-executable software instructions that can be read and executed by processor(s) 702 to act on the data stored in memory/media device 704b to create new output data (e.g., audio signals, display signals, RF communication signals and the like) for temporary or permanent storage in one of the memory/media devices. Such output data may be
communicated to a peripheral output device, such as display device 712, speakers 714, cellular phone or RF device 716, wireless network adapter 718, or as control signals to still further components. Computing/processing device(s) 702 may be adapted to operate as a special-purpose machine by executing the software instructions rendered in a computer-readable form stored in memory/media element 704a. When software is used, any suitable programming, scripting, or other type of language or combinations of languages may be used to implement the teachings contained herein. In other embodiments, the methods disclosed herein may alternatively be implemented by hardwired logic or other circuitry, including, but not limited to application-specific circuits. The steps, features and algorithms described later, including but not limited to algorithms for extracting web words, implementing slots and fillers, utilizing vocabulary lists, applying speech inflection styles and effects, and the like, are provided as executable software instructions stored in the computer-readable medium of elements 704a and/or 704b or others.
[0066] Referring still to Fig. 7, various input devices may be part of an SGD and communicatively coupled to the computing device 700. For example, a touch screen 706 may be provided to capture user inputs directed to a display location by a user hand or stylus. A microphone 708, for example a surface mount CMOS/MEMS silicon- based microphone such as offered for sale by Knowles Acoustics, a division of Knowles Electronics of Itasca, Illinois, may be provided to capture user audio inputs. Other exemplary input devices (e.g., peripheral device 710) may include but are not limited to a peripheral keyboard, peripheral touch-screen monitor, peripheral microphone, mouse and the like.
[0067] In one particular example, touch screen 706 includes a projective capacitive touchscreen instead of a keyboard, a mouse or a pressure sensitive touchscreen. A capacitive touchscreen panel is a sensor typically made of glass coated with a transparent conductor such as indium tin oxide (ITO). This type of sensor is basically a capacitor in which the plates are the overlapping areas between the horizontal and vertical axes in a grid pattern. Since the human body also conducts electricity, a touch on the surface of the sensor will affect the electric field and create a measurable change in the capacitance of the device. Thus, a transparent rigid panel covers the display screen and prevents false activations of the SGD when the transparent panel inadvertently is subject to pressure. Each of the icons on the display of the SGD is a capacitive switch, which is activated by the capacitance of the user's body rather than
by pressure against the screen. One particular example of a touch screen for use in accordance with the subject SGD corresponds to a capacitive touch screen such as offered for sale by Touch International of Austin, Texas.
[0068] When employed in accordance with an aspect of the present invention, the incorporation of projective capacitive technology into a dedicated speech generating device has several advantages, especially relative to a typical resistive touch panel. A primary advantage stems from the fact that active sensing material can be applied to the back of any transparent material, in this case a super thin (0.043") chemically strengthened glass. The overall thinness and light weight are cruciai in a lightweight, handheld ambulatory design. The top glass surface can be sealed to a bezel associated with a front casing component, providing ingress protection to moisture and dirt as well as a flush mounting surface so there is no recess for debris or liquids to accumulate. The active electronics and graphics are applied to the back surface of the glass which makes the top scratch resistant and increases the durability of the panel. High resolution devices are generally available, such as on the order of 10,000 dpi (dots per square inch). Capacitive touch screens also don't typically require recalibrations, as may be needed with resistive or pressure-sensitive touch screens.
[0069] Capacitive touch screens also provide advantages for conventional users of a speech generation device. A capacitive touch panel requires no activation force but only a slight contact, which is an advantage for a user who may have motor control limitations. Typical resistive sensors have a predetermined activation force required to physically short two conductive layers together. These top layers are external and can suffer failure due to scratching or physical breakdown of the spacing layer that separates the two conductive surfaces. Capacitive touch screens also accommodate multi-touch applications (i.e., a set of interaction techniques which ailow a user to control graphical applications with several fingers) as well as scrolling.
[0070] In general, the different types of input devices (including optional peripheral devices) are configured with software instructions to accept user inputs in accordance with one or more access methods, including the following: a "Touch Enter", "Touch Exit", "Touch Auto Zoom", "Scanning", "Joystick", "Audio Touch", "Mouse
Pause/Headtrackers", "Morse Code" and/or "Eye Tracking" access methods. In a Touch Enter" access method, selection is made upon contact with the touch screen, with highlight and bold options to visually indicate selection. In a "Touch Exit" method, selection is made upon release as a user moves from selection to selection by dragging
a finger as a stylus across the screen. In a "Touch Auto Zoom" method, a portion of the screen that was selected is automatically enlarged for better visual recognition by a user. In a "Scanning" mode, highlighting is used in a specific pattern so that individuals can use a switch (or other device) to make a seiection when the desired object is highlighted. Selection can be made with a variety of customization options such as a 1- switch autoscan, 2-switch directed scan, 2-switch directed scan, 1 -switch directed scan with dwell, inverse scanning, and auditory scanning. In a "Joystick" mode, selection is made with a button on the joystick, which is used as a pointer and moved around the touch screen. Users can receive audio feedback while navigating with the joystick. In an "Audio Touch" mode, the speed of directed selection is combined with auditory cues used in the "Scanning" mode. In the "Mouse Pause/Headtrackers" mode, selection is made by pausing on an object for a specified amount of time with a computer mouse or track ball that moves the cursor on the touch screen. An external switch exists for individuals who have the physical ability to direct a cursor with a mouse, but cannot press down on the mouse button to make selections. A "Morse Code" option is used to support one or two switches with visual and audio feedback. In "Eye Tracking" modes, selections are made simply by gazing at the device screen when outfitted with eye controller features and implementing selection based on dwell time, eye blinking or external switch activation.
[0071] SGD hardware components may also include one or more integrated output devices communicatively coupled to the central processing unit, such as but not limited to a display device 712 and speakers 714. Display device 712 may correspond to suitable monitor, screen or other output device for visual presentation of to a user.
Suitable examples, include but are not limited to a light-emitting diode (LED) display, electroluminescent display (ELD), plasma display panel (PDP), and liquid crystal display (LCD). One particular embodiment corresponds to an LCD such as an LW500AC9001 component offered for sale by Chi Mei Optoelectronics (CMO) of Taiwan. Speakers 714 may generally correspond to any compact high power audio output device, for example, a GC0401 low profile paper and mylar speaker such as offered for sale by Amaoto Industrial Co., Ltd. (A.I. C.) of Taiwan.
[0072] Speakers 714 may function as an audible interface for the speech generation device when computer processor(s) 702 utilize text-to-speech functionality. In accordance with general functionality of a speech generation device, a user provides text, symbols corresponding to text, and/or related or additional information in a
"Message Window" which may then be interpreted by a text-to-speech engine and provided as audio output via the speakers 714. In one embodiment, The SGD also includes an e-book reader that can be controlled by the user to read the e-book and have the e-book speak the words being read to the user. Speech output may be generated in accordance with one or more preconfigured text-to-speech generation tools in male or female and adult or child voices, such as but not limited to such products as offered for sale by Cepstral, HQ Voices offered by Acapela, Flexvoice offered by Mindmaker, DECtalk offered by Fonix, Loquendo products, VoiceText offered by NeoSpeech, products by, AT&T's Natural Voices offered by Wizzard, Microsoft Voices, digitized voice (digitally recorded voice clips) or others. A volume control module 722 may be controlled by one or more scrolling switches or touch-screen buttons.
[0073] SGD hardware components may also include various communications devices and/or modules, such as but not limited to an antenna 715, cellular phone or RF device 716 and wireless network adapter 718. Antenna 715 can support one or more of a variety of RF communications protocols. A cellular phone or other RF device 716 may be provided to enable the user to make phone calls directly and speak during the phone conversation using the SGD, thereby eliminating the need for a separate telephone device. A wireless network adapter 718 may be provided to enable access to a network, such as but not limited to a dial-in network, a local area network (LAN), wide area network (WAN), public switched telephone network (PSTN), the Internet, intranet or ethernet type networks or others. These communications modules (or others, such as but not limited to an infrared (IR) transceiver) may be provided to function as a universal remote control for the SGD that can operate devices in the user's
environment, for example including TV, DVD player, and CD player.
[0074] When different wireless communication devices are included within an SGD1 a dedicated communications interface module 720 may be provided within central computing device 700 to provide a software interface from the processing components of computer 700 to the communication device(s). In one embodiment, communications interface module 720 includes computer instructions stored on a computer-readable medium as previously described that instruct the communications devices how to send and receive communicated wireless or data signals.
[0075] Antenna 715 may be provided to facilitate wireless communications with other devices in accordance with one or more wireless communications protocols, including
but not limited to BLUETOOTH, WI-FI (802.11 b/g) and ZIGBEE wireless communication protocols. In one example, the antenna 715 enables a user to use the SGD with a BLUETOOTH headset for making phone calls or otherwise providing audio input to the SGD. The SGD can also generate BLUETOOTH radio signals that can be used to control a desktop computer, which appears on the SGD's display as a mouse and keyboard. An example of a suitable antenna for such devices is a surface mount Rufa brand antenna such as offered for sale by Antenova Ltd. of Cambridge, United Kingdom.
[0076] When the hardware components within an SGD embodiment, particularly the communications interface module 720, includes functional features by which the SGD can function as a Bluetooth radio, users can utilize the flexible input methods and software configurations of an SGD to control and operate a separate desktop or laptop computer. In such a fashion, the SGD appears to the desktop as a Human Interface Device (HID) device which allows the user who may not otherwise be able to type or control a mouse to operate the computer by taking advantage of the accessibility options provided by the SGD and its specialized hardware and software features. To access and control a personal computer using the Bluetooth features internal to the SGD1 a user plugs a Bluetooth access control module into a USB port on the user's personal computer and performs a communication pairing procedure that establishes short-range wireless communication connectivity between the SGD and the personal computer. Similar steps may be followed to establish Biuetooth connections between an SGD and Bluetooth-enabled headset of keyboard/mouse, etc.
[0077] In addition to controlling a desktop, integrated Bluetooth features afford a user the opportunity to take advantage of several optional Bluetooth accessories. For example, a switch may be provided for users to mechanically actuate a selection on the SGD and then communicate that selection via Bluetooth protocols. Switching is often used when an SGD operates in a user-input mode where choices are scanned across the display as visual options or sequenced within an audio output and a user can then select one of the scanned options upon selection by switch. Scanning users often rely on switches located wherever the user has consistent and reliable motor control.
Switches may be located on a head rest, seat, leg support, etc. Many conventional switches are hard-wired and include cables that are routed from the device around a wheelchair or other mounting location to the accessibility points. Provision of a
Bluetooth-communicating input switch eliminates the potential for wire tangling, thus
providing a more convenient and safer environment for the user in a wheelchair with moving parts.
[0078] Another option afforded by Bluetooth communications features involves the benefits of a Bluetooth audio pathway. Many users utilize an option of auditory scanning to operate their SGD. A user can choose to use a Bluetooth-enabled headphone to listen to the scanning, thus affording a more private listening environment that eliminates or reduces potential disturbance in a classroom environment without public broadcasting of a user's communications. A Bluetooth (or other wirelessly configured headset) can provide advantages over traditional wired headsets, again by overcoming the cumbersome nature of the traditional headsets and their associated wires.
[0079] Fig. 24 shows an exemplary user interface menu provided via software features of the present invention that provides specific functional selection mechanisms for a user to implement Bluetooth functionality. The Bluetooth menu shown in Fig. 24 may include a variety of features, including a Bluetooth devices viewport 2400, an Unpair Bluetooth device button 2402, a Refresh button 2404, a locator button 2406, a Bluetooth device power button 2408, a device details group box 2410, a
connect/disconnect button 2412 and a flash button 2414. Bluetooth devices viewport 2400 displays icons for all Bluetooth-enabled devices to which an SGD is currently connected, paired to, or that are available for connection. Unpair Bluetooth device button 2402 breaks the Bluetooth connection between an SGD and the Bluetooth- enabled device selected in the viewport. In other words, button 2402 unpairs a device and removes the device's icon from the viewport. Refresh button 2404 starts a new search for available Bluetooth-enabled devices. Locator button 2406 is selected if the Bluetooth-enabled device to which a user wants to connect is not displayed in the viewport 2400. Actuation of button 2406 opens a dialog box instructing the user to ready the device for pairing and then triggers a new search for available Bluetooth- enabled devices. Bluetooth device power button 2408 toggles the Bluetooth radio in the SGD on and off to enable or disable all Bluetooth connections between the SGD and Bluetooth-enabled devices. Device details group box 2410 lists details of the Bluetooth- enabled device selected in viewport 2400. Connect/disconnect button 2412 toggles between enabling and disabling a Bluetooth connection with the specific device selected in the viewport 2400. Flash button 2414 causes the Bluetooth access control module
plugged into a USB port of the Bluetooth device to flash an indicator light so that the Bluetooth connection can be verified.
[0080] When an exemplary SGD embodiment includes an integrated cell phone, a user is able to send and receive wireless phone calls and text messages. The cell phone component 716 shown in Fig. 7 may include an access port for a subscriber identity module (SIM) card by which a user can provide requisite information for identifying user information and cellular service provider, contact numbers, and other data for cellular phone use. In addition, associated data storage within the SGD itself can maintain a list of frequently-contacted phone numbers and individuals as well as a phone history or phone call and text messages. One or more memory devices or databases within a speech generation device may correspond to computer-readable medium that may include computer-executable instructions for performing various steps/tasks associated with a celiular phone and for providing related graphical user interface menus to a user for initiating the execution of such tasks. The input data received from a user via such graphical user interfaces can then be transformed into a visual display or audio output that depicts various information to a user regarding the phone call, such as the contact information, cail status and/or other identifying information. General icons available on SGD or displays provided by the SGD can offer access points for quick access to the cell phone menus and functionality, as well as information about the integrated cell phone such as the cellular phone signal strength, battery life and the like.
[0081] Examples of features variously provided for a user in relation to the use of an integrated cell phone within a speech generation device are now discussed with reference to Figs. 8-11. It should be further appreciated that in addition to the software features described herein, specific hardware features included within a cellular phone may also be provided, such as but not limited to an RF transceiver module,
coder/decoder (CODEC) module, digital signal processor (DSP) module,
communications interfaces, microcontroller(s) and/or subscriber identity module (SIM) cards. It should be appreciated that all graphical user interfaces and menus that display "buttons" or other features that are selectable by a user correspond to user input features that when selected trigger control signals being sent to the central computing device within an SGD to perform an action in accordance with the selection of the user buttons.
[0082] Referring now to Fig. 8, an exemplary graphical user interface menu provided via software features of the present invention provides for an exemplary phone contacts menu for use with a cellular phone integrated with a speech generation device. A contact list can contain identifying information (e.g., address, phone number(s), e-mail address(es), etc.) for people in one or more groups such as family, friends, physicians, clinicians, or others. Once a contact list is established, a user can selectively employ the contact information to streamline cell phone, text messaging and e-mail
communication activities via the SGD. As shown in Fig. 8, the exemplary contacts menu can include the following items: a contacts viewport 800, contact symbol display 801 , contacts add button 802, contacts edit button 804, contacts delete button 806, contacts search button 808, contacts place call or send text button 810, or others. The contacts viewport generally displays the particular information about a contact as stored in computer-readable memory associated with the SGD. For example, contacts viewport 800 may display each contact in a user's list, including the phone number(s), e-mail address(es), and a predetermined or selected symbol. A larger version of the contact symbol is provided as display 801 , to the right of the viewport 800. Contacts add button 802 is provided for selection by a user in order to add a new contact to the contact list. Contacts edit button 804 is provided for selection by a user in order to revise a currently selected contact. Contacts delete button 806 is provided for selection by a user in order to remove the currently selected contact. Contacts search button 808 is provided for selection by a user in order to search through the contact list using first or last name. Button 810 is provided for a user to place a call or send a text message to one of the numbers for the currently selected contact.
[0083] Before a user can use the contacts menu shown in Fig. 8, a user first provides general contact information about a user. An example of the user interface menu for providing initial or subsequent edits for a particular contact is shown in Fig. 9. The edit contact menu of Fig. 9 can be accessed and then opened to enter information for the new contact or to edit information for an existing contact. As shown in Fig. 9, contact information may selectively include one or more of the following: name, address, specifically designated ringtone, personalized custom greeting, home phone number, work phone number, cellular phone number, one or more e-mail addresses, and the like. A picture may also be selected by a user to provide a symbol
corresponding to a visual reference for a particular contact.
[0084] Referring now to Fig. 10, a phone menu enables a user to dial outgoing phone calls, access the contacts list and view calling history. In particular, a standard 12-button number pad 1000 is provided by which a user can manually dial a phone number to which an outgoing phone call may be placed. A power button 1002 may be used to turn the integrated cell phone on or off. Turning the phone off can extend battery life when the user is not using the integrated phone feature. A hide button 1004 can minimize the phone menu so a user can navigate through the communication pages while the phone is active. A close button 1006 closes the phone menu, thus disconnecting any active phone calls. A viewport 1008 displays the phone number that a user is calling. A dial button 1010 enables a user to place an outgoing call to the number shown in the viewport. A backspace button 1012 enables a user to erase the last number in the viewport. A clear button 1014 enables a user to clear all numbers from the viewport. A contacts button 1016 enables a user to open the contacts menu (e.g., as shown in Fig. 8) to search through the user's list of contacts. A view call history button 1018 enables a user to display records of all incoming and outgoing phone calls.
[0085] To make an outgoing phone call, a user can select the dial button 1010 available on the phone menu of Fig. 10. Alternatively, a user can use the place call button 810 available from the contacts menu of Fig. 8. Still further, a user may have programmed a customized button on the SGD to perform the Dial Cell Phone action. Further selection of a programmed "Speak" button can be used to select a message window with content in it so that a user can use the SGD to "speak" the user's message through the integrated cell phone. Similar features can be used to answer an incoming call. In one example, a user will receive an incoming call via the integrated cell phone feature causing the SGD to ring like a telephone. An alert box may also be shown on the SGD's display device with information such as the incoming call name and number and selectable input buttons for the user to select either to answer or ignore the incoming call. If the incoming call is from an entity listed in the user's contacts menu, then the contact's symbol may also be provided within an incoming call alert box.
[0086] Referring now to Fig. 11 , a text messaging menu enables a user to send a text message and to review the history of all or selected text messages that a user has sent and received. In particular, a viewport 1100 enables a user to select from a variety of tabs (Unread, Read, Sent, Drafts or Recent) to display a category of text messages in the viewport. Once messages are shown, buttons displayed to the right of viewport
1100 (e.g., View button 1102, Delete button 1104, and Delete All button 1106) will become available to help a user manage the text messages within each tab. A number field 1108 enables a user to send a text message to the number shown in the number field 1108. A text field 1110 visually displays to a user the body of a text message as it is entered. A contact button 1112 provides access to the current contact's numbers. If no contact is selected, the contact button 1112 enables a user to select a contact from available contacts. A send message button 1114 enables a user to send the created text message.
[0087] To send a text message (via SMS or other protocol), a user may have a variety of options. For example, a user can create a new text message by opening the text messaging menu of Fig. 11 , entering the text message manually and either entering the phone number manually or finding a contact number using the contacts button. A user may alternatively send a quick text message by sending the current contents of a message window as a text message to a specific phone number. In this fashion, a user will be prompted to enter the phone number when the user selects the quick text programmed button. A user can also enter the phone number manually or select a number from the contacts menu to send a quick text message.
[0088] As previously mentioned, an exemplary SGD embodiment may also include features for connecting to the Internet, for example features including a wireless network adapter 718 such as shown in Fig. 7 or other communications module.
Additional executable instructions stored in memory associated with central computing device provide a web browser to serve as a graphical user interface for interacting with the Internet. For example, software instructions may be provide to call preconfigured web browsers such as Microsoft Internet Explorer or Firefox® internet browser 3Od available from Mozilla software.
[0089] Examples of menus and features related to SGD Internet access are now discussed with reference to Figs. 12-15. In general, such menus and features allow a user to not only achieve access to the Internet for customary purposes, but also to utilize information accessed via the Internet to further enhance the functionality of an SGD. For example, an SGD web browser can be provided to enable the user to go to a webpage and extract key vocabulary from that web page to use the extracted
vocabulary so that the user can talk about current events or more sophisticated topics than is currently within the capability of the vocabulary of conventional SGDs. The SGD can also upgrade its own software by connecting to the internet and enabling the vendor
of the SGD to update the SGD's software from the vendor's dedicated servers. Again, it should be appreciated that all graphical user interfaces and menus that display
"buttons" or other features that are selectable by a user correspond to user input features that when selected trigger control signals being sent to the central computing device within an SGD to perform an action in accordance with the selection of the user buttons.
[0090] Referring again to the drawings, a specially designed web browser for use with exemplary SGD embodiments of the present invention is shown in Fig. 12. A set 1201 of go back/forward buttons enables a user to move backwards or forwards by one or more previously selected pages through the web browsing history. A reload current page button 1202 refreshes the current web page. A stop loading this page button 1203 is selected to provide an instruction to stop loading the current web page. A home button 1204 enables a user to open up the web page that the user has designated as a "Home" page. A Zoom in button 1205 enables a user to increase the font size of the current web page. A zoom out button 1206 enables a user to decrease the font size of the current web page. A reset button 1207 enables a user to return the font size on the current web page to standard or default size. An address text button 1208 enables a user to select this text box to enter a URL (web page address) that the user wants to open. When the address button 1208 is selected, such action may open up the system keyboard so that a user can enter the URL (web address) for the page the user wants to visit. A search text box button 1209 is selectable by a user to enter a keyword to use a search engine (e.g., Google, Yahoo, or other preconfigured searching tool available via the Internet) to search on the keyword. A Grab and Drag mode button 1210 enables a user to toggle the "grab and drag" setting in a web browser, such as may enable a user to select text and "flick" the screen to scroll up or down.
[0091] Referring now to Fig. 13, interface menus may be provided that enable a user to incorporate key words located on the Internet (i.e., "web words") into a user- customized vocabulary list. Such identified web words may then be incorporated into a user's custom message by selecting such words from a vocabulary list. In order to perform such word incorporation feature, a user may first enable the wireless network connection (e.g., Wi-Fi connection) available via the SGD, launch a provided web browser (e.g., MOZILLA FIREFOX) and navigate to a web page that includes the words of interest. On the page of interest, a user can select one or more portions of interest (e.g., the highlighted text box shown in Fig. 13). Upon selection of text, a text menu
1300 and possible sub-menu 1310 are provided, for example as also shown in Fig. 13. Such menus may be provided automatically (e.g., via pop-up window upon selection) or by selecting a button after the text has been selected by a user. Various options available from the text menu 1300 include a selectable "speak text" feature that instructs an SGD to immediately speak the selected text. Another feature is a "send to message window" feature that sends the selected text to a message window where additional actions can be selected. An "add vocabulary list as one item" feature enables a user to add the selected text to a vocabulary list as a single block. An "add to vocabulary list as multiple items" feature enables a user to add the selected text to a vocabulary list as individual words or phrases. When either of the options are selected from menu 1300 to add the text to a vocabulary list, a sub-menu 1310 may appear, listing all of the existing vocabulary lists available to a user that are enabled to accept web content. The specific vocabulary list desired by a user may then be selected. Selection may include a previously configured list or a new list.
[0092] Once web words have been selected and added to a vocabulary list, a user may view the words in a vocabulary list box, for example, as shown in Fig. 14. When a word is selected from the vocabulary list box, an SGD may be configured to either speak the word immediately or add the selected word into a message window, or both. Additional details regarding vocabulary list boxes and related features are presented below.
[0093] A still further internet-related feature available via an SGD corresponds to a user interface for importing symbols from the Internet, In general, such feature enables a user to select a picture available from an authorized source on a web page and import it into a custom symbol set for use by the user. In general, authorized sources include sources whose pictures and symbols are owned by the user, sources with pictures and symbols which the user has permission to use and access, and sources who provide pictures and symbols in the public domain. In order to perform such symbol
incorporation feature, a user may first enable the wireless network connection (e.g., Wi- Fi connection) available via the SGD, launch a provided web browser (e.g., Mozilla Firefox) and navigate to a web page that includes the picture(s) or graphic(s) of interest. On the page of interest, a user can select the picture or graphic to initiate the display of a web symbol menu 1500 as shown in Fig. 15. An "add to my symbols" feature saves the picture with an automatically generated name. An "add to my symbols with name" feature prompts the user to enter a unique and meaningful name for the symbol and
then saves it to the SGD symbol library. The imported symbol is now available to use as background on a page or a popup, as a symbol on a user button, or in another interactive fashion.
[0094] Referring now to Fig. 16, exemplary steps performed in a software algorithm for creating web words is presented. In general, integrated web access for vocabulary generation in a speech generating device affords a method to select content from the web, automatically extract useful terms and phrases, and use them immediately in aided communication. In particular, software steps to perform such a method are stored as executable instructions in a computer-readable medium such as previously described. Execution of the steps is selectively activated by a user based on the user's selection of text and options available via menus, such as those illustrated in Figs. 13, 14 and 15.
[0095] In a first step 1600, electronic input is received indicating a vocabulary source accessed by the user. The vocabulary source may correspond, for example, to text available from a document, web page or other data source. It should be appreciated that information available from the Internet may greatly expand the potential type and amount of information that is available to a user as words that can be integrated into messages and either spoken or communicatively transmitted via a speech generation device. For example, extraction of useful web words may facilitate rapid communication about presently relevant information, including current events, news stories, sports stories, and/or information of a specific technical nature with specialized terminology. The web serves as the source of vocabulary augmenting the information typically delivered with a communication device, such as the names of people, places, businesses, sports teams, etc.
[0096] Users can use the integrated web browser to find information related to that which the user wants to communicate about. Electronic input is received by an SGD in step 1602 indicating selected text from which to extract useful vocabulary. The user may collect one or more pieces of such text, and provide input to the SGD that tags selected pieces or each piece in step 1604 with a user-defined tag indicating a general category of interest, separating the text into tagged buckets. In step 1606, each such text bucket may be separately analyzed to determine words and phrases of interest, as described in more detail below. The words and phrases are then made available for use in communication in a scrolling view that presents the words and phrases. The words and phrases may be augmented in step 1608 with a picture if a suitable picture is
found in an internal database whose label matches the word or phrase. Once terms have been discovered and offered to the user in the scrolling view described above, the user may select the individual items to be spoken. Alternatively, they may select the items to be inserted into a document, which may or may not be spoken.
[0097] Discovered vocabulary may also be used in other ways once it has been discovered. For example, the lists of discovered vocabulary may be added to a permanent internal database of vocabulary available to the user at all times. One use of the vocabulary lists is with the slot mechanism described below. The lists of discovered vocabulary may serve as source material for constructing more traditional
communication pages. Vocabulary items can be directly selected and inserted onto buttons on communication pages, along with any matching symbols. Discovered vocabulary may be entered into an internal dictionary and used to augment word prediction. The discovered words will become part of the database of words that get predicted in response to partial inputs.
[0098] The text bucket analysis of step 1606 may include a variety of specific software-based rules. For example, selected text may be examined to discover interesting new words and phrases by extracting various identified items: Any new word not in an SGD's internal dictionary; Any noun, adjective, or verb from an internal dictionary that has a frequency of occurrence rating below some threshold; Phrases consisting of adjacent words from the text that contain either interesting words (by the above definitions) or capitalized words. All such words and/or phrases, up to a threshold length, are extracted from the text. Any such phrases that occur more than some threshold number of times in the text are added to the list of extracted words and phrases.
[0099] In addition to vocabulary discovery, other types of media encountered while traversing the web with an SGD's integrated browser can be integrated with the communication device's internal data library. Images encountered on communication pages can be selected and directly incorporated into the internal symbol database of the communication device user. Similarly, video and audio files encountered can also be added to the internal media databases for use by the communication device user.
Audio samples will often be used to augment synthesized text-to-speech for
communication and video samples can serve as a means of shared communication experience and to ground communication between a device user and the user's communication recipient(s).
[00100] Words, phrases, symbols and other information either stored on a SGD or imported as described above can be grouped and used in preconfigured or customized compilations referred to herein as vocabulary lists or vocabulary list boxes. Vocabulary list boxes enable a user to have a wide variety of words and phrases immediately available. By listing groups of related words and phrases, vocabulary list boxes enable a user to quickly search through a wide range of text options when composing a message. For example, a user can select a particular group of words and/or phrases and associate all selected items into a new vocabulary list, which may be named and optionally assigned a unique symbol to visually represent the vocabulary list. Setting options characterizing each vocabulary list may also be defined, for example, an indication of whether the vocabulary list should accept new words and phrases from sources available on the Internet. Once a vocabulary list is made, new words or phrases can be added to the vocabulary list and/or existing words or phrases may be edited or deleted. Features may also be provided such that existing vocabulary lists can be embedded into other vocabulary lists, thus providing a hierarchy of related sets and subsets of vocabulary lists. Features may be provided to add vocabulary searches into a vocabulary list or to sort and display the contents of a vocabulary list according to selected parameters such as alphabetization and frequency of use in messages.
Features may also be provided to trigger actions performed by the SGD upon selection of an item from a vocabulary list, for example, to automatically "speak" or provide as audio output the words/phrases from a vocabulary list box immediately as it is selected by a user, or to send the words/phrases from the vocabulary list box to the Message Window as it is selected by a user.
[00101] When embedding one vocabulary list in another, a user can get better access to more specific vocabulary on a related subject. For example, as shown in Fig. 17, software instructions may be configured to display an embedded vocabulary list in a similar fashion (i.e., as an icon) as any other item in a given vocabulary list box. The example in Fig. 17 shows a user interface displaying a vocabulary list box with the vocabulary list "Animals" having the "Animal Sounds" vocabulary list embedded within it. When an embedded vocabulary list is selected by a user, its contents will then be displayed in the vocabulary list box. All the words in the original vocabulary list
"Animals" (e.g., aardvark, alligator, animal, ant, anteater, antelope, ape, armadillo, etc.) as well as the words in the embedded vocabulary list "Animal Sounds" (e.g., bark, bleat,
buzz, chirp, cluck, croak, gobble, growl, hiss, etc.) are then available for a user to use in a message.
[00102] When embedding a vocabulary search within a vocabulary list, a search for words and phrases will automaticaiiy scan through a dictionary of words and phrases provided on the SGD and display ail words and phrases that meet the user-defined search criteria. For example, suppose that a user adds twelve words to an SGD dictionary and marks them all with the "Clothing" category. The next time a user opens a page with a vocabulary list containing a vocabulary search on "Clothing", those twelve new words will automatically appear in the vocabulary list box. By inserting a search itself to a vocabulary list, a search can be performed dynamically. As such, when other items are added to a dictionary that matches a user's search criteria, a user does not have to go back and separately add the new words to different preexisting vocabulary lists. The results of an embedded vocabulary search can be intermixed with the other vocabulary items within the vocabulary list. They will appear side by side with the individual vocabulary items, and they can be sorted based on the settings of the vocabulary list box {alphabetically, as is, or by frequency of use).
[00103] The SGD may further include executable instructions stored on a computer-readable medium that implement a unique method of communication rate enhancement that combines sentences and phrases with slot fillers, in general, implementation of a "slots" feature triggers software provision of mechanisms to insert a "slot" or placeholder in a text phrase. Each one of these placeholders is associated with a list of "fillers" that can potentially fill in this place in the text. The list of slots and their associated fillers are stored in a database internal to the software. When a user inserts the text containing the slot into a document, the slot position is filled with a default filler, and that filler is underlined in the text. The user then may select this underlined word, and upon the user's selection of the underlined word, the user is presented with the entire list of fillers for that slot and may choose another filler, at which point that filler is inserted in that place in the text, which remains underlined so that the filler may be changed again in the future. Another variation on the slots protocol allows a user to declare to speak a text phrase containing one or more fillers. At which point the user is prompted to specify the filler values for each slot, and then the entire text phrase, with the filler values chosen by the user, is spoken. Slot fillers can optionally have pictures associated with them in which case those pictures can be used to augment the display
of thθ filler value. Slots can be added and deleted by the user, and filler values can also be added and deleted by the user.
[00104] The slots can be populated with literal terms, words or phrases, vocabulary lists and related features as described above, embedded searches and/or embedded recursive slots. Moreover, this unique method optionally can be combined with symbols. A "slot" placeholder can also be included in button text, button labels and phrases. Specific interface options may be provided for a user to create a phrase with slots, adding slots to buttons, and work with slots in a Message Window. The capability embodied in the so-called "slots" protocol provides the user with the ability to fill in certain words in otherwise static text. The slots protocol provides the user with easy access to common words that can be used to complete a message in a variety of settings and situations. For example, in the phrase: "Can we have dinner now?", the slot is "dinner". Other commonly-used words like "breakfast" or "coffee" are slots that can be used interchangeably to complete similar messages like: "Can we have breakfast now?". Slots are additional tools that minimize necessary navigation to save the user's time and energy during message composition.
[00105] In general, slots are designed to provide a variety of vocabulary options while reducing the number of selections that a user must make to create a whole message. Slots also help to conserve space on the touch screen. Slots provide a user with easy access to all of the words associated with a particular vocabulary concept (or category). When a user selects a slot, the user can choose to replace the word that is currently filling the slot with another word from the same concept. Rather than build a dynamic message one word at a time, a user can create sentences that contain slots in key locations. When the phrase is added to the Message Window, a user can then select the slots (which are visually indicated in some fashion, for example displayed as blue underlined words) and replace the current words with different options.
[00106] Referring to the following example of Figs. 18 and 19, the message in the Message Window is the label text for a button. The first slot is associated with the "breakfast" concept and the second slot is associated with the "fruit" concept. The slots allow a user to create dynamic messages with a reduced number of selections. By selecting the slots and changing the filler text, the example phrase "I want oatmeal and a banana for breakfast" can quickly and easily be changed to read as follows: "I want toast and a nectarine for breakfast".
[00107] In accordance with the concepts slot technology, a user may be able to add slots to his customized phrase database - My Phrases. Adding slots to phrases is one way to maximize the potential of both a slot/filler rate enhancement feature as well as a phrase prediction rate enhancement feature. This technique provides a user with rapid access to complete statements, while still enabling the user to vary what he is going to say. For example, if a user tells an assistant what he wants to wear every morning, then he may want to create a phrase to say "I want to wear my jeans today." Then, simply turn the word "jeans" into a slot that accesses the "clothing" concept.
Every morning the user can quickly access the same phrase, no matter what page or popup is active, and say "I want to wear my sweater today." or "I want to wear my boots today." A user could even add more slots, such as one that accesses the "colors" concept or the "textures" concept to add more description to such statements.
[00108] In accordance with phrase prediction rate enhancement functionality, one or more areas of a user display (i.e., buttons) can be filled with phrases as a user enters text into a document. For example, as a user enters text, the SGD can match the partially entered text to an internal database of text phrases and present those phrases whose starting characteristics match the partially entered text. Each phrase may also be assigned a priority rating (e.g., based on frequency of use or other characterizing feature). In the case where more phrases are found that match the partially entered text than there are buttons to fill, those phrases with the highest priority ratings are shown. The phrase database can be added to or deleted from by a user. Phrases may optionally have pictures associated with them in which case those pictures can be used to augment the display of the phrases on the buttons. Phrases may also contain slots with their associated fillers.
[00109] A still further feature that may be provided in exemplary embodiments of a speech generation device corresponds to a voice inflection mark up feature that can add inflection to the speech output of the SGD. Thus, the speech output not only can be controlled as to the volume of the spoken words, as to the pitch of the voice
(childlike, female, masculine), but also individual syllables can be differently
emphasized to convey different emotions and different voices that communicate different meanings just based on the different inflections imposed on different syllables of the words being spoken by the SGD. This feature overcomes limitations of the kinds of standard language that is employed by most conventional SGDs.
[00110] Fig. 20 illustrates exemplary steps that may be performed in a method of implementing speech characterizations such as speech styles and/or speech effects. In step 2010, an SGD receives input from a user associating one or more speech styles to selected text. In particular, a user can selectively mark text that the user wants to be spoken with a specific stylistic variation and then apply the speech style, much like a user would apply a font style in a typical word processing application. User interface locations by which a user can apply these speech styles via actions can be embedded in user communication pages, making this feature available to all communication device users regardless of the user's physical disabilities or access needs. Input defining one or more associated speech styles can be embedded as markups directly in documents. As such, users can store their documents with stylistic variants for later recall and use. Once entered, a style markup can be selected and its properties modified.
[00111] Supported speech styles may include variations defined by the following non-exhaustive list of exemplary parameters: voice, volume, pitch, speech rate, spell mode, emphasis and context. When defining a speech style according to voice, a user can select the specific text-to-speech voice to use to speak the selected text. When defining a speech style according to volume, a user can select the specific volume at which to speak the selected text. When defining a speech style according to pitch, a user can select one or more specific pitch changes to apply to selected text. When defining a speech style according to speech rate, a user can select one or more rates of speed at which to speak the selected text. When defining a speech style according to spell mode, the user can select to spell out the selected text rather than pronouncing it normally as a word - e.g., "H-E-L-L-O" instead of "Hello." When defining a speech style according to emphasis, a user can select the specific portions of text to emphasize in a spoken portion of text, for example by increasing the amount of voice inflection given to the emphasized text portions. When defining a speech style according to context, a user can define when to declare the context of a selected portion of speech, e.g., to speak the selected portion as if it were a date, currency, or other contextually
appropriate format. An example of an interface menu that may be provided to a user for selecting speech styles is shown in Fig. 22.
[00112] Referring still to Fig. 20, input may also be received by an SGD
associating one or more speech effects to selected text. It should be appreciated that although steps 2010 and 2012 are shown as separate and subsequent steps, steps 2010 and 2012 can actually occur intermixed in time and each can selectively occur
multiple times as desired by a user in a method of implementing voice inflection.
Examples of speech effects may correspond, for example, to different emotional effects, non-verbal communication noises, etc. such as but not limited to coughing, laughing, crying, kissing, sniffing, swallowing, sighing, whistling, yawning, or making noises such as "eh", "oh", "mmm", etc. Again, features may be provided such that speech effects can be embedded in user communication pages. An example of an interface menu that may be provided to a user for selecting speech effects is shown in Fig. 23, where selections can be made for a desired speech effect to be entered into a message window or spoken immediately upon selection.
[00113] As input defining one or more speech styles and/or speech effects is received for a given portion of text, the marked up text may be displayed in a message window, for example as shown in Fig. 21. A first portion of text, "Hi Jim!" has been selected and associated with a speech style defining a selected level of volume. A second portion of text, "That's" has been selected and associated with a speech style indicating desired emphasis on this first word in the sentence. A speech effect, namely laughing, is inserted at the end of the given text to simulate a user's laugh. Such enhanced speech features provide a more interactive and enjoyable method of communicating for a user of a speech generation device.
[00114] After input defining one or more speech styles and/or speech effects is received for a given portion of text (e.g., text displayed in a message window), the marked-up text is relayed to an interpretation engine such that the marked text can be analyzed by software rules in step 2014 to determine an optimum speech-to-text engine for speaking the marked-up selected text and associated speech styles/effects.
Because several different text-to-speech engines may be included with SGD software, such an interpretation engine may be configured to examine the graphical markup inclusions and target them for a specific text-to-speech engine, selecting the best match for a given speech style or effect and the underlying text-to-speech engine. Once the optimum text-to-speech engine is determined in step 2014, the actual text-to-speech spoken output may then be generated in step 2016.
[00115] While at least one presently preferred embodiment of the invention has been described using specific terms, such description is for illustrative purposes only, and it is to be understood that changes and variations may be made without departing from the spirit or scope of the present invention.
Claims
1. A speech generation device, comprising:
a display device for displaying images to a viewer;
a capacitive touch screen for capturing user inputs directed to the display device; and
a processor communicatively coupled to the display device and the touch screen; a computer-readable medium storing instructions executable by the processor; and
wherein the instructions stored on the computer-readable medium configure the speech generation device to generate text-to-speech output.
2. The speech generation device of claim 1 , wherein the capacitive touch screen comprises a projective capacitive touch screen.
3. The speech generation device of claim 1 , wherein the capacitive touch screen comprises a transparent panel covering the display device.
4. The speech generation device of claim 3, wherein an active sensing material of the capacitive touch screen is applied to a back surface of the transparent panel.
5. The speech generation device of claim 3, further comprising an outer housing, the outer housing being configured to accommodate the transparent panel, the display device being disposed within the outer housing such that visual access to the display pane! is provided through the transparent panel.
6. The speech generation device of claim 5, wherein a top surface of the transparent panel is sealed to a bezel associated with the outer housing.
7. A speech generation device, comprising:
a display device for displaying images to a viewer;
an integrated cellular phone device;
a processor communicatively coupled to the display device and the integrated cellular phone device; and
a computer-readable medium for storing instructions executable by the
processor;
wherein the instructions stored on the computer- read able medium configure the speech generation device to send and receive at least one of phone calls or text messages.
8. The speech generation device of claim 7, wherein the instructions stored on the computer-readable medium further configure the speech generation device to generate a contacts menu displaying identifying information of one or more contacts.
9. The speech generation device of claim 8, wherein the instructions stored on the computer-readable medium enable a user to place a call or send a text message by selecting a button on the contacts menu.
10. The speech generation device of claim 8, wherein the instructions stored on the computer-readable medium further configure the speech generation device to generate an edit contacts menu enabling a user to enter identifying information of a new contact or edit identifying information of an existing contact.
11. The speech generation device of claim 7, wherein the instructions stored on the computer-readable medium further configure the speech generation device to generate a phone menu enabling a user to perform at least one of dialing a phone number, accessing a contacts menu, viewing phone call history or turning off the integrated cellular phone device.
12. The speech generation device of claim 7, wherein the instructions stored on the computer-readable medium further configure the speech generation device to speak through the integrated celi phone device.
13. The speech generation device of claim 7, wherein the instructions stored on the computer-readable medium further configure the speech generation device to display call information to a user relating to an incoming or outgoing call.
14. The speech generation device of claim 7, wherein the instructions stored on the computer-readable medium further configure the speech generation device to generate a text message menu enabling a user to perform at least one of sending a text message or viewing a history of received and sent text messages.
15. A speech generation device, comprising:
a display device for displaying images to a viewer;
at least one wireless communications device;
a processor communicatively coupled to the display device and the at least one wireless communications device; and
a computer-readable medium for storing instructions executable by the processor, wherein the instructions stored on the computer- read able medium configure the speech generation device to wireiessly communicate with one or more secondary devices.
16. The speech generation device of claim 15, wherein the instructions stored on the computer-readable medium more particularly configure the speech generation device to wireiessly communicate with and control a secondary computer.
17. The speech generation device of claim 16, wherein the secondary computer comprises a desktop computer or a laptop computer.
18. The speech generation device of claim 16, wherein the secondary computer is displayed on the display device of the speech generation device as a mouse and keyboard.
19. The speech generation device of claim 15, wherein the one or more secondary devices comprises a wireless headset.
20. The speech generation device of claim 15, wherein the at least one wireless communications device comprises at least one antenna.
21. The speech generation device of claim 15, wherein the at least one wireless communication device comprises at least one infrared transceiver.
22. The speech generation device of claim 15, wherein the at least one wireless communication device comprises a wireless network adaptor for connecting to the internet.
23. The speech generation device of claim 15, wherein the instructions stored on the computer-readable medium configure the speech generation device to upgrade its instructions via the connection to the Internet provided by the wireless network adaptor.
24. The speech generation device of claim 15, wherein the one or more secondary devices comprise a wireless-communicating input switch.
25. The speech generation device of claim 24, wherein the instructions stored on the computer readable medium configure the speech generation device to scan visual options across the display device and to detect user actuation of the wireless- communicating input switch as selection of a scanned visual option.
26. The speech generation device of claim 15, wherein the instructions stored on the computer readable medium configure the speech generation device to perform a communications pairing procedure for the speech generation device and the one or more secondary devices.
27. A method of extracting internet-based content for use with a speech generation device, the method comprising:
establishing a connection between the speech generation device and the internet;
receiving an input indicating selected internet-based content from which to extract at least one type of media;
automatically extracting the at least one type of media for use with the speech generation device; and
incorporating the at least one type of media into a library or database stored on a computer-readable medium of the speech generation device.
28. The method of claim 27, wherein the at least one type of media comprises one or more of images, video files, audio files or web words.
29. The method of claim 27, further comprising one or more of speaking the at least one type of media as audio output or displaying the at least one type of media as visual output.
30. A speech generation device, comprising:
a display device for displaying images to a viewer;
a wireless network adaptor for providing access to a network;
a processor communicatively coupled to the display device and the wireless network adaptor; and
a computer-readable medium for storing instructions executable by the processor;
wherein the instructions stored on the computer-readable medium configure the speech generation device to implement the method of claim 27.
31. A method of extracting internet-based content for use with a speech generation device, the method comprising:
establishing a connection between the speech generation device and the internet;
receiving an input identifying selected text from which to extract vocabulary from an internet-based vocabulary source;
receiving an input identifying one or more pieces of the selected text to be analyzed; and
analyzing the one or more pieces of the selected text to determine words and phrases of interest.
32. The method of claim 31 , wherein the words and phrases of interest comprise at least one interesting word, the at ieast one interesting word comprising any word not included in an internal dictionary of the speech generation device or any noun, adjective or verb included in an internal dictionary of the speech generation device which has a frequency of occurrence rating below a predetermined threshold.
33. The method of claim 32, wherein the words and phrases of interest further comprise any phrase including adjacent words from the one or more pieces of the selected text that contain capitalized words or the at least one interesting word.
34. The method of claim 31 , wherein the words and phrases of interest comprise any phrase that occurs more than a threshold number of times within the one or more pieces of the selected text.
35. The method of claim 31 , further comprising displaying the words and phrases of interest to a user for use in communication.
36. The method of claim 35, wherein the words and phrases of interest are displayed to the user in a scrolling view.
37. The method of claim 36, wherein the speech generation device is configured to generate a text-to-speech output when at least one of the words and phrases of interest is selected from the scrolling view.
38. The method of claim 36, wherein the speech generation device is configured to insert at least one of the words and phrases of interest into a document when the at least one of the words and phrases of interest is selected from the scrolling view.
39. The method of claim 31 , further comprising adding the words and phrases of interest to one or more vocabulary lists stored on a computer-readable medium of the speech generation device.
40. The method of claim 31 , further comprising adding the words and phrases of interest into an internal dictionary stored on a computer-readable medium of the speech generation device for use with word prediction.
41. The method of claim 31 , further comprising receiving an input indicating access of an internet-based vocabulary source using an integrated web browser of the speech generation device.
42. The method of claim 31 , further comprising augmenting the words and phrases of interest with identification information.
43. The method of claim 42, wherein the identification information comprises an image or symbol stored on a computer readable medium of the speech generation device.
44. A speech generation device, comprising:
a display device for displaying images to a viewer;
a wireless network adaptor for providing access to a network;
a processor communicatively coupled to the display device and the wireless network adaptor; and
a computer-readable medium for storing instructions executable by the
processor;
wherein the instructions stored on the computer-readable medium configure the speech generation device to implement the method of claim 31.
45. A hand-held speech generation device, comprising:
an outer housing, the outer housing including a top casing component and a bottom casing component;
first and second convex extensions extending from the bottom casing
component, the first and second convex extensions being ergonomically configured to facilitate location and grip of a user's fingers; and
a processor and related computer-readable medium for storing instructions executable by the processor, the processor and computer-readable medium being disposed in the outer housing;
wherein the instructions stored on the computer- read able medium configure the speech generation device to generate text-to-speech output.
46. The hand-held speech generation device of claim 45, further comprising a speaker grille forming a front surface of the speech generation device.
47. The hand-held speech generation device of claim 46, wherein the first and second convex extensions extend along the bottom casing component from the speaker grille in a direction of a rear surface of the speech generation device.
48. The hand-held speech generation device of claim 47, wherein the first and second convex extensions taper in size from the speaker grille in the direction of the rear surface of the speech generation device.
49. The hand-held speech generation device of claim 45, further comprising first and second convexly shaped opposing sides, the first and second opposing sides being at least partially defined by the top and bottom casing components.
50. The hand-held speech generation device of claim 49, wherein the first and second opposing sides define a predetermined thickness, the predetermined thickness being configured such that the first and second opposing sides fit within the palms of a user's hands.
51. The hand-held speech generation device of claim 49, wherein the first convex extension is disposed substantially adjacent to the first opposing side and the second convex extension is disposed substantially adjacent to the second opposing side.
52. The hand-held speech generation device of claim 45, further comprising one or more printed circuit boards disposed within the outer housing, the one or more printed circuit boards being configured as structural components and providing a mounting surface for one or more internal components of the speech generation device.
53. The hand-held speech generation device of claim 45, wherein an upper peripheral surface of the top casing component is formed from a material selected to highlight an outer perimeter of the speech generation device.
54. The hand-held speech generation device of claim 53, wherein the material comprises chrome.
55. The hand-held speech generation device of claim 45, wherein the outer housing is formed from a substantially rigid and lightweight materia!.
56. The hand-held speech generation device of claim 55, wherein the substantially rigid and lightweight material comprises a plastic material, a thermoplastic material, a polymer material, a polyethylene material or a resin material.
57. The hand-held speech generation device of claim 45, wherein the outer housing is formed from magnesium or a magnesium alloy.
58. The hand-held speech generation device of claim 57, wherein the processor is thermally coupled to a portion of the outer housing.
59. The hand-held speech generation device of claim 45, wherein the outer housing is configured such that audio output from the speech generation device radiates outwardly from the user when the user properly positions his hands around the speech generation device.
60. The hand-held speech generation device of claim 45, wherein the top casing component defines an opening configured to provide visual access to a display panel of the speech generation device.
61. The hand-held speech generation device of claim 60, wherein the top casing component further defines a recessed area configured to accommodate a touch screen, the touch screen being transparent so as to permit the display panel to be viewed through the touch screen,
62. The hand-held speech generation device of claim 61 , wherein the touch screen comprises a capacitive touch screen.
63. The hand-held speech generation device of claim 45, further comprising: a battery for powering the speech generation device;
wherein the bottom casing component comprises first and second bottom casing components; and
wherein the battery is stored between the first and second bottom casing components such that the second bottom casing component forms the bottommost surface of the outer housing.
64. The hand-held speech generation device of claim 45, wherein the first and second convex extensions each define a concavely shaped recessed area.
65. A method of implementing speech characterizations to the speech output of a speech generation device, the method comprising:
receiving an input associating one or more speech characterizations to selected text;
analyzing the selected text and the one or more associated speech
characterizations to determine a text-to-speech engine for outputting the selected text and the one or more associated speech characterizations; and
generating a text-to-speech output utilizing the determined text-to-speech engine.
66. The method of claim 65, wherein the one or more speech
characterizations comprise at least one of speech styles or speech effects.
67. The method of claim 65, wherein the one or more speech
characterizations comprise one or more speech styles, the one or more speech styles being defined by at least one of voice, volume, pitch, speech rate, spell mode, emphasis or context.
68. The method of claim 65, wherein the one or more speech
characterizations comprise one or more speech effects, the one or more speech effects comprising at least one of an emotional effect or a non-verbal communication noise.
69. The method of claim 65, wherein the input associating the one or more speech characterizations to the selected text is embedded directly as an inflection markup of the selected text.
70. The method of claim 69, wherein the inflection markup corresponds to a speech style or speech effect to be applied to the selected text when generating the text-to-speech output.
71. The method of claim 69, wherein the inflection markup of the selected text is stored as a document in a computer readable medium of the speech generation device to permit the inflection markup to be subsequently recalled and modified.
72. The method of claim 69, wherein the inflection markup of the selected text is displayed in a message window of the speech generation device.
73. The method of claim 65, wherein the selected text and the one or more associated speech characterizations are analyzed by an interpretation engine,
74. The method of claim 73, wherein the interpretation engine is configured to examine the selected text and the one or more associated speech characterizations to determine an optimum text-to-speech engine for outputting the selected text and the one or more associated speech characterizations.
75. The method of claim 65, wherein a plurality of text-to-speech engines are stored as software instructions on a computer-readable medium of the speech generation device.
76. The method of claim 65, wherein the text-to-speech output is generated using one or more speakers of the speech generation device.
77. A speech generation device, comprising:
a display device for displaying images to a viewer;
one or more speakers for generating audio outputs;
a processor communicatively coupled to the display device and the one or more speakers; and
a computer-readable medium for storing instructions executable by the processor;
wherein the instructions stored on the computer-readable medium configure the speech generation device to implement the method of claim 65.
78. A speech generation device, comprising
an outer housing, at least a portion of the outer housing being formed from magnesium or a magnesium alloy; and a processor and related computer-readable medium for storing instructions executable by the processor, the processor and computer-readable medium being disposed in the outer housing;
wherein the instructions stored on the computer-readable medium configure the speech generation device to generate text-to-speech output.
79. The speech generation device of claim 78, wherein the outer housing comprises a top casing component and a bottom casing component, the at least a portion of the outer housing comprising at least one of the top casing component or the bottom casing component.
80. The speech generation device of claim 79, further comprising:
a battery for powering the speech generation device;
wherein the bottom casing component comprises first and second bottom casing components; and
wherein the battery is stored between the first and second bottom casing components such that the second bottom casing component forms the bottommost surface of the outer housing.
81. The speech generation device of claim 78, wherein the outer housing is formed from an injection molding process.
82. The speech generation device of claim 78, wherein the processor is thermally coupled to the at least a portion of the outer housing.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2010/041799 WO2011011224A1 (en) | 2009-07-24 | 2010-07-13 | Hand-held speech generation device |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US22825609P | 2009-07-24 | 2009-07-24 | |
US61/228,256 | 2009-07-24 | ||
PCT/US2010/041799 WO2011011224A1 (en) | 2009-07-24 | 2010-07-13 | Hand-held speech generation device |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2011011224A1 true WO2011011224A1 (en) | 2011-01-27 |
Family
ID=48670158
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2010/041799 WO2011011224A1 (en) | 2009-07-24 | 2010-07-13 | Hand-held speech generation device |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2011011224A1 (en) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150195653A1 (en) * | 2014-01-03 | 2015-07-09 | Fugoo Corporation | Audio architecture for a portable speaker system |
WO2015103312A1 (en) * | 2014-01-04 | 2015-07-09 | Fugoo Corporation | Configurable portable sound systems with interchangeable enclosures |
USD777146S1 (en) | 2013-12-10 | 2017-01-24 | Fugoo Corporation | Jacket for portable speaker |
USD777147S1 (en) | 2013-12-10 | 2017-01-24 | Fugoo Corporation | Jacket for a portable speaker |
USD777709S1 (en) | 2013-12-10 | 2017-01-31 | Fugoo Corporation | Jacket for a portable speaker |
US9641921B2 (en) | 2014-01-03 | 2017-05-02 | Fugoo Corporation | Speaker core with removable jacket |
CN113485551A (en) * | 2014-05-31 | 2021-10-08 | 苹果公司 | Message user interface for capture and transmission of media and location content |
US12089121B2 (en) | 2015-08-14 | 2024-09-10 | Apple Inc. | Easy location sharing |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030022701A1 (en) * | 2001-07-25 | 2003-01-30 | Aloke Gupta | Buttonless communication device with touchscreen display |
US20060069567A1 (en) * | 2001-12-10 | 2006-03-30 | Tischer Steven N | Methods, systems, and products for translating text to speech |
US20060169314A1 (en) * | 2005-01-31 | 2006-08-03 | Yamaha Corporation | Artificial inner ear and thermoelectric generator therefor |
US20060284839A1 (en) * | 1999-12-15 | 2006-12-21 | Automotive Technologies International, Inc. | Vehicular Steering Wheel with Input Device |
US20070124144A1 (en) * | 2004-05-27 | 2007-05-31 | Johnson Richard G | Synthesized interoperable communications |
US20070260460A1 (en) * | 2006-05-05 | 2007-11-08 | Hyatt Edward C | Method and system for announcing audio and video content to a user of a mobile radio terminal |
US20080154601A1 (en) * | 2004-09-29 | 2008-06-26 | Microsoft Corporation | Method and system for providing menu and other services for an information processing system using a telephone or other audio interface |
US20090063154A1 (en) * | 2007-04-26 | 2009-03-05 | Ford Global Technologies, Llc | Emotive text-to-speech system and method |
-
2010
- 2010-07-13 WO PCT/US2010/041799 patent/WO2011011224A1/en active Application Filing
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060284839A1 (en) * | 1999-12-15 | 2006-12-21 | Automotive Technologies International, Inc. | Vehicular Steering Wheel with Input Device |
US20030022701A1 (en) * | 2001-07-25 | 2003-01-30 | Aloke Gupta | Buttonless communication device with touchscreen display |
US20060069567A1 (en) * | 2001-12-10 | 2006-03-30 | Tischer Steven N | Methods, systems, and products for translating text to speech |
US20070124144A1 (en) * | 2004-05-27 | 2007-05-31 | Johnson Richard G | Synthesized interoperable communications |
US20080154601A1 (en) * | 2004-09-29 | 2008-06-26 | Microsoft Corporation | Method and system for providing menu and other services for an information processing system using a telephone or other audio interface |
US20060169314A1 (en) * | 2005-01-31 | 2006-08-03 | Yamaha Corporation | Artificial inner ear and thermoelectric generator therefor |
US20070260460A1 (en) * | 2006-05-05 | 2007-11-08 | Hyatt Edward C | Method and system for announcing audio and video content to a user of a mobile radio terminal |
US20090063154A1 (en) * | 2007-04-26 | 2009-03-05 | Ford Global Technologies, Llc | Emotive text-to-speech system and method |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
USD777146S1 (en) | 2013-12-10 | 2017-01-24 | Fugoo Corporation | Jacket for portable speaker |
USD777147S1 (en) | 2013-12-10 | 2017-01-24 | Fugoo Corporation | Jacket for a portable speaker |
USD777709S1 (en) | 2013-12-10 | 2017-01-31 | Fugoo Corporation | Jacket for a portable speaker |
US20150195653A1 (en) * | 2014-01-03 | 2015-07-09 | Fugoo Corporation | Audio architecture for a portable speaker system |
US20150195652A1 (en) * | 2014-01-03 | 2015-07-09 | Fugoo Corporation | Portable stereo sound system |
US9641921B2 (en) | 2014-01-03 | 2017-05-02 | Fugoo Corporation | Speaker core with removable jacket |
US9668054B2 (en) | 2014-01-03 | 2017-05-30 | Fugoo Corporation | Audio architecture for a portable speaker system |
US9668039B2 (en) | 2014-01-03 | 2017-05-30 | Fugoo Corporation | Shock absorbent speaker system |
US10091584B2 (en) | 2014-01-03 | 2018-10-02 | Fugoo Corporation | Audio architecture for a portable speaker system |
WO2015103312A1 (en) * | 2014-01-04 | 2015-07-09 | Fugoo Corporation | Configurable portable sound systems with interchangeable enclosures |
CN113485551A (en) * | 2014-05-31 | 2021-10-08 | 苹果公司 | Message user interface for capture and transmission of media and location content |
US12089121B2 (en) | 2015-08-14 | 2024-09-10 | Apple Inc. | Easy location sharing |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11003325B2 (en) | Mobile terminal and method for controlling the same | |
WO2011011224A1 (en) | Hand-held speech generation device | |
US10592103B2 (en) | Mobile terminal and method for controlling the same | |
CN105320736B (en) | For providing the device and method of information | |
RU2420028C2 (en) | Mobile terminal and method of user interface provision when used | |
KR101545582B1 (en) | Terminal and method for controlling the same | |
KR101612788B1 (en) | Mobile terminal and method for controlling the same | |
EP2138928B1 (en) | Haptic effect provisioning for a mobile communication terminal | |
CN107463247B (en) | Text reading processing method and device and terminal | |
US20190340200A1 (en) | Multi-modal interaction between users, automated assistants, and other computing services | |
US20090166098A1 (en) | Non-visual control of multi-touch device | |
US11553070B2 (en) | Dynamic user interface schemes for an electronic device based on detected accessory devices | |
US20110161067A1 (en) | System and method of using pos tagging for symbol assignment | |
KR20090129192A (en) | Mobile terminal and voice recognition method | |
CN111524501B (en) | Voice playing method, device, computer equipment and computer readable storage medium | |
US9342213B2 (en) | Mobile terminal and method of controlling the same | |
CN105556937A (en) | Mobile terminal and control method therefor | |
KR101914548B1 (en) | Mobile terminal having function of voice recognition and method for providing search results thereof | |
CN110634336A (en) | Method and device for generating audio electronic book | |
WO2011082053A1 (en) | System and method of using a sense model for symbol assignment | |
CN108256108A (en) | A kind of processing method and terminal for showing content | |
CN103003874A (en) | Provisioning text services based on assignment of language attributes to contact entry | |
KR101151865B1 (en) | Portable communication device for auditory disabled | |
KR101495183B1 (en) | Terminal and method for controlling the same | |
CN109725798A (en) | The switching method and relevant apparatus of Autonomous role |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 10802665 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 10802665 Country of ref document: EP Kind code of ref document: A1 |