[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

US20060080083A1 - Method and apparatus for integrated, user-directed web site text translation - Google Patents

Method and apparatus for integrated, user-directed web site text translation Download PDF

Info

Publication number
US20060080083A1
US20060080083A1 US11/292,871 US29287105A US2006080083A1 US 20060080083 A1 US20060080083 A1 US 20060080083A1 US 29287105 A US29287105 A US 29287105A US 2006080083 A1 US2006080083 A1 US 2006080083A1
Authority
US
United States
Prior art keywords
user
text
information
server
related information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US11/292,871
Inventor
Carol Lin
Jacob Kjelstrup
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wordsniffer Inc
Original Assignee
Wordsniffer Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wordsniffer Inc filed Critical Wordsniffer Inc
Priority to US11/292,871 priority Critical patent/US20060080083A1/en
Publication of US20060080083A1 publication Critical patent/US20060080083A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation

Definitions

  • This invention relates to an apparatus and integrated method for acquisition of information relating to specific user-selected text of World Wide Web site pages, and more particularly to an apparatus and site-integrated method for providing high quality multi-lingual translations, explanations, and consolidated automatic multi-dictionary definitions of such text, in text, voice, image or multi-media format.
  • Globalization and strategic partnering provide businesses with means for lowering costs and increasing revenue. Globalization of commerce is increasingly viewed as a requirement for businesses to remain viable in competitive markets.
  • the Internet in this competitive context, has developed not only as an important medium for entertainment and information exchange, but also as a forum for facilitating or conducting commerce, including global commerce.
  • web sites can be grouped roughly into one of two system categories: (1) provider-driven (i.e., content provider-, product seller-driven) or (2) consumer-driven (i.e., content or product consumer-driven.
  • provider-driven i.e., content provider-, product seller-driven
  • consumer-driven i.e., content or product consumer-driven.
  • the difference between provider- and consumer-driven web sites or systems lies not only in which party takes the initiative to find the other, but also in which data set is used as the basis for relating the products, services or information.
  • provider-driven system the provider posts content in a single language, and holds itself out to the consumer using untargeted or generalized consumer data.
  • the consumer in a consumer-driven system, seeks out the seller using real (i.e., qualified, tailored or targeted) consumer data.
  • a typical provider's Internet site is a passive advertising site that provides information about the content/product in a single language (usually English) and directs a generalized base of potential consumers to places where they can obtain the content/product.
  • a smaller number of providers have active sites that not only provide information, but also allow potential consumers to conduct a transaction over the Internet to obtain consumer-specific information or to purchase a product.
  • the provider categorizes the content/prices, packages the information/product, and then holds it out for consumption or sale to potential consumers via the Internet. Typically, this information is presented and accessed in a single language on a given site.
  • Locating the desired content/product in provider-driven systems is typically a time-consuming process and often frustrating. Potential consumers must initially find the provider site, search and identify the content/product(s), and either aggregate (e.g., down-load and store) the desired information, or accept the seller's product terms or find another seller whose terms are acceptable. Information content on provider-driven sites is not typically tailored to the needs of particular consumers. For example, a potential consumer may fail to identify product information, or complete a product-related transaction if such consumer does not understand the language (e.g., English, Chinese, etc.) or technical terms of the commercial site.
  • the language e.g., English, Chinese, etc.
  • a typical consumer has neither the time nor the reference resources (e.g., specialized dictionaries, etc.) to effectively translate and research the language and technical or specialized terms of art associated with a given commercial web site to enable educated, informed decision making.
  • the reference resources e.g., specialized dictionaries, etc.
  • the consumer would need to exit the site and spend considerable time accessing and consolidating the reference information. This has a negative impact on the “stickiness” of a site, because after such off-site activity, there is a high likelihood that the potential consumer will never return to the commercial web site, having been effectively deterred by the daunting tasks of translation and consolidation of information.
  • Full text machine translation (e.g., using Systran, IBM, SDL, etc.) is relatively less expensive, but typically yields laughable results.
  • Machine translation attempts to translate full sentences and paragraphs, but currently provides poor results because of contextual complexity and ambiguity.
  • Significantly, such flawed machine translations can damage corporate image and/or incur liability risk resulting from mistranslation.
  • Some improvement in machinge translation, and/or liability reduction might potentially be gained if the translation, or parts thereof, were to proceed by executable modules, downloaded to the user. However, such modules would likely be usually large and take a long time for a user to download, especially if the user has a slow connection to the Internet.
  • Consumer-driven systems The use of client- or consumer-driven systems to reach a more global consumer base has also been contemplated.
  • the consumer In a consumer-driven system, the consumer typically specifies the desired content requirements, or the product purchase terms, and then “posts” this specification for provider access.
  • Some examples of consumer-driven systems on the Internet are “news groups” and “bulletin boards,” where any potential consumer can post “wanted” advertisements at little or no cost.
  • Consumer-driven Internet systems have inherent benefits that other systems do not. For example, where a large number of potential providers exist, but those providers do not have the resources to advertise globally it makes sense for consumers, if they can, to take the initiative in communicating their needs to them. Theoretically, the Internet is an ideal medium upon which to address this situation because Internet postings are global, and the posted consumer data theoretically communicates a need/offer to a large number of potential providers. Moreover, such consumers can, at least in principle, exercise more control in limiting content to desired categories, or in setting the terms and conditions of their specific purchases, by strategically tailoring their posted consumer data.
  • each consumer typically has different content requirements or purchasing specifications, and communicates his or her needs to providers using a non-standard format using, e.g., different languages, features, attributes, conditions, terms, or language styles. Translating, interpreting and addressing such non-standard specifications simply requires too much time of the typical provider.
  • Consumers are likewise deterred from using current consumer-driven systems. For example, the potential consumer may refrain from using such a system where he or she has little or no control over the nature and amount of content provided in response to the particular consumer data posted. Additionally, the consumer may be reluctant to post content/product/service needs where he or she has little or no experience with the particular content/product/service and is unable to effectively address the lack of knowledge. These concerns are compounded where the desired information content or the product being sold is available in multiple, highly technical, complex data or product configurations.
  • client/consumer driven systems like full-text machine translation or human localization approaches, do not address the globalization problems for businesses with commercial web pages that post content and/or transact in a single language format.
  • RichLink is a product that allows a client to customize definitions of particular words and phrases that occur on a web page.
  • the process is not automatic and to do so, the client must first leave the web page to obtain the definition from individual separate sources, and then the definition must be manually entered and maintained on the web page.
  • Arabic provides limited translation of user-selected words, but is not integrated into providers web pages, and requires downloaded software to be resident and executable on the client side.
  • An object of the present invention is to provide an integrated method, over a wide area network comprising a server side and a client side, for user-directed acquisition of information relating to user-selected text of World Wide Web site pages.
  • the inventive apparatus and methods provide a cost-effective apparatus and web site-integrated method for providing high quality simultaneous searching of multiple dictionaries to obtain multi-lingual translations, explanations, and consolidated automatic multi-dictionary definitions of user-selected text, in text, voice, image or multi-media format.
  • the method comprises: selecting, by a user, a word of web page text, whereby the user-selected word is accessible by an information retrieval function; selecting, by a user, from a plurality of reference sources or translation language pairs, a reference source or a translation language pair, whereby the user-selected reference source or user-selected translation language pair is accessible by an information retrieval function; accessing, in accordance with the user-selected reference source or user-selected translation language pair, and using the information retrieval function, information relating to the user-selected word; and displaying to the user the information related to the user-selected word, whereby user-directed acquisition of information relating to user-selected text of World Wide Web site pages is achieved.
  • the method steps are integrated into a server-side web site by either incorporating into the site a reference window linked to the information retrieval function, or linked pull down menus, or by installing programmed script on all web pages of the site to link and enable the information retrieval function.
  • the method steps are integrated into a client-side system using downloadable software, or a browser plug-in that is linked to the information retrieval function.
  • selecting a word of web page text is achieved by either typing or pasting a copied word into a window linked to an information retrieval function, or by highlighting the word whereby such highlighting renders the user-selected word accessible by an information retrieval function.
  • the reference source comprises one or a plurality of dictionaries.
  • information relating to user-selected text of World Wide Web site pages consists of high quality multi-lingual translations, explanations, or consolidated automatic multi-dictionary definitions, wherein such information is displayed to the user in text, voice, image or multi-media formats.
  • the information retrieval function further comprises an Internet search engine operative with the information retrieval function to search the Internet in accordance with the user-selected text, and the user-selected reference source or user-selected translation language pair, whereby the information retrieval function is augmented.
  • An alternate embodiment provides a method, over a wide area network comprising a server and a client, for user-directed acquisition of information relating to user-selected text of World Wide Web site pages, comprising: receiving from the client a user-selected word of World Wide Web page text; receiving from the client a user-selected reference source or user-selected translation language pair; accessing information relating to the user-selected word, in accordance with the user-selected reference source or user-selected translation language pair; and transmitting to the client the accessed information relating to the user-selected word.
  • the method further comprises an Internet search engine operative with server to search the Internet in accordance with the user-selected text, and the user-selected reference source or user-selected translation language pair.
  • An additional embodiment provides an apparatus for user-directed acquisition, over a wide area network comprising a server side and a client side, of information relating to user-selected text of World Wide Web site pages.
  • the apparatus comprises: a server connected to the network, wherein the server comprises a processor, a memory, and a storage device, and wherein the processor and memory are operative with a program stored on the storage device to receive a user-selected word of World Wide Web page text; receive a user-selected reference source or user-selected translation language pair; access, in accordance with the user-selected reference source or user-selected translation language pair, information relating to the user-selected word; and transmit the accessed information relating to the user-selected word to the client-side user.
  • the apparatus comprises an Internet search engine operative with the server to search the Internet in accordance with the user-selected text, and the user-selected reference source or user-selected translation language pair.
  • communication proceeds between a “master” user, having a subscribing wireless device (e.g., phone), and a “slave” user, having either a plug-in headset connected to the master device, or having another wireless device accessible by the master device.
  • a “master” user having a subscribing wireless device (e.g., phone)
  • a “slave” user having either a plug-in headset connected to the master device, or having another wireless device accessible by the master device.
  • the system is implemented using a plurality of “slave” devices, in connection with one or more master “devices” receiving messages in one or more translated languages.
  • master users may retrieve information related to selected voice or text, where such related information comprises high quality multi-lingual translations, explanations, illustrations and/or consolidated automatic multi-dictionary definitions (“look-ups”), and wherein such information is displayed to the user in text, voice, image or multi-media formats that are compatible with the user's wireless device, and with the user's native language or character set.
  • look-ups consolidated automatic multi-dictionary definitions
  • FIG. 1 is a screen shot of a typical provider-driven Web page, illustrating a server-side Web site-integrated embodiment of the inventive method for user-direct acquisition of user-selected Web page text.
  • Various fields ( 10 , 12 and 14 ) in a Web page-integrated “reference” window 16 , or one or more fields 20 in an integrated “result” window 18 are used to select text and user preferences, and to display retrieved information, such as a language translation (e.g., English ⁇ Chinese), related to the user-selected text (e.g., word-by-word translation, according to the present invention).
  • a language translation e.g., English ⁇ Chinese
  • word-by-word translation e.g., word-by-word translation
  • the method involves user selection of Web page text, further user selection of preferences for specific reference sources and/or translation language pairs from a plurality of such reference sources and/or languages, and acquisition of related information from a linked information retrieval function, according to the user selected preferences.
  • the method occurs in real-time, and does not require the user to exit the provider-driven Web site.
  • FIG. 2 is a screen shot, similar to that of FIG. 1 , of a typical provider-driven Web page, illustrating another server-side Web site integrated embodiment of the inventive method for user-direct acquisition of user-selected Web page text.
  • information related to user-selected Web page text i.e., a dictionary “look-up,” or definition of “otolaryngologist” that is acquired via the linked information retrieval function, is displayed in a field 20 of the Web site-integrated “reference” window 16 .
  • FIG. 3 is a screen shot, similar to that of FIGS. 1 and 2 , of a typical provider-driven Web page, illustrating an additional server-side Web site-integrated embodiment of the inventive method for user-direct acquisition of user-selected Web page text.
  • Web site integration is accomplished by installing executable script (e.g., JavaScriptTM) on all web pages of the site to link and enable the information retrieval function.
  • executable script e.g., JavaScriptTM
  • a ‘right-click’ of highlighted text 30 using a mouse can be used to activate one or more pop-up menus for user selection of particular reference source and/or translation language preferences (in this examples, the English ⁇ Chinese language pair preference was selected), and to enable activation of the information retrieval function to provide for a displayed translation 36 in one or more window fields using a Chinese character set.
  • FIG. 4 is a screen shot of another server-side Web site-integrated embodiment of the present invention similar to that illustrated in FIG. 3 .
  • the integrated web page is displayed in the upper half of the figure.
  • a right click of a mouse was used to activate pop-up menus, select user preferences, and acquire information related to user-selected Web site text (in this case, translation of legal-related text from a court opinion into Chinese characters).
  • FIG. 5 shows a high-level flow diagram of the inventive method steps associated with the server-side Web page-integrated embodiments shown in FIGS. 3 and 4 .
  • a “mouse” right-click is made on selected text 50 .
  • One or more resulting “pop-up” menus 52 allow a user to select preferences 54 for reference source (e.g., dictionary) and/or translation language pair (e.g., English ⁇ German), and to activate 56 an information retrieval function to look-up and translate the selected word into the selected language, which is displayed in an integrated window field 58 .
  • the English word “dog” is translated into German (“Hund”), according to one embodiment of the inventive method.
  • FIG. 6 is a high-level flow diagram of the inventive method steps associated with an alternate client-side embodiment of the present invention.
  • An end user downloads a browser “plug-in” which allows the user to use “mouse” right-clicks 60 on user-selected text 62 to select reference 64 and language 66 preferences, and to retrieve information 68 (here, a translation into German) related to user-selected Web page text on any Web page, including pages on those Web sites that do not have a server-side embodiment of inventive method.
  • information 68 here, a translation into German
  • FIG. 7A shows an embodiment of the present invention enabling combined translation and Internet search functions in accordance with particular user-selected Web site text and/or information source and/or language pair preferences.
  • a Chinese-speaking user activates (according to the present invention) the information retrieval function 70 , in accordance with one or more particular translation language preferences, to simultaneously translate a user-selected text (“key word”) into one or more languages/character sets 72 (e.g., using TurboDictionaryTM, according to the present invention).
  • the translations are accessed by an Internet search engine (e.g., I-Search®) to enable a user to simultaneously search the internet in multiple languages 74 , based on the users native language and selected key word.
  • I-Search® Internet search engine
  • FIG. 7B shows a more detailed diagram of the combined translation and Internet search embodiment of FIG. 7A , including certain structural elements.
  • the core “translate” and display embodiment of the inventive method is shown in combination with an automated Internet search, using the related information data set to obtain related Internet search information which is displayed to the user along with the related information data set (combined path arrows 77 and 79 ).
  • the core server-side translation and search functionality of the information retrieval function is shown enclosed in dotted lines 73 , with user selection and display occurring on the client side.
  • FIG. 8 shows apparatus for a wireless embodiment of the present invention. Communication proceeds between a “master” user, having a cellular phone 80 subscribing to the TurboTalkTM service (described herein), and a “slave” user, having either a plug-in headset 82 connected to the master device, or having another wireless device accessible by the master device.
  • FIG. 8 shows such a WAP server-based wireless method embodiment.
  • a “master” user 84 selects one or more translation language pairs using a GSM phone 86 accessible by a WAP server 88 hosting the inventive information retrieval function.
  • the user sends a voice message using the cellular phone, and the message is processed using voice-recognition software stored in the memory of the WAP server and operative with a processor of the server to recognize the voice and process it into text.
  • the information retrieval function translates the text, which is then converted to voice by the voice recognition function, and delivered as a translated voice message over the wireless network to the “slave” users 88 .
  • FIG. 9 shows a simpler WAP-based wireless embodiment involving a single “master” user and a single “slave” user.
  • the master user selects single translation language pair, and sends a voice message 90 using the cellular phone 92 , and the message is processed using voice-recognition software stored in the memory of a WAP server and operative with a processor of the server to recognize the voice and process it into text.
  • the information retrieval function translates the text, which is then converted to voice by the voice recognition function, and delivered as a translated voice message 94 over the wireless network to the “slave” user.
  • the embodiments illustrate various ways in which the present invention can be implemented—in this case, an apparatus and integrated method for user-directed acquisition of information relating to user-selected text of World Wide Web (“WEB”) site pages, and an apparatus and integrated method, over a wireless network comprising a server side (e.g., a WAP-based server) and a client side (e.g., cellular phones, or other PDA wireless devices), for user-directed acquisition of real-time translation and reference services.
  • WEB World Wide Web
  • the invention is particularly useful in those instances where the language and character-set of the Web site differ from the native language and character set of the user.
  • the present invention affords typical provider-driven web sites with an immediate, accurate, and low cost multilingual presence by expanding their site audience to include other language speakers. Additionally, integration of reference and/or result windows and fields increases the number site page views, thus providing more advertising revenue. The present invention will enhance revenue by increasing sales to other language speakers, because such users will be enabled to navigate the Web site and educate themselves about the products in their own languages to make informed transactional decisions. Furthermore, the present invention provides a novel method to collect business intelligence and understand foreign markets, based on the words translated and/or on which products or related information are selectively accessed by particular language groups.
  • the embodiments of the inventive apparatus and method provide a practical and inexpensive means by which Web sites, such as typical provider-driven commercial Web sites (see “Background,” herein above), may be rendered more accessible by a global consumer base comprised of individual users, communicating in one of a plurality of native languages and native character sets. More specifically, the embodiments provide an apparatus and integrated method, over a wide area network comprising a server side and a client side, for user-directed acquisition of information relating to user-selected text of Web site pages, using the portion of the Internet known as the Web.
  • Web sites such as typical provider-driven commercial Web sites (see “Background,” herein above)
  • the embodiments provide an apparatus and integrated method, over a wide area network comprising a server side and a client side, for user-directed acquisition of information relating to user-selected text of Web site pages, using the portion of the Internet known as the Web.
  • such related information comprises high quality multi-lingual translations, explanations, illustrations and/or consolidated automatic simultaneous multi-dictionary definitions (“look-ups”), wherein such information is displayed to the user in text, voice, image or multi-media formats that correspond, or are at least compatible with the user's native language or character set.
  • look-ups consolidated automatic simultaneous multi-dictionary definitions
  • a user connects to a Web site from their client computer (internet compatible PDA or other cellular or hand-held device).
  • the inventive apparatus and method achieve their efficiency, at least in part, by enabling a user to acquire reference information related to specific Web site text in the absence of full-text machine translation or human “localization” (see “Background,” herein above). This is because most users that are not native English speakers, speak some English as their second or third language, and they need assistance primarily with vocabulary and not grammar.
  • the present invention offers text translation (e.g., word-by-word translation) and illustration that will effectively achieve the same purpose as a full-text machine translation or localization.
  • the method is integrated into a server-side web site, by either incorporating into the site a reference window or pull down menus linked to the inventive information retrieval function, or by installing programmed script on all web pages of the site to link and/or enable the information retrieval function.
  • the method steps are integrated into a client-side system using downloadable software or browser plug-ins that link user-selected text and preferences to the information retrieval function.
  • the method involves the user “selecting” or highlighting specific Web page text (e.g., word-by-word selection), and then further selecting specific reference sources and/or translation language pairs from a plurality of reference sources and/or translation language pairs.
  • the user-selected text and preferences are accessible by an information retrieval function comprising a server connected to the network, wherein the server comprises a processor, a memory, and a storage device, and wherein the processor and memory are operative with a software program stored on the storage device to: receive the user-selected Web page text (e.g., user-selected word); receive a user-selected reference source and/or user-selected translation language pair; access, in accordance with the user-selected reference source and/or user-selected translation language pair, information relating to the user-selected text; and transmit the accessed information relating to the user-selected text to the client-side user.
  • the server comprises a processor, a memory, and a storage device, and wherein the processor and memory are
  • such information transmittal is in the user-selected translation language, which can be the user's native language.
  • the memory element of the information retrieval function comprises an extensive collection of dictionaries and reference books to enable retrieval of related information using a variety of languages, including but not limited to all European languages, Chinese, Korean, Japanese, Russian and Hebrew.
  • the information retrieval function may comprise one or more linked servers comprising one or a plurality of processors, a memory, and storage devices operable programmed and configured to carry out the information retrieval function.
  • FIG. 1 and FIG. 2 illustrate a first embodiment of the overall method a user goes through to acquire information relating to specific user-selected Web page text.
  • a Chinese user desires a translation into Chinese of the English term “osteoporosis,” present as text on a provider-driven, server-side Web site.
  • the user establishes a network connection between the client and the server.
  • the user “selects” the desired text by “copying” the word “osteoporosis” from the upper-left quadrant of the Web page, and “pasting” it to a text field 10 in a reference window 16 (present in the lower-left quadrant of the Web page) that is integrated into the Web page, and that is linked to an information retrieval function (discussed below).
  • the desired text may be typed into the text field 10 .
  • the user selects, from a plurality of reference sources (e.g., a professional medical dictionary) and/or translation language pairs (e.g., English ⁇ Chinese), a reference source and/or a translation language pair, whereby the user-selected reference source or user-selected translation language pair is accessible by the information retrieval function.
  • a reference source e.g., a professional medical dictionary
  • translation language pairs e.g., English ⁇ Chinese
  • the user preference for a particular translation language pair is selected using a scrollable language pair field 12 of the linked dictionary window 16 .
  • Other selection means such as right-clickable “pop-up” menus (discussed herein, below) are also encompassed by embodiments of the present invention, and are familiar in the art.
  • the user accesses information relating to the user-selected text, in accordance with the user-selected reference and/or translation language preferences, by activating the information retrieval function. This is accomplished in the example of FIG. 1 by clicking on a “look up” button 14 within the dictionary window.
  • the information retrieval function comprises a server connected to the network, wherein the server comprises a processor, a memory, and a storage device, and wherein the processor and memory are operative with a program stored on the storage device to: receive user-selected Web page text (e.g., a selected word); receive a user-selected reference source preference (e.g., a professional medical dictionary) and/or a user-selected translation language pair preference (e.g., English ⁇ Chinese); access information relating to the user-selected text in accordance with the user-selected reference and/or language preferences; and transmit the accessed related information to the client-side user.
  • a user-selected Web page text e.g., a selected word
  • receive a user-selected reference source preference e.g., a professional medical dictionary
  • a user-selected translation language pair preference e.g., English ⁇ Chinese
  • a list of desired dictionaries is selected from a dictionary list by the user to define (e.g. narrow) the search.
  • this is a standard linear list selected either explicitly by the user, or implicitly by the user by virtue of pre-stored user preferences accessible to the information retrieval function (e.g., by means of a stored ‘cookie’).
  • the inventive information retrieval function accesses the user-selected text and user preferences (e.g., reference sources, language pairs) to search (using art-recognized search engine technology), process, rank and sort related information stored in one or more data bases, and then transmits the accessed related information to the user. This is accomplished using various art-recognized algorithms and data structures.
  • hashing and/or Btree data structures are used to efficiently process matching (relevant) data sets from an initial search.
  • ‘hashing’ is used to quickly reduce or minimize the data set that matches a query term, enabling a simpler search within a relatively smaller data set.
  • ‘BTree’ data structures are used to reduce the frequency of disk access during a search.
  • a ‘BTree’ data structure uses nodes arranged in a tree structure to search for data that is contained in its nodes. ‘BTree’ nodes, compared with those of other tree-based data structures, are contained within relatively larger ‘pages’ that match the physical size of disk blocks, thereby reducing the number of disk access events for a given search.
  • the information retrieval function uses pointers. Each word within a phrase becomes a ‘child’ node that points back to the ‘parent’ node, which contains the phrase. For phrases consisting of a single word, there is no ‘child.’
  • terms (words) used for searching are further refined by applying a number of transformations, including but not limited to: (i) stemming, to find roots of words (e.g., “jumped” becomes “jump”); (ii) stop word removal (e.g., in the case of English, removal of “a,” “the,” “an,” etc.); and (iii) word splitting into individual sub-words (e.g., “superheated” becomes “super” and “heated”).
  • the retrieved data set is analyzed for relevancy using various criteria including, but not limited to, hit counting within the term itself, proximity of words, exact v. sub-phrase matching, and hit counting within word definition areas.
  • Each term within each dictionary is assigned a score, based on the relevancy algorithm.
  • a sort algorithm is applied to the ranked data set and the top N results are displayed to the user. The user may optionally select to see the next N results, and so on.
  • the retrieved ranked and sorted information data set relating to the user-selected word is displayed to the user using an integrated result window 18 , or in the alternative using one or more result fields 20 within the integrated reference window 16 ( FIG. 2 ) or integrated result 18 window ( FIG. 1 ).
  • users have the ability not only to select which information sources are accessed, but may compare the results from alternate reference sources in real time without leaving the Web site.
  • Various types of information retrieval is encompassed by the present invention, including but not limited to high quality multi-lingual translations, explanations, illustrations and consolidated automatic multi-dictionary definitions, wherein such information is displayed to the user in text, voice, image or multi-media formats that correspond, or are at least compatible with the user's native language and/or character set.
  • the above-described method steps are integrated into a server-side Web site.
  • the provider-driven site enables its users to acquire information relating to user-selected text by hosting or integrating the inventive method and information retrieval function on the provider-driven site.
  • Such integration is achieved, for example, by incorporating into the site a reference window (e.g., like reference window 16 of FIG. 1 and FIG. 2 ) linked to the information retrieval function.
  • a reference window e.g., like reference window 16 of FIG. 1 and FIG. 2
  • selecting Web page text is achieved by “pasting” a “copied” word into a text field 10 of a reference window 16 , linked to an information retrieval function. This requires that the server-side Web site be integrated with a window having a link to the information retrieval system and software.
  • FIGS. 3 and 4 illustrate an alternative embodiment of server-side Web site integration of the inventive method.
  • Web site integration is accomplished by installing executable script (e.g., JavaScriptTM) on all web pages of the site to link and enable the information retrieval function.
  • executable script e.g., JavaScriptTM
  • a right-click of highlighted text 30 using a mouse can be used to activate a pop-up menu for user selection of one or more particular reference source and/or translation language preferences (in these examples, the English ⁇ Chinese preference was selected), and enable activation of the information retrieval function to provide for a displayed translation 36 in one or more window fields using a Chinese character set.
  • FIG. 5 shows a high-level flow diagram of the inventive method steps associated with the server-side Web page-integrated embodiments shown in FIGS. 3 and 4 .
  • a “mouse” right-click is made on selected text 50 .
  • One or more resulting “pop-up” menus 52 allow a user to select preferences 54 for reference source (e.g., dictionary) and/or translation language pair (e.g., English ⁇ German), and to activate 56 an information retrieval function to look-up and translate the selected word into the selected language, displayed in an integrated window field 58 .
  • reference source e.g., dictionary
  • translation language pair e.g., English ⁇ German
  • the user takes the initiative to install or integrate the method steps into a client-side system using, for example, a downloadable browser “plug-in” that facilitates linking to the above-described information retrieval function.
  • a user can then use the inventive method on any provider-driven Web site, regardless of whether or not a server-side embodiment of the inventive method has been integrated into the provider-driven Web site.
  • FIG. 6 is a high-level flow diagram of the inventive method steps associated with an alternate client-side embodiment of the present invention.
  • An end user downloads a browser “plug-in,” which allows the user to use “mouse” right-clicks 60 on user-selected text 62 to select reference 64 and language 66 preferences, and to retrieve information 68 (here, a translation into German) related to user-selected Web page text on any Web page, including pages on those Web sites that do not have a server-side embodiment of the inventive method.
  • information 68 here, a translation into German
  • the user-selected text translation (or “look-up”) and retrieved information display language and character set may be the same as, or different from the language and character set of the Web page being translated or interpreted.
  • particular embodiments of the present invention have the character of a personalized (user-directed) education center, whereby users can effectively navigate, translate, interpret and perform transactions on provider-driven Web sites regardless of their native language or character set.
  • User selectable reference sources include, but are not limited to electronic databases comprising dictionaries, thesauruses, encyclopedias, digital images, multi-media content (e.g., sound and video). Such reference sources may comprise data corresponding to a variety of languages and character sets, including but not limited to all European languages, Chinese, Korean, Japanese, Russian and Hebrew, and the character sets thereof.
  • An additional novel feature of the present invention is “data mining.”
  • the present invention can be combined using art-recognized statistical methods and algorithms to provide consumer data mining statistics according, for example, to country, language, topic and product.
  • data mining capability enhances the globalization utility of the present invention by allowing provider-driven sites, which have been traditionally limited to presenting content to an amorphous generalized consumer base (see under “Background,” above), to understand their audiences and market, and optimize presentation of posted content.
  • TurboDictionaryTM a software able to efficiently allow for “on-line” acquisition of information (e.g., translation and dictionary look-ups) related to user-selected text, and according to user-selected preferences.
  • the method derives its novelty and utility, at least in part, by taking advantage of the fact that most on-line users that do not speak English as their native language are nonetheless familiar with it as a second or third language, and thus need primarily only help with vocabulary.
  • translation of user-selected text circumvents the necessity of full-text machine or human localization (see under “Background,” above), which is expensive and typically flawed.
  • the invention is implemented as an integrated method on the server-side, requiring little, if any client-side processing.
  • a consumer can access the Internet using a computer or electronic hand-held device.
  • the software program of the present invention is usable over a network, preferably over the Internet, and most preferably over the World Wide Web (“Web”).
  • the apparatus of the present invention encompasses a computer network comprising a server, at least one user (client) subsystem connected to the server via a network connecting means (e.g., user modem).
  • client e.g., user modem
  • the user modem can be any other communication means that enables network communication, for example, ethernet links.
  • the modem can be connected to the server by a variety of connecting means, including public telephone land lines, dedicated data lines, cellular links, microwave links, or satellite communication.
  • the server is essentially a high-capacity, high-speed computer that includes a processing unit connected to one or more relatable data bases or content sources, comprising, for example, electronic dictionaries, thesauruses, encyclopedias, digital images, multi-media content (e.g., sound and video, streaming video).
  • Such content may comprise data corresponding to a variety of languages and character sets, including but not limited to all European languages, Chinese, Korean, Japanese, Russian and Hebrew, and the character sets thereof. Additional databases are optionally added to the server.
  • the memory can be a separate, but connected component.
  • the communication hardware may be modems, ethernet connections, or any other suitable communication hardware.
  • the server can be a single computer having a single processing unit, it is also possible that the server function could be spread over several networked computers, each having its processor and having one or more databases resident thereon.
  • the server can be WAP-based server of a wireless network (see “Wireless Device Applications,” below).
  • the server further comprises an operating system and communication software allowing the server to communicate with other computers.
  • Various operating systems and communication software may be employed.
  • the operating system may be Microsoft Windows NTTM, and the communication software Microsoft IISTM (Internet Information Server) server with associated programs.
  • the server is connected to the network, wherein the server comprises a processor, a memory, and a storage device, and wherein the processor and memory are operative with a program (e.g., herein referred to as TurboDictionaryTM) stored on the storage device to: receive a user-selected word of Web page text; receive user-selected reference source and/or user-selected translation language pair preferences; access information relating to the user-selected word in real time, and in accordance with the user-selected reference source or user-selected translation language pair; and transmit the accessed information relating to the user-selected word to the client-side user.
  • the software program of the present invention can be implemented as a stand-alone program on a particular Web site provider's server. Alternatively, the software may be implemented on an independent server that is linked to the provider's server.
  • the databases on, or connected to the server contain the information necessary to make the inventive apparatus and method work.
  • the information retrieval databases of the present invention are relatable, and are assembled and accessed using any commercially-available database software, such as Microsoft AccessTM, OracleTM, Microsoft SQLTM Version 6.5, etc., and other search engine technologies and algorithms as described herein above.
  • a user (client) subsystem generally includes a processor attached to storage unit, a communication controller, and a display controller.
  • the display controller runs a display unit through which the user interacts with the subsystem.
  • the user subsystem is a computer able to run software providing a means for communicating with the server.
  • This software for example, is an Internet web browser such as Microsoft Internet Explorer, Netscape Navigator, or other suitable Internet web browsers.
  • the user subsystem can be a computer or hand-held electronic device, such as a telephone or wireless telephone or other PDA-type device that enables Internet access.
  • the information retrieval function of the present method and apparatus including storage devices and server-side databases, are operative with Internet search engines, such as I-Search®, stored on the storage device to provide combined translation and Internet search functions in accordance with particular user-selected Web site text and/or information source and/or language pair preferences.
  • Internet search engines such as I-Search®
  • I-Search® an art-recognized multi-language search engine (currently powering World-Trade-Search.com) is used in combination with software of the present invention (e.g., TurboDictionaryTM), but other art-recognized Internet search engines (e.g., YahooTM, GoogleTM, and AltaVista®) are productively used in combination with the instant method and apparatus, and are encompassed within the scope of the present invention.
  • software of the present invention e.g., TurboDictionaryTM
  • other art-recognized Internet search engines e.g., YahooTM, GoogleTM, and AltaVista®
  • both integrated server- and client-side implementations of such combined translation/search functions are contemplated.
  • FIGS. 7A and 7B illustrate an embodiment of the present invention enabling combined translation and Internet search functions in accordance with particular user-selected Web site text and/or information source and/or language pair preferences.
  • a Chinese-speaking user activates the information retrieval function 70 , in accordance with one or more particular translation language preferences, to simultaneously translate user-selected text (“key word”) into one or more languages/character sets 72 (e.g., using TurboDictionaryTM, according to the present invention).
  • the translations are accessed by an Internet search engine (e.g., I-Search®) to enable a user to simultaneously search the Internet in multiple languages 74 , based on the users native language and selected key word.
  • I-Search® Internet search engine
  • FIG. 7B shows a more detailed diagram of the combined translation and Internet search embodiment of FIG. 7A , including certain structural elements.
  • the elements and steps above and below dashed lines 76 and 78 respectively, illustrate the core “translate” and display embodiment of the inventive method, comprising user selection of text and preferences for reference sources and translation language, accessing by the information retrieval function of a related information data set, based on related data stored in reference data base(s), and display of the data set to the user (path arrow 77 ).
  • the related information data set is used to perform an automated Internet search to obtain related Internet search information which is displayed to the user along with the related information data set (combined path arrows 77 and 79 ).
  • the core server-side translation and search functionality of the information retrieval function is shown enclosed in dotted lines 73 , with user selection and display occurring on the client side.
  • the integrated user-directed information retrieval function of the present invention (using, e.g., TurboDictionaryTM) is implemented in the context of a wireless network (e.g., WAP server-based), and voice recognition function to provide a real-time voice translator and reference tool.
  • a wireless network e.g., WAP server-based
  • voice recognition function to provide a real-time voice translator and reference tool.
  • This novel implementation is herein referred to as TurboTalkTM.
  • the wireless embodiments provide an apparatus and integrated method, over a wireless network comprising a server side (e.g., WAP-based) and a client side (e.g., cellular phones, or other PDA (personal data assistant) wireless devices, e.g., Palm PC, Pocket PC, PSION, etc.), for user-directed acquisition of real-time translation and reference services in both text and voice, using standard cellular phones and PDA devices (voice-to-text, voice-to-voice, text-to-voice and text-to-text).
  • a server side e.g., WAP-based
  • client side e.g., cellular phones, or other PDA (personal data assistant) wireless devices, e.g., Palm PC, Pocket PC, PSION, etc.
  • PDA personal data assistant
  • TurboTalkTM embodiments further comprise voice recognition/conversion software voice-recognition software stored in the memory of the WAP server and operative with a processor of the server to recognize the voice and process it into text (or process text to voice).
  • this embodiment is offered as a wireless service by wireless service providers to subscribing users.
  • a wireless carrier offers the TurboTalkTM service, which enables a subscribing user, speaking in their native tongue, to spontaneously speak, for example, Greek while in Greece, and Japanese while in Japan without knowing either Greek or Japanese. That is, travelers can use the TurboTalkTM service to communicate with local people.
  • FIG. 8 shows a preferred embodiment, wherein communication proceeds between a “master” user, having a cellular phone 80 subscribing to the TurboTalkTM service, and a “slave” user, having either a plug-in headset 82 connected to the master device, or having another wireless device accessible by the master device ( FIG. 8 , upper half).
  • the system is compatible with the use of a plurality of “slave” devices in connection with one or more master “devices” receiving messages in one or more translated languages. In such instances, the master user selects language preferences for each slave.
  • FIG. 8 shows such a WAP-based wireless embodiment.
  • a “master” user 84 selects one or more translation language pairs (in this case, English ⁇ Chinese and English ⁇ German) using a GSM (Global System for Mobiles) phone 86 accessible by a WAP server 88 hosting the inventive information retrieval function (e.g., the method described herein using TurboDictionaryTM).
  • the user sends a voice message (“train station”) using the cellular phone, and the message is processed using voice-recognition software stored in the memory of the WAP server and operative with a processor of the server to recognize the voice and process it into text.
  • the information retrieval function translates the text, which is then converted to voice by the voice recognition function, and delivered as a voice message (in Chinese and/or German) over the wireless network to the “slave” users 88 .
  • FIG. 9 shows a simpler WAP-based wireless embodiment involving a single “master” user and a single “slave” user.
  • the master user selects single translation language pair (in this case, English ⁇ Chinese), and sends a voice message 90 (“Where can I find the restrooms?”) using the cellular phone 92 , and the message is processed using voice-recognition software stored in the memory of a WAP server and operative with a processor of the server to recognize the voice and process it into text.
  • the information retrieval function translates the text, which is then converted to voice by the voice recognition/conversion function, and delivered as a voice message 94 (in Chinese) over the wireless network to the “slave” user.
  • the translated information is delivered to the slave user in text, or multimedia formats compatible with the slave user's native language and character set.
  • WAP-based embodiments primarily involve the use of standard WML and WMLScript, but many different cellular phones (User Agents) have slight modifications to the WAP standard. Therefore, a database of User Agents and their respective differences is operative with the inventive method and apparatus to ensure that all devices are compatible with TurboTalkTM.
  • master users may retrieve information related to selected voice or text messages, in addition to voice message translations.
  • Such related information comprises high quality multi-lingual translations, explanations, illustrations and/or consolidated automatic multi-dictionary definitions (“look-ups”), wherein such information is displayed to the user in text, voice, image or multi-media formats that correspond, or are at least compatible with the user's wireless device, and with the user's native language or character set.
  • server-integrated user-directed information retrieval function of the present invention can be implemented in the context of a wireless network (e.g., WAP server-based), and voice recognition function to provide a real-time voice translator and reference tool that can be used to communicate between two, or among a plurality of “master” and “slave” users.
  • WAP server-based wireless network
  • voice recognition function to provide a real-time voice translator and reference tool that can be used to communicate between two, or among a plurality of “master” and “slave” users.
  • a person skilled in the art will, however, recognize that many additional embodiments are encompassed within the scope of the present invention. In particular, other embodiments are possible that incorporate additional or different art-recognized features and that apply to additional types and formats of retrievable information.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The present invention provides an integrated method, implemented over a wide area network or wireless network, comprising a server side and a client side, for user-directed acquisition of reference information, such as dictionary definitions (“look-ups”) or language translation, relating to user-selected messages or user-selected text of World Wide Web (Web″) site pages. The method steps, for Web use, are integrated into either a server-side web site or into a client-side system. In particular embodiments, reference information relating to user-selected Web page includes high quality multi-lingual translations, explanations, illustration and/or consolidated automatic multi-dictionary definitions, wherein such information is displayed to the user in text, voice, image or multi-media formats. The present invention also provides an apparatus for user-directed acquisition of information relating to user-selected Web page text. The present invention further provides an integrated method, over a wireless network for user-direct acquisition of real-time translation and reference services in both text and voice.

Description

    FIELD OF THE INVENTION
  • This invention relates to an apparatus and integrated method for acquisition of information relating to specific user-selected text of World Wide Web site pages, and more particularly to an apparatus and site-integrated method for providing high quality multi-lingual translations, explanations, and consolidated automatic multi-dictionary definitions of such text, in text, voice, image or multi-media format.
  • BACKGROUND
  • Globalization and strategic partnering provide businesses with means for lowering costs and increasing revenue. Globalization of commerce is increasingly viewed as a requirement for businesses to remain viable in competitive markets. The Internet, in this competitive context, has developed not only as an important medium for entertainment and information exchange, but also as a forum for facilitating or conducting commerce, including global commerce.
  • Presently, commercial and informational World Wide Web sites (“web sites”) can be grouped roughly into one of two system categories: (1) provider-driven (i.e., content provider-, product seller-driven) or (2) consumer-driven (i.e., content or product consumer-driven. The difference between provider- and consumer-driven web sites or systems lies not only in which party takes the initiative to find the other, but also in which data set is used as the basis for relating the products, services or information. In a typical provider-driven system, the provider posts content in a single language, and holds itself out to the consumer using untargeted or generalized consumer data. By contrast, the consumer, in a consumer-driven system, seeks out the seller using real (i.e., qualified, tailored or targeted) consumer data.
  • Provider-driven systems; passive and active formats. Most current Internet informational and commerce systems are provider-driven and are either passive or active sites. A typical provider's Internet site is a passive advertising site that provides information about the content/product in a single language (usually English) and directs a generalized base of potential consumers to places where they can obtain the content/product.
  • A smaller number of providers have active sites that not only provide information, but also allow potential consumers to conduct a transaction over the Internet to obtain consumer-specific information or to purchase a product. The provider categorizes the content/prices, packages the information/product, and then holds it out for consumption or sale to potential consumers via the Internet. Typically, this information is presented and accessed in a single language on a given site.
  • Locating the desired content/product in provider-driven systems is typically a time-consuming process and often frustrating. Potential consumers must initially find the provider site, search and identify the content/product(s), and either aggregate (e.g., down-load and store) the desired information, or accept the seller's product terms or find another seller whose terms are acceptable. Information content on provider-driven sites is not typically tailored to the needs of particular consumers. For example, a potential consumer may fail to identify product information, or complete a product-related transaction if such consumer does not understand the language (e.g., English, Chinese, etc.) or technical terms of the commercial site. A typical consumer has neither the time nor the reference resources (e.g., specialized dictionaries, etc.) to effectively translate and research the language and technical or specialized terms of art associated with a given commercial web site to enable educated, informed decision making. Moreover, even if the reference resources were independently available to a potential consumer, the consumer would need to exit the site and spend considerable time accessing and consolidating the reference information. This has a negative impact on the “stickiness” of a site, because after such off-site activity, there is a high likelihood that the potential consumer will never return to the commercial web site, having been effectively deterred by the formidable tasks of translation and consolidation of information.
  • The magnitude of this problem, for a global consumer base, is readily appreciated by considering present web site demographics. Currently, 75% of all web sites are in English only. Even more significantly, 96% of E-commerce web sites are in English only. However, by 2003, 75% (696 million) Internet users will not be native English speakers, and 50% of the projected worldwide $1.2 trillion online B2B commerce will come from outside the United States.
  • Because of difficulty in user acquisition of explanatory information relating to Web site content, products sold on provider-driven sites are typically limited to lower-priced standardized goods such as books, compact disks, and videos that are suitably distributed to a generalized consumer base, comprised of those consumers that understand the language of the particular site.
  • Prior approaches to solving e-commerce web site globalization problems in provider-driven systems. Currently, approaches to globalization of commercial web sites include translation, and limited use of consumer-driven systems. For example, globalization of provider-driven sites is typically approached (at least by those companies that can afford it) by full text translation, either via “localization” (human translation) or “machine” translation. Globalization of a web site via localization is very costly, averaging $250,000 per site per language, and with annual maintenance costs exceeding $50,000 per site per language. To many companies it is cost prohibitive. Furthermore, many companies need to have sufficient business intelligence before making the investment to launch into a foreign market.
  • Full text machine translation (e.g., using Systran, IBM, SDL, etc.) is relatively less expensive, but typically yields laughable results. Machine translation attempts to translate full sentences and paragraphs, but currently provides poor results because of contextual complexity and ambiguity. Significantly, such flawed machine translations can damage corporate image and/or incur liability risk resulting from mistranslation. Some improvement in machinge translation, and/or liability reduction might potentially be gained if the translation, or parts thereof, were to proceed by executable modules, downloaded to the user. However, such modules would likely be usually large and take a long time for a user to download, especially if the user has a slow connection to the Internet. Even users with high-speed Internet access may have problems with the executable modules, because many network firewalls are designed to keep these modules from being downloaded onto the user's system. In addition, the downloaded executables pose substantial browser compatibility problems. Finally, if the commercial website were to make any changes to its configuration process, a new module would need to be downloaded every time the process changes. Thus, like “localization,” machine translation is not desirable for companies wishing to post alternative-language versions of their web sites.
  • Consumer-driven systems. The use of client- or consumer-driven systems to reach a more global consumer base has also been contemplated. In a consumer-driven system, the consumer typically specifies the desired content requirements, or the product purchase terms, and then “posts” this specification for provider access. Some examples of consumer-driven systems on the Internet are “news groups” and “bulletin boards,” where any potential consumer can post “wanted” advertisements at little or no cost.
  • Consumer-driven Internet systems have inherent benefits that other systems do not. For example, where a large number of potential providers exist, but those providers do not have the resources to advertise globally it makes sense for consumers, if they can, to take the initiative in communicating their needs to them. Theoretically, the Internet is an ideal medium upon which to address this situation because Internet postings are global, and the posted consumer data theoretically communicates a need/offer to a large number of potential providers. Moreover, such consumers can, at least in principle, exercise more control in limiting content to desired categories, or in setting the terms and conditions of their specific purchases, by strategically tailoring their posted consumer data.
  • However, despite these advantages, existing consumer-driven systems have substantial drawbacks for both consumers and providers. Significantly, it is difficult, if not impossible for providers to efficiently target the needs of a consumer base that is scattered across the Internet, and comprised of amorphous and undiscriminating newsgroups, bulletin boards, and the like that host unqualified postings that may be in a variety of languages.
  • Moreover, even in the case of qualified consumers, each consumer typically has different content requirements or purchasing specifications, and communicates his or her needs to providers using a non-standard format using, e.g., different languages, features, attributes, conditions, terms, or language styles. Translating, interpreting and addressing such non-standard specifications simply requires too much time of the typical provider.
  • Consumers are likewise deterred from using current consumer-driven systems. For example, the potential consumer may refrain from using such a system where he or she has little or no control over the nature and amount of content provided in response to the particular consumer data posted. Additionally, the consumer may be reluctant to post content/product/service needs where he or she has little or no experience with the particular content/product/service and is unable to effectively address the lack of knowledge. These concerns are compounded where the desired information content or the product being sold is available in multiple, highly technical, complex data or product configurations.
  • Thus client/consumer driven systems, like full-text machine translation or human localization approaches, do not address the globalization problems for businesses with commercial web pages that post content and/or transact in a single language format.
  • Prior approaches to solving e-commerce web site globalization problems in consumer-driven systems. Various approaches in the art have been employed to aid the client/consumer in translation. RichLink and Babylon represent typical prior art approaches. RichLink is a product that allows a client to customize definitions of particular words and phrases that occur on a web page. However, the process is not automatic and to do so, the client must first leave the web page to obtain the definition from individual separate sources, and then the definition must be manually entered and maintained on the web page. Likewise, Babylon provides limited translation of user-selected words, but is not integrated into providers web pages, and requires downloaded software to be resident and executable on the client side.
  • Therefore, there is a need in the art for cost-effective methods of high quality globalization of commercial web sites. There is a need in the art to render commercial web sites, such as provider-directed web sites, more accessible and responsive to the multi-lingual characteristics of individual global clients. There is a need in the art for methods of web site translation that do not require full-text human or machine translation. There is a need in the art for web site-integrated translation and data-mining methods for tailoring content on otherwise generalized provider-driven commercial web sites to address the needs of a global client base. There is a need in the art to reduce the amount of client-side processing involved with conducting commercial web site translations and transactions. There is a need in the art to enhance commercial web site “stickiness” for an increasingly global client base whose first language is not that of the site. There is a need in the art for a cost-effective apparatus and web site-integrated method for providing high quality simultaneous searching of multiple dictionaries to obtain multi-lingual translations, explanations, and consolidated automatic multi-dictionary definitions of user-selected text, in text, voice, image or multi-media format.
  • SUMMARY OF THE INVENTION
  • An object of the present invention is to provide an integrated method, over a wide area network comprising a server side and a client side, for user-directed acquisition of information relating to user-selected text of World Wide Web site pages. The inventive apparatus and methods provide a cost-effective apparatus and web site-integrated method for providing high quality simultaneous searching of multiple dictionaries to obtain multi-lingual translations, explanations, and consolidated automatic multi-dictionary definitions of user-selected text, in text, voice, image or multi-media format.
  • The method comprises: selecting, by a user, a word of web page text, whereby the user-selected word is accessible by an information retrieval function; selecting, by a user, from a plurality of reference sources or translation language pairs, a reference source or a translation language pair, whereby the user-selected reference source or user-selected translation language pair is accessible by an information retrieval function; accessing, in accordance with the user-selected reference source or user-selected translation language pair, and using the information retrieval function, information relating to the user-selected word; and displaying to the user the information related to the user-selected word, whereby user-directed acquisition of information relating to user-selected text of World Wide Web site pages is achieved.
  • In particular embodiments, the method steps are integrated into a server-side web site by either incorporating into the site a reference window linked to the information retrieval function, or linked pull down menus, or by installing programmed script on all web pages of the site to link and enable the information retrieval function. In alternate embodiments, the method steps are integrated into a client-side system using downloadable software, or a browser plug-in that is linked to the information retrieval function. Preferably, selecting a word of web page text is achieved by either typing or pasting a copied word into a window linked to an information retrieval function, or by highlighting the word whereby such highlighting renders the user-selected word accessible by an information retrieval function. In particular embodiments the reference source comprises one or a plurality of dictionaries. Preferably, information relating to user-selected text of World Wide Web site pages consists of high quality multi-lingual translations, explanations, or consolidated automatic multi-dictionary definitions, wherein such information is displayed to the user in text, voice, image or multi-media formats. Preferably, the information retrieval function further comprises an Internet search engine operative with the information retrieval function to search the Internet in accordance with the user-selected text, and the user-selected reference source or user-selected translation language pair, whereby the information retrieval function is augmented.
  • An alternate embodiment provides a method, over a wide area network comprising a server and a client, for user-directed acquisition of information relating to user-selected text of World Wide Web site pages, comprising: receiving from the client a user-selected word of World Wide Web page text; receiving from the client a user-selected reference source or user-selected translation language pair; accessing information relating to the user-selected word, in accordance with the user-selected reference source or user-selected translation language pair; and transmitting to the client the accessed information relating to the user-selected word. Preferably, the method further comprises an Internet search engine operative with server to search the Internet in accordance with the user-selected text, and the user-selected reference source or user-selected translation language pair.
  • An additional embodiment provides an apparatus for user-directed acquisition, over a wide area network comprising a server side and a client side, of information relating to user-selected text of World Wide Web site pages. The apparatus comprises: a server connected to the network, wherein the server comprises a processor, a memory, and a storage device, and wherein the processor and memory are operative with a program stored on the storage device to receive a user-selected word of World Wide Web page text; receive a user-selected reference source or user-selected translation language pair; access, in accordance with the user-selected reference source or user-selected translation language pair, information relating to the user-selected word; and transmit the accessed information relating to the user-selected word to the client-side user. Preferably, the apparatus comprises an Internet search engine operative with the server to search the Internet in accordance with the user-selected text, and the user-selected reference source or user-selected translation language pair.
  • Further embodiments provide an apparatus and integrated method, over a wireless network comprising a server side (e.g., WAP-based) and a client side (e.g., cellular phones, or other PDA wireless devices, such as Palm PC, Pocket PC, PSION, etc.), for user-directed acquisition of real-time translation and reference services (including voice-to-text, voice-to-voice, text-to-voice and text-to-text translations and reference services) in both text and voice, using standard cellular phones and PDA devices. Preferably, this embodiment is implemented as part of a wireless subscription service for end users by wireless service providers. Preferably, communication proceeds between a “master” user, having a subscribing wireless device (e.g., phone), and a “slave” user, having either a plug-in headset connected to the master device, or having another wireless device accessible by the master device. Preferably the system is implemented using a plurality of “slave” devices, in connection with one or more master “devices” receiving messages in one or more translated languages. Preferably, master users may retrieve information related to selected voice or text, where such related information comprises high quality multi-lingual translations, explanations, illustrations and/or consolidated automatic multi-dictionary definitions (“look-ups”), and wherein such information is displayed to the user in text, voice, image or multi-media formats that are compatible with the user's wireless device, and with the user's native language or character set.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a screen shot of a typical provider-driven Web page, illustrating a server-side Web site-integrated embodiment of the inventive method for user-direct acquisition of user-selected Web page text. Various fields (10, 12 and 14) in a Web page-integrated “reference” window 16, or one or more fields 20 in an integrated “result” window 18 are used to select text and user preferences, and to display retrieved information, such as a language translation (e.g., English→Chinese), related to the user-selected text (e.g., word-by-word translation, according to the present invention). The method involves user selection of Web page text, further user selection of preferences for specific reference sources and/or translation language pairs from a plurality of such reference sources and/or languages, and acquisition of related information from a linked information retrieval function, according to the user selected preferences. The method occurs in real-time, and does not require the user to exit the provider-driven Web site.
  • FIG. 2 is a screen shot, similar to that of FIG. 1, of a typical provider-driven Web page, illustrating another server-side Web site integrated embodiment of the inventive method for user-direct acquisition of user-selected Web page text. In this embodiment, information related to user-selected Web page text (i.e., a dictionary “look-up,” or definition of “otolaryngologist”) that is acquired via the linked information retrieval function, is displayed in a field 20 of the Web site-integrated “reference” window 16.
  • FIG. 3 is a screen shot, similar to that of FIGS. 1 and 2, of a typical provider-driven Web page, illustrating an additional server-side Web site-integrated embodiment of the inventive method for user-direct acquisition of user-selected Web page text. Here, Web site integration is accomplished by installing executable script (e.g., JavaScript™) on all web pages of the site to link and enable the information retrieval function. A ‘right-click’ of highlighted text 30 using a mouse can be used to activate one or more pop-up menus for user selection of particular reference source and/or translation language preferences (in this examples, the English→Chinese language pair preference was selected), and to enable activation of the information retrieval function to provide for a displayed translation 36 in one or more window fields using a Chinese character set.
  • FIG. 4 is a screen shot of another server-side Web site-integrated embodiment of the present invention similar to that illustrated in FIG. 3. The integrated web page is displayed in the upper half of the figure. In the lower half of the figure, a right click of a mouse was used to activate pop-up menus, select user preferences, and acquire information related to user-selected Web site text (in this case, translation of legal-related text from a court opinion into Chinese characters).
  • FIG. 5 shows a high-level flow diagram of the inventive method steps associated with the server-side Web page-integrated embodiments shown in FIGS. 3 and 4. A “mouse” right-click is made on selected text 50. One or more resulting “pop-up” menus 52 allow a user to select preferences 54 for reference source (e.g., dictionary) and/or translation language pair (e.g., English→German), and to activate 56 an information retrieval function to look-up and translate the selected word into the selected language, which is displayed in an integrated window field 58. Here, the English word “dog” is translated into German (“Hund”), according to one embodiment of the inventive method.
  • FIG. 6 is a high-level flow diagram of the inventive method steps associated with an alternate client-side embodiment of the present invention. An end user downloads a browser “plug-in” which allows the user to use “mouse” right-clicks 60 on user-selected text 62 to select reference 64 and language 66 preferences, and to retrieve information 68 (here, a translation into German) related to user-selected Web page text on any Web page, including pages on those Web sites that do not have a server-side embodiment of inventive method.
  • FIG. 7A shows an embodiment of the present invention enabling combined translation and Internet search functions in accordance with particular user-selected Web site text and/or information source and/or language pair preferences. Here, a Chinese-speaking user activates (according to the present invention) the information retrieval function 70, in accordance with one or more particular translation language preferences, to simultaneously translate a user-selected text (“key word”) into one or more languages/character sets 72 (e.g., using TurboDictionary™, according to the present invention). The translations are accessed by an Internet search engine (e.g., I-Search®) to enable a user to simultaneously search the internet in multiple languages 74, based on the users native language and selected key word.
  • FIG. 7B shows a more detailed diagram of the combined translation and Internet search embodiment of FIG. 7A, including certain structural elements. The core “translate” and display embodiment of the inventive method is shown in combination with an automated Internet search, using the related information data set to obtain related Internet search information which is displayed to the user along with the related information data set (combined path arrows 77 and 79). The core server-side translation and search functionality of the information retrieval function is shown enclosed in dotted lines 73, with user selection and display occurring on the client side.
  • FIG. 8 (upper panel) shows apparatus for a wireless embodiment of the present invention. Communication proceeds between a “master” user, having a cellular phone 80 subscribing to the TurboTalk™ service (described herein), and a “slave” user, having either a plug-in headset 82 connected to the master device, or having another wireless device accessible by the master device. FIG. 8 (lower panel) shows such a WAP server-based wireless method embodiment. A “master” user 84 selects one or more translation language pairs using a GSM phone 86 accessible by a WAP server 88 hosting the inventive information retrieval function. The user sends a voice message using the cellular phone, and the message is processed using voice-recognition software stored in the memory of the WAP server and operative with a processor of the server to recognize the voice and process it into text. The information retrieval function translates the text, which is then converted to voice by the voice recognition function, and delivered as a translated voice message over the wireless network to the “slave” users 88.
  • FIG. 9 shows a simpler WAP-based wireless embodiment involving a single “master” user and a single “slave” user. The master user selects single translation language pair, and sends a voice message 90 using the cellular phone 92, and the message is processed using voice-recognition software stored in the memory of a WAP server and operative with a processor of the server to recognize the voice and process it into text. The information retrieval function translates the text, which is then converted to voice by the voice recognition function, and delivered as a translated voice message 94 over the wireless network to the “slave” user.
  • DETAILED DESCRIPTION OF THE INVENTION
  • Described below are various embodiments of the present invention. The embodiments illustrate various ways in which the present invention can be implemented—in this case, an apparatus and integrated method for user-directed acquisition of information relating to user-selected text of World Wide Web (“WEB”) site pages, and an apparatus and integrated method, over a wireless network comprising a server side (e.g., a WAP-based server) and a client side (e.g., cellular phones, or other PDA wireless devices), for user-directed acquisition of real-time translation and reference services. The invention is particularly useful in those instances where the language and character-set of the Web site differ from the native language and character set of the user.
  • The present invention affords typical provider-driven web sites with an immediate, accurate, and low cost multilingual presence by expanding their site audience to include other language speakers. Additionally, integration of reference and/or result windows and fields increases the number site page views, thus providing more advertising revenue. The present invention will enhance revenue by increasing sales to other language speakers, because such users will be enabled to navigate the Web site and educate themselves about the products in their own languages to make informed transactional decisions. Furthermore, the present invention provides a novel method to collect business intelligence and understand foreign markets, based on the words translated and/or on which products or related information are selectively accessed by particular language groups.
  • The embodiments of the inventive apparatus and method provide a practical and inexpensive means by which Web sites, such as typical provider-driven commercial Web sites (see “Background,” herein above), may be rendered more accessible by a global consumer base comprised of individual users, communicating in one of a plurality of native languages and native character sets. More specifically, the embodiments provide an apparatus and integrated method, over a wide area network comprising a server side and a client side, for user-directed acquisition of information relating to user-selected text of Web site pages, using the portion of the Internet known as the Web. In particular embodiments, such related information comprises high quality multi-lingual translations, explanations, illustrations and/or consolidated automatic simultaneous multi-dictionary definitions (“look-ups”), wherein such information is displayed to the user in text, voice, image or multi-media formats that correspond, or are at least compatible with the user's native language or character set. The invention thus allows for globalization of Web sites to proceed in the absence of expensive and inaccurate full-text translations.
  • Using the inventive apparatus and method, a user connects to a Web site from their client computer (internet compatible PDA or other cellular or hand-held device). The inventive apparatus and method achieve their efficiency, at least in part, by enabling a user to acquire reference information related to specific Web site text in the absence of full-text machine translation or human “localization” (see “Background,” herein above). This is because most users that are not native English speakers, speak some English as their second or third language, and they need assistance primarily with vocabulary and not grammar. Thus, the present invention offers text translation (e.g., word-by-word translation) and illustration that will effectively achieve the same purpose as a full-text machine translation or localization. Additional efficiency is gained, in preferred embodiments, where the method is integrated into a server-side web site, by either incorporating into the site a reference window or pull down menus linked to the inventive information retrieval function, or by installing programmed script on all web pages of the site to link and/or enable the information retrieval function. In alternate embodiments, the method steps are integrated into a client-side system using downloadable software or browser plug-ins that link user-selected text and preferences to the information retrieval function.
  • The method involves the user “selecting” or highlighting specific Web page text (e.g., word-by-word selection), and then further selecting specific reference sources and/or translation language pairs from a plurality of reference sources and/or translation language pairs. The user-selected text and preferences (variables) are accessible by an information retrieval function comprising a server connected to the network, wherein the server comprises a processor, a memory, and a storage device, and wherein the processor and memory are operative with a software program stored on the storage device to: receive the user-selected Web page text (e.g., user-selected word); receive a user-selected reference source and/or user-selected translation language pair; access, in accordance with the user-selected reference source and/or user-selected translation language pair, information relating to the user-selected text; and transmit the accessed information relating to the user-selected text to the client-side user. Preferably, such information transmittal is in the user-selected translation language, which can be the user's native language. The memory element of the information retrieval function comprises an extensive collection of dictionaries and reference books to enable retrieval of related information using a variety of languages, including but not limited to all European languages, Chinese, Korean, Japanese, Russian and Hebrew.
  • The information retrieval function, as will be obvious to those of ordinary skill in the art, may comprise one or more linked servers comprising one or a plurality of processors, a memory, and storage devices operable programmed and configured to carry out the information retrieval function.
  • FIG. 1 and FIG. 2 illustrate a first embodiment of the overall method a user goes through to acquire information relating to specific user-selected Web page text. In this example, a Chinese user desires a translation into Chinese of the English term “osteoporosis,” present as text on a provider-driven, server-side Web site. To begin the method, the user establishes a network connection between the client and the server. Using a client-side Web browser, the user “selects” the desired text by “copying” the word “osteoporosis” from the upper-left quadrant of the Web page, and “pasting” it to a text field 10 in a reference window 16 (present in the lower-left quadrant of the Web page) that is integrated into the Web page, and that is linked to an information retrieval function (discussed below). Alternatively, the desired text may be typed into the text field 10.
  • The user selects, from a plurality of reference sources (e.g., a professional medical dictionary) and/or translation language pairs (e.g., English→Chinese), a reference source and/or a translation language pair, whereby the user-selected reference source or user-selected translation language pair is accessible by the information retrieval function. For example, in the present embodiment, the user preference for a particular translation language pair, from among a plurality of such pairs, is selected using a scrollable language pair field 12 of the linked dictionary window 16. Other selection means such as right-clickable “pop-up” menus (discussed herein, below) are also encompassed by embodiments of the present invention, and are familiar in the art.
  • The user accesses information relating to the user-selected text, in accordance with the user-selected reference and/or translation language preferences, by activating the information retrieval function. This is accomplished in the example of FIG. 1 by clicking on a “look up” button 14 within the dictionary window. The information retrieval function comprises a server connected to the network, wherein the server comprises a processor, a memory, and a storage device, and wherein the processor and memory are operative with a program stored on the storage device to: receive user-selected Web page text (e.g., a selected word); receive a user-selected reference source preference (e.g., a professional medical dictionary) and/or a user-selected translation language pair preference (e.g., English→Chinese); access information relating to the user-selected text in accordance with the user-selected reference and/or language preferences; and transmit the accessed related information to the client-side user.
  • In preferred embodiments, a list of desired dictionaries is selected from a dictionary list by the user to define (e.g. narrow) the search. Preferably, this is a standard linear list selected either explicitly by the user, or implicitly by the user by virtue of pre-stored user preferences accessible to the information retrieval function (e.g., by means of a stored ‘cookie’).
  • The inventive information retrieval function accesses the user-selected text and user preferences (e.g., reference sources, language pairs) to search (using art-recognized search engine technology), process, rank and sort related information stored in one or more data bases, and then transmits the accessed related information to the user. This is accomplished using various art-recognized algorithms and data structures.
  • In preferred embodiments, hashing and/or Btree data structures are used to efficiently process matching (relevant) data sets from an initial search. Preferably, ‘hashing’ is used to quickly reduce or minimize the data set that matches a query term, enabling a simpler search within a relatively smaller data set. Preferably, ‘BTree’ data structures are used to reduce the frequency of disk access during a search. A ‘BTree’ data structure uses nodes arranged in a tree structure to search for data that is contained in its nodes. ‘BTree’ nodes, compared with those of other tree-based data structures, are contained within relatively larger ‘pages’ that match the physical size of disk blocks, thereby reducing the number of disk access events for a given search.
  • To find words within phrases, the information retrieval function uses pointers. Each word within a phrase becomes a ‘child’ node that points back to the ‘parent’ node, which contains the phrase. For phrases consisting of a single word, there is no ‘child.’
  • Preferably, terms (words) used for searching are further refined by applying a number of transformations, including but not limited to: (i) stemming, to find roots of words (e.g., “jumped” becomes “jump”); (ii) stop word removal (e.g., in the case of English, removal of “a,” “the,” “an,” etc.); and (iii) word splitting into individual sub-words (e.g., “superheated” becomes “super” and “heated”).
  • In preferred embodiments, the retrieved data set is analyzed for relevancy using various criteria including, but not limited to, hit counting within the term itself, proximity of words, exact v. sub-phrase matching, and hit counting within word definition areas. Each term within each dictionary is assigned a score, based on the relevancy algorithm. A sort algorithm is applied to the ranked data set and the top N results are displayed to the user. The user may optionally select to see the next N results, and so on.
  • The retrieved ranked and sorted information data set relating to the user-selected word is displayed to the user using an integrated result window 18, or in the alternative using one or more result fields 20 within the integrated reference window 16 (FIG. 2) or integrated result 18 window (FIG. 1). Thus, users have the ability not only to select which information sources are accessed, but may compare the results from alternate reference sources in real time without leaving the Web site. Various types of information retrieval is encompassed by the present invention, including but not limited to high quality multi-lingual translations, explanations, illustrations and consolidated automatic multi-dictionary definitions, wherein such information is displayed to the user in text, voice, image or multi-media formats that correspond, or are at least compatible with the user's native language and/or character set.
  • In particular embodiments the above-described method steps are integrated into a server-side Web site. In these embodiments the provider-driven site enables its users to acquire information relating to user-selected text by hosting or integrating the inventive method and information retrieval function on the provider-driven site. Such integration is achieved, for example, by incorporating into the site a reference window (e.g., like reference window 16 of FIG. 1 and FIG. 2) linked to the information retrieval function. For example, in the example of FIG. 1, selecting Web page text is achieved by “pasting” a “copied” word into a text field 10 of a reference window 16, linked to an information retrieval function. This requires that the server-side Web site be integrated with a window having a link to the information retrieval system and software.
  • FIGS. 3 and 4 illustrate an alternative embodiment of server-side Web site integration of the inventive method. In the these examples, Web site integration is accomplished by installing executable script (e.g., JavaScript™) on all web pages of the site to link and enable the information retrieval function. In such embodiments, a right-click of highlighted text 30 using a mouse can be used to activate a pop-up menu for user selection of one or more particular reference source and/or translation language preferences (in these examples, the English→Chinese preference was selected), and enable activation of the information retrieval function to provide for a displayed translation 36 in one or more window fields using a Chinese character set.
  • FIG. 5 shows a high-level flow diagram of the inventive method steps associated with the server-side Web page-integrated embodiments shown in FIGS. 3 and 4. A “mouse” right-click is made on selected text 50. One or more resulting “pop-up” menus 52 allow a user to select preferences 54 for reference source (e.g., dictionary) and/or translation language pair (e.g., English→German), and to activate 56 an information retrieval function to look-up and translate the selected word into the selected language, displayed in an integrated window field 58. Here, the English word “dog” is translated into German (“Hund”), according to one embodiment of the inventive method.
  • In alternative embodiments, the user takes the initiative to install or integrate the method steps into a client-side system using, for example, a downloadable browser “plug-in” that facilitates linking to the above-described information retrieval function. A user can then use the inventive method on any provider-driven Web site, regardless of whether or not a server-side embodiment of the inventive method has been integrated into the provider-driven Web site.
  • FIG. 6 is a high-level flow diagram of the inventive method steps associated with an alternate client-side embodiment of the present invention. An end user downloads a browser “plug-in,” which allows the user to use “mouse” right-clicks 60 on user-selected text 62 to select reference 64 and language 66 preferences, and to retrieve information 68 (here, a translation into German) related to user-selected Web page text on any Web page, including pages on those Web sites that do not have a server-side embodiment of the inventive method.
  • In the foregoing embodiments, the user-selected text translation (or “look-up”) and retrieved information display language and character set may be the same as, or different from the language and character set of the Web page being translated or interpreted. In either case, particular embodiments of the present invention have the character of a personalized (user-directed) education center, whereby users can effectively navigate, translate, interpret and perform transactions on provider-driven Web sites regardless of their native language or character set. User selectable reference sources include, but are not limited to electronic databases comprising dictionaries, thesauruses, encyclopedias, digital images, multi-media content (e.g., sound and video). Such reference sources may comprise data corresponding to a variety of languages and character sets, including but not limited to all European languages, Chinese, Korean, Japanese, Russian and Hebrew, and the character sets thereof.
  • An additional novel feature of the present invention is “data mining.” The present invention can be combined using art-recognized statistical methods and algorithms to provide consumer data mining statistics according, for example, to country, language, topic and product. Such data mining capability enhances the globalization utility of the present invention by allowing provider-driven sites, which have been traditionally limited to presenting content to an amorphous generalized consumer base (see under “Background,” above), to understand their audiences and market, and optimize presentation of posted content.
  • System hardware. E-commerce consumers do not have an intelligent, fast and reliable method for accessing and performing transactions on Web sites that do not post content in their native languages. The present invention addresses this need by creating a software (herein described as TurboDictionary™) able to efficiently allow for “on-line” acquisition of information (e.g., translation and dictionary look-ups) related to user-selected text, and according to user-selected preferences. The method derives its novelty and utility, at least in part, by taking advantage of the fact that most on-line users that do not speak English as their native language are nonetheless familiar with it as a second or third language, and thus need primarily only help with vocabulary. Thus, translation of user-selected text, according to the present invention, circumvents the necessity of full-text machine or human localization (see under “Background,” above), which is expensive and typically flawed. In preferred embodiments, as described above, the invention is implemented as an integrated method on the server-side, requiring little, if any client-side processing. Thus, a consumer can access the Internet using a computer or electronic hand-held device. The software program of the present invention is usable over a network, preferably over the Internet, and most preferably over the World Wide Web (“Web”).
  • The apparatus of the present invention encompasses a computer network comprising a server, at least one user (client) subsystem connected to the server via a network connecting means (e.g., user modem). Although referred to as a modem, the user modem can be any other communication means that enables network communication, for example, ethernet links. The modem can be connected to the server by a variety of connecting means, including public telephone land lines, dedicated data lines, cellular links, microwave links, or satellite communication.
  • The server is essentially a high-capacity, high-speed computer that includes a processing unit connected to one or more relatable data bases or content sources, comprising, for example, electronic dictionaries, thesauruses, encyclopedias, digital images, multi-media content (e.g., sound and video, streaming video). Such content may comprise data corresponding to a variety of languages and character sets, including but not limited to all European languages, Chinese, Korean, Japanese, Russian and Hebrew, and the character sets thereof. Additional databases are optionally added to the server. Also connected to the processing unit is sufficient memory and appropriate communication hardware. The memory can be a separate, but connected component. The communication hardware may be modems, ethernet connections, or any other suitable communication hardware. Although the server can be a single computer having a single processing unit, it is also possible that the server function could be spread over several networked computers, each having its processor and having one or more databases resident thereon. In particular embodiments of the present invention the server can be WAP-based server of a wireless network (see “Wireless Device Applications,” below).
  • In addition to the elements described above, the server further comprises an operating system and communication software allowing the server to communicate with other computers. Various operating systems and communication software may be employed. For example, the operating system may be Microsoft Windows NT™, and the communication software Microsoft IIS™ (Internet Information Server) server with associated programs.
  • According to the present invention, the server is connected to the network, wherein the server comprises a processor, a memory, and a storage device, and wherein the processor and memory are operative with a program (e.g., herein referred to as TurboDictionary™) stored on the storage device to: receive a user-selected word of Web page text; receive user-selected reference source and/or user-selected translation language pair preferences; access information relating to the user-selected word in real time, and in accordance with the user-selected reference source or user-selected translation language pair; and transmit the accessed information relating to the user-selected word to the client-side user. The software program of the present invention can be implemented as a stand-alone program on a particular Web site provider's server. Alternatively, the software may be implemented on an independent server that is linked to the provider's server.
  • The databases on, or connected to the server, contain the information necessary to make the inventive apparatus and method work. The information retrieval databases of the present invention are relatable, and are assembled and accessed using any commercially-available database software, such as Microsoft Access™, Oracle™, Microsoft SQL™ Version 6.5, etc., and other search engine technologies and algorithms as described herein above.
  • A user (client) subsystem generally includes a processor attached to storage unit, a communication controller, and a display controller. The display controller runs a display unit through which the user interacts with the subsystem. In essence, the user subsystem is a computer able to run software providing a means for communicating with the server. This software, for example, is an Internet web browser such as Microsoft Internet Explorer, Netscape Navigator, or other suitable Internet web browsers. The user subsystem can be a computer or hand-held electronic device, such as a telephone or wireless telephone or other PDA-type device that enables Internet access.
  • Combination Translation and Search Functions. The information retrieval function of the present method and apparatus, including storage devices and server-side databases, are operative with Internet search engines, such as I-Search®, stored on the storage device to provide combined translation and Internet search functions in accordance with particular user-selected Web site text and/or information source and/or language pair preferences. In preferred embodiments, I-Search®, an art-recognized multi-language search engine (currently powering World-Trade-Search.com) is used in combination with software of the present invention (e.g., TurboDictionary™), but other art-recognized Internet search engines (e.g., Yahoo™, Google™, and AltaVista®) are productively used in combination with the instant method and apparatus, and are encompassed within the scope of the present invention. As described above for the inventive information retrieval process, both integrated server- and client-side implementations of such combined translation/search functions are contemplated.
  • FIGS. 7A and 7B illustrate an embodiment of the present invention enabling combined translation and Internet search functions in accordance with particular user-selected Web site text and/or information source and/or language pair preferences. In FIG. 7A, a Chinese-speaking user activates the information retrieval function 70, in accordance with one or more particular translation language preferences, to simultaneously translate user-selected text (“key word”) into one or more languages/character sets 72 (e.g., using TurboDictionary™, according to the present invention). The translations are accessed by an Internet search engine (e.g., I-Search®) to enable a user to simultaneously search the Internet in multiple languages 74, based on the users native language and selected key word. FIG. 7B shows a more detailed diagram of the combined translation and Internet search embodiment of FIG. 7A, including certain structural elements. The elements and steps above and below dashed lines 76 and 78, respectively, illustrate the core “translate” and display embodiment of the inventive method, comprising user selection of text and preferences for reference sources and translation language, accessing by the information retrieval function of a related information data set, based on related data stored in reference data base(s), and display of the data set to the user (path arrow 77). Optionally, as shown between the dashed lines 76 and 78, the related information data set is used to perform an automated Internet search to obtain related Internet search information which is displayed to the user along with the related information data set (combined path arrows 77 and 79). The core server-side translation and search functionality of the information retrieval function is shown enclosed in dotted lines 73, with user selection and display occurring on the client side.
  • Additional Wireless Device Embodiments. In alternate embodiments, the integrated user-directed information retrieval function of the present invention (using, e.g., TurboDictionary™) is implemented in the context of a wireless network (e.g., WAP server-based), and voice recognition function to provide a real-time voice translator and reference tool. This novel implementation is herein referred to as TurboTalk™. Specifically, the wireless embodiments provide an apparatus and integrated method, over a wireless network comprising a server side (e.g., WAP-based) and a client side (e.g., cellular phones, or other PDA (personal data assistant) wireless devices, e.g., Palm PC, Pocket PC, PSION, etc.), for user-directed acquisition of real-time translation and reference services in both text and voice, using standard cellular phones and PDA devices (voice-to-text, voice-to-voice, text-to-voice and text-to-text). Generally, the implementation of the inventive information retrieval function, comprising reference access and generation of a related information data set are as described in detail herein above for TurboDictionary™. However, TurboTalk™ embodiments further comprise voice recognition/conversion software voice-recognition software stored in the memory of the WAP server and operative with a processor of the server to recognize the voice and process it into text (or process text to voice). Preferably, this embodiment is offered as a wireless service by wireless service providers to subscribing users.
  • In preferred embodiments, a wireless carrier offers the TurboTalk™ service, which enables a subscribing user, speaking in their native tongue, to spontaneously speak, for example, Greek while in Greece, and Japanese while in Japan without knowing either Greek or Japanese. That is, travelers can use the TurboTalk™ service to communicate with local people.
  • FIG. 8 shows a preferred embodiment, wherein communication proceeds between a “master” user, having a cellular phone 80 subscribing to the TurboTalk™ service, and a “slave” user, having either a plug-in headset 82 connected to the master device, or having another wireless device accessible by the master device (FIG. 8, upper half). The system is compatible with the use of a plurality of “slave” devices in connection with one or more master “devices” receiving messages in one or more translated languages. In such instances, the master user selects language preferences for each slave.
  • FIG. 8 (lower panel) shows such a WAP-based wireless embodiment. A “master” user 84 selects one or more translation language pairs (in this case, English→Chinese and English→German) using a GSM (Global System for Mobiles) phone 86 accessible by a WAP server 88 hosting the inventive information retrieval function (e.g., the method described herein using TurboDictionary™). The user sends a voice message (“train station”) using the cellular phone, and the message is processed using voice-recognition software stored in the memory of the WAP server and operative with a processor of the server to recognize the voice and process it into text. The information retrieval function translates the text, which is then converted to voice by the voice recognition function, and delivered as a voice message (in Chinese and/or German) over the wireless network to the “slave” users 88.
  • FIG. 9 shows a simpler WAP-based wireless embodiment involving a single “master” user and a single “slave” user. The master user selects single translation language pair (in this case, English→Chinese), and sends a voice message 90 (“Where can I find the restrooms?”) using the cellular phone 92, and the message is processed using voice-recognition software stored in the memory of a WAP server and operative with a processor of the server to recognize the voice and process it into text. Preferably, the information retrieval function translates the text, which is then converted to voice by the voice recognition/conversion function, and delivered as a voice message 94 (in Chinese) over the wireless network to the “slave” user. Alternatively, depending upon the capabilities of the slave device, the translated information is delivered to the slave user in text, or multimedia formats compatible with the slave user's native language and character set.
  • WAP-based embodiments primarily involve the use of standard WML and WMLScript, but many different cellular phones (User Agents) have slight modifications to the WAP standard. Therefore, a database of User Agents and their respective differences is operative with the inventive method and apparatus to ensure that all devices are compatible with TurboTalk™.
  • In preferred embodiments, master users may retrieve information related to selected voice or text messages, in addition to voice message translations. Such related information, as described herein above, comprises high quality multi-lingual translations, explanations, illustrations and/or consolidated automatic multi-dictionary definitions (“look-ups”), wherein such information is displayed to the user in text, voice, image or multi-media formats that correspond, or are at least compatible with the user's wireless device, and with the user's native language or character set.
  • Various integrated server-side and client-side embodiments of the present invention have been described above, including embodiments combined with multi-language search engines. Additionally the server-integrated user-directed information retrieval function of the present invention can be implemented in the context of a wireless network (e.g., WAP server-based), and voice recognition function to provide a real-time voice translator and reference tool that can be used to communicate between two, or among a plurality of “master” and “slave” users. A person skilled in the art will, however, recognize that many additional embodiments are encompassed within the scope of the present invention. In particular, other embodiments are possible that incorporate additional or different art-recognized features and that apply to additional types and formats of retrievable information.

Claims (30)

1. An integrated method, over a wide area network having a server side and a client side, for user-directed acquisition of information relating to user-selected text of World Wide Web site pages, comprising:
selecting by a user a word of web page text, whereby the user-selected word is accessible by an information retrieval function;
selecting by a user from a plurality of reference sources and translation language pairs, at least one reference source and a translation language pair, whereby the user-selected reference source and user-selected translation language pair are accessible by an information retrieval function;
accessing and retrieving from the user-selected reference source, using the information retrieval function, information relating to the user-selected word in accordance with the user-selected reference source and user-selected translation language pair; and
displaying to the user, based on the accessed and retrieved related information, a related information data set, wherein the selecting and displaying are integrated into the web page, and whereby user-directed acquisition of information relating to user-selected text of World Wide Web site pages is achieved.
2. The method of claim 1, wherein the method steps are integrated into a server-side web site by either incorporating into the site a reference window linked to the information retrieval function, or by installing programmed script on all web pages of the site to link and enable the information retrieval function.
3. The method of claim 1, wherein the method steps are integrated into a client-side system using downloadable software or a browser plug-in that is operatively linked to the information retrieval function.
4. The method of claim 1, wherein selecting of a word of web page text is achieved by typing, or pasting of a copy of the word into a window linked to an information retrieval function, or by highlighting the word, whereby the typing, pasting, or highlighting renders the user-selected word accessible by an information retrieval function.
5. The method of claim 1, wherein at least one reference source is a dictionary.
6. The method of claim 1, wherein the information relating to user-selected text of World Wide Web site pages is selected from the group consisting of high quality multi-lingual translations, explanations, and consolidated automatic multi-dictionary definitions, and wherein such information is displayed to the user in text, voice, image or multi-media formats.
7. The method of claim 1, wherein the information retrieval function further comprises an Internet search engine operative with the information retrieval function to search the Internet in accordance with the user-selected text, retrieved related information, or related information data set, thereby augmenting the information retrieval function.
8. The method of claim 1, wherein selecting of a reference source or a translation language pair comprises pre-selecting either or both of the reference source or the translation language pair for storing as a default user preference, and wherein accessing and retrieving by the information retrieval function is according to the stored default user preference.
9. The method of claim 8, wherein the user preference is stored on the server side.
10. The method of claim 1, wherein selecting by the user comprises selecting a plurality of reference sources for simultaneous accessing by the information retrieval function.
11. The method of claim 1, further comprising storing of either or both of the retrieved related information, or the related information data set.
12. The method of claim 11, wherein the storing of either or both of the retrieved related information or the related information data set is on the server-side.
13. The method of claim 1, further comprising ranking of either or both of the retrieved related information or the related information data set according to relevancy, using one or more relevancy algorithms.
14. The method of claim 13, further comprising sorting, using one or more sorting algorithms, of either or both of the retrieved related information or the related information data set into a plurality of relevancy ranked sub-data sets, whereby the sub-data sets are alternately or sequentially displayable to the user.
15. The method of claim 1, wherein at least one of hashing or BTree data structures are used to process data accessed by the information retrieval function.
16. An integrated method, over a wide area network having a server and a client, for user directed acquisition of information relating to user-selected text of World Wide Web site pages, comprising:
receiving from the client a user-selected word of World Wide Web page text;
receiving from the client at least one user-selected reference source and a user-selected translation language pair;
accessing and retrieving from the user-selected reference source stored on a server, information relating to the user-selected word, in accordance with the user-selected reference source and the user-selected translation language pair; and
transmitting to the client for display to the user, a related information data set, based on the accessed and retrieved related information, wherein selection by and display to the user are integrated into the World Wide Web page.
17. The method of claim 16, wherein accessing and retrieving information relating to the user-selected word further comprises an Internet search engine operative with the server to search the Internet in accordance with the user-selected text, retrieved related information, or related information data set.
18. The method of claim 16, wherein the information relating to user-selected text of World Wide Web site pages is selected from the group consisting of high quality multi-lingual translations, explanations, and consolidated automatic multi-dictionary definitions, and wherein such information is displayed to the user in text, voice, image or multi-media formats.
19. The method of claim 16, wherein user selecting of a reference source or a translation language pair comprises pre-selecting either or both of the reference source or the translation language pair for storing as a default user preference, and wherein the accessing and retrieving by the information retrieval function is according to the stored default user preference.
20. The method of claim 16, wherein a plurality of user-selected reference sources are received from the client.
21. The method of claim 16, further comprising storing of either or both of the retrieved related information, or the related information data set.
22. The method of claim 16, further comprising ranking of either or both of the retrieved related information or the related information data set according to relevancy, using one or more relevancy algorithms.
23. The method of claim 22, further comprising sorting, using one or more sorting algorithms, of either or both of the retrieved related information or the related information data set into a plurality of relevancy ranked sub-data sets, whereby the sub-data sets are alternately or sequentially displayable to the user.
24. The method of claim 16, wherein at least one of hashing or BTree data structures are used to process retrieved data.
25. An apparatus for user-directed acquisition, over a wide area network having a server side and a client side, of information relating to user-selected text of World Wide Web site pages, comprising:
a server connected to the network, wherein the server comprises a processor, a memory, and a storage device, and wherein the processor and memory are operative with a program stored on the storage device to receive from the client a user-selected word of World Wide Web page text;
receive from the client at least one user-selected reference source and user-selected translation language pair;
access and retrieve from the user-selected reference source stored on the server, information relating to the user-selected word, in accordance with the user-selected reference source and the user-selected translation language pair; and
transmit to the client for display to the user, a related information data set, based on the accessed and retrieved related information, wherein selection by and display to the user are integrated into the World Wide Web page.
26. The apparatus of claim 25, further comprising an Internet search engine operative with the server to search the Internet in accordance with the user-selected text, retrieved related information, or related information data set.
27. An integrated method, over a wireless network having a server side and a client side, for user-directed acquisition of real-time translation and reference services in both text and voice, comprising:
selecting, by a client-side user, at least one translation language pair and at least one reference source, whereby the user-selected translation language pair and reference source are accessible by a WAP-based server having information retrieval and voice-recognition functions;
sending by the client-side user a voice message comprising one or more spoken words to the WAP-based server;
accessing and retrieving from the user-selected reference source, using the information retrieval and voice recognition functions, information relating to the voice message in accordance with the user-selected reference source and user-selected translation language pair; and
displaying to the client-side user or to someone connected thereto, based on the accessed and retrieved related information, related text or voice, whereby user-directed acquisition of real-time translation and reference services in both text and voice is achieved.
28. The method of claim 27, wherein selecting of a reference source or a translation language pair comprises pre-selecting either or both of the reference source or the translation language pair for storing as a default user preference.
29. The method of claim 27, wherein at least one reference source is a dictionary.
30. The method of claim 27, wherein selecting by the user comprises selecting a plurality of reference sources for simultaneous accessing and retrieving by the information retrieval function.
US11/292,871 2001-04-20 2005-12-02 Method and apparatus for integrated, user-directed web site text translation Abandoned US20060080083A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US11/292,871 US20060080083A1 (en) 2001-04-20 2005-12-02 Method and apparatus for integrated, user-directed web site text translation

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US28558001P 2001-04-20 2001-04-20
US10/128,952 US6999916B2 (en) 2001-04-20 2002-04-22 Method and apparatus for integrated, user-directed web site text translation
US11/292,871 US20060080083A1 (en) 2001-04-20 2005-12-02 Method and apparatus for integrated, user-directed web site text translation

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US10/128,952 Continuation US6999916B2 (en) 2001-04-20 2002-04-22 Method and apparatus for integrated, user-directed web site text translation

Publications (1)

Publication Number Publication Date
US20060080083A1 true US20060080083A1 (en) 2006-04-13

Family

ID=23094864

Family Applications (2)

Application Number Title Priority Date Filing Date
US10/128,952 Expired - Fee Related US6999916B2 (en) 2001-04-20 2002-04-22 Method and apparatus for integrated, user-directed web site text translation
US11/292,871 Abandoned US20060080083A1 (en) 2001-04-20 2005-12-02 Method and apparatus for integrated, user-directed web site text translation

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US10/128,952 Expired - Fee Related US6999916B2 (en) 2001-04-20 2002-04-22 Method and apparatus for integrated, user-directed web site text translation

Country Status (2)

Country Link
US (2) US6999916B2 (en)
WO (1) WO2002086737A1 (en)

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040148158A1 (en) * 2002-12-27 2004-07-29 Casio Computer Co., Ltd. Information display control device and recording media that stores information display control programs
US20050043938A1 (en) * 2001-10-08 2005-02-24 Emil Viken Mutilingual support in web servers for embedded systems
US20060149528A1 (en) * 2005-01-05 2006-07-06 Inventec Corporation System and method of automatic Japanese kanji labeling
US20060184352A1 (en) * 2005-02-17 2006-08-17 Yen-Fu Chen Enhanced Chinese character/Pin Yin/English translator
US20060218292A1 (en) * 2001-05-09 2006-09-28 Woodward Mark L Method, apparatus and program product for media identification and tracking associated user preferences
US20070208813A1 (en) * 2006-02-17 2007-09-06 Microsoft Corporation Machine translation instant messaging applications
US20070260452A1 (en) * 2006-05-04 2007-11-08 Samsung Electronics Co., Ltd. Method and device for selecting a word to be defined in mobile communication terminal having an electronic dictionary
US20080229218A1 (en) * 2007-03-14 2008-09-18 Joon Maeng Systems and methods for providing additional information for objects in electronic documents
US20080294424A1 (en) * 2006-02-10 2008-11-27 Fujitsu Limited Information display system, information display method, and program
US20080319942A1 (en) * 2007-05-14 2008-12-25 Samir Courdy Method and system for report generation including extensible data
US20090222428A1 (en) * 2008-03-02 2009-09-03 Microsoft Corporation Multi-lingual information display in a single language portal
US20090248396A1 (en) * 2008-03-28 2009-10-01 International Business Machines Corporation Method for automating an internationalization test in a multilingual web application
US20090287471A1 (en) * 2008-05-16 2009-11-19 Bennett James D Support for international search terms - translate as you search
US20100011354A1 (en) * 2008-07-10 2010-01-14 Apple Inc. System and method for localizing display of applications for download
US20100049752A1 (en) * 2008-08-22 2010-02-25 Inventec Corporation Dynamic word translation system and method thereof
US20100324887A1 (en) * 2009-06-17 2010-12-23 Dong Mingchui System and method of online user-cycled web page vision instant machine translation
US20110251837A1 (en) * 2010-04-07 2011-10-13 eBook Technologies, Inc. Electronic reference integration with an electronic reader
US8311800B1 (en) * 2010-04-01 2012-11-13 Symantec Corporation Systems and methods for detecting incorrect translations of terms in a computing string
US8328558B2 (en) 2003-07-31 2012-12-11 International Business Machines Corporation Chinese / English vocabulary learning tool
WO2012174703A1 (en) * 2011-06-20 2012-12-27 Microsoft Corporation Hover translation of search result captions
CN103716453A (en) * 2012-10-02 2014-04-09 Lg电子株式会社 Mobile terminal and control method for the mobile terminal
US9122673B2 (en) 2012-03-07 2015-09-01 International Business Machines Corporation Domain specific natural language normalization
US20220129949A1 (en) * 2013-01-15 2022-04-28 Motionpoint Corporation Dynamic determination of localization source for web site content

Families Citing this family (71)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7246118B2 (en) * 2001-07-06 2007-07-17 International Business Machines Corporation Method and system for automated collaboration using electronic book highlights and notations
EP1308856A3 (en) * 2001-11-01 2006-01-18 Matsushita Electric Industrial Co., Ltd. Information providing system and information providing server apparatus for use therein, information terminal unit, and information providing method using to user profile
US7634397B2 (en) * 2002-06-27 2009-12-15 Siebel Systems, Inc. Single server instance, multi-lingual applications based on loosely coupled metadata and presentation layers
US20040004599A1 (en) * 2002-07-03 2004-01-08 Scott Shepard Systems and methods for facilitating playback of media
US20040024585A1 (en) * 2002-07-03 2004-02-05 Amit Srivastava Linguistic segmentation of speech
WO2004049110A2 (en) * 2002-11-22 2004-06-10 Transclick, Inc. Language translation system and method
KR20040059240A (en) * 2002-12-28 2004-07-05 엔에이치엔(주) A method for providing multi-language translation service and a system of enabling the method
US20040153512A1 (en) * 2003-01-16 2004-08-05 Friend Jeffrey Edward Dynamic online email catalog and trust relationship management system and method
US20050010392A1 (en) * 2003-07-10 2005-01-13 International Business Machines Corporation Traditional Chinese / simplified Chinese character translator
US20050010391A1 (en) * 2003-07-10 2005-01-13 International Business Machines Corporation Chinese character / Pin Yin / English translator
US20050027547A1 (en) * 2003-07-31 2005-02-03 International Business Machines Corporation Chinese / Pin Yin / english dictionary
US8090402B1 (en) * 2003-09-26 2012-01-03 Iwao Fujisaki Communication device
US20050097180A1 (en) * 2003-10-31 2005-05-05 Aaron Abdelhak System and method for improved customized portal web pages
US20050097190A1 (en) * 2003-10-31 2005-05-05 Aaron Abdelhak System and method for customized portal web pages
US7310605B2 (en) * 2003-11-25 2007-12-18 International Business Machines Corporation Method and apparatus to transliterate text using a portable device
US8296126B2 (en) * 2004-02-25 2012-10-23 Research In Motion Limited System and method for multi-lingual translation
CN1950820A (en) * 2004-03-02 2007-04-18 梅林格有限公司 Embedded translation document method and system
US8020101B2 (en) * 2004-05-20 2011-09-13 International Business Machines Corporation User specified transfer of data between applications
GB0414293D0 (en) * 2004-06-26 2004-07-28 Ibm System and method for reducing contention in an environment using optimistic locks
WO2006072882A2 (en) * 2005-01-10 2006-07-13 Melingo, Ltd. Embedded translation-enhanced search
JP4325577B2 (en) * 2005-03-25 2009-09-02 富士ゼロックス株式会社 Translation apparatus and program
US7516062B2 (en) * 2005-04-19 2009-04-07 International Business Machines Corporation Language converter with enhanced search capability
US20070174286A1 (en) * 2005-05-17 2007-07-26 Yahoo!, Inc. Systems and methods for providing features and user interface in network browsing applications
US9582602B2 (en) 2005-05-17 2017-02-28 Excalibur Ip, Llc Systems and methods for improving access to syndication feeds in network browsing applications
US7958446B2 (en) * 2005-05-17 2011-06-07 Yahoo! Inc. Systems and methods for language translation in network browsing applications
US20060286527A1 (en) * 2005-06-16 2006-12-21 Charles Morel Interactive teaching web application
US20070011160A1 (en) * 2005-07-07 2007-01-11 Denis Ferland Literacy automation software
US8225312B2 (en) * 2005-08-12 2012-07-17 Sony Corporation Method and apparatus for downloading software updates
US8689134B2 (en) * 2006-02-24 2014-04-01 Dana W. Paxson Apparatus and method for display navigation
US20070219782A1 (en) * 2006-03-14 2007-09-20 Qing Li User-supported multi-language online dictionary
US20070226321A1 (en) * 2006-03-23 2007-09-27 R R Donnelley & Sons Company Image based document access and related systems, methods, and devices
US7668859B2 (en) * 2006-04-18 2010-02-23 Foy Streetman Method and system for enhanced web searching
US7991608B2 (en) * 2006-04-19 2011-08-02 Raytheon Company Multilingual data querying
GB0608552D0 (en) * 2006-04-29 2006-06-07 Enabled London Ltd Information system
US8924194B2 (en) 2006-06-20 2014-12-30 At&T Intellectual Property Ii, L.P. Automatic translation of advertisements
US7783622B1 (en) 2006-07-21 2010-08-24 Aol Inc. Identification of electronic content significant to a user
WO2008120036A1 (en) * 2007-03-29 2008-10-09 Nokia Corporation Method at a central server for managing a translation dictionary and a translation server system
US9002869B2 (en) * 2007-06-22 2015-04-07 Google Inc. Machine translation for query expansion
US20090132257A1 (en) * 2007-11-19 2009-05-21 Inventec Corporation System and method for inputting edited translation words or sentence
US7974832B2 (en) * 2007-12-12 2011-07-05 Microsoft Corporation Web translation provider
US20090162818A1 (en) * 2007-12-21 2009-06-25 Martin Kosakowski Method for the determination of supplementary content in an electronic device
KR20100091923A (en) * 2009-02-10 2010-08-19 오의진 Method of servicing translation of web page written in many languages
KR101642449B1 (en) * 2009-03-18 2016-07-25 구글 인코포레이티드 Web translation with display replacement
US10671698B2 (en) 2009-05-26 2020-06-02 Microsoft Technology Licensing, Llc Language translation using embeddable component
US9405745B2 (en) * 2009-06-01 2016-08-02 Microsoft Technology Licensing, Llc Language translation using embeddable component
US8543598B2 (en) * 2010-03-01 2013-09-24 Microsoft Corporation Semantic object characterization and search
JP4983943B2 (en) * 2010-03-05 2012-07-25 カシオ計算機株式会社 Text display device and program
US9384283B2 (en) * 2010-04-19 2016-07-05 Tynt Multimedia Inc. System and method for deterring traversal of domains containing network resources
ES2369822B1 (en) * 2010-05-11 2012-10-15 Scio Soft, S.L. VIRTUAL VIDEOTRADUCTOR.
US8527259B1 (en) * 2011-02-28 2013-09-03 Google Inc. Contextual translation of digital content
US20120226500A1 (en) * 2011-03-02 2012-09-06 Sony Corporation System and method for content rendering including synthetic narration
US8843360B1 (en) * 2011-03-04 2014-09-23 Amazon Technologies, Inc. Client-side localization of network pages
US20130138421A1 (en) * 2011-11-28 2013-05-30 Micromass Uk Limited Automatic Human Language Translation
ITRM20120018U1 (en) * 2012-02-06 2013-08-07 Alessio Biancheri IT SYSTEM FOR THE MEMORIZATION, PROCESSING AND CONSULTATION ON REQUEST OF CLINICAL AND ANAMNESTIC INFORMATION
US20130323693A1 (en) * 2012-05-31 2013-12-05 International Business Machines Corporation Providing an uninterrupted reading experience
US9519641B2 (en) * 2012-09-18 2016-12-13 Abbyy Development Llc Photography recognition translation
US9323736B2 (en) * 2012-10-05 2016-04-26 Successfactors, Inc. Natural language metric condition alerts generation
JP6096489B2 (en) * 2012-11-30 2017-03-15 株式会社東芝 Foreign language text creation support apparatus, method, and program
US8914395B2 (en) 2013-01-03 2014-12-16 Uptodate, Inc. Database query translation system
US20140222413A1 (en) * 2013-02-01 2014-08-07 Klip, Inc. Method and user interface for controlling language translations using touch sensitive display screens
US10649619B2 (en) * 2013-02-21 2020-05-12 Oath Inc. System and method of using context in selecting a response to user device interaction
JP6417649B2 (en) * 2013-08-22 2018-11-07 株式会社リコー Text processing device, text display system, program
US9852129B2 (en) * 2013-11-26 2017-12-26 International Business Machines Corporation Language independent processing of logs in a log analytics system
US9639526B2 (en) * 2014-01-10 2017-05-02 Microsoft Technology Licensing, Llc Mobile language translation of web content
CN104123085B (en) * 2014-01-14 2015-08-12 腾讯科技(深圳)有限公司 By the method and apparatus of voice access multimedia interaction website
US20160150038A1 (en) * 2014-11-26 2016-05-26 Microsoft Technology Licensing, Llc. Efficiently Discovering and Surfacing Content Attributes
US9430466B1 (en) 2015-08-26 2016-08-30 Google Inc. Techniques for crowd sourcing human translations to provide translated versions of web pages with additional content
KR101827773B1 (en) * 2016-08-02 2018-02-09 주식회사 하이퍼커넥트 Device and method of translating a language
AU2017393668A1 (en) * 2017-01-17 2019-08-29 Loveland Co., Ltd. Multilingual communication system and multilingual communication provision method
US10885286B2 (en) * 2018-10-12 2021-01-05 Microsoft Technology Licensing, Llc Simultaneous and real time translation and language switching across a set of features
US11477286B2 (en) 2020-05-01 2022-10-18 Microsoft Technology Licensing, Llc Combined local and server context menus

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5987402A (en) * 1995-01-31 1999-11-16 Oki Electric Industry Co., Ltd. System and method for efficiently retrieving and translating source documents in different languages, and other displaying the translated documents at a client device
US6112174A (en) * 1996-11-13 2000-08-29 Hitachi, Ltd. Recognition dictionary system structure and changeover method of speech recognition system for car navigation
US6615178B1 (en) * 1999-02-19 2003-09-02 Sony Corporation Speech translator, speech translating method, and recorded medium on which speech translation control program is recorded
US20050149315A1 (en) * 1995-11-13 2005-07-07 America Online, Inc. Integrated multilingual browser

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5523946A (en) 1992-02-11 1996-06-04 Xerox Corporation Compact encoding of multi-lingual translation dictionaries
JPH0981569A (en) * 1995-09-12 1997-03-28 Hitachi Ltd System for providing services handling many countries
JP3959180B2 (en) * 1998-08-24 2007-08-15 東芝ソリューション株式会社 Communication translation device
US6338033B1 (en) 1999-04-20 2002-01-08 Alis Technologies, Inc. System and method for network-based teletranslation from one natural language to another

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5987402A (en) * 1995-01-31 1999-11-16 Oki Electric Industry Co., Ltd. System and method for efficiently retrieving and translating source documents in different languages, and other displaying the translated documents at a client device
US20050149315A1 (en) * 1995-11-13 2005-07-07 America Online, Inc. Integrated multilingual browser
US7292987B2 (en) * 1995-11-13 2007-11-06 America Online, Inc. Integrated multilingual browser
US6112174A (en) * 1996-11-13 2000-08-29 Hitachi, Ltd. Recognition dictionary system structure and changeover method of speech recognition system for car navigation
US6615178B1 (en) * 1999-02-19 2003-09-02 Sony Corporation Speech translator, speech translating method, and recorded medium on which speech translation control program is recorded

Cited By (42)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060218292A1 (en) * 2001-05-09 2006-09-28 Woodward Mark L Method, apparatus and program product for media identification and tracking associated user preferences
US20080147715A1 (en) * 2001-05-09 2008-06-19 Woodward Mark L Method, apparatus and program product for media identification and tracking associated user preferences
US8244896B2 (en) 2001-05-09 2012-08-14 Emission Limited Liability Company Method, apparatus and program product for media identification and tracking associated user preferences
US7844722B2 (en) * 2001-05-09 2010-11-30 Woodward Mark L Method, apparatus and program product for media identification and tracking associated user preferences
US20050043938A1 (en) * 2001-10-08 2005-02-24 Emil Viken Mutilingual support in web servers for embedded systems
US20040148158A1 (en) * 2002-12-27 2004-07-29 Casio Computer Co., Ltd. Information display control device and recording media that stores information display control programs
US8328558B2 (en) 2003-07-31 2012-12-11 International Business Machines Corporation Chinese / English vocabulary learning tool
US20060149528A1 (en) * 2005-01-05 2006-07-06 Inventec Corporation System and method of automatic Japanese kanji labeling
US7676357B2 (en) * 2005-02-17 2010-03-09 International Business Machines Corporation Enhanced Chinese character/Pin Yin/English translator
US20060184352A1 (en) * 2005-02-17 2006-08-17 Yen-Fu Chen Enhanced Chinese character/Pin Yin/English translator
US20080294424A1 (en) * 2006-02-10 2008-11-27 Fujitsu Limited Information display system, information display method, and program
US8065134B2 (en) * 2006-02-10 2011-11-22 Fujitsu Limited Multi-lingual information display system comprising public area and individual areas
US8660244B2 (en) * 2006-02-17 2014-02-25 Microsoft Corporation Machine translation instant messaging applications
US20070208813A1 (en) * 2006-02-17 2007-09-06 Microsoft Corporation Machine translation instant messaging applications
US10460021B2 (en) 2006-05-04 2019-10-29 Samsung Electronics Co., Ltd. Method and device for selecting a word to be defined in mobile communication terminal having an electronic dictionary
US9400772B2 (en) * 2006-05-04 2016-07-26 Samsung Electronics Co., Ltd. Method and device for selecting a word to be defined in mobile communication terminal having an electronic dictionary
US9092403B2 (en) * 2006-05-04 2015-07-28 Samsung Electronics Co., Ltd. Method and device for selecting a word to be defined in mobile communication terminal having an electronic dictionary
US20140229825A1 (en) * 2006-05-04 2014-08-14 Samsung Electronics Co., Ltd. Method and device for selecting a word to be defined in mobile communication terminal having an electronic dictionary
US20100169367A1 (en) * 2006-05-04 2010-07-01 Samsung Electronics Co., Ltd. Method and device for selecting a word to be defined in mobile communication terminal having an electronic dictionary
US20140229824A1 (en) * 2006-05-04 2014-08-14 Samsung Electronics Co., Ltd. Method and device for selecting a word to be defined in mobile communication terminal having an electronic dictionary
US20070260452A1 (en) * 2006-05-04 2007-11-08 Samsung Electronics Co., Ltd. Method and device for selecting a word to be defined in mobile communication terminal having an electronic dictionary
US20080229218A1 (en) * 2007-03-14 2008-09-18 Joon Maeng Systems and methods for providing additional information for objects in electronic documents
US20080319942A1 (en) * 2007-05-14 2008-12-25 Samir Courdy Method and system for report generation including extensible data
US9224179B2 (en) * 2007-05-14 2015-12-29 The University Of Utah Research Foundation Method and system for report generation including extensible data
US20090222428A1 (en) * 2008-03-02 2009-09-03 Microsoft Corporation Multi-lingual information display in a single language portal
US7970784B2 (en) 2008-03-02 2011-06-28 Microsoft Corporation Multi-lingual information display in a single language portal
WO2009111132A1 (en) * 2008-03-02 2009-09-11 Microsoft Corporation Multi-lingual information display in a single language portal
US20090248396A1 (en) * 2008-03-28 2009-10-01 International Business Machines Corporation Method for automating an internationalization test in a multilingual web application
US7698688B2 (en) * 2008-03-28 2010-04-13 International Business Machines Corporation Method for automating an internationalization test in a multilingual web application
US20090287471A1 (en) * 2008-05-16 2009-11-19 Bennett James D Support for international search terms - translate as you search
US8650561B2 (en) * 2008-07-10 2014-02-11 Apple Inc. System and method for localizing display of applications for download
US20100011354A1 (en) * 2008-07-10 2010-01-14 Apple Inc. System and method for localizing display of applications for download
US20100049752A1 (en) * 2008-08-22 2010-02-25 Inventec Corporation Dynamic word translation system and method thereof
US20100324887A1 (en) * 2009-06-17 2010-12-23 Dong Mingchui System and method of online user-cycled web page vision instant machine translation
US8311800B1 (en) * 2010-04-01 2012-11-13 Symantec Corporation Systems and methods for detecting incorrect translations of terms in a computing string
US20110251837A1 (en) * 2010-04-07 2011-10-13 eBook Technologies, Inc. Electronic reference integration with an electronic reader
WO2012174703A1 (en) * 2011-06-20 2012-12-27 Microsoft Corporation Hover translation of search result captions
US9122673B2 (en) 2012-03-07 2015-09-01 International Business Machines Corporation Domain specific natural language normalization
US9424253B2 (en) 2012-03-07 2016-08-23 International Business Machines Corporation Domain specific natural language normalization
CN103716453A (en) * 2012-10-02 2014-04-09 Lg电子株式会社 Mobile terminal and control method for the mobile terminal
EP2717148A3 (en) * 2012-10-02 2014-11-26 LG Electronics, Inc. Mobile terminal and control method for the mobile terminal
US20220129949A1 (en) * 2013-01-15 2022-04-28 Motionpoint Corporation Dynamic determination of localization source for web site content

Also Published As

Publication number Publication date
WO2002086737A1 (en) 2002-10-31
US20020194300A1 (en) 2002-12-19
US6999916B2 (en) 2006-02-14

Similar Documents

Publication Publication Date Title
US6999916B2 (en) Method and apparatus for integrated, user-directed web site text translation
US6735592B1 (en) System, method, and computer program product for a network-based content exchange system
US10032207B2 (en) Product placement engine and method
US9323848B2 (en) Search system using search subdomain and hints to subdomains in search query statements and sponsored results on a subdomain-by-subdomain basis
US6498921B1 (en) Method and system to answer a natural-language question
US8041601B2 (en) System and method for automatically targeting web-based advertisements
US8775396B2 (en) Method and system for searching a wide area network
US20060074853A1 (en) Canonicalization of terms in a keyword-based presentation system
US20020010639A1 (en) Computer-based interpretation and location system
US20050283473A1 (en) Apparatus, method and system of artificial intelligence for data searching applications
US20020154162A1 (en) Systems and methods for context personalized web browsing based on a browser companion agent and associated services
US7668859B2 (en) Method and system for enhanced web searching
US20120030014A1 (en) Automatic Ad Creative Generation
WO2007033468A2 (en) System and method configuring contextual based content with publisher content for display on a user interface
US11620143B2 (en) Interface and mode selection for digital action execution
JP2001076001A (en) Method for providing event information
US20070124200A1 (en) Systems and methods for providing online contextual advertising in multilingual environments
US20120016741A1 (en) Targeting content without keywords
KR20050045650A (en) Information suppling system and method with info-box
WO2016046650A1 (en) Method of and server for processing a message to determine intent
Gangopadhyay et al. Multilingual Electronic Commerce in a Global Economy
KR20040078631A (en) Apparatus and method for reconstructuring search result using hardware unique number

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION