CN103646017B - Acronym generating system for naming and working method thereof - Google Patents
Acronym generating system for naming and working method thereof Download PDFInfo
- Publication number
- CN103646017B CN103646017B CN201310673706.XA CN201310673706A CN103646017B CN 103646017 B CN103646017 B CN 103646017B CN 201310673706 A CN201310673706 A CN 201310673706A CN 103646017 B CN103646017 B CN 103646017B
- Authority
- CN
- China
- Prior art keywords
- acronym
- sentence
- acronyms
- classification
- database
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 14
- 239000000284 extract Substances 0.000 claims description 7
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
本发明公开了一种用于命名的缩略词生成系统及其工作方法,该系统通过对输入长字符串的分析,给出适当的缩略词命名。该系统包括输入输出页面及后台支撑服务平台,输入输出页面用于输入待生成缩略词的长字符串及输出用于命名的缩略词;后台支撑服务平台包括分类数据库、缩略词生成系统及推荐系统,分类数据库中储存有便于计算出各个单词的分类倾向的数据,可以使得后台支持程序分析用户输入的语句并且产生语义相关的缩略词,用于各领域的命名。本发明的产生就是为了改变现状,使得用户能够得到与原字符串语义相近的缩略词以用于命名。
The invention discloses an abbreviation generating system for naming and a working method thereof. The system gives appropriate abbreviation names by analyzing input long character strings. The system includes input and output pages and a background support service platform. The input and output pages are used to input long strings of acronyms to be generated and output acronyms for naming; the background support service platform includes a classification database and an acronym generation system And the recommendation system, the classification database stores data that is convenient for calculating the classification tendency of each word, which can make the background support program analyze the sentences entered by the user and generate semantically related acronyms for the naming of various fields. The purpose of the present invention is to change the status quo, so that users can obtain abbreviations that are semantically similar to the original character strings for naming.
Description
技术领域:Technical field:
本发明涉及一种名字生成系统及其工作方法,尤其涉及用于命名的缩略词生成系统及其工作方法,其为一个由多个单词组成的字符串提供与字符串本身意义相符合的缩略词。The present invention relates to a name generation system and its working method, in particular to an acronym generation system and its working method for naming, which provides a string composed of multiple words with an abbreviation that matches the meaning of the string itself. Abbreviations.
背景技术:Background technique:
缩略词生成技术是一个有创造性的、模拟人类思维对字符串进行分析的自动生成技术。由于各行业、各领域都需要打造家喻户晓的好品牌,所以取一个朗朗上口而又体现专业特色的名字就显得尤为重要。所以构建一个用于命名的缩略词生成系统在各行业、各领域都具有巨大的应用潜力和广阔前景。Acronym generation technology is a creative automatic generation technology that simulates human thinking to analyze character strings. Since every industry and field needs to create a well-known good brand, it is particularly important to choose a name that is catchy and professional. Therefore, building an acronym generation system for naming has great application potential and broad prospects in various industries and fields.
目前互联网上已有缩略词生成与缩略词查询系统,其命名规则基本上采用首字母匹配,即提取该字符串内所有单词的第一个字母构成缩略语命名。但是,如何生成与字符串意义相关的缩略词,尚未有可行的解决方法。一些缩略词生成系统通过用户投票的方式为生成的缩略词排序,但由于投票这种方式需要用户的配合,在查询次数较小的字符串上表现得并不令人满意。而且考虑到不同领域的用户需求不同,投票方式生成的缩略词在缩略词表达语义方面的表现不尽如人意。At present, there is an acronym generation and acronym query system on the Internet, and its naming rules basically use initial letter matching, that is, the first letter of all words in the string is extracted to form an acronym name. However, there is no feasible solution for how to generate acronyms related to the meaning of strings. Some acronym generation systems rank the generated acronyms by voting by users, but because voting requires the cooperation of users, it is not satisfactory for strings with a small number of queries. Moreover, considering the different needs of users in different fields, the performance of the acronyms generated by the voting method in the semantics of the acronyms is not satisfactory.
在现有的用于命名的缩略词生成系统中,我们利用计算机分析出字符串的语义,通过相关性匹配规则构造出与之相近语义的缩略词。通过查询后台数据库检查这样的组合是否构成单词,如果不构成单词,则在其中添加或删除一些字母以达到生成缩略词的目的。In the existing acronym generation system for naming, we use computers to analyze the semantics of character strings, and construct acronyms with similar semantics through correlation matching rules. Check whether such a combination constitutes a word by querying the background database, and if not, add or delete some letters to achieve the purpose of generating an acronym.
发明内容:Invention content:
本发明提供一种用于命名的缩略词生成系统及其工作方法,其通过改变现有的缩略词生成系统,使之能够生成与原字符串语义相匹配的缩略词。The present invention provides an abbreviation generating system for naming and its working method. By changing the existing abbreviation generating system, it can generate abbreviations matching the semantics of the original character string.
本发明采用如下技术方案:一种用于命名的缩略词生成系统,该系统包括输入输出页面及后台支撑服务平台;其中,输入输出页面用于输入待生成缩略词的长字符串及输出用于命名的缩略词;后台支撑服务平台包括:The present invention adopts the following technical solutions: an acronym generation system for naming, the system includes an input and output page and a background support service platform; wherein, the input and output page is used for inputting a long string of acronyms to be generated and outputting Acronyms used for naming; background support service platforms include:
分类数据库,分类数据库使用后台支持程序建立,分类数据库中储存有计算出各个单词的分类倾向的数据,用于查找用户输入的语句所属分类,并用于在相应分类所属的数据库中匹配所需的缩略词;Classification database, the classification database is established using background support programs, and the classification database stores the data that calculates the classification tendency of each word, which is used to find the classification of the sentence entered by the user, and is used to match the required abbreviation in the database to which the corresponding classification belongs. Abbreviations;
缩略词生成系统,通过查询分类数据库分析原语句的语义以及所属类别,从而在语句字符串的子序列中找出语义匹配的缩略词并按照语义相关程度给出排序;The acronym generation system analyzes the semantics and category of the original sentence by querying the classification database, so as to find out the acronyms that match the semantics in the subsequence of the sentence string and sort them according to the degree of semantic relevance;
推荐系统,缩略词均不匹配情况下,能够判断输入语句的语义,并在不影响语句语义的基础上修改语句中的某些单词或是调换语句中单词的顺序,再与分类数据库中单词进行匹配,使之能产生匹配的缩略词,并推荐给用户。In the recommendation system, when the acronyms do not match, it can judge the semantics of the input sentence, and modify some words in the sentence or exchange the order of the words in the sentence without affecting the semantics of the sentence, and then compare it with the words in the classification database Matching is performed so that it can generate matching acronyms and recommend them to users.
本发明还采用如下技术方案:一种用于命名的缩略词生成系统的工作方法,其包括如下步骤:The present invention also adopts following technical scheme: a kind of working method for the acronym generating system of naming, it comprises the steps:
1).输入待生成缩略词的长字符串,并确认生成;1). Enter the long character string of the acronym to be generated, and confirm the generation;
2).根据上述输入的长字符串,提取字符串中的每一个单词,并依次与分类数据库中的单词进行匹配并计算出各单词的类型;2). According to the long string entered above, extract each word in the string, match it with the words in the classification database in turn, and calculate the type of each word;
3).判断出该字符串属于的类型,然后保存下该类型;3). Determine the type of the string, and then save the type;
4).分析长字符串中有实意单词的首字母作为固定字母,并在此基础上保持字符原序并插入长字符串中的其它字母,找出所有可能的缩略词,并依次与步骤3)中查找出的类型所对应的数据库中的单词进行匹配,若匹配成功,则作为候选缩略词保存;4). Analyze the first letter of the real word in the long string as a fixed letter, and on this basis, keep the original order of the characters and insert other letters in the long string, find out all possible acronyms, and follow the steps in turn 3) match the word in the database corresponding to the type found in, if the match is successful, then save it as a candidate acronym;
5).为所有候选缩略词按类型相关程度排序,类型相关程度在类型数据库中获取;5). Sort all the candidate acronyms according to the degree of type correlation, and the degree of type correlation is obtained in the type database;
6).在缩略词输出框中显示排序之后的缩略词,转步骤7),若无法生成相关类型的缩略词,则转步骤8);6). Display the sorted acronyms in the acronym output box, go to step 7), if the relevant type of acronym cannot be generated, go to step 8);
7).进行复位动作,准备下一次缩略词生成;7). Perform a reset action to prepare for the next generation of acronyms;
8).进入缩略词推荐系统,不改变语句语义对语句进行修改,生成缩略词,并把修改的语句反馈给用户。8). Enter the acronym recommendation system, modify the sentence without changing the semantics of the sentence, generate acronyms, and feed back the modified sentence to the user.
本发明具有如下有益效果:The present invention has following beneficial effect:
(1)本发明通过单词的分类计算出输入字符串的分类,在该分类下匹配输入字符串的各个子序列,从而达到缩略词语义与输入字符串相近的目的;(1) The present invention calculates the classification of the input character string through the classification of words, and matches each subsequence of the input character string under the classification, so as to achieve the purpose that the semantics of the acronym is similar to the input character string;
(2)本发明与已有的缩略词生成系统相比,其能够大大提高生成命名缩略词与原语句的相关程度。(2) Compared with the existing acronym generation system, the present invention can greatly improve the degree of correlation between the generated named acronym and the original sentence.
附图说明:Description of drawings:
图1为本发明用于命名的缩略词生成系统的结构图。FIG. 1 is a structural diagram of an acronym generation system for naming in the present invention.
图2为本发明缩略词生成流程图。Fig. 2 is a flow chart of abbreviation generation in the present invention.
图3为本发明分类数据库生成流程图。Fig. 3 is a flow chart for generating the classification database of the present invention.
具体实施方式:detailed description:
请参照图1至图3所示,本发明用于命名的缩略词生成系统包括输入输出页面及后台支持程序。其中输入输出页面包括语句输入框、命名生成按钮、复位按钮、推荐按钮以及缩略词输出框。后台支持程序分为三个部分:分类数据库的生成、缩略词的生成及推荐系统。分类数据库用于查找用户输入的语句所属分类,并用于在相应分类所属的数据库中匹配所需的缩略词。下面将具体介绍这三个部分:Please refer to FIG. 1 to FIG. 3 , the acronym generation system for naming in the present invention includes input and output pages and background support programs. The input and output pages include a statement input box, a naming generation button, a reset button, a recommendation button, and an acronym output box. The background support program is divided into three parts: the generation of classification database, the generation of acronyms and the recommendation system. The taxonomy database is used to find the category to which the sentence entered by the user belongs, and to match the required acronym in the database to which the corresponding category belongs. These three parts are described in detail below:
(1)分类数据库的生成(1) Generation of classification database
本发明所述的分类数据库使用后台支持程序事先建立,存储的数据包括大量单词和各单词的分类倾向。建立分类数据库需要大量训练文本。我们首先对训练文本中出现的所有单词进行字数统计,然后计算各单词对每个独立文本的重要性,最后使用余弦相似性的原理对文本进行分类,从而得到单词的分类。通过训练文本产生分类数据库,步骤如下:The classification database of the present invention is established in advance using a background support program, and the stored data includes a large number of words and the classification tendencies of each word. Building a classification database requires a large amount of training text. We first count all the words that appear in the training text, then calculate the importance of each word to each independent text, and finally use the principle of cosine similarity to classify the text to obtain the word classification. Generate a classification database through the training text, the steps are as follows:
A:由处理程序分析文本本件,得到各个单词在各文本中出现的次数,在预备数据库中储存为<单词,文件ID[出现次数]>这样的格式;A: Analyze the original text file by the processing program to obtain the number of occurrences of each word in each text, and store it in the format of <word, file ID [number of occurrences]> in the preliminary database;
B:为预备数据库中的每一个元组计算该单词对各个文件的重要性ti,这里用到了TF-IDF(term frequency-inverse document frequency)技术:B: Calculate the importance ti of the word to each file for each tuple in the preliminary database. Here, TF-IDF (term frequency-inverse document frequency) technology is used:
tndfi,j=tfi,j×idfi tndf i, j = tf i, j × idf i
以上式子中ni,j是该词在文件dj中的出现次数,而分母则是在文件dj中所有字词的出现次数之和。|D|表示文件总数,|{j:ti∈dj}|表示包含单词ti的文件数。In the above formula, n i, j is the number of occurrences of the word in file d j , and the denominator is the sum of the number of occurrences of all words in file d j . |D| indicates the total number of documents, and |{j:t i ∈ d j }| indicates the number of documents containing the word ti.
如此计算出预备数据库中所有单词与文件之间的相关程度并存入define数据库格式为<单词,与文件1相关度,与文件2相关度,....,与文件n相关度>。In this way, the degree of correlation between all words and files in the preliminary database is calculated and stored in the define database. The format is <word, degree of correlation with file 1, degree of correlation with file 2, ..., degree of correlation with file n>.
C:把B中所述的define数据库每个单词与文件i的相关程度构成一个向量,计算各文件之间的余弦相似性:C: The degree of correlation between each word in the define database described in B and file i forms a vector, and the cosine similarity between files is calculated:
D:根据C中计算的余弦相似性为文件分类,再根据文件的分类为各个单词分类,如此就得到了各单词与某一类型的相关程度,构成了分类数据库。D: Classify files according to the cosine similarity calculated in C, and then classify each word according to the classification of files, so that the degree of correlation between each word and a certain type is obtained, and a classification database is formed.
(2)缩略词的生成(2) Generation of acronyms
缩略词的生成分为两个阶段:Acronyms are generated in two stages:
(A):分析用户输入字符串的语义:(A): Analyze the semantics of user input strings:
首先提取用户输入语句中的各个单词,通过查询(1)中建立的分类数据库分析各单词的类别,从而得到并记录语句的类别。Firstly, extract each word in the sentence input by the user, and analyze the category of each word by querying the classification database established in (1), so as to obtain and record the category of the sentence.
(B):产生语义相关的缩略词并排序:(B): Generate semantically related acronyms and sort them:
首字母缩略词匹配:Acronym matching:
考虑到好的缩略词往往尽可能使用原语句中的首字母,本发明首先提取用户输入语句的首字母组成缩略词与(A)中记录的分类下的数据库进行匹配,若匹配成功,则此缩略词记为最佳缩略词。Considering that good acronyms often use the initials in the original sentence as much as possible, the present invention first extracts the initials of the user input sentence to form an acronym and matches the database under the classification recorded in (A), if the matching is successful, This acronym is recorded as the best acronym.
在首字母基础上插入句中其它单词匹配:Insert other word matches in the sentence based on the first letter:
首字母缩略词匹配成功的概率并不是很高。当匹配失败时,我们考虑选取原语句中的部分单词,按原顺序插入到首字母序列之中,再次在(A)中记录的分类下的数据库中进行匹配。The odds of a successful acronym match are not very high. When the matching fails, we consider selecting some words in the original sentence, inserting them into the initial sequence in the original order, and matching again in the database under the classification recorded in (A).
其中缩略词生成部分,由用户发起,输入一个包含N个单词的字符串,点击生成按钮后,交由后台处理程序,步骤如下:Among them, the acronym generation part is initiated by the user, input a string containing N words, click the generate button, and hand it over to the background processing program, the steps are as follows:
A.提取输入字符串中的所有单词组成序列a,提取输入字符串中的所有字符组成序列b;A. Extract all the words in the input string to form a sequence a, and extract all the characters in the input string to form a sequence b;
B.根据步骤A中所述序列a,利用前期准备部分提到的TF-IDF技术计算出原字符串所属分类;B. According to the sequence a described in step A, use the TF-IDF technology mentioned in the preparatory part to calculate the classification of the original character string;
C.根据步骤B中分类,把该类型数据库下的所有单词取出,构成一棵Trie树,以便缩略词的匹配;C. according to classification in step B, all words under this type database are taken out, form a Trie tree, so that the matching of abbreviations;
D.对于步骤A中所述序列b,按发明内容部分步骤4)中要求,遍历其所有长度小于等于N+2的子序列,并在步骤C中所述Trie树中查找该子序列,若找到,保存所有结果到序列c;D. For the sequence b described in step A, according to the requirements in step 4) of the summary of the invention, traverse all subsequences whose length is less than or equal to N+2, and search for the subsequence in the Trie tree described in step C, if Find and save all results to sequence c;
E.根据分类数据库中与类型相关的程度,为序列c排序并打印到屏幕。若无法生成相关类型的缩略词,或是接收到用户要求,则转步骤G;E. Sort and print the sequence c to the screen according to the degree to which it is associated with the type in the taxonomy database. If the relevant type of acronym cannot be generated, or a user request is received, go to step G;
F.点击复位按钮,准备下一次缩略词生成;F. Click the reset button to prepare for the next generation of acronyms;
G.进入缩略词推荐系统,不改变语句语义对语句进行修改,生成缩略词,并把修改的语句反馈给用户。G. Enter the acronym recommendation system, modify the sentence without changing the semantics of the sentence, generate an acronym, and feed back the modified sentence to the user.
(3)推荐系统(3) Recommendation system
若用户对(2)中产生的缩略词不满意,则通过推荐系统以修改原语句的方式提供更好的缩略词。该推荐系统能够保证在不改变原语句意思的前提下对语句进行修改。修改语句后,再次与(2)中记录的分类数据库中的单词进行匹配。以下为本系统中对语句修改的三种可行方法:If the user is dissatisfied with the acronyms generated in (2), a better acronym will be provided by modifying the original sentence through the recommendation system. The recommendation system can guarantee that the sentence can be modified without changing the meaning of the original sentence. After modifying the sentence, match it again with the words in the classification database recorded in (2). The following are three feasible methods for modifying sentences in this system:
A.对原语句中的单词进行换序A. Reorder the words in the original sentence
这种方法对原语句中的单词进行简单的顺序的调换,可以极大地保证原语句意思的完整性。例如:提取原语句的首字母,进行顺序的调换:为了保留原语句的意思以及保证语法的正确,可以考虑调换and或是or连接的单词。This method can simply change the order of the words in the original sentence, which can greatly guarantee the integrity of the meaning of the original sentence. For example: extract the first letter of the original sentence, and exchange the order: in order to retain the meaning of the original sentence and ensure the correct grammar, you can consider exchanging the words connected by and or or.
B.在原语句中插入同义单词B. Insert synonyms into the original sentence
使用这种方法的时候可以考虑在同义单词之间使用and或是or之类的连接词以保证语法的正确。When using this method, you can consider using conjunctions such as and or or between synonyms to ensure grammatical correctness.
C.在原语句中修改同义或同类型单词C. Modify synonyms or words of the same type in the original sentence
利用现有的字典查找同义单词对长字符串中实词进行替换。Use existing dictionaries to find synonymous words and replace content words in long character strings.
请参照图1至图3所示,本发明用于命名的缩略词生成系统的工作方法包括如下步骤:Please refer to shown in Fig. 1 to Fig. 3, the working method of the acronym generation system that the present invention is used for naming comprises the steps:
1).用户输入待生成缩略词的长字符串,并点击生成按钮;1). The user enters a long string of acronyms to be generated and clicks the generate button;
2).后台处理程序根据用户的输入,提取字符串中的每一个单词,并依次与分类数据库中的单词进行匹配并计算出各单词的类型;2). The background processing program extracts each word in the string according to the user's input, and sequentially matches with the words in the classification database and calculates the type of each word;
3).后台处理程序判断出该字符串最有可能属于的类型,然后保存下该类型;3). The background processing program determines the most likely type of the string, and then saves the type;
4).后台处理程序把长字符串的所有子序列找出来,并依次与步骤3)中查找出的类型所对应的数据库中的单词进行匹配,若匹配成功,则作为候选缩略词保存;4). The background processing program finds out all the subsequences of the long string, and matches them with the words in the database corresponding to the type found in step 3). If the match is successful, it is saved as a candidate acronym;
5).为所有候选缩略词按类型相关程度排序,类型相关程度可以在类型数据库中获取;5). Sort all the candidate acronyms according to the degree of type correlation, which can be obtained in the type database;
6).在缩略词输出框中显示排序之后的缩略词,转步骤7),若无法生成相关类型的缩略词,则转步骤8);6). Display the sorted acronyms in the acronym output box, go to step 7), if the relevant type of acronym cannot be generated, go to step 8);
7).点击复位按钮,准备下一次缩略词生成;7). Click the reset button to prepare for the next generation of acronyms;
8).进入缩略词推荐系统,不改变语句语义对语句进行修改,生成缩略词,并把修改的语句反馈给用户。8). Enter the acronym recommendation system, modify the sentence without changing the semantics of the sentence, generate acronyms, and feed back the modified sentence to the user.
本发明用于命名的缩略词生成系统及其工作方法通过改变现有的缩略词生成系统,使之能够生成与原字符串语义相匹配的缩略词。The system for generating abbreviations used for naming and the working method thereof of the present invention change the existing system for generating abbreviations so that they can generate abbreviations that match the semantics of the original character strings.
以上所述仅是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下还可以作出若干改进,这些改进也应视为本发明的保护范围。The above is only a preferred embodiment of the present invention, it should be pointed out that for those of ordinary skill in the art, some improvements can also be made without departing from the principle of the present invention, and these improvements should also be regarded as the invention. protected range.
Claims (2)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310673706.XA CN103646017B (en) | 2013-12-11 | 2013-12-11 | Acronym generating system for naming and working method thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310673706.XA CN103646017B (en) | 2013-12-11 | 2013-12-11 | Acronym generating system for naming and working method thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103646017A CN103646017A (en) | 2014-03-19 |
CN103646017B true CN103646017B (en) | 2017-01-04 |
Family
ID=50251236
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310673706.XA Active CN103646017B (en) | 2013-12-11 | 2013-12-11 | Acronym generating system for naming and working method thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103646017B (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10380247B2 (en) | 2016-10-28 | 2019-08-13 | Microsoft Technology Licensing, Llc | Language-based acronym generation for strings |
JP6881991B2 (en) * | 2017-01-30 | 2021-06-02 | キヤノン株式会社 | Image processing device and its control method and program |
CN110231955B (en) * | 2019-05-13 | 2024-05-07 | 平安科技(深圳)有限公司 | Code processing method, device, computer equipment and storage medium |
CN113534972B (en) * | 2020-04-14 | 2025-01-03 | 北京搜狗科技发展有限公司 | A method and device for prompting an entry and a device for prompting an entry |
CN112632909B (en) * | 2020-10-30 | 2024-06-11 | 中核核电运行管理有限公司 | English coding method and device for data object |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1983271A (en) * | 2005-12-16 | 2007-06-20 | 国际商业机器公司 | System and method for defining and translating chat abbreviations |
CN101365012A (en) * | 2008-10-06 | 2009-02-11 | 深圳华为通信技术有限公司 | Abbreviation operating method and hand-hold communication terminal |
CN101599075A (en) * | 2009-07-02 | 2009-12-09 | 清华大学 | Chinese abbreviation processing method and device |
WO2012047214A2 (en) * | 2010-10-06 | 2012-04-12 | Virtuoz, Sa | Visual display of semantic information |
CN102902660A (en) * | 2011-07-26 | 2013-01-30 | 苗玉水 | Holographic Chinese information processing method by Quanpin and Jianpin of Chinese phonetic codes |
CN103020164A (en) * | 2012-11-26 | 2013-04-03 | 华北电力大学 | Semantic search method based on multi-semantic analysis and personalized sequencing |
-
2013
- 2013-12-11 CN CN201310673706.XA patent/CN103646017B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1983271A (en) * | 2005-12-16 | 2007-06-20 | 国际商业机器公司 | System and method for defining and translating chat abbreviations |
CN101365012A (en) * | 2008-10-06 | 2009-02-11 | 深圳华为通信技术有限公司 | Abbreviation operating method and hand-hold communication terminal |
CN101599075A (en) * | 2009-07-02 | 2009-12-09 | 清华大学 | Chinese abbreviation processing method and device |
WO2012047214A2 (en) * | 2010-10-06 | 2012-04-12 | Virtuoz, Sa | Visual display of semantic information |
CN102902660A (en) * | 2011-07-26 | 2013-01-30 | 苗玉水 | Holographic Chinese information processing method by Quanpin and Jianpin of Chinese phonetic codes |
CN103020164A (en) * | 2012-11-26 | 2013-04-03 | 华北电力大学 | Semantic search method based on multi-semantic analysis and personalized sequencing |
Also Published As
Publication number | Publication date |
---|---|
CN103646017A (en) | 2014-03-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112035730B (en) | Semantic retrieval method and device and electronic equipment | |
CN104933027B (en) | A kind of open Chinese entity relation extraction method of utilization dependency analysis | |
CN103049435B (en) | Text fine granularity sentiment analysis method and device | |
CN103646088B (en) | Product comment fine-grained emotional element extraction method based on CRFs and SVM | |
CN106649597B (en) | Method for auto constructing is indexed after a kind of books book based on book content | |
CN110399457A (en) | An intelligent question answering method and system | |
CN103226580B (en) | A kind of topic detection method of interaction text | |
CN107247780A (en) | A kind of patent document method for measuring similarity of knowledge based body | |
US20150227505A1 (en) | Word meaning relationship extraction device | |
US20160132572A1 (en) | Collecting, organizing, and searching knowledge about a dataset | |
CN111488466B (en) | Chinese language marking error corpus generating method, computing device and storage medium | |
CN102663139A (en) | Method and system for constructing emotional dictionary | |
Zu et al. | Resume information extraction with a novel text block segmentation algorithm | |
CN103646017B (en) | Acronym generating system for naming and working method thereof | |
CN111444713B (en) | Method and device for extracting entity relationship in news event | |
CN108920455A (en) | A kind of Chinese automatically generates the automatic evaluation method of text | |
CN107818081A (en) | Sentence similarity appraisal procedure based on deep semantic model and semantic character labeling | |
Laddha et al. | Extracting aspect specific opinion expressions | |
JP5426292B2 (en) | Opinion classification device and program | |
Wang et al. | Microblog summarization using paragraph vector and semantic structure | |
CN114580557A (en) | Method and device for determining the similarity of documents based on semantic analysis | |
CN103177126B (en) | For pornographic user query identification method and the equipment of search engine | |
CN105677684A (en) | Method for making semantic annotations on content generated by users based on external data sources | |
CN117609477A (en) | Large model question-answering method and device based on domain knowledge | |
Ma et al. | Combining n-gram and dependency word pair for multi-document summarization |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |