CN101075942A - Method and system for processing social network expert information based on expert value progation algorithm - Google Patents
Method and system for processing social network expert information based on expert value progation algorithm Download PDFInfo
- Publication number
- CN101075942A CN101075942A CNA2007101177193A CN200710117719A CN101075942A CN 101075942 A CN101075942 A CN 101075942A CN A2007101177193 A CNA2007101177193 A CN A2007101177193A CN 200710117719 A CN200710117719 A CN 200710117719A CN 101075942 A CN101075942 A CN 101075942A
- Authority
- CN
- China
- Prior art keywords
- expert
- paper
- value
- network
- relation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 48
- 238000004422 calculation algorithm Methods 0.000 title claims description 21
- 238000012545 processing Methods 0.000 title claims description 7
- 238000013507 mapping Methods 0.000 claims abstract description 12
- 238000004364 calculation method Methods 0.000 claims abstract description 8
- 239000011159 matrix material Substances 0.000 claims abstract description 8
- 238000011160 research Methods 0.000 claims description 14
- 230000008569 process Effects 0.000 claims description 4
- 238000009499 grossing Methods 0.000 claims description 3
- 239000000203 mixture Substances 0.000 claims description 3
- 230000001902 propagating effect Effects 0.000 claims description 3
- 230000002457 bidirectional effect Effects 0.000 claims 2
- 238000003672 processing method Methods 0.000 claims 1
- 238000010606 normalization Methods 0.000 abstract description 6
- 238000013459 approach Methods 0.000 description 19
- 238000007418 data mining Methods 0.000 description 18
- 230000000052 comparative effect Effects 0.000 description 17
- 238000012360 testing method Methods 0.000 description 15
- 238000000605 extraction Methods 0.000 description 9
- 239000003795 chemical substances by application Substances 0.000 description 7
- 238000002474 experimental method Methods 0.000 description 6
- 238000010801 machine learning Methods 0.000 description 6
- 238000003058 natural language processing Methods 0.000 description 6
- 238000012706 support-vector machine Methods 0.000 description 6
- 238000009412 basement excavation Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 238000011156 evaluation Methods 0.000 description 5
- 238000013528 artificial neural network Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 230000001537 neural effect Effects 0.000 description 4
- 230000000644 propagated effect Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 238000013473 artificial intelligence Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000008676 import Effects 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 1
- 230000006854 communication Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 238000012797 qualification Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 235000019640 taste Nutrition 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The method comprises: establishing a society relation network graph by using society relation network generation server; said graph is described by using personal description information, human relation, relation type and the mapping function between human relation and relation type; the expert value calculation server reads the information from the database server, and according to the weighting, analyzes the correlation between the key word, the person description information and the thesis information to get a initiative expert value; then according to the importance and osculation of human relation, constructs a propagator matrix; according to the initiative expert value and the propagator matrix, makes iteration to get the expert value of all experts; after normalization, arranging it in descending order; outputs it to the Web server to provide the option for user.
Description
Technical field
The invention belongs to the community network field of information processing, relate in particular to the community network search under the Internet.
Background technology
Along with rapid development of Internet and the Web1.0 transformation to Web2.0, community network becomes an extremely important Web gradually and uses.For example, we can utilize the community network job search, seek the employee, seek the friend of common interest, and seek the affiliate.
For we provide the lot of data source, help to carry out on this basis some data minings, Knowledge Discovery based on the community network of Web, these researchs have great challenge in the Web2.0 epoch.For example, the excavation of trust chain, personalized recommendation and expert's discovery etc.
The target that expert info is handled is the expert who finds to have a certain domain-specific knowledge automatically.By finding the expert, we can obtain the answer of some professional problems apace by the expert, have remedied " The High Cost ofNot Finding Information " problem that traditional search engines exists, and have improved effectiveness of retrieval greatly.This technology will be brought huge user and frequency of utilization to the Internet, is the important means that embodies the Internet new technology and economic worth.
The expert's discover method that has proposed mainly is conceived to carry out expert info and handles from the data of Web or non-structureization.Usual way is that the expert is found to regard as traditional information retrieval: some individuals that at first everyone are associated describe documents (as personal homepage, Email, publish thesis) and comprehensively generate a document, and everyone just can represent with this document like this; Use the method for conventional information retrieval then, these documents are sorted, promptly can obtain corresponding expert and tabulate according to the degree of correlation of document and searching keyword.Yet investigation finds, the work of finding the expert based on community network also seldom.In community network, except the individual descriptor that everyone depended on, the interpersonal relation that also has complexity, these relation informations are for recommending the expert that important effect is arranged.Traditional method has often been ignored the importance of community network relation, and we think and utilize interpersonal relation in community network to find that the expert has crucial meaning.
At the problems referred to above, the present invention proposes expert info treatment system and method in the community network that is worth propagation algorithm based on the expert.This method comprises two stages.In the phase I, mainly utilize individual descriptor to carry out candidate expert's discovery, and give initial expert's value for each candidate expert.In second stage, (each node is represented a people among the figure to utilize initial expert's value of the candidate expert that the phase I obtains and graph structure of the structure of the incidence relation between them, every interpersonal a kind of relation is represented on the limit), propagate expert's value along the direction on limit in the drawings then, promptly the incidence relation of representing according to the expert's value and the limit of node is revised expert's value of adjacent node, make each node obtain the new expert value relevant, thereby realize more high-precision expert info processing based on community network with community network.
Summary of the invention
The object of the present invention is to provide a kind of community network expert info treatment system and method that in community network, is worth propagation algorithm based on the expert.
The thinking of system and method proposed by the invention is: adopt a kind of general community network descriptive model, based on the target of this model definition expert info processing.Under the guidance of this target, at first utilize individual descriptor to retrieve the relevant expert in certain field (as: data mining field), as candidate expert, and calculate initial expert's value for each candidate expert.This step is based on a hypothesis, very many (for example: a people has delivered a lot of papers in the data mining field if people is about the descriptive information in certain field, repeatedly mention data mining in a people's the personal homepage), so very possible this people is exactly the expert in this field; Utilize candidate expert's initial expert's value and the incidence relation between them to make up community network subgraphs then, and everyone expert's value is propagated neighbours to him along the direction on limit based on this figure for these candidate's brainstrusts.This communication process iteration is always gone down up to algorithmic statement, finally obtains expert's tabulation that all expert's value all no longer changes, and feeds back to the user after being worth ordering according to the expert.This step is also based on a hypothesis, if people is familiar with the expert in many certain fields, and recommended by these experts, and he probably also is the expert in this field so.
Our idea comes from the observation of actual life being sought the expert.In reality, we seek the introduction that the expert a) reads a people by following two kinds of methods usually and judge whether he reaches the standard of expert's qualification; B) the seek help expert of our understanding helps and recommends other expert.Yet,, be difficult to directly to introduce the expert level of judging a people and determine the degree (for example: the strong recommendation, the general recommendation) that other people recommend by the individual for machine.Therefore, we are with a people's the introduction individual descriptor specific to the expert level that can embody him, comprise individual essential information (as: position, unit, research interest, homepage address, phone, E-mail address) and paper information that he delivers (as: title, deliver meeting title, partner); Simultaneously, the recommendation degree can be embodied as and concern weight (calculating of weight will specifically be set forth in step 3) between nominator's expert level and nominator and the presentee, so-called here " nominator " refer to and the presentee between have the candidate expert of certain relation.In a word, we judge a people's expert level, mainly factor aspect the individual descriptor by taking all factors into consideration him and he and other people these two of the incidence relations in community network.
Described method is based on community network that has existed specific implementation according to the following steps successively, and the step block diagram that is adopted is asked for an interview Fig. 1, and this method comprises the steps.
Step 1: make up community network.
In the present invention, we are described as a figure (Graph) to community network.
If: community network is G=(V, E, T, τ), wherein V is a set of node, and each node v ∈ V represents a people in the community network, and a people can have multiple descriptor, as individual essential information (comprising position, unit, research interest, homepage address, phone, E-mail address) and paper information that he delivers (as: title, deliver meeting title, partner); E V * V is the set on limit, e
t Ij∈ E represents the people v in the community network
iAnd v
jBetween a kind of t that concerns of existing; T is relationship type set, t ∈ T represent interpersonal a kind of social relationships type (as " with the article author ", " and quilt ... instruct ", " in same project, working ", " being friend "); τ: E → T is a mapping function from the limit to the relationship type, τ (e
t Ij)=t.Limit among the figure can be unidirectional, also can be two-way.A kind of relation of symmetry has been represented on two-way limit.
After community network of formalization, the task that expert info is handled just can be defined as: a given field keyword q
In the present invention, we have used the keyword of 13 computer realms to do test, these 13 fields comprise Ontology Mapping, Semantic Web, data mining, information extraction, Boosting study, SVMs, Planning study, intelligent agent, machine learning, natural language processing, encrypt and learn, computer vision, neural net, corresponding keyword is OntologyAlignment, Semantic Web, Data Mining, Information Extraction, Boosting, Support VectorMachine, Planning, Intelligent Agent, Machine Learning, Natural Language Processing, Cryptography, Computer Vision, Neural Networks.), from entire society's network G, search expert collection, and with them with a tabulation R={ (v that ordering is good
i, s (v
i)) return s (v wherein
i) represented a people v
IExpert value.
Illustrate a community network below, one really based on breadboard community network as shown in Figure 2.All relations and the weight that exist among Fig. 2 are as shown in table 1:
Table 1, all relations and the weight handled among Fig. 2
Relation | Expressed meaning | Weight |
With the article author | The author of two artificial same piece of | 2 |
Quilt ... instruct | A people is another person's | 4 |
In same project, work | Two people work in | 3 |
Be friend | Two people are | 1 |
In this drawing, we can find for " doctor Tang ":
1) he has a sensing " Wang professor's " unidirectional quilt ... the relation that instructs;
2) he has four two-way relations, as and " Xiao Hong " between exist with the article author relationships.
May there be multiple relation between two people, as existing between " doctor Tang " and " Wang professor " with article author and quilt ... the two kinds of relations that instruct.
1) relation and weight table, all relations and weight in this table storage social relationships net, as shown in table 2:
Table 2, the relationship type among the community network figure and concern weight table
Relation identity | Concern | Explanation | Weight | |
1 | With the article author | The author of two artificial same piece of | 2 | |
2 | Quilt ... instruct | A people is another person's | 4 | |
3 | In same project, work | Two people work in | 3 | |
4 | Be friend | Two people are | 1 |
2) individual Basic Information Table, the proprietary essential information in this table storage community network, in the present invention, we have stored " position ", " unit " " research interest ", and are as shown in table 3:
Table 3, the individual Basic Information Table among the community network figure
People's sign | Name | Position | Unit | Research |
1 | Doctor Tang | Ph.D (doctor) | Keg, tsinghua (Tsing-Hua University's knowledge engineering group) | Semantic Web (semantic network) |
2 | Xiao Hong | Master (master) | Keg, tsinghua (Tsing-Hua University's knowledge engineering group) | Information Extraction (information extraction) |
3 | Wang professor | Ph.Dmentor (doctoral advisor) | Keg, ts inghua (Tsing-Hua University's knowledge engineering group) | Semantic Web (semantic network) |
4 | The Cai professor | Master mentor (master supervisor; ) | Keg, tsinghua (Tsing-Hua University's knowledge engineering group) | Data Mining (data mining) |
… | … | … | … | … |
3) paper information table, the paper information that everyone delivers in this table storage community network, in the present invention, we have stored " paper title " and " delivering meeting ".As shown in table 4:
Table 4, the paper information table among the community network figure
The paper sign | The paper title | Deliver meeting |
1 | A Unified Tagging Approach to Text Normalization (the normalized unified labeling method of text) | ACL ' 2007 (computational linguistics annual meeting) |
2 | Semantic annotation using horizontal and vertical contexts (utilizing level to carry out semantic tagger) with vertical context | ASWC ' 2006 (Asia semantic network meeting) |
3 | Multiple strategies detection in ontology mapping (many strategies of Ontology Mapping are found) | WWW ' 2005 (WWW meeting) |
… | … | … |
4) people and the paper table of comparisons, people and the contrast information that publishes thesis in this table storage community network, as shown in table 5:
Table 5, the people among the community network figure and the paper table of comparisons
Sign | People's sign | The |
1 | 1 | 1 |
2 | 1 | 2 |
3 | 1 | 3 |
Sign | People's sign | The |
4 | 2 | 2 |
5 | 3 | 3 |
… | … | … |
Biao first line display wherein, people's identification nodes 1 (doctor Tang) is the author of paper identification nodes 1 (A Unified TaggingApproach to Text Normalization).
5) node relationships table, the relation between the people in the sign community network, as shown in table 6:
Table 6, community network node relationships table
Relationship type | Source | Destination node | |
2 | 2 | 4 | |
3 | 4 | 3 | |
1 | 2 | 1 | |
2 | 1 | 3 | |
… | … | … |
Wherein the row of first in the table has between identification nodes 2 (Xiao Hong) and the node 4 (professor Cai) to concern 2 (quilts ... instruct), second line display node 4 (professor Cai) and node 3 (Wang professor) have concern 3 (working) in same projects.
So far, a community network makes up and finishes.
Step 2: retrieve the candidate expert relevant, obtain a community network subgraph with a certain field.
In this step, the user (for example: data mining) imports any one field keyword q, from entire society's network, search the expert relevant as candidate expert, and calculate initial expert's value, constitute the community network subgraph in this field for each candidate expert with this field.In this step, we have only utilized individual descriptor described in the step 1 (comprising table 3, the content of table 4 and table 5) to go to retrieve candidate expert.
For everyone v
iHis individual essential information is connected into a big document d (in the present invention, we have used the connection of " position ", " unit " in the table 3, " research interest " three fields as d, at this time we no longer distinguish the field restriction of " position ", " unit ", " research interest ", handle but regard d as a common document); And use p
kRepresent that (in the present invention, we have only used " the paper title " of table 4, and the connection of " delivering meeting " two fields is as p for the information of his each piece paper
k, we no longer distinguish " paper title " equally, the field restriction of " delivering meeting ", but with p
kRegarding a common document as handles).
Probabilistic model in the user imports a field keyword q, and exploit information is retrieved is estimated the probability that field keyword q occurs in document d, represent this probability with p (q|d); Estimate that simultaneously field keyword q is at every piece of paper p
kThe probability that occurs is with p (q|p
k) represent this probability.
Because the field keyword q of user's input does might comprise a plurality of speech after the participle, for example: comprise " data " and " excavation " two speech behind " data mining " participle.At this time, the probability that " data " this speech that obtains behind the field that the is actually keyword q participle that we will estimate occurs in d, and the probability that in d, occurs of " excavation " this speech, and then these two probable values are combined with (formula 1 has been represented the method that we are used) someway, promptly obtain the p (q|d) that we finally will estimate, p
kIn like manner.We represent certain speech (for example: " data " in " data mining " or " excavation ") behind the q participle with t.We say to have only certain the speech t that occurs among the q also can appear at d and p simultaneously
kIn, p (q|d) and p (q|p
k) these two probable values just can be greater than 0, otherwise probable value is 0.
Suppose that each speech t is separate in document d, then can connect each t and appear at Probability p (t|d) among the d with connecting multiplication sign, so the Probability p (q|d) that field keyword q appears among the document d can be expressed as:
Wherein (t, q) expression t appears at the number of times of q (for example, q is " data mining ", and speech t is " data ", then the frequency n (t is 1 q)) that occurs in q of t n.In this formula, the Probability p (t|d) that speech t appears among the document d can be estimated except that the total speech number that comprises among the d with the number of times that t occurs in d.Consider that p (t|d) might be 0, can cause the result who connects in the formula (1) after taking advantage of like this is 0, so need do smoothing processing to p (t|d):
p(t|d)=(1-λ)p(t|d)+λp(t) 2)
Wherein, the λ value is [0,1], and the number of times that p (t) can word t occurs in proprietary document d removes the total speech number that comprises among the proprietary document d and estimates.The expansion of formula (2) has been arranged, and formula (1) just can be write as:
Equally, use another probabilistic model p (q|p
k) estimate that field keyword q is at v
iEvery piece of paper p
kThe middle probability that occurs.Model definition following (the wherein same p of the estimation of the definition of symbol and probability (q|d)):
Based on formula (3) and formula (4), we can calculate v
iEssential information and the degree of correlation of each piece paper and field keyword q, this two parts degree of correlation is carried out linear fit:
Wherein the α span is [0,1] (we establish a=0.5 in experiment); P represents v
iWhole papers of delivering; p
kOne piece of paper among the expression P; If (p
k) expression p
KThe factor of influence of the meeting of delivering or magazine, this factor of influence has reflected the authoritative degree of meeting or magazine, be generally and determine by hand (we in experiment from
Http:// citeseer.ist.psu.edu/impact.htmlThe meeting factor of influence that last this network of collection is put in order).Finally, s (v
i)
0Be everyone v
iInitial expert value.In experiment, we select initial expert's value to come the candidate expert of preceding 1000 people as the algorithm second stage.
Be that example is calculated " doctor Tang " initial expert's value about keyword " Semantic Web " with " doctor Tang " and " Semantic Web " below:
1) obtains t according to " Semantic Web "
1Be " Semantic ", t
2Be " Web ", suppose p (t
1)=0.01, p (t
2)=0.05.
2) according to table 3,4,5, the d of " doctor Tang " is expressed as " ph.D Keg tsinghua Semantic Web ", p
1Be " A Unified Tagging Approach to Text Normalization ACL ' 2007 ", p
2Be " Semanticannotation using horizontal and vertical contexts ASWC ' 2006 ", p
3Be " Multiplestrategies detection in ontology mapping WWW ' 2005 ".
3) p (t|d) estimates except that the total speech number that comprises among the d with the number of times that t occurs in d, so p (t
1| d)=and 1/5=0.2, p (t
2| d)=1/5=0.2; P (t
1| p
1)=0, p (t
2| p
1)=0; P (t
1| p
2)=1/7=0.143 (denominator is that 7 reason is to have filtered out high frequency words " and ", so only remaining 7 speech), p (t
2| p
2)=0; P (t
1| p
3)=0, p (t
2| p
3)=0.
4) carry out smoothly according to formula (2), establishing λ is 0.5, so obtain new p (t
1| d)=and 0.5*0.2+0.5*0.01=0.105, p (t
2| d)=0.5*0.2+0.5*0.05=0.125; P (t
1| p
1)=0.5*0+0.5*0.01=0.005, p (t
2| p
1)=0.5*0+0.5*0.05=0.025; P (t
1| p
2)=0.5*0.143+0.5*0.01=0.077, p (t
2| p
2)=0.5*0+0.5*0.05=0.025; P (t
1| p
3)=0.5*0+0.5*0.01=0.005, p (t
2| p
3)=0.5*0+0.5*0.05=0.025.
5) according to formula (3) and formula (4) (n (t wherein
1, q)=1, n (t
2, q)=1), p (q|d)=p (t then
1| d)
1* p (t
2| d)
1=0.105*0.125=0.013, p (q|p
1)=p (t
1| p
1)
1* p (t
2| p
1)
1=0.005*0.025=0.000, p (q|p
2)=p (t
1| p
2)
1* p (t
2| p
2)
1=0.077*0.025=0.002, p (q|p
3)=p (t
1| p
3)
1* p (t
2| p
3)
1=0.005*0.025=0.000.
6) (suppose wherein if (p according to formula (5) at last
1)=3, if (p
1)=2, if (p
1)=1, α=0.5), so obtain s (v
i)
0=0.5*0.013+0.5* (3*0.000+2*0.002+1*0.000)=0.009 is initial expert's value of doctor Tang.
Step 3: utilize one based on the iterative algorithm of propagating thought, upgrade expert's value of candidate expert.
The candidate expert's of phase I initial expert's value and the relation between them have constituted a community network subgraph about field q (wherein each node is represented a candidate expert, and a kind of relation between two experts is represented on every limit).On the basis of this community network subgraph, we have proposed to propagate the iterative algorithm of expert's value.In iteration each time, v
iThe expert be worth s (v
i)
nPropagate to be given and the candidate expert of relevant relation between him.Therefore, the renewal of expert's value of each candidate expert is influenced by two factors: 1) nominator's expert is worth 2) weight that concerns between nominator and the presentee
We indicate expert's value of a node to propagate into the degree of his neighbors with propagation coefficient.M is used for representing the propagation coefficient matrix; M
IjExpression is from v
iTo v
jPropagation coefficient (value is between 0 to 1).M
IjBe defined as follows:
Wherein, c (τ (e
t Ij)) a kind of e that concerns of representative
t IjWeight (at present we manual these weights are provided with).W (e
t Ij) represent v
iAnd v
jBetween concern e
t IjLevel of intimate.(level of intimate can be calculated by multiple distinct methods, and as the relation for " with the article author ", we are used as level of intimate to the quantity of cooperation paper).
In the present invention, because we only collect a kind of relation data, " with the article author ", so formula (6) just is simplified to M
Ij=w (e
Ij With the article author), w (e
Ij With the article author) be defined as follows:
Wherein | U
i| expression v
iAll " with article authors " concern number (we regard a kind of two-way relation as with " with the article author ").
In our algorithm,, only need expert's value is passed to destination node from source node for unidirectional relation; For two-way relation, the expert is worth needs and propagates mutually.
The expert is worth vectorial S during the n+1 time iteration
N+1(S
N+1Represent that all experts' expert is worth the vector of composition, each dimension S wherein
i nRepresent v
iExpert value) computing formula is as follows:
S
n+1=(1-ω)S
n+ωM
TS
n 8)
Wherein S represents that the expert is worth vector (being proprietary expert's value); The ω span is between 0 to 1, and it has represented the punishment to propagation distance, and promptly number of iterations is many more, and the propagation distance of relation is just far away more, its confidence level just low more (in experiment, we it be set to 0.85).After each iteration finished, proprietary expert's degree will be done normalization, made that maximum expert's value is always 1 among everyone, was calculated as follows:
This algorithm can iterate to a certain end condition always.The variation that the present end condition that is provided with is proprietary expert's value in the subgraph all is limited in the threshold range (the present invention is set to 0.05), and perhaps algorithm iteration stops (the present invention is set to 100) to a certain fixing number of times.
Proposed by the invention be worth the community network expert info treatment system of propagation algorithm and the outstanding contributions of method are based on the expert, we have not only utilized the degree of correlation of individual descriptor and field keyword to differentiate a people's expert level, also utilize in the community network interpersonal incidence relation to propagate expert's value simultaneously, thereby reached the effect that the expert recommends the expert.
Illustrate an iterative computation process of step 3 below, for example as shown in Figure 3.
Suppose that Fig. 3 left-half is expert's value of the n time each node,
" with the article author " relation is represented to exist between two nodes in limit among the figure.
Can get w (e by formula (7)
12)=w (e
13)=w (e
14)=1/3, w (e
21)=w (e
31)=w (e
41)=1 is then got by formula (8):
Wherein
Can get according to formula (9) at last:
Step 4: the expert that will propagate after finishing sorts according to expert's value, exports to the user.
Fig. 4 is the example of relevant " data mining " inquiry output.
The method summary that we are proposed is following handling process:
Input: field keyword q (for example: data mining) and community network G=who builds (V, E, T, λ)
(the community network G here is static structure, and promptly our step 1 needs to carry out once, and is irrelevant with inquiry)
Output: expert's tabulation of sorting by expert's value
// be that example is explained with the academic research network here
1. for everyone v
i, the utilization probabilistic model calculates his the individual essential information d and the degree of correlation of q;
2. use another probabilistic model to calculate v
iEvery piece of paper p
kThe degree of correlation with q;
3. 2 and 3 the degree of correlation is integrated consideration, the value that calculates is as v
iInitial expert value;
4. select initial expert to be worth higher some people, and utilize the relation between them to make up a community network subgraph as candidate;
5.do{
6. calculate the propagation coefficient matrix M according to formula (6);
7. for each v in the subgraph
i{
8. upgrade his expert's degree s (v according to formula (8)
i);
9. }
10. do normalization according to formula (9);
11. while (end condition satisfies);
The expert that step 4. will be propagated after finishing sorts according to expert's value, exports to the user
The combine closely new feature of current Web data of the present invention, promptly Web goes up this feature of emerge in multitude community network focusing on people.Utilize person to person's complicated social relation in the community network, strengthen Web and go up the expert info treatment effect.Novelty of the present invention is embodied in, and we propose to find the expert in community network.Traditional expert's discovery only is to utilize the similarity of document and searching keyword to judge expert's degree of a people, and the method that we propose is not only considered the document similarity, but also considers that interpersonal relation is strengthened the effect that expert info is handled in the community network.Its creativeness shows propose how to utilize the method that interpersonal relation is carried out the expert info processing in the community network.We turn to a graph structure with the community network form, propose the propagation renewal that a kind of propagation algorithm carries out expert's value based on this figure then.The function of utilizing expert info to handle, we can obtain the answer of some professional problems apace by the expert, thereby have improved effectiveness of retrieval greatly.Simultaneously, we can also handle by expert info and go to make friends with the friend who has similar tastes and interests, and seek affiliate, employee, consultant.This technology will be brought huge user and frequency of utilization to the Internet, is the important means that embodies the Internet new technology and economic worth.
Description of drawings
Fig. 1. the entire block diagram of system;
Fig. 2. the example of individual descriptor and human relationships in community network;
Individual's descriptor is as follows:
Individual's essential information
Position: master unit: department of computer science, Tsinghua university knowledge engineering group
Homepage address:
Http:// hmc.arnetmianer.orgPhone: 62788788
Email:
Hmc@keg.cs.tsinghua.edu.cnResearch interest: information labeling
Paper information
Title: Semantic Annotation Using Horizontal and Vertical Contexts
Deliver meeting: ASWC2006 partner: Tang Jie, Li Juanzi
Fig. 3. the expert is worth the example that iteration is propagated;
Fig. 4. based on the interface of the expert info treatment system of propagation algorithm;
Fig. 5. the hardware structure diagram of expert retrieval system.
Embodiment
Utilize step 1-4 of the present invention, created a researcher's community network, and find that at this community network the expert in certain field verifies the invention of this paper.All experiments of the present invention realize with the Java programming, and move on the server that is configured to double-core Intel Xeon3.0GHz processor, 2GB internal memory.
(1) modeling of researcher's community network
The structure of our researcher's community network mainly comprises two parts information at present: each researcher has the individual descriptor (comprising position, unit, research interest, paper title and the corresponding meeting of delivering) of oneself; The relation that has " with the article author " between researcher and the researcher is promptly worked as two people and was once delivered article together, then can set up the relation of " with the article author " between them.
(2) generation of researcher's community network
By (data analysis of http://www.informatik.uni-trier.de/ ~ ley/db/) at first obtains the paper tabulation, and paper information is added in the paper information table 4 to specific scientific paper website; Then obtain the author of each piece paper, author for each piece paper does following steps: if there is not this person in the individual Basic Information Table 3, then adopt the method for information extraction that the Web data are excavated, obtain his corresponding essential information, and add in the table 3; Corresponding relation with this author and corresponding paper adds in the people and the paper table of comparisons 5 simultaneously; Then, for per two authors of this paper, in node relationships table 6, insert " with the article author " relation.
Through this construction method, collected the researcher of 448,289 computer realms, 725,655 pieces publish thesis, and the interpersonal number that concerns amounts to 2,413, and 208, on average everyone and other people have 5.38 relations.
(3) foundation of test set
For algorithm is tested, we have collected the test set in 13 fields from Web, and each test set comprises expert's tabulation in a certain field respectively.Table 8 has been listed the statistical information and the source of these 13 test sets.Wherein, " Ontology Mapping " tabulates with the committee member that " Semantic Web " comes from relevant meeting; " data mining " is from the personnel of the relevant data excavation of kmining.com arrangement; The researcher who extracts for information about that " information extraction " collected from Dr.Ion Muslea; " encrypt and learn " researcher of the relevant encryption of collecting from Kevin McCurley; " computer vision " is from the researcher of the relevant calculation machine vision of Dr.Margaret Fleck collection; " neural net " is from the relevant study of neural networks person of Open Directory; " Boosting study " and " SVMs " are respectively from their official website." Planning study ", " intelligent agent ", " machine learning " and " natural language processing " all come from the website of an artificial intelligence.
Table 8, expert's test set in 13 fields
The field | The field keyword | Expert's number | The source |
Ontology Mapping | Ontology Alignment | 57 | EON2003﹠2004; OAEI2005﹠2006, OM workshop2006 etc. are about the meeting committee member of Ontology Mapping conference |
Semantic Web | Semantic Web | 412 | The meeting committee member of calendar year 2001 to 2006 year international semantic network conference |
Data mining | Data Mining | 351 | http://www.kmining.com/info_people.html |
Information extraction | Information Extraction | 91 | http://www.isi.edu/info-agents/RISE/people.html |
Boosting study | Boosting | 57 | http://www boosting.org/people.html |
SVMs | Support Vector Machine | 111 | http://www.svms.org/people-frames.html |
Planning study | Planning | 26 | http://aima.cs.berkeley.edu/ai.html#learning |
Intelligent agent | Intelligent Agent | 35 | http://aima.cs.berkeley.edu/ai.html#learning |
Machine learning | Machine Learning | 76 | http://aima.cs.berkeley.edu/ai.html#learning |
Natural language processing | Natural Language Processing | 54 | http://aima.cs.berkeley.edu/ai.html#learning |
Encrypt and learn | Cryptography | 174 | http://www.swcp.com/~mccurley/cryptographers/cryptographers.html |
Computer vision | Computer Vision | 215 | http://www.cs.hmc.edu/~fleck/computer-vision-handbook/vision-people.html |
Neural net | Neural Networks | 122 | http://dmoz.org/Computers/Artificial_Intelligence/Neural_Networks/People/ |
(4) test philosophy
We adopt P@5, P@10, and P@20, P@30, R-pre, MAP, bref is as the evaluation and test criterion.The definition accuracy rate is the shared ratio of the correct result in the expert info result (correct result refers to the expert who occurs in test set).P@5 evaluates and tests the accuracy rate of preceding 5 return results, P@10, and P@20, P@30 are in like manner; The accuracy rate of R return results before the R-pre evaluation and test, wherein R represents the expert's sum in the test set; The mean value of the accuracy rate when MAP evaluates and tests each correct result and occurs; Bref mainly is conceived to the average proportions that the wrong expert who finds comes correct expert front, and formula definition is as follows:
Wherein, R represents the expert's sum in the test set, and r is a correct expert in the test set, and n is one of preceding R wrong expert who finds in the lookup result.
(5) experimental result
Method described in the present invention and another kind of method compare, and this comparative approach is only considered to use individual descriptor, and does not consider relation information, and is closely similar with the step 2 of our method.Table 9 has been listed these two kinds of methods and respectively the evaluation result that expert info is handled has been carried out in 13 fields.
Table 9 is searched evaluation result (%) with our method and two kinds of method experts in 13 fields of comparative approach
The field | Method | P@5 | P@10 | P@20 | P@30 | R-prec | MAP | bpref |
Ontology Mapping | Comparative approach | 20.00 | 20.00 | 15.00 | 23.33 | 15.39 | 3.32 | 9.51 |
Our method | 40.00 | 40.00 | 35.00 | 23.33 | 16.00 | 5.60 | 11.73 | |
Semantic Web | Comparative approach | 80.00 | 70.00 | 70.00 | 60.00 | 71.43 | 9.55 | 15.75 |
Our method | 80.00 | 90.00 | 70.00 | 66.67 | 90.91 | 13.77 | 20.83 | |
Data mining | Comparative approach | 80.00 | 70.00 | 60.00 | 56.67 | 7142 | 9.88 | 15.27 |
Our method | 10000 | 80.00 | 75.00 | 66.67 | 66.67 | 12.63 | 19.16 | |
Information extraction | Comparative approach | 8000 | 70.00 | 65.00 | 53.33 | 71.43 | 19.52 | 22.89 |
Our method | 100.00 | 60.00 | 60.00 | 53.33 | 58.82 | 19.19 | 23.15 | |
Boosting study | Comparative approach | 60.00 | 60.00 | 4500 | 33.33 | 37.04 | 19.57 | 22.01 |
Our method | 80.00 | 50.00 | 40.00 | 33.33 | 35.71 | 17.39 | 18.78 | |
Planning study | Comparative approach | 60.00 | 40.00 | 30.00 | 23.33 | 23.26 | 23.12 | 20.56 |
Our method | 40.00 | 30.00 | 30.00 | 26.67 | 19.23 | 14.97 | 20.71 | |
Intelligent agent | Comparative approach | 20.00 | 40.00 | 25.00 | 23.33 | 6.33 | 8.86 | 15.59 |
Our method | 80.00 | 50.00 | 30.00 | 26.67 | 19.23 | 14.30 | 18.69 | |
SVMs | Comparative approach | 40.00 | 20.00 | 15.00 | 13.33 | 3.72 | 2.70 | 6.09 |
Our method | 20.00 | 20.00 | 10.00 | 13.33 | 13.70 | 1.82 | 6.47 | |
Machine learning | Comparative approach | 40.00 | 20.00 | 15.00 | 16.67 | 14.71 | 4.78 | 8.59 |
Our method | 40.00 | 40.00 | 25.00 | 16.67 | 14.93 | 6.54 | 9.92 | |
Natural language processing | Comparative approach | 20.00 | 10.00 | 15.00 | 26.67 | 17.24 | 7.50 | 11.69 |
Our method | 40.00 | 20.00 | 25.00 | 26.67 | 29.41 | 9.99 | 18.24 | |
Encrypt and learn | Comparative approach | 80.00 | 70.00 | 60.00 | 50.00 | 76.92 | 12.15 | 15.04 |
Our method | 100.00 | 70.00 | 55.00 | 56.67 | 58.82 | 17.71 | 23.21 | |
Computer vision | Comparative approach | 20.00 | 10.00 | 35.00 | 36.67 | 37.04 | 4.92 | 11.13 |
Our method | 60.00 | 50.00 | 55.00 | 43.33 | 62.50 | 7.93 | 13.77 | |
Neural net | Comparative approach | 0.00 | 0.00 | 15.00 | 10.00 | 3.85 | 0.65 | 3.20 |
Our method | 20.00 | 30.00 | 15.00 | 16.67 | 5.75 | 1.50 | 4.75 | |
Mean value | Comparative approach | 46.15 | 38.00 | 35.80 | 32.82 | 34.60 | 9.73 | 13.64 |
Our method | 61.54 | 48.00 | 40.40 | 36.15 | 37.82 | 11.03 | 16.11 |
From experimental result as can be seen, the evaluation result of our method on most of field all is better than comparative approach.Experiment shows, the community network expert info treatment system and the method that are worth propagation algorithm based on the expert that we propose are effective.
This shows that the present invention has reached intended purposes.
Claims (2)
1, the community network expert info treatment system that is worth propagation algorithm based on the expert, it is characterized in that this system by the social relation network of series connection successively generate server, database server, the expert is worth calculation server and Web server constitutes, wherein:
Social relation network generates server, makes up a social relation network figure G successively according to the following steps:
Step (1), community network be G=(V, E, T, τ),
Wherein, V is a set of node, and v ∈ V, each node v represent a people in the community network, and he has following individual descriptor:
Individual's essential information wherein has: position, unit, research interest and homepage address at least;
The paper information that the individual delivers wherein contains: the paper title, deliver meeting title and cooperation author at least; Relation between individual and the paper is described with its contribution of delivering corresponding expression with a people, comprising: described people's the sign and the sign of paper;
E V * V is the set on limit among the social relation network figure G, e
t Ij∈ E represents the people v among the G
iAnd v
jBetween a kind of relation of existing, represent with t;
T is the set of various relationship types among the described set E, t ∈ T represents interpersonal a kind of social relationships type, describe with a social relationships type list, wherein contain at least: relation, be no less than paper author cooperative relationship, the directive relationship of paper or project, four kinds of the cooperative relationship of research project and friends; The weight of various relations and the sign of relation;
τ: E → T is an interpersonal mapping function that is related to described relationship type, with τ (e
t Ij)=t represents, when described interpersonal relationships is bidirectional relationship, represents a kind of symmetric relation;
In described social relation network figure G, all interpersonal relationships constitutes a social relation network node table, comprising: type of interpersonal relations τ, source node v
iWith destination node v
j
Step (2), the social relation network figure G that step (1) is obtained is input to database server;
Step (3), the expert is worth calculation server and retrieves all candidate experts in entire society's relational network, and the relevant expert's value in each candidate expert's calculating field of giving:
Step (3.1) is for everyone v
i, his individual basic descriptor is connected into a big document d, and uses p
kRepresent this v
iThe information of each piece paper;
Step (3.2), a given field keyword q, the probabilistic model in the exploit information retrieval is estimated the degree of correlation p (q|d) of this field keyword q and individual essential information e and the degree of correlation (q|p of the information of field keyword q and each piece paper
k):
Wherein: t represents after field keyword q is divided into participle, the wherein symbolic representation of each speech;
N (t, q) expression participle t appears at the number of times among the q;
λ is the smoothing processing coefficient, and value is [0,1];
The number of times that p (t) occurs in proprietary big document d for participle t removes the total speech number that comprises among the proprietary big document d and estimates;
P (t|d) is that participle t is at v
iBig document d in the number of times that occurs remove v
iBig document d in the total number of word that comprises estimate;
Wherein: p (q|p
k) be that participle t is at v
iThe information p of every piece of paper
kThe middle number of times that occurs is divided by having filtered the paper information p after the high frequency words
kIn the total number of word that comprises estimate;
Step (3.2), the p (q|d) and the p (q|p that obtain according to step (3.1)
k) calculate everyone v
jIndividual essential information and the information of every piece of paper separately with the degree of correlation of field keyword q, again these two degrees of correlation are carried out linear fit, obtain everyone represented v of following formula
iExpert's initial value s (v
i)
0:
Wherein, the α span is [0,1],
P represents this v
jWhole papers of delivering, p
kOne piece of paper among the expression P;
If (p
k) expression paper p
kThe meeting of being delivered or the factor of influence of magazine are given value;
Step (3.3), the expert who selects initial expert's value to come preceding N position forms candidate expert group, and N is a set point;
Step (4), described expert is worth the calculation server utilization based on the iterative algorithm of propagating thought, makes up a community network subgraph according to the following steps, and upgrades expert's value of obtaining candidate expert on the basis of this subgraph:
Step (4.1), the candidate expert group that obtains according to step (3.3) obtains a social relation network subnet about field keyword q, and wherein node is represented a candidate expert, concerns between two experts that every limit is represented to connect;
The social relation network subgraph that step (4.2) obtains for step (4.1) makes up a broadcasting system matrix M, element M wherein
IjExpression is from v
iTo v
jPropagation coefficient, M
IjSpan is [0,1], described M
IjAs follows:
Wherein, c (τ (e
t Ij) a kind of e that concerns of representative
t IjWeight, be set point,
W (e
t Ij) represent v
iAnd v
jBetween concern e
t IjLevel of intimate, according to concerning among the described social relation network figure that the big person of weight chooses the corresponding title that concerns, calculate the corresponding weight w (e that concerns in view of the above
Ij t):
Wherein, | U
i| expression v
iThe number of all this relations;
Step (4.3), the expert when computational algorithm iterates to n+1 is worth vectorial S
N+1(S
N+1Represent that all experts' expert is worth the vector of composition, each dimension S wherein
i nRepresent v
iExpert value):
S
n+1=(1-ω)S
n+ωM
TS
n
Wherein, ω represents the penalty coefficient of propagation distance, and span is [0,1], and the propagation coefficient matrix M is changeless falling for process;
Step (4.4) is normalized to the expert's value of all experts in the step (4.3):
Expert's value is 1 to the maximum;
Step (4.5) is exported to the user to the recommendation expert collection that step (4.4) obtains by the Web network.
2. the community network expert info processing method that is worth propagation algorithm based on the expert, be characterised in that, this method is to generate server by social relation network successively at one, database server, the expert is worth expert that the series connection of calculation server and Web server constitutes and collects and progressively realize according to the following steps in the searching system:
Step (1), community network be G=(V, E, T, τ),
Wherein, V is a set of node, and v ∈ V, each node v represent a people in the community network, and he has following individual descriptor:
Individual's essential information wherein has: position, unit, research interest and homepage address at least;
The paper information that the individual delivers wherein contains: the paper title, deliver meeting title and cooperation author at least; Relation between individual and the paper is described with its contribution of delivering corresponding expression with a people, comprising: described people's the sign and the sign of paper;
E V * V is the set on limit among the social relation network figure G, e
t Ij∈ E represents the people v among the G
iAnd v
jBetween a kind of relation of existing, represent with t;
T is the set of various relationship types among the described set E, t ∈ T represents interpersonal a kind of social relationships type, describe with a social relationships type list, wherein contain at least: relation, be no less than paper author cooperative relationship, the directive relationship of paper or project, four kinds of the cooperative relationship of research project and friends; The weight of various relations and the sign of relation;
τ: E → T is an interpersonal mapping function that is related to described relationship type, with τ (e
t Ij)=t represents, when described interpersonal relationships is bidirectional relationship, represents a kind of symmetric relation;
In described social relation network figure G, all interpersonal relationships constitutes a social relation network node table, comprising: type of interpersonal relations τ, source node v
iWith destination node v
j
Step (2), the social relation network figure G that step (1) is obtained is input to database server;
Step (3), the expert is worth calculation server and retrieves all candidate experts in entire society's relational network, and the relevant expert's value in each candidate expert's calculating field of giving:
Step (3.1) is for everyone v
i, his individual basic descriptor is connected into a big document d, and uses p
kRepresent this v
iThe information of each piece paper;
Step (3.2), a given field keyword q, the probabilistic model in the exploit information retrieval is estimated the degree of correlation p (q|d) of this field keyword q and individual essential information e and the degree of correlation (q|p of the information of field keyword q and each piece paper
k):
Wherein: t represents after field keyword q is divided into participle, the wherein symbolic representation of each speech;
N (t, q) expression participle t appears at the number of times among the q;
λ is the smoothing processing coefficient, and value is [0,1];
The number of times that p (t) occurs in proprietary big document d for participle t removes the total speech number that comprises among the proprietary big document d and estimates;
P (t|d) is that participle t is at v
iBig document d in the number of times that occurs remove v
iBig document d in the total number of word that comprises estimate;
Wherein: p (q|p
k) be that participle t is at v
iThe information p of every piece of paper
kThe middle number of times that occurs is divided by having filtered the paper information p after the high frequency words
kIn the total number of word that comprises estimate;
Step (3.2), the p (q|d) and the p (q|p that obtain according to step (3.1)
k) calculate everyone v
iIndividual essential information and the information of every piece of paper separately with the degree of correlation of field keyword q, again these two degrees of correlation are carried out linear fit, obtain everyone represented v of following formula
iExpert's initial value s (v
i)
0:
Wherein, a span is [0,1],
P represents this v
jWhole papers of delivering, p
kOne piece of paper among the expression P;
If (p
k) expression paper p
kThe meeting of being delivered or the factor of influence of magazine are given value;
Step (3.3), the expert who selects initial expert's value to come preceding N position forms candidate expert group, and N is a set point;
Step (4), described expert is worth the calculation server utilization based on the iterative algorithm of propagating thought, makes up a community network subgraph according to the following steps, and upgrades expert's value of obtaining candidate expert on the basis of this subgraph:
Step (4.1), the candidate expert group that obtains according to step (3.3) obtains a social relation network subnet about field keyword q, and wherein node is represented a candidate expert, concerns between two experts that every limit is represented to connect;
The social relation network subgraph that step (4.2) obtains for step (4.1) makes up a broadcasting system matrix M, element M wherein
IjExpression is from v
iTo v
jPropagation coefficient, M
IjSpan is [0,1], described M
IjAs follows:
Wherein, c (τ (e
t Ij) a kind of e that concerns of representative
t IjWeight, be set point,
W (e
t Ij) represent v
iAnd v
jBetween concern e
t IjLevel of intimate, according to concerning among the described social relation network figure that the big person of weight chooses the corresponding title that concerns, calculate the corresponding weight w (e that concerns in view of the above
Ij t):
Wherein, | U
i| expression v
iThe number of all this relations;
Step (4.3), the expert when computational algorithm iterates to n+1 is worth vectorial S
N+1(S
N+1Represent that all experts' expert is worth the vector of composition, each dimension S wherein
i nRepresent v
iExpert value):
S
n+1=(1-ω)S
n+ωM
TS
n
Wherein, ω represents the penalty coefficient of propagation distance, and span is [0,1], and the propagation coefficient matrix M is changeless falling for process;
Step (4.4) is normalized to the expert's value of all experts in the step (4.3):
Expert's value is 1 to the maximum;
Step (4.5) is exported to the user to the recommendation expert collection that step (4.4) obtains by the Web network.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200710117719A CN100583804C (en) | 2007-06-22 | 2007-06-22 | Method and system for processing social network expert information based on expert value propagation algorithm |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200710117719A CN100583804C (en) | 2007-06-22 | 2007-06-22 | Method and system for processing social network expert information based on expert value propagation algorithm |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101075942A true CN101075942A (en) | 2007-11-21 |
CN100583804C CN100583804C (en) | 2010-01-20 |
Family
ID=38976768
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200710117719A Expired - Fee Related CN100583804C (en) | 2007-06-22 | 2007-06-22 | Method and system for processing social network expert information based on expert value propagation algorithm |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN100583804C (en) |
Cited By (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101904151A (en) * | 2007-12-17 | 2010-12-01 | 皇家飞利浦电子股份有限公司 | Method of controlling communications between at least two users of a communication system |
CN102043817A (en) * | 2009-10-12 | 2011-05-04 | 腾讯科技(深圳)有限公司 | Method and device for displaying figure associated word |
CN101321183B (en) * | 2008-06-30 | 2011-07-06 | 刘鑫 | Dependable social relationship recommending system and its operation method |
CN101521655B (en) * | 2008-02-29 | 2011-11-16 | 华为技术有限公司 | Method for searching and releasing information, system and synonymy node clustering method and device therefor |
CN102262681A (en) * | 2011-08-19 | 2011-11-30 | 南京大学 | Method for identifying key blog sets in blog information spreading |
US8244711B2 (en) | 2009-09-28 | 2012-08-14 | Chin Lung Fong | System, method and apparatus for information retrieval and data representation |
CN102737059A (en) * | 2011-04-14 | 2012-10-17 | 北京百度网讯科技有限公司 | Method, apparatus and device for determining accuracy information of resource description information |
CN102880657A (en) * | 2012-08-31 | 2013-01-16 | 电子科技大学 | Expert recommending method based on searcher |
CN103064837A (en) * | 2011-10-19 | 2013-04-24 | 西安邮电学院 | Retrieval of leading figures in academic fields and visualized navigation system |
CN103136267A (en) * | 2011-12-01 | 2013-06-05 | 腾讯科技(深圳)有限公司 | Expert mining method and device based on community |
CN103309962A (en) * | 2013-05-31 | 2013-09-18 | 华东师范大学 | Microblog service expert positioning method based on content relevance and social contact influence |
CN103324641A (en) * | 2012-03-23 | 2013-09-25 | 日电(中国)有限公司 | Information record recommendation method and device |
CN103559262A (en) * | 2013-11-04 | 2014-02-05 | 北京邮电大学 | Community-based author and academic paper recommending system and recommending method |
CN103631859A (en) * | 2013-10-24 | 2014-03-12 | 杭州电子科技大学 | Intelligent review expert recommending method for science and technology projects |
CN105550307A (en) * | 2015-12-14 | 2016-05-04 | 北京锐安科技有限公司 | Method for generating net citizen identity relationship network graph |
CN105763588A (en) * | 2014-12-18 | 2016-07-13 | 阿里巴巴集团控股有限公司 | Relational network data maintenance method, off-line server and real-time server |
CN102737059B (en) * | 2011-04-14 | 2016-12-14 | 北京百度网讯科技有限公司 | For determining the method for the accuracy information of resource description information, device and equipment |
CN106874289A (en) * | 2015-12-11 | 2017-06-20 | 阿里巴巴集团控股有限公司 | A kind of associated nodes determine method and apparatus |
CN107527295A (en) * | 2017-08-24 | 2017-12-29 | 中南大学 | Dynamics community of Academic Teams based on tense coauthorship network finds method and its method for evaluating quality |
CN107729569A (en) * | 2017-11-17 | 2018-02-23 | 杭州师范大学 | A kind of social networks Forecasting Methodology of UNE structure and text message |
CN107808223A (en) * | 2016-09-08 | 2018-03-16 | 杭州海康威视数字技术股份有限公司 | Data processing method and device |
CN110119424A (en) * | 2019-04-02 | 2019-08-13 | 天津大学 | Sparse trust method for digging based on round table influenza algorithm |
CN110377815A (en) * | 2019-06-06 | 2019-10-25 | 创易汇(北京)科技有限公司 | A kind of production, teaching & research recommender system and method |
CN110688405A (en) * | 2019-08-23 | 2020-01-14 | 上海科技发展有限公司 | Expert recommendation method, device, terminal and medium based on artificial intelligence |
CN111008285A (en) * | 2019-11-29 | 2020-04-14 | 中科院计算技术研究所大数据研究院 | Author disambiguation method based on thesis key attribute network |
CN112269909A (en) * | 2020-09-15 | 2021-01-26 | 淮阴工学院 | Expert recommendation method based on multi-source information fusion technology |
CN113434644A (en) * | 2021-05-25 | 2021-09-24 | 北京农业信息技术研究中心 | Agricultural technology knowledge service method and system |
-
2007
- 2007-06-22 CN CN200710117719A patent/CN100583804C/en not_active Expired - Fee Related
Cited By (42)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101904151A (en) * | 2007-12-17 | 2010-12-01 | 皇家飞利浦电子股份有限公司 | Method of controlling communications between at least two users of a communication system |
CN101521655B (en) * | 2008-02-29 | 2011-11-16 | 华为技术有限公司 | Method for searching and releasing information, system and synonymy node clustering method and device therefor |
CN101321183B (en) * | 2008-06-30 | 2011-07-06 | 刘鑫 | Dependable social relationship recommending system and its operation method |
US8244711B2 (en) | 2009-09-28 | 2012-08-14 | Chin Lung Fong | System, method and apparatus for information retrieval and data representation |
CN102043817A (en) * | 2009-10-12 | 2011-05-04 | 腾讯科技(深圳)有限公司 | Method and device for displaying figure associated word |
CN102043817B (en) * | 2009-10-12 | 2014-11-12 | 深圳市世纪光速信息技术有限公司 | Method and device for displaying figure associated word |
CN102737059A (en) * | 2011-04-14 | 2012-10-17 | 北京百度网讯科技有限公司 | Method, apparatus and device for determining accuracy information of resource description information |
CN102737059B (en) * | 2011-04-14 | 2016-12-14 | 北京百度网讯科技有限公司 | For determining the method for the accuracy information of resource description information, device and equipment |
CN102262681B (en) * | 2011-08-19 | 2015-12-02 | 南京大学 | A kind of blog information identifies the method for crucial blog collection in propagating |
CN102262681A (en) * | 2011-08-19 | 2011-11-30 | 南京大学 | Method for identifying key blog sets in blog information spreading |
CN103064837A (en) * | 2011-10-19 | 2013-04-24 | 西安邮电学院 | Retrieval of leading figures in academic fields and visualized navigation system |
CN103136267A (en) * | 2011-12-01 | 2013-06-05 | 腾讯科技(深圳)有限公司 | Expert mining method and device based on community |
CN103136267B (en) * | 2011-12-01 | 2017-12-08 | 腾讯科技(深圳)有限公司 | A kind of community-based expert's method for digging and device |
CN103324641B (en) * | 2012-03-23 | 2016-07-13 | 日电(中国)有限公司 | Information record recommendation method and device |
CN103324641A (en) * | 2012-03-23 | 2013-09-25 | 日电(中国)有限公司 | Information record recommendation method and device |
CN102880657B (en) * | 2012-08-31 | 2015-09-30 | 电子科技大学 | Based on the expert recommendation method of searchers |
CN102880657A (en) * | 2012-08-31 | 2013-01-16 | 电子科技大学 | Expert recommending method based on searcher |
CN103309962A (en) * | 2013-05-31 | 2013-09-18 | 华东师范大学 | Microblog service expert positioning method based on content relevance and social contact influence |
CN103631859A (en) * | 2013-10-24 | 2014-03-12 | 杭州电子科技大学 | Intelligent review expert recommending method for science and technology projects |
CN103631859B (en) * | 2013-10-24 | 2017-01-11 | 杭州电子科技大学 | Intelligent review expert recommending method for science and technology projects |
CN103559262B (en) * | 2013-11-04 | 2016-10-05 | 北京邮电大学 | Community-based author and scientific paper commending system thereof and recommend method |
CN103559262A (en) * | 2013-11-04 | 2014-02-05 | 北京邮电大学 | Community-based author and academic paper recommending system and recommending method |
CN105763588B (en) * | 2014-12-18 | 2020-02-04 | 阿里巴巴集团控股有限公司 | Relational network data maintenance method, offline server and real-time server |
CN105763588A (en) * | 2014-12-18 | 2016-07-13 | 阿里巴巴集团控股有限公司 | Relational network data maintenance method, off-line server and real-time server |
CN106874289A (en) * | 2015-12-11 | 2017-06-20 | 阿里巴巴集团控股有限公司 | A kind of associated nodes determine method and apparatus |
CN106874289B (en) * | 2015-12-11 | 2020-04-24 | 阿里巴巴集团控股有限公司 | Associated node determination method and equipment |
CN105550307B (en) * | 2015-12-14 | 2019-07-30 | 北京锐安科技有限公司 | A kind of generation method of netizen's personal status relationship network |
CN105550307A (en) * | 2015-12-14 | 2016-05-04 | 北京锐安科技有限公司 | Method for generating net citizen identity relationship network graph |
CN107808223A (en) * | 2016-09-08 | 2018-03-16 | 杭州海康威视数字技术股份有限公司 | Data processing method and device |
CN107808223B (en) * | 2016-09-08 | 2021-04-20 | 杭州海康威视数字技术股份有限公司 | Data processing method and device |
CN107527295B (en) * | 2017-08-24 | 2021-04-30 | 中南大学 | Academic team dynamic community discovery method based on temporal co-occurrence network and quality evaluation method thereof |
CN107527295A (en) * | 2017-08-24 | 2017-12-29 | 中南大学 | Dynamics community of Academic Teams based on tense coauthorship network finds method and its method for evaluating quality |
CN107729569B (en) * | 2017-11-17 | 2020-01-17 | 杭州师范大学 | Social relationship prediction method fusing network structure and text information |
CN107729569A (en) * | 2017-11-17 | 2018-02-23 | 杭州师范大学 | A kind of social networks Forecasting Methodology of UNE structure and text message |
CN110119424A (en) * | 2019-04-02 | 2019-08-13 | 天津大学 | Sparse trust method for digging based on round table influenza algorithm |
CN110377815A (en) * | 2019-06-06 | 2019-10-25 | 创易汇(北京)科技有限公司 | A kind of production, teaching & research recommender system and method |
CN110688405A (en) * | 2019-08-23 | 2020-01-14 | 上海科技发展有限公司 | Expert recommendation method, device, terminal and medium based on artificial intelligence |
CN111008285A (en) * | 2019-11-29 | 2020-04-14 | 中科院计算技术研究所大数据研究院 | Author disambiguation method based on thesis key attribute network |
CN112269909A (en) * | 2020-09-15 | 2021-01-26 | 淮阴工学院 | Expert recommendation method based on multi-source information fusion technology |
CN112269909B (en) * | 2020-09-15 | 2022-06-03 | 淮阴工学院 | Expert recommendation method based on multi-source information fusion technology |
CN113434644A (en) * | 2021-05-25 | 2021-09-24 | 北京农业信息技术研究中心 | Agricultural technology knowledge service method and system |
CN113434644B (en) * | 2021-05-25 | 2023-09-12 | 北京农业信息技术研究中心 | Agricultural technology knowledge service method and system |
Also Published As
Publication number | Publication date |
---|---|
CN100583804C (en) | 2010-01-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101075942A (en) | Method and system for processing social network expert information based on expert value progation algorithm | |
Wang et al. | Position bias estimation for unbiased learning to rank in personal search | |
CA2788704C (en) | Method and system for ranking intellectual property documents using claim analysis | |
Amitay et al. | Social search and discovery using a unified approach | |
US20120101808A1 (en) | Sentiment analysis from social media content | |
US10606853B2 (en) | Systems and methods for intelligent prospect identification using online resources and neural network processing to classify organizations based on published materials | |
CN1882943A (en) | Systems and methods for search processing using superunits | |
CN1517914A (en) | Searching of structural file | |
US9569525B2 (en) | Techniques for entity-level technology recommendation | |
AU2011210742A1 (en) | Method and system for conducting legal research using clustering analytics | |
Mahdabi et al. | The effect of citation analysis on query expansion for patent retrieval | |
CN102289514B (en) | The method of Social Label automatic marking and Social Label automatic marking device | |
Yu et al. | Query-driven discovery of semantically similar substructures in heterogeneous networks | |
Kang et al. | Learning to rank related entities in web search | |
Singhal et al. | Data extract: Mining context from the web for dataset extraction | |
Balog et al. | Category-based query modeling for entity search | |
Katsurai et al. | Topic Representation of Researchers' Interests in a Large-Scale Academic Database and Its Application to Author Disambiguation | |
Toba et al. | Enhanced unsupervised person name disambiguation to support alumni tracer study | |
Endrullis et al. | Entity search strategies for mashup applications | |
KR20160120583A (en) | Knowledge Management System and method for data management based on knowledge structure | |
JP5315726B2 (en) | Information providing method, information providing apparatus, and information providing program | |
Ma et al. | A multiple relevance feedback strategy with positive and negative models | |
US8886651B1 (en) | Thematic clustering | |
Batra et al. | Content based hidden web ranking algorithm (CHWRA) | |
Santos et al. | Mimicking web search engines for expert search |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20100120 Termination date: 20100622 |