CN102289430B - Method for analyzing latent semantics of fusion probability of multi-modality data - Google Patents
Method for analyzing latent semantics of fusion probability of multi-modality data Download PDFInfo
- Publication number
- CN102289430B CN102289430B CN2011101800250A CN201110180025A CN102289430B CN 102289430 B CN102289430 B CN 102289430B CN 2011101800250 A CN2011101800250 A CN 2011101800250A CN 201110180025 A CN201110180025 A CN 201110180025A CN 102289430 B CN102289430 B CN 102289430B
- Authority
- CN
- China
- Prior art keywords
- mode
- observed value
- probability
- latent semantic
- value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a method for analyzing latent semantics of the fusion probability of multi-modality data in the technical field of latent semantics analysis of probability. By starting from the essence of the multi-modality data and introducing a crossed structure, a standard probability latent semantic analysis model which is only applicable to single-modality is expanded to multi-modality, and difference between contributes of each modality to a latent semantic space and relevance of contents of each modality are modeled, so more accurate analysis and description are provided for the multi-modality data. The method has the advantages that: more accurate parameter estimation is realized by global parameter updating; reference is provided for selection of a proper subject number value range for each modality; and the working load of manual selection is reduced.
Description
Technical field
The invention belongs to probability latent semantic analysis technical field, relate in particular to a kind of fusion probability latent semantic analysis method of multi-modal data.
Background technology
The probability latent semantic analysis is the production model of a kind of observed value (co-occurrence of vocabulary-document), and by the introducing of latent semantic space, the mixing that utilizes multinomial distribution and condition to distribute carrys out the probability of modeling co-occurrence.At present, it has been widely used in the research fields such as information retrieval, natural language processing, audio frequency and video processing.
Usually, probability latent semantic analysis model is only applicable to the single mode data, for more complicated multi-modal data, has certain limitation.Multi-modal data comprise a plurality of mode that are mutually related, and they attempt to describe same content, but there are differences in the contribution that the data contents table is reached.In this case, for the co-occurrence matrix that each mode obtains, existing probability latent semantic analysis method can adopt following two kinds of disposal routes: the one, with the co-occurrence matrix simultaneous of identical weight with different modalities, then carry out the semantic analysis of standard; The 2nd, adopt asymmetrical mode only the latent semantic space of a mode to be estimated, then produced the observed value of all mode by this space.
Above two kinds of methods or ignored the essential difference of different modalities, or only unilateral description the relevance of data itself, all can not give the essence of multi-modal data, namely each mode is to the otherness of latent semantic space contribution and the relevance of content between them, to describe fully.Therefore, this just needs a kind of latent semantic analysis of probability for multi-modal data model.
Summary of the invention
The deficiencies such as essential difference of having ignored different modalities for the existing method of mentioning in the above-mentioned background technology, the present invention proposes a kind of fusion probability latent semantic analysis method of multi-modal data.
Technical scheme of the present invention is that the fusion probability latent semantic analysis method of multi-modal data is characterized in that the method comprises the following steps:
Step 1: set up the standard probability latent semantic analysis model of each mode, set up on this basis Fusion Model;
Step 2: determine the work space of Fusion Model, and selected number of topics;
Step 3: Fusion Model is decomposed into asymmetrical probability latent semantic analysis model,, according to input value and the selected number of topics of Fusion Model, calculates the initial parameter value of asymmetrical probability latent semantic analysis model;
Step 4: by the greatest hope algorithm, initial parameter value is upgraded, obtained final argument;
Step 5: utilize final argument to analyze mode to be detected.
The computing formula of described greatest hope algorithm is:
Wherein:
L is the likelihood function value;
Co-occurrence probabilities for mode V;
N
AVocabulary number for mode A;
N
VVocabulary number for mode V;
d
iBe i document;
M is the number of document in document sets.
Compared with the conventional method, the present invention has the following advantages:
The present invention, by each mode is carried out modeling, has embodied the contribution difference of different modalities to latent semantic space, and the relevance between mode has been described again in the introducing of decussate texture simultaneously, finally by the parameter of the overall situation, more newly arrives and realizes more accurate parameter estimation.Therefore, this Fusion Model is from multi-modal data essence, for it provides more rationally, analysis result accurately.In addition,, for the estimation of this model work space, also, for the selected suitable number of topics span of each mode provides foundation, reduced the workload of manually choosing.
Description of drawings
Fig. 1 is the structural representation of fusion probability latent semantic analysis model of the present invention;
Fig. 2 is the structural representation of fusion probability latent semantic analysis model of the present invention embodiment under two mode;
Figure a is the process by the probability latent semantic analysis model construction Fusion Model modeling of two standards; The process of figure b for by number of topics and the maximal value of standard probability latent semantic analysis model, determining the work space of Fusion Model; Figure c is for to take from Fusion Model the process that obtains the model final argument apart; Figure d is the application process of new mode.
Embodiment
Below in conjunction with accompanying drawing, preferred embodiment is elaborated.Should be emphasized that, following explanation is only exemplary, rather than in order to limit the scope of the invention and to apply.
Technical matters to be solved by this invention is to provide a kind of latent semantic analysis of probability for multi-modal data method, can represent simultaneously that each mode in multi-modal data is to the otherness of semantic space contribution and the relevance of content between them, thereby it is multi-modal that the probability latent semantic analysis is rationally expanded to from the single mode data, for it provides more accurately, describes.
The present invention includes following steps:
Step 1: set up the standard probability latent semantic analysis model of each mode, set up on this basis Fusion Model;
Step 2: determine the work space of Fusion Model, and selected number of topics;
Step 3: Fusion Model is decomposed into asymmetrical probability latent semantic analysis model,, according to input value and the selected number of topics of Fusion Model, calculates the initial parameter value of asymmetrical probability latent semantic analysis model;
Step 4: by the greatest hope algorithm, initial parameter value is upgraded, obtained final argument;
Step 5: utilize final argument to analyze mode to be detected.
The present invention is further detailed explanation below in conjunction with the drawings and specific embodiments.
With reference to Fig. 1, structural representation of the present invention is shown, wherein: K is the sum of mode; D is the document sets in observed value; M represents the number of document in document sets; w
kWord finder for k mode in observed value; N
kIt is the number of vocabulary in the word finder of k mode; z
kTheme space for k mode in observed value; L
F-kFor estimated the number of topics in this space that obtains by work space.
As shown in Figure 2, two modal datas of this instructions during take K=2 carry out module declaration as example,, for the multi-modal data of K>2 o'clock, can carry out corresponding modeling and calculation of parameter according to same principle.
Detailed process of the present invention is:
The modeling of Fusion Model:
As shown in Figure 2 a, be respectively mode A and mode V and build the probability latent semantic analysis model of standard, can be expressed as: d → z
A→ w
AAnd d → z
V→ w
VOn this basis, then the intersection of setting up theme and vocabulary between mode A and mode V generate relation, i.e. z
A→ w
VAnd z
V→ w
A, realize the modeling of Fusion Model.
Work space is estimated:
According to number of topics (L selected in the standard probability latent semantic analysis model of mode A and mode V
A, L
V) and maximal value
Determine the work space of Fusion Model, i.e. each mode number of topics (L in multi-modal data
F-A, L
F-V) span, they should meet simultaneously
With
Parameter estimation after number of topics in selected this scope carries out again, as shown in Figure 2 b.
Model parameter estimation:
As shown in Figure 2 c, at first, this Fusion Model is taken apart, regard two asymmetrical probability latent semantic analysis models as, be expressed as respectively: d → z
A→ w
A+ w
VAnd d → z
V→ w
A+ w
V, and based on known observed value co-occurrence matrix
With selected number of topics (L
F-AAnd L
F-V), the initial parameter value of estimation fusion model, comprise theme conditional probability (P (z
A| d), P (z
V| d)) and vocabulary conditional probability (P (w
A| z
A), P (w
V| z
A), P (w
V| z
V), P (w
A| z
V)), then, based on the greatest hope algorithm, from overall angle, above initial value is upgraded, obtain the model final argument.Wherein, based on the global parameter of greatest hope algorithm, upgrade and mainly comprise expectation value calculating and parameter revaluation two parts, specific as follows:
Greatest hope algorithm (based on maximum likelihood function) is as shown in formula (1), by iterating to try to achieve model parameter:
Wherein:
L is the likelihood function value;
N
AVocabulary number for mode A;
N
VVocabulary number for mode V;
Q vocabulary for mode V;
d
iBe i document;
M is the number of document in document sets.
1. expectation value is calculated:, according to initial parameter value, calculate four expectation values
Obtain the computing formula of the expectation value of this mode or another mode theme while representing respectively known certain mode observed value:
In formula (2):
p(d
i) be the probability that i document occurs;
Formula (3) arrives formula (5) by that analogy, that is:
2. parameter revaluation:, according to (2)-(5) formula, calculate the probable value theme after upgrading
And theme
The computing formula of conditional probability be respectively:
Wherein:
L
F-AFor the number of topics in the theme space of mode A;
L
F-VFor the number of topics in the theme space of mode V.
Formula (7) by that analogy.
The computing formula that obtains the conditional probability of this mode or another mode vocabulary during known certain mode theme is:
Formula (9) arrives formula (11) by that analogy, that is:
Alternately repeat expectation value calculation procedure and parameter revaluation step,, until formula (1) reaches convergence, just obtain final model parameter.
The model measurement calculation of parameter:
As shown in Figure 2 d, for new multi-modal data d
New, the known conditions of this moment comprises the co-occurrence matrix of this each mode of data
With the vocabulary conditional probability (P (w that is obtained by module 3
A| z
A), P (w
V| z
A), P (w
V| z
V), P (w
A| z
V)).Utilize equally the greatest hope algorithm to calculate, just keep the vocabulary conditional probability constant in this process, only iteration is carried out in formula (2)-(5) and (6)-(7),, until formula (1) reaches convergence, at this moment just obtained the theme conditional probability (P (z of new data
A| d
New), P (z
V| d
New)).
Fusion for multi-modal data probability latent semantic model of the present invention, by the theme space (z to each mode
A, z
V) modeling separately, reflect the contribution difference of different modalities to semantic space; Simultaneously, the vocabulary conditional probability (P (w of its decussate texture introducing
V| z
A), P (w
A| z
V)), the relevance between each mode has also been described respectively.Therefore, this Fusion Model is that the own characteristic for multi-modal data carries out modeling, thereby for it provides more rationally, data results accurately.In addition, the desirable theme number that is estimated as each mode of work space defines scope, has so both avoided the blindly inaccuracy of value, has reduced again by repeatedly enumerating to choose the calculated amount of optimal value, thereby has improved work efficiency.
The above; only for the better embodiment of the present invention, but protection scope of the present invention is not limited to this, anyly is familiar with those skilled in the art in the technical scope that the present invention discloses; the variation that can expect easily or replacement, within all should being encompassed in protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection domain of claim.
Claims (1)
1. the fusion probability latent semantic analysis method of multi-modal data is characterized in that the method comprises the following steps:
Step 1: set up the standard probability latent semantic analysis model of each mode, set up on this basis Fusion Model, namely be respectively mode A and mode V and build the probability latent semantic analysis model of standard, can be expressed as: d → z
A→ w
AAnd d → z
V→ w
VOn this basis, then the intersection of setting up theme and vocabulary between mode A and mode V generate relation, i.e. z
A→ w
VAnd z
V→ w
A, realize the modeling of Fusion Model;
Step 2: determine the work space of Fusion Model, and selected number of topics;
Step 3: Fusion Model is decomposed into asymmetrical probability latent semantic analysis model,, according to input value and the selected number of topics of Fusion Model, calculates the initial parameter value of asymmetrical probability latent semantic analysis model;
Step 4: by the greatest hope algorithm, initial parameter value is upgraded, obtained final argument;
The computing formula of described greatest hope algorithm is:
Wherein:
L is the likelihood function value;
Known observed value for mode A;
Observed value co-occurrence matrix for mode A;
N
AVocabulary number for mode A;
N
VVocabulary number for mode V;
P vocabulary for mode A;
d
iBe i document;
M is the number of document in document sets;
D is the document sets in observed value;
z
ATheme space for A mode in observed value;
w
AWord finder for A mode in observed value;
z
VTheme space for V mode in observed value;
w
VWord finder for V mode in observed value;
Step 5: utilize final argument to analyze mode to be detected, obtained the theme conditional probability of new data.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011101800250A CN102289430B (en) | 2011-06-29 | 2011-06-29 | Method for analyzing latent semantics of fusion probability of multi-modality data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011101800250A CN102289430B (en) | 2011-06-29 | 2011-06-29 | Method for analyzing latent semantics of fusion probability of multi-modality data |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102289430A CN102289430A (en) | 2011-12-21 |
CN102289430B true CN102289430B (en) | 2013-11-13 |
Family
ID=45335870
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2011101800250A Expired - Fee Related CN102289430B (en) | 2011-06-29 | 2011-06-29 | Method for analyzing latent semantics of fusion probability of multi-modality data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102289430B (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9396433B2 (en) | 2013-05-29 | 2016-07-19 | International Business Machines Corporation | Determining related data points from multi-modal inputs |
CN103984756B (en) * | 2014-05-29 | 2017-04-12 | 重庆大学 | Semi-supervised probabilistic latent semantic analysis based software change log classification method |
CN112232082B (en) * | 2020-10-14 | 2022-02-18 | 海南大学 | Multimode DIKW content multi-semantic analysis method for essential computing |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1920818A (en) * | 2006-09-14 | 2007-02-28 | 浙江大学 | Transmedia search method based on multi-mode information convergence analysis |
CN101334796A (en) * | 2008-02-29 | 2008-12-31 | 浙江师范大学 | Personalized and synergistic integration network multimedia search and enquiry method |
CN101388022A (en) * | 2008-08-12 | 2009-03-18 | 北京交通大学 | Web portrait search method for fusing text semantic and vision content |
US7844449B2 (en) * | 2006-03-30 | 2010-11-30 | Microsoft Corporation | Scalable probabilistic latent semantic analysis |
-
2011
- 2011-06-29 CN CN2011101800250A patent/CN102289430B/en not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7844449B2 (en) * | 2006-03-30 | 2010-11-30 | Microsoft Corporation | Scalable probabilistic latent semantic analysis |
CN1920818A (en) * | 2006-09-14 | 2007-02-28 | 浙江大学 | Transmedia search method based on multi-mode information convergence analysis |
CN101334796A (en) * | 2008-02-29 | 2008-12-31 | 浙江师范大学 | Personalized and synergistic integration network multimedia search and enquiry method |
CN101388022A (en) * | 2008-08-12 | 2009-03-18 | 北京交通大学 | Web portrait search method for fusing text semantic and vision content |
Non-Patent Citations (2)
Title |
---|
李志欣等.融合语义主题的图像自动标注.《软件学报》.2011,第22卷(第4期), |
融合语义主题的图像自动标注;李志欣等;《软件学报》;20110415;第22卷(第4期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN102289430A (en) | 2011-12-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Egger et al. | Discrete and phase field methods for linear elastic fracture mechanics: a comparative study and state-of-the-art review | |
Herrema et al. | A framework for parametric design optimization using isogeometric analysis | |
Sarkar et al. | Multifidelity and multiscale Bayesian framework for high-dimensional engineering design and calibration | |
CN103207856B (en) | A kind of Ontological concept and hierarchical relationship generation method | |
Liu et al. | Sequential optimization using multi-level cokriging and extended expected improvement criterion | |
CN108717423B (en) | Code segment recommendation method based on deep semantic mining | |
US20100004769A1 (en) | Method of designing a structure | |
CN108417210A (en) | A kind of word insertion language model training method, words recognition method and system | |
CN111127246A (en) | Intelligent prediction method for transmission line engineering cost | |
Zhao et al. | Research on rolling bearing virtual-real fusion life prediction with digital twin | |
Iuliano et al. | Aerodynamic shape optimization via non-intrusive POD-based surrogate modelling | |
Bai et al. | Design pattern modeling and extraction for CAD models | |
CN103678714B (en) | Construction method and device for entity knowledge base | |
CN111101929A (en) | Method, device and system for calculating average formation pressure of oil and gas reservoir | |
CN102289430B (en) | Method for analyzing latent semantics of fusion probability of multi-modality data | |
Stein et al. | Operator-and template-based modeling of solid geometry for Isogeometric Analysis with application to Vertical Axis Wind Turbine simulation | |
Runchal et al. | CFD of the Future: Year 2025 and Beyond | |
Khatouri et al. | Metamodeling techniques for CPU-intensive simulation-based design optimization: a survey | |
Hu et al. | An adaptive finite volume solver for steady Euler equations with non-oscillatory k-exact reconstruction | |
CN106777616A (en) | A kind of high-speed rice pot seedling planting machine seedling box mover Multidisciplinary Optimization method | |
Wang et al. | Research of the IFC-based transformation methods of geometry information for structural elements | |
Zhou et al. | Component mode synthesis order-reduction for dynamic analysis of structure modeled with NURBS finite element | |
Selvan | On the effect of shape parameterization on aerofoil shape optimization | |
Streuber et al. | Dynamic Geometry Control for Robust Aerodynamic Shape Optimization | |
CN109300504B (en) | Protein structure prediction method based on variable isoelite selection |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20131113 Termination date: 20140629 |
|
EXPY | Termination of patent right or utility model |