CN115905617A - Video scoring prediction method based on deep neural network and double regularization - Google Patents
Video scoring prediction method based on deep neural network and double regularization Download PDFInfo
- Publication number
- CN115905617A CN115905617A CN202310187456.2A CN202310187456A CN115905617A CN 115905617 A CN115905617 A CN 115905617A CN 202310187456 A CN202310187456 A CN 202310187456A CN 115905617 A CN115905617 A CN 115905617A
- Authority
- CN
- China
- Prior art keywords
- user
- video
- matrix
- users
- score
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 36
- 238000013528 artificial neural network Methods 0.000 title claims abstract description 25
- 239000011159 matrix material Substances 0.000 claims abstract description 117
- 238000000354 decomposition reaction Methods 0.000 claims abstract description 27
- 230000000694 effects Effects 0.000 claims abstract description 15
- 239000010410 layer Substances 0.000 claims description 44
- 239000013598 vector Substances 0.000 claims description 39
- 238000004364 calculation method Methods 0.000 claims description 18
- 230000006870 function Effects 0.000 claims description 17
- 238000012545 processing Methods 0.000 claims description 10
- 239000002356 single layer Substances 0.000 claims description 6
- 230000004913 activation Effects 0.000 claims description 5
- 238000011156 evaluation Methods 0.000 claims description 3
- 230000001537 neural effect Effects 0.000 claims description 3
- 238000012552 review Methods 0.000 claims description 3
- 150000001875 compounds Chemical class 0.000 claims description 2
- 230000009977 dual effect Effects 0.000 claims description 2
- 238000003062 neural network model Methods 0.000 abstract description 4
- 238000005065 mining Methods 0.000 abstract description 2
- 238000004422 calculation algorithm Methods 0.000 description 12
- 238000001914 filtration Methods 0.000 description 7
- 210000002569 neuron Anatomy 0.000 description 3
- 238000012937 correction Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000011478 gradient descent method Methods 0.000 description 2
- 230000004075 alteration Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000007621 cluster analysis Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012549 training Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/73—Querying
- G06F16/735—Filtering based on additional data, e.g. user or group profiles
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9536—Search customisation based on social or collaborative filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/048—Activation functions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0499—Feedforward networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/45—Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
- H04N21/466—Learning process for intelligent management, e.g. learning user preferences for recommending movies
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/475—End-user interface for inputting end-user data, e.g. personal identification number [PIN], preference data
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02T—CLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
- Y02T10/00—Road transport of goods or passengers
- Y02T10/10—Internal combustion engine [ICE] based vehicles
- Y02T10/40—Engine management systems
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Biomedical Technology (AREA)
- Computing Systems (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Signal Processing (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Human Computer Interaction (AREA)
- Library & Information Science (AREA)
- Image Analysis (AREA)
Abstract
The invention relates to a video scoring prediction method based on a deep neural network and double regularization, which comprises the steps of reconstructing a user-video scoring matrix, introducing a video associated regular item fused with user activity and a reliable nearest neighbor regular item, constructing a matrix decomposition recommendation model fused with the video associated regular item fused with the user activity and the reliable nearest neighbor regular item, inputting potential features into the deep neural network to obtain a result of a deep neural network model, and combining the result of the deep neural network model with a matrix decomposition structure to obtain a final prediction score, so that the precision of the prediction score is improved; and mining related information in user video comments by using an LDA (latent Dirichlet Allocation) model to generate a user type latent feature matrix and a video type latent feature matrix, combining the user type latent feature matrix and the video type latent feature matrix to obtain a hidden information matrix, and combining the hidden information matrix and an original user video scoring matrix to generate a new user-video scoring matrix, so that the problems of cold start and data sparsity are solved.
Description
Technical Field
The invention relates to a video scoring prediction method based on a deep neural network and double regularization, and belongs to the field of scoring prediction.
Background
With the rapid development of the internet technology, video resources in each network platform are more and more, abundant video resources are provided for users, and when more choices are provided for the users, troubles and puzzles are brought to the users, and the huge video resources not only increase the difficulty of searching favorite videos of the users, but also lead the process of searching the videos to be very time-consuming. In order to solve the information overload problem, the personalized recommendation system becomes an effective tool for solving the problem. The score prediction is again an important component of the recommendation algorithm. The existing recommendation algorithms mainly have three main categories: collaborative filtering based recommendation algorithms, content based recommendation algorithms, and hybrid recommendation algorithms. At present, the most used collaborative filtering-based recommendation algorithm is the collaborative filtering-based recommendation algorithm, and the most used collaborative filtering-based recommendation algorithm is the model-based collaborative filtering recommendation algorithm, and several algorithms which are relatively common in the model-based collaborative filtering recommendation algorithm include: matrix decomposition model, singular value decomposition, cluster analysis, etc. However, the existing collaborative filtering recommendation algorithm has problems of data sparseness, cold start and the like, which causes inaccurate score prediction of recommended video resources, thereby affecting personalized recommendation results, improving accuracy of prediction scoring of video resources, and further improving recommendation precision becomes one of the hot spots of current research.
Disclosure of Invention
The invention aims to provide a video scoring prediction method based on a deep neural network and double regularization aiming at the defects of the prior art, a user-video scoring matrix is reconstructed, a video association regular term and a reliable nearest neighbor regular term which are fused with user liveness are introduced during matrix decomposition to restrict the learning of a potential feature matrix, the deep neural network is introduced, the nonlinear feature of the deep neural network is utilized to relieve the limitation of linear dot products in the matrix decomposition process, the result of a deep neural network model is combined with the result of double regularization matrix decomposition, and the precision of video scoring prediction is improved.
The technical scheme adopted by the invention is as follows: a video scoring prediction method based on a deep neural network and double regularization is used for improving the precision of scoring prediction of recommended videos, and specifically comprises the following steps:
step S1: processing the video comments, excavating hidden information, combining a hidden information matrix with an original user-video scoring matrix to generate a new user-video scoring matrix, and entering the step S2;
step S2: the method comprises the following steps of adding a biregular term constraint potential feature matrix into a user-video scoring matrix for learning when decomposing a user-video scoring matrix, wherein each user scores a video to make a certain contribution to video similarity, and the user contributions are not completely the same, so that the users can be divided into active users and inactive users according to the liveness of the users, the active users refer to users who score and record a large number of videos, the inactive users refer to users who score and record only a small number of videos, so that the contributions of the active users and the inactive users are separated when the video similarity is calculated, and the liveness of the users is defined as:
In the formula 1, the first and second groups of the compound,the total score of the user u is represented, so that the video similarity calculation method obtained by combining the activity coefficient of the user and the modified cosine similarity comprises the following steps: />
In the formula 2, the first and second groups,represents the rating of user u on video i, </or>Represents the rating of user u on video j, and>represents a rating score for user u, and->Representing a set of users who have simultaneously scored videos i and j; and (3) introducing the learning of the video associated regularization term constraint project potential feature matrix integrated with the user activity during matrix decomposition, wherein the video associated regularization constraint function formula integrated with the user activity at the moment is as follows:
In equation 3, V represents a video feature matrix, V j Is a potential feature vector of video j,V i Is a potential feature vector of video i and proceeds to step S3;
and step S3: taking the potential characteristic vector decomposed by the matrix as the input of the multilayer perceptron, processing the potential characteristic vector by the multilayer perceptron to obtain a result predicted by a multilayer perceptron model, and entering the step S4;
step S4; and combining the result predicted by the multilayer perceptron model with the result of matrix decomposition in the merging layer, and optimizing the model by using a normalized cross entropy method to obtain the final predicted score.
As a preferred technical scheme of the invention: in the step S1, firstly, the LDA model is used to mine the hidden information of the relevant types in the user video review, generate the user type potential feature matrix LU and the video type potential feature matrix LV, and combine the user type potential feature matrix with the video type potential feature matrix to obtain the user type potential feature matrix LU and the video type potential feature matrix LV
To the hidden information matrix L, the calculation formula is:
And combining the hidden information matrix L with the original user-video scoring matrix R to generate a new user-video scoring matrixThe calculation formula is as follows:
As a preferred technical scheme of the invention: in step S2, users with the same interests and hobbies may affect each other, and the user similarity may be calculated using a weighted pearson correlation coefficient:
In the formula 6, the first and second groups,and &>Represents the mean rating, based on the user u and v, respectively>Represents the rating of user u on video i, </or>Represents the rating of user v on video i, <' > or>Represents a set of videos that user u has commented on, and @>Represents a set of videos that user v has commented on, <' > or>The Jaccard correlation coefficient of the items which can influence the user similarity calculation is the weight, and the calculation formula is as follows:
Wherein,represents a set of videos that user u has commented on, and @>Representing a video set which is commented by a user v;
the user's score for an item depends on the influence of neighboring users and may also be influenced by neighboring users of neighboring users, but neighboring users at a certain distance do not influence the user, i.e. become unreliable, so a reliable value is introduced, and neighboring users with a reliable value greater than a certain value have an influence on the user item score, and the reliable value is calculated by:
In the formula 8, the process is described,represents a score of u on video i, <' > or>Represents the rating of user v on video i, <' > or>Represents a set of videos that user u has commented on, and @>Represents a set of videos that user v has commented on, <' > or>Represents the maximum value of the score, based on the evaluation>Indicates a trust distance, i.e. the number of users present between user u and user v, is present>Represents the maximum distance allowed between two users, and>is a correction parameter, is a number greater than 0 and less than 1, and is reliable
Introducing a reliable nearest neighbor regular term to constrain the learning of the potential feature matrix of the user during matrix decomposition, wherein a reliable nearest neighbor regular term constraint function is as follows:
As a preferred technical scheme of the invention: in the step S3, the user potential feature vector and the video potential feature vector are used as inputs of a multilayer perceptron, wherein the deep neural network is composed of the multilayer perceptron and a single-layer perceptron, the multilayer perceptron includes an input layer, a plurality of hidden layers allowing neural structure nonlinearity and an output layer, and a result of a multilayer perceptron model is obtained by processing the multilayer perceptron through nonlinear features of the hidden layers.
As a preferred technical scheme of the invention: in step S4, the single-layer perceptron in the deep neural network structure is a merging layer, and the prediction result of the multilayer perceptron model is combined with the result of the regularized dual matrix decomposition model in the merging layer, where the calculation formula is:
In the formula 11For an activation function, <' >>For the set of matrix weights between the output layer and the merging layer, <' >>Is the result of the output layer, is>For a user potential vector, based on the number of potential vectors in the selected candidate set>For a video potential vector>And optimizing the model by using a normalized cross entropy method for merging the deviation items of the layers to finally obtain the prediction score.
Has the beneficial effects that:
1. the method introduces the video association regular term and the reliable nearest neighbor regular term which are blended into the user activity to restrain the learning of the potential feature matrix, utilizes the nonlinear structure of the deep neural network to relieve the limitation of linear dot products in the matrix decomposition process, combines the result of the deep neural network model with the result of the double regularization matrix decomposition, and improves the precision of video scoring prediction.
2. According to the method, the LDA model is utilized to mine relevant information in the user video comments, a user type potential feature matrix and a video type potential feature matrix are generated, the user type potential feature matrix and the video type potential feature matrix are combined to obtain a hidden information matrix, the hidden information matrix and an original user video scoring matrix are combined to generate a new user-video scoring matrix, and the problems of cold start and data sparsity are solved.
Drawings
FIG. 1 is a flow chart of the method of the present invention;
FIG. 2 is a block diagram of the multi-level perceptron of the present invention;
FIG. 3 is a diagram of the deep neural network architecture of the present invention.
Detailed Description
The following description will explain embodiments of the present invention in further detail with reference to the accompanying drawings.
The method is characterized in that a user-video scoring matrix is reconstructed on the basis of a traditional matrix decomposition model, and a video association regular term and a reliable nearest neighbor regular term which are blended into the user activity are introduced to limit the learning of a potential feature matrix. The nonlinear structure of the deep neural network is utilized, the limit of linear dot products in the matrix decomposition process is relieved, the potential feature vector decomposed by the matrix is used as the input of the deep neural network, the result of the MLP model is obtained through the processing of a plurality of layers of perceptrons, the result of the MLP model is combined with the result of the biregularized matrix decomposition model in a single-layer perceptron layer, namely a merging layer, the model is optimized through a normalized cross entropy method, and therefore the scoring prediction precision is improved.
As shown in fig. 1.
The invention designs a video scoring prediction method based on a deep neural network and double regularization, which is used for improving the precision of the scoring prediction of a recommended video, and comprises the following steps:
step S1: processing the video comments, excavating hidden information, combining a hidden information matrix with an original user-video scoring matrix to generate a new user-video scoring matrix, and entering the step S2;
step S2: adding a double regular term for restricting the learning of the potential feature matrix when decomposing the user-video scoring matrix, and entering a step S3;
and step S3: taking the potential characteristic vector decomposed by the matrix as the input of the multilayer perceptron, processing the potential characteristic vector by the multilayer perceptron to obtain a result predicted by a multilayer perceptron model, and entering the step S4;
and step S4: and combining the result predicted by the multilayer perceptron model with the result of matrix decomposition in the merging layer, and optimizing the model by using a normalized cross entropy method to obtain the final predicted score.
The method comprises the following specific steps:
the step S1 comprises the following steps: firstly, mining related types of hidden information in user video review by using an LDA (latent Dirichlet Allocation) model to generate a user type potential feature matrix LU and a video type potential feature matrix LV, and combining the user type potential feature matrix with the video type potential feature matrix to obtain a hidden information matrix L, wherein the calculation formula is as followsAnd combining the hidden information matrix L with the original user-video scoring matrix R to reconstruct a user-video scoring matrix R, wherein the calculation formula is as follows:。
The step S2 comprises the following steps: carrying out matrix decomposition on the user-video scoring matrix, and decomposing the high-dimensional user-video scoring matrix into a low-dimensional user characteristic matrix and a video characteristic matrix, wherein the formula is as follows:
wherein U represents a user feature matrix, U i Potential feature vector representing user i, V represents a video feature matrix, V j Is the potential feature vector for video j. The low-dimensional matrix decomposition method approximately calculates the scoring matrix R by the product of the d-rank factors. The prediction score of user i for video j is expressed asThe square of the error between the predicted score and the raw score is taken as a loss function, which is minimized to approximate the score matrix R. The loss function is:
in the above-mentioned formula,is an indicator function that indicates that if user i scores item j, it equals 1, otherwise it equals 0.And &>Over-fitting is prevented for two regular terms. Since each user scores the video to make a certain contribution to the video similarity, but each user contribution is not exactly the same, and the video similarity can be divided into active users and inactive users by considering the activity of the users, the active users refer to users who have a large number of score records on the video, and the inactive users refer to users who only have score records on a small number of videos, so the contributions of the active users and the inactive users should be distinguished when calculating the video similarity, and the activity of the users can be defined as:
in the above-mentioned formula,representing the total number of scores for user u. Therefore, the video similarity calculation method obtained by combining the liveness coefficient of the user with the modified cosine similarity is as follows:
in the above-mentioned formula,represents the rating of user u on video i, </or>Represents the rating of user u on video j, and>representing the user u's score. The video association regularization item integrated with the user activity is introduced during matrix decomposition to constrain the learning of a project potential feature matrix, and the video association regularization constraint function formula integrated with the user activity is as follows:
wherein V represents a video feature matrix, V j Is a potential feature vector, V, of video j i Are potential feature vectors for video i. Users with the same interest will influence each other, and the user similarity can be calculated using weighted pearson correlation coefficients:
in the above-mentioned formula,and &>Represents the mean score of users u and v, respectively>The Jaccard correlation coefficient of the items which can influence the user similarity calculation is the weight, and the calculation formula is as follows:
wherein,represents a set of videos that user u has commented on, and @>Representing a set of videos that user v has commented on.
The user's score for an item depends on the influence of neighboring users and may also be influenced by neighboring users of neighboring users, but neighboring users at a certain distance do not influence the user, i.e. become unreliable, so a reliable value is introduced, and neighboring users with a reliable value greater than a certain value have an influence on the user item score, and the reliable value is calculated by:
in the above-mentioned formula,represents a score of u on video i, <' > or>Represents the rating of user v on video i, <' > or>Represents a set of videos that user u has commented on, and @>Represents a set of videos that a user v has commented on,/>>Represents the maximum value of the score, based on the evaluation>Representing a trust distance, i.e. the number of users present between user u and user v, in conjunction with a user v>Represents the maximum distance allowed between two users, and>is a number greater than 0 and less than 1 for the correction parameter. Reliable nearest neighbor users are
Introducing a reliable nearest neighbor regular term to constrain the learning of a potential feature matrix of a user during matrix decomposition, wherein a reliable nearest neighbor regular term constraint function is as follows:
Adding a video associated regularization term fused with user activity and a reliable nearest neighbor regularization term, wherein the final optimization loss function is as follows:
and searching an optimal solution by adopting a random gradient descent method, and finding out an optimal potential feature matrix.
As shown in fig. 2.
The step S3 comprises the following steps: user potential feature vector U u And video latent feature vector V i As input to a multi-layered perceptron, as shown in FIG. 2, comprising an input layer L in A plurality of hidden layers allowing the nonlinearity of the neural structure and an output layer L out An input layer L in The output vector of (a) is:
outputting a vector after the first hidden layer processing:
whereinIs a set of weights, included in a matrix between the input layer and the first layer hidden layer L1, is->Is a aberration of level L1>Is an activation function of
So as to hide the layer L k The output vector of (a) is:
whereinIs an activation function of the neuron>Is a weight matrix, is based on>Is a deviation. Multilayer perceptron output layer L out The output vector is:
as shown in fig. 3.
The step S4 comprises the following steps: the single-layer perceptron in the deep neural network structure is a merging layer, wherein the deep neural network structure is shown in fig. 3, the prediction result of the multilayer perceptron model and the result of the double regularization matrix decomposition model are combined in the merging layer, and the calculation formula is as follows:
wherein,for an activation function, <' >>For the set of matrix weights between the output layer and the merging layer, <' >>Is the result of the output layer, is>For a user potential vector, based on the number of potential vectors in the selected candidate set>For a potential vector of video, be>To merge the deviation terms of the layers
The proposed model is continuously optimized by using a normalized cross entropy method through the following cost functions:
in the above-mentioned formula, the first and second,for merging layer neuron number, based on the combined layer neuron number>For a predicted score>For a real point of training instance, <' > based on the number of true points>The maximum value of the score is indicated. The cost function continuously optimizes the model by using a gradient descent method to obtain the final prediction score.
The embodiments of the present invention have been described in detail with reference to the drawings, but the present invention is not limited to the above embodiments, and various changes can be made within the knowledge of those skilled in the art without departing from the gist of the present invention.
Claims (5)
1. A video scoring prediction method based on a deep neural network and double regularization is characterized by comprising the following steps:
step S1: processing the video comments, excavating hidden information, combining a hidden information matrix with an original user-video scoring matrix to generate a new user-video scoring matrix, and entering the step S2;
step S2: the learning of a double regular term constraint potential feature matrix is added during the decomposition of a user-video scoring matrix, the scoring of each user on a video makes a certain contribution to the video similarity, the user contributions are not completely the same, the video scoring matrix can be divided into active users and inactive users according to the liveness of the users, the active users refer to users who record a large number of scoring records on the video, the inactive users refer to users who record only a small number of scoring records on the video, so the contributions of the active users and the inactive users are separated when the video similarity is calculated, and the liveness of the users is defined as:
In the formula 1, the first and second groups of the compound,the total score of the user u is represented, so that the video similarity calculation method obtained by combining the activity coefficient of the user and the modified cosine similarity comprises the following steps:
In the case of the formula 2, the,represents the rating of user u on video i, </or>Represents the rating of user u on video j, and>represents a rating score for user u, and->Representing a set of users who have simultaneously scored videos i and j; and (3) introducing the learning of the video associated regularization term constraint project potential feature matrix blended with the user activity during matrix decomposition, wherein the video associated regularization constraint function formula blended with the user activity at the moment is as follows:
In equation 3, V represents the video feature matrix, V j Is a potential feature vector, V, of video j i Is a potential feature vector of video i and proceeds to step S3;
and step S3: taking the potential characteristic vector decomposed by the matrix as the input of the multilayer perceptron, processing the potential characteristic vector by the multilayer perceptron to obtain a result predicted by a multilayer perceptron model, and entering the step S4;
and step S4: and combining the result predicted by the multilayer perceptron model with the result of matrix decomposition in the merging layer, and optimizing the model by using a normalized cross entropy method to obtain the final predicted score.
2. The method of claim 1, wherein the video score prediction method based on deep neural network and regularization comprises: in the step S1, firstly, the LDA model is used to mine the hidden information of the relevant type in the user video review, the user type potential feature matrix LU and the video type potential feature matrix LV are generated, and the user type potential feature matrix is combined with the video type potential feature matrix to obtain the hidden information matrix L, wherein the calculation formula is as follows:
And combining the hidden information matrix L with the original user-video scoring matrix R to generate a new user-video scoring matrixThe calculation formula is as follows:
3. The method of claim 1, wherein the video score prediction method based on deep neural network and regularization comprises: in step S2, users with the same interests and hobbies affect each other, and the user similarity is calculated using the weighted pearson correlation coefficient:
In the formula 6, the first and second groups,and &>Represents the mean rating, based on the user u and v, respectively>Represents the rating of user u on video i, </or>Represents the rating of user v on video i, <' > or>Represents a set of videos that user u has commented on, and @>Represents a set of videos that user v has commented on,the Jaccard correlation coefficient of the items which can influence the user similarity calculation is the weight, and the calculation formula is as follows:
Wherein,represents a set of videos that user u has commented on, and @>Representing a video set which is commented by a user v;
the user's score for an item depends on the influence of neighboring users and is also influenced by neighboring users of neighboring users, but neighboring users at a certain distance do not influence the user, i.e. become unreliable, so a reliable value is introduced, and neighboring users with a reliable value greater than a certain value have an influence on the user item score, and the reliable value is calculated by:
In the formula 8, the process is described,represents a score of u on video i, <' > or>Represents the rating on video i of user v>Represents a set of videos that user u has commented on, and @>Represents a set of videos that user v has commented on, <' > or>Represents the maximum value of the score, based on the evaluation>Representing a trust distance, i.e. the number of users present between user u and user v, in conjunction with a user v>Represents the maximum distance allowed between two users, and>is to correct the parameter, is oneA number greater than 0 and less than 1, the reliable nearest neighbor user is
Introducing a reliable nearest neighbor regular term to constrain the learning of the potential feature matrix of the user during matrix decomposition, wherein a reliable nearest neighbor regular term constraint function is as follows:
4. The method of claim 1, wherein the video score prediction method based on deep neural network and regularization comprises: in the step S3, the user potential feature vector and the video potential feature vector are used as inputs of a multilayer perceptron, wherein the deep neural network is composed of the multilayer perceptron and a single-layer perceptron, the multilayer perceptron includes an input layer, a plurality of hidden layers allowing the neural structure to be nonlinear, and an output layer, and the result of the multilayer perceptron model is obtained through processing by the multilayer perceptron by utilizing the nonlinear features of the hidden layers.
5. The method of claim 1, wherein the video score prediction method based on deep neural network and regularization comprises: in step S4, the single-layer perceptron in the deep neural network structure is a merging layer, and the prediction result of the multilayer perceptron model is combined with the result of the regularized dual matrix decomposition model in the merging layer, where the calculation formula is:
In equation 11For an activation function, <' >>For the set of matrix weights between the output layer and the merging layer, <' >>Is the result of the output layer, is>For a user potential vector>For a potential vector of video, be>And optimizing the model by using a normalized cross entropy method for merging the deviation items of the layers to finally obtain the prediction score. />
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310187456.2A CN115905617B (en) | 2023-03-02 | 2023-03-02 | Video scoring prediction method based on deep neural network and double regularization |
PCT/CN2023/135369 WO2024179089A1 (en) | 2023-03-02 | 2023-11-30 | Video score prediction method based on deep neural network and double regularization |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310187456.2A CN115905617B (en) | 2023-03-02 | 2023-03-02 | Video scoring prediction method based on deep neural network and double regularization |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115905617A true CN115905617A (en) | 2023-04-04 |
CN115905617B CN115905617B (en) | 2023-05-30 |
Family
ID=85744775
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310187456.2A Active CN115905617B (en) | 2023-03-02 | 2023-03-02 | Video scoring prediction method based on deep neural network and double regularization |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN115905617B (en) |
WO (1) | WO2024179089A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2024179089A1 (en) * | 2023-03-02 | 2024-09-06 | 南京邮电大学 | Video score prediction method based on deep neural network and double regularization |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112328900A (en) * | 2020-11-27 | 2021-02-05 | 北京工业大学 | Deep learning recommendation method integrating scoring matrix and comment text |
CN112668374A (en) * | 2019-10-16 | 2021-04-16 | 北京灵汐科技有限公司 | Image processing method and device, re-recognition network training method and electronic equipment |
CN113157957A (en) * | 2021-03-05 | 2021-07-23 | 北京工业大学 | Attribute graph document clustering method based on graph convolution neural network |
CN113807422A (en) * | 2021-09-07 | 2021-12-17 | 南京邮电大学 | Weighted graph convolutional neural network score prediction model fusing multi-feature information |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111581333B (en) * | 2020-05-07 | 2023-05-26 | 重庆大学 | Text-CNN-based video and audio play list pushing method and video and audio play list pushing system |
CN114117191A (en) * | 2020-08-26 | 2022-03-01 | 付昳漫 | Context-aware recommendation method based on deep neural network |
CN114625980A (en) * | 2020-12-13 | 2022-06-14 | 国网吉林省电力有限公司培训中心 | Dual-regularization collaborative filtering recommendation method |
CN115905617B (en) * | 2023-03-02 | 2023-05-30 | 南京邮电大学 | Video scoring prediction method based on deep neural network and double regularization |
-
2023
- 2023-03-02 CN CN202310187456.2A patent/CN115905617B/en active Active
- 2023-11-30 WO PCT/CN2023/135369 patent/WO2024179089A1/en unknown
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112668374A (en) * | 2019-10-16 | 2021-04-16 | 北京灵汐科技有限公司 | Image processing method and device, re-recognition network training method and electronic equipment |
CN112328900A (en) * | 2020-11-27 | 2021-02-05 | 北京工业大学 | Deep learning recommendation method integrating scoring matrix and comment text |
CN113157957A (en) * | 2021-03-05 | 2021-07-23 | 北京工业大学 | Attribute graph document clustering method based on graph convolution neural network |
CN113807422A (en) * | 2021-09-07 | 2021-12-17 | 南京邮电大学 | Weighted graph convolutional neural network score prediction model fusing multi-feature information |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2024179089A1 (en) * | 2023-03-02 | 2024-09-06 | 南京邮电大学 | Video score prediction method based on deep neural network and double regularization |
Also Published As
Publication number | Publication date |
---|---|
WO2024179089A1 (en) | 2024-09-06 |
CN115905617B (en) | 2023-05-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110929164B (en) | Point-of-interest recommendation method based on user dynamic preference and attention mechanism | |
Pan et al. | A correlative denoising autoencoder to model social influence for top-N recommender system | |
CN111127142B (en) | Article recommendation method based on generalized nerve attention | |
Ni et al. | An effective recommendation model based on deep representation learning | |
Ni et al. | A two-stage embedding model for recommendation with multimodal auxiliary information | |
CN110866145B (en) | Co-preference-assisted deep single-class collaborative filtering recommendation method | |
CN107507073A (en) | Based on the service recommendation method for trusting extension and the sequence study of list level | |
CN112800344B (en) | Deep neural network-based movie recommendation method | |
CN111125540A (en) | Recommendation method integrating knowledge graph representation learning and bias matrix decomposition | |
CN115905617B (en) | Video scoring prediction method based on deep neural network and double regularization | |
Lin et al. | Lifelong Text-Audio Sentiment Analysis learning | |
Lin et al. | Transfer learning for collaborative recommendation with biased and unbiased data | |
CN113836393B (en) | Cold start recommendation method based on preference self-adaptive meta-learning | |
CN110889505A (en) | Cross-media comprehensive reasoning method and system for matching image-text sequences | |
Nazari et al. | Scalable and data-independent multi-agent recommender system using social networks analysis | |
Mu et al. | Auxiliary stacked denoising autoencoder based collaborative filtering recommendation | |
Rauf et al. | BCE4ZSR: Bi-encoder empowered by teacher cross-encoder for zero-shot cold-start news recommendation | |
CN115310004A (en) | Graph nerve collaborative filtering recommendation method fusing project time sequence relation | |
CN116049377A (en) | Context-aware recommendation system and method based on matrix decomposition and CRNN | |
Xiao et al. | A better understanding of the interaction between users and items by knowledge graph learning for temporal recommendation | |
Lu et al. | GTPAN: Global Target Preference Attention Network for session-based recommendation | |
Dashti et al. | Hybrid recommender system based on fuzzy neural algorithm | |
Gan et al. | A WEIGHTED NEURAL MATRIX FACTORIZATION HEALTH MANAGEMENT RECOMMENDATION ALGORITHM INTEGSCORING DEEP LEARNING TECHNOLOGY | |
AL SBOU et al. | Performance comparison of three different types of autoencoders using recommendation systems | |
Tomar et al. | Twin support vector machine for multiple instance learning based on bag dissimilarities |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |