CN115905617B - Video scoring prediction method based on deep neural network and double regularization - Google Patents
Video scoring prediction method based on deep neural network and double regularization Download PDFInfo
- Publication number
- CN115905617B CN115905617B CN202310187456.2A CN202310187456A CN115905617B CN 115905617 B CN115905617 B CN 115905617B CN 202310187456 A CN202310187456 A CN 202310187456A CN 115905617 B CN115905617 B CN 115905617B
- Authority
- CN
- China
- Prior art keywords
- user
- video
- matrix
- representing
- users
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 30
- 238000013528 artificial neural network Methods 0.000 title claims abstract description 22
- 239000011159 matrix material Substances 0.000 claims abstract description 114
- 238000000354 decomposition reaction Methods 0.000 claims abstract description 23
- 230000000694 effects Effects 0.000 claims abstract description 16
- 239000010410 layer Substances 0.000 claims description 74
- 239000013598 vector Substances 0.000 claims description 37
- 230000006870 function Effects 0.000 claims description 18
- 238000004364 calculation method Methods 0.000 claims description 15
- 238000012545 processing Methods 0.000 claims description 10
- 239000002356 single layer Substances 0.000 claims description 6
- 238000005065 mining Methods 0.000 claims description 4
- 230000000452 restraining effect Effects 0.000 claims description 4
- 238000012937 correction Methods 0.000 claims description 3
- 230000009977 dual effect Effects 0.000 claims description 3
- 230000001537 neural effect Effects 0.000 claims description 3
- 238000003062 neural network model Methods 0.000 abstract description 4
- 238000004422 calculation algorithm Methods 0.000 description 11
- 238000001914 filtration Methods 0.000 description 6
- 230000004913 activation Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 238000011478 gradient descent method Methods 0.000 description 2
- 210000002569 neuron Anatomy 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000007621 cluster analysis Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012549 training Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/73—Querying
- G06F16/735—Filtering based on additional data, e.g. user or group profiles
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9536—Search customisation based on social or collaborative filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/048—Activation functions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0499—Feedforward networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/45—Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
- H04N21/466—Learning process for intelligent management, e.g. learning user preferences for recommending movies
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/475—End-user interface for inputting end-user data, e.g. personal identification number [PIN], preference data
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02T—CLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
- Y02T10/00—Road transport of goods or passengers
- Y02T10/10—Internal combustion engine [ICE] based vehicles
- Y02T10/40—Engine management systems
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Biomedical Technology (AREA)
- Computing Systems (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Signal Processing (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Human Computer Interaction (AREA)
- Library & Information Science (AREA)
- Image Analysis (AREA)
Abstract
The invention relates to a video scoring prediction method based on a deep neural network and double regularization, which comprises the steps of reconstructing a user-video scoring matrix, introducing a video association regular term fused with user activity and a reliable nearest neighbor regular term, constructing a matrix decomposition recommendation model fused with the video association regular term fused with user activity and the reliable nearest neighbor regular term, inputting potential features into the deep neural network to obtain a result of the deep neural network model, combining the result of the deep neural network model with a matrix decomposition structure to obtain a final prediction score, and improving the accuracy of the prediction score; and the LDA model is utilized to mine relevant information in the user video comments, a user type potential feature matrix and a video type potential feature matrix are generated, the user type potential feature matrix and the video type potential feature matrix are combined to obtain a hidden information matrix, and then the hidden information matrix is combined with the original user video scoring matrix to generate a new user-video scoring matrix, so that the problems of cold start and data sparsity are relieved.
Description
Technical Field
The invention relates to a video scoring prediction method based on a deep neural network and double regularization, and belongs to the field of scoring prediction.
Background
Along with the rapid development of internet technology, video resources in each network platform are more and more, so that abundant video resources are provided for users, and when more choices are provided for users, trouble and trouble are brought to the users, and huge video resources not only increase the difficulty of finding favorite videos of the users, but also make the process of finding videos quite time-consuming. In order to solve the information overload problem, the personalized recommendation system becomes an effective tool for solving the problem. Score prediction is an important component of the recommendation algorithm. The existing recommendation algorithm mainly comprises three main categories: collaborative filtering-based recommendation algorithms, content-based recommendation algorithms, and hybrid recommendation algorithms. The most used collaborative filtering-based recommendation algorithm at present, and the most used collaborative filtering recommendation algorithm is the model-based collaborative filtering recommendation algorithm, and several more common algorithms in the model-based collaborative filtering recommendation algorithm include: matrix decomposition model, singular value decomposition, cluster analysis, etc. However, the existing collaborative filtering recommendation algorithm has the problems of sparse data, cold start and the like, so that the recommendation video resource scoring prediction is inaccurate, the personalized recommendation result is affected, the accuracy of the video resource scoring prediction is improved, and the recommendation precision is further improved to be one of hot spots of the current research.
Disclosure of Invention
Aiming at the defects of the prior art, the invention provides a video scoring prediction method based on a depth neural network and double regularization, a user-video scoring matrix is reconstructed, a video association regular term and a reliable nearest neighbor regular term which are integrated with the activity of a user are introduced to restrain the learning of a potential feature matrix during matrix decomposition, the depth neural network is introduced, the restriction of linear dot products in the matrix decomposition process is relieved by utilizing the nonlinear features of the depth neural network, the result of a depth neural network model is combined with the result of the double regularization matrix decomposition, and the accuracy of video scoring prediction is improved.
The technical scheme adopted by the invention is as follows: a video scoring prediction method based on a deep neural network and double regularization is used for improving the accuracy of scoring prediction of recommended videos, and specifically comprises the following steps:
step S1: processing the video comments, mining hidden information, combining the hidden information matrix with the original user-video scoring matrix to generate a new user-video scoring matrix, and entering into step S2;
step S2: the method comprises the steps that a dual regularization term is added to learn a potential feature matrix when a user-video scoring matrix is decomposed, each user can make a certain contribution to video similarity, the user contributions are not identical, the user can be divided into active users and inactive users from the consideration of the liveness of the users, the active users refer to users with a large number of scoring records on the video, the inactive users refer to users with scoring records on a small number of videos, so that the contributions of the active users and the inactive users are distinguished when the video similarity is calculated, and the liveness of the users is defined as follows:
In the case of the formula 1 of the present invention,the total score of the user u is represented, so that the video similarity calculation method obtained by combining the activity coefficient of the user with the corrected cosine similarity is as follows: />
In the case of the formula 2 of the present invention,representing the score of user u for video i, +.>Representing the score of user u for video j, +.>Score representing user u, ++>Representing a set of users who have scored video i and j simultaneously; the learning of the potential feature matrix of the video association regularization term constraint project which is integrated with the user activity is introduced during matrix decomposition, and the video association regularization constraint function formula which is integrated with the user activity at the moment is as follows:
In equation 3, V represents the video feature matrix, V j Is the potential feature vector of video j, V i Is a potential feature vector of the video i, and proceeds to step S3;
step S3: taking the potential feature vectors decomposed by the matrix as the input of the multi-layer perceptron, processing the potential feature vectors by the multi-layer perceptron to obtain a multi-layer perceptron model prediction result, and entering into step S4;
s4, a step of S4; and combining the result of the multi-layer perceptron model prediction with the result of matrix decomposition at the merging layer, and optimizing the model by using a normalized cross entropy method to obtain a final prediction score.
As a preferred technical scheme of the invention: in the step S1, firstly, the LDA model is utilized to mine relevant type hidden information in the user video evaluation, generate a user type latent feature matrix LU and a video type latent feature matrix LV, and combine the user type latent feature matrix and the video type latent feature matrix to obtain
To the hidden information matrix L, the calculation formula is:
And combining the hidden information matrix L with the original user-video scoring matrix R to generate a new user-video scoring matrixThe calculation formula is as follows:
As a preferred technical scheme of the invention: in the step S2, users with the same interests may affect each other, and the user similarity may be calculated using the weighted pearson correlation coefficient:
In the case of the formula 6 of the present invention,and->Represents the average score of users u and v, respectively, < >>Representing the score of user u for video i, +.>Representing the score of user v for video i, +.>Representing a video set reviewed by user u, +.>Representing a video set reviewed by user v, +.>The Jaccard correlation coefficient, which is a term capable of affecting the user similarity calculation, is calculated as follows:
Wherein,,representing a video set reviewed by user u, +.>Representing a video set reviewed by the user v;
the scoring of the item by the user depends on the influence of the adjacent user, and the influence of the adjacent user by the adjacent user can be also possible, but the adjacent user after a certain distance cannot be influenced by the user, namely the adjacent user becomes unreliable, so that a reliable value is introduced, the scoring of the item by the adjacent user with the reliable value being larger than a certain value can be influenced, and the reliable value is calculated in the following way:
In the case of the formula 8 of the present invention,representing the score of video i with u, +.>Representing the score of user v for video i, +.>Representing a video set reviewed by user u, +.>Representing a video set reviewed by user v, +.>Represents the maximum value of the score,/->Indicating trust distance, i.e. presence between user u and user vIs->Represents the maximum distance allowed between two users, +.>Is a correction parameter, is a number greater than 0 and less than 1, and is a reliable nearest neighbor user
The reliable nearest-neighbor regular term is introduced to restrain the learning of the potential feature matrix of the user when the matrix is decomposed, and the reliable nearest-neighbor regular term restraining function is as follows:
As a preferred technical scheme of the invention: in the step S3, the user potential feature vector and the video potential feature vector are used as input of a multi-layer perceptron, wherein the deep neural network is composed of the multi-layer perceptron and a single-layer perceptron, the multi-layer perceptron comprises an input layer, a plurality of hidden layers allowing nonlinearity of the neural structure and an output layer, and the result of the multi-layer perceptron model is obtained through processing of the multi-layer perceptron by utilizing the nonlinearity characteristics of the hidden layers.
As a preferred technical scheme of the invention: in the step S4, in the deep neural network structure, the single-layer perceptron is a merging layer, and in the merging layer, the prediction result of the multi-layer perceptron model is combined with the result of the dual regularization matrix decomposition model, and the calculation formula is as follows:
In equation 11To activate the function +.>For the matrix weight set between the output layer and the merge layer,/for the matrix weight set between the output layer and the merge layer>For outputting the result of the layer->For user potential vector, ++>For video potential vector, ++>And optimizing the model by using a normalized cross entropy method for merging the bias terms of the layers, and finally obtaining the prediction scores.
The beneficial effects are that:
1. according to the method, the video association regular term and the reliable nearest neighbor regular term which are integrated with the user activity are introduced to restrain the learning of the potential feature matrix, the nonlinear structure of the deep neural network is utilized to relieve the limit of linear dot products in the matrix decomposition process, and the result of the deep neural network model is combined with the result of the double regularization matrix decomposition, so that the accuracy of video scoring prediction is improved.
2. The invention utilizes the LDA model to mine the relevant information in the user video comment, generates the user type potential feature matrix and the video type potential feature matrix, combines the user type potential feature matrix and the video type potential feature matrix to obtain the hidden information matrix, combines the hidden information matrix and the original user video scoring matrix to generate the new user-video scoring matrix, and alleviates the problems of cold start and data sparsity.
Drawings
FIG. 1 is a flow chart of the method of the present invention;
FIG. 2 is a diagram of a multi-layer perceptron of the present invention;
fig. 3 is a block diagram of a deep neural network of the present invention.
Detailed Description
The following describes the embodiments of the present invention in further detail with reference to the drawings.
The invention reconstructs a user-video scoring matrix based on a traditional matrix decomposition model, and introduces video association regular terms and reliable nearest neighbor regular terms which are integrated with the activity of the user to limit the learning of potential feature matrices. The nonlinear structure of the deep neural network is utilized, the limit of linear dot products in the matrix decomposition process is relieved, potential feature vectors decomposed by the matrix are used as input of the deep neural network, the result of the MLP model is obtained through multi-layer perceptron processing, the result of the MLP model is combined with the result of the double regularization matrix decomposition model in a single-layer perceptron layer, namely a merging layer, and the model is optimized through a normalized cross entropy method, so that the accuracy of scoring prediction is improved.
As shown in fig. 1.
The invention designs a video scoring prediction method based on a deep neural network and double regularization, which is used for improving the precision of scoring prediction of recommended videos and comprises the following steps of:
step S1: processing the video comments, mining hidden information, combining the hidden information matrix with the original user-video scoring matrix to generate a new user-video scoring matrix, and entering into step S2;
step S2: adding a double-regular term for restraining the learning of the potential feature matrix when decomposing the user-video scoring matrix, and entering into step S3;
step S3: taking the potential feature vectors decomposed by the matrix as the input of the multi-layer perceptron, processing the potential feature vectors by the multi-layer perceptron to obtain a multi-layer perceptron model prediction result, and entering into step S4;
step S4: and combining the result of the multi-layer perceptron model prediction with the result of matrix decomposition at the merging layer, and optimizing the model by using a normalized cross entropy method to obtain a final prediction score.
The method comprises the following specific steps:
the step S1 comprises the following steps: firstly, utilizing an LDA model to mine relevant type hidden information in a user video evaluation, generating a user type latent feature matrix LU and a video type latent feature matrix LV, combining the user type latent feature matrix and the video type latent feature matrix to obtain a hidden information matrix L, wherein the calculation formula is as followsAnd combining the hidden information matrix L with the original user-video scoring matrix R to reconstruct the user-video scoring matrix R, wherein the calculation formula is as follows:。
The step S2 comprises the following steps: performing matrix decomposition on the user-video scoring matrix, and decomposing the high-dimensional user-video scoring matrix into a low-dimensional user feature matrix and a video feature matrix, wherein the formula is as follows:
wherein U represents a user feature matrix, U i Representing the potential feature vector of user i, V represents the video feature matrix, V j Is a potential feature vector for video j. The low-dimensional matrix decomposition method approximately calculates a scoring matrix R by the product of d rank factors. The predictive score of user i for video j is expressed asThe square of the error between the predicted score and the original score is taken as a loss function, and the loss function is minimized to approach the scoring matrix R. The loss function is:
in the above-mentioned formula(s),is an indication function that indicates that 1 is equal if user i scores item j, and 0 is otherwise equal.And->Overfitting is prevented for two regular terms. Since each user can make a certain contribution to the video similarity, but each user contribution is not exactly the same, the user can be classified into active users and inactive users from the viewpoint of the activity of the user, the active users refer to users with a large number of scoring records on the video, and the inactive users refer to users with only a small number of scoring records on the video, so that the contribution of the active users and the inactive users should be distinguished when the video similarity is calculated, and the activity of the user can be defined as:
in the above-mentioned formula(s),representing the total amount of scoring for user u. Therefore, the video similarity calculation method obtained by combining the activity coefficient of the user with the modified cosine similarity comprises the following steps:
in the above-mentioned formula(s),representing the score of user u for video i, +.>Representing the score of user u for video j, +.>A score representing user u. Introducing a video association regularization term which is integrated with the user activity during matrix decomposition to restrict the learning of the project potential feature matrix, wherein a video association regularization constraint function formula which is integrated with the user activity at the moment is as follows:
wherein V represents a video feature matrix, V j Is the potential feature vector of video j, V i Is a potential feature vector for video i. Users of the same interest will interact and user similarity can be calculated using weighted pearson correlation coefficients:
in the above-mentioned formula(s),and->Represents the average score of users u and v, respectively, < >>The Jaccard correlation coefficient, which is a term capable of affecting the user similarity calculation, is calculated as follows:
wherein,,representing a video set reviewed by user u, +.>Representing the video collection reviewed by user v.
The scoring of the item by the user depends on the influence of the adjacent user, and the influence of the adjacent user by the adjacent user can be also possible, but the adjacent user after a certain distance cannot be influenced by the user, namely the adjacent user becomes unreliable, so that a reliable value is introduced, the scoring of the item by the adjacent user with the reliable value being larger than a certain value can be influenced, and the reliable value is calculated in the following way:
in the above-mentioned formula(s),representing the score of video i with u, +.>Representing the score of user v for video i, +.>Representing a video set reviewed by user u, +.>Representing a video set reviewed by user v, +.>Represents the maximum value of the score,/->Representing the trust distance, i.e. the number of users present between user u and user v>Represents the maximum distance allowed between two users, +.>Is a correction parameter, which is a number greater than 0 and less than 1. Reliable nearest neighbor user is
The reliable nearest-neighbor regular term is introduced to restrain the learning of the potential feature matrix of the user when the matrix is decomposed, and the reliable nearest-neighbor regular term restraining function is as follows:
Adding a video association regularization term fused with user liveness and a reliable nearest neighbor regularization term, wherein the final optimization loss function is as follows:
and searching an optimal solution by adopting a random gradient descent method, and finding out an optimal potential feature matrix.
As shown in fig. 2.
The step S3 comprises the following steps: latent feature vector U of user u And video potential feature vector V i As input to the multi-layer perceptron, as shown in FIG. 2, comprising an input layer L in Several hidden layers and output layers L allowing for neural structure nonlinearity out Input layer L in The output vector of (2) is:
the vector is output after the first hidden layer processing:
wherein the method comprises the steps ofIs the set of weights contained in the matrix between the input layer and the first layer hidden layer L1,/and>is the deviation of the L1 layer, < >>Is an activation function of
So conceal layer L k The output vector of (2) is:
wherein the method comprises the steps ofFor the activation function of neurons, +.>Is a weight matrix>Is the deviation. Output layer L of multi-layer perceptron out The output vector is:
as shown in fig. 3.
The step S4 includes: the single-layer perceptron is a merging layer in the deep neural network structure, wherein the deep neural network structure is shown in fig. 3, the multi-layer perceptron model prediction result is combined with the result of the double regularization matrix decomposition model in the merging layer, and the calculation formula is as follows:
wherein,,to activate the function +.>For the matrix weight set between the output layer and the merge layer,/for the matrix weight set between the output layer and the merge layer>For outputting the result of the layer->For user potential vector, ++>For video potential vector, ++>Bias term for merging layers
The proposed model is continuously optimized by adopting a normalized cross entropy method through the following cost functions:
in the above-mentioned formula(s),for merging layer neuron number, +.>For the predicted score, ++>For training example true score, +.>Representing the maximum value of the score. And continuously optimizing the model by using a gradient descent method to obtain a final prediction score by the cost function.
The embodiments of the present invention have been described in detail with reference to the drawings, but the present invention is not limited to the above embodiments, and various changes can be made within the knowledge of those skilled in the art without departing from the spirit of the present invention.
Claims (2)
1. The video scoring prediction method based on the deep neural network and the double regularization is characterized by comprising the following steps of:
step S1: processing the video comments, mining out hidden information, combining the hidden information matrix with the original user-video scoring matrix to generate a new user-video scoring matrix, and entering into step S2, wherein the step S2 comprises the following steps:
firstly, mining relevant type hidden information in a user video comment by using an LDA model, generating a user type potential feature matrix LU and a video type potential feature matrix LV, and combining the user type potential feature matrix and the video type potential feature matrix to obtain a hidden information matrix L, wherein the calculation formula is as follows:
and combining the hidden information matrix L with the original user-video scoring matrix R to generate a new user-video scoring matrixThe calculation formula is as follows:
step S2: the method comprises the steps that a dual regularization term is added to learn a potential feature matrix when a user-video scoring matrix is decomposed, each user can make a certain contribution to video similarity, the user contributions are not identical, the user can be divided into active users and inactive users from the consideration of the liveness of the users, the active users refer to users with a large number of scoring records on the video, the inactive users refer to users with scoring records on a small number of videos, so that the contributions of the active users and the inactive users are distinguished when the video similarity is calculated, and the liveness of the users is defined as follows:
in formula 1, |m (u) | represents the total score of user u, so the video similarity calculation method obtained by combining the activity coefficient of the user with the modified cosine similarity is as follows:
in formula 2, R ui Representing the score of user u on video i, R uj Representing the score of user u for video j,representing the average score for user U, which represents the set of users that have both video i and j scored; the learning of the potential feature matrix of the video association regularization term constraint project which is integrated with the user activity is introduced during matrix decomposition, and the video association regularization constraint function formula which is integrated with the user activity at the moment is as follows:
in equation 3, V represents the video feature matrix, V j Is the potential feature vector of video j, V i Is a potential feature vector of video i, and proceeds to step S3, which includes:
users of the same interest will interact and user similarity is calculated using weighted pearson correlation coefficients:
in the case of the formula 6 of the present invention,and->Representing the average score, r, of users u and v, respectively ui Representing the score of user u on video i, r vi Representing the score of user v on video I u Representing a video set reviewed by user u, I v Video collection representing user v commented, JCC uv The Jaccard correlation coefficient, which is a term capable of affecting the user similarity calculation, is calculated as follows:
wherein I is u Representing a video set reviewed by user u, I v Representing a video set reviewed by the user v;
the scoring of the item by the user depends on the influence of the adjacent user and can be influenced by the adjacent user of the adjacent user, but the adjacent user after a certain distance cannot be influenced by the user, namely becomes unreliable, so that a reliable value is introduced, the scoring of the item by the adjacent user with the reliable value larger than a certain value can be influenced, and the reliable value is calculated in the following way:
in formula 8, R ui Representing the score of video i with u, R vi Representing the score of user v on video I u Representing a video set reviewed by user u, I v Representing video collections reviewed by user v, R max Represents the maximum value of the score, d uv Representing the trust distance, i.e. the number of users present between user u and user v, d max Representing the maximum distance allowed between two users, ε being a correction parameter, a number greater than 0 and less than 1, may beThe nearest neighbor user is
Near={v∈U|sim uv Not less than gamma and C uv ∈delta } formula 9
The reliable nearest-neighbor regular term is introduced to restrain the learning of the potential feature matrix of the user when the matrix is decomposed, and the reliable nearest-neighbor regular term restraining function is as follows:
wherein U is u As a potential vector for user U, U v Potential feature vectors for user v;
step S3: taking the potential feature vectors decomposed by the matrix as the input of the multi-layer perceptron, processing the potential feature vectors by the multi-layer perceptron to obtain a multi-layer perceptron model prediction result, and entering into step S4;
step S4: combining the result of the multi-layer perceptron model prediction with the result of matrix decomposition at a merging layer, optimizing the model by using a normalized cross entropy method to obtain a final prediction score, wherein the method comprises the following steps:
in the deep neural network structure, a single-layer perceptron is a merging layer, a prediction result of a multi-layer perceptron model is combined with a result of a double regularization matrix decomposition model at the merging layer, and a calculation formula is as follows:
ρ in equation 11 m To activate the function, w m For the matrix weight set between the output layer and the merging layer, y out To output the result of the layer, U u For the user potential vector, V i A is a video potential vector m And optimizing the model by using a normalized cross entropy method for merging the bias terms of the layers, and finally obtaining the prediction scores.
2. The video scoring prediction method based on deep neural network and double regularization according to claim 1, wherein the method is characterized in that: in the step S3, the user potential feature vector and the video potential feature vector are used as input of a multi-layer perceptron, wherein the deep neural network is composed of the multi-layer perceptron and a single-layer perceptron, the multi-layer perceptron comprises an input layer, a plurality of hidden layers allowing nonlinearity of the neural structure and an output layer, and the result of the multi-layer perceptron model is obtained through processing of the multi-layer perceptron by utilizing the nonlinearity characteristics of the hidden layers.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310187456.2A CN115905617B (en) | 2023-03-02 | 2023-03-02 | Video scoring prediction method based on deep neural network and double regularization |
PCT/CN2023/135369 WO2024179089A1 (en) | 2023-03-02 | 2023-11-30 | Video score prediction method based on deep neural network and double regularization |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310187456.2A CN115905617B (en) | 2023-03-02 | 2023-03-02 | Video scoring prediction method based on deep neural network and double regularization |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115905617A CN115905617A (en) | 2023-04-04 |
CN115905617B true CN115905617B (en) | 2023-05-30 |
Family
ID=85744775
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310187456.2A Active CN115905617B (en) | 2023-03-02 | 2023-03-02 | Video scoring prediction method based on deep neural network and double regularization |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN115905617B (en) |
WO (1) | WO2024179089A1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115905617B (en) * | 2023-03-02 | 2023-05-30 | 南京邮电大学 | Video scoring prediction method based on deep neural network and double regularization |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112668374A (en) * | 2019-10-16 | 2021-04-16 | 北京灵汐科技有限公司 | Image processing method and device, re-recognition network training method and electronic equipment |
CN111581333B (en) * | 2020-05-07 | 2023-05-26 | 重庆大学 | Text-CNN-based video and audio play list pushing method and video and audio play list pushing system |
CN114117191A (en) * | 2020-08-26 | 2022-03-01 | 付昳漫 | Context-aware recommendation method based on deep neural network |
CN112328900A (en) * | 2020-11-27 | 2021-02-05 | 北京工业大学 | Deep learning recommendation method integrating scoring matrix and comment text |
CN114625980A (en) * | 2020-12-13 | 2022-06-14 | 国网吉林省电力有限公司培训中心 | Dual-regularization collaborative filtering recommendation method |
CN113157957A (en) * | 2021-03-05 | 2021-07-23 | 北京工业大学 | Attribute graph document clustering method based on graph convolution neural network |
CN113807422B (en) * | 2021-09-07 | 2024-05-31 | 南京邮电大学 | Weighted graph convolutional neural network scoring prediction model integrating multi-feature information |
CN115905617B (en) * | 2023-03-02 | 2023-05-30 | 南京邮电大学 | Video scoring prediction method based on deep neural network and double regularization |
-
2023
- 2023-03-02 CN CN202310187456.2A patent/CN115905617B/en active Active
- 2023-11-30 WO PCT/CN2023/135369 patent/WO2024179089A1/en unknown
Also Published As
Publication number | Publication date |
---|---|
WO2024179089A1 (en) | 2024-09-06 |
CN115905617A (en) | 2023-04-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Pan et al. | A correlative denoising autoencoder to model social influence for top-N recommender system | |
Bansal et al. | Ask the gru: Multi-task learning for deep text recommendations | |
Ni et al. | An effective recommendation model based on deep representation learning | |
CN111079409B (en) | Emotion classification method utilizing context and aspect memory information | |
CN110309195B (en) | FWDL (full Width Domain analysis) model based content recommendation method | |
CN110866145B (en) | Co-preference-assisted deep single-class collaborative filtering recommendation method | |
CN111461157A (en) | Self-learning-based cross-modal Hash retrieval method | |
CN107507073A (en) | Based on the service recommendation method for trusting extension and the sequence study of list level | |
CN112800344B (en) | Deep neural network-based movie recommendation method | |
CN111104601B (en) | Antagonistic multi-feedback-level paired personalized ranking method | |
Chen et al. | A survey on heterogeneous one-class collaborative filtering | |
CN111241425A (en) | POI recommendation method based on hierarchical attention mechanism | |
CN116071128A (en) | Multitask recommendation method based on multi-behavioral feature extraction and self-supervision learning | |
CN115905617B (en) | Video scoring prediction method based on deep neural network and double regularization | |
Li et al. | Coltr: Semi-supervised learning to rank with co-training and over-parameterization for web search | |
Li et al. | Capsule neural tensor networks with multi-aspect information for Few-shot Knowledge Graph Completion | |
CN116662564A (en) | Service recommendation method based on depth matrix decomposition and knowledge graph | |
Lin et al. | Transfer learning for collaborative recommendation with biased and unbiased data | |
CN110889505A (en) | Cross-media comprehensive reasoning method and system for matching image-text sequences | |
Lian et al. | Financial product recommendation system based on transformer | |
Rauf et al. | BCE4ZSR: Bi-encoder empowered by teacher cross-encoder for zero-shot cold-start news recommendation | |
CN114841778B (en) | Commodity recommendation method based on dynamic graph neural network | |
CN117056609A (en) | Session recommendation method based on multi-layer aggregation enhanced contrast learning | |
CN117539999A (en) | Cross-modal joint coding-based multi-modal emotion analysis method | |
Kim et al. | COVID-19 outbreak prediction using Seq2Seq+ Attention and Word2Vec keyword time series data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |