CN115905617B

CN115905617B - Video scoring prediction method based on deep neural network and double regularization

Info

Publication number: CN115905617B
Application number: CN202310187456.2A
Authority: CN
Inventors: 赵学健; 张晶晶; 孙知信; 孙哲; 曹亚东; 宫婧; 汪胡青; 胡冰; 徐玉华
Original assignee: Nanjing University of Posts and Telecommunications
Current assignee: Nanjing University of Posts and Telecommunications
Priority date: 2023-03-02
Filing date: 2023-03-02
Publication date: 2023-05-30
Anticipated expiration: 2043-03-02
Also published as: CN115905617A; WO2024179089A1

Abstract

The present invention relates to a video scoring prediction method based on deep neural network and double regularization, which reconstructs the user-video scoring matrix, introduces video association regularization items incorporating user activity and reliable nearest neighbor regularization items, and constructs a fusion The matrix factorization recommendation model of the video association regularization item with user activity and the reliable nearest neighbor regularization item, the latent features are input into the deep neural network to obtain the result of the deep neural network model, and the result of the deep neural network model is compared with the matrix factorization Combining the structure to get the final prediction score improves the accuracy of the prediction score; and using the LDA model to mine relevant information in user video comments, generate user type latent feature matrix and video type latent feature matrix, and combine the two to get the hidden information matrix , and then combine the hidden information matrix with the original user video rating matrix to generate a new user-video rating matrix, which alleviates the cold start and data sparsity problems.

Description

A video rating prediction method based on deep neural network and double regularization

技术领域Technical Field

本发明涉及一种基于深度神经网络与双正则化的视频评分预测方法，属于评分预测领域。The invention relates to a video rating prediction method based on a deep neural network and double regularization, and belongs to the field of rating prediction.

背景技术Background Art

随着互联网技术的快速发展，各个网络平台中视频资源越来越多，为用户提供丰富的视频资源，为用户提供更多的选择时，也给我们带来了麻烦和困扰，庞大的视频资源不仅增加了我们寻找自己喜好视频的难度，而且使得寻找视频的过程变得十分耗费时间。为了解决信息过载问题，个性化推荐系统成为解决这一个问题的有效工具。评分预测又是推荐算法的重要组成部分。现有的推荐算法主要有三大类：基于协同过滤的推荐算法、基于内容的推荐算法和混合推荐算法。目前使用最多的是基于协同过滤的推荐算法，而协同过滤推荐算法中使用最多的是基于模型的协同过滤推荐算法，基于模型的协同过滤推荐算法中比较常见的几种算法包括：矩阵分解模型、奇异值分解、聚类分析等。但是现存的协同过滤推荐算法存在数据稀疏及冷启动等问题，导致对推荐的视频资源评分预测不准确，从而影响个性化推荐的结果，如何提高对视频资源的预测评分的准确性，进一步提高推荐精度成为当前研究的热点之一。With the rapid development of Internet technology, there are more and more video resources in various network platforms. While providing users with rich video resources and more choices, it also brings us troubles and troubles. The huge video resources not only increase the difficulty of finding videos we like, but also make the process of finding videos very time-consuming. In order to solve the problem of information overload, personalized recommendation system has become an effective tool to solve this problem. Rating prediction is an important part of recommendation algorithm. There are three main categories of existing recommendation algorithms: recommendation algorithm based on collaborative filtering, recommendation algorithm based on content and hybrid recommendation algorithm. At present, the most used recommendation algorithm is based on collaborative filtering, and the most used in collaborative filtering recommendation algorithm is model-based collaborative filtering recommendation algorithm. The more common algorithms in model-based collaborative filtering recommendation algorithm include: matrix decomposition model, singular value decomposition, cluster analysis, etc. However, the existing collaborative filtering recommendation algorithm has problems such as data sparsity and cold start, which leads to inaccurate prediction of the recommended video resource rating, thus affecting the results of personalized recommendation. How to improve the accuracy of the predicted rating of video resources and further improve the recommendation accuracy has become one of the current research hotspots.

发明内容Summary of the invention

本发明的目的在于针对上述现有技术的不足，提供了一种基于深度神经网络与双正则化的视频评分预测方法，重构用户-视频评分矩阵，在矩阵分解时引入融入用户活跃度的视频关联正则项和可靠的最近邻正则项来约束潜在特征矩阵的学习，并引入深度神经网络，利用深度神经网络的非线性特征减轻矩阵分解过程中的线性点积的限制，将深度神经网络模型的结果与双正则化矩阵分解的结果相结合，提高视频评分预测的精度。The purpose of the present invention is to address the deficiencies of the above-mentioned prior art and provide a video rating prediction method based on deep neural network and double regularization, reconstruct the user-video rating matrix, introduce video association regularization terms that incorporate user activity and reliable nearest neighbor regularization terms during matrix decomposition to constrain the learning of the latent feature matrix, introduce deep neural network, utilize the nonlinear characteristics of deep neural network to alleviate the limitation of linear dot product in the matrix decomposition process, combine the results of deep neural network model with the results of double regularization matrix decomposition, and improve the accuracy of video rating prediction.

本发明采用的技术方案如下：一种基于深度神经网络与双正则化的视频评分预测方法，用于提高对推荐视频评分预测的精度，具体包括如下步骤：The technical solution adopted by the present invention is as follows: a video rating prediction method based on deep neural network and double regularization, which is used to improve the accuracy of predicting the rating of recommended videos, and specifically includes the following steps:

步骤S1：对视频评论进行处理，挖掘出隐藏信息，并将隐藏信息矩阵与原始用户-视频评分矩阵相结合生成新的用户-视频评分矩阵，并进入步骤S2；Step S1: Process the video comments, mine hidden information, and combine the hidden information matrix with the original user-video rating matrix to generate a new user-video rating matrix, and then proceed to step S2;

步骤S2：对用户-视频评分矩阵分解时加入双正则项约束潜在特征矩阵的学习，每个用户对视频的评分都会对视频相似度做出一定的贡献，用户贡献却不完全相同，从用户的活跃度考虑能够分为活跃用户和不活跃用户，活跃用户指的是对视频有大量的评分记录的用户，而不活跃用户是指只对少数视频进行评分记录的用户，所以在计算视频相似度时应将活跃用户和不活跃用户的贡献区分开，用户的活跃度定义为：Step S2: When decomposing the user-video rating matrix, double regularization terms are added to constrain the learning of the latent feature matrix. Each user's rating of a video will make a certain contribution to the video similarity, but the user contributions are not exactly the same. From the perspective of user activity, they can be divided into active users and inactive users. Active users refer to users who have a large number of rating records for videos, while inactive users refer to users who have only rated a few videos. Therefore, when calculating video similarity, the contributions of active users and inactive users should be distinguished. The user's activity is defined as:

公式1

Formula 1

在公式1中，

表示用户u的评分总量，因此用户的活跃度系数结合修正的余弦相似度得到的视频相似度计算方法为：In formula 1,

represents the total number of ratings of user u, so the video similarity calculation method obtained by combining the user's activity coefficient with the modified cosine similarity is:

公式2

Formula 2

在公式2中，

表示用户u对视频i的评分，

表示用户u对视频j的评分，

表示用户u的评分评分，

表示同时对视频i和j有过评分的用户集合；在矩阵分解时引入融入用户活跃度的视频关联正则项约束项目潜在特征矩阵的学习，此时融入用户活跃度的视频关联正则化约束函数公式为：In formula 2,

represents the rating of video i by user u,

represents the rating of video j by user u,

represents the rating of user u,

represents the set of users who have rated videos i and j at the same time; when decomposing the matrix, the video association regularization term that incorporates user activity is introduced to constrain the learning of the project's latent feature matrix. At this time, the formula of the video association regularization constraint function that incorporates user activity is:

公式3

Formula 3

在公式3中，V表示视频特征矩阵，V_j是视频j的潜在特征向量，V_i是视频i的潜在特征向量，并进入步骤S3；In Formula 3, V represents the video feature matrix, _Vj is the potential feature vector of video j, _Vi is the potential feature vector of video i, and the process proceeds to step S3;

步骤S3：将矩阵分解出的潜在特征向量作为多层感知机的输入，经过多层感知机处理得到多层感知机模型预测的结果，并进入步骤S4；Step S3: The potential feature vector decomposed from the matrix is used as the input of the multi-layer perceptron, and the result predicted by the multi-layer perceptron model is obtained after being processed by the multi-layer perceptron, and then the process goes to step S4;

步骤S4；在合并层将多层感知机模型预测的结果与矩阵分解的结果相结合，使用归一化交叉熵法来优化模型，得到最终预测得分。Step S4: Combine the prediction results of the multilayer perceptron model with the results of matrix decomposition at the merging layer, use the normalized cross entropy method to optimize the model, and obtain the final prediction score.

作为本发明的一种优选技术方案：所述步骤S1中，首先利用LDA模型挖掘用户视频评论中相关类型的隐藏信息，生成用户类型潜在特征矩阵LU和视频类型潜在特征矩阵LV，并将用户类型潜在特征矩阵与视频类型潜在特征矩阵相结合得As a preferred technical solution of the present invention: in the step S1, the LDA model is first used to mine the hidden information of related types in the user video comments, generate the user type potential feature matrix LU and the video type potential feature matrix LV, and combine the user type potential feature matrix with the video type potential feature matrix to obtain

到隐藏信息矩阵L，其计算公式为：To the hidden information matrix L, its calculation formula is:

公式4

Formula 4

并将隐藏信息矩阵L与原始用户-视频评分矩阵R相结合生成新的用户-视频评分矩阵

，其计算公式为：The hidden information matrix L is combined with the original user-video rating matrix R to generate a new user-video rating matrix

, and its calculation formula is:

公式5

Formula 5

作为本发明的一种优选技术方案：所述步骤S2中，相同兴趣爱好的用户会互相影响，用户相似度可以使用加权的皮尔逊相关系数进行计算：As a preferred technical solution of the present invention: in step S2, users with the same interests and hobbies will influence each other, and the user similarity can be calculated using the weighted Pearson correlation coefficient:

公式6

Formula 6

在公式6中，

和

分别代表用户u和v的平均评分，

表示用户u对视频i的评分，

表示用户v对视频i的评分，

表示用户u评论过的视频集合，

表示用户v评论过的视频集合，

为权重即能够影响用户相似度计算的项目的Jaccard相关系数，其计算公式如下：In Formula 6,

and

Represent the average ratings of users u and v respectively,

represents the rating of video i by user u,

represents the rating of user v on video i,

represents the set of videos commented by user u,

represents the set of videos commented by user v,

The Jaccard correlation coefficient of the item that can affect the calculation of user similarity is the weight, and its calculation formula is as follows:

公式7

Formula 7

其中，

表示用户u评论过的视频集合，

表示用户v评论过的视频集合；in,

represents the set of videos commented by user u,

represents the set of videos commented by user v;

用户对项目的评分取决于邻近用户的影响，也可能受到邻近用户的邻近用户的影响，但是一定距离后的邻近用户不会在对用户有所影响即变得不在可靠，因此引进可靠值，可靠值大于一定值的邻近用户会对用户项目评分有影响，可靠值计算方式为：The user's rating of an item depends on the influence of neighboring users, and may also be affected by the neighboring users of the neighboring users. However, neighboring users beyond a certain distance will no longer have an impact on the user and become unreliable. Therefore, a reliability value is introduced. Neighboring users with a reliability value greater than a certain value will have an impact on the user's item rating. The reliability value is calculated as follows:

公式8

Formula 8

在公式8中，

表示用u对视频i的评分，

表示用户v对视频i的评分，

表示用户u评论过的视频集合，

表示用户v评论过的视频集合，

表示评分的最大值，

表示信任距离即用户u和用户v之间存在的用户人数，

表示两个用户之间允许的最大距离，

是为修正参数，是一个大于0小于1的数，可靠的最近邻用户为In formula 8,

represents the rating of video i by u,

represents the rating of user v on video i,

represents the set of videos commented by user u,

represents the set of videos commented by user v,

represents the maximum value of the score,

represents the trust distance, that is, the number of users between user u and user v.

Indicates the maximum distance allowed between two users.

is a correction parameter, which is a number greater than 0 and less than 1. The reliable nearest neighbor user is

公式9

Formula 9

在矩阵分解时引入可靠的最近邻正则项来约束用户潜在特征矩阵的学习，可靠的最近邻正则项约束函数为：During matrix decomposition, a reliable nearest neighbor regularization term is introduced to constrain the learning of the user's potential feature matrix. The constraint function of the reliable nearest neighbor regularization term is:

。公式10

Formula 10

其中，

为用户u的潜在向量，

为用户v的潜在特征向量。in,

is the potential vector of user u,

is the potential feature vector of user v.

作为本发明的一种优选技术方案：所述步骤S3中，将用户潜在特征向量与视频潜在特征向量作为多层感知机的输入，其中该深度神经网络由多层感知机、单层感知机组成，其中多层感知机包括输入层、若干个允许神经结构非线性的隐藏层和输出层，利用隐藏层的非线性特征，通过多层感知机处理得到多层感知机模型的结果。As a preferred technical solution of the present invention: in the step S3, the user potential feature vector and the video potential feature vector are used as inputs of a multi-layer perceptron, wherein the deep neural network is composed of a multi-layer perceptron and a single-layer perceptron, wherein the multi-layer perceptron includes an input layer, several hidden layers that allow the nonlinearity of the neural structure, and an output layer, and the result of the multi-layer perceptron model is obtained by processing the multi-layer perceptron using the nonlinear characteristics of the hidden layer.

作为本发明的一种优选技术方案：所述步骤S4中，在该深度神经网络结构中单层感知机为合并层，在合并层将多层感知机模型的预测结果与双正则化矩阵分解模型的结果相结合，其计算公式为：As a preferred technical solution of the present invention: in the step S4, the single-layer perceptron in the deep neural network structure is a merging layer, and the prediction results of the multi-layer perceptron model are combined with the results of the double regularization matrix decomposition model in the merging layer, and the calculation formula is:

公式11

Formula 11

在公式11中

为激活函数，

为输出层与合并层之间的矩阵权重集，

为输出层的结果，

为用户潜在向量，

为视频潜在向量，

为合并层的偏差项，使用归一化交叉熵法来优化模型，最终得到预测评分。In formula 11

is the activation function,

is the matrix weight set between the output layer and the merging layer,

is the result of the output layer,

is the user latent vector,

is the video latent vector,

The normalized cross entropy method is used to optimize the model for the bias term of the merging layer, and finally the predicted score is obtained.

有益效果：Beneficial effects:

1. 本发明引入融入用户活跃度的视频关联正则项和可靠的最近邻正则项来约束潜在特征矩阵的学习，利用深度神经网络的非线性结构来减轻矩阵分解过程中线性点积的限制，并将深度神经网络模型的结果与双正则化矩阵分解的结果相结合，提高了视频评分预测的精度。1. The present invention introduces a video association regularization term that incorporates user activity and a reliable nearest neighbor regularization term to constrain the learning of the latent feature matrix, uses the nonlinear structure of the deep neural network to alleviate the limitation of the linear dot product in the matrix decomposition process, and combines the results of the deep neural network model with the results of the double regularization matrix decomposition, thereby improving the accuracy of video score prediction.

2. 本发明利用LDA模型挖掘用户视频评论中相关信息，生成用户类型潜在特征矩阵和视频类型潜在特征矩阵，并将用户类型潜在特征矩阵与视频类型潜在特征矩阵相结合得到隐藏的信息矩阵，并将隐藏的信息矩阵与原始用户视频评分矩阵相结合生成新的用户-视频评分矩阵，缓解了冷启动和数据稀疏性问题。2. The present invention utilizes the LDA model to mine relevant information in user video comments, generates a user type latent feature matrix and a video type latent feature matrix, combines the user type latent feature matrix with the video type latent feature matrix to obtain a hidden information matrix, and combines the hidden information matrix with the original user video rating matrix to generate a new user-video rating matrix, thereby alleviating the cold start and data sparsity problems.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1是本发明的方法流程图；Fig. 1 is a flow chart of the method of the present invention;

图2是本发明的多层感知机结构图；FIG2 is a diagram of the structure of a multi-layer perceptron of the present invention;

图3是本发明的深度神经网络结构图。FIG3 is a structural diagram of a deep neural network of the present invention.

具体实施方式DETAILED DESCRIPTION

下面结合说明书附图对本发明的具体实施方式作进一步详细的说明。The specific implementation modes of the present invention will be further described in detail below in conjunction with the accompanying drawings.

本发明是在传统的矩阵分解模型的基础上，重构用户-视频评分矩阵，引入融入用户活跃度的视频关联正则项和可靠的最近邻正则项用来限制潜在特征矩阵的学习。利用深度神经网络的非线性结构，缓解了矩阵分解过程中线性点积的限制，将矩阵分解出的潜在特征向量作为深度神经网络的的输入，经过多层感知机处理得到MLP模型的结果，并在单层感知机层即合并层将MLP模型结果与双正则化矩阵分解模型的结果相结合，通过归一化交叉熵法优化模型，从而提高评分预测的精度。The present invention reconstructs the user-video rating matrix on the basis of the traditional matrix decomposition model, introduces the video association regularization term integrating the user activity and the reliable nearest neighbor regularization term to limit the learning of the potential feature matrix. The nonlinear structure of the deep neural network is utilized to alleviate the limitation of the linear dot product in the matrix decomposition process, and the potential feature vector decomposed by the matrix is used as the input of the deep neural network. The result of the MLP model is obtained through multi-layer perceptron processing, and the MLP model result is combined with the result of the double regularization matrix decomposition model in the single-layer perceptron layer, i.e., the merging layer, and the model is optimized by the normalized cross entropy method, thereby improving the accuracy of the rating prediction.

如图1所示。As shown in Figure 1.

本发明设计了一种基于深度神经网络与双正则化的视频评分预测方法，用于提高对推荐视频评分预测的精度，包括如下步骤：The present invention designs a video rating prediction method based on deep neural network and double regularization to improve the accuracy of recommended video rating prediction, comprising the following steps:

步骤S2：对用户-视频评分矩阵分解时加入双正则项用来约束潜在特征矩阵的学习，并进入步骤S3；Step S2: When decomposing the user-video rating matrix, a double regularization term is added to constrain the learning of the latent feature matrix, and then proceed to step S3;

步骤S4：在合并层将多层感知机模型预测的结果与矩阵分解的结果相结合，使用归一化交叉熵法来优化模型，得到最终预测得分。Step S4: In the merging layer, the predicted results of the multilayer perceptron model are combined with the results of matrix decomposition, and the normalized cross entropy method is used to optimize the model to obtain the final prediction score.

具体步骤如下：The specific steps are as follows:

步骤S1包括：首先利用LDA模型挖掘用户视频评论中相关类型的隐藏信息，生成用户类型潜在特征矩阵LU和视频类型潜在特征矩阵LV，并将用户类型潜在特征矩阵与视频类型潜在特征矩阵相结合得到隐藏信息矩阵L，其计算公式为

，并将隐藏信息矩阵L与原始用户-视频评分矩阵R相结合重构用户-视频评分矩阵R，其计算公式为：

。Step S1 includes: firstly, using the LDA model to mine the hidden information of relevant types in user video comments, generating a user type potential feature matrix LU and a video type potential feature matrix LV, and combining the user type potential feature matrix with the video type potential feature matrix to obtain a hidden information matrix L, whose calculation formula is:

, and combine the hidden information matrix L with the original user-video rating matrix R to reconstruct the user-video rating matrix R, whose calculation formula is:

.

步骤S2包括：将用户-视频评分矩阵进行矩阵分解，将高维用户-视频评分矩阵分解为低维用户特征矩阵和视频特征矩阵，其公式为：Step S2 includes: performing matrix decomposition on the user-video rating matrix, decomposing the high-dimensional user-video rating matrix into a low-dimensional user feature matrix and a video feature matrix, and the formula is:

其中U表示用户特征矩阵，U_i表示用户i的潜在特征向量，V表示视频特征矩阵，V_j是视频j的潜在特征向量。低维矩阵分解方法通过d秩因子的乘积近似计算评分矩阵R。用户i对视频j的预测评分表示为

，将预测评分与原始评分之间的误差平方作为损失函数，最小化损失函数来逼近评分矩阵R。损失函数为：Where U represents the user feature matrix, _Ui represents the potential feature vector of user i, V represents the video feature matrix, and _Vj is the potential feature vector of video j. The low-dimensional matrix decomposition method approximates the rating matrix R by multiplying the d-rank factors. The predicted rating of user i for video j is expressed as

, the square error between the predicted score and the original score is used as the loss function, and the loss function is minimized to approximate the score matrix R. The loss function is:

，

,

在上述公式中，

是指示函数，表示如果用户i对项目j进行评分则等于1，否则等于0。

和

为两个正则项，防止过拟合。由于每个用户对视频的评分都会对视频相似度做出一定的贡献，但是每个用户贡献却不完全相同，从用户的活跃度考虑可以分为活跃用户和不活跃用户，活跃用户指的是对视频有大量的评分记录的用户，而不活跃用户是指只对少数视频进行评分记录的用户，所以在计算视频相似度时应将活跃用户和不活跃用户的贡献区分开，用户的活跃度可以定义为：In the above formula,

is an indicator function, which equals 1 if user i rates item j, and 0 otherwise.

and

are two regularization terms to prevent overfitting. Since each user's rating of a video will make a certain contribution to the video similarity, but each user's contribution is not exactly the same, users can be divided into active users and inactive users based on their activity. Active users refer to users who have a large number of rating records for videos, while inactive users refer to users who have only rated a few videos. Therefore, when calculating video similarity, the contributions of active users and inactive users should be distinguished. The activity of users can be defined as:

在上述公式中，

表示用户u的评分总量。因此用户的活跃度系数结合修正的余弦相似度得到的视频相似度计算方法为：In the above formula,

Represents the total number of ratings of user u. Therefore, the video similarity calculation method obtained by combining the user's activity coefficient with the modified cosine similarity is:

在上述公式中，

表示用户u对视频i的评分，

表示用户u对视频j的评分，

表示用户u的评分评分。在矩阵分解时引入融入用户活跃度的视频关联正则项用来约束项目潜在特征矩阵的学习，此时融入用户活跃度的视频关联正则化约束函数公式为：In the above formula,

represents the rating of video i by user u,

represents the rating of video j by user u,

Represents the rating of user u. When decomposing the matrix, the video association regularization term that incorporates user activity is introduced to constrain the learning of the project's latent feature matrix. At this time, the video association regularization constraint function formula that incorporates user activity is:

其中，V表示视频特征矩阵，V_j是视频j的潜在特征向量，V_i是视频i的潜在特征向量。相同兴趣爱好的用户会互相影响，用户相似度可以使用加权的皮尔逊相关系数进行计算：Where V represents the video feature matrix, _Vj is the potential feature vector of video j, and _Vi is the potential feature vector of video i. Users with the same interests and hobbies will influence each other, and user similarity can be calculated using the weighted Pearson correlation coefficient:

在上述公式中，

和

分别代表用户u和v的平均评分，

为权重即能够影响用户相似度计算的项目的Jaccard相关系数，其计算公式如下：In the above formula,

and

Represent the average ratings of users u and v respectively,

其中，

表示用户u评论过的视频集合，

表示用户v评论过的视频集合。in,

represents the set of videos commented by user u,

Represents the set of videos commented by user v.

，

,

在上述公式中，

表示用u对视频i的评分，

表示用户v对视频i的评分，

表示用户u评论过的视频集合，

表示用户v评论过的视频集合，

表示评分的最大值，

表示信任距离即用户u和用户v之间存在的用户人数，

表示两个用户之间允许的最大距离，

是为修正参数，是一个大于0小于1的数，。可靠的最近邻用户为In the above formula,

represents the rating of video i by u,

represents the rating of user v on video i,

represents the set of videos commented by user u,

represents the set of videos commented by user v,

represents the maximum value of the score,

Indicates the maximum distance allowed between two users.

，

,

其中，

为用户u的潜在向量，

为用户v的潜在特征向量。in,

is the potential vector of user u,

is the potential feature vector of user v.

加入融有用户活跃度的视频关联正则化项和可靠的最近邻正则项，最后的优化损失函数为：By adding the video association regularization term that incorporates user activity and the reliable nearest neighbor regularization term, the final optimization loss function is:

采用随机梯度下降法来寻找最优解，找出最优潜在特征矩阵。The stochastic gradient descent method is used to find the optimal solution and find the optimal latent feature matrix.

如图2所示。As shown in Figure 2.

步骤S3包括：将用户潜在特征向量U_u和视频潜在特征向量V_i作为多层感知机的输入，其中多层感知机如图2所示，包括输入层L_in、若干个允许神经结构非线性的隐藏层和输出层L_out，输入层L_in的输出向量为:Step S3 includes: taking the user potential feature vector U _u and the video potential feature vector V _i as the input of a multilayer perceptron, wherein the multilayer perceptron is shown in FIG2 , including an input layer L _in , a plurality of hidden layers that allow the nonlinearity of the neural structure, and an output layer L _out , and the output vector of the input layer L _in is:

经过第一个隐藏层处理后输出向量：The output vector after processing by the first hidden layer is:

其中

是包含在输入层和第一层隐藏层L1之间的矩阵中的权重集合，

是L1层的偏差，

是激活函数，激活函数为in

is the set of weights contained in the matrix between the input layer and the first hidden layer L1,

is the bias of the L1 layer,

is the activation function, the activation function is

所以隐藏层L_k的输出向量为：So the output vector of the hidden layer L _k is:

其中

为神经元的激活函数，

为权重矩阵，

为偏差。多层感知机输出层L_out输出向量为:in

is the neuron activation function,

is the weight matrix,

is the deviation. The output vector of the multi-layer perceptron output layer L _out is:

如图3所示。As shown in Figure 3.

步骤S4包括：在该深度神经网络结构中单层感知机为合并层，其中该深度神经网络结构如图3所示，在合并层将多层感知机模型预测结果与双正则化矩阵分解模型的结果相结合，其计算公式为：Step S4 includes: in the deep neural network structure, the single-layer perceptron is a merging layer, wherein the deep neural network structure is shown in FIG3 , and the prediction result of the multi-layer perceptron model is combined with the result of the double regularization matrix decomposition model in the merging layer, and the calculation formula is:

其中，

为激活函数，

为输出层与合并层之间的矩阵权重集，

为输出层的结果，

为用户潜在向量，

为视频潜在向量，

为合并层的偏差项in,

is the activation function,

is the matrix weight set between the output layer and the merging layer,

is the result of the output layer,

is the user latent vector,

is the video latent vector,

is the bias term of the merging layer

采用归一化交叉熵法通过以下代价函数不断优化所提的出的模型，其代价函数为:The proposed model is continuously optimized by the normalized cross entropy method through the following cost function, and the cost function is:

在上述公式中，

为合并层神经元数量，

为预测出的分数，

为训练实例真实分数，

表示评分的最大值。代价函数使用梯度下降方法不断优化模型，得到最终的预测评分。In the above formula,

is the number of neurons in the merging layer,

is the predicted score,

is the true score of the training instance,

Represents the maximum value of the score. The cost function uses the gradient descent method to continuously optimize the model and obtain the final predicted score.

上面结合附图对本发明的实施方式作了详细说明，但是本发明并不限于上述实施方式，在本领域普通技术人员所具备的知识范围内，还可以在不脱离本发明宗旨的前提下做出各种变化。The embodiments of the present invention are described in detail above in conjunction with the accompanying drawings, but the present invention is not limited to the above embodiments, and various changes can be made within the knowledge scope of ordinary technicians in this field without departing from the purpose of the present invention.

Claims

1. A video rating prediction method based on deep neural network and double regularization, characterized in that the method comprises the following steps:

Step S1: Process the video comments to mine hidden information, and combine the hidden information matrix with the original user-video rating matrix to generate a new user-video rating matrix, and then proceed to step S2, including:

First, the LDA model is used to mine the hidden information of relevant types in user video comments, generate the user type potential feature matrix LU and the video type potential feature matrix LV, and combine the user type potential feature matrix with the video type potential feature matrix to obtain the hidden information matrix L, whose calculation formula is:

The hidden information matrix L is combined with the original user-video rating matrix R to generate a new user-video rating matrix

The calculation formula is:

Step S2: When decomposing the user-video rating matrix, double regularization terms are added to constrain the learning of the latent feature matrix. Each user's rating of a video will make a certain contribution to the video similarity, but the user contributions are not exactly the same. From the perspective of user activity, they can be divided into active users and inactive users. Active users refer to users who have a large number of rating records for videos, while inactive users refer to users who have only rated a few videos. Therefore, when calculating video similarity, the contributions of active users and inactive users should be distinguished. The user's activity is defined as:

In formula 1, |M(u)| represents the total number of ratings of user u, so the video similarity calculation method obtained by combining the user's activity coefficient with the modified cosine similarity is:

In Formula 2, _Rui represents the rating of video i given by user u, and _Ruj represents the rating of video j given by user u.

represents the average rating of user u, and U represents the set of users who have rated videos i and j at the same time. When decomposing the matrix, the video association regularization term that incorporates user activity is introduced to constrain the learning of the project's latent feature matrix. At this time, the formula of the video association regularization constraint function that incorporates user activity is:

In Formula 3, V represents the video feature matrix, _Vj is the potential feature vector of video j, _Vi is the potential feature vector of video i, and the process proceeds to step S3, including:

Users with the same interests and hobbies will influence each other, and user similarity is calculated using the weighted Pearson correlation coefficient:

In Formula 6,

and

Represent the average ratings of users u and v respectively, r _ui represents the rating of video i by user u, r _vi represents the rating of video i by user v, I _u represents the set of videos commented on by user u, I _v represents the set of videos commented on by user v, JCC _uv is the weight, that is, the Jaccard correlation coefficient of the items that can affect the calculation of user similarity, and its calculation formula is as follows:

Where _Iu represents the set of videos commented on by user u, and _Iv represents the set of videos commented on by user v;

The user's rating of the project depends on the influence of neighboring users, and is also affected by the neighboring users of the neighboring users. However, the neighboring users after a certain distance will no longer have an impact on the user, that is, they become unreliable. Therefore, a reliability value is introduced. Neighboring users with a reliability value greater than a certain value will have an impact on the user's project rating. The reliability value is calculated as follows:

In Formula 8, _Rui represents the score of video i given by user u, _Rvi represents the score of video i given by user v, _Iu represents the set of videos commented on by user u, _Iv represents the set of videos commented on by user v, _Rmax represents the maximum score, _duv represents the trust distance, i.e., the number of users between user u and user v, _dmax represents the maximum distance allowed between two users, ε is a correction parameter, which is a number greater than 0 and less than 1, and the reliable nearest neighbor user is

Near＝{v∈U|sim _uv ≥γ and C _uv ≥δ} Formula 9

During matrix decomposition, a reliable nearest neighbor regularization term is introduced to constrain the learning of the user's potential feature matrix. The constraint function of the reliable nearest neighbor regularization term is:

Among them, U _u is the potential vector of user u, and U _v is the potential feature vector of user v;

Step S3: The potential feature vector decomposed from the matrix is used as the input of the multi-layer perceptron, and the result predicted by the multi-layer perceptron model is obtained after being processed by the multi-layer perceptron, and then the process goes to step S4;

Step S4: Combine the prediction results of the multi-layer perceptron model with the results of matrix decomposition at the merging layer, use the normalized cross entropy method to optimize the model, and obtain the final prediction score, including:

In this deep neural network structure, the single-layer perceptron is a merging layer, in which the prediction results of the multi-layer perceptron model are combined with the results of the double regularization matrix decomposition model. The calculation formula is:

In Formula 11, ρ _m is the activation function, w _m is the matrix weight set between the output layer and the merging layer, y _out is the result of the output layer, U _u is the user latent vector, V _i is the video latent vector, and a _m is the bias term of the merging layer. The normalized cross entropy method is used to optimize the model and finally obtain the predicted score.

2. According to claim 1, a video rating prediction method based on deep neural network and double regularization is characterized in that: in the step S3, the user potential feature vector and the video potential feature vector are used as inputs of a multilayer perceptron, wherein the deep neural network is composed of a multilayer perceptron and a single-layer perceptron, wherein the multilayer perceptron includes an input layer, several hidden layers that allow the nonlinearity of the neural structure, and an output layer, and the nonlinear characteristics of the hidden layer are utilized to obtain the result of the multilayer perceptron model through multilayer perceptron processing.