CN104598720B - Cmp time setting method based on cluster and multi-task learning - Google Patents
Cmp time setting method based on cluster and multi-task learning Download PDFInfo
- Publication number
- CN104598720B CN104598720B CN201410805040.3A CN201410805040A CN104598720B CN 104598720 B CN104598720 B CN 104598720B CN 201410805040 A CN201410805040 A CN 201410805040A CN 104598720 B CN104598720 B CN 104598720B
- Authority
- CN
- China
- Prior art keywords
- msub
- mrow
- mfrac
- msubsup
- mtd
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 66
- 239000000126 substance Substances 0.000 claims abstract description 19
- 238000000605 extraction Methods 0.000 claims abstract description 12
- 239000013598 vector Substances 0.000 claims description 44
- 239000011159 matrix material Substances 0.000 claims description 34
- 238000009826 distribution Methods 0.000 claims description 15
- PXFBZOLANLWPMH-UHFFFAOYSA-N 16-Epiaffinine Natural products C1C(C2=CC=CC=C2N2)=C2C(=O)CC2C(=CC)CN(C)C1C2CO PXFBZOLANLWPMH-UHFFFAOYSA-N 0.000 claims description 11
- 238000012360 testing method Methods 0.000 claims description 10
- 238000004364 calculation method Methods 0.000 claims description 5
- 239000000463 material Substances 0.000 claims description 5
- 238000013499 data model Methods 0.000 claims description 4
- 238000012804 iterative process Methods 0.000 claims description 4
- 239000000203 mixture Substances 0.000 claims description 4
- 238000007476 Maximum Likelihood Methods 0.000 claims description 3
- 238000009825 accumulation Methods 0.000 claims description 2
- 238000005315 distribution function Methods 0.000 claims description 2
- 230000005540 biological transmission Effects 0.000 claims 1
- 230000001186 cumulative effect Effects 0.000 claims 1
- 238000003801 milling Methods 0.000 claims 1
- 238000002360 preparation method Methods 0.000 claims 1
- 238000004519 manufacturing process Methods 0.000 abstract description 28
- 238000005516 engineering process Methods 0.000 abstract description 2
- 238000004422 calculation algorithm Methods 0.000 description 16
- 239000000523 sample Substances 0.000 description 9
- 238000012549 training Methods 0.000 description 7
- 238000005457 optimization Methods 0.000 description 5
- 238000012706 support-vector machine Methods 0.000 description 5
- 238000012545 processing Methods 0.000 description 4
- 230000002159 abnormal effect Effects 0.000 description 3
- 238000002790 cross-validation Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000004377 microelectronic Methods 0.000 description 3
- 238000005498 polishing Methods 0.000 description 2
- 238000004088 simulation Methods 0.000 description 2
- RNAMYOYQYRYFQY-UHFFFAOYSA-N 2-(4,4-difluoropiperidin-1-yl)-6-methoxy-n-(1-propan-2-ylpiperidin-4-yl)-7-(3-pyrrolidin-1-ylpropoxy)quinazolin-4-amine Chemical compound N1=C(N2CCC(F)(F)CC2)N=C2C=C(OCCCN3CCCC3)C(OC)=CC2=C1NC1CCN(C(C)C)CC1 RNAMYOYQYRYFQY-UHFFFAOYSA-N 0.000 description 1
- 241000282414 Homo sapiens Species 0.000 description 1
- 239000003082 abrasive agent Substances 0.000 description 1
- 238000010923 batch production Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000007517 polishing process Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Other Investigation Or Analysis Of Materials By Electrical Means (AREA)
Abstract
基于聚类和多任务学习的化学机械研磨时间设定方法,属于自动控制、信息技术和先进制造领域,其特征在于,为实现多品种混合下微电子制造过程化学机械研磨时间优化设定,本方法将工艺指标及产品状态作为输入,影响工艺指标的关键因素‑研磨时间作为输出建立逆向模型用于对研磨时间进行优化设定。在构建上述逆向模型过程中,针对生产品种种类多,单个品种数据少的问题,将相似品种按产品特征进行聚类,在每个类别中采用基于共有参数提取的多任务学习方法进行建模;所计算出的模型参数分为本类品种共有部分和单品种私有部分。
The chemical mechanical grinding time setting method based on clustering and multi-task learning belongs to the field of automatic control, information technology and advanced manufacturing. The method takes the process index and product state as input, and the key factor affecting the process index - grinding time is used as the output to establish a reverse model to optimize the setting of grinding time. In the process of constructing the above reverse model, in view of the problem of many types of production varieties and little data of a single variety, similar varieties are clustered according to product characteristics, and a multi-task learning method based on common parameter extraction is used for modeling in each category; The calculated model parameters are divided into the common part of this kind of variety and the private part of single variety.
Description
技术领域technical field
本发明属于自动控制、信息技术和先进制造领域。为解决现代微电子生产制造中多品种混合生产模式下人工设定化学机械研磨时间所导致的返工率高,整条生产线的生产效率偏低的问题,本发明采用基于逆向模型的优化设定方法,将工艺指标及产品状态作为输入,影响工艺指标的关键因素一研磨时间作为输出建立逆向模型用于对研磨时间进行优化设定。针对生产数据中所存在的产品品种多且单个品种数据量少的问题,提出一种基于聚类和多任务学习的化学机械研磨时间设定方法,可实现对化学机械研磨时间的优化设定,提高化学机械研磨工序的生产效率。The invention belongs to the fields of automatic control, information technology and advanced manufacturing. In order to solve the problems of high rework rate and low production efficiency of the entire production line caused by manual setting of chemical mechanical grinding time in the multi-variety mixed production mode in modern microelectronics production and manufacturing, the present invention adopts an optimal setting method based on a reverse model , taking the process index and product status as input, and the key factor affecting the process index—grinding time as output—to establish a reverse model to optimize the setting of grinding time. Aiming at the problem that there are many product varieties in the production data and the data volume of a single variety is small, a chemical mechanical grinding time setting method based on clustering and multi-task learning is proposed, which can realize the optimal setting of chemical mechanical grinding time. Increases productivity in chemical mechanical polishing processes.
背景技术Background technique
化学机械研磨是微电子制造过程中的关键工序,影响着整条生产线的生产效率。由于现代微电子制造业多品种小批量的生产特点,企业常采用多品种混合生产的模式。在这种混合生产模式下,传统的Run-to-Run(RtR)优化控制方法难以取得理想效果。RtR的主要思想是利用上一批次乃至几批次的生产信息指导新来批次的生产,一般假设相同品种的产品在一台设备上连续加工或假设若干台设备有规律的循环加工少量品种。但由于多品种混合生产下若干台设备可能连续加工任意品种的产品,不同品种和设备间存在差异,传统RtR方法效果较差。因此,目前研磨时间仍然依靠人工经验设定及先行片测试,一旦出现不合格产品,整个批次都要进行返工,难以实现较高的生产效率,亟需对化学机械研磨的加工时间进行优化设定。传统的操作参数优化设定方法需要首先对工艺指标进行建模,然后在指标模型的基础上优化操作参数。工艺指标模型是以产品状态及操作参数为输入、工艺指标为输出的正向模型,可是在数据较少的时候,很难准确建立指标模型,导致操作优化效果不佳。借鉴RtR中用历史批次指导未来生产的想法,针对化学机械研磨数据品种多、每个品种中数据少的特点,本发明将工艺指标及产品状态作为输入,影响工艺指标的关键因素-研磨时间作为输出建立逆向模型用于对研磨时间进行优化设定,基于此提出一种基于聚类和多任务学习的化学机械研磨时间设定方法。Chemical mechanical polishing is a key process in the microelectronics manufacturing process, which affects the production efficiency of the entire production line. Due to the multi-variety and small-batch production characteristics of the modern microelectronics manufacturing industry, enterprises often adopt the mode of multi-variety mixed production. In this mixed production mode, the traditional Run-to-Run (RtR) optimization control method is difficult to achieve ideal results. The main idea of RtR is to use the production information of the previous batch or even several batches to guide the production of new batches. Generally, it is assumed that the products of the same variety are processed continuously on one equipment or several equipments are assumed to process a small number of varieties in a regular cycle. . However, due to the multi-variety mixed production, several equipment may continuously process any variety of products, and there are differences between different varieties and equipment, and the effect of the traditional RtR method is poor. Therefore, at present, the grinding time still depends on manual experience setting and advanced sheet testing. Once unqualified products appear, the entire batch will have to be reworked. It is difficult to achieve high production efficiency. It is urgent to optimize the processing time of chemical mechanical grinding. Certainly. The traditional optimization setting method of operating parameters needs to firstly model the process index, and then optimize the operating parameters on the basis of the index model. The process index model is a forward model with product status and operating parameters as input and process index as output. However, when there is less data, it is difficult to accurately establish the index model, resulting in poor operation optimization results. Drawing lessons from the idea of using historical batches to guide future production in RtR, and aiming at the characteristics of many types of chemical mechanical grinding data and less data in each type, the present invention uses process indicators and product status as input, and the key factor affecting process indicators-grinding time As an output, an inverse model is established to optimize the setting of grinding time, and based on this, a chemical-mechanical grinding time setting method based on clustering and multi-task learning is proposed.
发明内容Contents of the invention
为解决多品种混合生产环境下的化学机械研磨时间优化设定问题,本发明将工艺指标及产品状态作为输入,影响工艺指标的关键因素-研磨时间作为输出建立逆向模型用于对研磨时间进行优化设定,基于此提出一种基于聚类和多任务学习的化学机械研磨时间设定方法。为处理实际生产数据中产品品种多,每个品种数据少的问题,提出一种基于聚类和多任务学习的两步建模方法,将传统聚类多任务学习中的聚类过程和参数学习过程分开考虑。品种聚类过程假设每个品种中的数据符合一个多维高斯分布,采用最大似然估计对多维高斯分布的均值向量和方差矩阵进行估计。根据估计结果,采用巴氏距离表示两个多维高斯分布的相似度。通过计算不同品种概率分布之间的相似度得到相似度矩阵。以相似度矩阵作为输入采用仿射传播算法完成品种聚类。同时为了保证小样本下聚类所得的结果中每个类别中的样本数量都足够大,将每个品种的样本数量作为一种先验知识嵌入到品种聚类过程中。In order to solve the problem of optimal setting of chemical mechanical grinding time in a multi-variety mixed production environment, the present invention takes the process index and product status as input, and the key factor affecting the process index-grinding time as output to establish a reverse model for optimizing the grinding time Based on this, a chemical mechanical grinding time setting method based on clustering and multi-task learning is proposed. In order to deal with the problem of many varieties of products and little data of each variety in the actual production data, a two-step modeling method based on clustering and multi-task learning is proposed, which integrates the clustering process and parameter learning in traditional clustering multi-task learning The process is considered separately. The variety clustering process assumes that the data in each variety conforms to a multidimensional Gaussian distribution, and uses maximum likelihood estimation to estimate the mean vector and variance matrix of the multidimensional Gaussian distribution. According to the estimated results, the Bhattacharyachian distance is used to represent the similarity of two multidimensional Gaussian distributions. The similarity matrix is obtained by calculating the similarity between the probability distributions of different varieties. Using the similarity matrix as input, the affine propagation algorithm was used to complete the clustering of varieties. At the same time, in order to ensure that the number of samples in each category in the results of clustering under small samples is large enough, the number of samples of each variety is embedded in the process of variety clustering as a priori knowledge.
在每个类别中采用所提出的基于共有参数提取的多任务学习算法计算模型参数。为了解决单个任务中样本数量少的问题,所提出的基于共有参数提取的多任务学习算法将模型参数分解为共有部分和私有部分并同时学习这两部分。共有部分可以弥补样本数据不足带来的模型偏差。The model parameters are computed in each category using the proposed multi-task learning algorithm based on common parameter extraction. To solve the problem of small number of samples in a single task, the proposed multi-task learning algorithm based on common parameter extraction decomposes model parameters into common and private parts and learns both parts simultaneously. The common part can make up for the model bias caused by insufficient sample data.
基于聚类和多任务学习的化学机械研磨时间设定方法,其特征在于,所述方法是在计算机上依次按以下步骤实现的:The chemical mechanical grinding time setting method based on clustering and multi-task learning is characterized in that the method is implemented on a computer in the following steps:
步骤(1):数据整理Step (1): Data curation
本方法中所建立的优化设定模型以4项工艺指标及产品状态组成模型输入行向量x,其中包括:研磨材料去除速率、来片厚度、抽测先行片出片厚度和lot抽测出片厚度,以化学机械研磨时间为模型输出y;假设模型输入输出之间的关系满足下式:The optimal setting model established in this method uses 4 process indicators and product status to form a model input row vector x, including: abrasive material removal rate, incoming sheet thickness, random measurement of the thickness of the preceding sheet and the thickness of the sheet measured by lot, Take the chemical mechanical grinding time as the model output y; assume that the relationship between the model input and output satisfies the following formula:
y=xw+δy=xw+δ
其中列向量w表示待确定的模型参数,δ为噪声。The column vector w represents the model parameters to be determined, and δ is the noise.
不失一般性的,假设有m个产品品种;第i个产品品种的Ni个样本输入记为矩阵其第j行xi(j)表示第i个产品品种的第j个样本的输入向量,d为模型输入变量个数,本方法中d=4;第i个产品品种的Ni个样本输出记为列向量其第j个元素表示第i个产品品种的第j个样本的输出,即化学机械研磨时间;为便于后续表示,用X表示由输入矩阵[X1,X2,...,Xm-1,Xm]纵向排列所构成的矩阵;同样用Y表示由输出向量[y1,y2,...,ym-1,ym]纵向排列构成的列向量;X和Y均有Ni行,N表示所有的样本总数;列向量表示每个样本所属品种的标号,取值范围为{1,2,...,m-1,m};Without loss of generality, suppose there are m product varieties; N i sample inputs of the i-th product variety are recorded as matrix Its j-th row x i(j) represents the input vector of the j-th sample of the i-th product variety, d is the number of model input variables, and d=4 in this method; N i samples of the i-th product variety output denoted as a column vector its jth element Indicates the output of the j-th sample of the i-th product variety, that is, the chemical-mechanical grinding time; for subsequent representation, X is used to represent the input matrix [X 1 , X 2 ,..., X m-1 , X m ] The matrix formed by the vertical arrangement; Y is also used to represent the column vector formed by the vertical arrangement of the output vector [y 1 , y 2 ,..., y m-1 , y m ]; both X and Y have N i rows, N represents the total number of samples; column vector Indicates the label of the variety to which each sample belongs, and the value range is {1, 2, ..., m-1, m};
步骤(2):计算不同品种的相似度矩阵Step (2): Calculate the similarity matrix of different varieties
假设每个品种的数据服从不同的多维高斯分布,采用极大似然估计方法计算每个品种的概率分布函数;例如,对第i个品种,其多维高斯分布的均值向量和和方差矩阵的估计值:Assuming that the data of each variety obeys different multidimensional Gaussian distributions, the probability distribution function of each variety is calculated using the maximum likelihood estimation method; for example, for the i-th variety, the mean vector of its multidimensional Gaussian distribution and and variance matrix Estimated value of :
其中矩阵行向量zi(j)表示矩阵Zi的第j行,包含4个输入变量和1个输出变量;where matrix The row vector z i(j) represents the jth row of the matrix Z i , including 4 input variables and 1 output variable;
采用巴氏距离比较不同品种之间的相似度,即比较不同品种多维高斯分布的相似度:The similarity between different varieties is compared by using the Barrett's distance, that is, the similarity of multidimensional Gaussian distribution of different varieties is compared:
在多维高斯分布的假设下,巴氏距离有解析表达式,假设两个多维高斯分布的概率分布为G1~N(μ1,∑1),G2~N(μ2,∑2)的距离计算方法为:Under the assumption of multidimensional Gaussian distribution, the Bhattacharyian distance has an analytical expression, assuming that the probability distribution of two multidimensional Gaussian distributions is G 1 ~ N(μ 1 , ∑ 1 ), G 2 ~ N(μ 2 , ∑ 2 ) The distance calculation method is:
其中|A|表示矩阵A的行列式;in |A| represents the determinant of matrix A;
根据上述的差异度计算方法,基于每个品种多元高斯分布均值向量和方差矩阵的估计值,计算得到相似度矩阵;因后续仿射传播聚类算法所需输入为相似度,所以差异度取负数得到相似度;According to the above calculation method of difference degree, based on the mean vector of multivariate Gaussian distribution of each variety and variance matrix The estimated value of is calculated to obtain the similarity matrix; because the input required by the subsequent affine propagation clustering algorithm is the similarity, the difference is negative to obtain the similarity;
步骤(3):基于仿射传播的产品特征聚类Step (3): Product feature clustering based on affine propagation
仿射传播聚类是一种基于信息累积的聚类算法,根据不同点的累积信息量确定聚类中心,主要利用相似度矩阵计算两种信息量,γ(i,k),a(i,k):Affine propagation clustering is a clustering algorithm based on information accumulation. The cluster center is determined according to the accumulated information of different points, and the similarity matrix is mainly used to calculate two kinds of information, γ(i, k), a(i, k):
开始设置a(i,k)=0,然后根据上式迭代的更新γ(i,k)和a(i,k)直到收敛;Start to set a(i,k)=0, then iteratively update γ(i,k) and a(i,k) according to the above formula until convergence;
仿射传播聚类中用偏好向量表示先验知识中每个任务成为聚类中心的可能性;在迭代中用偏好向量代替相似度矩阵中的对角线元素进而影响聚类中心的选择;因为样本数目较多的产品品种后续会得到更加准确的模型,更适合作为聚类中心,为了将样本数目这一先验用于聚类的过程中,采用如下方法设置仿射传播算法中的偏好向量;In the affine propagation clustering, the preference vector is used to represent the possibility of each task in the prior knowledge becoming the cluster center; in the iteration, the preference vector is used to replace the diagonal elements in the similarity matrix to affect the selection of the cluster center; because Products with a large number of samples will get more accurate models later, which are more suitable as cluster centers. In order to use the prior of the number of samples in the process of clustering, the following method is used to set the preference vector in the affine propagation algorithm ;
设偏好向量的设定值为p=[p1,p2,...,pm-1,pm]:Assume that the set value of the preference vector is p=[p 1 , p 2 ,..., p m-1 , p m ]:
其中Ni代表每个任务的样本数,L表示希望样本数量大于L的品种更倾向于成为聚类中心;典型值设定a=0.005,b=2000,L=50;Among them, N i represents the number of samples for each task, and L indicates that varieties with a sample size larger than L are more likely to become cluster centers; typical values are set to a=0.005, b=2000, L=50;
步骤(4):基于共有参数提取的多任务学习Step (4): Multi-task learning based on common parameter extraction
在聚类后得到L个类别,对每个类别中的品种使用基于共有参数提取的多任务学习算法,其算法的主要思想是将每个品种的模型参数分为两部分:共享参数和私有参数;共享参数是每个类别中所有品种的数据模型中相同的部分,用列向量表示;而私有参数是每个类别中每个品种的数据模型中不同的部分,用列向量表示;如果一个类别中有γ个品种,则γ个列向量vi可构成一个γ列的矩阵V;基于共有参数提取的多任务学习算法可根据每个类别中的数据学习出这两部分的参数从而得到最终模型的参数,例如对第i个任务,最终的模型参数为:After clustering, L categories are obtained, and a multi-task learning algorithm based on common parameter extraction is used for the varieties in each category. The main idea of the algorithm is to divide the model parameters of each variety into two parts: shared parameters and private parameters. ; Shared parameters are the same parts of the data model for all varieties in each category, with a column vector Represents; while the private parameters are different parts of the data model for each variety in each category, with a column vector If there are γ varieties in a category, then γ column vectors v i can form a matrix V of γ columns; the multi-task learning algorithm based on the extraction of common parameters can learn the two parts according to the data in each category Parameters to get the parameters of the final model, for example, for the i-th task, the final model parameters are:
wi=u+vi w i =u+v i
在进行模型学习以前,需要先对数据进行归一化处理;然后设定模型的参数,包括λ1,λ2,λ3,随机初始化共有参数向量u和私有参数矩阵V;Before learning the model, it is necessary to normalize the data; then set the parameters of the model, including λ 1 , λ 2 , λ 3 , and randomly initialize the common parameter vector u and the private parameter matrix V;
迭代过程如下,其中X表示一类中γ个品种的输入矩阵[X1,X2,...,XXr-1,Xr]纵向排列所构成的矩阵;同样用Y表示一类中γ个品种的输出向量[y1,y2,...,yr-1,yr]纵向排列构成的列向量对第k次迭代计算:The iterative process is as follows, where X represents the matrix formed by the vertical arrangement of input matrices [X 1 , X 2 , ..., XX r-1 , X r ] of γ varieties in one class; Y also represents γ in one class The output vectors [y 1 , y 2 , ..., y r-1 , y r ] of varieties are arranged vertically to form a column vector for the k-th iteration calculation:
上式中:In the above formula:
根据pk,uk-1和Qk,Vk-1更新uk,Vk Update u k , V k according to p k , u k-1 and Q k , V k -1
其中α∈[0,1],可令α0=0;t0=1, Where α∈[0,1], α 0 =0 can be set; t 0 =1,
在迭代过程中,步长lk采用如下方法确定:In the iterative process, the step size l k is determined by the following method:
lk=2jklk-1,其中jk为使得下式成立的最小非负正整数:l k =2 jk l k-1 , where j k is the smallest non-negative positive integer that makes the following formula true:
对L个聚类所得的类别分别使用上述方法即可得到L个模型库,其包含了所有m个品种的模型。Using the above methods for the categories obtained by the L clusters respectively, L model libraries can be obtained, which include all models of m varieties.
附图说明Description of drawings
图1:基于聚类和多任务学习的化学机械研磨时间设定方法流程图Figure 1: Flow chart of the method for setting chemical mechanical grinding time based on clustering and multi-task learning
图2:基于聚类和多任务学习的化学机械研磨时间设定方法软硬件组成图。Figure 2: The hardware and software composition diagram of the chemical mechanical grinding time setting method based on clustering and multi-task learning.
具体实施方式Detailed ways
本发明提出基于聚类和多任务学习的化学机械研磨时间设定方法,其主要优势在于可用于多品种混合生产,相比人工设定可提高生产效率。在实际应用过程中,如果有新的生产批次到来,可根据其品种和加工层种类及其他批次信息计算研磨时间。本发明的基于聚类和多任务学习算法依赖于相关的硬件设备,包括:数据采集系统、算法服务器和用户客户端,并由基于智能优化软件实现。本发明提出方法流程图如图2所示。The present invention proposes a chemical mechanical grinding time setting method based on clustering and multi-task learning. Its main advantage is that it can be used in multi-variety mixed production, and can improve production efficiency compared with manual setting. In the actual application process, if a new production batch arrives, the grinding time can be calculated according to its variety and processing layer type and other batch information. The learning algorithm based on clustering and multi-task of the present invention depends on related hardware equipment, including: data acquisition system, algorithm server and user client, and is realized by intelligent optimization software. The flow chart of the method proposed by the present invention is shown in FIG. 2 .
步骤(1):数据采集Step (1): Data Acquisition
采集的生产信息包括lot的产品品种、加工层次、材料去除率、来片厚度、先行片出片厚度、抽检出片厚度、来片厚度范围、出片厚度范围,将初始化信息存储至生产过程数据库中;The collected production information includes lot product variety, processing level, material removal rate, incoming sheet thickness, leading sheet output thickness, random inspection output sheet thickness, incoming sheet thickness range, and output sheet thickness range, and the initialization information is stored in the production process database middle;
步骤(2):数据整理Step (2): Data Wrangling
在模型训练服务器上,首先去除异常历史记录,包括历史数据中人为输入的明显异常记录和个别数据项缺失的异常记录。然后根据模型学习的要求整理数据,其中将产品品种和品种最为一个组合定位为一种广义品种,及相同品种的不同层次也认为是不同品种。所组成的数据集为其中第j行由4维数据构成,包括材料去除速率、来片厚度、先行片出片厚度和lot抽测出片厚度;第j个元素为化学机械研磨时间;品种编号向量I。On the model training server, first remove the abnormal historical records, including the obvious abnormal records entered by human beings in the historical data and the abnormal records missing individual data items. Then organize the data according to the requirements of model learning, in which the combination of product variety and variety is positioned as a generalized variety, and different levels of the same variety are also considered as different varieties. The dataset composed of where line j Consists of 4-dimensional data, including material removal rate, thickness of the incoming piece, thickness of the leading piece and the thickness of the piece measured by lot; jth element is the chemical mechanical grinding time; the variety number vector I.
步骤(3):模型训练Step (3): Model training
在模型训练服务器上,根据整理后的数据集{Xi,yi},i={1,2,...,m-1,m}和品种编号向量I进行模型学习。其中包括两步:产品特征聚类和共有参数提取的多任务学习。根据发明描述中的计算流程完成整个模型训练的过程。本发明提出方法流程图如图1所示。On the model training server, model learning is performed according to the sorted data set {X i , y i }, i={1, 2, . . . , m−1, m} and the variety number vector I. It includes two steps: product feature clustering and multi-task learning of common parameter extraction. Complete the entire model training process according to the calculation process in the invention description. The flow chart of the method proposed by the present invention is shown in FIG. 1 .
步骤(4):参数优化Step (4): Parameter Optimization
在基于产品特征聚类和共有参数提取的多任务学习中,主要的模型参数包括样本数阈值L、聚类数参数a和多任务学习参数λ1,λ2,λ3。样本阈值L可通过对整个数据集中每个品种的样本数统计来确定。其余三个参数采用交叉验证的方法进行优化。其主要过程是对整个数据集进行拆分并分别作为训练集和测试集。例如如果进行3交叉验证,则将每个品种的数据分为3份,然后取1份作为测试数据,2分作为训练数据。用训练数据训练得到模型,然后用测试数据测试模型的优化设定效果。通过设定不同的a,λ1,λ2,λ3选择效果最好的参数。最后将最佳的模型传输到现场服务器上。In the multi-task learning based on product feature clustering and common parameter extraction, the main model parameters include sample number threshold L, cluster number parameter a and multi-task learning parameters λ 1 , λ 2 , λ 3 . The sample threshold L can be determined by counting the number of samples of each variety in the entire data set. The remaining three parameters are optimized by cross-validation method. The main process is to split the entire data set and use it as training set and test set respectively. For example, if 3 cross-validation is performed, the data of each variety is divided into 3 parts, and then 1 part is used as test data and 2 parts are used as training data. Use the training data to train the model, and then use the test data to test the optimization setting effect of the model. By setting different a, λ 1 , λ 2 , λ 3 to select the parameter with the best effect. Finally the best model is transferred to an on-site server.
步骤(4):在线应用Step (4): Online Application
在现场服务器上,根据实际生产数据传过来的当前加工批次数据,选择对应品种和层次的数据确定模型参数。当前批次中的数据x包括材料去除速率、来片厚度,缺少先行片出片厚度和lot抽测出片厚度。此处用此品种和层次的标准出片厚度代替先行片出片厚度和lot抽测出片厚度。用模型参数点乘向量x可得到化学机械研磨时间的优化设定值。On the on-site server, according to the current processing batch data transmitted from the actual production data, select the data corresponding to the variety and level to determine the model parameters. The data x in the current batch includes the material removal rate, the thickness of the incoming piece, the thickness of the missing piece of the leading piece, and the thickness of the piece measured by lot. Here, the standard sheet thickness of this variety and level is used to replace the sheet thickness of the preceding sheet and the thickness of the sheet measured by lot sampling. The optimal setting value of the chemical mechanical polishing time can be obtained by multiplying the vector x with the model parameters.
基于上述所提出的基于产品特征聚类和共有参数提取多任务学习方法,本发明做了大量的仿真试验,由于篇幅所限,这里仅给出该发明应用到化学机械研磨时间优化设定的仿真结果。输入数据由4维数据组成,包括材料去除速率、来片厚度,先行片出片厚度和lot抽测出片厚度。数据取自2011-1-1至2013-7-25之间的工业现场数据,441个品种,共11250条记录,采用3交叉验证确定模型参数。Based on the above-mentioned proposed multi-task learning method based on product feature clustering and common parameter extraction, the present invention has done a large number of simulation experiments. Due to space limitations, only the simulation of the application of the invention to the optimal setting of chemical mechanical grinding time is given here. result. The input data consists of 4-dimensional data, including the material removal rate, the thickness of the incoming piece, the thickness of the piece out of the first piece and the thickness of the piece measured by lot. The data is taken from the industrial field data between 2011-1-1 and 2013-7-25, 441 varieties, a total of 11250 records, using 3 cross-validation to determine the model parameters.
本发明与典型的多任务学习算法CMTL-convex(Clustered multi-task learningA convex formulation)和非线性建模方法核极限学习机(KELM)、支持向量机(SVM)进行了比较。其中为了证明品种聚类算法的有效性,分别对不采用品种聚类方法的KELM、SVM与进行品种聚类以后的Cluster-KELM和Cluster-SVM进行比较。KELM与SVM中所用的是高斯核函数,表达式为:The present invention is compared with typical multi-task learning algorithm CMTL-convex (Clustered multi-task learningA convex formulation) and nonlinear modeling methods Kernel Extreme Learning Machine (KELM) and Support Vector Machine (SVM). Among them, in order to prove the effectiveness of the variety clustering algorithm, KELM and SVM without the variety clustering method are compared with Cluster-KELM and Cluster-SVM after the variety clustering. The Gaussian kernel function used in KELM and SVM is expressed as:
exp(-γ*||u-v||2)exp(-γ*||uv|| 2 )
另外的一个参数为模型中的正则项权重。相关参数取值如下表所示。Another parameter is the weight of the regular term in the model. The relevant parameter values are shown in the table below.
表1 算法相关参数设置Table 1 Algorithm-related parameter settings
随机抽取不同比例(0.2,0.4,0.6)的数据作为测试数据进行算法性能比较。性能指标选择正则化均方误差(normalized Mean Squared Error,nMSE),平均均方误差(averaged Means Squared Error,aMSE)和10%误差以外所占比例(error>10%)比较结果如表2所示Data with different ratios (0.2, 0.4, 0.6) were randomly selected as test data for algorithm performance comparison. The performance index selection regularized mean square error (normalized Mean Squared Error, nMSE), average mean square error (averaged Means Squared Error, aMSE) and the proportion of 10% error (error> 10%) comparison results are shown in Table 2
表2 LS-IELM与OS-ELM、Fixed-LSSVM算法性能比较结果Table 2 Performance comparison results of LS-IELM and OS-ELM, Fixed-LSSVM algorithm
从表中可看出,本发明提出的TwCMTL相比CMTL-convex、SVM、KELM、Cluster-SVM、Cluster-KELM具有更好的测试精度,具有更好的泛化能力。It can be seen from the table that TwCMTL proposed by the present invention has better test accuracy and better generalization ability than CMTL-convex, SVM, KELM, Cluster-SVM, and Cluster-KELM.
Claims (1)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410805040.3A CN104598720B (en) | 2014-12-23 | 2014-12-23 | Cmp time setting method based on cluster and multi-task learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410805040.3A CN104598720B (en) | 2014-12-23 | 2014-12-23 | Cmp time setting method based on cluster and multi-task learning |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104598720A CN104598720A (en) | 2015-05-06 |
CN104598720B true CN104598720B (en) | 2018-04-10 |
Family
ID=53124499
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410805040.3A Active CN104598720B (en) | 2014-12-23 | 2014-12-23 | Cmp time setting method based on cluster and multi-task learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104598720B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US12164270B2 (en) | 2021-10-19 | 2024-12-10 | International Business Machines Corporation | Process optimization with joint-level inflow model |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6845278B2 (en) * | 2002-08-07 | 2005-01-18 | Kimberly-Clark Worldwide, Inc. | Product attribute data mining in connection with a web converting manufacturing process |
CN101853507A (en) * | 2010-06-03 | 2010-10-06 | 浙江工业大学 | A Cell Classification Method Based on Affine Propagation Clustering |
CN103093078A (en) * | 2012-12-18 | 2013-05-08 | 湖南大唐先一科技有限公司 | Data inspection method for improved 53H algorithm |
-
2014
- 2014-12-23 CN CN201410805040.3A patent/CN104598720B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6845278B2 (en) * | 2002-08-07 | 2005-01-18 | Kimberly-Clark Worldwide, Inc. | Product attribute data mining in connection with a web converting manufacturing process |
CN101853507A (en) * | 2010-06-03 | 2010-10-06 | 浙江工业大学 | A Cell Classification Method Based on Affine Propagation Clustering |
CN103093078A (en) * | 2012-12-18 | 2013-05-08 | 湖南大唐先一科技有限公司 | Data inspection method for improved 53H algorithm |
Non-Patent Citations (4)
Title |
---|
A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems;A Beck 等;《Siam Journal on Imaging Sciences》;20091231;第183-202页 * |
A Few Useful Things to Know about Machine Learning;Pedro Domingos;《Communications of the Acm》;20121031;第1-9页 * |
Clustering by Passing Messages Between Data Points;Brendan J.Frey;《Science》;20070216;第942-951页 * |
Regularized mult-task learning;T.Evgeniou;《Tench Acm Sigkdd International Conference on Knowledge Discovery & Data Mining》;20041231;第109-117页 * |
Also Published As
Publication number | Publication date |
---|---|
CN104598720A (en) | 2015-05-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107272655B (en) | Batch process fault monitoring method based on multi-stage ICA-SVDD | |
CN109116834B (en) | A Deep Learning-Based Method for Intermittent Process Fault Detection | |
TWI794157B (en) | Automatic multi-threshold feature filtering method and device | |
CN103217960B (en) | Automatic selection method of dynamic scheduling strategy of semiconductor production line | |
CN104914723B (en) | Soft sensor modeling method for industrial process based on collaborative training partial least squares model | |
CN107273445A (en) | The apparatus and method that missing data mixes multiple interpolation in a kind of big data analysis | |
CN107861492A (en) | A kind of broad sense Non-negative Matrix Factorization fault monitoring method based on nargin statistic | |
Arif et al. | Cascade quality prediction method using multiple PCA+ ID3 for multi-stage manufacturing system | |
CN108875771A (en) | A kind of failure modes model and method being limited Boltzmann machine and Recognition with Recurrent Neural Network based on sparse Gauss Bernoulli Jacob | |
CN109085805A (en) | A kind of industrial process fault detection method based on multi-sampling rate Factor Analysis Model | |
CN107357269A (en) | Multistage process quality prediction method based on mixing MPLS | |
CN110020680B (en) | A PMU Data Classification Method Based on Random Matrix Theory and Fuzzy C-Means Clustering Algorithm | |
CN106599367A (en) | Method for detecting abnormal state of spacecraft | |
Lee et al. | Statistical comparison of fault detection models for semiconductor manufacturing processes | |
CN114330549A (en) | Chemical process fault diagnosis method based on depth map network | |
CN110334478A (en) | Machinery equipment abnormality detection model building method, detection method and model | |
CN110008914A (en) | A kind of pattern recognition system neural network based and recognition methods | |
CN103885867A (en) | Online evaluation method of performance of analog circuit | |
CN109933040A (en) | A fault monitoring method based on hierarchical density peak clustering and the most similar mode | |
CN106547899B (en) | Intermittent process time interval division method based on multi-scale time-varying clustering center change | |
CN104598720B (en) | Cmp time setting method based on cluster and multi-task learning | |
CN115759552A (en) | Multi-agent architecture-based real-time scheduling method for intelligent factory | |
CN110084301A (en) | A kind of multiple operating modes process industry and mining city method based on hidden Markov model | |
Li et al. | Variable moving windows based non‐Gaussian dissimilarity analysis technique for batch processes fault detection and diagnosis | |
Van der Laan et al. | Quantile-function based null distribution in resampling based multiple testing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20181031 Address after: 200120 Shanghai free trade pilot area 707 Zhang Yang road two floor West 205 room Patentee after: Zhengda Industrial Biotechnology (Shanghai) Co., Ltd. Address before: 100084 Tsinghua Yuan, Beijing, Haidian District Patentee before: Tsinghua University |
|
TR01 | Transfer of patent right | ||
DD01 | Delivery of document by public notice | ||
DD01 | Delivery of document by public notice |
Addressee: Zhengda Industrial Biotechnology (Shanghai) Co., Ltd. Document name: Notification to Pay the Fees |
|
DD01 | Delivery of document by public notice | ||
DD01 | Delivery of document by public notice |
Addressee: Zhang Wei Document name: payment instructions |
|
DD01 | Delivery of document by public notice | ||
DD01 | Delivery of document by public notice |
Addressee: Zhang Wei Document name: payment instructions |