CN113766229A

CN113766229A - Encoding method, decoding method, device, equipment and readable storage medium

Info

Publication number: CN113766229A
Application number: CN202111160289.XA
Authority: CN
Inventors: 冯亚楠; 李琳; 周冰; 徐嵩; 邢刚; 马思伟; 王苫社; 徐逸群; 胡玮
Original assignee: Migu Cultural Technology Co Ltd; Peking University; China Mobile Communications Group Co Ltd
Current assignee: Migu Cultural Technology Co Ltd; Peking University; China Mobile Communications Group Co Ltd
Priority date: 2021-09-30
Filing date: 2021-09-30
Publication date: 2021-12-07
Anticipated expiration: 2041-09-30
Also published as: CN113766229B; WO2023051783A1

Abstract

The application discloses an encoding method, a decoding method, a device, equipment and a readable storage medium, which relate to the technical field of image processing and aim to improve the processing performance. The method comprises the following steps: clustering point cloud data to be processed of a current frame to obtain a plurality of sub-point clouds; generating a generalized Laplace matrix for any target sub-point cloud in the plurality of sub-point clouds according to Euclidean distances between a plurality of point pairs in the target sub-point cloud and Euclidean distances between target points in the target sub-point cloud and corresponding points of the target points; performing inter-frame prediction and graph Fourier residual transformation on the target sub-point cloud by using the generalized Laplace matrix; quantizing and coding the transformed multiple sub-point clouds respectively to obtain a coded code stream; and the corresponding point is positioned in a reference point cloud of the target sub-point cloud, and the reference point cloud is positioned in a reference frame of the current frame. The embodiment of the application can improve the processing performance.

Description

Encoding method, decoding method, device, equipment and readable storage medium

Technical Field

The present application relates to the field of image processing technologies, and in particular, to an encoding method, a decoding method, an apparatus, a device, and a readable storage medium.

Background

With the development of computer hardware and algorithms, the three-dimensional point cloud data is more and more convenient to acquire, and the data volume of the point cloud data is larger and larger. The point cloud data is composed of a large number of three-dimensional unordered points, each of which includes position information (X, Y, Z) and several attribute information (color, normal vector, etc.).

In order to facilitate the storage and transmission of point cloud data, point cloud compression technology is becoming a focus of attention. The prior art provides a scheme for selectively encoding one or more 3D point cloud blocks using inter-coding (e.g., motion compensation) techniques of previously encoded/decoded frames. However, this scheme has poor processing performance such as encoding.

Disclosure of Invention

The embodiment of the application provides an encoding method, a decoding method, a device, equipment and a readable storage medium, so as to improve the processing performance.

In a first aspect, an embodiment of the present application provides an encoding method, which is applied to an encoding device, and includes:

clustering point cloud data to be processed of a current frame to obtain a plurality of sub-point clouds;

generating a generalized Laplace matrix for any target sub-point cloud in the plurality of sub-point clouds according to Euclidean distances between a plurality of point pairs in the target sub-point cloud and Euclidean distances between target points in the target sub-point cloud and corresponding points of the target points;

performing inter-frame prediction and graph Fourier residual transformation on the target sub-point cloud by using the generalized Laplace matrix;

quantizing and coding the transformed multiple sub-point clouds respectively to obtain a coded code stream;

and the corresponding point is positioned in a reference point cloud of the target sub-point cloud, and the reference point cloud is positioned in a reference frame of the current frame.

In a second aspect, an embodiment of the present application further provides a decoding method, which is applied to a decoding device, and the method includes:

acquiring a coding code stream;

carrying out graph Fourier inverse transformation based on Euclidean distance weight on the coded code stream to obtain a transformation result;

obtaining a decoding code stream based on the conversion result;

the coding code stream is obtained by coding a result of inter-frame prediction and image Fourier residual transformation of sub-point clouds by coding equipment.

In a third aspect, an embodiment of the present application further provides an encoding apparatus, including:

the first acquisition module is used for clustering point cloud data to be processed of a current frame to obtain a plurality of sub-point clouds;

the first generation module is used for generating a generalized Laplace matrix for any target sub-point cloud in the plurality of sub-point clouds according to Euclidean distances between a plurality of point pairs in the target sub-point cloud and Euclidean distances between target points in the target sub-point cloud and corresponding points of the target points;

the first transformation module is used for performing inter-frame prediction and image Fourier residual transformation on the target sub-point cloud by using the generalized Laplace matrix;

the first coding module is used for quantizing and coding the transformed sub-point clouds respectively to obtain a coding code stream;

In a fourth aspect, an embodiment of the present application further provides a decoding apparatus, including:

the first acquisition module is used for acquiring a coding code stream;

the first transformation module is used for carrying out graph Fourier inverse transformation based on Euclidean distance weight on the coding code stream to obtain a transformation result;

the first decoding module is used for obtaining a decoding code stream based on the conversion result;

In a fifth aspect, an embodiment of the present application further provides an electronic device, including: a memory, a processor and a program stored on the memory and executable on the processor, the processor implementing the steps in the encoding method or the decoding method as described above when executing the program.

In a sixth aspect, the present application further provides a readable storage medium, on which a program is stored, where the program, when executed by a processor, implements the steps in the encoding method or the decoding method as described above.

In the embodiment of the application, point cloud data to be processed of a current frame is clustered to obtain a plurality of sub-point clouds, a generalized Laplace matrix is generated for any target sub-point cloud according to Euclidean distances between a plurality of point pairs in the target sub-point cloud and Euclidean distances between target points in the target sub-point cloud and corresponding points of the target points, inter-frame prediction and image Fourier residual error transformation are respectively carried out on the plurality of sub-point clouds by utilizing the generalized Laplace matrix, and therefore a coding code stream is obtained based on a transformation result. The generalized Laplace matrix is generated by utilizing Euclidean distances between points, so that the global correlation characteristics can be utilized in the embodiment of the application, the correlation between the points can be more fully expressed, the similarity between point cloud data can be removed as far as possible, and the encoding performance is improved.

The performance of the encoding end is improved, and correspondingly, for the decoding end, the data needing to be decoded is optimized, so that the decoding efficiency and performance can be correspondingly improved.

Drawings

Fig. 1 is a flowchart of an encoding method provided in an embodiment of the present application;

FIGS. 2 and 3 are schematic diagrams comparing the effect of the method of the embodiment of the present application and the method of the prior art;

fig. 4 is a flowchart of a decoding method provided in an embodiment of the present application;

fig. 5 is a block diagram of an encoding apparatus according to an embodiment of the present application;

fig. 6 is a block diagram of a decoding device according to an embodiment of the present application.

Detailed Description

In the embodiment of the present application, the term "and/or" describes an association relationship of associated objects, and means that there may be three relationships, for example, a and/or B, which may mean: a exists alone, A and B exist simultaneously, and B exists alone. The character "/" generally indicates that the former and latter associated objects are in an "or" relationship.

In the embodiments of the present application, the term "plurality" means two or more, and other terms are similar thereto.

The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.

Referring to fig. 1, fig. 1 is a flowchart of an encoding method provided in an embodiment of the present application, and is applied to an encoding apparatus. As shown in fig. 1, the method comprises the following steps:

step 101, clustering point cloud data to be processed of a current frame to obtain a plurality of sub-point clouds.

In the step, the point cloud data to be processed is subjected to voxel formation to obtain point cloud voxels, and then the voxel formation point cloud data is clustered to obtain a plurality of sub-point clouds.

Specifically, a three-dimensional grid with a preset size is constructed, point cloud data to be processed are placed in the constructed three-dimensional grid to obtain coordinates of each point, and the three-dimensional grid containing the points is used as a point cloud voxel to obtain a plurality of point cloud voxels. In addition, coordinate and attribute information of each point cloud voxel can be obtained. Wherein the attribute information includes intensity, color, and the like. In the embodiment of the application, the coordinates of the point cloud volume element are specifically the coordinates of the central point of each point in the point cloud volume element; the color information of the point cloud volume element is specifically an average value of the color information of each point in the point cloud volume element. In practical application, point cloud data to be processed can be subjected to voxel formation in an octree mode and the like to obtain a plurality of point cloud voxels.

The point cloud data is clustered by using a space uniform division method. The clustering method may employ a clustering method using K-means, for example.

In the embodiment of the application, the point cloud data to be processed is divided into a plurality of sub-point clouds based on the position information, and the space is uniformly divided. Each sub-point cloud may be independently encoded.

102, for any target sub-point cloud in the plurality of sub-point clouds, generating a generalized Laplace matrix according to Euclidean distances between a plurality of point pairs in the target sub-point cloud and Euclidean distances between target points in the target sub-point cloud and corresponding points of the target points. And the corresponding point is positioned in a reference point cloud of the target sub-point cloud, and the reference point cloud is positioned in a reference frame of the current frame.

Any sub-point cloud in the plurality of sub-point clouds can be used as the target sub-point cloud. In practical application, the processing mode of each target sub-point cloud is the same.

Specifically, the following contents may be included in this step:

and S1021, obtaining a weight matrix according to Euclidean distances among a plurality of point pairs in the target sub-point cloud.

Wherein, the target sub-point cloud may include a plurality of points, and each two points constitute one point pair in the embodiment. In the embodiment of the present application, the euclidean distance between two points in each point pair is calculated. For example, for the ith point and the jth point in the target sub-point cloud, the euclidean distance between the ith point and the jth point is calculated. In particular, for point i (x)₁，x₂……x_n) And point j (y)₁，y₂……y_n) The euclidean distance d (i, j) between them can be calculated in practical application according to the following formula:

wherein i is more than or equal to 1 and less than or equal to M, j is more than or equal to 1 and less than or equal to M, i, j and M are integers, and M is the total number of points included in the target sub-point cloud.

Then, calculating the weight according to the following formula, and forming the weight matrix W by using the weight:

wherein ,W_ijRepresenting the weight corresponding to the edge from the ith point to the jth point in the target sub-point cloud; distance represents the Euclidean distance from the ith point to the jth point; σ is a constant different from 0 and represents the tuning parameter.

And S1022, obtaining the Laplace matrix according to the degree matrix and the weight matrix.

In this step, the difference between the utilization matrix and the weight matrix is used as the laplacian matrix.

Specifically, the method comprises the following steps: L-D-W, L denotes a laplacian matrix, D denotes a degree matrix, and W denotes a weight matrix.

Wherein a diagonal element d of the degree matrix_i＝∑_jW_ijAnd the other elements are 0. Wherein d is_iThe ith diagonal element, W, of the degree-of-representation matrix_ijRepresenting weights corresponding to edges from the ith point to the jth point in the target sub-point cloud

And S1023, generating a diagonal matrix.

The diagonal matrix is generated according to Euclidean distances between target points in the target sub-point cloud and corresponding points of the target points.

Specifically, the reference point cloud of the target sub-point element may be determined in the reference frame first. For example, motion estimation is performed in a reference frame to find a matching reference point cloud. Wherein, the target sub-point cloud and the reference point are in one-to-one correspondence. For example, using an iterative closest point algorithm, in the reference frame, a reference point cloud of the target sub-point elements may be determined based on euclidean distances. Then, the diagonal matrix D is generated based on the Euclidean distance between each point in the target sub-point cloud and the corresponding point of each point in the reference point cloud_w. Wherein, the value on the ith diagonal of the diagonal matrix is the reciprocal of the Euclidean distance between the ith point and the point p, and the other elements are 0. And the point p is the corresponding point of the ith point in the reference point cloud.

And S1024, obtaining the generalized Laplace matrix according to the diagonal matrix and the Laplace matrix.

In this step, the sum of the diagonal matrix and the laplacian matrix is used as the generalized laplacian matrix.

Specifically, the method comprises the following steps: lg ═ L + D_wWherein Lg represents a generalized Laplace matrix, L represents a Laplace matrix, and D_wRepresenting a diagonal matrix.

And 103, performing inter-frame prediction and graph Fourier residual transformation on the target sub-point cloud by using the generalized Laplacian matrix.

In this step, the inter-frame prediction and graph fourier residual transform may be understood as being based on euclidean distance weights, and may include the following:

and S1031, obtaining the attribute predicted value of the target attribute of the current frame by the reference frame.

In the embodiment of the application, an inter-frame prediction method is adopted, and the attribute value of the current frame is predicted by using the reference frame. Wherein the attributes may include color, intensity, normal vector, etc. Then the target attribute may be any attribute.

Specifically, in this step, the attribute prediction value of the target attribute of the current frame by the reference frame is obtained according to the following formula:

wherein ,

attribute prediction value, x, representing target attribute of reference frame to current frame_t-1An attribute value, L, representing a target attribute of a reference frame_gRepresenting a generalized laplacian matrix.

S1032, generating a residual error of the target attribute of the current frame according to the attribute value of the target attribute of the current frame and the attribute predicted value of the reference frame to the target attribute of the current frame.

Specifically, here, the difference between the attribute value of the target attribute of the current frame and the attribute prediction value of the reference frame for the target attribute of the current frame may be used as the residual:

wherein δ represents a residual of a target property of the current frame,

attribute prediction value, x, representing target attribute of reference frame to current frame_tAn attribute value representing a target attribute of the current frame.

The residual error is obtained by the inter-frame prediction method, so that the difference between two frames can be obtained as much as possible. As the same part between the two frames does not need additional processing, the code rate can be saved by calculating the residual error.

And S1033, transforming the residual error of the target attribute of the current frame based on the generalized Laplacian matrix.

In this step, a transformation matrix is obtained by using the generalized laplacian matrix, and then, the residual error of the target attribute of the current frame is transformed by using the transformation matrix.

Specifically, the following formula is solved to obtain a transformation matrix:

wherein Lg represents a generalized Laplace matrix,

representing a transformation matrix.

Obtaining the transformation result by using the following formula:

where, theta represents the result of the transformation,

representing a transformation matrix, δ representing the residual of the target property of the current frame.

In the embodiment of the application, on the basis of the traditional graph Fourier transform, the concept of generalized graph Fourier transform is introduced, and prediction and residual transformation are performed on the inter-frame attributes of the point cloud data, so that the redundancy among data can be further removed, and the coding efficiency is improved.

And the processing mode of other sub-point clouds is the same as that of the target sub-point cloud.

And step 104, quantizing and coding the transformed multiple sub-point clouds respectively to obtain a coded code stream.

In the step, the transformed multiple sub-point clouds are subjected to uniform quantization and arithmetic coding to obtain a coding code stream.

Taking the target attribute as color as an example, here, the color can be decomposed into three 3 × 1 vectors (YUV or RGB). Taking the Y component as an example, the attribute value of the current frame is predicted according to the process in S1031, and a residual is generated according to S1032. Thereafter, the residual is transformed using S1033. And uniformly quantizing and carrying out arithmetic coding on the transformed Y component to obtain a code stream. For each component, the same processing can be done for the Y component.

In the embodiment of the application, because the generalized laplacian matrix is generated by using the euclidean distance between the points, the global correlation characteristic can be used in the embodiment of the application to more fully express the correlation between the points, so that the similarities between the point cloud data can be removed as much as possible, and the encoding performance is improved.

In practical applications, tests are performed on the actual point cloud sequence. In the test, the test is performed on 16 frames of dynamic point clouds, and fig. 2 shows a comparison between the performance of the method of the embodiment of the present application and the performance of the RAHT (Region-Adaptive Hierarchical Transform) and NWGFT (main direction weight graph fourier Transform) methods. To quantify the gain, a comparison of the data with the RAHT method was performed in the experiment, as shown in FIG. 3.

As can be seen from fig. 2 and 3, in the embodiment of the present application, on the basis of the conventional graph fourier transform, a concept of a generalized graph fourier transform is introduced, and prediction and residual transform are performed on the point cloud inter-frame attributes, so that redundancy between data can be further removed, and the encoding efficiency is improved. Experimental results show that the method can improve subjective and objective performance and can be applied to a compression, transmission and storage system of actual point cloud.

Referring to fig. 4, fig. 4 is a flowchart of a decoding method provided in an embodiment of the present application, which is applied to an encoding apparatus. As shown in fig. 4, the method comprises the following steps:

step 401, obtaining a code stream.

The coding code stream is obtained by coding a result of inter-frame prediction and image Fourier residual transformation of sub-point clouds by using a generalized Laplacian matrix.

And step 402, carrying out graph Fourier inverse transformation based on Euclidean distance weight on the coding code stream to obtain a transformation result.

At a decoding end, after entropy decoding is carried out on the coded code stream, inverse quantization is carried out on the coded code stream. And then, carrying out graph Fourier inverse transformation based on Euclidean distance weight on the coded code stream after inverse quantization to obtain a transformation result.

Specifically, the inverse quantized coded code stream may be subjected to inverse graph fourier transform based on euclidean distance weights using the following formula:

wherein ,

the inverse-transform residual value is represented,

a transformation matrix is represented that is,

the quantized residual values representing the target properties of the current frame, and epsilon represent the inverse quantized coefficients.

And 403, obtaining a decoding code stream based on the conversion result.

In the embodiment of the application, because the generalized laplacian matrix is generated by using the euclidean distance between the points, the global correlation characteristic can be used in the embodiment of the application to more fully express the correlation between the points, so that the similarities between the point cloud data can be removed as much as possible, and the encoding performance is improved. The performance of the encoding end is improved, and correspondingly, for the decoding end, the data needing to be decoded is optimized, so that the decoding efficiency and performance can be correspondingly improved.

The embodiment of the application also provides a coding device. Referring to fig. 5, fig. 5 is a structural diagram of an encoding apparatus according to an embodiment of the present application. Because the principle of the coding device for solving the problem is similar to the coding method in the embodiment of the present application, the implementation of the coding device can refer to the implementation of the method, and repeated details are not repeated.

As shown in fig. 5, the encoding apparatus 500 includes:

a first obtaining module 501, configured to cluster point cloud data to be processed of a current frame to obtain a plurality of sub-point clouds; a first generating module 502, configured to generate a generalized laplacian matrix for any target sub-point cloud of the plurality of sub-point clouds according to euclidean distances between a plurality of point pairs in the target sub-point cloud and between target points in the target sub-point cloud and corresponding points of the target point; a first transformation module 503, configured to perform inter-frame prediction and graph fourier residual transformation on the target sub-point cloud by using the generalized laplacian matrix; a first encoding module 504, configured to quantize and encode the transformed multiple sub-point clouds respectively to obtain an encoded code stream; and the corresponding point is positioned in a reference point cloud of the target sub-point cloud, and the reference point cloud is positioned in a reference frame of the current frame.

Optionally, the first obtaining module includes: the first processing submodule is used for carrying out voxel formation on the point cloud data to be processed to obtain a point cloud voxel; and the first acquisition submodule is used for clustering the voxel-based point cloud data to obtain a plurality of sub-point clouds.

Optionally, the first generating module includes:

the first obtaining submodule is used for obtaining a weight matrix according to Euclidean distances among a plurality of point pairs in the target sub-point cloud; the second obtaining submodule is used for obtaining a Laplace matrix according to the degree matrix and the weight matrix; a first generation submodule for generating a diagonal matrix; and the second generation submodule is used for obtaining the generalized Laplace matrix according to the diagonal matrix and the Laplace matrix.

Specifically, the second obtaining sub-module is configured to use a difference between the degree matrix and the weight matrix as a laplacian matrix; the second generation submodule is configured to use a sum of the diagonal matrix and the laplacian matrix as the generalized laplacian matrix.

Wherein a diagonal element d of the degree matrix_i＝∑_jW_ij, wherein ,d_iThe ith diagonal element, W, of the degree-of-representation matrix_ijRepresenting the weight corresponding to the edge from the ith point to the jth point in the target sub-point cloud; i is more than or equal to 1 and less than or equal to M, j is more than or equal to 1 and less than or equal to M, wherein M is an integer and is the total number of points included in the target sub-point cloud;

Optionally, the first obtaining sub-module includes:

the first calculating unit is used for calculating the Euclidean distance between the ith point and the jth point in the target sub-point cloud;

a first obtaining unit, configured to calculate weights according to the following formula, and form the weight matrix using the weights:

wherein ,W_ijRepresenting the weight corresponding to the edge from the ith point to the jth point in the target sub-point cloud; distance represents the Euclidean distance from the ith point to the jth point; σ is a constant different from 0 and represents a tuning parameter; i is more than or equal to 1 and less than or equal to M, j is more than or equal to 1 and less than or equal to M, wherein M is an integer and is the total number of points included in the target sub-point cloud.

Optionally, the first generation submodule includes:

a first determination unit, configured to determine a reference point cloud of the target sub-point element in the reference frame; a first generating unit, configured to generate the diagonal matrix based on an euclidean distance between each point in a target sub-point cloud and a corresponding point of each point in the reference point cloud; and the value on the ith diagonal of the diagonal matrix is the reciprocal of the Euclidean distance between the ith point and a point p, wherein the point p is the corresponding point of the ith point in the reference point cloud.

Optionally, the first determining unit is configured to determine, in the reference frame, a reference point cloud of the target sub-point element by using an iterative closest point algorithm.

Optionally, the first transformation module includes:

the first obtaining submodule is used for obtaining an attribute predicted value of the target attribute of the current frame from the reference frame; the first generation submodule is used for generating a residual error of the target attribute of the current frame according to the attribute value of the target attribute of the current frame and the attribute prediction value of the reference frame on the target attribute of the current frame; and the first transformation submodule is used for transforming the residual error of the target attribute of the current frame based on the generalized Laplace matrix.

Optionally, the first obtaining sub-module is configured to obtain an attribute prediction value of the target attribute of the current frame by the reference frame according to the following formula:

wherein

Attribute prediction value, x, representing target attribute of reference frame to current frame_t-1Attribute values representing target attributes of the reference frame, and Lg representing a generalized laplacian matrix.

Optionally, the first generating sub-module is configured to use a difference between the attribute value of the target attribute of the current frame and the attribute prediction value of the target attribute of the current frame with respect to the reference frame as the residual error.

Optionally, the first transformation submodule includes:

a first obtaining unit, configured to obtain a transformation matrix by using the generalized laplacian matrix; and the first transformation unit is used for transforming the residual error of the target attribute of the current frame by using the transformation matrix.

Optionally, the first obtaining unit is configured to solve the following formula to obtain a transformation matrix:

wherein Lg represents a generalized Laplace matrix,

representing a transformation matrix.

Optionally, the first transforming unit is configured to obtain the transformation result by using the following formula:

where, theta represents the result of the transformation,

representing a transformation matrix, delta representing said current frameThe residual error of the target property of (1).

The apparatus provided in the embodiment of the present application may implement the method embodiment, and the implementation principle and the technical effect are similar, which are not described herein again.

The embodiment of the application also provides a decoding device. Referring to fig. 6, fig. 6 is a structural diagram of a decoding apparatus according to an embodiment of the present application. Because the principle of the decoding apparatus for solving the problem is similar to the decoding method in the embodiment of the present application, the implementation of the decoding apparatus can refer to the implementation of the method, and repeated details are not repeated.

As shown in fig. 6, the decoding apparatus 600 includes:

a first obtaining module 601, configured to obtain an encoded code stream; a first transform module 602, configured to perform inverse graph fourier transform based on euclidean distance weight on the encoded code stream to obtain a transform result; a first decoding module 603, configured to obtain a decoded code stream based on the transform result; the coding code stream is obtained by coding a result of inter-frame prediction and image Fourier residual transformation of sub-point clouds by coding equipment.

Optionally, the first transformation module includes: the first processing submodule is used for carrying out inverse quantization on the coding code stream; and the first transformation submodule is used for carrying out inverse graph Fourier transformation based on Euclidean distance weight on the coded code stream after inverse quantization to obtain a transformation result.

Optionally, the first transform submodule is configured to perform inverse graph fourier transform based on euclidean distance weights on the inversely quantized coded code stream by using the following formula:

wherein ,

the inverse-transform residual value is represented,

display changeAnd the matrix is changed, so that the matrix is changed,

It should be noted that the division of the unit in the embodiment of the present application is schematic, and is only a logic function division, and there may be another division manner in actual implementation. In addition, functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.

The integrated unit, if implemented as a software functional unit and sold or used as a stand-alone product, may be stored in a processor readable storage medium. Based on such understanding, the technical solution of the present application may be substantially implemented or contributed by the prior art, or all or part of the technical solution may be embodied in a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, a network device, or the like) or a processor (processor) to execute all or part of the steps of the method according to the embodiments of the present application. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.

An embodiment of the present application further provides an electronic device, including: a memory, a processor and a program stored on the memory and executable on the processor, the processor implementing the steps in the encoding method or the decoding method as described above when executing the program.

The embodiment of the present application further provides a readable storage medium, where a program is stored on the readable storage medium, and when the program is executed by a processor, the program implements each process of the above encoding or decoding method embodiment, and can achieve the same technical effect, and in order to avoid repetition, the detailed description is omitted here. The readable storage medium may be any available medium or data storage device that can be accessed by a processor, including but not limited to magnetic memory (e.g., floppy disk, hard disk, magnetic tape, magneto-optical disk (MO), etc.), optical memory (e.g., CD, DVD, BD, HVD, etc.), and semiconductor memory (e.g., ROM, EPROM, EEPROM, nonvolatile memory (NAND FLASH), Solid State Disk (SSD)), etc.

It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.

Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. With such an understanding, the technical solutions of the present application may be embodied in the form of a software product, which is stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal (such as a mobile phone, a computer, a server, an air conditioner, or a network device) to execute the methods according to the embodiments of the present application.

While the present embodiments have been described with reference to the accompanying drawings, it is to be understood that the invention is not limited to the precise embodiments described above, which are meant to be illustrative and not restrictive, and that various changes may be made therein by those skilled in the art without departing from the spirit and scope of the invention as defined by the appended claims.

Claims

1. An encoding method applied to an encoding device, the method comprising:

2. The method of claim 1, wherein clustering point cloud data to be processed of a current frame to obtain a plurality of sub-point clouds comprises:

carrying out voxel formation on the point cloud data to be processed to obtain point cloud voxels;

and clustering the voxel-based point cloud data to obtain a plurality of sub-point clouds.

3. The method of claim 1, wherein generating a generalized Laplace matrix according to Euclidean distances between a plurality of point pairs in the target sub-point cloud and Euclidean distances between corresponding points of the target point and the target point in the target sub-point cloud comprises:

obtaining a weight matrix according to Euclidean distances between a plurality of point pairs in the target sub-point cloud;

obtaining a Laplace matrix according to the degree matrix and the weight matrix;

generating a diagonal matrix;

and obtaining the generalized Laplace matrix according to the diagonal matrix and the Laplace matrix.

4. The method of claim 3,

obtaining a Laplace matrix according to the degree matrix and the weight matrix, wherein the obtaining of the Laplace matrix comprises the following steps:

using the difference between the degree matrix and the weight matrix as a Laplace matrix;

obtaining the generalized Laplace matrix according to the diagonal matrix and the Laplace matrix, including:

using a sum of the diagonal matrix and the laplacian matrix as the generalized laplacian matrix;

5. The method of claim 3, wherein obtaining a weight matrix from Euclidean distances between pairs of points in the target sub-point cloud comprises:

calculating the Euclidean distance between the ith point and the jth point in the target sub-point cloud;

calculating weights according to the following formula, and forming the weight matrix by using the weights:

6. The method of claim 3, wherein generating the diagonal matrix comprises:

determining a reference point cloud of the target sub-point elements in the reference frame;

generating the diagonal matrix based on Euclidean distances between each point in the target sub-point cloud and the corresponding point of each point in the reference point cloud;

and the value on the ith diagonal of the diagonal matrix is the reciprocal of the Euclidean distance between the ith point and a point p, wherein the point p is the corresponding point of the ith point in the reference point cloud.

7. The method of claim 6, wherein determining the reference point cloud of the target sub-point element in the reference frame comprises:

and determining the reference point cloud of the target sub-point element in the reference frame by using an iterative closest point algorithm.

8. The method of claim 1, wherein the using the generalized Laplace matrix to perform inter-frame prediction and graph Fourier residual transform on the plurality of sub-point clouds respectively comprises:

obtaining an attribute predicted value of the reference frame to the target attribute of the current frame;

generating a residual error of the target attribute of the current frame according to the attribute value of the target attribute of the current frame and the attribute prediction value of the reference frame to the target attribute of the current frame;

transforming a residual of a target attribute of the current frame based on the generalized Laplace matrix.

9. The method of claim 8, wherein obtaining the attribute prediction value of the target attribute of the current frame from the reference frame comprises:

obtaining an attribute predicted value of the target attribute of the current frame by the reference frame according to the following formula:

wherein ,

10. The method according to claim 8, wherein the generating a residual error of the target attribute of the current frame according to the attribute value of the target attribute of the current frame and the attribute prediction value of the target attribute of the current frame by the reference frame comprises:

and using the difference between the attribute value of the target attribute of the current frame and the attribute predicted value of the reference frame to the target attribute of the current frame as the residual error.

11. The method of claim 8, wherein transforming the residual of the target property of the current frame based on the generalized Laplacian matrix comprises:

obtaining a transformation matrix by utilizing the generalized Laplace matrix;

and transforming the residual error of the target attribute of the current frame by using the transformation matrix.

12. The method of claim 11, wherein the using the generalized laplacian matrix to obtain a transform matrix comprises:

solving the following formula to obtain a transformation matrix:

wherein Lg represents a generalized Laplace matrix,

representing a transformation matrix.

13. The method of claim 11, wherein transforming the residual error of the target property of the current frame using the transformation matrix comprises:

obtaining the transformation result by using the following formula:

wherein ,

the result of the transformation is represented by,

14. A decoding method applied to a decoding device, the method comprising:

acquiring a coding code stream;

obtaining a decoding code stream based on the conversion result;

15. The method according to claim 14, wherein said performing inverse graph fourier transform based on euclidean distance weights on said coded code stream to obtain a transform result comprises:

carrying out inverse quantization on the coded code stream;

and carrying out graph Fourier inverse transformation based on Euclidean distance weight on the coded code stream after inverse quantization to obtain a transformation result.

16. The method according to claim 15, wherein said performing inverse graph fourier transform based on euclidean distance weights on said coded code stream to obtain a transform result comprises:

and carrying out inverse graph Fourier transform based on Euclidean distance weight on the coded code stream after inverse quantization by using the following formula:

wherein ,

the inverse-transform residual value is represented,

a transformation matrix is represented that is,

17. An encoding apparatus, comprising:

18. A decoding apparatus, comprising:

the first acquisition module is used for acquiring a coding code stream;

19. An electronic device, comprising: a memory, a processor, and a program stored on the memory and executable on the processor; it is characterized in that the preparation method is characterized in that,

the processor, which is used for reading the program in the memory to realize the steps in the coding method according to any one of claims 1 to 13; or implementing the steps in the decoding method of any of claims 14 to 16.

20. A readable storage medium for storing a program, wherein the program, when executed by a processor, implements the steps in the encoding method of any one of claims 1 to 13; or implementing the steps in the decoding method of any of claims 14 to 16.