In a point cloud encoder/decoder framework, such as Geometry-based Point Cloud Compression (G-PCC) coding frameworks of the Audio Video coding Standard (AVS) and the Motion Picture Expert Group (MPEG), geometric information of a point cloud and attribute information corresponding to each point are coded separately. First, coordinate conversion is performed on the geometric information so as to include the whole point cloud in a bounding box. Then, quantization is performed. This quantization step mainly implements scaling. Rounding in quantization makes the geometric information corresponding to a part of points the same, and whether to remove duplicate points is determined based on a parameter. The process of quantization and removal of the duplicate points is also referred to as a voxelization process. Next, octree partitioning is performed on the bounding box. In an octree-based geometric information coding framework, the bounding box is equally partitioned into eight sub-cubes. A nonempty sub-cube (including a point in the point cloud) continues to be equally partitioned into eight parts, and the partitioning is performed until a leaf node obtained by partitioning is a 1×1×1 unit cube. Points in the leaf node are coded to generate a binary bitstream. In the present point cloud coding/decoding standard, the geometric partitioning order includes a breadth first traversal order. Specifically, when geometric octree partitioning is performed, nodes of the current layer are partitioned first, nodes of the next layer continue to be partitioned when the nodes in the current layer are all partitioned, and partitioning is stopped when a leaf node obtained by partitioning is a 1×1×1 unit cube.
However, in the existing point cloud coding/decoding standard, when geometric octree partitioning is performed, occupation information of each child node corresponds to an independent context respectively. As a result, the spatial correlation is not used fully, and the point cloud coding/decoding efficiency is reduced.
The embodiments of the disclosure relate to video image processing technologies, and provide point cloud coding and decoding methods, an encoder, a decoder, and a storage medium, which may implement coding/decoding by full use of a geometric spatial correlation between nodes, thereby improving the geometric coding/decoding performance.
In a first aspect, the embodiments of the disclosure provide a cloud point coding method, which may be applied to an encoder and include the following operations.
A bounding box of a point cloud to be coded is determined.
Coordinate information of a current node is acquired in case that octree partitioning is performed on the bounding box of the point cloud to be coded based on a Morton code order.
A first neighbor pattern of the current node is determined based on the coordinate information of the current node.
An entropy coding parameter of the current node is determined based on the first neighbor pattern of the current node.
Entropy coding is performed on occupation information of the current node based on the entropy coding parameter.
In a second aspect, the disclosure provides a point cloud decoding method, which may be applied to a decoder and include the following operations.
A bitstream is parsed to determine a bounding box of a point cloud to be decoded.
Coordinate information of a current node is acquired in case that the bounding box of the point cloud to be decoded is partitioned based on a Morton code order.
A first neighbor pattern of the current node is determined based on the coordinate information of the current node.
An entropy decoding parameter of the current node is determined based on the first neighbor pattern of the current node.
Entropy decoding is performed on occupation information of the current node based on the entropy decoding parameter.
In a third aspect, the embodiments of the disclosure provide an encoder, which may include a first determination unit, a first acquisition unit, a first processing unit, a first prediction unit, and a coding unit.
The first determination unit may be configured to determine a bounding box of a point cloud to be coded.
The first acquisition unit may be configured to acquire coordinate information of a current node in case that octree partitioning is performed on the bounding box of the point cloud to be coded based on a Morton code order.
The first processing unit may be configured to determine a first neighbor pattern of the current node based on the coordinate information of the current node.
The first prediction unit may be configured to determine an entropy coding parameter of the current node based on the first neighbor pattern of the current node.
The coding unit may be configured to perform entropy coding on occupation information of the current node based on the entropy coding parameter.
In a fourth aspect, the embodiments of the disclosure provide a decoder, which may include a parsing unit, a second acquisition unit, a second processing unit, a second prediction unit, and a decoding unit.
The parsing unit may be configured to parse a bitstream to determine a bounding box of a point cloud to be decoded.
The second acquisition unit may be configured to acquire coordinate information of a current node in case that the bounding box of the point cloud to be decoded is partitioned based on a Morton code order.
The second processing unit may be configured to determine a first neighbor pattern of the current node based on the coordinate information of the current node.
The second prediction unit may be configured to determine an entropy decoding parameter of the current node based on the first neighbor pattern of the current node.
The decoding unit may be configured to perform entropy decoding on occupation information of the current node based on the entropy decoding parameter.
In a fifth aspect, the embodiments of the disclosure also provide an encoder, which may include a first memory and a first processor. The first memory may store a computer program capable of running on the first processor. The first processor may execute the program to implement the point cloud coding method for an encoder.
In a sixth aspect, the embodiments of the disclosure provide a decoder, which includes a second memory and a second processor. The second memory stores a computer program capable of running on the second processor. The second processor may execute the program to implement the point cloud decoding method for a decoder.
In a seventh aspect, the embodiments of the disclosure provide a storage medium, on which a computer program is stored. The computer program may be executed by a first processor to implement the point cloud coding method for an encoder. Alternatively, the computer program may be executed by a second processor to implement the point cloud decoding method for a decoder.
The embodiments of the disclosure provide point cloud coding and decoding methods, an encoder, a decoder, and a storage medium. The point cloud coding method includes the following operations. A bounding box of a point cloud to be coded is determined; coordinate information of a current node is acquired in case that octree partitioning is performed on the bounding box of the point cloud to be coded based on a Morton code order; a first neighbor pattern of the current node is determined based on the coordinate information of the current node; an entropy coding parameter of the current node is determined based on the first neighbor pattern of the current node; and entropy coding is performed on occupation information of the current node based on the entropy coding parameter. In this manner, an entropy coding parameter of a child node in the current node is predicted by using the neighbor pattern of the current node, namely a position distribution of occupied neighbor nodes of the current node, and entropy coding is performed on occupation information of the child node by using the predicted entropy coding parameter, so that a geometric spatial correlation between neighbor nodes may be used well for coding/decoding, thereby improving the geometric coding/decoding performance.
In order to make the characteristics and technical contents of the embodiments of the disclosure understood in more detail, the implementation of the embodiments of the disclosure will be described below in combination with the drawings in detail. The appended drawings are only for description as references and not intended to limit the embodiments of the disclosure.
The geometric information is reconstructed in an attribute coding process. At present, attribute coding is performed mainly for color and reflectivity information. First, it is determined whether to perform color space conversion. If it is determined to perform color space conversion, the color information is converted from a Red Green Blue (RGB) color space to a YUV color space. Then, the reconstructed point cloud is recolored by use of the original point cloud so as to achieve a correspondence between the uncoded attribute information and the reconstructed geometric information. When the color information is coded, backward difference prediction is performed immediately after the point cloud is ordered based on a Morton code. Finally, a prediction residual is quantized and coded to generate an attribute bitstream.
A high-level syntax element, a geometric bitstream, and an attribute bitstream form a binary bitstream of a three-dimensional image. An encoder sends the binary bitstream to a decoder.
Point cloud coding and decoding methods provided in the embodiments of the disclosure are mainly used for the entropy coding part and entropy decoding part enclosed by the dashed boxes in
Based on the background introduced above, the point cloud coding method and point cloud decoding method provided in the embodiments of the disclosure will be introduced respectively below.
An embodiment of the disclosure provides a cloud point coding method. The method is applied to an encoder, and as shown in
At S101, a bounding box of a point cloud to be coded is determined.
In the embodiment of the disclosure, when a three-dimensional image model is coded, point cloud data of the three-dimensional image model to be coded in a space is acquired. The point cloud data includes geometric information and attribute information of the three-dimensional image model. When the three-dimensional image model is coded, geometric information of the point cloud and attribute information corresponding to each point are coded separately. The geometric information is coded by the point cloud coding method provided in the embodiment of the disclosure.
Specifically, when the geometric information is coded, coordinate conversion is performed on geometric positions to include the whole point cloud in a bounding box.
Further, the bounding box is quantized. This quantization step mainly implements scaling. Rounding in quantization makes the geometric information corresponding to a part of points the same. Whether to remove duplicate points is determined based on a parameter. The process of quantization and removal of the duplicate points is also referred to as the voxelization process.
When the original point cloud is quantized, multiple different points may be quantized into duplicate points. When there are multiple duplicate points, if the parameter indicates to perform duplicate point removal, the multiple duplicate points are processed as one point. If the parameter indicates not to perform duplicate point removal, the multiple points with the same geometric coordinate are still processed as multiple points. Next, octree partitioning is performed on the processed bounding box.
At S102, coordinate information of a current node is acquired in case that octree partitioning is performed on the bounding box of the point cloud to be coded based on a Morton code order.
In the embodiment of the disclosure, when octree partitioning is performed on the bounding box of the point cloud, Morton codes of the point cloud are calculated first. Then, a geometric octree is constructed from a root node N00 (the zeroth layer) according to the Morton codes and a Morton code order. The bounding box is equally partitioned into eight sub-cubes. A nonempty sub-cube (including a point in the point cloud) continues to be equally partitioned into eight parts, and partitioning is stopped when a leaf node obtained by partitioning is a 1×1×1 unit cube.
The Morton code order may be a breadth first traversal order or a depth first traversal order. The breadth first traversal order is as follows: when octree partitioning is performed on the geometric information, nodes of the current layer are partitioned first, nodes of the next layer continue to be partitioned when the nodes in the current layer are all partitioned, and partitioning is stopped when a leaf node obtained by partitioning is a 1×1×1 unit cube. The depth first traversal order is as follows: when octree partitioning is performed on the geometric information, a first node of the current layer is continuously partitioned first, partitioning of the current node is stopped when a leaf node obtained by partitioning is a 1×1×1 unit cube, subsequent nodes of the current layer are partitioned in this order, and partitioning is stopped when the nodes in the current layer are all partitioned. That is, according to the point cloud coding method provided in the embodiment of the disclosure, an entropy coding parameter of a child node may be predicted by use of a neighbor pattern of the current node in the breadth first traversal order or the depth first traversal order.
It is assumed that a geometric position of a point in the cloud point is represented by a three-dimensional Cartesian coordinate (X, Y, Z). Each coordinate value is represented by N bits, and a coordinate (Xk, Yk, Zk) of the k-th point may be represented as follows:
X
k=(xN-1kxN-2k . . . x1kx0k)
Y
k=(yN-1kyN-2k . . . y1ky0k)
Z
k=(zN-1kzN-2k . . . z1kz0k)
k-th is a kth point in the point cloud. A Morton code corresponding to the k-th point may be represented as follows:
M
k=(xN-1kyN-1kzN-1k,xN-2kyN-2kzN-2k, . . . x1ky1kz1k,x0ky0kz0k)
Every three bits are represented by an octal number mnk=(xnkynkznk), n=0, 1, . . . , N−1, and then the Morton code corresponding to the k-th point may be represented as follows:
M
k=(mN-1kmN-2k . . . m1km0k)
A specific partitioning process is as follows.
At 1, all points are partitioned into eight child nodes first according to the zeroth octal number of the Morton code mN-1k as follows.
All points corresponding to mN-1k=0 are partitioned into the zeroth child node N01.
All points corresponding to mN-1k=1 are partitioned into the first child node N11.
All points corresponding to mN-1k=7 are partitioned into the seventh child node N71.
Then, nodes of the first layer of the octree include the eight nodes.
At 2, eight bits B00=(b0b1b2b3b4b5b6b7) represents whether the eight child nodes of the root node N00 are occupied. If Nk1, k=0, 1, . . . , 7 includes at least one point in the point cloud, a corresponding bit is bk=1. If the child node does not include any point, the corresponding bit is bk=0.
At 3, an occupied node Nln1 in the first layer is further partitioned into eight child nodes according to the first octal number mN-2k of the Morton code of the geometric position. Occupation information of the child node of the occupied node is represented by eight bits Bln1 (in represents a sequence number of the occupied node, n=0, 1, . . . , N1−1, and N1 represents the number of occupied nodes of the first layer).
At 4, an occupied node Nlnt in a t-th layer (t=2, 3, . . . , N−2) is further partitioned into eight child nodes according to a t-th octal number MN-1-tk of the Morton code of the geometric position. Occupation information of the child node of the occupied node is represented by eight bits Blnt (ln represents a sequence number of the occupied node, n=0, 1, . . . , Nt−1, and Nt represents the number of occupied nodes of the t-th layer).
At 5, all nodes of layer t=N−1 become leaf nodes. If the encoder allows duplicate points according to a configuration parameter, it is necessary to record the number of duplicate points on each occupied leaf node in a bitstream. If the encoder does not allow any duplicate point according to the configuration parameter, the duplicate points on each occupied leaf node are processed as one point.
In the embodiment of the disclosure, the encoder, when performing octree partitioning on the bounding box of the point cloud, may determine coordinate information of a current node to be coded according to coordinate information of points in the current node.
At S103, a first neighbor pattern of the current node is determined based on the coordinate information of the current node.
Neighbor nodes in the space are highly correlated. Therefore, entropy coding parameters of eight child nodes in the current node may be predicted according to occupation information of coded neighbor nodes of the current node, and entropy coding is performed on the child nodes by use of the entropy coding parameters matched with the child nodes. As such, a spatial correlation may be used fully.
In some embodiments of the disclosure, the encoder determines a neighbor node practically through the following process. The encoder may determine coordinate information of the neighbor node of the current node based on a three-dimensional Morton code, a preset shift parameter, and the coordinate information of the current node, so as to further determine occupation information of the neighbor node.
Specifically, the operation that the first neighbor pattern of the current node is determined based on the coordinate information of the current node includes the following operations. Coordinate information of each of neighbor nodes corresponding to the current node is determined based on the coordinate information of the current node and a first neighbor range of the current node. Occupation information of the neighbor node is determined based on the coordinate information of the neighbor node. The first neighbor pattern of the current node is determined based on the occupation information of the neighbor nodes and preset first neighbor node weight values.
The first neighbor range defines neighbor nodes of the current node. For example, the first neighbor range includes at least one of neighbor nodes co-planar with the current node, neighbor nodes co-linear with the current node, or neighbor nodes co-dot with the current node. When the bounding box is equally partitioned into eight sub-cubes, there are 6 neighbor nodes co-planar with the current node, 12 neighbor nodes co-linear with the current node, 8 neighbor nodes co-dot with the current node, 18 neighbor nodes co-planar and co-linear with the current node, and 26 neighbor nodes co-planar, co-linear, and co-dot with the current node.
The occupation information is used to represent whether there is a point cloud distributed in the neighbor node, and may be represented by a binary number. For example, 0 represents that the neighbor node is empty and has no point cloud distributed therein, and 1 represents that the neighbor node is nonempty and has a point cloud distributed therein.
The first neighbor pattern represents a position distribution of occupied neighbor nodes within the first neighbor range of the current node. The first neighbor node weight values define correlations between the current node and neighbor nodes at different positions. If the weight value is greater, the correlation with the current node is higher. If the weight value is smaller, the correlation with the current node is lower.
At S104, an entropy coding parameter of the current node is determined based on the first neighbor pattern of the current node.
In the embodiment of the disclosure, the first neighbor pattern represents a position distribution of occupied neighbor nodes within the first neighbor range of the current node. When entropy coding is performed on the current node, entropy coding parameters of eight child nodes in the current node may be predicted according to occupation information of one or more coded neighbor nodes. That is, corresponding contexts are selected for different nodes according to correlations between the nodes, thereby fully using the spatial correlation.
In some embodiments of the disclosure, the operation that the entropy coding parameter of the current node is determined based on the first neighbor pattern of the current node includes the following operations. Neighbor information of a child node in the current node is determined based on the first neighbor pattern of the current node. An entropy coding parameter of the child node in the current node is determined based on the neighbor information of the child node in the current node.
That is, when a context of a child node is determined, neighbor information of the child node is determined first according to the first neighbor pattern of the current node (which may also be referred to as a parent node), and then a corresponding entropy coding parameter is selected for the child node according to the neighbor information of the child node.
In some embodiments of the disclosure, the entropy coding parameter is a context index of the child node. That is, in the disclosure, when entropy coding is performed on the current node, a context index corresponding to each child node in the current node is predicted according to the first neighbor pattern of the current node, a context corresponding to the child node is determined according to the context index, and entropy coding is performed on occupation information of the child node in the context corresponding to the child node. Therefore, the spatial correlation between nodes may be used fully, and the coding performance may be improved.
It is to be noted that, in the related art, occupation information of the eight child nodes in the current node corresponds to respective independent contexts. That is, a space occupation code of the node includes eight bits B00=(b0b1b2b3b4b5b6b7) that represents occupation conditions of the eight child nodes of this node respectively. Entropy coding is performed on each bit by use of the independent context. In this method, only the eight contexts may be used for separate coding when entropy coding is performed on the current node, and the spatial correlation is not used fully, reducing the point cloud coding efficiency. However, in the disclosure, multiple contexts may be set for different child nodes by taking the occupation information of the neighbor node into account, so that a more suitable context is selected for entropy coding of each child node. Therefore, the coding performance may be improved.
In some embodiments of the disclosure, the neighbor information is a number of occupied neighbor nodes of the child node or a second neighbor pattern of the child node. The second neighbor pattern represents a position distribution of occupied neighbor nodes within a second neighbor range of the child node.
Specifically, when the neighbor information is the number of occupied neighbor nodes, the operation that neighbor information of the child node in the current node is determined based on the first neighbor pattern of the current node includes the following operations.
The number of occupied neighbor nodes of the child node corresponding to the first neighbor pattern of the current node is acquired from a preset first mapping table. The first mapping table includes a mapping relationship between at least one first neighbor pattern of the current node and a number of occupied neighbor nodes of the child node of the current node.
Alternatively, occupation information of the neighbor nodes corresponding to the current node is determined based on the first neighbor pattern of the current node, and the number of occupied neighbor nodes of the child node is determined based on the occupation information of the neighbor nodes corresponding to the current node and the second neighbor range of the child node.
The second neighbor range includes at least one of neighbor nodes of the current node co-planar with the child node, neighbor nodes of the current node co-linear with the child node, or neighbor nodes of the current node co-dot with the child node. When the bounding box is equally partitioned into eight sub-cubes, there are three neighbor nodes of the current node (i.e., the parent node) co-planar with the child node, three neighbor nodes of the parent node co-linear with the child node, one neighbor node co-dot with the current node, six neighbor nodes of the parent node co-planar and co-linear with the child node, and seven neighbor nodes of the parent node co-planar, co-linear, and co-dot with the child node.
In some embodiments of the disclosure, the second neighbor range of the child node is a subset of the first neighbor range of the current node.
That is, when the number of occupied neighbor nodes of the child node is determined, a first mapping table may be preset, and numbers of occupied neighbor nodes of the eight child nodes are indexed according to the first neighbor pattern of the parent node. The first mapping table may be a two-dimensional table neighborPatternToChildNeighborCount[64][8] shown in Table 1, in which numbers of occupied neighbor nodes of the eight child nodes corresponding to 64 different neighbor patterns are stored. It is to be noted that the neighbor nodes herein are six neighbor nodes co-planar with the current node within the range. That is, there are 64 neighbor patterns, and if the neighbor range changes, the neighbor pattern and the correspondence also change. Detailed correspondences refer to Table 1, and will not be elaborated herein.
Looking up the first mapping table may reduce the calculation amount and improve the coding efficiency. However, in practical applications, the number of occupied neighbor nodes may also be calculated directly. That is, the occupation information of the neighbor nodes is determined according to the first neighbor pattern, and the number of occupied neighbor nodes of the child node is determined based on the occupation information of the neighbor nodes and the second neighbor range of the child node. In this manner, it is unnecessary to preestablish and store the first mapping table, and the number of occupied neighbor nodes of the child node may be determined directly according to the first neighbor pattern. Therefore, if a processor is powerful enough in calculation, the influence of the calculation process on the processing performance may be ignored, the early table creation process is eliminated, and storage space is saved.
In practical applications, the first mapping table may be created by the following method. The numbers of occupied neighbor nodes of the eight child nodes of the current node in different first neighbor patterns are obtained in advance through the above-mentioned calculation process, thereby creating the first mapping table.
Assuming that the first neighbor pattern of the current node is 27, it is obtained by looking up Table 1 or by calculation that the number of occupied neighbor nodes of the first child node b0 is 2, the number of occupied neighbor nodes of the second child node b1 is 1, the number of occupied neighbor nodes of the third child node b2 is 3, the number of occupied neighbor nodes of the fourth child node b3 is 2, the number of occupied neighbor nodes of the fifth child node b4 is 2, the number of occupied neighbor nodes of the sixth node b5 is 1, the number of occupied neighbor nodes of the seventh child node b6 is 3, and the number of occupied neighbor nodes of the eighth child node b7 is 2.
In some embodiments of the disclosure, the entropy coding parameter of the current node is a context index of the child node in the current node. The operation that the entropy coding parameter of the child node in the current node is determined based on the neighbor information of the child node in the current node includes that: the context index of the child node is determined based on the number of occupied neighbor nodes of the child node in the current node. For example, the number of occupied neighbor nodes of the child node is determined as the context index of the child node. If the number of occupied neighbor nodes of each child node in
In some embodiments of the disclosure, the entropy coding parameter of the current node is a context index of the child node in the current node. The operation that the entropy coding parameter of the child node in the current node is determined based on the neighbor information of the child node in the current node includes the following operations. A context index of a first child node in the current node is determined based on neighbor information of the first child node. A context index of a target child node in the current node is determined based on neighbor information of the target child node and occupation information of a coded child node in the current node. The target child node is a different node from the first child node.
Exemplarily, a method for determining the context index of the child node specifically includes the following operations. A context index of the first child node is determined based on a number of occupied neighbor nodes of the first child node in the current node. A context index of the target child node is determined based on a number of occupied neighbor nodes of the target child node in the current node and occupation information of a coded child node in the current node. The target child node is a different node from the first child node.
That is, when entropy coding is performed on the current node, the first child node is coded first. Therefore, a context index of the first child node may be determined only by means of the neighbor pattern of the current node.
The target child nodes are the second child node to the eighth child node. When entropy coding is performed on the second child node to the eighth child node, the occupation information of the coded child node has been coded, so the coded child node may also be regarded as a neighbor node of the uncoded child node. Therefore, context prediction is performed on the child node to be coded (i.e., the target child node) according to the number of occupied neighbor nodes of the child node to be coded and the occupation information of the coded child node to determine the context index of the child node to be coded. As such, richer contexts may be provided, and the coding performance of the target child node may further be improved.
For example, a method for determining the context index of the first child node specifically includes the following operations. The neighbor information of the first child node is determined as the context index of the first child node. The occupation information of the coded child node in the current node and the neighbor information of the target child node form a binary index, and the binary index is determined as the context index of the target child node.
The corresponding context may be indexed directly by using the binary index. Alternatively, the binary index may be converted into a unitary index to index the corresponding context. For example, the context index of the target child node is obtained by taking the occupation information of the coded child node in the current node as a high bit and the number of occupied neighbor nodes of the target child node as low bits. Alternatively, the context index of the first child node is calculated by taking the second neighbor pattern of the target child node as a high bit and the occupation information of the coded child node as low bits.
That is, the context index of the first child node is the number of occupied neighbor nodes. For example, when the context index of the second child node is calculated, if the number of occupied neighbor nodes is 3, the context index is 3. For a non-first child node, the occupation information of the coded child node and the number of occupied neighbor nodes are both represented by binary numbers, and the context index of the child node is obtained by taking the occupation information as a high bit and the number of occupied neighbor nodes as low bits. For example, if the number of occupied neighbor nodes is 3, converted to a binary number 0011, and the occupation information of the coded child node is 1, the context index 0111 is obtained by merging, corresponding to a decimal number 7.
Table 2 shows the number of contexts of each child node in the current node. The number of occupied neighbors of three neighbors co-planar with each child node in the current node is obtained by use of the neighbor pattern of the current node, and is stored in chilNeighborCount[8]. The number of occupied neighbors of the three neighbors co-planar with each child node is valued to 0 to 3. The context index of b0 is chilNeighborCount[0], which is valued to 0 to 3, namely there are four contexts. The context index of b1 is (b0, chilNeighborCount[1]) with eight values, namely there are eight contexts. In a similar manner, there are allocated totally 1,020 contexts.
At S105, entropy coding is performed on occupation information of the current node based on the entropy coding parameter.
The entropy coding parameter is used to index an entropy encoder, and entropy coding is performed on the occupation information of the child node by use of the entropy encoder indexed according to the entropy coding parameter. The entropy encoder is constructed according to different contexts of the child node. In the embodiment of the disclosure, the context of the child node is equivalent to the entropy encoder of the child node.
Specifically, the entropy coding parameter is the context index of the child node. The operation that entropy coding is performed on occupation information of the current node based on the entropy coding parameter includes the following operations. A context corresponding to the child node in the current node is determined based on the context index of the child node in the current node. Entropy coding is performed on occupation information of the child node in the context corresponding to the child node. After entropy coding is performed on the occupation information of the child node, coded occupation information is written into a geometric bitstream, such that a decoder decodes the occupation information of the child node in the current node from the geometric bitstream.
Table 3 shows eight different context sets corresponding to eight different child nodes. Each context set includes multiple contexts. A target context in the corresponding context set used for practical coding of each child node is determined according to the context index calculated above. That is, the target context corresponding to each child node in Table 3 is determined according to the corresponding context index in Table 2. Entropy coding is performed on the occupation information of the child node by use of the target context to obtain the geometric bitstream.
Specifically, in case that the neighbor information is the second neighbor pattern of the child node, the operation that neighbor information of each child node in the current node is determined based on the first neighbor pattern of the current node includes the following operations.
The second neighbor pattern of the child node corresponding to the first neighbor pattern of the current node is acquired from a preset second mapping table. The second mapping table includes a mapping relationship between at least one first neighbor pattern of the current node and second neighbor patterns of the child nodes of the current node.
Alternatively, the occupation information of the neighbor nodes corresponding to the current node is determined based on the first neighbor pattern of the current node, and the second neighbor pattern of the child node is determined based on the occupation information of the neighbor nodes corresponding to the current node, and the second neighbor range of the child node, and preset second neighbor node weight values.
The second neighbor range includes at least one of neighbor nodes of the current node co-planar with the child node, neighbor nodes of the current node co-linear with the child node, or neighbor nodes of the current node co-dot with the child node. When the bounding box is equally partitioned into eight sub-cubes, there are three neighbor nodes of the parent node co-planar with the child node, three neighbor nodes of the parent node co-linear with the child node, one neighbor node co-dot with the current node, six neighbor nodes of the parent node co-planar and co-linear with the child node, and seven neighbor nodes of the parent node co-planar, co-linear, and co-dot with the child node.
That is, in some embodiments of the disclosure, a correlation between a neighbor node of the child node and the child node is also considered. A weight value is set for the neighbor node of the child node to represent the correlation between the neighbor node and the child node. If the weight value is greater, the correlation with the current node is higher. If the weight value is smaller, the correlation with the current node is lower.
In some embodiments of the disclosure, the second neighbor range of the child node is a subset of the first neighbor range of the current node.
That is, when the second neighbor pattern of the child node is determined, a second mapping table may be preset, and second neighbor patterns of the eight child nodes are indexed according to the first neighbor pattern of the parent node. If there are 64 neighbor patterns for the current node, the second mapping table stores second neighbor patterns of the eight child nodes corresponding to the 64 first neighbor patterns respectively. For example, the neighbor nodes are six neighbor nodes co-planar with the current node. That is, there are 64 neighbor patterns, and if the neighbor range changes, the neighbor pattern also changes. Detailed correspondences will not be elaborated herein.
Looking up the second mapping table may reduce the calculation amount and improve the coding efficiency. However, in practical applications, the second neighbor pattern may also be calculated directly. That is, the occupation information of the neighbor nodes is determined according to the second neighbor pattern, and the second neighbor pattern of the child node is determined based on the occupation information of the neighbor nodes, the second neighbor range of the child node, and the second neighbor node weight values. In this manner, it is unnecessary to preestablish and store the second mapping table, and the second neighbor pattern of the child node may be determined directly according to the first neighbor pattern. Therefore, if a processor is powerful enough in calculation, the influence of the calculation process on the processing performance may be ignored, the early table creation process is eliminated, and storage space is saved.
In practical applications, the second mapping table may be created by the following method. The second neighbor patterns of the eight child nodes of the current node in different first neighbor patterns are obtained in advance through the above-mentioned calculation process, thereby creating the second mapping table.
In some embodiments of the disclosure, the entropy coding parameter of the current node is a context index of the child node in the current node. The operation that an entropy coding parameter of the child node in the current node is determined based on the neighbor information of the child node in the current node includes that: the context index of the child node is determined based on the second neighbor pattern of the child node in the current node. For example, the second neighbor pattern of the child node is determined as the context index of the child node. In
In some embodiments of the disclosure, the entropy coding parameter of the current node is a context index of the child node in the current node. The operation that an entropy coding parameter of the child node in the current node is determined based on the neighbor information of the child node in the current node includes the following operations. A context index of a first child node in the current node is determined based on neighbor information of the first child node. A context index of a target child node in the current node is determined based on neighbor information of the target child node and occupation information of a coded child node in the current node. The target child node is a different node from the first child node.
Exemplarily, a method for determining the context index of the child node specifically includes the following operations. A context index of the first child node is determined based on a second neighbor pattern of the first child node in the current node. A context index of a target child node is determined based on a second neighbor pattern of the target child node in the current node and occupation information of a coded child node in the current node. The target child node is a different node from the first child node.
That is, when entropy coding is performed on the current node, the first child node is coded first. Therefore, a context index of the first child node may be determined only by means of the neighbor pattern of the current node.
The target child nodes are the second child node to the eighth child node. When entropy coding is performed on the second child node to the eighth child node, the occupation information of the coded child node has been coded, so the coded child node may also be regarded as a neighbor node of the uncoded child node. Therefore, context prediction is performed on the child node to be coded (i.e., the target child node) according to the second neighbor pattern of the child node to be coded and the occupation information of the coded child node to determine the context index of the child node to be coded. As such, richer contexts may be provided, and the coding performance of the target child node may further be improved.
For example, a method for determining the context index of the first child node specifically includes the following operations. The neighbor information of the first child node is determined as the context index of the first child node. The occupation information of the coded child node in the current node and the neighbor information of the target child node form a binary index, and the binary index is determined as the context index of the target child node.
The corresponding context may be indexed directly by using the binary index. Alternatively, the binary index may be converted into a unitary index to index the corresponding context. For example, the context index of the first child node is calculated by taking the occupation information of the coded child node in the current node as a high bit and the second neighbor pattern of the target child node as low bits. Alternatively, the context index of the first child node is calculated by taking the second neighbor pattern of the target child node as a high bit and the occupation information of the coded child node as low bits.
That is, the context index of the first child node is the occupied second neighbor pattern. For example, when the context index of the second child node is calculated, if the second neighbor pattern is 7, the context index is 7. For a non-first child node, the occupation information of the coded child node and the second neighbor pattern are both represented by binary numbers, and the context index of the child node is obtained by taking the occupation information as a high bit and the second neighbor pattern as low bits. For example, if the second neighbor pattern is 7, converted to a binary number 1111, and the occupation information of the coded child node is 1, the context index 11111 is obtained by merging, corresponding to a decimal number 15.
Table 4 shows the number of contexts of each child node in the current node. The second neighbor pattern of each child node in the current node is obtained by use of the neighbor pattern of the current node, and is stored in chilNeighborMode[8]. The second neighbor pattern of each child node is valued to 0 to 7. The context index of b0 is chilNeighborMode[0], valued to 0 to 7, namely there are eight contexts. The context index of b1 is (b0, chilNeighborMode[1]) with 16 values, namely there are 16 contexts. In a similar manner, there are allocated totally 2,040 contexts. Compared with considering the number of occupied neighbors, considering the second neighbor pattern obtains more contexts, which further improves the coding performance.
Table 5 shows eight different context sets corresponding to eight different child nodes. Each context set includes multiple contexts. A target context in the corresponding context set used for practical coding of each child node is determined according to the context index calculated above. That is, the target context corresponding to each child node in Table 5 is determined according to the corresponding context index in Table 4. Entropy coding is performed on the occupation information of the child node by use of the target context to obtain the geometric bitstream.
Table 6 shows performance comparison results of the point cloud coding method of the disclosure and PCEMv1. Experimental results show that, by using the algorithm described in the disclosure, the coding performance may be improved. As shown in Table 6, the point-to-point error (BD-rate, D1) and point-to-plane error (BD-rate, D2) of the reconstructed point cloud are improved. Here, the Peak Signal-to-Noise Ratio (PSNR) is an objective criterion of image evaluation, and if the PSNR is greater, the quality of the image is higher. The BD-rate is a parameter for evaluating the performance A minus BD-rate indicates that the performance is improved. Based on this, if an absolute value of the BD-rate is greater, the gain of the performance is higher.
In the embodiment of the disclosure, there are also provided other four solutions for determining the context index of each child node under different neighbor ranges according to different neighbor information of each child node.
In the first solution, the neighbor information is the number of occupied neighbor nodes, and contexts, totally seven, are defined by use of an occupation number of six neighbors, co-planar and co-linear with the child node, in the neighbor nodes of the current node. The first neighbor range includes neighbor nodes co-planar and co-linear with the current node, and the second neighbor range includes neighbor nodes of the current node co-planar and co-linear with the child node.
In the second solution, based on the first solution, the neighbor information is the second neighbor pattern of the child node, and contexts, totally 64, are defined by use of occupation patterns of six neighbors co-planar and co-linear with the child node in the neighbor nodes of the current node. Here, the first neighbor range includes neighbor nodes co-planar and co-linear with the current node, and the second neighbor range includes neighbor nodes of the current node co-planar and co-linear with the child node.
In the third solution, the neighbor information is the number of occupied neighbor nodes, and contexts, totally eight, are defined by use of an occupation number of seven neighbors co-planar, co-linear, and co-dot with the child node in the neighbor nodes of the current node. Here, the first neighbor range includes neighbor nodes co-planar, co-linear, and co-dot with the current node, and the second neighbor range includes neighbor nodes of the parent node co-planar, co-linear, and co-dot with the current node.
In the fourth solution, based on the third solution, the neighbor information is the second neighbor pattern of the child node, and contexts, totally 128, are defined by use of occupation patterns of seven neighbors co-planar, co-linear, and co-dot with the child node in the neighbor nodes of the current node. Here, the first neighbor range includes neighbor nodes co-planar, co-linear, and co-dot with the current node, and the second neighbor range includes neighbor nodes of the current node co-planar, co-linear, and co-dot with the child node.
With the above technical solution, the entropy coding parameter of the child node in the current node is predicted by using the neighbor pattern of the current node, namely the position distribution of the occupied neighbor nodes of the current node, and entropy coding is performed on the occupation information of the child node by using the predicted entropy coding parameter, so that the geometric spatial correlation between neighbor nodes may be used well, thereby improving the geometric coding performance.
At S201, a bitstream is parsed to determine a bounding box of a point cloud to be decoded.
In practical applications, a binary bitstream received by the decoder includes a high-level syntax element, a geometric bitstream, and an attribute bitstream. The decoder decodes the geometric bitstream and attribute bitstream in the binary bitstream separately. When the geometric bitstream is decoded, geometric information of a point cloud is obtained by entropy decoding-octree reconstruction-inverse coordinate quantization-inverse coordination translation. When the attribute bitstream is decoded, attribute information of the point cloud is obtained by entropy decoding-inverse quantization-attribute reconstruction-inverse space conversion. A three-dimensional image model of point cloud data is restored based on the geometric information and the attribute information. The point cloud decoding method provided in the embodiment of the disclosure is mainly used for an entropy decoding part.
Specifically, the decoder parses the bitstream to obtain descriptive information of a bounding box of the point cloud. Further, the bounding box is quantized. This quantization step mainly implements scaling. Rounding in quantization makes the geometric information of a part of points to be the same, and it is determined whether to remove duplicate points based on a parameter. The process of quantization and removal of the duplicate points is also referred to as a voxelization process.
When the original point cloud is quantized, multiple different points may be quantized into duplicate points. When there are multiple duplicate points, if the parameter indicates to perform duplicate point removal, the multiple duplicate points are processed as one point. If the parameter indicates not to perform duplicate point removal, the multiple points with the same geometric coordinate are still processed as multiple points. Next, octree partitioning is performed on the processed bounding box.
At S202, coordinate information of a current node is acquired in case that the bounding box of the point cloud to be decoded is partitioned based on a Morton code order.
In the embodiment of the disclosure, the Morton code order may be a breadth first traversal order or a depth first traversal order. The decoder partitions the bounding box of the point cloud to be decoded in the same manner as the encoder, and elaborations are omitted herein.
At S203, a first neighbor pattern of the current node is determined based on the coordinate information of the current node.
In some embodiments of the disclosure, the operation at S203 may specifically include the following operations. Coordinate information of each of neighbor nodes corresponding to the current node is determined based on the coordinate information of the current node and a first neighbor range of the current node. Occupation information of the neighbor node is determined based on the coordinate information of the neighbor node. The first neighbor pattern of the current node is determined based on the occupation information of the neighbor nodes and preset first neighbor node weight values.
The first neighbor range defines neighbor nodes of the current node. For example, the first neighbor range includes at least one of neighbor nodes co-planar with the current node, neighbor nodes co-linear with the current node, or neighbor nodes co-dot with the current node. When the bounding box is equally partitioned into eight sub-cubes, there are 6 neighbor nodes co-planar with the current node, 12 neighbor nodes co-linear with the current node, 8 neighbor nodes co-dot with the current node, 18 neighbor nodes co-planar and co-linear with the current node, and 26 neighbor nodes co-planar, co-linear, and co-dot with the current node.
The occupation information is used to represent whether there is a point cloud distributed in the neighbor node, and may be represented by a binary number. For example, 0 represents that the neighbor node is empty and has no point cloud distributed therein, and 1 represents that the neighbor node is nonempty and has a point cloud distributed therein.
The first neighbor pattern represents a position distribution of occupied neighbor nodes within the first neighbor range of the current node. The first neighbor node weight values define correlations between the current node and neighbor nodes at different positions. If the weight value is greater, the correlation with the current node is higher. If the weight value is smaller, the correlation with the current node is lower.
At S204, an entropy decoding parameter of the current node is determined based on the first neighbor pattern of the current node.
In some embodiments of the disclosure, the operation that the entropy decoding parameter of the current node is determined based on the first neighbor pattern of the current node includes the following operations. Neighbor information of a child node in the current node is determined based on the first neighbor pattern of the current node. An entropy decoding parameter of the child node in the current node is determined based on the neighbor information of the child node in the current node.
That is, when a context of the child node is determined, neighbor information of the child node is determined first according to the first neighbor pattern of the current node (which may also be referred to as a parent node), and then a corresponding entropy decoding parameter is selected for the child node according to the neighbor information of the child node.
In some embodiments of the disclosure, the entropy decoding parameter is a context index of the child node. That is, in the disclosure, when entropy decoding is performed on the current node, a context index corresponding to each child node in the current node is predicted according to the first neighbor pattern of the current node, a context corresponding to the child node is determined according to the context index, and entropy decoding is performed on occupation information of the child node in the context corresponding to the child node. Therefore, the spatial correlation between nodes may be used fully, and the coding performance may be improved.
In some embodiments of the disclosure, the neighbor information is a number of occupied neighbor nodes of the child node or a second neighbor pattern of the child node. The second neighbor pattern represents a position distribution of occupied neighbor nodes within a second neighbor range of the child node.
Specifically, when the neighbor information is the number of occupied neighbor nodes of the child node, the operation that neighbor information of the child node in the current node is determined based on the first neighbor pattern of the current node includes the following operations.
The number of occupied neighbor nodes of the child node corresponding to the first neighbor pattern of the current node is acquired from a preset first mapping table. The first mapping table includes a mapping relationship between at least one first neighbor pattern of the current node and a number of occupied neighbor nodes of the child node of the current node.
Alternatively, occupation information of the neighbor nodes corresponding to the current node is determined based on the first neighbor pattern of the current node, and the number of occupied neighbor nodes of the child node is determined based on the occupation information of the neighbor nodes corresponding to the current node and the second neighbor range of the child node.
The second neighbor range includes at least one of neighbor nodes of the current node co-planar with the child node, neighbor nodes of the current node co-linear with the child node, or neighbor nodes of the current node co-dot with the child node. When the bounding box is equally partitioned into eight sub-cubes, there are three neighbor nodes of the current node (i.e., the parent node) co-planar with the child node, three neighbor nodes of the parent node co-linear with the child node, one neighbor node co-dot with the current node, six neighbor nodes of the parent node co-planar and co-linear with the child node, and seven neighbor nodes of the parent node co-planar, co-linear, and co-dot with the child node.
Specifically, when the neighbor information is the second neighbor pattern of the child node, the operation that neighbor information of each child node in the current node is determined based on the first neighbor pattern of the current node includes the following operations.
The second neighbor pattern of the child node corresponding to the first neighbor pattern of the current node is acquired from a preset second mapping table. The second mapping table includes a mapping relationship between at least one first neighbor pattern of the current node and second neighbor patterns of the child nodes of the current node.
Alternatively, occupation information of the neighbor nodes corresponding to the current node is determined based on the first neighbor pattern of the current node, and the second neighbor pattern of the child node is determined based on the occupation information of the neighbor nodes corresponding to the current node, the second neighbor range of the child node, and preset second neighbor node weight values of the child node.
The second neighbor range includes at least one of neighbor nodes of the current node co-planar with the child node, neighbor nodes of the current node co-linear with the child node, or neighbor nodes of the current node co-dot with the child node. When the bounding box is equally partitioned into eight sub-cubes, there are three neighbor nodes of the parent node co-planar with the child node, three neighbor nodes of the parent node co-linear with the child node, one neighbor node co-dot with the current node, six neighbor nodes of the parent node co-planar and co-linear with the child node, and seven neighbor nodes of the parent node co-planar, co-linear, and co-dot with the child node.
In some embodiments of the disclosure, the second neighbor range of the child node is a subset of the first neighbor range of the current node.
In some embodiments of the disclosure, the entropy decoding parameter of the current node is a context index of the child node in the current node. The operation that the entropy decoding parameter of the child node in the current node is determined based on the neighbor information of the child node in the current node includes that: the context index of the child node is determined based on the number of occupied neighbor nodes of the child node in the current node. For example, the number of occupied neighbor nodes of the child node is determined as the context index of the child node. If the number of occupied neighbor nodes of each child node in
In some embodiments of the disclosure, the entropy decoding parameter of the current node is a context index of the child node in the current node. The operation that the entropy decoding parameter of the child node in the current node is determined based on the neighbor information of the child node in the current node includes the following operations. A context index of a first child node in the current node is determined based on neighbor information of the first child node. A context index of a target child node in the current node is determined based on neighbor information of the target child node and occupation information of a decoded child node in the current node. The target child node is a different node from the first child node.
That is, when entropy decoding is performed on the current node, the first child node is decoded first. Therefore, a context index of the first child node may be determined only by means of the neighbor pattern of the current node.
The target child nodes are the second child node to the eighth child node. When entropy decoding is performed on the second child node to the eighth child node, the occupation information of the decoded child node has been decoded, so the decoded child node may also be regarded as a neighbor node of the un-decoded child node. Therefore, context prediction is performed on the child node to be decoded (i.e., the target child node) according to the number of occupied neighbor nodes of the child node to be decoded and the occupation information of the decoded child node to determine the context index of the child node to be decoded. As such, richer contexts may be provided, and the decoding performance of the target child node may further be improved.
For example, a method for determining the context index of the first child node specifically includes the following operations. The neighbor information of the first child node is determined as the context index of the first child node. The occupation information of the decoded child node in the current node and the neighbor information of the target child node form a binary index, and the binary index is determined as the context index of the target child node.
The corresponding context may be indexed directly by using the binary index. Alternatively, the binary index may be converted into a unitary index to index the corresponding context. For example, the context index of the target child node is obtained by taking the occupation information of the decoded child node in the current node as a high bit and the number of occupied neighbor nodes of the target child node as low bits. Alternatively, the context index of the first child node is calculated by taking the second neighbor pattern of the target child node as a high bit and the occupation information of the decoded child node as low bits.
In some other embodiments, the neighbor information is the second neighbor pattern of the child node.
The operation that neighbor information of each child node in the current node is determined based on the first neighbor pattern of the current node includes the following operations.
The second neighbor pattern of the child node corresponding to the first neighbor pattern of the current node is acquired from a preset second mapping table. The second mapping table includes a mapping relationship between at least one first neighbor pattern of the current node and second neighbor patterns of the child nodes of the current node.
Alternatively, occupation information of neighbor nodes corresponding to the current node is determined based on the first neighbor pattern of the current node, and the second neighbor pattern of the child node is determined based on the occupation information of the neighbor nodes corresponding to the current node, the second neighbor range of the child node, and preset second neighbor node weight values of the child node.
The second neighbor range includes at least one of neighbor nodes of the current node co-planar with the child node, neighbor nodes of the current node co-linear with the child node, or neighbor nodes of the current node co-dot with the child node. When the bounding box is equally partitioned into eight sub-cubes, there are three neighbor nodes of the current node co-planar with the child node, three neighbor nodes of the parent node co-linear with the child node, one neighbor node co-dot with the current node, six neighbor nodes of the parent node co-planar and co-linear with the child node, and seven neighbor nodes of the parent node co-planar, co-linear, and co-dot with the child node.
That is, in some embodiments of the disclosure, a correlation between a neighbor node of the child node and the child node is also considered. A weight value is set for the neighbor node of the child node to represent the correlation between the neighbor node and the child node. If the weight value is greater, the correlation with the current node is higher. If the weight value is smaller, the correlation with the current node is lower.
In some embodiments of the disclosure, the second neighbor range of the child node is a subset of the first neighbor range of the parent node.
In some embodiments of the disclosure, the entropy decoding parameter of the current node is a context index of the child node in the current node. The operation that the entropy decoding parameter of the child node in the current node is determined based on the neighbor information of the child node in the current node includes that: the context index of the child node is determined based on the second neighbor pattern of the child node in the current node. For example, the second neighbor pattern of the child node is determined as the context index of the child node. In
In some embodiments of the disclosure, the entropy decoding parameter of the current node is a context index of the child node in the current node. The operation that the entropy decoding parameter of the child node in the current node is determined based on the neighbor information of the child node in the current node includes the following operations. A context index of a first child node in the current node is determined based on neighbor information of the first child node. A context index of a target child node in the current node is determined based on neighbor information of the target child node and occupation information of a decoded child node in the current node. The target child node is a different node from the first child node.
That is, when entropy decoding is performed on the current node, the first child node is decoded first. Therefore, a context index of the first child node may be determined only by means of the neighbor pattern of the current node.
The target child nodes are the second child node to the eighth child node. When entropy decoding is performed on the second child node to the eighth child node, the occupation information of the decoded child node has been decoded, so the decoded child node may also be regarded as a neighbor node of the un-decoded child node. Therefore, context prediction is performed on the child node to be decoded (i.e., the target child node) according to the second neighbor pattern of the child node to be decoded and the occupation information of the decoded child node to determine the context index of the child node to be decoded. As such, richer contexts may be provided, and the decoding performance of the target child node may further be improved.
For example, a method for determining the context index of the first child node specifically includes the following operations. The neighbor information of the first child node is determined as the context index of the first child node. The occupation information of the decoded child node in the current node and the neighbor information of the target child node form a binary index, and the binary index is determined as the context index of the target child node.
The corresponding context may be indexed directly by using the binary index. Alternatively, the binary index may be converted into a unitary index to index the corresponding context. For example, the context index of the first child node is calculated by taking the occupation information of the decoded child node in the current node as a high bit and the second neighbor pattern of the target child node as low bits. Alternatively, the context index of the target child node is calculated by taking the second neighbor pattern of the target child node as a high bit and the occupation information of the decoded child node as low bits.
Similarly, at the decoder, there are also other four solutions for determining the context index of each child node under different neighbor ranges according to different neighbor information of each child node. The four solutions are the same as those of the encoder, and will not be elaborated herein.
At S205, entropy decoding is performed on occupation information of the current node based on the entropy decoding parameter.
With the above technical solution, the entropy decoding parameter of the child node in the current node is predicted by using the neighbor pattern of the current node, namely the position distribution of the occupied neighbor nodes of the current node, and entropy decoding is performed on the occupation information of the child node by using the predicted entropy decoding parameter, so that the geometric spatial correlation between neighbor nodes may be used well, thereby improving the geometric decoding performance.
An embodiment of the disclosure provides an encoder.
The first determination unit 501 is configured to determine a bounding box of a point cloud to be coded.
The first acquisition unit 502 is configured to acquire coordinate information of a current node in case that octree partitioning is performed on the bounding box of the point cloud to be coded based on a Morton code order.
The first processing unit 503 is configured to determine a first neighbor pattern of the current node based on the coordinate information of the current node.
The first prediction unit 504 is configured to determine an entropy coding parameter of the current node based on the first neighbor pattern of the current node.
The coding unit 505 is configured to perform entropy coding on occupation information of the current node based on the entropy coding parameter.
In some embodiments of the disclosure, the first processing unit 503 is configured to determine coordinate information of each of neighbor nodes corresponding to the current node based on the coordinate information of the current node and a first neighbor range of the current node, determine occupation information of the neighbor node based on the coordinate information of the neighbor node, and determine the first neighbor pattern of the current node based on the occupation information of the neighbor nodes and preset first neighbor node weight values.
In some embodiments of the disclosure, the first neighbor range includes at least one of neighbor nodes co-planar with the current node, neighbor nodes co-linear with the current node, or neighbor nodes co-dot with the current node.
In some embodiments of the disclosure, the first neighbor pattern represents a position distribution of occupied neighbor nodes within the first neighbor range of the current node.
In some embodiments of the disclosure, the first prediction unit 504 is configured to determine neighbor information of a child node in the current node based on the first neighbor pattern of the current node, and determine an entropy coding parameter of the child node in the current node based on the neighbor information of the child node in the current node.
In some embodiments of the disclosure, the neighbor information is a number of occupied neighbor nodes of the child node or a second neighbor pattern of the child node. The second neighbor pattern represents a position distribution of occupied neighbor nodes within a second neighbor range of the child node.
In some embodiments of the disclosure, in case that the neighbor information is the number of occupied neighbor nodes of the child node, the first prediction unit 504 is configured to acquire the number of occupied neighbor nodes of the child node corresponding to the first neighbor pattern of the current node from a preset first mapping table. The first mapping table includes a mapping relationship between at least one first neighbor pattern of the current node and a number of occupied neighbor nodes of the child node of the current node.
Alternatively, the first prediction unit 504 is configured to determine occupation information of neighbor nodes corresponding to the current node based on the first neighbor pattern of the current node, and determine the number of occupied neighbor nodes of the child node based on the occupation information of the neighbor nodes corresponding to the current node and the second neighbor range of the child node.
In some embodiments of the disclosure, in case that the neighbor information is the second neighbor pattern of the child node, the first prediction unit 504 is configured to acquire the second neighbor pattern of the child node corresponding to the first neighbor pattern of the current node from a preset second mapping table. The second mapping table includes a mapping relationship between at least one first neighbor pattern of the current node and second neighbor patterns of the child nodes of the current node.
Alternatively, the first prediction unit 504 is configured to determine occupation information of the neighbor nodes corresponding to the current node based on the first neighbor pattern of the current node, and determine the second neighbor pattern of the child node based on the occupation information of the neighbor nodes corresponding to the current node, the second neighbor range of the child node, and preset second neighbor node weight values of the child node.
In some embodiments of the disclosure, the second neighbor range includes at least one of neighbor nodes of the current node co-planar with the child node, neighbor nodes of the current node co-linear with the child node, or neighbor nodes of the current node co-dot with the child node.
In some embodiments of the disclosure, the entropy coding parameter of the current node is a context index of the child node in the current node. The first prediction unit 504 is configured to determine a context index of a first child node in the current node based on neighbor information of the first child node, and determine a context index of a target child node in the current node based on neighbor information of the target child node and occupation information of a coded child node in the current node. The target child node is a different node from the first child node.
In some embodiments of the disclosure, the first prediction unit 504 is configured to determine the neighbor information of the first child node as the context index of the first child node.
In some embodiments of the disclosure, the first prediction unit 504 is configured to form a binary index by using the occupation information of the coded child node in the current node and the neighbor information of the target child node, and determine the binary index as the context index of the target child node; or obtain the context index of the target child node by taking the occupation information of the coded child node in the current node as a high bit and the neighbor information of the target child node as low bits.
In some embodiments of the disclosure, the coding unit 505 is configured to determine a context corresponding to the child node in the current node based on the context index of the child node in the current node, and perform entropy coding on occupation information of the child node in the context corresponding to the child node.
In some embodiments of the disclosure, the Morton code order is a breadth first traversal order or a depth first traversal order.
In practical applications, the embodiment of the disclosure also provides an encoder.
With the above technical solution, the encoder, when performing entropy coding, predicts the entropy coding parameter of the child node in the current node by use of the neighbor pattern of the current node, namely the position distribution of the occupied neighbor nodes of the current node, and performs entropy coding on the occupation information of the child node by use of the predicted entropy coding parameter, so that the geometric spatial correlation between neighbor nodes may be used well, thereby improving the geometric coding performance.
An embodiment of the disclosure provides a decoder.
The parsing unit 601 is configured to parse a bitstream to determine a bounding box of a point cloud to be decoded.
The second acquisition unit 602 is configured to acquire coordinate information of a current node in case that the bounding box of the point cloud to be decoded is partitioned based on a Morton code order.
The second processing unit 603 is configured to determine a first neighbor pattern of the current node based on the coordinate information of the current node.
The second prediction unit 604 is configured to determine an entropy decoding parameter of the current node based on the first neighbor pattern of the current node.
The decoding unit 605 is configured to perform entropy decoding on occupation information of the current node based on the entropy decoding parameter.
In some embodiments of the disclosure, the second processing unit 603 is configured to determine coordinate information of each of neighbor nodes corresponding to the current node based on the coordinate information of the current node and a first neighbor range of the current node, determine occupation information of the neighbor node based on the coordinate information of the neighbor node, and determine the first neighbor pattern of the current node based on the occupation information of the neighbor nodes and preset first neighbor node weight values.
In some embodiments of the disclosure, the first neighbor range includes at least one of neighbor nodes co-planar with the current node, neighbor nodes co-linear with the current node, or neighbor nodes co-dot with the current node.
In some embodiments of the disclosure, the first neighbor pattern represents a position distribution of occupied neighbor nodes within the first neighbor range of the current node.
In some embodiments of the disclosure, the second prediction unit 604 is configured to determine neighbor information of a child node in the current node based on the first neighbor pattern of the current node, and determine an entropy decoding parameter of the child node in the current node based on the neighbor information of the child node in the current node.
In some embodiments of the disclosure, the neighbor information is a number of occupied neighbor nodes of the child node or a second neighbor pattern of the child node. The second neighbor pattern represents a position distribution of occupied neighbor nodes within a second neighbor range of the child node.
In some embodiments of the disclosure, in case that the neighbor information is the number of occupied neighbor nodes of the child node, the second prediction unit 604 is configured to acquire the number of occupied neighbor nodes of the child node corresponding to the first neighbor pattern of the current node from a preset first mapping table. The first mapping table includes a mapping relationship between at least one first neighbor pattern of the current node and a number of occupied neighbor nodes of the child node of the current node.
Alternatively, the second prediction unit 604 is configured to determine occupation information of the neighbor nodes corresponding to the current node based on the first neighbor pattern of the current node, and determine the number of occupied neighbor nodes of the child node based on the occupation information of the neighbor nodes corresponding to the current node and the second neighbor range of the child node.
In some embodiments of the disclosure, in case that the neighbor information is the second neighbor pattern of the child node, the second prediction unit 604 is configured to acquire the second neighbor pattern of the child node corresponding to the first neighbor pattern of the current node from a preset second mapping table. The second mapping table includes a mapping relationship between at least one first neighbor pattern of the current node and second neighbor patterns of the child nodes of the current node.
Alternatively, the second prediction unit 604 is configured to determine occupation information of the neighbor nodes corresponding to the current node based on the first neighbor pattern of the current node, and determine the second neighbor pattern of the child node based on the occupation information of the neighbor nodes corresponding to the current node, the second neighbor range of the child node, and preset second neighbor node weight values of the child node.
In some embodiments of the disclosure, the second neighbor range includes at least one of neighbor nodes of the current node co-planar with the child node, neighbor nodes of the current node co-linear with the child node, or neighbor nodes of the current node co-dot with the child node.
In some embodiments of the disclosure, the entropy decoding parameter of the current node is a context index of the child node in the current node. The second prediction unit 604 is configured to determine a context index of a first child node in the current node based on neighbor information of the first child node, and determine a context index of a target child node in the current node based on neighbor information of the target child node and occupation information of a decoded child node in the current node. The target child node is a different node from the first child node.
In some embodiments of the disclosure, the second prediction unit 604 is configured to determine the neighbor information of the first child node as the context index of the first child node.
In some embodiments of the disclosure, the second prediction unit 604 is configured to form a binary index by using the occupation information of the decoded child node in the current node and the neighbor information of the target child node, and determine the binary index as the context index of the target child node; or obtain the context index of the target child node by taking the occupation information of the decoded child node in the current node as a high bit and the neighbor information of the target child node as low bits.
In some embodiments of the disclosure, the decoding unit 605 is configured to determine a context corresponding to the child node in the current node based on the context index of the child node in the current node, and perform entropy decoding on occupation information of the child node in the context corresponding to the child node to obtain the occupation information of the child node.
In some embodiments of the disclosure, the Morton code order is a breadth first traversal order or a depth first traversal order.
In practical applications, the embodiment of the disclosure also provides a decoder.
It can be understood that the decoder, when performing entropy decoding, predicts the entropy decoding parameter of the child node in the current node by use of the neighbor pattern of the current node, namely the position distribution of the occupied neighbor nodes of the current node, and performs entropy decoding on the occupation information of the child node by use of the predicted entropy decoding parameter, so that the geometric spatial correlation between neighbor nodes may be used well, thereby improving the geometric decoding performance.
In addition, each function module in the embodiments may be integrated into a processing unit, each unit may also physically exist independently, and two or more than two units may also be integrated into a unit. The integrated unit may be implemented in a hardware form, or in form of a software function module.
Correspondingly, an embodiment of the disclosure provides a storage medium, on which a computer program is stored. The computer program, when executed by a first processor, implements the point cloud coding method for an encoder. Alternatively, the computer program, when executed by a second processor, implements the point cloud decoding method for a decoder.
It is to be pointed out that the above descriptions about the storage medium and device embodiments are similar to those about the method embodiments and beneficial effects similar to those of the method embodiments are achieved. Technical details undisclosed in the storage medium and device embodiments of the disclosure are understood with reference to the descriptions about the method embodiments of the disclosure.
The above is only the implementation mode of the disclosure and not intended to limit the scope of protection of the disclosure. Any variations or replacements apparent to those skilled in the art within the technical scope disclosed in the disclosure shall fall within the scope of protection of the disclosure. Therefore, the scope of protection of the disclosure shall be subject to the scope of protection of the claims.
In the embodiments of the disclosure, point cloud coding and decoding methods, an encoder, a decoder, and a storage medium are provided. The methods include the following operations. A bounding box of a point cloud to be coded/decoded is determined; coordinate information of a current node is acquired in case that octree partitioning is performed on the bounding box of the point cloud to be coded/decoded based on a Morton code order; a first neighbor pattern of the current node is determined based on the coordinate information of the current node; an entropy coding parameter/entropy decoding parameter of the current node is determined based on the first neighbor pattern of the current node; and entropy coding/entropy decoding is performed on occupation information of the current node based on the entropy coding parameter/entropy decoding parameter. In this manner, an entropy coding parameter/entropy decoding parameter of a child node in the current node is predicted by use of the neighbor pattern of the current node, namely a position distribution of occupied neighbor nodes of the current node, and entropy coding/entropy decoding is performed on occupation information of the child node by use of the predicted entropy coding parameter/entropy decoding parameter, so that the geometric spatial correlation between neighbor nodes may be used well for coding/decoding, thereby improving the geometric coding/decoding performance.
This is a continuation of International Application No. PCT/CN2020/091106 filed on May 19, 2020, the contents of which are hereby incorporated by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2020/091106 | May 2020 | US |
Child | 17981661 | US |