Classification method of data pattern and classification system of data pattern

Description

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a 371 U.S. National Phase of International Application No. PCT/JP2020/016772, filed on Apr. 16, 2020. The entire disclosure of the above application is incorporated herein by reference.

TECHNICAL FIELD

The present invention relates to a data-pattern classification method and a classification system, by which the candidates of precise rules are filtered from information in a database if the type of information inputted in a column of the database is not unified in the column.

BACKGROUND ART

An ordinary database (hereinafter referred to as “DB”) contains information about a piece of data, the information being inputted to a dedicated column. In columns such as a remarks column where items to be inputted are not definitely determined, various types of information may be inputted. When a DB including such a remarks column is combined with another DB, information to be shared between the DBs in the remarks column may interfere with proper combination.

Such DBs will be described below using specific examples.

TABLE 1

City, town and
House
Telephone

ID
School Name
Prefecture
village
Number
Number
Position
Classification
Remarks

1
A University
Tokyo
Minato-ward
1-11-11
5678
S
Address
1-11-11, Minato-ward,

Tokyo

2
B High
Aichi
Nagoya
2-22
6789
S
Address
2-22 Nagoya-city

School

3
C Technical
Osaka
Osska
33-33-3
1234
S
Telephone
7890-1234

College

Table 1 indicates an example of a DB where character strings regularly overlap in remarks columns. In the remarks columns, one of (1) address and (2) telephone number is inputted. In the case of (1) address, information in columns “prefecture,” “city, town and village,” and “house number” is shared. However, “prefecture” is not inputted in the remarks column of “ID:2” data. In this way, the expressions of addresses are not unified in the columns.

TABLE 2

Com-
Area
Maintenance

ID
Address
pletion
(m²)
Company
Remarks

1
0-0-0, Minato-
2000/1/1
1000
□ □, Inc.
School

ward, Tokyo

2
x-x, Nagoya-
1995/1/1
1200
XX main-
Prefecture

city, Aichi

tenance
high school

Table 2 indicates an example of a DB about buildings.

When the DB in table 1 and the DB in table 2 are combined, an address is a common item serving as a trigger. However, table 1 does not have any column corresponding to an address, and part of the data contains telephone numbers, so that a school name or a telephone number in table 1 and building information in table 2 cannot be associated with each other.

To address this problem, a conventional technique is proposed, in which rules are preregistered to determine which one of the rules is abided by an inputted character string (e.g., PTLs 1 and 2).

CITATION LIST
Patent Literatures

- [PTL 1] Japanese Patent Application Publication No. 2016-136341
- [PTL 2] Japanese Patent Application Publication No. 2014-219833

SUMMARY OF THE INVENTION
Technical Problem

In table 1, types of information (addresses or telephone numbers) are inputted in the remarks columns. An input to each of the remarks columns indicates one type of information. Although address information is inputted in the remarks columns of “ID:1” and “ID:2” in table 1, an address in the remarks column of “ID:1” is inputted from a prefecture name, whereas an address in “ID:2” is inputted from the name of a city, town or village. In this way, the expressions of information are not unified. This is because a human error has occurred or an input style for addresses is not specified.

If information to be inputted to a remarks column is not specified, the related arts described in PTL 1 and PTL 2 cannot cope with such information.

The present invention has been devised in view of such circumstances. An object of the present invention is to provide a data-pattern classification method and a classification system, by which columns containing information to be shared between databases can be easily found when the databases are combined.

Means for Solving the Problem

An aspect of the present invention is a data-pattern classification method for a database containing character strings regularly overlapping between the input values of mixed columns and information inputted in another specific column, the mixed column including types of information, the method including: extracting means for extracting a branch column for changing the type of information to be inputted to the mixed column, the branch column being extracted by one of a heuristic first technique and a second technique, the first technique using timing to change the pattern of the mixed column in the database including the mixed column, the pattern referring to a column where a character string overlaps the input value of the mixed column, the second technique using a statistical technique of a likelihood test; and classification to obtain the number of types of information stored in the mixed columns, by grouping, according to information indicated by the patterns, the patterns obtained from the mixed columns based on the extracted branch column.

Effects of the Invention

According to an aspect of the present invention, columns containing information to be shared between databases can be easily found when the databases are combined.

DESCRIPTION OF EMBODIMENTS

An embodiment of the present invention applied to a data-pattern classification method will be described below.

[Outline]

The outline of the present embodiment will be described below.

Columns in which types of information are inputted as in a remarks column in Table 1 are referred to as mixed columns in the present specification. A DB in the present embodiment is provided for character strings regularly overlapping between the input values of mixed columns and information inputted in another specific column. A combination of the input value of the mixed column and a column containing a character string overlap is referred to as a pattern.

If input values with identical patterns are simply handled as a group indicating the same information so that the input values of the remarks columns are grouped according to patterns obtained from the mixed columns, groups may be excessively generated.

In the present embodiment, patterns obtained from the mixed columns are grouped according to information indicated by the patterns. In the example of table 1, different patterns are obtained from the remarks columns of ID:1 and ID:2 but are classified as the same group because address information is inputted in both of the columns.

In the present embodiment, it is assumed that a branch column is present. Specifically, the branch column indicates a column for changing information about the input value of the remarks column according to an inputted character string. For example, in a DB containing a mixed column, like the classification column, if a character string X (e.g., “address”) is inputted in a branch column as in a “classification” column, an input is made in the remarks column under a rule A (information about an address is inputted to the remarks column), whereas if another character string Y (e.g., “telephone”) is inputted in the branch column, an input is made in the remarks column under a rule B (information about a telephone number is inputted to the remarks column). Thus, the type of an input value in the branch column is identical to the type of information in the mixed column.

For finding a branch column, two methods are used:

- A) A heuristic method using the timing of a pattern change of the mixed column
- B) A method using a statistical technique of a likelihood test

The branch column is found as described above, so that patterns are grouped and the number of types of information stored in the mixed columns is determined. When the mixed columns are grouped, the mixed columns are classified by the type of information, facilitating a search for columns containing information to be shared between DBs when the DBs are combined.

Additionally, in combination with a technique of extracting a representative pattern from patterns belonging to one group, the order of superiority is determined among the patterns. This can unify input values according to the best pattern.

[Method Steps]

The specific steps of a data-pattern classification method will be described below.

It is assumed that a known column corresponds to a mixed column. Pieces of data stored in a DB is denoted as d_i(i∈{1, 2, . . . , n}) (n: the total number of pieces of stored data), and D denotes the data sets of mixed columns that are not blank.

For d_i, information about i-th data is listed except for mixed columns.

For example, in the DB of table 1, d_i=[“A university”, “Tokyo,” “Minato Ward,” “1-11-1,” and “5678” ] is obtained.

At this point, a_jdenotes a mixed column corresponding to data d_jthat is an element of a set D.

Hereinafter, for character strings str1 and str2, a set of common characters is denoted as str1∩str2.

<Step 0> (Preprocessing: Pattern Extraction)

From among inputted information other than mixed columns, indexes including character strings shared with a_jare all extracted. Specifically, an index x is found such that a_j∩d_j[x]≠φ(x≠y) is established. φ indicates an empty set.

However, a search for the index x needs to satisfy all the following conditions i) to iii):

- i) A common character string is as long as or longer than a half of the length of a character string d_j[x].

If numbers or symbols (e.g., :, ;, −, +, @) are shared, the common character string requires a length of 2 or more, or d_j[x] needs to include characters other than numbers and symbols.

- ii) the common character string is identical to a character string obtained by deleting the first character from the character string d_j[x] or deleting the last character from the character string. For example, in the case of d_j[x]=“Fukushima Koriyama,” the condition ii) is satisfied when a character string shared with d_j[y] is “Fukushima” or “Koriyama,” whereas “Fukuyama” does not satisfy the condition ii).

iii) Common character strings should not overlap one another among the elements of different indexes. For example, in the case of a_j=“Fukushima Koriyama,” the condition iii) is satisfied when d_j[x]=“Fukushima” and d_j[z]=“Koriyama” are obtained, whereas the condition iii) is not satisfied when d_j[x]=“Fukushima” and d_j[z]=“Fukushima Koriyama” are obtained.

If multiple indexes are extracted, a combination thereof is referred to as a pattern as described above. Patterns are extracted for all d_j∈D. The set of the patterns is denoted as P, and the i-th pattern is denoted as p_i∈P. An index is stored for each of the patterns. For example, p_i=[1, 4, 6] indicates a common part shared with character strings inputted in the first, fourth, sixth columns. By performing such preprocessing, the following processing steps for grouping can be efficiently performed.

First, technique A), the heuristic method will be described below.

The step 0 is performed to extract patterns.

The total number of the extracted patterns is denoted as |P|. Columns are selected according to the following two conditions:

- (1) The number of types of character strings inputted to the columns is |P| or less.
- (2) The columns do not contain any elements of the extracted patterns.
  
  <Step 3A>

For each of the selected columns, the following value s is calculated: s:=(error_c/c)+(error_f/f) (when vertically adjacent pieces of data are compared with each other, c:the number of times the input pattern of the mixed column changes, error_c:the number of times the input of the column is not changed even when the input pattern of the mixed column changes, f:the number of times the input pattern of the mixed column is kept, error_f:the number of times the input of the column changes even when the input pattern of the mixed column does not change.)

As the conditions of the branch column, the pattern of the mixed column may change concurrently with a change of an input character string, whereas the pattern may be unchanged when the character string does not change. Thus, the number of times the conditions are not satisfied and the ratio are indicated.

Columns are selected to obtain minimum s. The selected columns are denoted as h_j(j∈{1, 2, . . . }), and a set of h_jis denoted as H. At this point, the pattern of the mixed column is the data of p_j, and a character string with the highest frequency of occurrence is denoted as m_ifrom among character strings inputted to h_j.

However, it is necessary to satisfy the condition “the number of times m_ioccurs exceeds a half of the number of data pieces in the pattern p_i.” If the condition is not satisfied, h_jis not assumed to be the branch column and is excluded from the set H.

If m_iis determined for each pattern p_i(i∈{1, 2, . . . , |P|}), the column h_jis assumed to serve as the branch column.

If the number of types of character strings inputted in the branch column h_jis smaller than the total number |P| of patterns, the patterns are grouped. Specifically, patterns with the same m_iare grouped as patterns indicating identical information. A representative pattern is selected from the grouped patterns.

A technique B), a statistical method according to a likelihood will be described below.

The step 0 is performed to extract patterns.

The total number of the extracted patterns is denoted as |P|. Columns are selected according to the following two conditions:

- (1) The number of types of character strings inputted to the columns is |P| or less.
- (2) The columns do not contain any elements of the extracted patterns.
  
  <Step 3B>

A set of the selected columns is denoted as H. For each column h_jof the set H, the pattern inputted in the mixed column is the data of p_i, and a character string with the highest frequency of occurrence is selected from among character strings inputted to h_jand is denoted as m_ji. However, in the case of multiple candidates for m_ji, that is, multiple character strings having a mode, h_jis not assumed to be the branch column and is excluded from the set H.

Regarding the column h_jof the set H, a numeric value corresponding to each x_klof a two-dimensional tabulation in Table 3 is obtained, for each of p_iand m_ji, from the data. x_kl: the number of pieces of data in which m_jiis inputted in a pattern p_k.

However, in the presence of k and 1 (k≠1, k≠i, l≠i) satisfying m_jk=m_jiin the pattern p_i,

- x_ik=x_il=M/2 (M is the total number of pieces of data in which m_jk(or m_jl) is inputted) is inputted to obtain equalization.

TABLE 3

Pattern p_i
Pattern P_j+1
Total

. . .

m_j(i+1) input
x_ii
x_(i+1)i

\sum_{k} x_{ki}

m_(j+i) input
x_i(i+1)
x_(i+1)(i+1)

\sum_{k} x_{k (i + 1)}

. . .

Total

\sum_{l} x_{i l}

\sum_{l} x_{(i + 1) l}

\sum_{l} \sum_{k} x_{kl}

By using a chi[χ]-square test for the data, a hypothesis “Compliance with each pattern p_iand the input of m_jiaccidentally occur at the same time” is tested. An expected frequency is indicated in table 4.

TABLE 4

Pattern p_i

custom character

. . .

m_jiinput

y_{ii} = \frac{\sum_{k} x_{ki} \sum_{l} x_{il}}{\sum_{l} \sum_{k} x_{kl}}

\sum_{k} x_{ki}

. . .

Total

\sum_{l} x_{il}

\sum_{l} \sum_{k} x_{kl}

By the expected frequency table in table 4, a test statistic χ_i²is determined according to the following expression:

$\begin{matrix} χ_{i}^{2} = \sum_{l} \sum_{k} \frac{{(x_{k l} - y_{k l})}^{2}}{y_{k l}} & [Math . 1] \end{matrix}$

The foregoing operation is performed on each h_i.

The calculated test statistic χ_i²is converted into a p value at a degree (|P|−1)²of freedom of a test amount. If the p value is larger than a significance level α, the hypothesis is rejected and h_iis outputted as the branch column. In the presence of multiple h_i, h_iwith a maximump value is used as the branch column.

Example

A specific example based on the steps of the data-pattern classification method will be described below.

TABLE 5

School

City, town
House
Telephone

ID
name
Prefecture
and village
number
number
Position
Classification
Remarks

1
A
Tokyo
Minato-ward
1-11-11
5678
S
Address
1-11-11, Minato-ward,

University

Tokyo

2
B High
Aichi
Nagoya
2-22
6789
S
Address
2-22 Nagoya-city

school

3
C Technical
Osaka
Osaka
33-33-3
1234
S
Telephone
7890-1234

College

4
D
Kanagawa
Yokohama-
4-4-4
890
S
Address
4-4-4, Yokohama-city,

University

city

Kanagawa

5
E
Mie
Yokohama-
5-55-5
4567
S
Address
5-55-5, Mstsuzaka-city,

university

city

Mie

6
F High
Hyogo
Kobe-
66-6
9012
R
Address
66-6, Kobe-city,

school

city

Hyogo

7
G High
Saitama
Kawagoe-
7-77-7
234
R
Telephone
049-012-234

school

city

8
H Junior
Gifu
Gifu-city
888
123
R
Telephone
058-90-123

high

school

9
I Junior
Kyoto
Kyoto-city
9-99
3456
R
Address
8-99,

high

Kyoto-city, Kyoto

school

10
J
Chiba
Ichikawa-
00-000
789
R
Address
00-000, Chiba

technical

city

school

Table 5 indicates an example of a DB including input rules. In the remarks columns, one of (1) address and (2) telephone number is inputted. In the case of (1) address, information in columns “prefecture,” “city, town and village,” and “house number” is shared.

Table 6 indicates extracted combinations of the columns including character strings shared with the remarks columns.

TABLE 6

ID
Common column

1
Prefecture, city, town, village, house number

2
City, town and village, house number

3
Telephone number

4
Prefecture, city, town, village, house number

5
Prefecture, city, town, village, house number

6
Prefecture. city, town, village, house number

7
Telephone number

8
Telephone number

9
Prefecture, city, town, village, house number

10
Prefecture, house number

In table 6, “ID” is not a common item because the common character corresponds to one digit of a number, which does not satisfy the condition i). Table 6 indicates four types of patterns as follows:

- p₁=[prefecture, city, town and village, house number]
- p₂=[telephone number]
- p₃=[city, town and village, house number]
- p₄=[prefecture, city, house number]
  
  Technique A) Heuristic Method
  
  <Step 1A>

The step 0 is performed to extract patterns.

Columns are selected so as to satisfy the following two conditions:

- (1) The number of types of character strings inputted in the columns is four or less.
- (2) The columns are not any elements of the extracted patterns.

The columns “position” and “classification” satisfy these conditions. The columns “ID” and “school name” each contain ten or more types of character strings and thus do not satisfy the condition (1).

For the column “position,” the following is determined:

- c=6 (the pattern changes when the ID changes from 1 to 2, from 2 to 3, from 3 to 4, from 6 to 7, from 8 to 9, and from 9 to 10) error_c=6 (the input character does not change at any one of the times)
- f=3 (the pattern does not change when the ID changes from 4 to 5, from 5 to 6, and from 7 to 8)
- error_f=1 (the pattern changes when the ID changes from 5 to 6)

Thus, s:=(error_c/c)+(error_f/f)=1+(1/3)≈1.333 is obtained.

For the column “classification,” the following is determined:

- c=6 (the pattern changes when the ID changes from 1 to 2, from 2 to 3, from 3 to 4, from 6 to 7, from 8 to 9, and from 9 to 10) error_c=2 (the input character does not change when the ID changes from 1 to 2 and from 9 to 10)
- f=3 (the pattern does not change when the ID changes from 4 to 5, from 5 to 6, and from 7 to 8)
- error_f=0 (the pattern does not change at the foregoing timing)

Thus, s:=(error_c/c)+(error_f/f)=(2/6)+0≈0.333 is obtained.

The value of s is small in the column “classification.” At this point, the pattern of the remarks column is the data of p_i. From among character strings inputted to the column “classification,” the following modes are obtained:

- p₁:m₁=address (five times in five pieces of data)
- p₂:m₂=telephone number (three times in three pieces of data)
- p₃:m₃=address (one time in a piece of data)
- p₄:m₄=address (one time in a piece of data)

Thus, four types of patterns are classified into p₁, p₂, p₃, and p₄.

The number of types of character strings inputted in the column “classification” is two, and thus the patterns are grouped into two types. In this case, the pattern that occurs most frequently in the data is regarded as a representative pattern, and the other patterns are regarded as patterns generated by erroneous inputs.

In the pattern of m_i=“address,” the occurrence frequency in the data is maximized in five times of the pattern p₁. Hence, the pattern p₁serves as a representative pattern and is classified as a group assumed to be indicative of identical information.

A technique B), statistical method according to likelihood

The step 0 is performed to extract patterns.

Columns are selected so as to satisfy the following two conditions:

- (1) The number of types of character strings inputted in the columns is four or less.
- (2) The columns are not any elements of the extracted patterns.

h₁:“position” and h₂:“classification” are set. For the pattern p_i(i=1, 2, 3, 4), m_ji(j=1, 2) is set as indicated in table 7.

TABLE 7

p₁
p₂
p₃
p₄

h₁
m₁₁= S
m₁₂= R
m₁₃= R
m₁₄= S

h₂
m₂₁= Address
m₂₂= Telephone
m₂₃= Address
m₂₄= Address

For each column h_j, two-dimensional tabulations are created as indicated in table 8 and table 9.

TABLE 8

h₁

p₁
p₂
p₃
p₄
Total

m₁₁
3
1/2
0
0
7/2

m₁₂
2
2
0
0
4

m₁₃
0
0
1
0
1

m₁₄
0
1/2
0
1
3/2

Total
5
3
1
1
10

In table 8, 1/2 is inputted at m₁₁:p₂and m₁₄:p₂because m₁₁and m₁₄are identical to each other with respect to a piece of data in which a character string S is inputted in the pattern p₂.

TABLE 9

h₂

p₁
p₂
p₃
p₄
Total

m₂₁
5
0
0
0
5

m₂₂
0
3
0
0
3

m₂₃
0
0
1
0
1

m₂₄
0
0
0
1
1

Total
5
3
1
1
10

For each h_j, table 10 and table 11 of expected frequencies are created.

TABLE 10

h₁

p₁
p₂
p₃
p₄
Total

m₁₁
7/4
21/20
7/20
7/20
7/2

m₁₂
2
6/5
2/5
2/5
4

m₁₃
1/2
3/10
1/10
1/10
1

m₁₄
3/4
9/20
3/20
3/20
3/2

Total
5
3
1
1
10

TABLE 11

h₂

p₁
p₂
p₃
p₄
Total

m₂₁
5/2
3/2
1/2
1/2
5

m₂₂
3/2
9/10
3/10
3/10
3

m₂₃
1/2
3/10
1/10
1/10
1

m₂₄
1/2
3/10
1/10
1/10
1

Total
5
3
1
1
10

As indicated by table 8 and table 10, a chi-square test amount χ₁²with respect to h₁is χ₁²≈18.23

As indicated by table 9 and table 11, a chi-square test amount χ₂²with respect to h₂is χ₂²=31

In this case, a significance level α=0.01 is determined. Since an upper significance probability of 0.01 has a value of 21.7 at 9 degrees of freedom, a hypothesis is established at h₁but is rejected at h₂, and h₂serves as the branch column.

In the case of a small amount of sample data as in the present example, a Fisher test is more suitable than a chi-square test. In an actual application of the technique, however, it is assumed that the number of data pieces is sufficiently large and a chi-square test is used.

The number of types of character strings inputted in the column “classification” of h₂is two, and thus the patterns are grouped into two types. In this case, the pattern that occurs most frequently in the data is regarded as a representative pattern, and the other patterns are regarded as patterns generated by erroneous inputs.

Specifically, in the pattern of m_2i=“address,” the occurrence frequency in the data is maximized in five times of the pattern p₁. Hence, the pattern p₁serves as a representative pattern, and other patterns p₃and p₄are classified as a group assumed to be indicative of the same information as the pattern p₁.

By using the technique A) heuristic method and the technique B) statistical method according to likelihood, patterns obtained from the mixed columns can be grouped according to information indicated by the patterns.

[Effects of Embodiment]

According to the present embodiment specifically described above, columns containing information to be shared between databases can be easily found when the databases are combined.

The method of the present invention can be also implemented by a computer system in which a plurality of databases are combined. A program for implementing the method steps can be recorded in a recording medium or can be provided via a network.

Furthermore, the invention of the present application is not limited to the foregoing embodiment and can be changed in various ways within the scope of the invention in the implementation of the invention. Moreover, the embodiment includes inventions at various stages. Various inventions may be extracted with a proper combination of disclosed constituent components. For example, even if some of the constituent components are deleted from all the constituent components described in the embodiment, a configuration may be extracted as the invention without the deleted constituent components as long as the invention can solve the problem described in the technical problem and obtain the effects described in the effects of the invention.

Claims

1. A data-pattern classification method for a database containing character strings regularly overlapping between input values of mixed columns and information inputted in another specific column, the mixed column including types of information, the method comprising:extracting a branch column for changing a type of information to be inputted to the mixed column, the mixed column including two or more different types of information, the branch column being extracted by one of a heuristic first technique and a second technique, the first technique using timing to change a pattern of the mixed column in the database including the mixed column, the pattern referring to a column where a character string overlaps the input value of the mixed column, the second technique using a statistical technique of a likelihood test; andclassifying to obtain the number of types of information stored in the mixed columns, by grouping, according to information indicated by the patterns, the patterns obtained from the mixed columns based on the extracted branch column,wherein the branch column indicates a column for changing information about the input value of a remarks column according to an inputted character string,wherein if a first character string is input in the branch column, making an input in the remarks column under a first rule; andwherein if a second character string is input in the branch column, making an input in the remarks column under a second rule.
2. The data-pattern classification method according to claim 1, further comprising, prior to the extraction, preprocessing for extracting all indexes of columns including character strings shared with the mixed columns.
3. The data-pattern classification method according to claim 1, wherein in the classification, a representative pattern is selected from the patterns belonging to one group, an order of superiority is determined among the patterns, and the patterns are grouped with the input values unified according to the representative pattern.
4. A data-pattern classification system for a database containing character strings regularly overlapping between input values of mixed columns and information inputted in another specific column, the mixed column including types of information, the system comprising:one or more processors; andmemory including code that, when executed by the one or more processors, perform to:extract a branch column for changing a type of information to be inputted to the mixed column, the mixed column including two or more different types of information, the branch column being extracted by one of a heuristic first technique and a second technique, the first technique using timing to change a pattern of the mixed column in the database including the mixed column, the pattern referring to a column where a character string overlaps the input value of the mixed column, the second technique using a statistical technique of a likelihood test; andclassify means for obtaining the number of types of information stored in the mixed columns, by grouping, according to information indicated by the patterns, the patterns obtained from the mixed columns based on the extracted branch column;wherein the branch column indicates a column for changing information about the input value of a remarks column according to an inputted character string,if a first character string is input in the branch column, make an input in the remarks column under a first rule; andif a second character string is input in the branch column, make an input in the remarks column under a second rule.
5. The data-pattern classification system according to claim 4 wherein the code, when executed by the one or more processors, further performs to, prior to the extraction, preprocess for extracting all indexes of columns including character strings shared with the mixed columns.
6. The data-pattern classification system according to claim 4, wherein in the classification, a representative pattern is selected from the patterns belonging to one group, an order of superiority is determined among the patterns, and the patterns are grouped with the input values unified according to the representative pattern.

PCT Information

Filing Document	Filing Date	Country	Kind
PCT/JP2020/016772	4/16/2020	WO

Publishing Document	Publishing Date	Country	Kind
WO2021/210142	10/21/2021	WO	A

US Referenced Citations (5)

Number	Name	Date	Kind
20120101975	Khosravy	Apr 2012	A1
20120124037	Lee	May 2012	A1
20130322743	Matsunaga	Dec 2013	A1
20180011830	Iida	Jan 2018	A1
20210334280	Medvedev	Oct 2021	A1

Foreign Referenced Citations (2)

Number	Date	Country
2014219833	Nov 2014	JP
2016136341	Jul 2016	JP

Related Publications (1)

	Number	Date	Country
	20230259589 A1	Aug 2023	US

Classification method of data pattern and classification system of data pattern

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

Field of Search

US

International Classifications

Term Extension

Abstract

Description

Claims

PCT Information

US Referenced Citations (5)

Foreign Referenced Citations (2)

Related Publications (1)