The embodiments relate to an image processing learning program, an image processing program, an information processing apparatus, and an image processing system.
As a conventional technique, there has been proposed an image processing learning program for clustering a data set beforehand and performing learning of super resolution (see, for example, Non Patent Literature 1).
In single image super resolution for restoring a single high-resolution image from a single low-resolution image, the image processing learning program disclosed in Non Patent Literature 1 prepares a plurality of low-resolution images as a data set, clusters the data set beforehand with k-means clustering to divide the data set into classification domains, and prepares a convolutional neural network (CNN) models as many as the number of classification domains and performs learning using the distance between an image input to the CNN models and a cluster center to obtain super-resolution models. The image processing learning program performs, about the trained CNN models, which are the super-resolution models, inference using the distance between the input image and the cluster center.
However, with the image processing learning program of Non Patent Literature 1 described above, the data set is clustered beforehand. Therefore, although efficiency of learning is improved, since the clustering is sometimes performed based on feature values such as a color, light and shade, and the like of an image, there is a problem in that the clustering does not always link to improvement of accuracy of super resolution.
Therefore, an object of one of embodiments is to provide an image processing training program for clustering, without requiring labeling in advance, a data set used for training of image processing and performing the training of the image processing models such that accuracy of the image processing for classification domains is improved, a trained image processing program, and an information processing apparatus and an image processing system.
An aspect of embodiments provide, in order to achieve the object, an image processing learning program, an image processing program, an information processing apparatus, and an image processing system explained below.
An aspect of embodiments is a non-transitory computer-readable medium having stored thereon instructions, that when executed by a processor, causes the processor to process the following steps (S-1)-(S-8). The step (S-1) includes preparing a plurality of target images. The step (S-2) includes preparing a plurality of training images, where each of the plurality of training images is prepared by lowering a resolution of a corresponding target image of the plurality of target images. The step (S-3) includes, for each of the plurality of training images, training and updating a first super-resolution model by executing the following substeps a) to d): a) inputting a training image of the plurality of training images into the first super-resolution model and generating a higher-resolution training image, b) comparing the higher-resolution training image with a corresponding target image of the plurality of target images, c) calculating a difference between the higher-resolution training image and the corresponding target image, and d) updating the first super-resolution model through a feedback of the calculated difference. The step (S-4) includes, for each of the plurality of training images, training a second super-resolution model in a same manner as training the first super-resolution model by executing the substeps a) to d) in the step (S-3), and updating the second super-resolution model. The step (S-5) includes labeling each of the plurality of training images based on the differences obtained from the updated first super-resolution model and the updated second super-resolution model through the substep d) in the step (S-3), and classifying each of the plurality of training images according to each label representing a preference of either the updated first super-resolution model or the updated second super-resolution model. The step (S-6) includes, using each of the plurality of training images that are clustered in a largest cluster, based on the classification of the plurality of training images, that includes a greatest number of the plurality of training images having a commonly preferred updated super-resolution model, training a super-resolution model-K in a same manner as training the first super-resolution model by executing the substeps a) to d) in the step (S-3), and thereby updating the super-resolution model-K, wherein K is an arbitrary number in a sequence. The step (S-7) includes updating the label of each of the training images of the plurality of training images in the largest cluster based on differences obtained from the updated super-resolution model-K and the commonly preferred updated super-resolution model, and re-classifying the training images of the plurality of training images in the largest cluster into sub-clusters according to each of the updated label representing a preference of either the updated super-resolution model-K or the commonly preferred updated super-resolution model. The step (S-8) includes repeating the steps (S-6)-(S-7) to generate sub-clusters until a predetermined condition is satisfied.
Another aspect of embodiments is a non-transitory computer-readable medium having stored thereon instructions, that when executed by a processor, causes the processor to process the following steps (S-1)-(S-8). The step (S-1) includes preparing a plurality of target images. The step (S-2) includes preparing a plurality of training images, wherein each of the plurality of training images is prepared by lowering a resolution of a corresponding target image of the plurality of target images. The step (S-3) includes, for each of the plurality of training images, training and updating a first super-resolution model by executing the following substeps a) to d): a) inputting the training image in the first super-resolution model and generate a higher-resolution training image, b) comparing the higher-resolution training image with the corresponding target image of the plurality of target images, c) calculating a difference between the higher-resolution training image and the corresponding target image, and d) updating the first super-resolution model through a feedback of the calculated difference, wherein the calculated difference is recorded as resolution accuracy of the first-resolution model to the corresponding training image. The step (S-4) includes, for each of the plurality of training images, training a second super-resolution model in a same manner as training the first super-resolution model by executing the substeps a) to d) in the step (S-3), and updating the second super-resolution model. The step (S-5) includes determining which one of updated super-resolution models preferably resolved a greatest number of the plurality of training images. The step (S-6) includes, using each of the plurality of training images of the greatest number of the plurality of training images resolved by the preferred updated super-resolution model, train a super-resolution model-K in a same manner as training the first super-resolution model by executing the substeps a) to d) in the step (S-3), and updating the super-resolution model-K, wherein K is an arbitrary number in a sequence. The step (S-7) includes, using each of all the plurality of training images, training all of the updated super-resolution models including the updated super-resolution model-K, in a same manner as training the first super-resolution model by executing the substeps a) to d) in (S-3). The step (S-8) includes repeating the steps (S-6)-(S-7) to update the resolution accuracy of each of the updated super-resolution models corresponding to each of the plurality of training images, until a predetermined condition is satisfied.
Yet another aspect of embodiments is a method for processing images that includes the following steps (S-1)-(S-8), by one or more computing devices. The step (S-1) includes preparing a plurality of target images. The step (S-2) includes preparing a plurality of training images, wherein each of the plurality of training images is prepared by lowering a resolution of a corresponding target image of the plurality of target images. The step (S-3) includes, for each of the training images, training and updating a first super-resolution model by executing the following substeps a) to d): a) inputting the training image in the first super-resolution model and generate a higher-resolution training image, b) comparing the higher-resolution training image with a corresponding target image of the plurality of target images, c) calculating a difference between the higher-resolution training image and the corresponding target image, and d) updating the first super-resolution model through a feedback of the calculated difference. The step (S-4) includes, for each of the plurality of training images, training a second super-resolution model in a same manner as training the first super-resolution model by executing the substeps a) to d) in the step (S-3), and updating the second super-resolution model. The step (S-5) includes labeling each of the plurality of training images based on the differences obtained from the updated first super-resolution model and the updated second super-resolution model through the substep d) in the step (S-3), and classifying each of the plurality of training images according to each label representing a preference of either the updated first super-resolution model or the updated second super-resolution model. The step (S-6) includes, using each of the plurality of training images that are clustered in a largest cluster, based on the classification of the plurality of training images, that includes a greatest number of the plurality of training images having a commonly preferred updated super-resolution model, training a super-resolution model-K in a same manner as training the first super-resolution model by executing the substeps a) to d) in the step (S-3), and updating the super-resolution model-K, wherein K is an arbitrary number in a sequence. The step (S-7) includes updating the label of each of the training images of the plurality of training images in the largest cluster based on differences obtained from the updated super-resolution model-K and the commonly preferred updated super-resolution model, and re-classifying the training images of the plurality of training images in the largest cluster into sub-clusters according to each of the updated label representing a preference of either the updated super-resolution model-K or the commonly preferred updated super-resolution model. The step (S-8) includes repeating the steps (S-6)-(S-7) to generate sub-clusters.
Various embodiments of the present invention may be described with reference to flowcharts and block diagrams whose elements may represent (1) steps of processes in which operations are performed or (2) sections of apparatuses responsible for performing operations. Certain steps and sections may be implemented by dedicated circuitry, programmable circuitry, and/or processors supplied with computer-readable instructions stores on computer-readable media.
A super-resolution system 5 as an example of this image processing system is configured by communicably connecting a terminal 1 functioning as an information processing apparatus and a Web server 2 to each other by a network 3.
The terminal 1 is an information processing apparatus of a portable type such as a notebook personal computer (PC), a smartphone, or a tablet terminal and includes, in a main body, electronic components such as a central processing unit (CPU) having a function of processing information, a graphics processing unit (GPU), and a flash memory. Note that the terminal 1 is not limited to the information processing apparatus of the portable type and may be a PC of a stationary type.
The Web server 2 is a server-type information processing apparatus and operates according to a request of the terminal 1. The Web server 2 includes, in a main body, electronic components such as a CPU having a function of processing information and a flash memory.
The network 3 is a communication network capable of performing high-speed communication and is, for example, a wired or wireless communication network such as the Internet or a local area network (LAN).
As an example, the terminal 1 transmits a request to the Web server 2 for browsing a Web page. In response to the request, the Web server 2 transmits, to the terminal 1, Web page information 20 forming a Web page including an image for distribution 200 to be displayed on the Web page. The terminal receives the Web page information 20 and the image for distribution 200 and classifies the image for distribution 200, which is an input image, into a category. As an example of image processing, the terminal 1 converts the image for distribution 200 into a high-resolution (super-resolution) image using a super-resolution model suitable for the category and displays a display image 130 on a display unit (13, see
The terminal 1 is configured from a CPU, a GPU, or the like and includes a control unit 10 that controls units and executes various programs, a storing unit 11 that is configured from a storage medium such as a flash memory and stores information, a communication unit 12 that communicates with the outside via the network 3, a display unit 13 that is configured from a liquid crystal display (LCD) or the like and displays characters and images, and an operation unit 14 that is configured from a touch panel, a keyboard, switches, and the like, which can be touched and operated, arranged on the display unit 13 and receives operation by a user.
The control unit 10 executes a Web browser program 110 explained below to function as Web-page-information receiving means 100, Web-page-display control means 103, and the like. The control unit 10 executes a super-resolution program 111 functioning as an image processing program explained below to function as an image classifying model 101, a plurality of super-resolution models 1020, 1021, . . . , and the like. The control unit 10 executes a super-resolution learning program 114 functioning as an image processing training program explained below to function as training means 104 for training the image classifying model 101, the plurality of super-resolution models 1020, 1021, . . . , and the like.
The Web-page-information receiving means 100 receives the Web page information 20 including the image for distribution 200 from the Web server 2 via the communication unit 12 and stores the Web page information 20 in the storing unit 11 as Web page information 112. Note that the storage of the Web page information 112 may be temporary.
The trained image classifying model 101 classifies the image for distribution 200 received by the Web-page-information receiving means 100 into a category and selects super-resolution models suitable for the category of the image for distribution 200 among the plurality of trained super-resolution models 1020, 1021, . . . . Note that the image classifying model 101 is trained, for example, by using a CNN (Convolutional Neural Network) but may be trained with logistic regression, a support vector machine, a decision tree, a random forest, Stochastic Gradient Descent (SGD), Kernel density estimation, a k-nearest neighbors algorithm, perceptron, or the like.
The plurality of trained super-resolution models 1020, 1021, . . . functioning as image processing models are super-resolution models specialized for super resolution of images in respective different categories. The plurality of trained super-resolution models 1020, 1021, . . . super-resolve the image for distribution 200 serving as an input image classified by the trained image classifying model 101, generate high-resolution super-resolution image information 113 serving as an output image, and store the super-resolution image information 113 in the storing unit 11. Note that the super-resolution models 1020, 1021, . . . are trained, for example, by using the CNN but may be trained with an equivalent algorithm.
The Web-page-display control means 103 displays, based on the Web page information 112, the display image 130 of the Web page on the display unit 13 instead of the image for distribution 200 using the super-resolution image information 113.
The training model 104 causes the untrained image classifying model 101 and the plurality of untrained super-resolution models 1020, 1021, . . . to learn. Details of training methods for learning are explained below. Note that the training model 104 and the super-resolution learning program 114 are not essential components for the terminal 1 and are generally executed and stored by different apparatuses and are included in the configuration for convenience of explanation. That is, the training model 104 and the super-resolution learning program 114 only have to be executed by the different apparatuses. The trained image classifying model 101, the plurality of trained super-resolution models 1020, 1021, . . . , and the super-resolution program 111 as a result of training in the different apparatuses only have to be included in the terminal 1.
The storing unit 11 stores the Web browser program 110 for causing the control unit 10 to operate as the means 100 and 103 explained above, the super-resolution program 111 for causing the control unit 10 to operate as the models 101, 1020, 1021, . . . explained above, the Web page information 112, the super-resolution image information 113, the super-resolution learning program 114 for causing the control unit 10 to operate as the training model 104 explained above, and the like.
Next, actions of this embodiment are divided into (1) a super-resolution operation and (2) a training operation, and are explained respectively. In the “(1) super-resolution operation”, the operation of executing the super-resolution program 111 trained by the “(2) training operation” and super-resolving of the image for distribution 200 is explained. In the “(2) learning operation”, the operation for executing the super-resolution learning program 114 to cause the image classifying model 101 and the plurality of super-resolution models 1020, 1021, . . . to learn is explained.
First, the Web-page-information receiving means 100 of the terminal 1 receives the Web page information 20 including the image for distribution 200 from the Web server 2 via the communication unit 12 and stores the Web page information 20 in the storing unit 11 as the Web page information 112 (S10).
Subsequently, the trained image classifying model 101 of the terminal 1 extracts the image for distribution 200 from the Web page information 20 received by the Web-page-information receiving means 100 (S11).
Subsequently, the trained image classifying model 101 extracts, from the extracted image for distribution 200, a plurality of patches 2001, 2002, 2003, . . . as partial regions. The trained image classifying model 101 performs patch processing of the plurality of patches 2001, 2002, 2003, . . . and obtains outputs for the plurality of patches 2001, 2002, 2003, . . . . The trained image classifying model 101 operates based on the super-resolution program 111 serving as a training result, classifies the image for distribution 200 into a category from a value obtained by averaging the outputs for the plurality of patches 2001, 2002, 2003, . . . (S12) and selects, among the plurality of trained super-resolution models 1020, 1021, . . . , for instance, the trained super-resolution model 1021 corresponding to a category of a classification result and most suitable for super resolution of the image for distribution 200 (S13).
Subsequently, the trained super-resolution model 1021 selected by the trained image classifying model 101 super-resolves the image for distribution 200 (S14), generates high-resolution super-resolution image information 113, and stores the high-resolution super-resolution image information 113 in the storing unit 11.
Subsequently, the Web-page-display control means 103 of the terminal 1 displays, based on the Web page information 112, the display image 130 of the Web page on the display unit 13 using the super-resolution image information 113 instead of the image for distribution 200 (S15).
First, as illustrated in
The super-resolution model 1020 super-resolves a j-th low-resolution image for learning 500lj of the low-resolution images for learning 500l0 to 500l7 and obtains a super-resolution image 500sr0j. Subsequently, the training model 104 compares the super-resolution image 500sr0j with a j-th original image 500hj of original images 500h0 to 500h7 serving as target images having higher resolution than the low-resolution images for learning 500l0 to 500l7 prepared in advance and calculates differences. As the difference, for example, a mean squared error (MSE) or a mean absolute error (MAE) is used. The differences may be calculated by using a CNN that has been trained to calculate difference. The training model 104 feeds back the differences and train the super-resolution model 1020 about the entire low-resolution images for learning 500l0 to 500l7 such that the differences decrease. In the following explanation, the difference being small is referred to as “accuracy of super resolution is high”.
Subsequently, as illustrated in
The super-resolution model 1021 super-resolves the j-th low-resolution image for learning 500lj of the low-resolution images for learning 500l0 to 500l7 and obtains a super-resolution image 500sr1j. Subsequently, the training model 104 compares the super-resolution image 500sr1j with the j-th original image 500hj of the high-resolution original images 500h0 to 500h7 of the low-resolution images for learning 500l0 to 500l7 prepared in advance and calculates differences. The training model 104 feeds back the differences and trains the super-resolution model 1021 with the entire low-resolution images for learning 500l0 to 500l7 such that the differences decrease.
Note that the training model 104 may copy the trained super-resolution model 1020 as the super-resolution model 1021 and reduce a time required for training and cost of processing.
Subsequently, as illustrated in
The super-resolution model 1020 and the super-resolution model 1021 super-resolve the j-th low-resolution image for learning 500lj of the low-resolution images for learning 500l0 to 500l7 and obtain the super-resolution image 500sr0j and the super-resolution image 500sr1j. Subsequently, the training model 104 compares the super-resolution image 500sr0j and the super-resolution image 500sr1j with the high-resolution original image 500hj and calculates differences. The training model 104 gives, to the low-resolution image for learning 500lj, a classification label (0 or 1) of the super-resolution model 1020 or the super-resolution model 1021 that outputs the super-resolution image 500sr0j or the super-resolution image 500sr1j having the smaller difference and clusters the group 50 and feeds back the j-th low-resolution image for learning 500lj to the super-resolution model 1020 or the super-resolution model 1021 having the smaller difference and cause the super-resolution model 1020 or the super-resolution model 1021 to learn the j-th low-resolution image for learning 500lj. Note that, when the differences coincide about the super-resolution model 1020 and the super-resolution model 1021, the training model 104 selects one of the super-resolution model 1020 and the super-resolution model 1021, gives the classification label (0 or 1) to the low-resolution image for learning 500lj, and clusters the group 50 and feeds back the j-th low-resolution image for learning 500lj to the selected super-resolution model 1020 or super-resolution model 1021 and causes the super-resolution model 1020 or the super-resolution model 1021 to learn the j-th low-resolution image for learning 500lj. The super-resolution model to which the low-resolution image for learning 500lj is fed back to cause the super-resolution model to learn the low-resolution image for learning 500lj does not always need to be one of the super-resolution model 1020 and the super-resolution model 1021. The super-resolution model 1020 and the super-resolution model 1021 may be weighted based on accuracies thereof and caused to learn. That is, weight for the feedback and the learning may be set large for either of the super-resolution model 1020 or the super-resolution model 1021 whichever having the smaller difference, and the weight for the feedback and the learning may be set small for either of the super-resolution model 1020 or the super-resolution model 1021 whichever having the larger difference.
As a result of the clustering, as illustrated in
If the domain is divided (S26; Yes), the training model 104 of the terminal 1 executes steps S23 to S25 about the next untrained super-resolution model (S27; No, S28).
Subsequently, as illustrated in
Subsequently, as illustrated in
The super-resolution model 1020 and the super-resolution model 1022 super-resolve the j-th low-resolution image for learning 500lj of the low-resolution images for learning 500l0 to 500l4, respectively, and obtain the super-resolution image 500sr0j and a super-resolution image 500sr2j. Subsequently, the training model 104 compares the super-resolution image 500sr0j and the super-resolution image 500sr1j with the high-resolution original image 500hj, respectively, and calculates differences. The training model 104 gives a classification label (0 or 2) of the super-resolution model 1020 or the super-resolution model 1022 having the smaller difference to the low-resolution image for learning 500lj and clusters the group 500 and feeds back the j-th low-resolution image for learning 500lj to the super-resolution model 1020 or the super-resolution model 1022 having the smaller difference and causes the super-resolution model 1020 or the super-resolution model 1022 to learn the j-th low-resolution image for learning 500lj. The super-resolution model to which the low-resolution image for learning 500lj is fed back to cause the super-resolution model to learn the low-resolution image for learning 500lj does not always need to be one of the super-resolution model 1020 and the super-resolution model 1022. The super-resolution model 1020 and the super-resolution model 1022 may be weighted based on accuracies thereof and caused to learn.
As explained above, the super resolution is performed by the k-th super-resolution model corresponding to the largest classification domain and the single i-th super-resolution model. The largest classification domain is divided based on the accuracy of the super resolution and the training of the super-resolution model is performed based on the accuracy of the super resolution. However, the super resolution may be performed by the k-th super-resolution model corresponding to the largest classification domain and a plurality of i-th, i+1-th, i+2-th, . . . super-resolution models, where the largest classification domain may be divided based on the accuracy of the super resolution, and the training of the super-resolution models may be performed based on the accuracy of the super resolution.
As a result of the clustering, as illustrated in
When finishing executing steps S23 to S25 about all the prepared super-resolution models (S27; Yes), the training model 104 of the terminal 1 ends the operation. Even when the operation is not executed about all the prepared super-resolution models, if the domain is not divided any more (S26; No), the training model 104 of the terminal 1 ends the operation and stops using the super-resolution model.
When all the steps end, the learning of all the super-resolution models 1020, 1021, . . . is completed, and the classification domain of the group 50 is divided, the training model 104 learns the image classifying model 101 about the low-resolution image for learning 500lj to which the classification label of the group 50 is given. Note that, as in the case illustrated in
According to the first embodiment explained above, for super-resolving a single image, when the plurality of super-resolution models 1020, 1021, . . . 102k, . . . 102i are trained, super-resolution model 102k corresponding to a classification domain 50k having the largest amount of data in the data set (the group 50) and super-resolution model 102i that is to be trained anew using the data in the classification domain 50k are caused to compete and train. A label of the super-resolution model 102k or 102i having high accuracy of super resolution of an image included in the classification domain 50k is given to the data set (the group 50) and the data set (the group 50) is clustered. The super-resolution model 102k or 102i having a result with high accuracy is caused to learn with the image and set as the super-resolution model 102k or 102i optimized for a divided classification domain 50k and 50i. Therefore, it is possible to cluster the data set (the group 50) used for the training of the super resolution without necessity of labeling the data set (the group 50) in advance. It is possible to efficiently perform the optimization of the classification domains 50k and 50i and the super-resolution model 102k and 102i. Since the data set can be spontaneously clustered by the training of the super-resolution model, it is possible to prepare a data set for training of the image classifying model 101 without requiring labeling in advance. It is possible to efficiently train the image classifying model 101.
By preparing the plurality of super-resolution models 1020, 1021, . . . and specialized according to a category of an image, it is possible to improve accuracy as a whole and the respective super-resolution models 1020, 1021, . . . can be formed as light-weight models. By causing the trained plurality of super-resolution models 1020, 1021, . . . and image classifying model 101 to function in the terminal 1, it is possible to reduce the data volume of the image for distribution 200 and reduce a communication volume of the network 3.
A second embodiment is different from the first embodiment in that a classification label is not given in clustering in a training operation. Note that, since a configuration and a super-resolution operation are the same as those in the first embodiment, explanation about the configuration and the super-resolution operation is omitted.
First, the training model 104 of the terminal 1 trains, with the super-resolution model 1020 and the super-resolution model 1021, which are untrained zero-th and first super-resolution model, the entire low-resolution images for learning 500l0 to 500l7 included in the entire group 50, which is a learning target, (S30). Note that, since a training method is the same as the training method in the first embodiment, explanation about the training method is omitted.
Subsequently, the training model 104 of the terminal 1 sets a variable l=2 (S31) and, as illustrated in an upper part of
Subsequently, as illustrated in an upper part of
Subsequently, as illustrated in an upper part of
In this state, as illustrated in a lower part of
As illustrated in a lower part of
In this way, steps S32 to S34 are executed about all untrained models (S35, S36).
When all the steps explained above end and the training of all the super-resolution models 1020, 1021, . . . is completed, the training model 104 trains the image classifying model 101 about the group 50 using finally obtained accuracy information 101a1.
According to the second embodiment explained above, in super-resolving a single image, when the plurality of super-resolution models 1020, 1021, . . . are caused to learn, a super-resolution model having high accuracy is quantified and the super-resolution model 102k corresponding to the classification domain 50k in the data set (the group 50) used for the training of the super resolution and the super-resolution model 102i to be trained anew are caused to compete and learn. Therefore, it is unnecessary to label, in advance, the data set (the group 50) used for the training of the super resolution. Labeling during the training is unnecessary and clustering is possible. It is possible to efficiently perform the optimization of the super-resolution model 102k and 102i.
Note that the embodiments are not limited to the embodiments explained above. Various modifications of the embodiments are possible in a range not departing from the gist of the present invention.
In the embodiments, the example is explained in which the Web page information 20 including the image for distribution 200 is distributed from the Web server 2 via the network 3 and the image for distribution 200 is super-resolved in the terminal 1. However, a low-resolution image only has to be distributed and super-resolved in the terminal 1. It goes without saying that it is unnecessary to include the low-resolution image in the Web page information 20 and distributed. That is, the super-resolution program 111 for causing the image classifying model 101 and the super-resolution models 1020, 1021, . . . to operate can be combined with not only the Web browser but also any application program included in the terminal 1.
Note that the group 50 of images used for training and the image for distribution 200 may be different from each other or may be the same. When the group 50 and the image for distribution 200 are different, it is possible to create the super-resolution models 1020, 1021, . . . , which are general models, from the group 50. When the group 50 and the image for distribution 200 are the same, it is possible to create the super-resolution models 1020, 1021, . . . optimum for the image for distribution 200.
In the embodiments, the super resolution is explained as the example of the image processing. However, as other examples, the embodiments are also applicable to training about image processing such as noise removal from an image, removal of a blur, and sharpening. Content of the image processing is not particularly limited. About the image processing trained using the training method, content of the image processing is not limited to the super resolution either.
In the embodiments explained above, the functions of the models 100 to 104 of the control unit 10 are realized by the program. However, all or a part of the models may be realized by hardware such as an ASIC. The program used in the embodiments can also be stored in a recording medium such as a CD-ROM and provided. Replacement, deletion, addition, and the like of the steps explained in the embodiments are possible in a range not changing the gist of the present invention.
According to an aspect of embodiments, it is possible to perform the training of the image processing such that accuracy of the image processing for the classification domain of input images is improved.
According to an aspect of embodiments, it is possible to complete the training when all of the plurality of image processing models are trained or when the classification label of the training image of the classification domain having the largest number of training images of the cluster is only i-th or k-th.
According to an aspect of embodiments, it is possible to cluster, without requiring labeling in advance, the data set used for the training of the image processing.
According to an aspect of embodiments, it is possible to classify the image that is subjected to the image processing to any one of the predetermined plurality of categories, and subject the image to the image processing with the image processing model associated with the category of the classification result.
According to an aspect of embodiments, it is possible to extract the plurality of partial regions included in the image that is subjected to the image processing, calculate feature values of the plurality of partial regions in the image, and average the calculated feature values to classify the image for the image processing.
According to an aspect of embodiments, it is possible to perform image processing optimized for the image distributed by the server apparatus.
There are provided an image processing learning program for clustering, without requiring labeling in advance, a data set used for learning of image processing and performing the learning of the image processing such that accuracy of the image processing for classification domains is improved, an image processing program trained by the program, and an information processing apparatus and an image processing system.
Number | Date | Country | Kind |
---|---|---|---|
2019-047434 | Mar 2019 | JP | national |
The present application is a bypass continuation application based on and claims the benefit of priority from the prior Japanese patent application No. 2019-047434 filed on Mar. 14, 2019, and PCT Application No. PCT/JP2020/004451 filed Feb. 6, 2020, the entire contents of which are incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/JP2020/004451 | Feb 2020 | US |
Child | 17371112 | US |