This application claims the benefit of Japanese Patent Application No. 2018-158048, filed on Aug. 27, 2018 and Japanese Patent Application No. 2019-122644 filed on Jul. 1, 2019, the entire disclosures of which are incorporated by reference herein.
The present disclosure relates generally to a similar image display control apparatus, a similar-image display control system, a similar image display control method, a display control apparatus, a display control system, a display control method, and a recording medium.
In dermatology, diagnosing skin disease is a very difficult task that requires expertise. Recently, techniques are being developed for image-capturing a disease-affected area and analyzing the captured image with use of a computer. Such techniques involve compiling a database of a large volume of disease cases, performing a similar image search using a captured image of a disease-affected area of a patient as a query image, and then diagnosing the disease-affected area of the patient based on similar disease cases.
As an example of an apparatus that displays similar images, for example, Unexamined Japanese Patent Application Kokai Publication No. 2010-250529 describes an image searching apparatus and the like that extracts similar images that are similar to a query image from a database of registered images, arranges the extracted similar images on the periphery of the query result, and presents, to display means, a search result in which the query image and the similar images are connectedly displayed.
In order to support the diagnosis, techniques for determining whether a disease-affected area is benign or malignant are also being developed. For example, in “Nevisense—a breakthrough in non-invasive detection of melanoma”, [online], [Searched Jun. 14, 2019] on the Internet (URL: https://scibase.com/the-nevisense-product/), a diagnostic support apparatus that visually provides a benign/malignant skin disease ratio using single-axis information is described.
A similar image display control apparatus of the present disclosure includes a processor configured to
acquire similar images obtained as a result of a similar image search with respect to a query image,
set categories into which the acquired similar images are to be classified,
determine, in a space having a prescribed number of dimensions that is no less than two, coordinates indicating a position of each category region in accordance with attributes of types equal in number to the number of dimensions, the category region being a region indicating one of the set categories,
classify the acquired similar images into the set categories, and
place the similar images, classified into each of the categories, within the category region positioned in a position as indicated by the determined coordinates and cause a display to display the placed similar images.
Also, a display control apparatus of the present disclosure includes a processor configured to
acquire a malignant index representing a possibility that an attribute of a disease of a diagnosis target area is malignant and a first disease attribute index representing a possibility that an attribute of the disease of the diagnosis target area is a prescribed first disease attribute, and
cause the acquired malignant index and the acquired first disease attribute index to be displayed in association with each other on a display.
A more complete understanding of this application can be obtained when the following detailed description is considered in conjunction with the following drawings, in which:
A similar image display apparatus and the like according to embodiments of the present disclosure are described below with reference to the accompanying drawings. Throughout the drawings, components that are the same or equivalent are assigned the same reference signs.
A similar image display apparatus 100 according to Embodiment 1 of the present disclosure collects, for each prescribed category, search images obtained as a result of a similar image search with respect to a query image, and arranges the search images within the categories based on the degree of similarity with the query image. A relationship between similar images can be displayed in a manner that is easy to understand by arranging and displaying, in an n-dimensional space defined by a prescribed axis or axes, categories into which the similar images are collected and arranged. The manner in which such a display is performed is described below.
The similar image display apparatus 100 according to Embodiment 1, as illustrated in
The controller 10 includes, for example a central processing unit (CPU), and executes programs stored in the storage 20 to achieve the functions of individual components (similar image acquirer 11, category setter 12, position determiner 13, classifier 14, and image display controller 15), which are described further below.
The storage 20 includes a read-only memory (ROM), a random access memory (RAM), and the like, and stores programs to be executed by the CPU of the controller 10 and necessary data.
The inputter 31 is a device used by a user of the similar image display apparatus 100 to input instructions directed at the similar image display apparatus 100 and input query images. Examples of the inputter 31 include a keyboard, mouse, touch panel, camera, and the like. The controller 10 acquires instructions and query images from the user via the inputter 31. Any device can be used as the inputter 31 as long as the controller 10 can acquire instructions or query images from the user. Moreover, the controller 10 may acquire query images via the communicator 33. The term query image refers to image data to be inputted when conducting a search for similar images that are to be displayed on the similar image display apparatus 100. The similar image display apparatus 100 presents, to the user, images that are similar to the query image in an easy to understand manner.
The outputter 32 is a device used by the controller 10 to present similar images to the user. Examples of such devices include a display, an interface for a display, and the like. The similar image display apparatus 100 may include the outputter 32 as a display, and may display a search result or the like on an external display connected via the outputter 32. The similar image display apparatus 100 without the display (similar image display apparatus 100 in which the outputter 32 is an interface for the display) is also referred to as the similar image display control apparatus.
The communicator 33 is a device (network interface, for example) for transmitting and receiving data to and from another external device (server storing a database of image data, or a similar image searching device, for example). The controller 10 can acquire query images and images similar to the query image via the communicator 33.
Next, the function of the controller 10 is described. The controller 10 achieves the functions of a similar image acquirer 11, the category setter 12, the position determiner 13, the classifier 14, and the image display controller 15.
The similar image acquirer 11 acquires data (image data of similar images and a degree of similarity between these images and the image query) obtained as a result of the similar image search with respect to the query image. Specifically, the similar image acquirer 11 acquires data of images that have a degree of similarity that is greater than or equal to a prescribed threshold in the similar image search and also acquires the degree of similarity. The similar image acquirer 11 may acquire data of similar images obtained as a result of the search by the controller 10 for images that are similar to the query image, and for example, may cause an external similar image searching device to search, via the communicator 33, for images that are similar to the query image, and may also acquire data of the similar images searched by the similar image searching device. Also, the image data is appended with their own corresponding information such as the disease names associated in one-to-one correspondence to the images as tag information.
The category setter 12 sets a category group (plurality of categories) into which images acquired by the similar image acquirer 11 are classified. In a case where the target is image data of skin, this category group is, “disease name” (pigmented nevus, melanoma, basal cell carcinoma, or the like), “outer shape” (round, star-shaped, elliptical, or the like), “color” (red, black, brown, or the like), “size”, “internal structure”, “nevus (pigmented spot) state” (mesh pattern, globular pattern, cobblestone pattern, homogenous pattern, parallel pattern, starburst pattern, multi-component pattern, unspecific pattern) or the like. For example, in a case in which the category group is the disease name, one category for each of the specific disease names: pigmented nevus, melanoma, and basal cell carcinoma is created. Information of the category group (plurality of categories) into which images are classified is stored in advance in the storage 20. The category setter 12 sets the category group (plurality of categories) into which image data is classified, based on the information of the category group stored in the storage 20.
The position determiner 13 determines a position where a region indicating each category included in the category group (plurality of categories) set by the category setter 12 is displayed as coordinates in an n-dimensional space based on n-types of attributes (n being an integer greater than or equal to one). More specifically, each attribute of n-types of attributes is associated in one-to-one correspondence with a coordinate axis of n-axes defining the coordinates of the n-dimensional space, and the coordinates indicating the position where the individual regions (category region), each indicating a category, is to be displayed is based on attribute values of individual attributes, each individual attribute corresponding to a particular coordinate axis of the coordinate axes.
A case in which the disease name is set as the category group by the category setter 12 and the position determiner 13 determines positions within a two-dimensional space of the category group (disease name) with two types of attributes “benign/malignant” and “melanocytic/non-melanocytic” is considered as an example. In this case, the position determiner 13, for example as illustrated in
If, as a specific example, the following five disease names: pigmented nevus, melanoma, seborrheic keratosis, hematoma/hemangiomas, and basal cell carcinoma are considered, the attributes for the diseases are as follows: “benign, melanocytic” for pigmented nevus, “malignant, melanocytic” for melanoma, “benign, non-melanocytic” for seborrheic keratosis, “benign, non-melanocytic” for hematoma/hemangiomas, and “malignant, non-melanocytic” for basal cell carcinoma. Therefore, the position determiner 13, as illustrated in
The position determiner 13 may adjust the display positions of the categories as necessary so that the positions where different categories are displayed have different coordinates. For example, in the example illustrated in
Information of the n-types of attributes that is used for determining the coordinate axes in a space, information of the attributes for each of the categories, and placement information for the individual attributes, for the position determiner 13 to determine the display positions for each of the categories, is stored in advance in the storage 20. The position determiner 13 determines the coordinates, in the n-dimensional space, of the positions where the category group (plurality of categories) is displayed, based on the information of the n-types of attributes, information of the attributes for each of the categories, and placement information for the individual attributes. In the example illustrated in
The classifier 14 classifies image data acquired by the similar image acquirer 11 into one of the categories of the category group (plurality of categories) set by the category setter 12. The classifier 14 can classify image data by use of tag information that is appended to the image data (for example, the disease name is appended as tag information to each image).
The image display controller 15 places, based on the degree of similarity with the query image, the image data, which is classified into each of the categories by the classifier 14, inside the regions of the respective categories whose coordinates were determined in the n-dimensional space by the position determiner 13. The image display controller 15 displays the image data accordingly via the outputter 32. The image display controller 15, for example as illustrated in
The functional configuration of the similar image display apparatus 100 is described above. Details of the similar image display processing performed by the similar image display apparatus 100 are described next with reference to
First, the controller 10 of the similar image display apparatus 100 acquires a query image (step S101). For example, when the user inputs the query image into the similar image display apparatus 100 via the inputter 31 (drags and drops the query image into a prescribed region on the screen, for example), the controller 10 acquires the query image.
Next, the similar image acquirer 11 acquires similar images obtained as a result of a similar image search with respect to the query image (step S102). Specifically, similar images that have a degree of similarity with the query image that are greater than or equal with a prescribed threshold are acquired. At such time, the similar image acquirer 11 acquires the similar image together with the degree of similarity that the similar image has with the query image. Step S102 is also referred to as the similar image acquisition step. The processing of the similar image search may be performed by an external similar image searching device instead of the similar image display apparatus 100. In such a case, the controller 10 transmits the query image acquired in step S101 to the external similar image searching device via the communicator 33 and the similar image acquirer 11 acquires the result of the similar image search performed by the external similar image searching device.
Then, the classifier 14 classifies the similar images acquired by the similar image acquirer 11 into categories set by the category setter 12, based on tag information appended to each similar image (step S103). Step S103 is also referred to as the classification step.
Next, the image display controller 15 places the similar images, which are classified into each of the categories in step S103, in the regions of the categories whose positions were determined by the position determiner 13 and displays these similar images via the outputter 32 (step S104). Specifically, as illustrated in
Also, in step S104, the image display controller 15 displays, in the region of each category, a circle of a size in accordance with the number of similar images that are classified into the particular category. The displaying of these circles makes it easy to intuitively grasp the scale of each category. Also, the greater the degree of similarity is between an image at the center (the similar image having the greatest degree of similarity with the query image for a particular category) and the query image, the thicker the width of the circumferential line of the circle is displayed by image display controller 15. The thickening of the width of the circumferential line of the circle in such a manner makes it easy for the user to intuitively grasp placement location of similar images that are most similar to the query image. Also, thickness of the circumferential line of this circle does not need to be displayed at a prescribed thickness that is in accordance with the degree of similarity between the image at the center of the circle and the query image. The image display controller 15, for example, may display a prescribed thickness of the circumferential line of the circle in accordance with a degree of similarity between a prescribed image and the query image. Here the prescribed image is, for example, an image in a particular category having an n-th (n being an integer that is no less than 1 and no greater than the number of similar images classified in that particular category) greatest degree of similarity with the query image, a lowest degree of similarity, or a middlemost image when arranged in order of degree of similarity. Also, displaying at the prescribed thickness means, for example, that the thickness is displayed thicker the greater the degree of similarity, and displayed thinner the lower the degree of similarity. In order to ensure that the user can easily compare each similar image with the query image, the image display controller 15, in step S104, also performs processing for displaying the query image 300 in the center portion of the display screen as illustrated in
Next, the controller 10 determines whether or not a similar image displayed in step S104 is selected (clicked by the user, for example) via the inputter 31 (step S105). If no similar image is selected (NO in step S105), processing advances to step S108.
If a similar image is selected (YES in step S105), the image display controller 15 displays the image selected in step S105 and the query image in an enlarged manner so that these images can be compared (step S106). For example, in a case in which the image (image that is most similar to the query image among the similar images that are classified into pigmented nevus) at the center of the pigmented nevus in
Then, the image display controller 15 displays an image in accordance with a user operation (Step S107). For example, when a drag operation is performed on the query image 51 or the comparison target image 52, the image display controller 15 moves the image parallel to the dragging direction. When a mouse wheel rotation is performed on the query image 51 or the comparison target image 52, the image display controller 15 enlarges or reduces the size the image of the image. When the query image 51 is doubled-clicked, the image display controller 15 displays the query image display screen illustrated in
Next, the controller 10 determines whether or not an instruction was given to end the similar image display processing (step S108). If no instruction was given to end the similar image display processing (NO in step S108), the controller 10 returns processing to step S107. If an instruction is given to end the similar image display processing (YES in step S108), the similar image display processing is ended. For example, if an instruction to end the similar image display processing is given by the user via the inputter 31, the similar image display processing is ended.
As described above, since the similar image display apparatus 100 can classify images into categories and then place and display the similar images, for each category, in descending order of degree of similarity with the query image, the similar image display apparatus 100 can display the relationship between similar images in a manner that is easier to understand.
For example, in a case in which images of skin diseases are to be displayed, although melanoma, basal cell carcinoma, and solar keratosis are all malignant diseases, the degree of malignancy (the effect on the human body) greatly differs depending on the skin disease. Therefore, the malignancy information (attribute values of attributes) such as “malignancy 10, melanocytic” for melanoma, “malignancy 8, non-melanocytic” for basal cell carcinoma, and “malignancy 3, non-melanocytic” for solar keratosis are stored in the storage 20 as information of attributes for the individual categories, and upon determination by the position determiner 13 of the position of the categories according to malignancy such that, for example, the categories of greater malignancy are displayed as categories in circles towards the top portion of the screen, the user can confirm both the similar images placed in the individual categories and the malignancy of the individual categories. Also by determining the position of other attributes based likewise on attribute values of the other attributes, the user is able to confirm the similar images in accordance with the attribute values of the attribute. These are merely introduced as examples and are not necessarily medically correct examples. A doctor or the like may make changes as appropriate to the display positions in accordance with the way of thinking or circumstances of a user of the similar image display apparatus 100.
In aforementioned Embodiment 1, in the similar image display processing, displaying is performed as indicated in
In the similar image display apparatus 100 of Modified Example 1, the image display controller 15 performs processing as follows in step S104 of the similar image display processing (
In step S104 of the similar image display processing (
In aforementioned Embodiment 1, although the image display controller 15 collectively places the similar image search results into the individual categories in a concentric circular manner, this is not limiting. Alternative examples of placement include: radially, elliptically shaped, square shaped, and the like. For example, when square-shaped placement is employed, as illustrated in
In aforementioned Embodiment 1, an example is described in which the number of attributes used by the position determiner 13 for determining the positions is two types, these two attributes correspond with two axes (X-axis and Y-axis) of a two-dimensional space, and the position determiner 13 determines the coordinates in the two-dimensional space of the display positions for the similar image search results. However, this example is not limiting. For example, the number of attributes used for determining the positions may be one type and the individual categories may be placed on a linear line (one-dimensional space). In such a case, although the categories are placed on a linear line, since the similar images within the category are placed in a concentric circular manner, placement is ultimately performed in a two-dimensional space.
Alternatively, the number of attributes used for determining the positions may be three types and the individual categories may be placed in a three-dimensional space. In such a case, although the categories and similar images are placed in a three-dimensional space, since the outputter 32 outputs these as a projection onto a two-dimensional space, these can be displayed on a conventional display. Also, in a case in which n of the n-types of attributes used for determining positions is greater than or equal to four, the individual categories may be placed in a virtual n-dimensional space, and ultimately projected onto a two-dimensional space. The types of attributes are not limited to the aforementioned “benign/malignant” and “melanocytic/non-melanocytic” and may also include: “endothelial/non-endothelial”, “metastatic/non-metastatic”, “ductal/non-ductal”, “viral/non-viral”, “size (diameter of a circumscribed ellipses of the disease-affected area, for example)”, “ellipticity (ellipticity of the circumscribed ellipses of the disease-affected area, for example)”, “lesion surface area (surface area of the disease-affected area)”, “contour length (contour length of the outer portion of the disease-affected area)”, “depth of tumor (determined by color (black if shallow, and shifting from brown to gray and finally to pale steel color as the tumor depth deepens)”, “color of disease-affected area (arranged on a color-based axis corresponding to depth of tumor”, “use of a value of a shape (for example, a moment (obtained by performing a moment calculation for a coordinates value of a lesion region, a coordinates value of a contour of lesion region, pixel value of a lesion region, and the like)”, “time (for example, through prolonged observation of size, a time variation of measurement values of size, can be viewed, by taking measurement values of size where time is placed on the horizontal axis and size is placed on the vertical axis)”, and the like.
Also, in aforementioned Embodiment 1, although a case is described using skin diseases as an example, the present disclosure is not limited to the field of dermatology. The present disclosure can be widely applied to fields involving the display of similar images. For example, the present disclosure can also be applied to similar searching of images of flowers, similar searching of microscope pictures of bacteria, and the like.
Also, in aforementioned Embodiment 1, although the similar image display processing is performed by the controller 10, the controller 10 may receive, via the communicator 33, a result processed using an external server, and output the received result to the outputter 32.
Also, the aforementioned Embodiment 1 and Modified Examples 1 and 2 may be combined together as appropriate. For example, by combining Modified Example 1 and Modified Example 2 together, the similar images can be displayed as square shapes for each category and the drawing of connection lines and backgrounds of the square shapes can be performed. As such, the benefits of both Modified Example 1 and Modified Example 2 can be attained. For example, in such a case, the width of the connection line to the similar image (similar image that is most similar to the query image for a particular category) in the upper left corner in the square of the individual categories can be thickened in accordance with the degree of similarity between the particular image and the query image, and the background of a particular square shape can be drawn as darkly shaded in the upper left corner and as more lightly shaded to the lower right.
A display control apparatus 101 according to Embodiment 2 of the present disclosure associates each attribute (“benign/malignant” and “melanocytic/non-melanocytic”, for example) of a disease of a diagnosis target area that is shown in a query image with a particular coordinate axis of the coordinate axes, and displays an index representing the possibility that a disease relates to each attribute as a plot in a space having a number of dimensions equal to the number of attributes of a disease. By displaying in this manner, the display control apparatus 101 makes it easy to grasp the attribute information of a disease of a diagnosis target area. In Embodiment 2, although an example is given in which the disease of the diagnosis target area is a skin disease of a person, as the diagnosis target area (disease), there are many other different types of areas (diseases) that can be diagnosed based on captured images, including the uterus of a person (cervical cancer), the oral cavity of a person (oral cancer), skin (skin cancer) of an animal (cat) and oral cavity of an animal (oral cancer), and the like.
The display control apparatus 101 according to Embodiment 2, as illustrated in
The controller 10 includes, for example, a CPU, and executes programs stored in the storage 20 to achieve the functions of individual components (index acquirer 16, risk acquirer 17, and display controller 18), which are described further below.
The storage 20 includes the ROM, the RAM, and the like, and stores programs to be executed by the CPU of the controller 10 and necessary data.
The inputter 31 is a device used by a user of the similar image display apparatus 101 to input instructions directed at the similar image display apparatus 101 and input query images. Examples of the inputter 31 include a keyboard, a mouse, a touch panel, a camera, and the like. The controller 10 acquires instructions and query images from the user via the inputter 31. Any device can be used as the inputter 31 as long as the controller 10 can acquire instructions or query images from the user. Moreover, the controller 10 may acquire query images via the communicator 33. The term query image refers to image data of images taken of a diagnosis target area by use of a dermatoscope, for example. The display control apparatus 101 presents, in a manner that is easy to understand by the user, attribute information of a disease of the diagnosis target area that is shown in the query image.
The outputter 32 is a device (a display, interface for the display, or the like) used by the controller 10 to present attribute information of a disease to the user in an easy to understand manner. The display control apparatus 101 may include the outputter 32 as a display, and may display the attribute information or the like on an external display connected via the outputter 32.
The communicator 33 is a device (network interface, for example) for transmitting and receiving data to and from another external device (server storing a database of image data or an image identification device). The controller 10 can acquire image identification results and the like by the image identification device via the communicator 33.
Next, the function of the controller 10 is described. The controller 10 achieves the functions of an index acquirer 16, a risk acquirer 17, and a display controller 18).
The index acquirer 16 uses an identifier to obtain a probability (possibility) of a disease of a diagnosis target area shown in a query image being related to a particular attribute of attributes, and acquires the obtained probability as an index of the particular attribute. This identifier includes, for example, a convolutional neural network, and is trained by use of prescribed image data that is for training in advance. The index acquirer 16 may include such kind of an identifier that is already trained and may cause an external image identification device that includes an image identifier that is already trained to identify a query image, via the communicator 33, and then, the index acquirer 16 may acquire a probability (possibility) of a disease of a diagnosis target area relating to a particular attribute of attributes attained from the identification result as the index of the particular attribute. The index acquired by the index acquirer 16 is not limited to probability. The index acquirer 16 may acquire a more conventional score (conceivably, a score (not necessarily equal to the probability value) being greater in value the greater the possibility, or conversely, a score being greater in value the lower the possibility) as the index.
Here, it is assumed that the index acquirer 16 includes a disease identifier that outputs individual probabilities (hereinafter referred to as “disease-applicable probabilities”) of the disease of the diagnosis target area shown in the query image being one of four particular diseases (melanoma, basal cell carcinoma, pigmented nevus, and seborrheic keratosis). Also, the disease-applicable probabilities obtained by inputting the query image into this disease identifier are assumed to be, for example, 89.0% for melanoma, 4.4% for basal cell carcinoma, 6.4% for pigmented nevus, and 0.2% for seborrheic keratosis. The attributes of these diseases are: “benign/non-melanocytic” for pigmented nevus, “malignant/melanocytic” for melanoma, “benign/non-melanocytic” for seborrheic keratosis, and “malignant/non-melanocytic” for basal cell carcinoma.
In this example, the probability of the attribute of the disease of the diagnosis target area being “malignant” is calculated as 89.0%+4.4%=93.4% and the probability of the attribute of the disease of the diagnosis target area being “benign” is calculated as 6.4%+0.2%=6.6%. Also, the probability of the attribute of the disease of the diagnosis target area being “melanocytic” is calculated as 89.0%+6.4%=95.4% and the probability of the attribute of the disease of the diagnosis target area being “non-melanocytic” is calculated as 4.4%+0.2%=4.6%. The index acquirer 16 acquires the individual probabilities of the attribute of the disease of the diagnosis target area being one of the particular attributes calculated in the aforementioned manner, as indexes representing the individual possibilities of the attribute of the disease of the target area being one of the particular attributes. In particular, the probability of the attribute of the disease of the diagnosis target area being “malignant” and the probability of the attribute of the disease of the diagnosis target area being “benign” are also respectively referred to as the malignant index and the benign index. Likewise, the probability of the attribute of the disease of the diagnosis target area being a prescribed disease attribute such as “melanocytic” or “non-melanocytic” is also referred to as the disease attribute index. Also, in a case in which multiple disease attributes are to be referred to in a distinguishable manner, first, second, and the like are appended to the attribute. For example, among the attributes of the disease of the diagnosis target area, “melanocytic” is the first disease attribute and “non-melanocytic” is the second disease attribute, the probability of “melanocytic” attribute of the disease of the diagnosis target area is referred to as the first disease attribute index, whereas probability of the attribute “non-melanocytic” of the disease of the diagnosis target area is referred to as the second disease attribute index.
The index acquirer 16 does not necessarily use the disease identifier that acquires the disease-applicable probabilities of the diagnosis target area. The index acquirer 16, for example, in place of the disease identifier, alternatively may use an identifier that outputs the probability (malignant index) of the disease of the diagnosis target area being “malignant” or may use an identifier that outputs the probability (disease attribute index) of the attribute of the disease of the diagnosis target area being a prescribed disease attribute such as “melanocytic”.
The risk acquirer 17 acquires a risk index indicating whether or not the risk of the disease is high in a case in which the attribute of the disease is malignant and the attribute of the disease is a prescribed disease attribute. Here, although it is conceivable that there are, as risks, overlook risks (risk of erroneous determination being (no malignant detection) made by the identifier or prognostic risks (neglected risks), the risk acquirer 17 may distinguish between these risks and handle them as separate risk indexes or may handle these values comprehensively as a single risk index. For example, the controller 10 obtains a risk index of overlook risk by, for example, using image data (trial disease case data) other than the training data used for training the disease identifier and/or obtains a risk index of prognostic risk by, for example, using data regarding prognostic risk from a specialist and the like and stores in advance the risk indexes into the storage 20. Also, a risk index may be obtained in advance by using, for example, an external server. The acquirer 17 acquires the risk index obtained in advance by the controller 10 or the external server, for example. In the present embodiment, this risk index is an index indicating the extent of overlook risk of the particular disease when the attribute of the disease is malignant, based on the malignant index of the particular disease, and is pre-generated by risk boundary line generation processing which is described further below.
For example, identifying a melanocytic malignant disease is more difficult than identifying a non-melanocytic malignant disease, and thus the overlook risk for the melanocytic malignant disease is greater, even though the probability of “malignant” (malignant index) is the same for both. In the present embodiment, since a malignant index that is greater than the risk index indicates that overlook risk is high, the risk index for the disease attribute “melanocytic” is lower in value than the risk index for the disease attribute “non-melanocytic”. Therefore, when the disease attribute is “melanocytic”, a risk index of a value that is lower than that of when the disease attribute is “non-melanocytic” is acquired by the risk acquirer 17.
Through the display control processing which is described further below, the display controller 18 causes the display to display multiple indexes, which are acquired by the index acquirer 16, in association with each other. For example, regarding the diagnosis target area shown in the query image, when the index acquirer 16 acquires 93.4% as the index of “malignant” and acquires the 95.4% as the index of “melanocytic”, the display controller 18 causes the displays to display a point 206 as the score corresponding to (95.4% and 93.4%) as illustrated in
In
Also, the display controller 18 causes the risk index acquired by the risk acquirer 17 and the indexes acquired by the index acquirer 16 to be displayed in association with one another on the display. As an example of this display, the display controller 18 displays, as a risk boundary line 207 indicated by the dotted line in
The functional configuration of the display control apparatus 101 is described above. Details of the display control processing performed by the display control apparatus 101 are described with reference to
First, the display controller 18 displays the coordinate axes onto the display (step S201). The coordinate axes that are displayed here are coordinate axes that are based on attributes instructed in advance by the user. For example, in the example illustrated in
Next, the controller 10 of the display controller 101 acquires the query image (step S202). For example, when the user inputs (drags and drops query image into prescribed region of screen of display, for example) the query image into the display control apparatus 101 via the inputter 31, the controller 10 acquires the query image.
Next, the index acquirer 16 inputs the query image into the identifier and acquires the individual attributes (step S203). Step S203 is also referred to as the acquisition step. Then, the display controller 18 displays, on the coordinate axes displayed on the display, the point 206 at the coordinates represented by the index acquired by the index acquirer 16 (step S204). Step S204 is also referred to as the display control step.
Next, the display controller 18 displays, on the display, the risk boundary line 207, stored in the storage 20, having been generated in advance during the risk boundary line generation processing which is described further below (step S205). The display control processing ends upon completion of step S205.
Next, the risk boundary line generation processing is described with reference to
First, the controller 10 acquires trial disease case data (not yet used for training the disease identifier) from the storage 20 or via the communicator 33 (step S301). Next, the index acquirer 16 inputs the trial disease case image data into the disease identifier and acquirers the attribute indexes corresponding to the individual coordinate axes (step S302). In the example illustrated in
Next, regarding the indexes acquired in step S302, the controller 10 classifies the malignant index into the individual segments of the disease attribute index (step S303). Here, for example, if the values of the disease attribute index are from 0% to 100% and the individual segments have a width equal to 10%, then the individual segments of the disease attribute index are ten in number with the disease attribute index values from 0% to below 10% being in segment 1, the disease attribute index values from 10% to below 20% being in segment 2, . . . , and the disease attribute index values from 90% to 100% being in segment 10. For example, if the indexes acquired in step S302 are malignant index 35% and disease attribute index 55%, the controller 10 classifies the malignant index 35% into segment 6.
Next, the controller 10 determines whether or not the malignant indexes classified in step S303 are classified into every segment (all of the segments from segment 1 to segment 10 in the aforementioned example), with every segment having no less than a prescribed number (20, for example) (step S304). If there are any segments with a number of classified malignant indexes less than the prescribed number (NO in step S304), processing returns to step S301 where index classification is repeated using new trial disease case data.
If every segment has a number of classified malignant indexes that is greater than the prescribed number (YES in step S304), the controller 10 calculates, for every segment, a malignant determination threshold of a malignant index at which the sensitivity of the malignant disease is a prescribed sensitivity (95% for example) (in the case where the sensitivity is 95%, for example, a threshold at which 95% are determined as being malignant diseases once a certain number of test disease cases for a malignant disease is identified) (step S305). The lower this threshold is, the easier it is to determine that the attribute of the disease is malignant, and thus sensitivity increases and specificity (accuracy percentage of benign disease cases) decreases.
Then, the controller 10 sets a line, such as a spline curve, linking the malignant determination thresholds of the individual segments together as the risk boundary line and saves the coordinates of the risk boundary line into the storage 20 (step S306). The risk boundary line generation processing ends upon completion of this step. When the points displayed in step S204 of the display control processing (
The aforementioned risk boundary line generation processing merely represents a single example. The following modified examples are also conceivable.
As described above, the display control apparatus 101, in response to an input query image, can display attribute information of the diagnosis target area shown in the query image in a manner that is easy to understand by use of the coordinates of the point 206 as illustrated in
Similar to that in the similar image display apparatus according to Embodiment 1, in the display control apparatus 101 according to Embodiment 2, the following attributes: “endothelial/non-endothelial”, “metastatic/non-metastatic”, “ductal/non-ductal”, “viral/non-viral”, “size of the disease-affected area”, “color of the disease-affected area”, “time (for example, through prolonged observation of size, a time variation of measurement values of size, can be viewed, for example, by taking measurement values of, for example, size where time is placed on the horizontal axis and size is placed on the vertical axis)”, and the like may be used in place of at least one of “benign/malignant” or “melanocytic/non-melanocytic”. Among these attributes, since melanocytic is considered to be the attribute with the highest prognostic risk, in the example illustrated in
In aforementioned Embodiment 2, “benign/malignant” is assigned to the vertical axis and “melanocytic/non-melanocytic” is assigned to the horizontal axis, both as attributes, and the point 206 is displayed on the two-dimensional space. However, the attributes used may be three types and the point 206 may be placed in a three-dimensional space. In such a case, this projection onto a two-dimensional space may be outputted to the outputter 32. Also, in a case in which n of the n-types of attributes used is greater than or equal to four, the point 206 may be placed in a virtual n-dimensional space, and ultimately projected onto a two-dimensional space and outputted to the outputter 32.
A display control apparatus 102 according to Embodiment 3 of the present disclosure displays an attribute of a disease of a diagnosis target area that is shown in a query image together with a probability of the disease of the diagnosis target area being a prescribed disease by using a tree structure including the query image as the root node. By displaying in this manner, the display control apparatus 102 makes it easier to grasp the attribute information of the disease of the diagnosis target area.
The display control apparatus 102 according to Embodiment 3, as illustrated in
The controller 10 includes, for example, a CPU, and executes programs stored in the storage 20 to achieve the functions of individual components (index acquirer 16, position determiner 13, disease risk acquirer 19, and display controller 18), which are described further below.
The index acquirer 16 uses an identifier that identifies a disease among a prescribed number of diseases to obtain a probability (possibility) of a disease of a diagnosis target area shown in a query image being related to a particular attribute of attributes, and acquires the obtained probability as an index of the particular attribute. This identifier includes, for example, a convolutional neural network, and is trained by use of image data that is for training and is prescribed in advance. The index acquirer 16 may include such kind of an identifier that is already trained and may cause an external image identification device that includes an image identifier that is already trained to identify a query image, via the communicator 33, and then the index acquirer 16 may acquire a probability (possibility) of a disease of a diagnosis target area relating to a particular attribute of attributes attained from the identification result as the index of the particular attribute.
Here, the index acquirer 16, as described in the description of the index acquirer 16 according to Embodiment 2, includes a disease identifier that outputs disease-applicable probabilities regarding four diseases (melanoma, basal cell carcinoma, pigmented nevus, and seborrheic keratosis). Also, the disease-applicable probabilities obtained by inputting the query image into this disease identifier are assumed to be, for example, 89.0% for melanoma, 4.4% for basal cell carcinoma, 6.4% for pigmented nevus, and 0.2% for seborrheic keratosis.
In this example, as described in the description of the index acquirer 16 according to Embodiment 2, regarding the indexes representing the individual possibilities of the attribute of the disease of the diagnosis target area being one of the particular attributes, the malignant index is 93.4%, the benign index is 6.6%, the disease attribute index for “melanocytic” is 95.4%, and the disease attribute index for “non-melanocytic” is 4.6%. Also, the index acquirer 16 according to Embodiment 3 also acquires, as the disease indexes, the probabilities that are outputted by the disease identifier. In this example, the disease index for melanoma is 89.0%, the disease index for basal cell carcinoma is 4.4%, the disease index for pigmented nevus is 6.4%, and the disease index for seborrheic keratosis is 0.2%.
The position determiner 13 determines positions where information regarding the individual diseases (categories) for the number of disease indexes that are acquired by the index acquirer 16 is to be displayed, as coordinates in an n-dimensional space based on n-types of attributes (n being an integer greater than or equal to one). More specifically, each attribute of n-types of attributes is associated in one-to-one correspondence with a coordinate axis of n-axes defining the coordinates of the n-dimensional space, and the coordinates indicating the positions where information regarding the individual diseases is to be displayed are determined based on indexes representing the possibilities that the individual diseases relate to a particular attribute corresponding to a particular coordinate axis of the coordinate axes.
For example, in a case in which the following two types of attributes: “benign/malignant” and “melanocytic/non-melanocytic” are utilized as the aforementioned attributes of n-type of attributes, the position determiner 13 determines the coordinates in a two-dimensional space where the information regarding the individual disease corresponding to the disease indexes acquired by the index acquirer 16 is to be displayed. The position determiner 13, for example as illustrated in
If, as a specific example, the following four disease names: pigmented nevus, melanoma, seborrheic keratosis, and basal cell carcinoma are considered, the attributes for the diseases are as follows: “benign, melanocytic” for pigmented nevus, “malignant, melanocytic” for melanoma, “benign, non-melanocytic” for seborrheic keratosis, and “malignant, non-melanocytic” for basal cell carcinoma. Therefore, the position determiner 13, as illustrated in
The position determiner 13 may adjust the display positions of the information regarding the diseases as necessary so that the positions where information regarding different diseases is displayed each have different coordinates. Although not displayed in
Information of the n-types of attributes that is used for determining the coordinate axes in a space, information of attributes of the respective diseases, and placement information for the individual attributes, for the position determiner 13 to determine the display positions of information regarding the individual diseases, is stored in advance in the storage 20. The position determiner 13 determines the coordinates in the n-dimensional space of positions where information regarding the individual diseases is to be displayed, based on the information of the n-types of attributes, information of the attributes of the respective diseases, and the placement information for the individual attributes that are stored in the storage 20. In the example illustrated in
For each disease, a disease risk acquirer 19 acquires a risk index indicating whether the risk for that particular disease is high or not. Here, although the risk of disease includes prognostic risk (neglected risk in a case when a disease is neglected) or overlook risk (erroneous determination risk where the disease identifier makes a determination that a malignant disease is not a malignant diseases), the disease risk acquirer 19 may distinguish between these risks and handle them as separate risk indexes or may handle these values comprehensively as a single risk index. For example, for melanoma there is greater prognostic risk and overlook risk than that of basal cell carcinoma. Therefore, the disease risk acquirer 19 may acquire, for example, 10% as a risk index of melanoma and 80% as a risk index of basal cell carcinoma. This is an example in which if the disease of the diagnosis target area is melanoma, the risk is high even though the probability (disease index) is 10%, whereas if the disease of the diagnosis target area is basal cell carcinoma, unless the probability (disease index) is greater than or equal to 80%, the risk is not regarded as high, for example. The values of the risk indexes for these individual diseases may be values that are set in advance by a doctor or the like on a per-disease basis. Similar to the processing in the risk boundary line generation processing (
Through the display control processing which is described further below, the display controller 18 causes the display to display multiple indexes, which are acquired by the index acquirer 16, in association with one another, as a tree structure, as illustrated in
The functional configuration of the display control apparatus 102 is described above. Details of the display control processing performed by the display control apparatus 102 are described next with reference to
First, the controller 10 of the display control apparatus 102 acquires a query image (step S401). For example, when the user inputs the query image into the display control apparatus 102 via the inputter 31 (drags and drops the query image into a prescribed region on the screen, for example), the controller 10 acquires the query image.
Next, the display controller 18, as illustrated in
Next, the index acquirer 16 inputs the query image into the disease identifier and acquires the disease indexes of the individual diseases (step S403). Then, the display controller 18, as illustrated in
Then, the display controller 18 displays risk circles indicating the size of the risk indexes of the individual disease acquired by the disease risk acquirer 19 are displayed at positions such that the middle of the risk circles coincide with the middle of the corresponding probability circles of the individual diseases (step S405). For example, in
In the example illustrated in
Next, the display controller 18 displays, as illustrated in
In the displaying of the tree structure in step S406, if the malignant index is larger than the benign index based on the indexes acquired by the index acquirer 16, the display controller 18 displays a malignant node 432 more largely than a benign node 431, as illustrated in
Although not illustrated in
Also, in
As described above, the display control apparatus 102, in response to the inputted query image, can make it easy to grasp the attribute information of the disease of the diagnosis target area, by indicating the probability of the disease of the diagnosis target area shown in the query image being a prescribed disease by adjusting the sizes of the probability circles, and by displaying the probability circles in a tree structure based on the attributes of the individual diseases, as illustrated in
Similar to that in the aforementioned embodiments, in the display control apparatus 102 according to Embodiment 3, the following attributes: “endothelial/non-endothelial”, “metastatic/non-metastatic”, “ductal/non-ductal”, “viral/non-viral”, “size of the disease-affected area”, “color of the disease-affected area”, “time (for example, through prolonged observation of size, a time variation of measurement values of size, can be viewed, for example, by taking measurement values of, for example, size where time is placed on the horizontal axis and size is placed on the vertical axis)”, and the like may be used in place of at least one of “benign/malignant” or “melanocytic/non-melanocytic”. Among these attributes, since melanocytic is considered to be the attribute with the highest prognostic risk, in the example illustrated in
Also, instead of displaying of the malignant node 432, the benign node 431, melanocytic nodes 433, 434, the non-melanocytic nodes 435, 436, and the tree structure including, for example, the connections lines 421, 422, 423, 424 that respectively extend from the these individual nodes to the probability circles 411, 412, 413, 414 of the disease corresponding to the individual attributes, the display controller 18 may alternatively display (i) only the probability circles 411, 412, 413, 414, (ii) only the probability circles 411, 412, 413, 414 together with the risk circles 415, 416, 417, 418, or (iii) only these circles together with a portion of the nodes and connections lines that make up the tree structure.
Also, in aforementioned Embodiment 3, “benign/malignant” is assigned to the vertical axis and “melanocytic/non-melanocytic” is assigned to the horizontal axis, both as attributes, and the tree structure is displayed on a two-dimensional space. However, the attributes used may be three types and the tree structure may be placed in a three-dimensional space. In such a case, this projection onto a two-dimensional space may be outputted to the outputter 32. Also, in a case in which n of the n-types of attributes used is greater than or equal to four, the tree structure may be placed in a virtual n-dimensional space, and ultimately projected onto a two-dimensional space and outputted to the outputter 32.
A display control apparatus 103 according to Embodiment 4 of the present disclosure displays images that are similar to the query image on the periphery of the individual probability circles in addition to displaying the tree structure of the display control apparatus 102 according to Embodiment 3. By performing the displaying in such a manner, the display control apparatus 103 makes it easy to grasp the attribute information of the disease of the diagnosis target area and display the relationship between the similar images in a manner that is easier to understand.
The display control apparatus 103 according to Embodiment 4, as illustrated in
The controller 10 includes, for example, a CPU, and executes programs stored in the storage 20 to achieve the functions of individual components (index acquirer 16, position determiner 13, disease risk acquirer 19, similar image acquirer 11, classifier 14, and display controller 18), which are described further below.
The index acquirer 16, the position determiner 13, and the disease risk acquirer 19 are similar to the index acquirer 16, the position determiner 13, and the disease risk acquirer 19 included in the display control apparatus 102 according to Embodiment 3, and thus descriptions for these similar components are omitted.
The similar image acquirer 11, similar to the similar image acquirer according to Embodiment 1, acquires data (image data of similar images and a degree of similarity between the images and the image query) obtained as a result of the similar image search with respect to the query image. Specifically, the similar image acquirer 11 acquires data of images that have a degree of similarity that is greater than or equal to a prescribed threshold in the similar image search and also acquires the degree of similarity. The similar image acquirer 11 may acquire data of similar images obtained as a result of the search by the controller 10 for images that are similar to the query image, and for example, may cause an external similar image searching device to search, via the communicator 33, for images that are similar to the query image, and may also acquire data of the similar images searched by the similar image searching device. Also, the image data is appended with their own corresponding information such as the disease names associated in one-to-one correspondence to the images as tag information.
The classifier 14 classifies image data acquired by the similar image acquirer 11 into a disease identified by a disease identifier that is used by the index acquirer 16. The classifier 14 can classify image data to any disease by use of tag information that is appended to the image data (for example, the disease name is appended as tag information to each image data).
Through the display control processing that is described further below, the display controller 18 performs processing to display data of the similar images acquired by the similar image acquirer 11 on the periphery of the probability circles corresponding to the diseases classified by the classifier 14 as illustrated in
The functional configuration of the display control apparatus 103 is described above. Details of the display control processing performed by the display control apparatus 103 are described next with reference to
When the tree structure is displayed in the processing performed up to step S406, the similar image acquirer 11 next acquires similar images obtained as a result of the similar image search with respect to the query image (step S407). Specifically, similar images that have a degree of similarity with the query image that are greater than or equal with a prescribed threshold are acquired. As such time, the similar image acquirer 11 acquires the similar image together with the degree of similarity the similar image has with the query image.
Then, the classifier 14 classifies, based on the tag information (disease name) appended to the individual similar images, the similar images acquired by the similar image acquirer 11 into the diseases identified by disease identifier that is used by the index acquirer 16 (step S408).
Then, the display controller 18 places and displays the similar images acquired in step S407 by the similar image acquirer 11 on the periphery of the probability circles (or within the probability circles), corresponding to the disease classified in step S408 by the classifier 14 (step S409), on the display. The display control processing ends upon completion of step S409.
Regarding the displaying of the similar images by the display controller 18 in step S409, as illustrated in
Also, although the individual similar images are displayed as being surrounded by a small circle, the width of the line of the small circle gets, thicker the greater the degree of similarity between the particular similar image and the query image. For example, in the example illustrated in
As described above, the display control apparatus 103, in response to the inputted query image, can make it easy to grasp the attribute information of the disease of the diagnosis target area, by indicating the probability of the disease of the diagnosis target area shown in the query image being a prescribed disease by adjusting the sizes of the probability circles, and by displaying the probability circles using a tree structure based on the attributes of the individual diseases, as illustrated in
Similar to that in the display control apparatus 102 according to Embodiment 3, in the display control apparatus 102 according to Embodiment 4, various attributes can be used. Also, the display controller 18 may alternatively display only a portion of the probability circles 411, 412, 413, 414, the risk circles 415, 416, 417, 418, the query image 400, similar images, the nodes and connections lines that make up the tree structure. Also, n of the n-types of attributes for the tree structure is not limited to two (tree structure in a two-dimensional space). The tree structure may be placed on an n-dimensional space and then ultimately outputted to the outputter 32 as a projection on a two-dimensional space.
Also, although the aforementioned Embodiments 2, 3, and 4 use skin disease as an example, the present disclosure is not limited to the field of dermatology. The present disclosure can be widely applied to fields involving the identification of images with use of an identifier. For example, the present disclosure can also be applied to the identification of types of flowers by using images of flowers, and the identification of bacteria by using microscope pictures of bacteria. Also, any approach may be used to achieve these identifiers. For example, a deep neural network (DNN) such as a convolutional neural network (CNN) may be used to achieve these identifiers or alternatively a support vector machine (SVM), logistic regression, or the like may be used to achieve the identifiers.
Also, although the controller 10 performed the display control processing in the aforementioned Embodiments 2, 3, and 4, the controller 10 may receive, via the communicator 33, a result from causing an external server to perform processing equivalent to the display control processing and output the result to the outputter 32.
Also, the aforementioned embodiments and modified examples may be combined together as appropriate. Although Embodiment 4 can be regarded as an embodiment in which a portion of Embodiment 1 is combined together with Embodiment 3, conversely, a portion of Embodiment 3 may be combined together with Embodiment 1. In such a case, the individual category circles illustrated in
The individual functions of the similar image display apparatus 100 and the display control apparatuses 101, 102, 103 may also be executed by a computer such an ordinary personal computer (PC). Specifically, in the aforementioned embodiments, the program for the similar image display processing that is performed by the similar image display apparatus 100 and the program for the display control processing that is performed by the display control apparatuses 101, 102, 103 are described as being stored in advance in the ROM of the storage 20. However, the program may be stored in, and distributed through, a non-transitory computer-readable recording medium such as a flexible disk, a compact disc read-only memory (CD-ROM), a digital versatile disc (DVD), a magneto-optical disc (MO), a memory stick, or a universal serial bus (USB), and may be installed into a computer to enable the computer to achieve the above-described individual functions.
The foregoing describes some example embodiments for explanatory purposes. Although the foregoing discussion has presented specific embodiments, persons skilled in the art will recognize that changes may be made in form and detail without departing from the broader spirit and scope of the invention. Accordingly, the specification and drawings are to be regarded in an illustrative rather than a restrictive sense. This detailed description, therefore, is not to be taken in a limiting sense, and the scope of the invention is defined only by the included claims, along with the full range of equivalents to which such claims are entitled.
Number | Date | Country | Kind |
---|---|---|---|
2018-158048 | Aug 2018 | JP | national |
2019-122644 | Jul 2019 | JP | national |