The present embodiments relate to radiology reports. Medical reports, such as a radiology report, are primarily a written communication between radiologists, medical professionals, and patients. The reports often contain complex anatomical, technical, and medical information. A comprehensive explanation and deep understanding of the contents of the report has significant value in disease diagnosis, prognosis, and treatment. However, radiology reports can be hard to read, especially for those without a medical background. Patients with limited medical knowledge may benefit from simpler explanation of the report, but such simple explanation may require time and communications with a physician. Medical professionals may not comprehensively understand the report drafted by someone with a different medical expertise or may not desire to again review the entire report to efficiently extract desired information.
By way of introduction, the preferred embodiments described below include methods, systems, instructions, and computer readable media for obtaining information from a radiology report based on machine learning. Rather than a natural language processing system designed to answer a question based on a large corpus of generic information, deep learning is used to train a machine-learnt network to contribute to extraction of a patient-specific answer from a patient-specific radiology report. Natural language questions, including questions with less informed terminology from a patient and questions with more informed terminology from a physician, are answered using evidence from the patient-specific radiology report even where that report and/or questions are not used in training the machine-learnt network.
In a first aspect, a method is provided for obtaining information from a radiology report based on machine learning. A question about a patient is received from interface hardware. The question and a radiology report from medical imaging of the patient are input into a natural language processing system. The natural language processing system includes a deep machine-learnt network having been trained to contribute to extraction of an answer from patient-specific input. An answer to the question is determined in response to the inputting and using the deep machine-learnt network. The answer is specific to the radiology report for the patient and is output.
In a second aspect, a system is provided for obtaining information from a radiology report based on machine learning. An interface is configured to receive a question about a radiology report of a patient. A medical records database has stored therein the radiology report of the patient. A processor is configured to analyze the question for information, identify parts of the radiology report, retrieve evidence related to the question from the parts, and generate an answer to the question based on the evidence. The configuration for at least one of the analyze, identify, retrieve, or generate includes a deep training-based machine-learnt network. A display is configured to output the answer.
In a third aspect, a method is provided for obtaining information from a radiology report based on machine learning. Evidence is retrieved in response to a natural language question about a patient from a radiology report for the patient by, at least in part, a natural language processing deep-learnt network. An answer to the natural language question is generated from the evidence retrieved from the radiology report for the patient.
The present invention is defined by the following claims, and nothing in this section should be taken as a limitation on those claims. Further aspects and advantages of the invention are discussed below in conjunction with the preferred embodiments and may be later claimed independently or in combination.
The components and the figures are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the invention. Moreover, in the figures, like reference numerals designate corresponding parts throughout the different views.
Question answering (QA) systems generally use a combination of techniques from computational linguistics, information retrieval and knowledge representation for finding answers. QA systems have been studied and showed success in various research areas. For example, IBM Watson QA system defeated two Jeopardy champions. EAGLi QA system is a search engine for MEDLINE with terminology-powered navigation and knowledge extraction skills. Facebook research DrQA system is a reading comprehension system for Wikipedia to answer open-domain questions. START is a web-based QA system to answer various questions about geography, arts, history, and other generic subjects. Most existing QA systems are at a research level and use a large database of generic information.
For a patient's radiology report, a QA system answers questions specific to that single source. Artificial intelligence-based querying of knowledge from radiology reports may assist patients and medical professionals in understanding or efficiently using a radiology report for a specific patient. The patient-specific QA system is based on natural language processing (NLP), deep learning techniques. Given a natural language question together with a document (i.e., radiology report), the QA system can understand the question and document, automatically retrieve the corresponding information as the evidence, and generate the answer.
Question-answering-based interaction automatically answers questions posed by humans in natural language. Various audiences are supplied with just right information specific to a radiology report of a given patient. Clinical language is not easy to understand for most patients. The QA system provides an easy way to convey the radiology report to patient. The patient may use their vocabulary to ask any questions, which the QA system answers based on the patient's radiology report. The radiology report QA system may pull answers from an unstructured document (i.e., free-form report). The QA system is a flexible analysis tool for users with various purposes, such as medical professionals, financial staffs, insurance companies and/or patients.
For analysis over multiple patients (e.g., physician and/or hospital performance study), the same question or questions are asked for many radiology reports. Key information is automatically retrieved from the large volume of data. The key information may be used to generate labels at various anatomical sites for corresponding images for the radiology reports. Comparing with manual annotation, this auto-annotation mechanism may improve the efficiency in development of big image data analysis. The use of machine-learnt networks allow the computer to more quickly provide an answer over a broader range of questions and vocabulary. The arrangement of such networks and sequence allow a machine to more accurately provide an answer specific to a patient's radiology report.
The QA system is not limited to radiology reports. Other patient-specific text data, such as medical record, family history, lab test report, and so on, may be used for answering a question.
The method is implemented by the system of
Additional, different, or fewer acts may be provided. For example, the question is generated rather than received in act 10, such as where a computer generates a series of questions for analysis. In another example, acts 15-19 represent one example for determining an answer to a question about a given radiology report. Less than all and/or additional acts may be used to determine the answer.
Acts 15-18 are shown as using deep-learnt networks. Different deep learnt networks or classifiers are used for each act, but a common network or a network formed from other networks may be used for implementing two or more acts. In other embodiments, a deep-learnt network is not used for one or more of acts 15-18 and/or is used for act 19.
The acts are performed in the order shown (e.g., top to bottom or numerical) or other orders. For example, acts 15 and 16 are performed in a reverse order or simultaneously.
In act 10, interface hardware receives a question about a patient. The interface hardware is a communications interface, such as a user input device, port, bus, or memory. The question is received by the interface hardware, such as receiving a question typed on a keyboard at a universal serial bus (USB) connector of a computer. A microphone may be used to receive the question.
The question is input as text or converted to text. The question is a natural language question, such as using phrasing, abbreviations, terms, and/or structure in a common language, such as English. The question is not a pre-prepared question, such as the user selecting a question from a list (e.g., from a table or drop-down menu). In alternative embodiments, the question is selected from a list.
The question is patient-specific. Rather than being generic to many patients, the question is about a particular patient. The question is directed to or for a given patient. The same question may be asked for other patient. For example, the question is whether there is an indication of stenosis of a right coronary artery for the patient. In another example, the question is “what is the level of calcification of plaque?” In other embodiments, the question is not about a specific patient, but instead is a question that is asked for many patients.
The question is received from the patient. The phrasing of the question may be different due to being from a patient, such as whether there is an indication of a heart problem rather than whether there is an indication of stenosis of a right coronary artery. Any vocabulary and phrasing may be used. Rather than having to contact a physician, the patient or a relative may ask a question, and the QA system provides an answer based on the patient's radiology report.
In other embodiments, the question is received from a treating physician. A surgeon, anesthesiologist, primary care physician (internal medicine), nurse, or other medical professional asks the question. The phrasing may be appropriate for the physician or medical professional. Since the medical professional may have different knowledge than the radiologist that prepared the report and/or to avoid having to review the entire report again to find an answer, the QA system answers the question from the medical professional based on the report for the patient being treated.
In another embodiment, the questions are posed by an algorithm. A computer generates one or more questions. Computer-generated questions may be used for machine training or for analysis of answers. For example, data analytics are performed for radiology reports for a physician and/or hospital. Key or embedded knowledge is automatically retrieved in the large volume of radiology reports. The question is loaded from memory or otherwise generated, and the answers are used for radiology research and/or to improve practices in reporting.
In act 12, the interface, memory, and/or processor inputs the question and a radiology report into the QA system. The patient-specific question and the radiology report for that patient are input to the natural language processing system for an answer.
The natural language processing system includes a deep machine-learnt network. Other natural language processing tools may also be included. Computational linguistics (e.g., linguist rules), information retrieval, and/or knowledge representation may be part of the natural language processing system.
The deep machine-learnt network contributes to any part of the natural language processing system. The network contributes to extraction of an answer from the patient-specific input (e.g., the radiology report for the patient and/or question about the patient). Any contribution may be provided, such as for analyzing the question, parsing the report, extracting candidate phrases or sentences from the report, retrieving evidence from the report and/or extracted information, and/or generating an answer from retrieved evidence. One network may be trained to perform all these acts. A separate network may be trained for each of the acts. Some acts may not use any machine-learnt network.
The deep machine-learnt network is any now known or later developed network. In general, deep learning uses a neural network with raw data or data without conversion to other features as input and a ground truth. The relationship of the values of the input raw data to the ground truth is learned. For example, deep learning (e.g., deep structured learning, hierarchical learning, or deep machine learning) models high-level abstractions in data by using multiple processing layers with structures composed of multiple non-linear transformations, where the input data features are not engineered explicitly. Deep learning may be used without including specific linguistic rules. A deep neural network processes the input via multiple layers of feature extraction to produce features used to output. The deep learning provides the features used to generate the output. In alternative embodiments, other machine learning is used, such as a Bayesian network or support vector machine.
For natural language processing, the deep-machine learnt network may be a recursive neural network and/or a long term-short memory network. Any architecture may be used. Other deep learnt, sparse auto-encoding models may be trained and applied. The machine training is unsupervised in learning the features to use and how to classify given an input sample (i.e., feature vector). The combination of information in the raw data indicative of the ground truth is learned, and an output given input combination of information is learned.
For training, the training data includes many samples. The samples are based on the contribution. The samples include appropriate input information and corresponding ground truth or output to be learned. For example, many different questions in the form of ASCI text are provided as samples of the raw data to be input. The ground truth is the question information, such as content (e.g., category of subject matter, object, or subject), type of question (Yes/No, how, what, where . . . ), anatomy, and/or anatomy location. The deep learning learns to provide the question information from an input question. In another example, many samples of radiology reports are used with the content of the reports in a structural template as the ground truth. The deep learning learns to provide the structural template populated with the content of the reports. As another example, many samples of question information and report sentences are provided with particular ones of the report sentences having content relative to the question information as ground truth. As yet another example, many samples of question information are provided as raw data inputs and corresponding text from medical reports or templates matching the question information are provided as the ground truth. The deep learning learns to provide key knowledge (e.g., evidence) from the text. In yet another example, many samples of extracted evidence and corresponding question information are provided with a correct answer as the ground truth. As a final example, many samples of questions and radiology reports are provided with the correct answer as the ground truth. The network is trained to provide the answer from the question and report without intervening processing. Other inputs and ground truths (i.e., outputs) may be used.
The trained network is stored in a memory. The trained artificial intelligence (i.e., machine-learnt network) is stored. The result of the training is a matrix or other model. The matrix represents the learned knowledge through machine training using deep learning. Other machine-learnt network representations may be used, such as a heirarchy of matrices or other non-linear models.
Any memory may be used. The memory used for the training data may be used. For application, the memory may be in other devices. For example, the trained model is stored in a memory of a server. The server uses the trained model to output to clients. As another example, multiple copies of the trained model are provided to different physicians, medical scanners, and/or workstations for use by different physicians.
Once trained, the machine-learnt network is applied by a machine, such as a computer, processor, or server. The machine uses input raw data for a patient (i.e., radiology report, question, and/or information derived therefrom) and the machine-learnt network to generate an output, such as question information, a parse of the report, evidence, and/or an answer.
The only inputs to the QA system are the question and the radiology report for the patient. Rather than answering the question based on a corpus that includes reports for other patients, only the radiology report or reports for the specific patient are input. The QA system may include learned knowledge from the deep learning, but that knowledge is not input during application to answer a question. The deep learnt knowledge is applied to answer the question. In an alternative embodiment, other information is input to answer the question, such as other information from a medical record of the patient and/or information generic to patients (e.g., a medical atlas).
For application for a specific patient, the input question is different than any question used to train the deep machine-learnt network. At least one word or order of words in the question is different. Similarly, the input radiology report for the patient is different than any radiology report used to train the deep machine-learnt network. At least the text, image, or organization is different. By deep learning by the machine, the machine-learnt network is trained to predict an output based on the input even if the input is different than any used in training. In alternative embodiments, the question and/or radiology report input during application is the same as used for training. Rather than looking up the output in a database, the machine-learnt network outputs an answer based on the training.
In act 14, the computer (e.g., processor) determines an answer to the input question. In response to the input of the question and the radiology report, an answer to the question is derived from the radiology report. Only the information from the radiology report for the specific patient is used to derive the answer to the question. A patient-specific answer is determined as specific to the radiology report for that patient. Alternatively, other information is also used, such as atlas information.
The answer is determined by the QA system with natural language processing including the deep machine-learnt network. The deep machine-learnt network is part of the natural language processing, such as to retrieve evidence in response to a natural language question about a patient from a radiology report for the patient and/or forming an answer from the evidence. The deep machine-learnt network is a natural language processing deep-learnt network. Other natural language processing may be applied, such as linguistic rules, information retrieval, knowledge representation, text/sentence negation, sentiment analysis, and/or logical reasoning.
The natural language processing, in general, performs multiple acts to derive the answer to the question from the radiology report. Question information is derived from the question using natural language processing. The radiology report is reformatted, parsed, used to populate a template, and/or separated using natural language processing. Evidence and/or candidate sentences or phrases relevant to the question are identified using natural language processing. The evidence and/or candidates are analyzed using natural language processing to provide an answer.
Additional, different, or fewer acts may be provided. Acts 15-19 show one example combination of acts using natural language processing.
The deep machine-learnt network or networks are used in one or more of the acts. Machine-learnt networks may be used in all or only a sub-set of the acts.
In act 15, question information is derived from the question. Any question information may be derived. The content (e.g., subject, object, and/or category of subject matter), type of question (e.g., Yes/no, compound, who, what, where, how, why, when, . . . ), anatomy, anatomy location, or other question information is extracted from the question. For example, the question is input as “is there a stenosis in the RCA?” The content is identified as coronary arteries; the type of question is Yes/No; a sign/measure/symptom is stenosis; and the anatomy/organ/segment is right coronary artery. Other medical related categorization and/or content may be derived, such as association with a branch or group of terms.
Natural language processing is used to extract the question information. Some or all the question information may be derived by a deep machine-learnt network. The question is analyzed to determine the question information.
In acts 16, radiology report information is derived from the radiology report. The radiology report is analyzed to understand the document. Sections or types of information may be identified. The content, anatomy, and/or other information about each part may be derived. Different parts of the radiology report are separated, such as identifying sentences, clauses or phrases. In one embodiment, the radiology report is split into sentences, but other separations (e.g., by section or grammar-based) may be used.
In one embodiment, the radiology report is parsed. The parsing locates different types of information and places the information into a template radiology report. The template includes a hierarchal standardized structure, which is populated from the radiology report based on natural language processing. In another embodiment, grammatical rules or searching for punctuation and/or capitalization is used to extract the parts of the radiology report. Other derivations may be performed.
Natural language processing is used to derive the radiology report information. For example, computational linguistics or knowledge representation are used to extract. Some or all the radiology report information may be derived by a deep machine-learnt network.
In act 17, candidate sentences or phrases are extracted from the radiology report and/or from the radiology report information from act 16. The information or parts of the radiology report corresponding to the question information are extracted. For example, the sentences are compared to the question, and the candidate sentences that relate to the question are extracted (e.g., any sentence with content for right coronary artery is identified). Less than all the sentences are extracted as candidate sentences. As another example, the parts of the hierarchal template structure corresponding to the question are extracted.
Natural language processing is used to extract from the radiology report information associated with or providing evidence for the question. A deep machine-learnt network may be applied to extract. The radiology report information that relates to the question information is output as a match.
In act 18, evidence is extracted from the radiology report information and/or extracted candidates therefrom. Any evidence that may contribute to the answer is extracted. The candidate sentences and/or radiology report information may be parsed or analyzed with respect to the question information. For example, a candidate sentence is “Right Coronary: No evidence of stenosis.” The extracted evidence is a negative with respect to stenosis and the right coronary artery.
Natural language processing is used to extract the evidence. A deep machine-learnt network may be applied to extract the evidence. The evidence in the radiology report information and/or candidates that indicate a possible answer to the question is retrieved by the natural language processing.
In act 19, an answer is generated from the evidence, candidates, and/or radiology report information. The evidence retrieved from the radiology report is analyzed with respect to the question information to generate the answer. For example, the evidence is combined to determine the answer.
Natural language processing is used to generate the answer. The evidence is combined, such as probabilistically, to determine a most likely or probable answer. Rules or fuzzy logic may be used. A deep-machine learnt network may be used to generate the answer, such as outputting possible answers and corresponding probabilities given input evidence and question information. The possible answer with a greatest probability is selected. Alternatively, the network may output a single answer with or without probability information.
In act 20, the computer (e.g., processor) outputs the answer. The answer provided by the determination is output. The output is to a display device, such as a response to input to the question on a user interface. Alternatively, the output is to a report, such as adding the question and answer to the radiology report and/or the medical data record of the patient.
The use of a deep machine-learnt network may make the response time of the computer to provide an answer more rapid. Due to the application of the network rather than hand programmed series of calculations, the computer operates more quickly to provide an answer.
The output answer may assist a patient or family in understanding their radiology report. Rather than having to search for meanings of medical terms, simple language questions may be posed and simple language answers output. Arranging for consultation with a physician may be avoided.
The output answer may assist a physician. Reviewing the entire radiology report again may be avoided by asking a question and receiving the answer. Where the radiologist uses different expressions or terminology, the physician may receive an answer that avoids confusion, assisting in diagnosis, prognosis, and/or treatment.
A question with rich content and a radiology report are input in act 12. In act 15, question information (content, type, and anatomy location) are extracted from the question by a deep machine-learnt classifier. The input question is analyzed to understand the content, type, location etc. In act 16, the radiology report is parsed into a template by another deep machine-learnt classifier. The template has a hierarchal standardized structure.
A question and a radiology report are input in act 12. In act 15, question information (content, type, anatomy location, and measurements) are extracted from the question by a natural language processing deep machine-learnt classifier. The input question is analyzed to understand the content, type, location etc. In act 16, the radiology report is separated into sentences or phrases. Grammatical rules, punctuation rules, or other hand programmed rules are used to split the radiology report into sentences or phrases. In act 17, candidates of the sentences or phrases are identified by another natural language processing deep machine-learnt classifier. The question and report sentences are fed into a QA matching model (e.g., a pre-trained deep learning model). The candidate sentences containing answers based on the question information are automatically detected by the model as the evidence. Natural language processing tools, such as cTakes, may be used to extract the evidence. In act 18, parts of the candidates are selected. The candidate sentences are analyzed using natural language processing tools and/or a deep machine-learnt network to extract evidence (e.g., key knowledge) from the candidate sentences. Specific information from the statements that relates to the question is identified. In act 19, the answer is generated from the selected text. The question is matched into the evidence to pick-up the right answer. The selected evidence correlates to an answer to the question.
The system implements the method of
The system includes an interface 70, a medical records database 72 with a radiology report 74, a processor 76 for applying a natural language processing, deep machine-learnt network 78, and a display 80. Additional, different, or fewer components may be provided. For example, a medical imager is provided for scanning the patient to generate the radiology report 74. In another example, a network or network connection is provided, such as for networking with a medical imaging network or data archival system or interconnecting the processor 76 and the database 72.
The interface 70, medical records database 72, processor 76, and/or display 80 are part of a medical imager, server, workstation, or computer. In one embodiment, the interface 70, medical records database 72, processor 76, and/or display 80 are a personal computer, such as desktop or laptop. In yet other embodiments, the medical records database 72 is part of a separate computer from the processor 76.
The interface 70 is a communications port, such as an ethernet card. In other embodiments, the interface 70 is a user interface, such as user input device (e.g., keyboard, mouse, trackpad, touchscreen, and/or roller ball). The interface 70 may be a bus, chip, or other hardware for receiving the radiology report and/or question.
The interface 70 is configured to receive a question about a radiology report of a patient. The configuration is provided by software, hardware, and/or firmware. For example, the interface 70 is configured by an operating system to receive user entry of a question via audio or text.
The medical records database 72 is a random access memory, system memory, cache memory, hard drive, optical media, magnetic media, flash drive, buffer, database, combinations thereof, or other now known or later developed memory device for storing the radiology report 74, the deep machine-learnt network 78, and/or data generated by natural language processing (e.g., template, sentences, candidates, evidence, text, probabilities, and/or answer). The medical records database 72 is part of the computer associated with the processor 76 or is a separate or remote database for access over a computer network.
The medical records database 72 or other memory is alternatively or additionally a non-transitory computer readable storage medium storing data representing instructions executable by the programmed processor 76 for learning or applying the machine-learnt model, and/or for natural language processing. The instructions for implementing the processes, methods, and/or techniques discussed herein are provided on non-transitory computer-readable storage media or memories, such as a cache, buffer, RAM, removable media, hard drive, or other computer readable storage media. Non-transitory computer readable storage media include various types of volatile and nonvolatile storage media. The functions, acts or tasks illustrated in the figures or described herein are executed in response to one or more sets of instructions stored in or on computer readable storage media. The functions, acts or tasks are independent of the particular type of instructions set, storage media, processor or processing strategy and may be performed by software, hardware, integrated circuits, firmware, micro code and the like, operating alone, or in combination. Likewise, processing strategies may include multiprocessing, multitasking, parallel processing, and the like.
In one embodiment, the instructions are stored on a removable media device for reading by local or remote systems. In other embodiments, the instructions are stored in a remote location for transfer through a computer network or over telephone lines. In yet other embodiments, the instructions are stored within a given computer, CPU, GPU, or system.
The processor 76 is a general processor, control processor, digital signal processor, application specific integrated circuit, field programmable gate array, or other hardware processor for natural language processing in a QA system. The processor 76 is part of a computer, workstation, server, or other device configured to apply machine learning and/or to apply a machine-learnt network 78. The processor 76 is configured by software, hardware, and/or firmware. For learning, the processor 76 is configured by one or more machine learning algorithms. For applying a learnt network 78, the processor 76 is configured, in part, by a learnt matrix or matrices, table, or other neural network representation associating input data to output data.
The processor 76 applies the machine-learnt network 78 and/or other natural language processing. The machine-learnt network 78, as implemented by the processor 76, contributes to determining an answer to a question from a radiology report. The processor 76 is configured to analyze the question for information, identify parts of the radiology report, retrieve evidence related to the question from the parts, and generate an answer to the question based on the evidence. The configuration for at least one of the analyze, identify, retrieve, or generate includes natural language processing and/or application of a natural language processing, deep training-based machine-learnt network.
In one embodiment, the processor 76 is configured to: analyze by extraction of the information as content, type, and anatomy location by a first deep machine-learnt classifier of the deep-training based machine-learnt network; identify the parts by a parse of the radiology report into a template by a second deep machine-learnt classifier of the deep-training based machine-learnt network; retrieve the evidence as a selection of text from the template based on the content, type, and/or anatomy location; and/or generate the answer from the selected text.
In another embodiment, the processor 76 is configured to: extract content, type, and anatomy location from the question with a first deep machine-learnt classifier of the deep-training based machine-learnt network; separate the radiology report into sentences or phrases; identify candidates of the sentences or phrases by a second deep machine-learnt classifier of the deep-training based machine-learnt network; select parts of the sentences or phrases; and/or generate the answer from the selected parts.
The display 80 is a monitor, LCD, projector, plasma display, CRT, printer, or other now known or later developed device for displaying the answer, question, evidence, and/or radiology report. The display 80 receives the answer from the processor 76, the radiology report from the processor 76 or database 72, and the question from the interface 70 or processor 76. The processor 76 formats the data into an image and stores the image in a buffer, configuring the display 80. The display 80 uses the image in the buffer to generate an image for viewing. The image includes graphics, alphanumeric text, anatomical scan, and/or other information for communicating the answer.
While the invention has been described above by reference to various embodiments, it should be understood that many changes and modifications can be made without departing from the scope of the invention. It is therefore intended that the foregoing detailed description be regarded as illustrative rather than limiting, and that it be understood that it is the following claims, including all equivalents, that are intended to define the spirit and scope of this invention.