[Not Applicable]
[Not Applicable]
[Not Applicable]
Embodiments of the present method and system relate generally to electronic data collection and display in a healthcare setting. Particularly, certain embodiments relate to providing automated methods and systems for updating medical reference materials.
Many traditional medical textbooks have been converted to electronic formats, particularly in the field of radiology. Users of such electronic textbooks use computers to navigate the textbook contents. One advantage that these electronic textbooks offer over conventional texts is the ability for users to link to large databases of images or other data that can enhance the learning experience. However, the reference materials linked to electronic texts tend to contain static content. That is, there is typically no mechanism for users of the electronic texts or educators using such texts to add content to the databases. In the medical profession, a tremendous amount of learning is empirical or based on actual cases and the lessons gathered from the diagnosis and treatment of specific physiological conditions. Thus, there is a need for electronic texts to have their reference collections updated to reflect the empirical learning common to the medical profession.
Moreover, collections of reference exams are useful not only for the education of new clinicians and the continuing education of existing clinicians, but also for decision support in the clinic. Clinical decision support refers to using a knowledge base and a mechanism for drawing inferences based on a set of expert rules in order to guide diagnosis.
As with traditional texts, both the main body of the electronic text and any linked reference material are categorized by anatomy, pathology, or some other relevant indexing system. Thus, there exists a classification system inherent in the electronic texts that may allow for integration of new data into the main body or reference materials of an electronic text. There is a need for a convenient way to take advantage of this inherent classification system to update reference materials using clinically relevant data.
As clinics, hospitals, and other healthcare facilities have come to rely more and more on computers over the last several decades, much of the data useful for updating electronic texts exists in electronic formats. In particular, healthcare facilities employ certain types of digital diagnostic imaging modalities, such as computed tomography, magnetic resonance imaging, ultrasound imaging, and X-ray imaging. The images gathered on these systems are stored in electronic formats, as are the orders used to generate the images and the clinical reports that result from clinical analysis of the images.
Manipulation of these electronic data sets, such as clinical reports and clinical images is known. One method used for manipulating large clinical data sets is natural language processing. Natural language processing converts computer-readable text, typically in a narrative format, into an often predefined, structured form. This structured form can be used for further analysis of the data. For example, Hripcsak et al. used natural language processing to structure over 800,000 clinical reports and compare the findings in the reports. (G. Hripcsak, J. H. Austin, P. O. Alderson, C. Friedman, Use of natural language processing to translate clinical information from a database of 889,921 chest radiographic reports. Radiology. July 2002; 224(1):157-63). Other uses of natural language processing in a radiology setting include A. A. Bui, R. K. Taira, S. El-Saden, A. Dordoni, D. R. Aberle, Automated medical problem list generation: towards a patient timeline. Medinfo. 2004; 11(Pt 1):587-91 and K. J. Dreyer, M. K. Kalra, M. M. Maher, A. M. Hurier, B. A. Asfaw, T. Schultz, E. F. Halpern, J. H. Thrall, Application of recently developed computer algorithm for automatic classification of unstructured radiology reports: validation study. Radiology. February 2005; 234(2):323-9.
What is needed is a system and method for applying classification methods in real time to medical data. Such real time classification could take advantage of the common electronic formats of clinical data and reference materials to provide an automated way for updating medical reference collections.
Certain embodiments of the present invention include a method for automated collection of medical reference materials. Certain embodiments of the method comprise the steps of tagging exam data, processing the exam data to extract categorizing information, categorizing the exam data, and storing the exam data in a reference collection.
Certain embodiments of the present invention include an automated system for updating reference materials in a healthcare setting. Certain embodiments of the automated system comprise a collection of medical reference materials connected to a network, an exam database connected to the network, and a workstation connected to the network for evaluating data stored in the exam database. The collection of medical reference materials may have a set of reference exams. The data evaluation on the workstation may comprise tagging data for categorization.
The foregoing summary, as well as the following detailed description of certain embodiments of the present invention, will be better understood when read in conjunction with the appended drawings. For the purpose of illustrating the invention, certain embodiments are shown in the drawings. It should be understood, however, that the present invention is not limited to the arrangements and instrumentalities shown in the attached drawings.
Referring to
Still referring to
In the event that a clinician activates a tag in exam data 110 and the query answer is “YES,” exam data 110 proceeds to extraction step 130 according to one embodiment of the present invention. Extraction step 130 parses exam data 110 and extracts information that matches a set of predefined rules or categories. Parsing exam data 110 may involve a natural language processing routine according to one embodiment of the present invention.
Natural language processing enables extraction step 130 to scan the text-based data of exam data 110 and parse out key semantics according to one embodiment of the present invention. Key semantics may include the clinical finding that identifies the pathology of interest in the exam. Each exam procedure may then be associated with a preset list of pathologies that may be used as attributes to describe the exam. The natural language processing of the report could determine whether each pathology attribute is true (present) or false (not present). Such a detailed list of attributes would allow for much more specific image retrievals. Thus, extracting step 130 is useful for a method of automated collection and categorization of exam data 110 for updating medical reference materials in that extraction step 130 extracts key information for categorizing exam data 110, according to one embodiment of the present invention.
According to one embodiment of the present invention, extracting step 130 may extract data from an image or series of images. In such a case where the data is extracted from an image, extracting step 130 preferably examines the data fields associated with the image, such as the Digital Imaging and Communications in Medicine (DICOM) information commonly used with radiology images. The DICOM vocabulary is typically more limited that the narrative vocabulary used in a clinical report. Thus, a natural language processing routine may not be needed to extract data from the DICOM data fields associated with an image. The limited vocabulary of the DICOM fields may be parsed to extract DICOM terms commonly known to overlap with reference categories in medical reference collections. Similarly, exam orders may be parsed for HL-7 protocol terms, for example, as exam orders typically are formatted in the HL-7 protocol.
Referring to
An alternative way of categorizing extracted data would be to associate a set of attributes gathered from the findings in an exam report that would be relevant to a type of exam (e.g. the MR Brain example above). Each type of exam will have a unique set of possible associated findings (e.g. a C-T scan of the chest will have a different set of findings than a MR scan of the brain).
In one embodiment of the method of the present invention, extraction step 130 provides semantics to comparison step 140 in a specific grammatical form for comparison with reference categories 145. For example, extraction step 130 may provide the noun “fiber” to comparison step 140 in the event the term “fibrous” was identified in exam data 110 during extraction step 130. Or, extraction step 130 may provide multiple grammatical formats for a given term, such as “fiber,” “fibers,” “fibril,” “fibrils,” “fibrous” and “fibrillar.” Multiple grammatical formats serve at least the purpose of providing multiple points of comparison to reference categories 145. That is, reference categories 145 may have grammatical formats different than the specific grammatical format of the semantics being extracted in extraction step 130.
Further, reference categories 145 may span a number of individual medical references in a collection, according to one embodiment of the present invention. For example, reference categories 145 may include categorizing terms from an electronic radiology text, an electronic oncology text, and an electronic physiology text. In such an example, a given categorizing term may have slight variations from one text to another. Thus, providing multiple grammatical formats for extracted semantics may facilitate categorization in multiple references.
In one embodiment of the present invention, comparison step 140 may perform grammatical formatting to facilitate categorization. Or, both extraction step 130 and comparison step 140 may perform grammatical formatting to facilitate categorization. In any event, comparison step 140 performs the function of filtering through the extracted semantics to provide a list of semantics that overlap with reference categories 145 according to one embodiment of the present invention. Comparison step 140 may provide a list of multiple overlaps within a single reference collection or across multiple collections.
Referring again to
Referring to
According to one embodiment of the present invention, output step 160 may remove certain patient demographic information from the categorized data in order to preserve patient confidentiality. Since the data may be linked to a reference collection for educational purposes, certain patient demographic data, such as age and gender, may be useful for furthering the educational purpose of the reference collection. However, other patient demographic information that may be part of exam data 110 is potentially unnecessary for educational purposes, such as, for example, the patient's name or Social Security number.
The technical effects of certain embodiments of the present method are tagging exam data, processing the exam data to extract categorizing information, categorizing the exam data, and storing the exam data in a reference collection.
The steps described above are illustrated in
According to one embodiment of the present invention, workstation 220 provides a user interface that enables a clinician to interact with exam data such as exam order 222, exam image 224 and exam report 226. A clinician may create and/or edit exam order 222 and exam report 226 using workstation 220 and may view and edit exam image 224 using workstation 220. Workstation 220 is connected to image archive 230 and data archive 240 to facilitate access to stored data as well as storage of created or edited data.
In addition to viewing and manipulating exam data on workstation 220, a clinician may activate a tag on exam data using workstation 220 according to one embodiment of the present invention. A clinician may activate a tag to identify exam data for automated characterization for addition to a reference collection. In the event a tag is activated, exam order 222, exam image 224, and exam report 226 may all be processed for categorization and storage in a reference collection.
Exam image 224 may be stored in image archive 230, according to one embodiment of the invention. If exam image 224 has been added to a reference collection according to one method of the present invention, then exam image 224 may also be stored in reference collection 250. Alternately, reference collection 250 may contain a link to exam image 224. In such a case where reference collection 250 contains a link to exam image 224, if a user of reference collection 250 would like to view exam image 224, then reference collection 250 can cause exam image 224 to be retrieved from image archive 230.
Similarly, exam report 226 and exam order 222 may be stored in data archive 240, according to one embodiment of the invention. If exam report 226 and/or exam order 222 has been added to a reference collection according to one method of the present invention, then exam report 226 and/or exam order 222 may also be stored in reference collection 250. Reference collection 250 may contain a link to exam report 226 and/or exam order 222.
Referring to
In one example of an embodiment of the present invention, a radiologist uses a PACS workstation to retrieve a series of images related to a magnetic resonance (MR) scan of a patient's brain. Upon examining the image series, the radiologist records the following notes in the findings section of a clinical report: “Increased T2 and FLAIR signal in the periventricular white matter and central pons, consistent with chronic small vessel ischemic change. No hemorrhage, no mass, no midline shift, no hydrocephalus, no signal abnormality on diffusion weighted images, no brain parenchymal signal abnormality on conventional images, no abnormal extra axial fluid collection, no bone lesion, paranasal sinuses are clear.” The radiologist decides that this series of images is a particularly clear example of a certain pathology and tags the image by marking a field in the display of PACS workstation. Now that the image is marked, it is processed using natural language processing to yield the following text string: “chronic small vessel ischemic change.” The images and the report are then linked to the Neurovascular category of an appropriate radiology text and a neurology text.
Embodiments of the present invention provide systems and methods for automated categorization of clinical data for addition of such data to medical reference collections. Certain embodiments take advantage of common electronic formats of clinical data and medical reference materials to provide a system and method for updating the medical reference materials. Certain embodiments take advantage of developments in data processing, such as for example natural language processing, to provide a real-time classification system and method.
While the invention has been described with reference to certain embodiments, it will be understood by those skilled in the art that various changes may be made and equivalents may be substituted without departing from the scope of the invention. In addition, many modifications may be made to adapt a particular situation or material to the teachings of the invention without departing from its scope. Therefore, it is intended that the invention not be limited to the particular embodiment disclosed, but that the invention will include all embodiments falling within the scope of the appended claims.