System and method for identifying disease-influencing genes

Abstract
The present invention describes a system and method of using individuals' behavioral and physiologic information to identify disease-influencing genes.
Description
FIELD OF THE INVENTION

This invention relates generally to the fields of genomics, bioinformatics, and drug development. More specifically, it relates to a database containing phenotypic and environmental data on groups of individuals for use in conjunction with gene sequences to identify disease-influencing genes and substances.


BACKGROUND OF THE INVENTION

The physical makeup of an individual is determined by his or her genes. Genes are comprised of DNA, which in turn consists of four nucleotides known as adenine(A), thymine(T), cytosine(C), and guanine(G). A particular series of nucleotides is known as a gene sequence. Each gene sequence codes for a protein. A defective or mutant gene sequence will not produce a working protein. The protein may not perform its purpose, the protein may carry out a different purpose than intended, too much protein may be made, too little protein may be made, or the protein may not be made at all. If the protein is essential to one or more functions of the body, disease will result.


Mutant gene sequences are either inherited or acquired. An inherited gene sequence is received from an individual's parents, while an acquired gene sequence results from an event in the individual's lifetime which changes the original gene sequence.


A classic example of an inherited mutant gene sequence is the sickle cell anemia gene. Sickle cell anemia is caused by the substitution of a single nucleotide (A to T) in the gene sequence of an individual. This single substitution results in the substitution of a single amino acid (glutamic acid to valine) in the resulting hemoglobin protein. The mutant hemoglobin protein produces crescent-shaped or sickled red blood cells in affected individuals, causing a decrease in the amount of oxygen that can be transported throughout the body. The lack of oxygen often results in kidney and heart failure, paralysis, and rheumatism, which are common symptoms of anemic individuals.


An example of an acquired mutant gene sequence is malignant melanoma, or skin cancer. Cancer results when normal cells in an individual's body either lose or gain certain functions, resulting in the unchecked growth of non-normal cells. These non-normal cells often form tumors and spread throughout the body, disrupting normal cell functions. A cancer such as malignant melanoma is caused when the original gene sequence in epidermal cells is changed or mutated by an environmental factor, such as UV radiation. Our cells contain repair mechanisms to fix such problems, but over time the gene sequences in epidermal cells acquire more and more mutations. Mutant proteins are then produced and cellular functions are disrupted. The individual then has skin cancer.


Although an individual's environment generally precipitates the development of cancer, many individuals have been found to have a predisposition to cancer. These individuals have gene sequences which are more likely to become mutated over a shorter period of time. Examples of such gene sequences are the BRCA1 and BRCA2 genes. Women carrying these gene sequences have a higher probability of developing breast and ovarian cancer than women who carry normal gene sequences. Thus, although the affected women's original gene sequences may not be mutated, they are more likely to become mutated due to their sequence or location on a chromosome.


Another factor that should be considered when discussing genetic diseases is whether they are monogenic or polygenic in nature. Sickle cell anemia and cystic fibrosis are examples of monogenic diseases, as they are caused by a single gene sequence. Most types of cancer, asthma, and diabetes are examples of polygenic diseases, as they are caused by a variety of genes. Polygenic diseases are also more likely to be influenced by an individual's environment. Not surprisingly, polygenic diseases are more difficult to diagnose and treat. Thus, the use of gene sequences in developing new drugs is dependent the monogenic or polygenic nature of genetic diseases.


Typically, individuals with diseases caused by inherited or acquired gene sequences have only their symptoms treated. Diabetes patients receive insulin shots to regulate their blood glucose levels, asthma patients use inhalers to allow normal respiratory functions, and cancer patients undergo chemotherapy and radiation therapy to remove cancerous tumors. Although these treatments are often able to alleviate or eliminate the symptoms, they are unable to remove the genetic bases of the diseases.


The genetic bases of many diseases were discovered in the 1940's by scientists such as Beadle and Tatum, who discovered that each gene codes for a protein. Researchers then rationalized that study of the relevant gene sequences could lead to effective drug treatments for genetic diseases. The technology was inadequate, however, until the 1970-80's, when Boyer and Cohen cloned DNA; Maxam, Gilbert, and Sanger figured out how to sequence DNA; and Mullis developed the polymerase chain reaction (PCR) technique to quickly amplify DNA sequences. Using genetics to find drug candidates soon became a practical option.


Before these techniques became available, the pharmaceutical industry's main method of finding new drugs was trial and error. Compounds that were found to mimic the body's natural compounds were tested in vitro, in animal models, and in clinical trials to see if they had a desirable effect in treating disease. This method is still used and has resulted in many well-known drugs, but it is expensive and time-consuming.


With the advent of improved genetic techniques, however, the pharmaceutical industry has begun concentrating on genetics as the most effective route to new drug discovery. Genomics companies can typically be classified into one of two groups.


The first group concentrates on gene sequencing in order to find both drug targets and drug candidates, usually in the form of proteins expressed by the gene sequences. Gene sequencing can either be in the form of random discovery, whereby genes are sequenced without regard to their functions, or in the form of targeted discovery, whereby a certain region of the genome which is tentatively associated with a disease is sequenced. In random discovery gene sequencing, potentially useful gene sequences are identified and assayed to determine if they can be used in drug development. One problem with random discovery gene sequencing is that the majority of the human genome contains introns, or gene sequences which do not code for proteins. One way to circumvent this problem is to sequence complementary DNA (cDNA) instead. cDNA is produced from messenger RNA (mRNA). mRNA, in turn, is transcribed from DNA and processed by certain enzymes which remove the introns. cDNA sequences thus code for un-interrupted proteins.


Targeted discovery gene sequencing is typically used with positional cloning, comparative gene expression, and functional cloning techniques, which are described in the next group.


The second group of genomics companies takes a more epidemiological approach by first researching families or groups of individuals having a similar disease, and then isolating the relevant genes. In this method, also known as positional cloning, blood samples are taken from the individuals and analyzed. The blood samples contain DNA, which is studied to identify certain regions of the genome which appear to be associated with the disease. Linking a region of the genome with a disease is known as linkage analysis or genetic linkage mapping. Once a region of the genome has been identified, it is sequenced via targeted discovery gene sequencing.


The second group of genomics companies also uses comparative gene expression to discover disease gene sequences. In comparative gene expression, mRNA from both healthy and diseased tissue is isolated. The mRNA is then used to produce cDNA, which is sequenced using targeted discovery gene sequencing. The gene sequences from both the healthy and diseased tissue are then compared. In addition, the identification of genes associated with disease can be made by studying the level of expression of genes in both the healthy and diseased tissue.


Another similar technique is functional cloning. Mutant or non-functional proteins in metabolic pathways are studied and identified. The proteins are sequenced using targeted discovery gene sequencing and these sequences are used to figure out the corresponding DNA gene sequences. Once the disease gene sequences have been identified, they can be used in drug development.


Genomics companies in the first group include Incyte Pharmaceuticals (Palo Alto, Calif.). Incyte uses random discovery gene sequencing to produce its LifeSeq™ and LifeSeq FL™ databases. These databases contain the sequences of hundreds of human genes. These databases are licensed to drug development companies who use the sequences to produce new drugs. Databases covering animals (ZooSeq™), plants (PhytoSeq™), and bacteria and fungi (PathoSeq™) are also available. Incyte has also developed bioinformatics software, which provides sequence analysis and data management for their databases. In addition, Incyte offers cDNA libraries of the gene sequences in their databases, which can be directly used in drug development.


Human Genome Sciences (Rockville, Md.) also concentrates on random discovery gene sequencing, and has sequenced an estimated 90% of the 100,000 genes in the human body. In addition to collaborating with drug development companies who use their gene sequences, HGS also has its own drug discovery and development division. A number of therapeutic proteins which appear effective in animal models are under study.


Hyseq, Inc. (Sunnyvale, Calif.) has its HyX Platform which is capable of processing and sequencing millions of blood and DNA samples. The HyX Platform includes DNA arrays of samples and probes, software-driven modules, industrial robots for screening DNA probes against DNA samples, and bioinformatic software to analyze the genetic information. Through the use of its HyX Platform, HyX believes it can carry out a variety of techniques, such as gene identification, gene expression level determination, gene interaction studies (for polygenic diseases), and genetic mapping.


Affymetrix, Inc. (Santa Clara, Calif.) has a GeneChip system consisting of disposable DNA probe arrays containing gene sequences on a chip, instruments to process the probe arrays, and software to analyze and manage the genetic information in the probe. The GeneChip system thus allows pharmaceutical and biotechnology companies to collect gene sequences and apply them to drug development.


On the other hand, the pharmaceutical industry has a number of genomics companies who first identify the genes which are likely to cause disease. After the genes are identified, they are sequenced and the gene sequences are used in drug development. Likewise, proteins implicated in disease can be identified and sequenced. The sequences can be used to discover the gene sequences, which are then used in drug development.


Myriad Genetics, Inc. (Salt Lake City, Utah) targets families with a history of genetic disease and collects their genetic material in order to identify hereditary disease-causing genes. Myriad is able to identify these genes by using positional cloning and protein interaction studies in combination with targeted discovery gene sequencing. Using these techniques, Myriad has been able to locate and identify eight disease-related gene sequences, including BRCA1 and BRCA2. These gene sequences are used by Myriad's pharmaceutical partners to develop new therapeutics.


Another genomics company which uses disease inheritance patterns together with gene sequencing is Sequana (La Jolla, Calif.). Sequana uses DNA collection of individuals with inherited diseases, genotyping and linkage analysis, physical mapping, and gene sequencing to find disease gene sequences. Sequana also has a proprietary bioinformatics system which includes data mining tools to automatically sort and organize much of its data. Like Myriad, Sequana has a number of alliances with drug development companies which license Sequana's gene sequences.


Millennium Pharmaceuticals, Inc. (Cambridge, Mass.) employs a broader range of technologies than Myriad and Sequana. In addition to positional cloning and targeted discovery gene sequencing, Millennium uses a number of other non-genetic techniques. cDNA libraries are prepared from mouse tissues and expressed using rapid expression of differential gene expression (RARE) technology. Different patterns of cDNA gene expression allow researchers to identify possible disease targets. Millennium also uses functional cloning techniques in order to identify the gene sequences of interesting proteins. Once a potentially useful gene sequence has been identified, biological assays and bioinformatics are used as additional analyses.


Genome Therapeutics Corporation (Waltham, Mass.) uses a combination of positional cloning techniques and targeted discovery gene sequencing, as well as random discovery gene sequencing to isolate and identify disease gene sequences. In addition, Genome Therapeutics also has pathogen programs, which sequence pathogen genomes. As many non-genetic human diseases result from infection by pathogens, Genome Therapeutics hopes to eliminate pathogens by developing drugs and vaccines using the pathogens' genomes.


Gene Logic, Inc. (Columbia, Md.) has an accelerated drug discovery system which emphasizes its restriction enzyme analysis of differentially expressed sequences (READS) technology. READS is similar in nature to comparative gene expression technology. In READS, normal and diseased tissues are compared in order to identify gene expression differences between the two. Genes which appear to be important in the diseased tissue are then analyzed. Restriction enzymes, which cut gene sequences at specific sites, are used to produce gene fragments. The gene fragments from the normal and diseased tissues will differ and can be compared. Gene Logic also has a Flow-thru Chip and genomic databases, which it licenses to drug development companies.


Progenitor (Columbus, Ohio) focuses on developmental biology. Growing cells and tissues are analyzed for their level of expression of certain genes. Study of growing cells and tissues may help discover treatments for diseases characterized by abnormal cell growth, such as cancer and osteoporosis. Progenitor also uses bioinformatics, gene mapping, and gene sequencing to isolate, identify, and sequence relevant gene sequences.


OncorMed, Inc. (Gaithersburg, Md.) has focused on the development of medical services using genetic information. Oncormed offers a number of tests for hereditary diseases such as breast and colon cancers and malignant melanoma. The medical services include measurements of replication error rates in tumors, molecular profiling of tumor suppresser genes, and gene sequencing. In addition, OncorMed has a genomics repository containing known cancer gene sequences.


U.S. Pat. No. 5,642,936 issued to Evans and assigned to OncorMed describes a method for identifying human hereditary disease patterns. According to the method, data is collected on individuals having a history of disease within their families. Factors related to each disease are given weights, and the weights for each individual are summed. If the sum is above a certain predetermined threshold value, the individual is deemed to have a hereditary risk for the disease. Records from a number of individuals having a hereditary risk for a disease are collected to form a database.


The methods used by the above companies all focus on the genetic aspect of hereditary disease. Gene sequencing and positional cloning represent the two approaches generally taken. However, very little emphasis is put on the environmental aspect of hereditary disease. An individual's environment is defined as his or her physical surroundings, geographical location, diet, lifestyle, etc. For many diseases which are genetic in origin, such as most cancers, an individual's environment plays a large role in determining whether or not the individual eventually develops the disease. Some individuals who have disease gene sequences develop diseases, while others who carry the exact same disease gene sequences do not. One purpose of collecting environmental data about individuals whose gene sequences are studied is to effectively rule out any non-genetic causes of disease. Another purpose is to discover if any individuals who are carrying disease gene sequences but who do not develop the disease have other compensatory gene sequences or factors which enable them to live disease-free.


To a certain extent, the second group of genomics companies do take into account a small amount of environmental data when they select individuals whose DNA they use for positional cloning analyses. The environmental data is usually in the form of a questionnaire or survey. However, the data is typically limited in scope to lifestyle questions, and is used only to help narrow the search for the specific disease gene in question.


In addition, most genomics companies are reluctant to share their data on individuals' with others, even those genomics companies which are studying the same gene sequences. As a result, each genomics company must gather its own data on individuals having a certain disease. For example, Sequana sent its own researcher to the island of Tristan de Cunha to study hereditary asthma, while Myriad is located in Salt Lake City to take advantage of the detailed family trees of the Mormons. For genomics companies searching for gene sequences, gathering environmental data on individuals is often an expensive, time-consuming, but necessary step. Genomics companies could potentially spend more of their time and money on actual disease gene isolation if they were able to obtain necessary environmental data from another source.


Another problem lies in the fact that when genomics companies do gather environmental data on the individuals whose gene sequences are studied, the environmental data represents only a small time frame of an individual's life. Few genomics companies continually collect data over a long period of time, and as a result, are not able to definitively rule out certain environmental factors which may affect disease progression. In addition, such data collections are unlikely to provide leads for factors which may prohibit the formation of disease.


OBJECTS AND ADVANTAGES OF THE INVENTION

Accordingly, it is a primary object of the present invention to provide a system and method for creating a database of information about individuals' environments over a period of time. Another object of the present invention is to provide a database containing information about individuals' environments which can be used with existing genomics databases. A further object of the present invention is to provide a method of using environmental information about an individual in conjunction with the individual's genotype to find disease-influencing genes or substances. It is another object of the present invention to use the disease-influencing genes or substances to find drug candidates or drug targets.


SUMMARY OF THE INVENTION

These objects and advantages are attained by a system and method for identifying a disease-influencing gene or protein. The method includes the step of selecting individuals having a risk factor for a certain disease. Each of the individuals is provided with a remotely programmable apparatus having a user interface for communicating queries to the individuals and for receiving responses. Each apparatus also includes a communication device, such as a modem, for communicating with a server through a communication network.


Queries relating to the individuals' environment are entered into the server and transmitted from the server to each individual's remote apparatus. After the individuals' have responded to the queries, the responses are sent back to the server and organized into a database. Data mining software is then used to distinguish the individuals into groups based on their environmental profiles. After a period of time, each group is then further divided into categories based on their disease progression. The genomes of all the individuals are then sequenced. Data mining techniques are used to find gene differences between the categories.


According to a second method of the invention, the individuals are first separated into groups according to their disease progressions. Data mining techniques are then used to further distinguish each group into categories based on the individuals' environmental profiles. The genomes of all the individuals are then sequenced, and data mining techniques are used to find gene differences between the categories.


A third embodiment of the invention provides a method for identifying disease-influencing substances. The method includes the step of selecting individuals having a risk factor for a certain disease. Each of the individuals is provided with a remotely programmable apparatus having a user interface for communicating queries to the individuals and for receiving responses. Each apparatus also includes a communication device, such as a modem, for communicating with a server through a communication network.


Queries relating to the individuals' environment are entered into the server and transmitted from the server to each individual's remote apparatus. After the individuals' have responded to the queries, the responses are sent back to the server and organized into a database. The genomes of all the individuals are then sequenced. The individuals are placed into groups based on their gene sequences. Each group is then separated into categories based on the individuals' disease progression. Data mining techniques are then used to find a disease-influencing substance between the categories of individuals by using the individuals environmental profiles.


The disease-influencing gene or substance isolated using these methods is preferably used to develop drug candidates or drug targets. Additionally, the isolation of the disease-influencing gene is preferably used to identify a corresponding disease-influencing protein, which can also be used to develop drug candidates or drug targets.


The present invention also provides a database and data processing system for storing and analyzing environmental information about individuals. The database and data processing system comprise a server for storing queries and the individuals' responses to the queries. The system also includes at least one remotely programmable apparatuses having a user interface for communicating queries to the individuals and for receiving the responses. Each apparatus also includes a communication device, such as a modem, for communicating with the server through a communication network.


The system also includes genotyping means in communication with the server for determining the individuals' gene sequences and a data mining software program accessible to the server for analyzing the individuals' gene sequences and environmental profiles. In particular, the data mining program includes: means for analyzing the responses in order to group the individuals having a similar behavioral and environmental profile, a similar disease progression, and a similar genotype; means for analyzing the responses in order to group the individuals having a similar disease progression; means for analyzing the responses in order to group the individuals having a similar genotype; and means for identifying a disease-influencing gene or substance. Alternatively, the database can be used with other genomics or bioinformatics databases and systems if the information is to be manipulated in different ways.





DESCRIPTION OF THE FIGURES


FIG. 1 is a block diagram of a networked system according to a preferred embodiment of the invention.



FIG. 2 is a block diagram illustrating the interaction of the components of the system of FIG. 1.



FIG. 3 is a perspective view of a remotely programmable apparatus of the system of FIG. 1.



FIG. 4 is a block diagram illustrating the components of the apparatus of FIG. 3.



FIG. 5 is a script entry screen according to the preferred embodiment of the invention.



FIG. 6A is a listing of a sample script program according to the preferred embodiment of the invention.



FIG. 6B is a continuation of the listing of FIG. 6A.



FIG. 7 is a script assignment screen according to the preferred embodiment of the invention.



FIG. 8 is a sample query appearing on a display of the apparatus of FIG. 3.



FIG. 9 is a sample prompt appearing on the display of the apparatus of FIG. 3.



FIG. 10 is a sample report displayed on a workstation of the system of FIG. 1.



FIG. 11 is a flow chart illustrating the steps included in a monitoring application executed by the server of FIG. 1 according to the preferred embodiment of the invention.



FIG. 12 is a flow chart illustrating the steps included in the script program of FIGS. 6A-6B.



FIG. 13 is a sample data table of the present invention.



FIG. 14 is a sample completed data table of the present invention.



FIG. 15 is a flow chart illustrating a first method for identifying a gene according to the present invention.



FIG. 16 is a block diagram illustrating the method of FIG. 15.



FIG. 17 is a flow chart illustrating a second method for identifying a gene according to the present invention.



FIG. 18 is a block diagram illustrating the method of FIG. 17.



FIG. 19 is a flow chart illustrating a third method according to the present invention.



FIG. 20 is a block diagram illustrating the method of FIG. 19.





DETAILED DESCRIPTION

The invention presents a system and method for creating a database containing environmental information about an individual to be used in conjunction with the individual's gene sequences to find new drug targets and drug candidates. In a preferred embodiment of the invention, remote monitors are used to collect the environmental information. It is to be understood that environmental information includes all non-genetic information about an individual, such as disease progression, diet, lifestyle, and geographical location.


A preferred embodiment of the invention is illustrated in FIGS. 1-16. Referring to FIG. 1, a networked system includes a server 50 and a workstation 52 connected to server 50 through a communication network 58. Server 50 is also connected to a patient profile database 54 which stores environmental information about the individuals. Server 50 is further connected to a genotyping system 56 which is capable of sequencing individuals' genomes. Patient profile database 54 and genotyping system 56 are connected to server 50 through communication network 58.


Server 50 and patient profile database 54 are preferably world wide web servers. Server 50 and database 54 may comprise single stand-alone computers or multiple computers distributed throughout a network. Workstation 52 is preferably a personal computer, remote terminal, or web TV unit. Workstation 52 functions as a remote interface for entering in server 50 messages and queries to be communicated to the individuals.


Genotyping system 56 can be a laboratory capable of sequencing individuals' genomes, a gene sequencing chip such as the GeneChip by Affymetrix, or any other suitable genotyping system. Genotyping system 56 should be capable of transmitting information about the individuals' genomes to server 50. Communication network 58 connects workstation 52, patient profile database 54, and genotyping system 56 to server 50. Communication network 58 can be any suitable communication network, such as a telephone cable, the Internet, or cellular or wireless communication. Such communication networks are well known in the art.


The system also includes remotely programmable apparatuses 60 for monitoring individuals. Preferably, each remote apparatus 60 is used to monitor a respective one of the individuals. Alternatively, a multi-user apparatus may be used to monitor a plurality of individuals. Each remote apparatus is designed to interact with an individual in accordance with script programs received from server 50.


Each remote apparatus is in communication with server 50 through communication network 58, which is preferably the Internet. Alternatively, each remote apparatus may be placed in communication with the server via telephone cable, cellular communication, wireless communication, etc. For clarity of illustration, only two remote apparatuses are shown in FIG. 1. It is to be understood that the system may include any number of remote apparatuses for monitoring any number of individuals.


In the preferred embodiment, each individual to be monitored is also provided with a monitoring device 64. Monitoring device 64 is designed to produce measurements of a physiological condition of the individual, record the measurements, and transmit the measurements to the individual's remote apparatus 60 through a standard connection cable 62. Examples of suitable monitoring devices include blood glucose meters, respiratory flow meters, blood pressure cuffs, electronic weight scales, and pulse rate monitors. Such monitoring devices are well known in the art.


The specific type of monitoring device provided to each individual is dependent upon the individual's disease. For example, diabetes patients are provided with blood glucose meters for measuring blood glucose concentrations, asthma patients are provided with respiratory flow meters for measuring peak flow rates, obesity patients are provided with weight scales, etc.



FIG. 2 shows server 50, workstation 52, and remote apparatus 60 in greater detail. Server 50 includes a database 66 for storing script programs 68. The script programs 68 are executed by each remote apparatus 60 to communicate queries and messages to an individual, receive responses 70 to the queries, collect monitoring device measurements 72, and transmit responses 70 and measurements 72 to server 50. Database 66 is designed to store the responses 70 and measurements 72. Database 66 further includes a look-up table 74. Table 74 contains a list of the individuals to be monitored, and for each individual, a unique individual identification code and a respective pointer to script program 68 assigned to the individual. Each remote apparatus 60 is designed to execute the assigned script program which it receives from server 50.



FIGS. 3-4 show the structure of remote apparatus 60 according to the preferred embodiment. Referring to FIG. 3, remote apparatus 60 includes a housing 90. Housing 90 is preferably sufficiently compact to enable the remote apparatus to be hand-held and carried by an individual. Remote apparatus 60 also includes a user interface for communicating queries to the individual and for receiving responses to the queries.


In the preferred embodiment, the user interface includes a display 92 and four user input buttons 98A, 98B, 98C, and 98D. Display 92 displays queries and prompts to the individual, and is preferably a liquid crystal display (LCD). The user input buttons 98A, 98B, 98C, and 98D are for entering responses to the queries and prompts. The user input buttons are preferably momentary contact push buttons. Although the user interface of the preferred embodiment includes a display and input buttons, it will be apparent to one skilled in the art of electronic devices that any suitable user interface may be used in remote apparatus 60. For example, the user input buttons may be replaced by switches, keys, a touch sensitive display screen, or any other data input device. Alternatively, the display and input buttons may be replaced by a speech synthesis/speech recognition interface.


Three monitoring device jacks 96A, 96B, and 96C are located on a surface of housing 90. Device jacks 96A, 96B, and 96C are for connecting remote apparatus 60 to a number of monitoring devices, such as blood glucose meters, respiratory flow meters, or blood pressure cuffs, through standard connection cables (not shown). Remote apparatus 60 also includes a modem jack 94 for connecting remote apparatus 60 to a telephone jack through a standard connection cord (not shown). Remote apparatus 60 further includes a visual indicator, such as a light emitting diode (LED) 100. LED 100 is for visually notifying the individual that he or she has unanswered queries stored in remote apparatus 60.



FIG. 4 is a schematic block diagram illustrating the components of remote apparatus 60 in greater detail. Remote apparatus 60 includes a microprocessor 102 and a memory 108 connected to microprocessor 102. Memory 108 is preferably a non-volatile memory, such as a serial EEPROM. Memory 108 stores script programs received from the server, measurements received from monitoring device 64, responses to queries, and the individual's unique identification code. Microprocessor 102 also includes built-in read-only memory (ROM) which stores firmware for controlling the operation of remote apparatus 60. The firmware includes a script interpreter used by microprocessor 102 to execute the script programs. The script interpreter interprets script commands which are executed by microprocessor 102. Specific techniques for interpreting and executing script programs in this manner are well known in the art.


Microprocessor 102 is preferably connected to memory 108 using a standard two-wire I2C interface. Microprocessor 102 is also connected to user input buttons 98A, 98B, 98C, and 98D, LED 100, a clock 112, and a display driver 110. Clock 112 indicates the current date and time to microprocessor 102. For clarity of illustration, clock 112 is shown as a separate component, but is preferably built into microprocessor 102. Display driver 110 operates under the control of microprocessor 102 to display information on display 92. Microprocessor 102 is preferably a PIC 16C65 processor which includes a universal asynchronous receiver transmitter (UART) 104. UART 104 is for communicating with a modem 114 and a device interface 118. A CMOS switch 116 under the control of microprocessor 102 alternately connects modem 114 and interface 118 to UART 116.


Modem 114 is connected to a telephone jack 119 through modem jack 94. Modem 114 is for exchanging data with the server through the communication network. The data includes script programs which are received from the server as well as responses to queries, device measurements, script identification codes, and the individual's unique identification code which modem 114 transmits to server 50. Modem 114 is preferably a complete 28.8 K modem commercially available from Cermetek, although any suitable modem may be used.


Device interface 118 is connected to device jacks 96A, 96B, and 96C. Device interface 118 is for interfacing with a number of monitoring devices, such as blood glucose meters, respiratory flow meters, blood pressure cuffs, weight scales, or pulse rate monitors, through device jacks 96A, 96B, and 96C. Device interface 118 operates under the control of microprocessor 102 to collect measurements 72 from monitoring devices 64 and to output measurements 72 to microprocessor 102 for storage in memory 108. In the preferred embodiment, interface 118 is a standard RS232 interface. For simplicity of illustration, only one device interface 118 is shown in FIG. 4. However, in alternative embodiments, remote apparatus 60 may include multiple device interfaces to accommodate monitoring devices which have different connection standards.


Referring again to FIG. 2, server 50 includes a monitoring application 76. Monitoring application 76 is a controlling software application executed by server 50 to perform the various functions described below. Monitoring application 76 includes a script generator 78, a script assignor 80, and a report generator 82. Script generator 78 is designed to generate script programs 68 from script information entered through workstation 52. The script information is entered through a script entry screen 84. In the preferred embodiment, script entry screen 84 is implemented as a web page on the server 50. Workstation 52 includes a web browser for accessing the web page to enter the script information.



FIG. 5 illustrates script entry screen 84 as it appears on workstation 52. Script entry screen 84 includes a script name field 120 for specifying the name of script program to be generated. Screen 84 also includes entry fields 122 for entering a set of queries to be answered by an individual. Each entry field 122 has corresponding response choice fields 124 for entering response choices for the query. Screen 84 further includes check boxes 126 for selecting a desired monitoring device type from which to collect measurements, such as a blood glucose meter, respiratory flow meter, or blood pressure cuff.


Screen 84 additionally includes a connection time field 128 for specifying a prescribed connection time at which each remote apparatus executing the script program is to establish a subsequent communication link to the server. The connection time is preferably selected to be the time at which communication rates are the lowest, such as 3:00 AM. Screen 84 also includes a CREATE SCRIPT button 130 for instructing the script generator to generate a script program from the information entered in screen 84. Screen 84 further includes a CANCEL button 132 for canceling the information entered.


In the preferred embodiment, each script program created by the script generator 82 conforms to the standard file format used on UNIX systems. In the standard file format, each command is listed in the upper case and followed by a colon. Every line in the script program is terminated by a linefeed character {LF}, and only one command is placed on each line. The last character in the script program is a UNIX end of file character {EOF}. TABLE 1 shows an exemplary listing of script commands used in the preferred embodiment of the invention.









TABLE 1







SCRIPT COMMANDS








Command
Description





CLS: {LF}
Clear the display.


ZAP: {LF}
Erase from memory the last set of query responses



recorded.


LED: b{LF}
Turn the LED on or off, where b is a binary digit of



0 or 1. An argument of 1 turns on the LED, and an



argument of 0 turns off the LED.


DISPLAY:
Display the text following the DISPLAY command.


{chars}{LF}



INPUT:
Record a button press. The m's represent a button


mmmm{LF}
mask pattern for each of the four input buttons. Each



m contains an “X” for disallowed buttons or an “O”



for allowed buttons. For example, INPUT: OXOX{LF}



allows the user to press either button #1 or #3.


WAIT: {LF}
Wait for any one button to be pressed, then continue



executing the script program.


COLLECT:
Collect measurements from the monitoring device


device{LF}
specified in the COLLECT command. The user is



preferably prompted to connect the specified monitor-



ing device to the apparatus and press a button to



continue.


NUMBER:
Assign a script identification code to the script program.


aaaa{LF}
The script identification code from the most recently



executed NUMBER statement is subsequently transmit-



ted to the server along with the query responses and



device measurements. The script identification code



identifies to the server which script program was most



recently executed by the remote apparatus.


DELAY: t{LF}
Wait until time t specified in the DELAY command,



usually the prescribed connection time.


CONNECT:
Perform a connection routine to establish a communi-


{LF}
cation link to the server, transmit the patient identifi-



cation code, query responses, device measurements,



and script identification code to the server, and receive



and store a new script program. When the server in-



structs the apparatus to disconnect, the script interpreter



is restarted, allowing the new script program to execute.









The script commands illustrated in TABLE 1 are representative of the preferred embodiment and are not intended to limit the scope of the invention. After consideration of the ensuing description, it will be apparent to one skilled in the art many other suitable scripting languages and sets of script commands may be used to implement the system and method of the invention.


Script generator 78 preferably stores a script program template which it uses to create each script program. To generate a script program, script generator 78 inserts into the template the script information entered in script entry screen 84. For example, FIGS. 6A-6B illustrate a sample script program created by the script generator from the script information shown in FIG. 5.


The script program includes display commands to display the queries and response choices entered in fields 122 and 124, respectively. The script program also includes input commands to receive responses to the queries. The script program further includes a collect command to collect device measurements from the monitoring device specified in check boxes 126. The script program also includes commands to establish a subsequent communication link to the server at the connection time specified in field 128. The steps included in the sample script program are also shown in the flow chart of FIGS. 12A-12B and will be discussed in the operation section below.


Referring again to FIG. 2, script assignor 80 is for assigning the script programs 68 to the individuals. The script programs are assigned in accordance with script assignment information entered through workstation 52. The script assignment information is entered through a script assignment screen 86, which is preferably implemented as a web page on server 50.



FIG. 7 shows a sample script assignment screen 86 as it appears on the workstation. Screen 86 includes check boxes 134 for selecting the script program to be assigned and check boxes 136 for selecting the individuals to whom the script program is to be assigned. Screen 86 also includes an ASSIGN SCRIPT button 140 for entering the assignments. When button 140 is pressed, the script assignor creates and stores for each individual selected in check boxes 136 a respective pointer to the script program selected in check boxes 134. Each pointer is stored in the look-up table 74 of database 66. Screen 86 further includes an ADD SCRIPT button 138 for accessing the script entry screen and a DELETE SCRIPT button 142 for a deleting script program.


Referring again to FIG. 2, report generator 82 is designed to generate a report 88 from the responses 70 and device measurements 72 received in server 50. Report 88 is displayed on workstation 52. FIG. 10 shows a sample patient report 88 produced by report generator 82 for a selected individual. Report 88 includes a graph 146 of the device measurements received from the individual, as well as a listing of the query responses received from the individual. Specific techniques for writing a report generator program to display data in this manner are well known in the software art.


The operation of the preferred embodiment is illustrated in FIGS. 1-12. FIG. 11 is a flow chart illustrating steps included in the monitoring application executed by server 50. In step 202, the server determines if new script information has been entered through script entry screen 84. If new script information has not been entered, the server proceeds to step 206. If new script information has been entered, the server proceeds to step 204.


As shown in FIG. 5, the script information includes a set of queries, and for each of the queries, corresponding responses choices. The script information also includes a selected monitoring device type from which to collect measurements. The script information further includes a prescribed connection time for each remote apparatus to establish a subsequent communication link to the server. The script information is generally entered in the server by a healthcare provider, such as the individuals' physician or case manager. Of course, any person desiring to communicate with the individual may also be granted access to the server to create and assign script programs. Further, it is to be understood that the system may include any number of workstations for entering script generation and script assignment information into the server.


In step 204, script generator 78 generates a script program from the information entered in screen 84. The script program is stored in database 66. Steps 202 and 204 are preferably repeated to generate multiple script programs, e.g. a script program for diabetes patients, a script program for asthma patients, etc. Each script program corresponds to a respective one of the sets of queries entered through script entry screen 84. Following step 204, the server proceeds to step 206.


In step 206, the server determines if new script assignment information has been entered through script assignment screen 86. If new script assignment information has not been entered, the server proceeds to step 210. If new script assignment information has been entered, the server proceeds to step 208. As shown in FIG. 7, script programs are assigned to each individual by selecting a script program through check boxes 134, selecting the individuals to whom selected the script program is to be assigned through check boxes 136, and pressing the ASSIGN SCRIPT button 140. When button 140 is pressed, script assignor 86 creates for each individual selected in check boxes 136 a respective pointer to the script program selected in check boxes 134. In step 208, each pointer is stored in look-up table 74 of database 66. Following step 208, the server proceeds to step 210.


In step 210, the server determines if any of the remote apparatuses are remotely connected to the server. Each individual to be monitored is preferably provided with his or her own remote apparatus which has the individual's unique identification code stored therein. Each individual is thus uniquely associated with a respective one of the remote apparatuses. If none of remote apparatuses are connected, the server proceeds to step 220.


If a remote apparatus is connected, the server receives from the apparatus the individual's unique identification code in step 212. In step 214, the server receives from the apparatus the query responses, device measurements, and script identification code recorded during execution of a previously assigned script program. The script identification code identifies to the server which script program was executed by the remote apparatus to record the query responses and device measurements. The responses, device measurements, and script identification code are stored in database 66.


In step 216, the server uses the individual's unique identification code to retrieve from look-up table 74 the pointer to the script program assigned to the individual. The server then retrieves the assigned script program from the database 66. In step 218, the server transmits the assigned script program to the individual's remote apparatus through the communication network 58. Following step 218, the server proceeds to step 220.


In step 220, the server determines if a report request has been received from workstation 52. If no report request has been received, the server returns to step 202. If a report request has been received for a selected individual, the server retrieves from database 66 the query responses and measurements last received from the individual, step 222. In step 224, the server generates and displays the report 88 on workstation 52.


As shown in FIG. 10, the report includes the query responses and device measurements last received from the individual. Following step 224, the server returns to step 202.



FIG. 12 illustrates the steps included in a sample script program executed by the remote apparatus. Before the script program is received, the remote apparatus is initially programmed with the individual's unique identification code and the script interpreter used by microprocessor 102 to execute script programs. The initial programming may be achieved during manufacture or during an initial connection to the server. Following initial programming, the remote apparatus receives from the server the script program assigned to the individual associated with the apparatus. The script program is received by modem 114 through a first communication link to the server and stored in memory 108.


In step 302, microprocessor 102 assigns a script identification code to the script program and stores the script identification code in memory 108. The script identification code is subsequently transmitted to the server along with query responses and device measurements to identify to the server which script program was most recently executed by the remote apparatus. In step 304, microprocessor 102 lights LED 100 to notify the individual that he or she has unanswered queries stored in the remote apparatus. LED 100 preferably remains lit until the queries are answered by the individual. In step 306, microprocessor 102 erases from memory 108 the last set of query responses recorded.


In step 308, microprocessor 102 prompts the individual by displaying on display 92 “ANSWER QUERIES NOW? PRESS ANY BUTTON TO START”. In step 310, microprocessor 102 waits until a reply to the prompt is received from the individual. When a reply is received, microprocessor 102 proceeds to step 312. In step 312, microprocessor 102 executes successive display and input commands to display the queries and response choices on display 92 and to receive responses to the queries.



FIG. 8 illustrate a sample query and its corresponding response choices as they appear on display 92. The response choices are preferably positioned on display 92 such that each response choice is located proximate a respective one of the user input buttons 98A, 98B, 98C, and 98D. In the preferred embodiment, each response choice is displayed immediately above a respective user input button. The individual presses the button corresponding to his or her response, and microprocessor 102 stores the response in memory 108.


In steps 314 to 318, microprocessor 102 executes commands to collect device measurements from a selected monitoring device specified in the script program. In step 314, microprocessor 102 prompts the individual to connect the selected device to one of the device jacks 96A, 96B, or 96C. A sample prompt is shown in FIG. 9. In step 316, microprocessor 102 waits until a reply to the prompt is received from the individual. When a reply is received, microprocessor 102 proceeds to step 318. Microprocessor 102 also connects UART 104 to device interface 118 through CMOS switch 116. In step 318, microprocessor 102 collects device measurements from the selected device through device interface 118. The device measurements are stored in memory 108.


In step 320, microprocessor 102 prompts the individual to connect remote apparatus 60 to telephone jack 119 so that the apparatus may connect to the server at the prescribed connection time. In step 322, microprocessor 102 waits until a reply to the prompt is received from the individual. When a reply is received, microprocessor 102 turns off LED 100 in step 324. In step 326, microprocessor 102 waits until it is time to connect to the server. Microprocessor 102 compares the connection time specified in the script program to the current time output by clock 112. When it is time to connect, microprocessor 102 connects UART 104 to modem 114 through CMOS switch 116.


In step 328, microprocessor 102 establishes a subsequent communication link between remote apparatus 60 and server 50 through modem 114 and communication network 58. If the connection fails for any reason, microprocessor 102 repeats step 328 to get a successful connection. In step 330, microprocessor 102 transmits the query responses, device measurements, script identification code, and the individual's unique identification code stored in memory 108 to the server. In step 332, microprocessor 102 receives through modem 114 a newly assigned script program from the server. The new script program is stored in memory 108 for subsequent execution by microprocessor 102. Following step 332, the script program ends.


After the individual's information has been collected via remote apparatus 60 and the script programs, the data is mined to distinguish patterns. Data mining programs are well known in the art and can be easily adapted to this system. In the preferred embodiment, the data mining program includes a data table 150, as shown in FIG. 13. Data table 150 is stored on the server and has an individual identification number field 151, name fields 152, value fields 154 corresponding to the name fields, and explanation fields 156 corresponding to the name fields and value fields. The data type is entered into name fields 152, the possible numerical values corresponding to the data type are entered into value fields 154, and brief explanations of the data types and corresponding values are entered into explanation fields 156.


The individuals' device measurements and responses to the queries are entered into data table 150 in the form of numerical values in value fields 154. The individual's identification number is entered into individual identification number field 151. An example of data table 150 in which the individuals' information has been entered is shown in FIG. 14. Once data table 150 contains all the necessary information, the data mining program then compares the information.



FIG. 15 is a flowchart illustrating a first method of the present invention carried out by the server using the data mining techniques described above. In step 400, individuals having a risk factor for a disease are selected. In step 402, these individuals are queried about their behavior and environment using the script programs and remote apparatuses previously described. The responses to the queries and any device measurements are received and stored by the server. Collection of the responses and device measurements can occur over any period of time, thus allowing for more accurate data.


After the server receives the responses and measurements, a database comprising the individuals' behavioral and environmental profiles is created in step 404. In step 406, data mining techniques are used to group individuals having similar behavioral and environmental profiles. In step 408, the server determines if it is necessary to further group the individuals in order to produce smaller groups. Steps 406 and 408 can be repeated as often as necessary.


In step 410, each group of individuals is categorized using data mining techniques. The individuals are categorized according to their disease progressions. For example, a group of individuals can be categorized into those that have a severe disease phenotype, those that have a moderate disease phenotype, and those that have a mild disease phenotype. In step 412, the server determines if it is necessary to further categorize the individuals. Steps 410 and 412 can be repeated as often as necessary.


In step 414, the genomes of all the individuals are sequenced by genotyping system 56. The genotypes of all the individuals are transmitted to server 50. In step 416, data mining techniques are used to compare the genotypes of the individuals between the categories. For example, if those individuals who have a severe disease phenotype and are overweight have a certain gene sequence, while those individuals who have a mild disease phenotype and are overweight do not, it is likely the gene sequence is responsible for the severe disease phenotype. If a gene sequence is found, it is further identified in step 418. Methods of isolating and identifying gene sequences are well known in the field.



FIG. 16 is a block diagram illustrating an example of the first method of the present invention as described in FIG. 15. First individuals having a risk factor for a certain disease, such as non-insulin dependent diabetes mellitus (NIDDM), are selected, as indicated at block 422. Behavioral and environmental information from each individual is collected using the script programs and remote apparatuses. Using data mining techniques, the individuals are then grouped into overweight individuals 424 and non-overweight individuals 426. Using data mining techniques, the individuals are then categorized into overweight individuals having severe NIDDM 428, overweight individuals having mild NIDDM 430, non-overweight individuals having mild NIDDM 432, and non-overweight individuals having severe NIDDM 434.


The individuals' genotype information is then taken, as indicated at block 436, to determine the individuals' gene sequences. For example, overweight individuals with severe NIDDM have gene sequence A, overweight individuals with mild NIDDM have gene sequence B, non-overweight individuals with mild NIDDM have gene sequence B, and non-overweight individuals with severe NIDDM have gene sequence A. Data mining techniques are then used to analyze the information and come to a conclusion. In this example, data mining would conclude that the severe NIDDM phenotype is likely related to gene sequence A, not the individual's weight.



FIG. 17 shows a flowchart illustrating a second method of the present invention carried out by the server using the data mining techniques described above. In step 500, individuals having a risk factor for a disease are selected. In step 502, these individuals are queried about their behavior and environment using the script programs and remote apparatuses previously described. The responses to the queries and any device measurements are received and stored by the server.


After the server receives the responses and measurements from the remote apparatuses, a database comprising the individuals' behavioral and environmental profiles is created in step 504. In step 506, data mining techniques are used to group together individuals having similar disease progressions. For example, a group of individuals can be grouped into those that have a severe disease phenotype, those that have a moderate disease phenotype, and those that have a mild disease phenotype. In step 508, the server determines if it is necessary to further group the individuals in order to produce smaller groups. Steps 506 and 508 can be repeated as often as necessary.


In step 510, each group of individuals created in steps 506 and 508 is categorized using data mining techniques according to the behavioral and environmental profiles of the individuals. In step 512, the server determines if it is necessary to further group the individuals in order to produce smaller groups. Steps 510 and 512 can be repeated as often as necessary.


In step 514, the genomes of all the individuals are sequenced by genotyping system 56. The genotypes of all the individuals are transmitted to the server. In step 516, data mining techniques are used to compare the genotypes of the individuals between the categories. For example, if those individuals who have a severe disease phenotype and are overweight have a certain gene sequence, while those individuals who have a mild disease and are also overweight phenotype do not, it is likely the gene sequence, not weight, is responsible for the severe disease phenotype. If a gene sequence is found, it is further identified in step 518. Specific techniques of isolating and identifying gene sequences are well known in the field.



FIG. 18 is a block diagram illustrating an example of the second method of the present invention as described in FIG. 17. First individuals having a risk factor for a certain disease, such as NIDDM, are chosen, as indicated at block 522. Behavioral and environmental information from each individual is collected using the remote apparatuses and script programs. Using data mining techniques, the individuals are then grouped into those exhibiting severe NIDDM 524 and those exhibiting mild NIDDM 526. Using data mining techniques, the individuals are then categorized into overweight individuals having severe NIDDM 528, non-overweight individuals having severe NIDDM 530, non-overweight individuals having mild NIDDM 532, and overweight individuals having mild NIDDM 534.


The individuals' genotype information is then taken, as indicated at block 536, to determine the individuals' gene sequences. For example, individuals with severe NIDDM who are overweight have gene sequence A, individuals with severe NIDDM who are non-overweight have gene sequence A, individuals with mild NIDDM who are non-overweight have gene sequence B, and individuals with severe NIDDM who are overweight have gene sequence B. Data mining techniques are then used to analyze the information and come to a conclusion. In this example, data mining would conclude that the severe NIDDM phenotype is likely related to gene sequence A, not the individual's weight.



FIG. 19 shows a flowchart illustrating a preferred method carried out by server 50 to identify a disease-identifying substance. In step 600, individuals having a risk factor for a disease are selected. In step 602, these individuals are queried about their behavior and environment using the script programs and remote apparatuses previously described. The responses to the queries and any device measurements are received and stored by the server.


After the server receives the responses and measurements from the remote apparatuses, a database comprising the individuals' behavioral and environmental profiles is created in step 604. In step 606, the genomes of all the individuals are sequenced, and the genotypes of all the individuals are transmitted to the server. In step 608, individuals having the same or close genotypes are grouped together. In step 610, data mining techniques are used to categorize together individuals having similar disease progressions. In step 612, the server determines if it is necessary to further categorize the individuals in order to produce smaller groups. Steps 610 and 612 can be repeated as often as necessary.


In step 614, data mining techniques are used to find a disease-influencing substance between the categories of individuals by using the individuals behavioral and environmental profiles. For example, if those individuals who have a severe disease phenotype are overweight, while those individuals who have a mild disease phenotype are not, it is likely weight is responsible for the severe disease phenotype. If such a disease-influencing substance is found, it is identified in step 618. If no disease-influencing substance is found, the process is preferably repeated.



FIG. 20 is a block diagram illustrating an example of the method described in FIG. 19. First, individuals having a risk factor for a certain disease, such as NIDDM, are chosen, as indicated at block 620. Behavioral and environmental information from each individual is collected using the remote apparatuses and script programs. The individuals' genotype information is then taken, as indicated at block 622, to determine the individuals' gene sequences. The individuals are then grouped according to their gene sequences. For example, one group may have gene sequence A, as indicated at block 624, while another group may have gene sequence B, as indicated at block 626. Using data mining techniques, the individuals are then categorized into individuals with gene sequence A having severe NIDDM 628, individuals with gene sequence A having mild NIDDM 630, individuals with gene sequence B having mild NIDDM 632, and individuals with gene sequence B having severe NIDDM 634.


Data mining techniques are further used to analyze the categories of individuals and their behavioral and environmental profiles. For example, overweight individuals 638 with severe NIDDM have gene sequence A, non-overweight individuals 640 with mild NIDDM have gene sequence A, overweight individuals 642 with mild NIDDM have gene sequence B, and non-overweight individuals 644 with severe NIDDM have gene sequence B. Data mining techniques are then used to analyze the information and come to a conclusion. In this example, data mining would conclude that the severe NIDDM phenotype is likely related to gene sequence A, not the individual's weight.


SUMMARY, RAMIFICATIONS, AND SCOPE

Although the above description contains many specificities, these should not be construed as limitations on the scope of the invention but merely as illustrations of some of the presently preferred embodiments. Many other embodiments of the invention are possible. For example, the scripting language and script commands shown are representative of the preferred embodiment. It will be apparent to one skilled in the art that many other scripting languages and specific script commands may be used to implement the invention.


Moreover, the invention is not limited to the specific applications described. The system and method of the invention have many other applications. For example, pharmaceutical manufacturers may apply the system in clinical trials to analyze new drug data.


Therefore, the scope of the invention should be determined by the appended claims and their legal equivalents.

Claims
  • 1. A method for generating groups of individuals useful in researching influence of a disease on said individuals, comprising: selecting individuals having a risk factor for a disease;providing to each individual a communications apparatus;transmitting a computer program containing queries and predefined response choices to said communications apparatus, wherein said computer program when executed causes said communications apparatus to present said queries and predefined response choices to each individual via a display of said communications apparatus and collect responses to said queries, including at least one of the predefined response choices presented on the display of the communications apparatus, from each individual via user input buttons of the communications apparatus;receiving said responses to the queries from the individuals through the communications apparatus, said responses communicating information about the individuals;storing the responses of each individual in a database;defining a plurality of groups by categorizing the individuals having similar profiles based on the responses, wherein categorizing the individuals into groups includes one or more phenotypic classifications;after defining said groups, receiving genotype information representative of individuals in each of said groups;comparing said genotype information between said groups; andgenerating a report for presentation on a display that represents a subset of said genotype information associated with each of said groups, wherein differences in said genotype information between said groups is expressed in terms of phenotypic classifications.
  • 2. The method of claim 1, wherein the queries are inserted into said computer program with a script generator and assigned to an individual using a script assignor.
  • 3. The method of claim 1, wherein categorizing the individuals into groups includes one of the phenotypic classifications from the set of behavioral, environmental, and disease progression.
  • 4. The method of claim 1, wherein the communications apparatus is connectable with a monitoring device configured to acquire physiologic data.
  • 5. The method of claim 4, wherein the monitoring device includes one of the set consisting of a blood glucose meter, a respiratory flow meter, a blood pressure cuff, a weight scale, and a pulse rate monitor.
  • 6. A system for generating groups of individuals useful in researching influence of a disease on said individuals, comprising: a communications apparatus operable by an individual; anda communication network in signal communication with the communications apparatus and a server, a workstation configured to send scripted queries and predefined response choices, a genotyping system configured to provide genotype information representative of the individual, and a patient profile system configured to receive responses from the individual and genotype information analyses via the communications network and the server, wherein the server transmits a computer program containing the scripted queries and predefined response choices to the communications apparatus, the computer program when executed causes the communications apparatus to present the scripted queries and predefined response choices to the individual via a display of said communications apparatus and collect responses to the queries containing information about the individual and at least one of the predefined response choices presented on the display of said communications apparatus,whereby the genotype information is compared based upon groups formed by categorizing individuals having a risk factor for a disease using the responses to the scripted queries in the patient profile system to identify one or more individuals having similar profiles, wherein categorizing the individuals into groups includes one or more phenotypic classifications, and differences in said genotype information between said groups is expressed in terms of phenotypic classifications.
  • 7. The system of claim 6, wherein the responses from the individual are used to categorize the individual into one or more groups and the one or more groups are compared with the genotype information of the individual to categorize said genotype information according to disease progression.
  • 8. The system of claim 7, wherein the disease progression includes non-insulin dependent diabetes.
  • 9. A system for identifying groups of individuals useful in researching influence of disease on said individuals, comprising: at least one communications apparatus in signal communication with a monitoring device configured to measure physiologic and environmental conditions, the communications apparatus and monitoring device being operable by at least one individual; anda communication network in signal communications with each communications apparatus and a server, a workstation configured to send scripted queries and predefined response choices, a genotyping system configured to provide genotype information representative of the at least one individual, and a patient profile system configured to receive responses and measurements from the at least one individual and genotype information analyses via the communications network and the server, wherein the server transmits a computer program containing the scripted queries and predefined response choices to the communications apparatus the computer program when executed causes the communications apparatus to present the scripted queries and predefined response choices to the individual via a display of the communications apparatus and collect responses to the scripted queries containing information about the individual and at least one of the predefined response choices presented on the display of said communications apparatus,whereby the genotype information representative of the at least one individual is compared based upon groups formed by categorizing individuals having a risk factor for a disease using the responses and measurements to the scripted queries in the patient profile system to identify one or more individuals having similar profiles, wherein categorizing the individuals into groups includes one or more phenotypic classifications, and differences in said genotype information between said groups is expressed in terms of phenotypic classifications.
  • 10. The system of claim 9, wherein the monitoring device includes one of the set consisting of a blood glucose meter, a respiratory flow meter, a blood pressure cuff, a weight scale, and a pulse rate monitor.
  • 11. The system of claim 9, wherein the responses and measurements from each individual are used to categorized each individual with one or more groups and the groups are compared with the genotype information representative of each individual to categorize the genotype information according to disease progression of each individual in the one or more groups based on the responses and measurements sent by each individual.
  • 12. The system of claim 11, wherein the disease progression includes non-insulin dependent diabetes.
  • 13. The system of claim 6, wherein categorizing the individuals into groups includes one of the phenotypic classifications from the set of behavioral, environmental, and disease progression.
  • 14. The system of claim 9, wherein categorizing the individuals into groups includes one of the phenotypic classifications from the set of behavioral, environmental, and disease progression.
  • 15. The method of claim 1, wherein the responses to the queries from the individuals communicate environmental information about the individuals.
  • 16. The method of claim 15, wherein the environmental information comprises one or more of non-genetic information about an individual, information about disease progression, information about diet, information about lifestyle, and information about geographical location.
  • 17. The method of claim 1, wherein the scripted queries contained in said computer program are related to one or both of behavior and environment of each individual.
RELATED APPLICATION INFORMATION

This application is a continuation of application Ser. No. 09/041,809 filed Mar. 13, 1998, now abandoned, which is a continuation-in-part of application Ser. No. 08/946,341 filed Oct. 7, 1997, now U.S. Pat. No. 5,997,476, which is a continuation-in-part of application Ser. No. 08/847,009 filed Apr. 30, 1997, now U.S. Pat. No. 5,897,493, which claims priority to provisional application Ser. No. 60/041,746 filed on Mar. 28, 1997 and provisional application Ser. No. 60/041,751 filed on Mar. 28, 1997. This application is also a continuation-in-part of application Ser. No. 09/378,188 filed on Aug. 20, 1999, now abandoned, which is a continuation of application Ser. No. 08/850,840 filed on May 3, 1997 now U.S. Pat. No. 5,985,559, which is a continuation of application Ser. No. 08/847,009 filed Apr. 30, 1997, now U.S. Pat. No. 5,897,493, which claims priority to provisional application Ser. No. 60/041,746 filed on Mar. 28, 1997 and provisional application Ser. No. 60/041,751 filed on Mar. 28, 1997. All applications, except to the extent, if any, such incorporation would constitute new matter, are herein incorporated by reference in their entirety as if fully set forth herein.

US Referenced Citations (388)
Number Name Date Kind
3426150 Tygart Feb 1969 A
3566365 Rawson et al. Feb 1971 A
3566370 Worthington, Jr. et al. Feb 1971 A
3581072 Nymeyer May 1971 A
3768014 Smith Oct 1973 A
3811116 Takeuchi et al. May 1974 A
3883235 Lynn et al. May 1975 A
3910257 Fletcher et al. Oct 1975 A
3920005 Gombrich et al. Nov 1975 A
3996928 Marx Dec 1976 A
4004577 Sarnoff Jan 1977 A
4051522 Healy et al. Sep 1977 A
4060915 Conway Dec 1977 A
4130881 Haessler et al. Dec 1978 A
4150284 Trenkler et al. Apr 1979 A
4151407 McBride et al. Apr 1979 A
4151831 Lester May 1979 A
4173971 Karz Nov 1979 A
4216462 McGrath et al. Aug 1980 A
4227526 Goss Oct 1980 A
4253521 Savage Mar 1981 A
4259548 Fahey et al. Mar 1981 A
4270547 Steffen et al. Jun 1981 A
4296756 Dunning et al. Oct 1981 A
4347568 Giguere et al. Aug 1982 A
4347851 Jundanian Sep 1982 A
4360345 Hon Nov 1982 A
4412287 Braddock, III Oct 1983 A
4417306 Citron et al. Nov 1983 A
4422081 Woods Dec 1983 A
4428733 Kumar-Misir Jan 1984 A
4449536 Weaver May 1984 A
4465077 Schneider Aug 1984 A
4473884 Behl Sep 1984 A
4518361 Conway May 1985 A
4519398 Lisiecki et al. May 1985 A
4531527 Reinhold, Jr. et al. Jul 1985 A
4546436 Schneider et al. Oct 1985 A
4566461 Lubell et al. Jan 1986 A
4576578 Parker et al. Mar 1986 A
4592546 Fascenda et al. Jun 1986 A
4627445 Garcia Dec 1986 A
4674652 Aten et al. Jun 1987 A
4686624 Blum et al. Aug 1987 A
4694490 Harvey et al. Sep 1987 A
4695954 Rose et al. Sep 1987 A
4712562 Ohayon et al. Dec 1987 A
4722349 Baumberg Feb 1988 A
4729381 Harada et al. Mar 1988 A
4730253 Gordon Mar 1988 A
4731726 Allen, III Mar 1988 A
4738451 Logg Apr 1988 A
4768229 Benjamin et al. Aug 1988 A
4779199 Yoneda et al. Oct 1988 A
4782511 Nemec et al. Nov 1988 A
4789928 Fujisaki Dec 1988 A
4796639 Snow et al. Jan 1989 A
4799156 Shavit et al. Jan 1989 A
4799199 Scales, III et al. Jan 1989 A
4803625 Fu et al. Feb 1989 A
4835372 Gombrich et al. May 1989 A
4838275 Lee Jun 1989 A
4846797 Howson et al. Jul 1989 A
4853521 Claeys et al. Aug 1989 A
4858354 Gettler Aug 1989 A
4858617 Sanders Aug 1989 A
4890621 Hakky Jan 1990 A
4894777 Negishi et al. Jan 1990 A
4897869 Takahashi Jan 1990 A
4899839 Dessertine et al. Feb 1990 A
4903201 Wagner Feb 1990 A
4907973 Hon Mar 1990 A
4916441 Gombrich Apr 1990 A
4931934 Snyder Jun 1990 A
4933873 Kaufman et al. Jun 1990 A
4933876 Markoff et al. Jun 1990 A
4950246 Muller Aug 1990 A
4950264 Osborn, III Aug 1990 A
4953552 DeMarzo Sep 1990 A
4958632 Duggan Sep 1990 A
4958641 Digby et al. Sep 1990 A
4967756 Hewitt Nov 1990 A
4977899 Digby et al. Dec 1990 A
4978303 Lampbell Dec 1990 A
4978335 Arthur, III Dec 1990 A
4979509 Hakky Dec 1990 A
5007429 Treatch et al. Apr 1991 A
5009645 Silver et al. Apr 1991 A
5016172 Dessertine May 1991 A
5019974 Beckers May 1991 A
5024225 Fang Jun 1991 A
5025374 Roizen et al. Jun 1991 A
5034807 Von Kohorn Jul 1991 A
5035625 Munson et al. Jul 1991 A
5036462 Kaufman et al. Jul 1991 A
5049487 Phillips et al. Sep 1991 A
5050612 Matsumura Sep 1991 A
5056059 Tivig et al. Oct 1991 A
5059394 Phillips et al. Oct 1991 A
5065315 Garcia Nov 1991 A
5068536 Rosenthal Nov 1991 A
5074317 Bondell et al. Dec 1991 A
5077476 Rosenthal Dec 1991 A
5077665 Silverman et al. Dec 1991 A
5095798 Okada et al. Mar 1992 A
5104380 Holman et al. Apr 1992 A
5109414 Harvey et al. Apr 1992 A
5109974 Beer et al. May 1992 A
5111396 Mills et al. May 1992 A
5111817 Clark et al. May 1992 A
5111818 Suzuki et al. May 1992 A
5120230 Clark et al. Jun 1992 A
5120421 Glass et al. Jun 1992 A
5128552 Fang et al. Jul 1992 A
5128752 Von Kohorn Jul 1992 A
5134391 Okada Jul 1992 A
5142358 Jason Aug 1992 A
5142484 Kaufman et al. Aug 1992 A
5143378 Joel Sep 1992 A
5171977 Morrison Dec 1992 A
5176502 Sanderson et al. Jan 1993 A
5182707 Cooper et al. Jan 1993 A
5204670 Stinton Apr 1993 A
5219322 Weathers Jun 1993 A
5222020 Takeda Jun 1993 A
5226895 Harris Jul 1993 A
5227874 Von Kohorn Jul 1993 A
5228450 Sellers Jul 1993 A
5230629 Buschke Jul 1993 A
5231990 Gauglitz Aug 1993 A
5243515 Lee Sep 1993 A
5249044 Von Kohorn Sep 1993 A
5251126 Kahn et al. Oct 1993 A
5261401 Baker et al. Nov 1993 A
5262943 Thibado et al. Nov 1993 A
5265888 Yamamoto et al. Nov 1993 A
5266179 Nankai et al. Nov 1993 A
5275159 Griebel Jan 1994 A
5282950 Dietze et al. Feb 1994 A
5295491 Gevins Mar 1994 A
5299121 Brill et al. Mar 1994 A
5301105 Cummings, Jr. Apr 1994 A
5304112 Mrklas et al. Apr 1994 A
5304468 Phillips et al. Apr 1994 A
5307263 Brown Apr 1994 A
5309919 Snell et al. May 1994 A
5321009 Baeder et al. Jun 1994 A
5325288 Satou Jun 1994 A
5329459 Kaufman et al. Jul 1994 A
5329608 Bocchieri et al. Jul 1994 A
5331549 Crawford, Jr. Jul 1994 A
5333981 Pronovost et al. Aug 1994 A
5335338 Proesel Aug 1994 A
5339821 Fujimoto Aug 1994 A
5341291 Roizen et al. Aug 1994 A
5343239 Lappington et al. Aug 1994 A
5344324 O'Donnell et al. Sep 1994 A
5357427 Langen et al. Oct 1994 A
5366896 Margrey et al. Nov 1994 A
5368562 Blomquist et al. Nov 1994 A
5371687 Holmes, II et al. Dec 1994 A
5375604 Kelly et al. Dec 1994 A
5377100 Pope et al. Dec 1994 A
5390238 Kirk et al. Feb 1995 A
5399821 Inagaki et al. Mar 1995 A
5410471 Alyfuku et al. Apr 1995 A
5410474 Fox Apr 1995 A
5429140 Burdea et al. Jul 1995 A
5431690 Schaldach et al. Jul 1995 A
5431691 Snell et al. Jul 1995 A
5434611 Tamura Jul 1995 A
5438607 Przygoda, Jr. et al. Aug 1995 A
5438983 Falcone Aug 1995 A
5441047 David et al. Aug 1995 A
5449334 Kingsbury Sep 1995 A
5454721 Kuch Oct 1995 A
5454722 Holland et al. Oct 1995 A
5456606 McIntyre Oct 1995 A
5456692 Smith, Jr. et al. Oct 1995 A
5458123 Unger Oct 1995 A
5467269 Flaten Nov 1995 A
5471039 Irwin, Jr. et al. Nov 1995 A
5471382 Tallman et al. Nov 1995 A
5483276 Brooks et al. Jan 1996 A
5488412 Majeti et al. Jan 1996 A
5488423 Walkingshaw et al. Jan 1996 A
5501231 Kaish Mar 1996 A
5502636 Clarke Mar 1996 A
5502726 Fischer Mar 1996 A
5504519 Remillard Apr 1996 A
5517405 McAndrew et al. May 1996 A
5518001 Snell May 1996 A
5519058 Gonick et al. May 1996 A
5519433 Lappington et al. May 1996 A
5523232 Sechler Jun 1996 A
5536249 Castellano et al. Jul 1996 A
5542420 Goldman et al. Aug 1996 A
5544649 David et al. Aug 1996 A
5546943 Gould Aug 1996 A
5549117 Tacklind et al. Aug 1996 A
5550575 West et al. Aug 1996 A
5553609 Chen et al. Sep 1996 A
5558638 Evers et al. Sep 1996 A
5564429 Bornn et al. Oct 1996 A
5569212 Brown Oct 1996 A
5572421 Altman et al. Nov 1996 A
5572646 Kawai et al. Nov 1996 A
5574828 Hayward et al. Nov 1996 A
5576952 Stutman et al. Nov 1996 A
5583758 McIlroy et al. Dec 1996 A
5590648 Mitchell et al. Jan 1997 A
5593349 Miguel et al. Jan 1997 A
5593390 Castellano et al. Jan 1997 A
5594637 Eisenberg et al. Jan 1997 A
5596994 Bro Jan 1997 A
5597307 Redford et al. Jan 1997 A
5601435 Quy Feb 1997 A
5613495 Mills et al. Mar 1997 A
5619991 Sloane Apr 1997 A
5624265 Redford et al. Apr 1997 A
5628309 Brown May 1997 A
5629981 Nerlikar May 1997 A
5631844 Margrey et al. May 1997 A
5633910 Cohen May 1997 A
5635532 Samid Jun 1997 A
5640569 Miller et al. Jun 1997 A
5640953 Bishop et al. Jun 1997 A
5642731 Kehr Jul 1997 A
5642936 Evans Jul 1997 A
5651363 Kaufman et al. Jul 1997 A
5651775 Walker et al. Jul 1997 A
5659691 Durward et al. Aug 1997 A
5666487 Goodman et al. Sep 1997 A
5670711 Detournay et al. Sep 1997 A
5675635 Vos et al. Oct 1997 A
5678562 Sellers Oct 1997 A
5678571 Brown Oct 1997 A
5679075 Forrest et al. Oct 1997 A
5680590 Parti Oct 1997 A
5680866 Kangas et al. Oct 1997 A
5687322 Deaton et al. Nov 1997 A
5687717 Halpern et al. Nov 1997 A
5687734 Dempsey et al. Nov 1997 A
5689652 Lupien et al. Nov 1997 A
5692906 Corder Dec 1997 A
5704364 Saltzstein et al. Jan 1998 A
5704366 Tacklind et al. Jan 1998 A
5704902 Vandenbelt et al. Jan 1998 A
5704922 Brown Jan 1998 A
5710178 Samid Jan 1998 A
5710918 Lagarde et al. Jan 1998 A
5711297 Iliff Jan 1998 A
5714319 Joutel et al. Feb 1998 A
5715451 Marlin Feb 1998 A
5715823 Wood et al. Feb 1998 A
5717739 Dyer et al. Feb 1998 A
5717913 Driscoll Feb 1998 A
5720733 Brown Feb 1998 A
5722418 Bro Mar 1998 A
5727153 Powell Mar 1998 A
5730124 Yamauchi Mar 1998 A
5730654 Brown Mar 1998 A
5732696 Rapoport et al. Mar 1998 A
5732709 Tacklind et al. Mar 1998 A
5734413 Lappington et al. Mar 1998 A
5749083 Koda et al. May 1998 A
5752234 Withers May 1998 A
5754740 Fukuoka et al. May 1998 A
5760771 Blonder et al. Jun 1998 A
5772585 Lavin et al. Jun 1998 A
5778882 Raymond et al. Jul 1998 A
5782814 Brown et al. Jul 1998 A
5785650 Akasaka et al. Jul 1998 A
5787295 Nakao Jul 1998 A
5791342 Woodard Aug 1998 A
5792117 Brown Aug 1998 A
5793969 Kamentsky et al. Aug 1998 A
5794219 Brown Aug 1998 A
5794251 Watanabe et al. Aug 1998 A
5796393 MacNaughton Aug 1998 A
5799318 Cardinal et al. Aug 1998 A
5800458 Wingrove Sep 1998 A
5802494 Kuno Sep 1998 A
5802534 Hatayama et al. Sep 1998 A
5806057 Gormley et al. Sep 1998 A
5810747 Brudny et al. Sep 1998 A
5819735 Mansfield et al. Oct 1998 A
5822544 Chaco et al. Oct 1998 A
5822715 Worthington et al. Oct 1998 A
5825283 Camhi Oct 1998 A
5827180 Goodman Oct 1998 A
5828943 Brown Oct 1998 A
5832448 Brown Nov 1998 A
5835896 Fisher et al. Nov 1998 A
5840020 Heinonen et al. Nov 1998 A
5842976 Williamson Dec 1998 A
5868669 Iliff Feb 1999 A
5868683 Protopapas et al. Feb 1999 A
5875432 Sehr Feb 1999 A
5879163 Brown et al. Mar 1999 A
5882338 Gray Mar 1999 A
5887133 Brown et al. Mar 1999 A
5893077 Griffin Apr 1999 A
5893098 Peters et al. Apr 1999 A
5897493 Brown Apr 1999 A
5899855 Brown May 1999 A
5911687 Sato et al. Jun 1999 A
5913310 Brown Jun 1999 A
5918603 Brown Jul 1999 A
5920477 Hofbert et al. Jul 1999 A
5933136 Brown Aug 1999 A
5935060 Iliff Aug 1999 A
5940801 Brown Aug 1999 A
5941829 Saltzstein et al. Aug 1999 A
5945651 Chorosinski et al. Aug 1999 A
5950630 Portwood et al. Sep 1999 A
5951300 Brown Sep 1999 A
5954641 Kehr et al. Sep 1999 A
5956501 Brown Sep 1999 A
5960403 Brown Sep 1999 A
5961446 Beller et al. Oct 1999 A
5966526 Yokoi Oct 1999 A
5971855 Ng Oct 1999 A
5971922 Arita et al. Oct 1999 A
5983003 Lection et al. Nov 1999 A
5983217 Khosravi-Sichani et al. Nov 1999 A
5987471 Bodine et al. Nov 1999 A
5995969 Lee et al. Nov 1999 A
5997476 Brown Dec 1999 A
5997502 Reilly et al. Dec 1999 A
6001065 DeVito Dec 1999 A
6022315 Iliff Feb 2000 A
6022615 Rettenbacher Feb 2000 A
6023686 Brown Feb 2000 A
6024281 Shepley Feb 2000 A
6029138 Khorasani et al. Feb 2000 A
6032119 Brown et al. Feb 2000 A
6035328 Soukal Mar 2000 A
6046761 Echerer Apr 2000 A
6049794 Jacobs et al. Apr 2000 A
6050940 Braun et al. Apr 2000 A
6055314 Spies et al. Apr 2000 A
6055487 Margery et al. Apr 2000 A
6055506 Frasca, Jr. Apr 2000 A
6057758 Dempsey et al. May 2000 A
6068615 Brown et al. May 2000 A
6095985 Raymond et al. Aug 2000 A
6101478 Brown Aug 2000 A
6108635 Herren et al. Aug 2000 A
6110148 Brown et al. Aug 2000 A
6113578 Brown Sep 2000 A
6138145 Kawanaka Oct 2000 A
6144837 Quy Nov 2000 A
6151586 Brown Nov 2000 A
6161095 Brown Dec 2000 A
6167362 Brown et al. Dec 2000 A
6167386 Brown Dec 2000 A
6168563 Brown Jan 2001 B1
6177940 Bond et al. Jan 2001 B1
6186145 Brown Feb 2001 B1
6189029 Fuerst Feb 2001 B1
D439242 Brown et al. Mar 2001 S
6210272 Brown Apr 2001 B1
6221012 Maschke et al. Apr 2001 B1
6233539 Brown May 2001 B1
6240393 Brown May 2001 B1
6248065 Brown Jun 2001 B1
6260022 Brown Jul 2001 B1
6270455 Brown Aug 2001 B1
6270456 Iliff Aug 2001 B1
6334778 Brown Jan 2002 B1
6352523 Brown et al. Mar 2002 B1
6368273 Brown Apr 2002 B1
6370513 Kolawa et al. Apr 2002 B1
6375469 Brown Apr 2002 B1
6379301 Worthington et al. Apr 2002 B1
6381577 Brown Apr 2002 B1
6436036 Miller-Kovach et al. Aug 2002 B1
6513532 Mault et al. Feb 2003 B2
6678669 Lapointe et al. Jan 2004 B2
20020019748 Brown Feb 2002 A1
20030068649 Doberstein et al. Apr 2003 A1
20040106855 Brown Jun 2004 A1
20040107116 Brown Jun 2004 A1
20040117207 Brown Jun 2004 A1
20040117208 Brown Jun 2004 A1
20040117209 Brown Jun 2004 A1
20040117210 Brown Jun 2004 A1
Foreign Referenced Citations (40)
Number Date Country
0286456 Oct 1988 EP
0320749 Jun 1989 EP
370599 May 1990 EP
0461910 Dec 1991 EP
508912 Oct 1992 EP
526166 Feb 1993 EP
0558975 Sep 1993 EP
0653718 May 1995 EP
676709 Oct 1995 EP
680727 Nov 1995 EP
761160 Mar 1997 EP
08131551 Dec 1997 EP
0251520 Jan 1998 EP
2218831 Nov 1989 GB
2225637 Jun 1990 GB
54005785 Jan 1979 JP
54146633 Nov 1979 JP
62226278 Oct 1987 JP
5155024 Jun 1993 JP
5266002 Oct 1993 JP
1995407095963 Apr 1995 JP
WO-8501667 Apr 1985 WO
WO-9000367 Jan 1990 WO
WO-9109374 Jun 1991 WO
WO-9301489 Jan 1993 WO
WO-9302622 Feb 1993 WO
WO-9416774 Aug 1994 WO
WO-9509386 Apr 1995 WO
WO-9520199 Jul 1995 WO
WO-9522131 Aug 1995 WO
WO-9529447 Nov 1995 WO
WO-9607908 Mar 1996 WO
WO-9625877 Aug 1996 WO
WO-9636923 Nov 1996 WO
WO-9708605 Mar 1997 WO
WO-9712544 Apr 1997 WO
WO-9737738 Oct 1997 WO
WO-9816895 Apr 1998 WO
WO-9831275 Jul 1998 WO
WO-9839933 Sep 1998 WO
Provisional Applications (2)
Number Date Country
60041746 Mar 1997 US
60041751 Mar 1997 US
Continuations (3)
Number Date Country
Parent 09041809 Mar 1998 US
Child 09496893 US
Parent 08850840 May 1997 US
Child 09378188 US
Parent 08847009 Apr 1997 US
Child 08850840 US
Continuation in Parts (4)
Number Date Country
Parent 08946341 Oct 1997 US
Child 09041809 US
Parent 08847009 Apr 1997 US
Child 08946341 US
Parent 09496893 US
Child 08946341 US
Parent 09378188 Aug 1999 US
Child 09496893 US