The present invention relates to a system for feature extraction for formal concept analysis (FCA) and, more particularly, to a system for feature extraction for FCA using Parzen windows.
Many forms of information can be described as a set of objects, each with a set of attributes and/or values. In these cases, any hierarchical structure remains implicit. Often the set of objects can be related to two or more completely different domains of attributes and/or values. Formal Concept Analysis (FCA) is a principled way of deriving a partial order on a set of objects, each defined by a set of attributes. It is a technique in data and knowledge processing that has applications in data visualization, data mining, information retrieval, and knowledge management (see the List of Incorporated Literature References, Literature Reference No. 2). The principle with which it organizes data is a partial order induced by an inclusion relation between object's attributes. Additionally, FCA admits rule mining from structured data.
FCA is widely applied for data analysis. FCA relies on binary features in order to construct lattices. There are techniques for converting scalar data to a binarized format, but they often result in the creation of too many attributes to be efficiently used in lattice construction. Feature selection on scalar data is typically done by scaling or creating uniform bins. Existing methods of selecting features from scalar data in FCA suffer from blind selection policies which yield too many and, typically, not useful features. This is problematic due to the exponentially increasing computational time required for lattice construction based on features.
Thus, a continuing need exists for reducing the number of features in FCA down to the most useful to allow for smaller lattices to be constructed without diminishing the powers of FCA.
The present invention relates to a system for feature extraction for formal concept analysis (FCA) and, more particularly, to a system for feature extraction for FCA using Parzen windows. The system comprises one or more processors and a non-transitory computer-readable medium having executable instructions encoded thereon such that when executed, the one or more processors perform multiple operations. The system separates a set of data points having features into a set of known object classes. For each known object class, the data points are convolved with a Gaussian function, resulting in a class distribution curve for each known object class. For each class distribution curve, intervals of data values on which the class distribution curve is maximum with respect to all other class distribution curves are identified. The intervals are ranked with respect to a predetermined confidence threshold value. The ranking of the intervals are used to select which features to extract from the set of data points in FCA lattice construction, and the selected features are extracted from the set of data points.
In another aspect, the selected features are used to interpret neural data.
In another aspect, the selected features are applied to functional magnetic resonance imaging (fMRI) responses to classify a thought process of a human.
In another aspect, the system generates a binary array comprising ones and zeroes, having ones on intervals of data on which the class distribution curve is maximum, and zeroes elsewhere.
In another aspect, for each known object class, a binary class curve indicating for which interval a performance of the known object class exceeds all other known object classes is generated.
In another aspect, the set of data points comprises data from a neural sensor.
In another aspect, the predetermined confidence threshold value is used to eliminate intervals having a low confidence value.
In another aspect, the ranking of the intervals is determined by taking a ratio of an area under each class distribution curve along each interval to a sum of the areas under all the other class distribution curves along each interval.
In another aspect, the present invention also comprises a method for causing a processor to perform the operations described herein.
Finally, in yet another aspect, the present invention also comprises a computer program product comprising computer-readable instructions stored on a non-transitory computer-readable medium that are executable by a computer having a processor for causing the processor to perform the operations described herein.
The file of this patent or patent application publication contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
The objects, features and advantages of the present invention will be apparent from the following detailed descriptions of the various aspects of the invention in conjunction with reference to the following drawings, where:
The present invention relates to a system for feature extraction for formal concept analysis (FCA) and, more particularly, to a system for feature extraction for FCA using Parzen windows. The following description is presented to enable one of ordinary skill in the art to make and use the invention and to incorporate it in the context of particular applications. The application discussed is for analyzing brain activity in response to different stimuli using FCA by constructing a lattice using the feature extraction method in this invention. Various modifications, as well as a variety of uses in different applications will be readily apparent to those skilled in the art, and the general principles defined herein may be applied to a wide range of aspects. Thus, the present invention is not intended to be limited to the aspects presented, but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
In the following detailed description, numerous specific details are set forth in order to provide a more thorough understanding of the present invention. However, it will be apparent to one skilled in the art that the present invention may be practiced without necessarily being limited to these specific details. In other instances, well-known structures and devices are shown in block diagram form, rather than in detail, in order to avoid obscuring the present invention.
The reader's attention is directed to all papers and documents which are filed concurrently with this specification and which are open to public inspection with this specification, and the contents of all such papers and documents are incorporated herein by reference. All the features disclosed in this specification, (including any accompanying claims, abstract, and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise. Thus, unless expressly stated otherwise, each feature disclosed is one example only of a generic series of equivalent or similar features.
Furthermore, any element in a claim that does not explicitly state “means for” performing a specified function, or “step for” performing a specific function, is not to be interpreted as a “means” or “step” clause as specified in 35 U.S.C. Section 112, Paragraph 6. In particular, the use of “step of” or “act of” in the claims herein is not intended to invoke the provisions of 35 U.S.C. 112, Paragraph 6.
Please note, if used, the labels left, right, front, back, top, bottom, forward, reverse, clockwise and counter-clockwise have been used for convenience purposes only and are not intended to imply any particular fixed direction. Instead, they are used to reflect relative locations and/or directions between various portions of an object. As such, as the present invention is changed, the above labels may change their orientation.
Before describing the invention in detail, first a list of cited literature references used in the description is provided. Next, a description of various principal aspects of the present invention is provided. Following that is an introduction that provides an overview of the present invention. Finally, specific details of the present invention are provided to give an understanding of the specific aspects.
The following references are cited and incorporated throughout this application. For clarity and convenience, the references are listed herein as a central resource for the reader. The following references are hereby incorporated by reference as though fully included herein. The references are cited in the application by referring to the corresponding literature reference number, as follows:
Various embodiments have three “principal” aspects. The first is a system for Parzen window feature selection for formal concept analysis (FCA). The system is typically in the form of a computer system operating software or in the form of a “hard-coded” instruction set. This system may be incorporated into a wide variety of devices that provide different functionalities, such as a robot or other device. The second principal aspect is a method, typically in the form of software, operated using a data processing system (computer). The third principal aspect is a computer program product. The computer program product generally represents computer-readable instructions stored on a non-transitory computer-readable medium such as an optical storage device, e.g., a compact disc (CD) or digital versatile disc (DVD), or a magnetic storage device such as a floppy disk or magnetic tape. Other, non-limiting examples of computer-readable media include hard disks, read-only memory (ROM), and flash-type memories. These aspects will be described in more detail below.
A block diagram depicting an example of a system (i.e., computer system 100) of the present invention is provided in
The computer system 100 may include an address/data bus 102 that is configured to communicate information. Additionally, one or more data processing units, such as a processor 104 (or processors), are coupled with the address/data bus 102. The processor 104 is configured to process information and instructions. In an aspect, the processor 104 is a microprocessor. Alternatively, the processor 104 may be a different type of processor such as a parallel processor, or a field programmable gate array.
The computer system 100 is configured to utilize one or more data storage units. The computer system 100 may include a volatile memory unit 106 (e.g., random access memory (“RAM”), static RAM, dynamic RAM, etc.) coupled with the address/data bus 102, wherein a volatile memory unit 106 is configured to store information and instructions for the processor 104. The computer system 100 further may include a non-volatile memory unit 108 (e.g., read-only memory (“ROM”), programmable ROM (“PROM”), erasable programmable ROM (“EPROM”), electrically erasable programmable ROM “EEPROM”), flash memory, etc.) coupled with the address/data bus 102, wherein the non-volatile memory unit 108 is configured to store static information and instructions for the processor 104. Alternatively, the computer system 100 may execute instructions retrieved from an online data storage unit such as in “Cloud” computing. In an aspect, the computer system 100 also may include one or more interfaces, such as an interface 110, coupled with the address/data bus 102. The one or more interfaces are configured to enable the computer system 100 to interface with other electronic devices and computer systems. The communication interfaces implemented by the one or more interfaces may include wireline (e.g., serial cables, modems, network adaptors, etc.) and/or wireless (e.g., wireless modems, wireless network adaptors, etc.) communication technology.
In one aspect, the computer system 100 may include one or more of an input device 112 coupled with the address/data bus 102, wherein the input device 112 is configured to communicate information and command selections to the processor 100. In accordance with one aspect, the input device 112 includes an alphanumeric input device, such as a keyboard, that may include alphanumeric and/or function keys. Alternatively, or in addition, the input device 112 may include an input device other than an alphanumeric input device. For example, the input device 112 may include one or more sensors such as a camera for video or still images, a microphone, or a neural sensor. Other example input devices 112 may include an accelerometer, a GPS sensor, or a gyroscope.
In an aspect, the computer system 100 further may include one or more optional computer usable data storage devices, such as a storage device 116, coupled with the address/data bus 102. The storage device 116 is configured to store information and/or computer executable instructions. In one aspect, the storage device 116 is a storage device such as a magnetic or optical disk drive (e.g., hard disk drive (“HDD”), floppy diskette, compact disk read only memory (“CD-ROM”), digital versatile disk (“DVD”)). Pursuant to one aspect, a display device 118 is coupled with the address/data bus 102, wherein the display device 118 is configured to display video and/or graphics. In an aspect, the display device 118 may include a cathode ray tube (“CRT”), liquid crystal display (“LCD”), field emission display (“FED”), plasma display, or any other display device suitable for displaying video and/or graphic images and alphanumeric characters recognizable to a user.
The computer system 100 presented herein is an example computing environment in accordance with an aspect. However, the non-limiting example of the computer system 100 is not strictly limited to being a computer system. For example, an aspect provides that the computer system 100 represents a type of data processing analysis that may be used in accordance with various aspects described herein. Moreover, other computing systems may also be implemented. Indeed, the spirit and scope of the present technology is not limited to any single data processing environment. Thus, in an aspect, one or more operations of various aspects of the present technology are controlled or implemented using computer-executable instructions, such as program modules, being executed by a computer. In one implementation, such program modules include routines, programs, objects, components and/or data structures that are configured to perform particular tasks or implement particular abstract data types. In addition, an aspect provides that one or more aspects of the present technology are implemented by utilizing one or more distributed computing environments, such as where tasks are performed by remote processing devices that are linked through a communications network, or such as where various program modules are located in both local and remote computer-storage media including memory-storage devices.
An illustrative diagram of a computer program product (i.e., storage device) embodying the present invention is depicted in
Formal concept analysis (FCA) is a principled way of deriving a concept hierarchy or formal ontology from a collection of objects and their properties or attributes. It is a creation of a partial order of the objects based on an ordering relation defined by set inclusion of attributes. Formally, a context=(G, M, I) consists of two sets G and M and a relation I, called the incidence relation, between them. The elements of G are called the objects, and the elements of M are called the attributes (see Literature Reference No. 4). If an object gεG has the attribute mεM, then write glm or (g, m)εI. A context can be represented by a cross table, or context table, which is a rectangular table where the rows are headed by objects and the columns are headed by attributes, an example of which is illustrated in
A formal concept can now be defined. A formal concept of the context (G, M, I) is a pair (A, B) with A⊂G, B⊂M, A′=B, and B′=A. A is called the extent, and B is called the intent of the concept (A, B). (G, M, I) denotes the set of all concepts of the context (G, M, I). A concept is represented within a context table by a maximal contiguous block of “X”'s after arbitrary rearrangement of rows and columns, as shown in
tεT(At,Bt)=(∩tεTAt,(∪tεTBt)″) and
tεT(At,Bt)=((∪tεTAt)″,∩tεTBt).
Referring to
(3.1) Example of a Context and Concept Lattice
A concept lattice is a mathematical object represented by (G, M, I) as described above. A concept lattice can be visualized by a Hasse diagram, a directed acyclic graph where the nodes represent concepts and lines represent the inclusion relationship between the nodes. In the case of formal concept analysis, the Hasse diagram has a single top node representing all objects (given by G), and a single bottom node representing all attributes (given by M). All the nodes in between represent the various concepts comprised of some subset of objects and attributes. A line between two nodes represents the order information. The node above is considered greater than the node below. In a Hasse diagram, a node in with attribute set m and object set g has the following properties:
Thus, the ordering of the nodes within the lattice n>k implies that the extent of n is contained in the extent of k and, equivalently, the intent of n is contained in the intent of k. The upset of a node n consists of all of its ancestor nodes within the lattice. The downset of n consists of all its children nodes within the lattice.
(4) Specific Details of the Invention
In the system according to some embodiments of the present invention, feature selection is performed on scalar data from BOLD (blood oxygenation level dependent) responses measured using fMRI (functional magnetic resonance imaging). fMRI is a functional neuroimaging procedure using MRI technology that measures brain activity by detecting changes associated with blood flow. This technique relies on the fact that cerebral blood flow and neuronal activation are coupled. When an area of the brain is in use, blood flow to that region also increases. fMRI typically provides a dataset that can consist of samples of brain activity (inferred from the BOLD signal) from 20 k-100 k (where k represents “thousand”) voxels, in response to stimuli. Feature selection from this high dimensional scalar data is performed to extract signal from noise in the voxel responses. The selected features can then be further analyzed using methods, such as FCA, to understand their structure and contribution to activity in response to stimuli (referred to as object class below), and further be used to decode brain activity back into stimulus dimensions.
(4.1) Feature Selection
A Parzen window density estimation is used in determining appropriate bins for the scalar data values (see Literature Reference No. 3 for a description of Parzen window density estimation). The method according to some embodiments of the present invention consists of separating the data points into the separate known object classes. For each class, the data points are convolved with a Gaussian function. The resulting curves are called class distribution curves, which are depicted in
Formally, the algorithm ParzenFeatureSelection is as follows. Let Gauss(μ) be a Gaussian with mean μ and standard deviation σ. Co is used for the class curve of the object o, and the resulting bins is bo. The corresponding confidence value is Co. The outputs are bins, which is a list of the beginning value and ending value of the intervals, and confs, which is the list of confidence levels for each interval.
This ranking in confidence can be done in a variety of ways, a non-limiting example of which is described below. The ranks are established by taking the ratio of the area under the class distribution curve along the interval to the sum of the areas of all the other class distribution curves along the interval. In our application, an fMRI experiment measures brain activity as voxel values in response to different stimulus classes (e.g., class A and B) repeatedly to produce multiple measurement samples. For example, if the input data voxel value achieves 3.7 for 10 different samples, and 7 of the samples are associated with an element of class A, and 3 of the samples are other classes, then if the value 3.7 is observed in another sample, one can be 70% confident it is an instance of class A. A predetermined threshold value is used to throw out intervals with low confidence values. Other methods for confidence level computation may prove useful depending on the statistics of the data (number of samples, distribution of sample values). The following are non-limiting examples of confidence level computation:
(4.2) Experimental Studies
Studies were performed on two sets of data for classification. The first is the Iris data set available in the University of California, Irvine (UCI) machine learning repository (see Literature Reference No. 7 for the Iris data set). In this problem, the goal is to classify the iris type based on ‘sepal length’, ‘sepal width’, ‘petal length’, and ‘petal width’. The second data set was comprised of fMRI BOLD responses.
(4.2.1) Iris
Classification of the Iris data set was performed using the algorithm described in U.S. Non-Provisional application Ser. No. 14/807,083, which is hereby incorporated by reference as though fully set forth herein. Using the present invention, it was possible to classify the data set with much smaller lattices compared to previous techniques, such as uniform binning of the data, making the classification much quicker.
Additionally, a study was performed to see if the classification accuracy could be boosted while still maintaining a small lattice structure. The results are depicted in
(4.2.2) Functional Magnetic Resonance Imaging (fMRI) Blood-Oxygen-Level Dependent (BOLD) Responses
(4.2.2.1) Voxel Binning
fMRI BOLD responses are used to represent a level of neural activity within the brain in a non-invasive way. Various stimuli (e.g., spoken words, written words, images) are presented, representing semantic or conceptual input. During the presentation of stimuli, the brain's responses are recorded. A baseline of null activity is subtracted out and the difference between this neutral brain state and the brain's state in response to the stimuli is extracted.
The set of stimuli (whether individual words of sentences, spoken words, images, etc.) represent the objects of formal concept analysis (FCA), and the extracted fMRI BOLD responses for the voxels within the brain represent the attributes of the objects. FCA classification (described in U.S. Non-Provisional application Ser. No. 14/807,083) can then be applied to the fMRI BOLD responses in an effort to classify the thought process of a human. To these ends, feature extraction via the Parzen window binning algorithm of the present invention is employed.
The invention described herein has multiple applications. For instance, as described above, FCA classification is instrumental to the classification of fMRI BOLD responses to presented stimuli. Further, the method according to some embodiments of the present invention can be used to classify inefficiencies within a production line or a circuit design, since many such inefficiencies are dependency based, resulting from the hidden structures within the production process.
This is a Continuation-in-Part application of U.S. Non-Provisional application Ser. No. 14/489,313, filed in the United States on Sep. 17, 2014, entitled, “Mapping Across Domains to Extract Conceptual Knowledge Representation from Neural Systems,” which is a Non-Provisional patent application of U.S. Provisional Application No. 62/028,083, filed in the United States on Jul. 23, 2014, entitled, “Mapping Across Domains to Extract Conceptual Knowledge Representation from Neural Systems,” which are incorporated herein by reference in their entirety. This is ALSO a Continuation-in-Part application of U.S. Non-Provisional application Ser. No. 14/807,083, filed in the United States on Jul. 23, 2015, entitled, “A General Formal Concept Analysis (FCA) Framework for Classification, which is a Continuation-in-Part application of U.S. Non-Provisional application Ser. No. 14/489,313, filed in the United States on Sep. 17, 2014, entitled, “Mapping Across Domains to Extract Conceptual Knowledge Representation from Neural Systems,” which is incorporated herein by reference in their entirety. U.S. Non-Provisional application Ser. No. 14/807,083 is ALSO a Non-Provisional application of U.S. Provisional Application No. 62/028,171, filed in the United States on Jul. 23, 2014, entitled, “A General Formal Concept Analysis (FCA) Framework for Classification,” which is incorporated herein by reference in its entirety. This is ALSO a Non-Provisional patent application of U.S. Provisional Application No. 62/195,876, filed in the United States on Jul. 23, 2015, entitled, “A Parzen Window Feature Selection Algorithm for Formal Concept Analysis (FCA),” which is incorporated herein by reference in its entirety.
This invention was made with government support under U.S. Government Contract Number FA8650-13-C7356. The government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
62028083 | Jul 2014 | US | |
62028171 | Jul 2014 | US | |
62195876 | Jul 2015 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 14489313 | Sep 2014 | US |
Child | 15150998 | US | |
Parent | 14807083 | Jul 2015 | US |
Child | 14489313 | US | |
Parent | 14489313 | Sep 2014 | US |
Child | 14807083 | US |