The present invention overcomes the limitations in the status quo by utilizing both distributed hardware and software nodes to automate the building of training set data or verifying an AI algorithm has properly annotated the desired subject in the data.
Specifically, hardware consisting of A) a positioning system, such as a Global Positioning System (GPS) receiver or some other relative positioning system for cases where GPS signals cannot be received (such as Ultrawide Band sensors used indoors), B) a data storage method such as a SD or microSD card with the test subject's time-tagged metadata (and other test data as needed) written on the data storage device, and C) a transmitter (such as radio, Bluetooth, Ethernet, or some other method) for broadcasting this metadata to a receiving unit. The hardware node is co-located with the test subject(s).
Specifically, software capable of receiving the time-tagged data (both hardware node and sensor) for aggregation, storage, and subsequent analysis. The software receives the time-tagged data from the external sensor, such as a video camera in an Unmanned Aerial Vehicle, and the data broadcast from the hardware node. Keeping with the Unmanned Aerial Vehicle example, the software then associates the known subject position in every video frame with the known metadata. Since the subject position is known (as it is broadcast from the hardware nodes), the software can draw a proper bounding box around the subject and digitally associate the known metadata in every video frame. This process occurs at the rate of input sensor data and input hardware node data.
The same system can work in reverse, i.e. the system can be used to verify an AI algorithm is properly classifying the sensor data by comparing the known subject metadata against the AI algorithm output.
In the case where verification of the subject's identity is necessary, for example, in dense target environments where subjects are positioned with separation distances close to or less than the error in geolocation, a positive target feedback can be employed. An example of positive feedback is the subject encoding a message and transmitting this information over the broadcast mechanism.
The same system can be utilized for multiple sensors whose aggregation constitutes a single or network of discrete lines or geo-boundary polynomials. In this way, a boundary or area may be defined and/or identified.
Some embodiments of the present invention are illustrated as an example and are not limited by the figures of the accompanying drawings, in which like references may indicate similar elements and in which:
The present invention automates common procedures required during the development and analysis of Artificial Intelligence (AI) algorithms.
Specifically, the invention automates two crucial steps: the generation of properly tagged training data sets and the verification/validation of AI algorithm performance.
The invention relates to both hardware devices and software algorithms that build automated training data sets and evaluate AI algorithms for efficacy.
Artificial Intelligence (AI) is a rapidly growing field that has permeated into many areas of both civilian and military technology. In general, AI algorithms “learn” via statistical pattern recognition by “training” against known data sets.
AI algorithms perceive information from a sensor, such as numerical data (for example radio wave signals collected by an antenna, which could be used for geolocating radio emitter positions), video (for example video imagery from an Unmanned Aerial Vehicle), or any number of other methods. In all cases, the sensor feeds data into the AI algorithm and the AI algorithm must accuracy classify the signal context. Typically, a specific test signal or object must be extracted from data in order to be deemed a successful algorithm.
Some types of AI algorithms require large amounts of properly formatted training data sets in order to accurately classify incident signals, be it numerical, video, or some other type. These data sets contain a digital association of metadata that defines the state of the subject data a priori. By digitally associating this metadata with every training data set, an AI algorithm can “learn” how to classify information. The type of training data can be purely numerical (such as geolocating radio emitter positions), pictorial (such as video imagery from an Unmanned Aerial Vehicle, or any number of other methods.
An example of the problem in modern AI development occurs in many types of automated recognition of subjects from imagery. In order to build the training data set, a researcher must first train the AI algorithm by manually drawing a bounding box around the desired object (a tree, cat, truck, etc.) and then manually indicate the test object with associated metadata (a red truck with chrome wheels, for example). This occurs for every image frame that will be used for training AI algorithms, which can be quite large and time consuming. The aforementioned example is the status quo in building AI training data sets and imposes large time and money costs on the development cycle.
In order to circumvent the enormous costs of training against actual data, many researchers build their own training data sets using synthetic data. However, given that the AI algorithm will be expected to perform against real world data, training data sets composed of real-world data are preferred.
Even if the training data set is manually generated and used to build the AI algorithm, the automated analysis of algorithm efficacy is manually completed. A human must manually verify the results of the AI output with a large amount of new data against which the algorithm is expected to perform adequately. The AI algorithm must then be scored for accuracy. This is a time-consuming process and imposes large time and money costs on the development cycle but is crucial for building trust between humans and AI algorithms.
Therefore, a need exists to A) automate both the collection, injection of metadata, and building of training data sets and B) automate the analysis and scoring of AI algorithms during test.
A new method for automating procedures required by AI algorithms is are discussed herein. Specifically, the invention relates to procedures for generating and assessing data required or generated by AI algorithms without manual human intervention.
The present invention will be described by referencing the appended figures representing preferred embodiments.
We make mention of the state of an object in this disclosure. Here state refers to the defining characteristics that make the object unique at a point in time.
A specific example of the present invention is shown in
The test item 17 position 19 and metadata 20 are passed to a hardware device 18 capable of transmitting all state information of the test item 17 to the receiver 23 located on System C. The video sensor state, position 15, and recorded data is passed through controller 14 to System C software 21 for processing and storage 21.
The present invention permits the test item 17 to cue the sensor 13 such that the sensor 13 readily and continuously records the test item 17 for a long period of time. In the case of
If the goal is to build training data sets, the known metadata 8 of the test item 5 is digitally associated 45 with the sensor 1 data stream. The raw sensor data 1 and associated metadata are packetized 43 for transmission 44 using a communication protocol (LAN network, radio, etc.) to a storage device.
If the goal is to evaluate an existing AI algorithm, the AI algorithm data 45 is evaluated against the known recorded data 40 and a statistical error analysis may be performed 47. The results of this analysis are then packetized 46 and transmitted 47 using a communication protocol (LAN network, radio, etc.) to a storage device.
In some instantiations, it may prove necessary to have positive feedback of the subject. This would occur, for example, in dense target environments where subjects are positioned with separation distances close to or less than the error in geolocation. In this case, the system may improperly tag the wrong subject. An example of positive feedback is the subject encoding a message and transmitting this information over the broadcast mechanism. In the example of a UAV observing a target object, the target object could flash a light such that the UAV would observe this event, confirm the correct target object is being tracked, and maintain a positive lock on the desired test subject using traditional computer vision tracking methods.
The same system can be utilized for multiple sensors whose aggregation constitutes a single or network of discrete lines or geo-boundary polynomials. In this way, a boundary or area may be defined and/or identified. In the example of a UAV, the UAV could continuously monitor a restricted area or geo-boundary defined by sensors located on four stationary or moving objects, such as buoys or vessels, and begin tracking a target object once the target enters the restricted area.
It will thus be seen that the objects set forth above, among those made apparent from the preceding description, are efficiently attained and, because certain changes may be made in carrying out the above method and in the construction(s) set forth without departing from the spirit and scope of the invention, it is intended that all matter contained in the above description and shown in the accompanying drawings shall be interpreted as illustrative and not in a limiting sense.
It is also to be understood that the following claims are intended to cover all of the generic and specific features of the invention herein described and all statements of the scope of the invention which, as a matter of language, might be said to fall there between.
The present application claims priority to U.S. Provisional Patent Application No. 62/908,820 filed Oct. 1, 2019, the contents of which is hereby incorporated by reference.
Number | Name | Date | Kind |
---|---|---|---|
6256352 | Chang | Jul 2001 | B1 |
7085637 | Breed et al. | Aug 2006 | B2 |
7202776 | Breed | Apr 2007 | B2 |
7295925 | Breed et al. | Nov 2007 | B2 |
7418346 | Breed et al. | Aug 2008 | B2 |
7629899 | Breed | Dec 2009 | B2 |
7688375 | Ho | Mar 2010 | B2 |
7770847 | Severson | Aug 2010 | B1 |
9103671 | Breed et al. | Aug 2015 | B1 |
10627396 | Bornhop | Apr 2020 | B2 |
10739149 | Chase et al. | Aug 2020 | B2 |
20120176237 | Tabe | Jul 2012 | A1 |
20150326570 | Publicover | Nov 2015 | A1 |
20160127641 | Gove | May 2016 | A1 |
20180204111 | Zadeh | Jul 2018 | A1 |
20190141467 | Breed | May 2019 | A1 |
Number | Date | Country |
---|---|---|
3032812 | Feb 2018 | CA |
103946732 | Jul 2014 | CN |
106600581 | Apr 2017 | CN |
105083331 | May 2017 | CN |
107077782 | Aug 2017 | CN |
104208865 | Jun 2018 | CN |
109843371 | Jun 2019 | CN |
209085657 | Jul 2019 | CN |
110568456 | Dec 2019 | CN |
110800273 | Feb 2020 | CN |
112074790 | Dec 2020 | CN |
112183712 | Jan 2021 | CN |
112183735 | Jan 2021 | CN |
200919210 | May 2009 | TW |
Number | Date | Country | |
---|---|---|---|
20210097436 A1 | Apr 2021 | US |
Number | Date | Country | |
---|---|---|---|
62908820 | Oct 2019 | US |