The agriculture industry comprises a large portion of the world's economy. In addition, as the population of the world increases annually, more food must be produced by existing agricultural assets. In order to increase yields on existing plots of farm land, producers require a clear understanding of plant and soil conditions. However, as a single farm may encompass hundreds of acres, it is difficult to access the conditions of the farm land.
Currently, farmers rely on their observations of their land along with prior experience to determine the requirements to increase the yield of their farm land. These observations may include identifying locations of weeds, identifying plant illnesses and determining levels of crop damage. However, considering the large number of acres in the average farm, these observations are not a reliable method to increase yields. Therefore, a need exists for system that will allow a farmer to better understand the conditions of their farm land.
Systems, methods, features, and advantages of the present invention will be or will become apparent to one with skill in the art upon examination of the following figures and detailed description. It is intended that all such additional systems, methods, features, and advantages be included within this description, be within the scope of the invention, and be protected by the accompanying claims.
One embodiment of the present disclosure includes a row detection system including an image gathering unit that gathers a high-altitude image having multiple channels, an image analysis unit that segments the high-altitude image into a plurality of equally size tiles and determines an index value based on at least one channel of the image, where the image analysis unit identifies rows of objects in each image.
In another embodiment, the image analysis unit separates each tile into a first channel image and a second channel image.
In another embodiment, the image analysis unit calculates a frequency spectrum from the first channel image and the second channel image.
In another embodiment, the image analysis unit applies a mask to the first channel image and the second channel image.
In another embodiment, the image analysis unit calculates a maximum energy and average energy for the masked first channel image and the masked second channel image.
In another embodiment, the image analysis unit assigns a confidence score for the first channel image and the second channel image based on the calculated maximum energy and average energy of the first channel image and the second channel image.
In another embodiment, the image analysis unit determines the slope, offset and distance for each row in a tile having a low confidence level and each adjacent tile having a high confidence score.
In another embodiment, the image analysis unit selects tiles adjacent to the low confidence tile with the adjacent tiles having a high confidence level
In another embodiment, the image analysis unit calculates an average inter-row distance for each row identified in the low and high confidence tiles.
In another embodiment, the image analysis unit calculates a maximum row angle and a minimum row angles for each row in the low and high confidence tiles, and the image analysis unit creates a parallel line for each row from the high confidence tile into the low confidence tile using the inter-row distance and row angles.
Another embodiment includes a row detection system having a memory and a processor, with a method of identifying rows of objects in an image is performed in the memory, the method including the steps of gathering a high-altitude image having multiple channels via a an image gathering unit, segmenting the high-altitude image into a plurality of equally size tiles via the image gathering unit, and determining an index value based on at least one channel of the image via the image gathering unit, where the image analysis unit identifies rows of objects in each image.
Another embodiment includes the step of separating each tile into a first channel image and a second channel image.
Another embodiment includes the step of the step of calculating a frequency spectrum from the first channel image and the second channel image.
Another embodiment includes the step of the step of applying a mask to the first channel image and the second channel image.
Another embodiment includes the step of the step of calculating a maximum energy and average energy for the masked first channel image and the masked second channel image.
Another embodiment includes the step of the step of assigning a confidence score for the first channel image and the second channel image based on the calculated maximum energy and average energy of the first channel image and the second channel image.
Another embodiment includes the step of determining the slope, offset and distance for each row in a tile having a low confidence level and each adjacent tile having a high confidence score.
Another embodiment includes the step of selecting tiles adjacent to the low confidence tile with the adjacent tiles having a high confidence level
Another embodiment includes the step of calculating an average inter-row distance for each row identified in the low and high confidence tiles.
Another embodiment includes the step of calculating a maximum row angle and a minimum row angles for each row in the low and high confidence tiles, and the image analysis unit creates a parallel line for each row from the high confidence tile into the low confidence tile using the inter-row distance and row angles.
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate an implementation of the present invention and, together with the description, serve to explain the advantages and principles of the invention. In the drawings:
Referring now to the drawings which depict different embodiments consistent with the present invention, wherever possible, the same reference numbers will be used throughout the drawings and the following description to refer to the same or like parts.
The row identification system 100 gathers medium to low resolution images gathered from an aircraft flying above 1,500 feet. Each image is then partitioned into equally sized tiles. Each tile is analyzed to identify objects within the tile. Adjacent tiles are then compared to identify similar objects in adjacent tiles. When the system 100 identifies an object that is inconsistent with adjacent objects, the system 100 identifies the area in the image containing the inconsistent object as an area requiring further statistical analysis. By comparing object areas to adjacent object areas to identify similar objects and dissimilar objects, the processing of large images covering multiple acres can be performed using less processing resources resulting in more images being processed and fewer images being gathered to analyze multiple acres of land.
The image gathering unit 110 and image analysis unit 112 may be embodied by one or more servers. Alternatively, each of the row detection unit 114 and image generation unit 116 may be implemented using any combination of hardware and software, whether as incorporated in a single device or as a functionally distributed across multiple platforms and devices.
In one embodiment, the network 108 is a cellular network, a TCP/IP network, or any other suitable network topology. In another embodiment, the row identification device may be servers, workstations, network appliances or any other suitable data storage devices. In another embodiment, the communication devices 104 and 106 may be any combination of cellular phones, telephones, personal data assistants, or any other suitable communication devices. In one embodiment, the network 102 may be any private or public communication network known to one skilled in the art such as a local area network (“LAN”), wide area network (“WAN”), peer-to-peer network, cellular network or any suitable network, using standard communication protocols. The network 108 may include hardwired as well as wireless branches. The image gathering unit 112 may be a digital camera.
In one embodiment, the network 108 may be any private or public communication network known to one skilled in the art such as a Local Area Network (“LAN”), Wide Area Network (“WAN”), Peer-to-Peer Network, Cellular network or any suitable network, using standard communication protocols. The network 108 may include hardwired as well as wireless branches.
In step 508, the NDVI image and NIR images are separated non-overlapping tiles of equally sized pixels. In one embodiment, each tile is 256×256 pixels.
Planted vegetation seen in aerial images occurs in regular patterns often as parallel equidistant rows.
Where w is the width of the tile.
In step 604 the general peak frequency is calculated as the median value of all the highest FFT peaks extracted in each of the NDVI tiles. In step 606, the general row spacing between the planted vegetation rows is computed using Equation 1 above for the general peak frequency.
In step 608, a binary mask of the same size is calculated with the NDVI tile (256×256 pixel in one implementation). The binary mask contains non-zero values only in the circle corresponding to the general peak frequency. In FFT domain this mask will select only peaks that correspond to vegetation rows separated by general row spacing described above.
c
NDVI=max(M∘FNDVI)/mean(M∘FNDVI),
where ∘ denotes element-wise multiplication.
In each masked frequency spectrum NIR tile the maximum and average energies are calculated using the following equation:
c
NIR=max(M∘FNIR)/mean(M∘FNIR).
In step 710, a confidence score is calculate using the following equation:
c=max(cNDVI,cNIR/1.5)
In step 712, the rows are determined using the confidence score if the confidence score is above a predetermined threshold. The tiles without vegetation have higher of energy in low frequencies, while the tiles containing vegetation rows have high energy peaks in high frequencies.
b_i=b_0+i*d_row
where i is the ith row in the tile. For each combination b_0 and alpha a new set of lines are created and a mask image is formed with 1 indicating points belonging to a line and 0 for points not belonging to a line. In step 720, a refined confidence score is calculated by determining the NVDI value for each of the new lines created. The lines with the highest confidence scores are identified as the rows in the low confidence tiles.
q_k=min_i_d(p_k,line_i)
Each pixel p_k is assigned to a line line_i for which the distance above is minimized. If the distribution is unimodal, fit the line parameters using the all samples. In one embodiment, the parameters of i_th line (b_i and alpha_i) are calculated using the set of pixels for which q_k=i. If the distribution is bimodal, the number of samples associated with each modality is determined. In step 736, the overall fit score is determined using the following equation:
Q=\sum_i\sum_k d(p_k,line_i)/number of all vegetation pixels
In step 738, If Q is less than a threshold value, the process ends. If Q is greater than a threshold value, the process returns to step 732.
A binary mask of the row planted vegetation for the entire image may be created. The mask is generated only for tiles for which the confidence score is above a predetermined threshold. The binary mask may have a non-zero value for pixels corresponding to lines (parallel equidistant with the given slope and spacing).
NDVI intensities of the pixels on vegetation lines are grouped into three categories. First, mean m and standard deviation s of the NDVI intensities of the pixels on vegetation lines are calculated. The pixels having intensity less than m−3 s are grouped into category R. The pixels having intensity between m−3 s and m−1.5 s and are grouped into category Y. The remaining pixels on the vegetation lines are grouped into category. The coverage score c is calculated using the following formula:
While various embodiments of the present invention have been described, it will be apparent to those of skill in the art that many more embodiments and implementations are possible that are within the scope of this invention. Accordingly, the present invention is not to be restricted except in light of the attached claims and their equivalents.
This application is a non-provisional patent application that claims the benefit of and the priority from U.S. Provisional Patent Application No. 62/616,153, filed Jan. 11, 2018, titled ROW DETECTION SYSTEM.
Number | Date | Country | |
---|---|---|---|
62616153 | Jan 2018 | US |