The present invention relates to a system for and method of processing laser scan samples comprising laser scan samples relating to building façades. one field of application is the detection of building façade texture elements from mobile mapping system data that includes images from cameras and output data from laser scanners.
Systems for detection of texture elements from data obtained by a mobile mapping system (MMS) is known for some years. Such a MMS is provided with one or more cameras that take pictures of the environment as controlled by a processor on board the MMS. A MMS may take the form of a car that is driven along roads of interest while the processor controls the camera(s) to take pictures of building façades. In an off-line process, nowadays, the locations of texture elements, like windows and doors, in the façade pictures are identified. The texture elements as present in these façade pictures are then substituted by standard texture elements as stored in a library that is accessible by the processor. Since these standard texture elements are used to construct the façades, the memory space required to store façades with their texture elements is much less than is necessary for storing all the original pictures with their original textures. Further reference is made to unpublished PCT/EP2005/055317.
To build a textured 3D façade model, a good method for identification of such texture elements in the raw picture of the façade is required. In the prior art, there exist methods to analyze and decompose texture from pictures. However, there is a rather high chance that incorrect objects are identified as being texture elements, such as trees, cars, people, and other obstacles in front of a façade. Also like all highly advanced image based technologies, the methods require heavy computational power. Moreover, the prior art process requires a lot of human interaction to obtain good quality for the textural representation of the façade. The speed of manual extraction may be only 1 km/h, on average, which, in total, would bring a cost of thousands of man hours for an average sized city.
It is an object of the invention to provide a system and method to detect a texture element in a façade as well as its location that requires much less man power than in the prior art.
To that effect, the present invention provides a system as defined in claim 1.
The system according to the invention provides the user with more automatic detection of texture elements in façades than was possible in the prior art. Thus, an enormous amount of labour in the sense of manpower as well as money can be saved. And a realistic building façade can be constructed from these texture elements, for example in a 3D model of the buildings along a street.
In an embodiment, the invention relates to a method as defined in the independent method claim.
In a further embodiment, the invention relates to a computer program product comprising instructions and data arranged to instruct a processor to perform such a method.
Moreover, the invention relates to a data carrier comprising such a computer program product.
The invention will be explained in detail with reference to some drawings that are intended to illustrate the invention but not to limit its scope which is defined by the annexed claims and its equivalent embodiments.
In the drawings:
a, 7b, and 7c, respectively, show the façade of the last example of
a and 9b, respectively, show middle floors of a scanned façade and an averaged image of
a-10d show how masks can be derived from the images of
a and 11b show how façade wall data is derived with the aid a mask;
a, 15b, and 15c, respectively, show an original picture of a façade, a façade picture as resulting from the invention and some stored texture elements, respectively.
The car 1 is provided with a plurality of wheels 2. Moreover, the car 1 is provided with a high accuracy position determination device. As shown in
The system as shown in
It is a general desire to provide as accurate as possible location and orientation measurement from the 3 measurement units: GPS, IMU and DMI. These location and orientation data are measured while the camera(s) 9(i) take pictures and the laser scanner(s) 3(j) take laser samples. Both the pictures and the laser samples are stored for later use in a suitable memory of the μP in association with corresponding location and orientation data of the car 1 at the time these pictures and laser samples were taken.
The pictures and laser samples include information as to building block façades. In an embodiment, the laser scanner(s) 3(j) are arranged to produce an output with minimal 50 Hz and 1 deg resolution in order to produce a dense enough output for the method. A laser scanner such as MODEL LMS291-S05 produced by SICK is capable of producing such output.
The microprocessor in the car 1 and memory 9 may be implemented as a computer arrangement. An example of such a computer arrangement is shown in
In
The processor 11 is connected to a plurality of memory components, including a hard disk 12, Read Only Memory (ROM) 13, Electrically Erasable Programmable Read Only Memory (EEPROM) 14, and Random Access Memory (RAM) 15. Not all of these memory types need necessarily be provided. Moreover, these memory components need not be located physically close to the processor 11 but may be located remote from the processor 11.
The processor 11 is also connected to means for inputting instructions, data etc. by a user, like a keyboard 16, and a mouse 17. Other input means, such as a touch screen, a track ball and/or a voice converter, known to persons skilled in the art may be provided too.
A reading unit 19 connected to the processor 11 is provided. The reading unit 19 is arranged to read data from and possibly write data on a data carrier like a floppy disk 20 or a CDROM 21. Other data carriers may be tapes, DVD, CD-R. DVD-R, memory sticks etc. as is known to persons skilled in the art.
The processor 11 is also connected to a printer 23 for printing output data on paper, as well as to a display 18, for instance, a monitor or LCD (Liquid Crystal Display) screen, or any other type of display known to persons skilled in the art.
The processor 11 may be connected to a loudspeaker 29.
The processor 11 may be connected to a communication network 27, for instance, the Public Switched Telephone Network (PSTN), a Local Area Network (LAN), a Wide Area Network (WAN), the Internet etc. by means of I/O means 25. The processor 11 may be arranged to communicate with other communication arrangements through the network 27.
The data carrier 20, 21 may comprise a computer program product in the form of data and instructions arranged to provide the processor with the capacity to perform a method in accordance with the invention. However, such computer program product may, alternatively, be downloaded via the telecommunication network 27.
The processor 11 may be implemented as stand alone system, or as a plurality of parallel operating processors each arranged to carry out subtasks of a larger computer program, or as one or more main processors with several sub-processors. Parts of the functionality of the invention may even be carried out by remote processors communicating with processor 11 through the network 27.
It is observed that when applied in the car 1 the computer arrangement does not need to have all components shown in
For post-processing the pictures and scans as taken by the camera(s) 9(i) and the laser scanner(s) 3(j) a similar arrangement as the one shown in
In the present invention, façade textures are decomposed by using both the pictures taken by the camera(s) 9(i) and the laser scans taken by the laser scanner(s) 3(j). The method uses a unique combination of techniques from both the field of image processing and laser scanning technology.
These actions will now be explained in detail below.
A. Action 42: Extraction of Façade Points in Laser Scan
The laser scanner(s) 3(j) are, in an embodiment 2D laser scanner(s). A 2D laser scanner 3(j) provides a triplet of data comprising time of measurement, angle of measurement, and distance to nearest solid object that is visible at this angle from the laser scanner 3(j). A good method for finding façade points in the laser scan is to use a histogram analysis.
In
The peak on histogram 63 indicates the presence of a flat solid surface parallel to the car heading. The approximate distance between the car 1 and the façade 65 can be determined by any available method. Alternatively, GPS (or other) data indicating the trajectory traveled by the car 1 and data showing locations of footprints of buildings can be compared and, thus, render such approximate distance data between the car 1 and the façade 65. By analysing the histogram data within a certain area about this approximate distance, the local maximal peak within this area is identified as being the base of a façade 65. All laser scan samples that are within a perpendicular distance of, for instance, 0.5 m from this local maximal peak are considered as architectural detail of the façade 65 and marked as “façade points”. All other points are considered as “ghosts” and are marked so. It is observed that the distance of 0.5 m is only given as an example. Other distances may be used, if required.
From points marked as “façade” a depth map perpendicular to the vehicle direction is created and stored as an image. This depth-map contains all laser scan samples within, for instance, 0.5 m from the local maximal peak.
Examples of detected façade textures in
Below, the last façade of
B. Action 44: Floor Height Size Determination Based on Fourier Analysis of Extracted Points of Laser Scan
The next action in the process is the computation of the number of floors in a building as well as a height of a single floor.
On the image generated in step 42 a variance filtering is applied with a window size of 3×3. The variance filtering is applied for a plurality of horizontal lines in the entire area of the image. each horizontal line is at a different height above ground. For every horizontal line an average value of such a variance over the complete line is calculated. Such average data is stored in a table in memory. Thus, the table comprises distance variance data averaged for each horizontal line (height) where “distance” refers to a perpendicular distance from the car 1. A Fast Fourier Transform (FFT) is applied to this average variance, height dependent data in the table to find frequency characteristics of the height component of the image. Such transformation allows one to find a repeatable pattern in the average changes over the height of the building. There is a very low, zero frequency representing the constant background of the building's distance to the car. The next lowest frequency peak in the FFT output data will be caused by patterns of windows and other architectural elements outside the plane of the façade itself and therefore will represent the basic floor pattern of the building The highest peak in the FFT output data will correspond to the frequency of appearance of windows and other floor associated architectural elements in the façade. From the FFT output data, a size (height) of a floor can be computed, as will be explained now in greater detail with reference to
a, 7b, and 7c, respectively, show the façade of the last example of
Thus,
C. Action 46: Floor Size Averaging
Action 44 has rendered the floor size (height). In action 46, the floors are averaged. In order to ease the overall process and make it more accurate, only “middle floors” are taken into account. A “middle floor” is defined as a floor with a minimal and maximal height that occurs more than once in the façade of the building. One can also say, “middle floors” are vertically repeatable in the façade. For instance, the lower floor of most buildings is not a “middle floor”: its height is in most cases not within a certain minimal and maximal height as is the case for higher floors (in most buildings the lower floor is higher than other floors). Similarly, the upper floor is, in most cases not a “middle floor” since its height can not be established very well with the method above because there is no floor above the upper floor with windows that can be identified and used.
a and 9b further clarify this.
By averaging all middle floors, as the resolution of the façade data as would be captured by the laser scanner 3(j) may be 3 times higher, for instance may increase from 10 cm to 3 cm resolution in given example). In it's original resolution one pixel on the laser output corresponds to 20 pixels on the image as produced by the cameras. That is why the process of increasing resolution is very important in case when the laser scanner has a lower resolution then the camera (which is typical on currently available equipment). In that manner a higher resolution mask can be obtained, which later on will be applied to the image.
In a test of a large number of buildings in a European city, it was observed that over ninety percent of these buildings had at least two such similar middle floors So, the property of high similarity between two or more floors in a building can be used to virtually increase the laser resolution. This is especially important for higher buildings since laser resolution lowers with building height due to the angular nature of laser measurements. Moreover, averaging allows reconstructing places where trees were in front of part of the façade. This is because trees and other obstacles are filtered out in action 42, and as such these points are not taken into the average. Action 46 delivers a “floor pattern”.
D. Action 48: Extraction of Median Based Library Elements from Laser Scan
a shows the same laser scan as
E. Action 50: Creating a Multi Floor Mask
Then, the image is separated into two parts or masks. The first part contains all samples associated with the mode value and is used as a mask indicating the wall of the façade. The second part contains all samples associated with non-mode values and is used as a mask for deriving library elements from the pictures taken by the camera(s) 9(i). Note that both masks contain samples where each sample contains location information.
to a person skilled in the art, such a camera (or cameras) and such a laser scanner (or laser scanners) may alternatively be located on an airborne vehicle. The laser scanner may be part of a LIDAR (Light Detection And Ranging or Laser Imaging Detection And Ranging) system.
It is observed that the method as explained above can be performed by processor 11 as instructed by a suitable software program stored in one of the memories 12-15 or stored elsewhere. The present invention relates to the method and to the computer arrangement with such a processor, but also to a computer program product comprising suitable instructions and data for the described method, as well as a data carrier, like a CD, DVD, etc. comprising such a computer program.
Number | Date | Country | Kind |
---|---|---|---|
PCT/NL2006/050259 | Oct 2006 | WO | international |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/NL2006/050265 | 10/20/2006 | WO | 00 | 11/30/2009 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2008/044914 | 4/17/2008 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
20060152522 | Strassenburg-Kleciak et al. | Jul 2006 | A1 |
20060188143 | Strassenburg-Kleciak et al. | Aug 2006 | A1 |
20080111815 | Graves et al. | May 2008 | A1 |
Number | Date | Country |
---|---|---|
2005251035 | Sep 2005 | JP |
2005532631 | Oct 2005 | JP |
WO 2007045272 | Apr 2007 | WO |
Entry |
---|
XP005225768—Madhavan et al: “A computer vision based approach for 3D building modelling of airborne laser scanner DSM data” Computers Environment and Urban Systems, New York, NY, US, vol. 30 No. 1, Jan. 2006. |
Number | Date | Country | |
---|---|---|---|
20100104141 A1 | Apr 2010 | US |