There are various needs for understanding the shape and size of cavity surfaces, such as body cavities. For example, hearing aids, hearing protection, custom head phones, and wearable computing devices may require impressions of a patient's ear canal. To construct an impression of an ear canal, audiologists may inject a silicone material into a patient's ear canal, wait for the material to harden, and then provide the mold to manufacturers who use the resulting silicone impression to create a custom fitting in-ear device. As may be appreciated, the process is slow, expensive, and unpleasant for the patient as well as a medical professional performing the procedure.
Computer vision and photogrammetry generally relates to acquiring and analyzing images in order to produce data by electronically understanding an image using various algorithmic methods. For example, computer vision may be employed in event detection, object recognition, motion estimation, and various other tasks.
Many aspects of the present disclosure can be better understood with reference to the following drawings. The components in the drawings are not necessarily to scale, with emphasis instead being placed upon clearly illustrating the principles of the disclosure. Moreover, in the drawings, like reference numerals designate corresponding parts throughout the several views.
The present disclosure relates to a mobile scanning device configured to scan and generate images and reconstructions of surfaces. Advancements in computer vision permit imaging devices, such as conventional cameras, to be employed as sensors useful in determining locations, shapes, and appearances of objects in a three-dimensional space. For example, a position and an orientation of an object in a three-dimensional space may be determined relative to a certain world coordinate system utilizing digital images captured via image capturing devices. As may be appreciated, the position and orientation of the object in the three-dimensional space may be beneficial in generating additional data about the object, or about other objects, in the same three-dimensional space.
For example, scanning devices may be used in various industries to scan objects to generate data pertaining to the objects being scanned. A scanning device may employ an imaging device, such as a camera, to determine information about the object being scanned, such as the size, shape, or structure of the object, the distance of the object from the scanning device, etc.
As a non-limiting example, a scanning device may include an otoscanner configured to visually inspect or scan the ear canal of a human or animal. An otoscanner may comprise one or more cameras that may be beneficial in generating data about the ear canal subject of the scan, such as the size, shape, or structure of the ear canal. This data may be used in generating three-dimensional reconstructions of the ear canal that may be useful in customizing in-ear devices, for example but not limited to, hearing aids or wearable computing devices.
Determining the size, shape, or structure of an object subject to a scan, may require information about a position of the object relative to the scanning device conducting the scan. For example, during a scan, a distance of an otoscanner from an ear canal may be beneficial in determining the shape of the ear canal. An estimated position of the scanning device relative to the object being scanned (i.e., the pose estimate) may be generated using various methods, as will be described in greater detail below.
According to one embodiment, determining an accurate pose estimate for a scanning device (e.g., an otoscanner) may comprise employing one or more fiducial markers to be imaged via one or more imaging devices in data communication with the scanning device. By being imaged via the imaging devices, the fiducial marker may act as a point of reference or as a measure in estimating a pose (or position) of the scanning device. A fiducial marker may comprise, for example, a circle-of-dots fiducial marker comprising a plurality of machine-identifiable regions (also known as “blobs”), as will be described in greater detail below. In other embodiments, the tracking targets may be naturally occurring features surrounding and/or within the cavity to be scanned.
As a scanning device is performing a scan of an object, the one or more imaging devices may generate one or more digital images. The digital images may be analyzed for the presence of at least a portion of the one or more circle-of-dots fiducial markers. Subsequently, an identified portion of the one or more circle-of-dots fiducial markers may be analyzed and used in determining a relatively accurate pose estimate for the scanning device. The pose estimate may be used in generating three-dimensional reconstructions of an ear canal, as will be described in greater detail below.
In the following discussion, a general description of the system and its components is provided, followed by a discussion of the operation of the same.
With reference to
The hand grip 106 may be configured such that the length is long enough to accommodate large hands and the diameter is small enough to provide enough comfort for smaller hands. A trigger 121, located within the hand grip 106, may perform various functions such as initiating a scan of a surface, controlling a user interface rendered in the display, and/or otherwise modifying the function of the scanning device 100.
The scanning device 100 may further comprise a cord 124 that may be employed to communicate data signals to external computing devices and/or to power the scanning device 100. As may be appreciated, the cord 124 may be detachably attached to facilitate the mobility of the scanning device 100 when held in a hand via the hand grip 106. According to various embodiments of the present disclosure, the scanning device 100 may not comprise a cord 124, thus acting as a wireless and mobile device capable of wireless communication.
The probe 109 mounted onto the scanning device 100 may be configured to guide light received at a proximal end of the probe 109 to a distal end of the probe 109 and may be employed in the scanning of a surface cavity, such as an ear canal, by placing the probe 109 near or within the surface cavity. During a scan, the probe 109 may be configured to project a 360-degree ring onto the cavity surface and capture reflections from the projected ring to reconstruct the image, size, and shape of the cavity surface. In addition, the scanning device 100 may be configured to capture video images of the cavity surface by projecting video illuminating light onto the cavity surface and capturing video images of the cavity surface.
The fan light element 112 mounted onto the scanning device 100 may be configured to emit light in a fan line for scanning an outer surface. The fan light element 112 comprises a fan light source projecting light onto a single element lens to collimate the light and generate a fan line for scanning the outer surface. By using triangulation of the reflections captured when projected onto a surface, the imaging sensor within the scanning device 100 may reconstruct the scanned surface.
Referring next to
Turning now to
In the examples of
Further, the display screen 118 is coupled for data communication to the imaging devices 115 (not shown). The display screen 118 may be configured to display and/or render images of the scanned surface. The displayed images may include digital images or video of the cavity captured by the probe 109 and the fan light element 112 (not shown) as the probe 109 is moved within the cavity. The displayed images may also include real-time constructions of three-dimensional images corresponding to the scanned cavity. The display screen 118 may be configured, either separately or simultaneously, to display the video images and the three-dimensional images, as will be discussed in greater detail below.
According to various embodiments of the present disclosure, the imaging devices 115 of
Moving on to
Referring next to
In the second portion 303b, a real-time three-dimensional reconstruction of the object being scanned may be rendered, providing the operator of the scanning device 100 with an estimate regarding what portion of the surface cavity has been scanned. For example, the three-dimensional reconstruction may be non-existent as a scan of a surface cavity is initiated by the operator. As the operator progresses in conducting a scan of the surface cavity, a three-dimensional reconstruction of the surface cavity may be generated portion-by-portion, progressing into a complete reconstruction of the surface cavity at the completion of the scan. In the non-limiting example of
A three-dimensional reconstruction of an ear canal 309 may be generated via one or more processors internal to the scanning device 100, external to the scanning device 100, or a combination thereof. Generating the three-dimensional reconstruction of the object subject to the scan may require information related to the pose of the scanning device 100. The three-dimensional reconstruction of the ear canal 309 may further comprise, for example, a probe model 310 emulating a position of the probe 109 relative to the surface cavity being scanned by the scanning device. Determining the information that may be used in the three-dimensional reconstruction of the object subject to the scan and the probe model 310 will be discussed in greater detail below.
A notification area 312 may provide the operator of the scanning device with notifications, whether assisting the operator with conducting a scan or warning the operator of potential harm to the object being scanned. Measurements 315 may be rendered in the display to assist the operator in conducting scans of surface cavities at certain distances and/or depths. A bar 318 may provide the operator with an indication of which depths have been thoroughly scanned as opposed to which depths or distances remain to be scanned. One or more buttons 321 may be rendered at various locations of the user interface permitting the operator to initiate a scan of an object and/or manipulate the user interface presented on the display screen 118 or other display in data communication with the scanning device 100. According to one embodiment, the display screen 118 comprises a touch-screen display and the operator may engage button 321 to pause and/or resume an ongoing scan.
Although portion 303a and portion 303b are shown simultaneously in a side-by-side arrangement, other embodiments may be employed without deviating from the scope of the user interface. For example, portion 303a may be rendered in the display screen 118 on the scanning device 100 and portion 303b may be located on a display external to the scanning device 100, and vice versa.
Turning now to
According to various embodiments of the present disclosure, a circle-of-dots 406 may comprise, for example, a combination of uniformly or variably distributed large dots and a small dots that, when detected, represent a binary number. For example, in the event seven dots in a circle-of-dots 406 are detected in a digital image, the sequence of seven dots may be analyzed to identify (a) the size of the dots and (b) a number or other identifier corresponding to the arrangement of the dots. Detection of a plurality of dots in a digital image may be employed using known region- or blob-detection techniques, as may be appreciated.
As a non-limiting example, a sequence of seven dots comprising small-small-large-small-large-large-large may represent an identifier represented as a binary number of 0-0-1-0-1-1-1 (or, alternatively, 1-1-0-1-0-0-0). The detection of this arrangement of seven dots, represented by the corresponding binary number, may be indicative of a pose of the scanning device 100 relative to the fiducial marker 403. For example, a lookup table may be used to map the binary number to a pose estimate, providing at least an initial estimated pose that may be refined and/or supplemented using information inferred via one or more camera models, as will be discussed in greater detail below. Although the example described above employs a binary operation using a combination of small dots and large dots to form a circle-of-dots 406, variable size dots (having, for example, β sizes) may be employed using variable base numeral systems (for example, a base-β numeral system).
The arrangement of dots in the second circle-of-dots 406b may be the same as the first circle-of-dots 406a, or may vary. If the second circle-of-dots 406b comprises the same arrangement of dots as the first circle-of-dots 406a, then the second circle-of-dots 406b may be used independently or collectively (with the first circle-of-dots 406a) to determine an identifier indicative of the pose of the scanning device 100. Similarly, the second circle-of-dots 406b may be used to determine an error of the pose estimate determined via the first circle-of-dots 406a, or vice versa.
Accordingly, a fiducial marker 403 may be placed relative to the object being scanned to facilitate in accurate pose estimation of the scanning device 100. In the non-limiting example of
In other embodiments, a fiducial marker may not be needed, as the tracking targets may be naturally occurring features surrounding and/or within the cavity to be scanned detectable by employing various computer vision techniques. For example, assuming that a person's ear is being scanned by the scanning device 100, the tracking targets may include, hair, folds of the ear, skin tone changes, freckles, moles, and/or any other naturally occurring feature on the person's head relative to the ear.
Moving on to
Referring next to
Initially, a scanning device 100 may be calibrated using the imaging devices 115 to capture calibration images of a calibration object whose geometric properties are known. By employing the camera model of
In the camera model of
A world coordinate system 609 with principal point O may be defined separately from the camera coordinate system as XO, YO, ZO. According to various embodiments, the world coordinate system 609 may be defined at a base location of the probe 109 of the scanning device 100, however, it is understood that various locations of the scanning device 100 may be used as the base of the world coordinate system 609. Motion between the camera coordinate system and the world coordinate system 609 is defined by a rotation R, a translation t, a tilt φ. A principal point p is defined as the origin of a normalized image coordinate system (x, y) and a pixel image coordinate system is defined as (u, v), wherein α is
in a conventional orthogonal pixel coordinate axes. The mapping of a three-dimensional point X to the digital image m is represented via:
Further, the camera model of
r(θ)=1+k2θ3+k3θ5+k4θ7+ . . . (eq. 3)
As eq. 3 shows a polynomial with four terms up to the seventh power of θ, the polynomial of eq. 3 provides enough degrees of freedom (e.g., six degrees of freedom) for a relatively accurate representation of various projection curves that may be produced by a lens of an imaging device 115. Other polynomial equations with lower or higher orders or other combinations of orders may be used.
Turning now to
The placement of two imaging devices 115 permits computations of positions using epipolar geometry. For example, when the first imaging device 115a and the second imaging device 115b view a three-dimensional scene from their respective positions (different from the other imaging device 115), there are geometric relations between the three-dimensional points and their projections on two-dimensional images that lead to constraints between the image points. These geometric relations may be modeled via the camera model of
By determining the internal parameters and external parameters for each imaging device 115 via the camera model of
In addition, the placement of the two imaging device 115 in the scanning device 100 may be beneficial in implementing computer stereo vision. For example, both imaging devices 115 can capture digital images of the same scene; however, they are separated by a distance 709. A processor in data communication with the imaging devices 115 may compare the images by shifting the two images together over the top of each other to find the portions that match to generate a disparity used to calculate a distance between the scanning device 100 and the object of the picture. However, implementing the camera model of
Moving on to
Referring next to
Beginning with 903, a digital image comprising data corresponding to at least a portion of fiducial marker 403 (
As the digital image will be analyzed using one or more region- or blob-detection techniques, it may be beneficial to prepare a digital image for blob-detection. In 906, the digital image accessed in 903 may be pre-processed according to predefined parameters (e.g., internal and external parameters, discussed above). Pre-processing a digital image according to predefined parameters may comprise, for example, applying filters and/or modifying chroma, luminescence, and/or other features of the digital image. In addition, pre-processing may further comprise, for example, removing speckles or extraneous artifacts from the digital image, removing partial dots from the digital image, etc.
As discussed above, in 909, blob detection may be employed to identify: (a) the presence of a fiducial marker in the digital image; and (b) if the fiducial marker is present in the digital image, identify dots in a circle-of-dots 406 (or other arrangement), as depicted in
For example, a sequence of seven dots comprising small-small-large-small-large-large-large may represent a binary number of 0-0-1-0-1-1-1 (or, alternatively, 1-1-0-1-0-0-0). The detection of this sequence of seven dots, represented by the binary number, is indicative of a pose of the scanning device 100 relative to the fiducial marker 403. According to one embodiment, a lookup table may be used to map the binary number to a pose estimate, providing at least an initial pose estimate that may be refined and/or supplemented using information inferred via one or more camera models, as will be discussed in 912. According to various embodiments, the initial pose estimate may provide enough information to determine six degrees of freedom of the scanning device 100. As more dots are identified, a more approximate identifier may be determined indicating a more approximate pose estimate of the scanning device 100.
Next, in 912, world and image points may be computed to refine and/or supplement the information determined from the fiducial marker 403. According to one embodiment, the camera model of
In 915, the world and image points may be used in an initial pose of the scanning device 100 (i.e., the pose estimate). For example, an identifier determined from at least a portion of an identifier identified in a digital image may be indicative of a pose estimate of the scanning device. Similarly, after a determination of the external parameters and internal parameters for one or more imaging devices 115 has been determined via a camera model, a pose estimate of the scanning device 100 may be determined relative to a world coordinate system 609 (
In 918, the pose estimate may be refined. For example, a second digital image of the fiducial marker 403 comprising one or more circle-of-dots 406 captured via the imaging devices 115, if detected, may be used in refining and/or error checking the computed pose estimate, as shown in 921. In 924, an output of the pose of the scanning device 100 may be transmitted and/or accessed by other components in data communication with the scanning device 100. For example, the pose estimate may be requested from a requesting service such as a service configured to generate a three-dimensional reconstruction of an object being scanned using the scanning device 100. The pose estimate may provide information beneficial in the three-dimensional reconstruction of the object, such as the distance of the scanning device 100 relative to a surface cavity being scanned by the scanning device 100.
With reference to
Stored in the memory 1006 are both data and several components that are executable by the processor 1003. In particular, a pose estimate application 900 is stored in the memory 1006 and executable by the processor 1003, as well as other applications. Also stored in the memory 1006 may be a data store 1012 and other data. In addition, an operating system may be stored in the memory 1006 and executable by the processor 1003.
It is understood that there may be other applications that are stored in the memory 1006 and are executable by the processor 1003 as can be appreciated. Where any component discussed herein is implemented in the form of software, any one of a number of programming languages may be employed such as, for example, C, C++, C#, Objective C, Java®, JavaScript®, Perl, PHP, Visual Basic®, Python®, Ruby, Flash®, or other programming languages.
A number of software components are stored in the memory 1006 and are executable by the processor 1003. In this respect, the term “executable” means a program file that is in a form that can ultimately be run by the processor 1003. Examples of executable programs may be, for example, a compiled program that can be translated into machine code in a format that can be loaded into a random access portion of the memory 1006 and run by the processor 1003, source code that may be expressed in proper format such as object code that is capable of being loaded into a random access portion of the memory 1006 and executed by the processor 1003, or source code that may be interpreted by another executable program to generate instructions in a random access portion of the memory 1006 to be executed by the processor 1003, etc. An executable program may be stored in any portion or component of the memory 1006 including, for example, random access memory (RAM), read-only memory (ROM), hard drive, solid-state drive, USB flash drive, memory card, optical disc such as compact disc (CD) or digital versatile disc (DVD), floppy disk, magnetic tape, or other memory components.
The memory 1006 is defined herein as including both volatile and nonvolatile memory and data storage components. Volatile components are those that do not retain data values upon loss of power. Nonvolatile components are those that retain data upon a loss of power. Thus, the memory 1006 may comprise, for example, random access memory (RAM), read-only memory (ROM), hard disk drives, solid-state drives, USB flash drives, memory cards accessed via a memory card reader, floppy disks accessed via an associated floppy disk drive, optical discs accessed via an optical disc drive, magnetic tapes accessed via an appropriate tape drive, and/or other memory components, or a combination of any two or more of these memory components. In addition, the RAM may comprise, for example, static random access memory (SRAM), dynamic random access memory (DRAM), or magnetic random access memory (MRAM) and other such devices. The ROM may comprise, for example, a programmable read-only memory (PROM), an erasable programmable read-only memory (EPROM), an electrically erasable programmable read-only memory (EEPROM), or other like memory device.
Also, the processor 1003 may represent multiple processors 1003 and/or multiple processor cores and the memory 1006 may represent multiple memories 1006 that operate in parallel processing circuits, respectively. In such a case, the local interface 1009 may be an appropriate network that facilitates communication between any two of the multiple processors 1003, between any processor 1003 and any of the memories 1006, or between any two of the memories 1006, etc. The local interface 1009 may comprise additional systems designed to coordinate this communication, including, for example, performing load balancing. The processor 1003 may be of electrical or of some other available construction.
Although the pose estimate application 900, and other various systems described herein may be embodied in software or code executed by general purpose hardware as discussed above, as an alternative the same may also be embodied in dedicated hardware or a combination of software/general purpose hardware and dedicated hardware. If embodied in dedicated hardware, each can be implemented as a circuit or state machine that employs any one of or a combination of a number of technologies. These technologies may include, but are not limited to, discrete logic circuits having logic gates for implementing various logic functions upon an application of one or more data signals, application specific integrated circuits (ASICs) having appropriate logic gates, field-programmable gate arrays (FPGAs), or other components, etc. Such technologies are generally well known by those skilled in the art and, consequently, are not described in detail herein.
The flowchart of
Although the flowchart of
Also, any logic or application described herein, including the pose estimate application 900, that comprises software or code can be embodied in any non-transitory computer-readable medium for use by or in connection with an instruction execution system such as, for example, a processor 1003 in a computer system or other system. In this sense, the logic may comprise, for example, statements including instructions and declarations that can be fetched from the computer-readable medium and executed by the instruction execution system. In the context of the present disclosure, a “computer-readable medium” can be any medium that can contain, store, or maintain the logic or application described herein for use by or in connection with the instruction execution system.
The computer-readable medium can comprise any one of many physical media such as, for example, magnetic, optical, or semiconductor media. More specific examples of a suitable computer-readable medium would include, but are not limited to, magnetic tapes, magnetic floppy diskettes, magnetic hard drives, memory cards, solid-state drives, USB flash drives, or optical discs. Also, the computer-readable medium may be a random access memory (RAM) including, for example, static random access memory (SRAM) and dynamic random access memory (DRAM), or magnetic random access memory (MRAM). In addition, the computer-readable medium may be a read-only memory (ROM), a programmable read-only memory (PROM), an erasable programmable read-only memory (EPROM), an electrically erasable programmable read-only memory (EEPROM), or other type of memory device.
Further, any logic or application described herein, including the pose estimate application 900, may be implemented and structured in a variety of ways. For example, one or more applications described may be implemented as modules or components of a single application. Further, one or more applications described herein may be executed in shared or separate computing devices or a combination thereof. For example, a plurality of the applications described herein may execute in the same scanning device 100, or in multiple computing devices in a common computing environment. Additionally, it is understood that terms such as “application,” “service,” “system,” “engine,” “module,” and so on may be interchangeable and are not intended to be limiting.
Disjunctive language such as the phrase “at least one of X, Y, or Z,” unless specifically stated otherwise, is otherwise understood with the context as used in general to present that an item, term, etc., may be either X, Y, or Z, or any combination thereof (e.g., X, Y, and/or Z). Thus, such disjunctive language is not generally intended to, and should not, imply that certain embodiments require at least one of X, at least one of Y, or at least one of Z to each be present.
It should be emphasized that the above-described embodiments of the present disclosure are merely possible examples of implementations set forth for a clear understanding of the principles of the disclosure. Many variations and modifications may be made to the above-described embodiment(s) without departing substantially from the spirit and principles of the disclosure. All such modifications and variations are intended to be included herein within the scope of this disclosure and protected by the following claims.
This application is related to U.S. patent application Ser. No. ______, filed on Oct. ______, 2013 (Attorney Docket No. 52105-1010) and entitled “Tubular Light Guide,” U.S. patent application Ser. No. ______, filed on Oct. ______, 2013 (Attorney Docket No. 52105-1020) and entitled “Tapered Optical Guide,” U.S. patent application Ser. No. ______, filed on Oct. ______, 2013 (Attorney Docket No. 52105-1030) and entitled “Display for Three-Dimensional Imaging,” U.S. patent application Ser. No. ______, filed on Oct. ______, 2013 (Attorney Docket No. 52105-1040) and entitled “Fan Light Element,” U.S. patent application Ser. No. ______, filed on Oct. ______, 2013 (Attorney Docket No. 52105-1060) and entitled “Integrated Tracking with Fiducial-based Modeling,” U.S. patent application Ser. No. ______, filed on Oct. ______, 2013 (Attorney Docket No. 52105-1070) and entitled “Integrated Calibration Cradle,” and U.S. patent application Ser. No. ______, filed on Oct. ______, 2013 (Attorney Docket No. 52105-1080) and entitled “Calibration of 3D Scanning Device,” all of which are hereby incorporated by reference in their entirety.