Heterogeneous image processing system

Information

  • Patent Grant
  • 8326092
  • Patent Number
    8,326,092
  • Date Filed
    Monday, April 23, 2007
    17 years ago
  • Date Issued
    Tuesday, December 4, 2012
    12 years ago
Abstract
The present invention relates to machine vision computing environments, and more specifically relates to a system and method for selectively accelerating the execution of image processing applications using a hybrid computing system. To this extent, a hybrid system is generally defined as one that is multi-platform, and potentially distributed via a network or other connection. The invention provides a machine vision system and method for executing image processing applications on a hybrid image processing system referred to herein as an image co-processor that comprises (among other things) a plurality of special purpose engines (SPEs) that work to process multiple images in an accelerated fashion.
Description
CROSS-REFERENCE TO RELATED APPLICATION

This application is related in some aspects to commonly owned U.S. patent application Ser. No. 11/738,711 entitled “HETEROGENEOUS IMAGE PROCESSING SYSTEM”, filed concurrently herewith, the entire contents of which are herein incorporated by reference.


FIELD OF THE INVENTION

In general, the present invention relates to image processing/inspection. Specifically, the present invention relates to a heterogeneous image processing system that provides accelerated image processing as compared to previous approaches.


BACKGROUND OF THE INVENTION

Current image processing/inspection systems have limited processing power. Specifically, current systems perform all image processing functions within a single, general-purpose system. The processor used in current image processing/inspection systems is not powerful enough to handle the image processing demands, data rates, and algorithms for much of the current generation of (e.g., manufacturing inspection systems), let alone the next generation of systems. Next-generation manufacturing systems have a need for a fast image processing system in order to complete image inspection within required times. As the size of the inspection area and the amount of gray scale data double, the data per one scan area increases dramatically. Therefore, the image inspection processing time is drastically increased. Thus, the current inspection system(s) will not adequately handle the requirements for future manufacturing systems.


Although, image processing functions are sometimes offloaded to another system, this other system also uses a general purpose processor that fails to actually perform any image processing acceleration. In addition, image processing functions in current systems are tied to a specific processor and platform, making it difficult to offload and accelerate specific functions at a fine-grained level. An example of this is shown in FIG. 1. Specifically, as shown, each image recording device (e.g., camera) 10A-N is linked with a separate and distinct (general purpose) processor 12A-N, that itself is linked/integrated with a separate and distinct node 14A-N (e.g., PC). Such an embodiment fails to provide the accelerated image processing needed by current and emerging image generations.


Whereas the development of a new inspection system will increase cost and development time, it is desirable to use reusable system components without impacting system performance. In view of the foregoing, there exists a need for an approach that solves at least one of the above-referenced deficiencies of the current art.


SUMMARY OF THE INVENTION

In general, the present invention relates to machine vision computing environments, and more specifically relates to a system and method for accelerating the execution of image processing applications using a hybrid computing system. To this extent, a hybrid system is generally defined as one that is multi-platform, and potentially distributed via a network or other connection. The invention provides a machine vision system and method for executing image processing applications on a hybrid image processing system referred to herein as an image co-processor that comprises (among other things) a plurality of special purpose engines (SPEs) that work collectively to process multiple images in an accelerated fashion. Moreover, implementations of the invention provide a machine vision system and method for distributing and managing the execution of image processing applications at a fine-grained level via a switch-connected hybrid system. This method allows one system to be used to manage and control the system functions, and one or more other systems to execute image processing applications. The invention allows the management and control system components to be reused, and the image processing components to be used as an image processing accelerator or image co-processor. The system components can be run using different operating systems, such as Windows (Windows and related terms are trademarks of Microsoft Corp. in the United States and/or other countries), Linux (Linux and related terms are trademarks of Linus Torvalds in the United States and/or other countries), Macintosh (Macintosh and related terms are trademarks of Apple Inc. in the United States and/or foreign countries), etc.


Along these lines, the present invention improves upon existing systems by providing a hybrid system comprising a management processor; a general purpose processor, a set (e.g., one or more) of special purpose engines/accelerators; a closely connected network, a staging storage device; and an archive storage device.


The present invention further provides a multi-platform hybrid system that reuses uses the existing infrastructure including the software assets for the image processing/inspection management and control system, and offloads image processing workloads to a special-purpose accelerator


Further provided is a method for processing image(s), comprising: receiving a request for execution of an image co-processor of an application; and analyzing switches and routing at least a portion of library functions of the image(s)


Still yet, an image processing program product stored on a computer readable medium is provided that furnishes: a set (e.g., one or more) of libraries with selective offload of functions, algorithms, and optimizations to other systems; scalable processing system design(s) that increase and decrease processors when needed; multiple hybrid platform system design(s); and optimizations encapsulated in pluggable modules.


A first aspect of the present invention provides a heterogeneous image processing system, comprising: an image co-processor comprising a plurality of special purpose engines (SPEs), the plurality of SPEs being configured to: receive a plurality of images; and process the plurality of images to determine associated image data.


A second aspect of the present invention provides a heterogeneous image processing method, comprising: receiving a plurality of images in an image co-processor, the image co-processor comprising a plurality of special purpose engines (SPEs); processing the plurality of images with the plurality of SPEs to determine image data associated with the plurality of images; and providing at least one of the image data or the plurality of images to a control processor to determine control information associated with the plurality of images.


A third aspect of the present invention provides a program product stored on computer readable medium for heterogeneously processing an image, the computer readable medium comprising program code for causing a computer system to: receive a plurality of images in an image co-processor, the image co-processor comprising a plurality of special purpose engines (SPEs); and process the plurality of images with the plurality of SPEs to determine image data associated with the plurality of images.


A fourth aspect of the present invention provides a method for deploying a system for heterogeneously processing an image, comprising: providing a computer infrastructure being operable to: receive a plurality of images in an image co-processor, the image co-processor comprising a plurality of special purpose engines (SPEs); and process the plurality of images with the plurality of SPEs to determine image data associated with the plurality of images.


A fifth aspect of the present invention provides computer software embodied in a propagated signal for heterogeneously processing an image, the computer software comprising instructions for causing a computer system to: receive a plurality of images in an image co-processor, the image co-processor comprising a plurality of special purpose engines (SPEs); and process the plurality of images with the plurality of SPEs to determine image data associated with the plurality of images.


A sixth aspect of the present invention provides a data processing system for heterogeneously processing an image, comprising a memory medium comprising instructions; a bus coupled to the memory medium; and a processor coupled to the bus that when executing the instruction causes the data processing system to: receive a plurality of images in an image co-processor, the image co-processor comprising a plurality of special purpose engines (SPEs); and process the plurality of images with the plurality of SPEs to determine image data associated with the plurality of images.


A seventh aspect of the present invention provides computer-implemented business method for heterogeneously processing an image, comprising: receiving a plurality of images in an image co-processor, the image co-processor comprising a plurality of special purpose engines (SPEs); and processing the plurality of images with the plurality of SPEs to determine image data associated with the plurality of images.


An eighth aspect of the present invention provides a heterogeneous image processing system, comprising: an image co-processor comprising: a set of input/output (I/O) processors; a set of multi-core processors (MCPs) coupled to the set of I/O processors; a set of external data representation (XDR) modules coupled to the set of MCPs; and a set of express peripheral component interconnects (PCIs) coupled the set of I/O processors, at lease one of the set of express PCIs receiving image data from a PC system that is external to the image co-processor.





BRIEF DESCRIPTION OF THE DRAWINGS

These and other features of this invention will be more readily understood from the following detailed description of the various aspects of the invention taken in conjunction with the accompanying drawings in which:



FIG. 1 shows a system for processing images according to the related art.



FIG. 2 shows a heterogeneous, hybrid image processing/inspection system according to the present invention.



FIG. 3 shows an approach for integrating the system of FIG. 2 with an existing system according to the present invention.



FIG. 4 shows an approach for a heterogeneous, hybrid image processing/inspection system having parallel processors according to the present invention.



FIG. 5 shows a flow diagram for an illustrative software process according to the present invention.





The drawings are not necessarily to scale. The drawings are merely schematic representations, not intended to portray specific parameters of the invention. The drawings are intended to depict only typical embodiments of the invention, and therefore should not be considered as limiting the scope of the invention. In the drawings, like numbering represents like elements.


DETAILED DESCRIPTION OF THE INVENTION

As indicated above, the present invention relates to machine vision computing environments, and more specifically relates to a system and method for selectively accelerating the execution of image processing applications using a hybrid computing system. To this extent, a hybrid system is generally defined as one that is multi-platform, and potentially distributed via a network or other connection. The invention provides a machine vision system and method for executing image processing applications on a hybrid image processing system referred to herein as an image co-processor (also referred to herein as a “cell”) that comprises (among other things) a plurality of special purpose engines (SPEs) that work to process multiple images in an accelerated fashion. Moreover, implementations of the invention provide a machine vision system and method for distributing and managing the execution of image processing applications at a fine-grained level via a switch-connected hybrid system. This method allows one system to be used to manage and control the system functions, and one or more other systems to execute image processing applications. The invention allows the management and control system components to be reused, and the image processing components to be used as an image processing accelerator or image co-processor. The system components can be run using different operating systems, such as Windows (Windows and related terms are trademarks of Microsoft Corp. in the United States and/or other countries), Linux (Linux and related terms are trademarks of Linus Torvalds in the United States and/or other countries), Macintosh (Macintosh and related terms are trademarks of Apple Computer Inc. in the United States and/or other countries), etc.


Referring now to FIG. 2, a heterogeneous image processing/inspection system according to the present invention is shown. As depicted, image recordation mechanisms (e.g., cameras) 58A-N record images and are attached to an image co-processor 52 (via one more image frame acquisition mechanisms 56 for image processing). The image co-processor 52 in turn is connected to a control processor 50. These components are connected in a single, monolithic, tightly integrated system. All image processing is done completely within the single system. Each component can only be used with a limited set of other components. Each component, and thus the entire system, can only run a single operating system. The current image inspection system can be used in a manufacturing line to detect defects in items such as LCD panels or semiconductor wafers. The system performs one or more scans to detect defect points. Image analysis is conducted on a magnified version of each defect point. A single Field of View (FOV) has multiple inspection threads, which run different algorithms, potentially at the same time for different areas in an image. One algorithm may take significantly longer to run than others. Image processing software libraries are used to implement the algorithms. Large amounts of image and log data need to be moved, processed, and stored during inspection, requiring high I/O speeds and bandwidth.


This new design approach is a processing/inspection system based on hybrid, reusable components/systems 54A-N that are combined with special purpose engines/accelerators. Image processing applications use algorithms that often have specialized functions that can benefit from special purpose processors. These special purpose processors can be used as accelerators to speed up image processing algorithms in a fine-grained, selective fashion that takes advantage of the strengths of both general purpose and special purpose processors. Thus, the present invention, combines image recording mechanisms/devices 58A-N such as cameras with a special purpose processor for image processing as well as a general processor 50 for determining control information.


In a typical embodiment, images are received by hybrid systems 54A-N of image co-processor 52, which process the images to determine image data. This image data (and optionally the images themselves) are then communicated to control processor 50 and staging storage unit 60. Control processor then processes the image data to determine control information. The images, image data, and/or control information can then be stored in archive storage unit 62.


Referring now to FIG. 3, a more detailed diagram of the system of the present invention as well as its integration with existing system is shown in greater detail. As depicted, image grabber 56 acquires images from one or more image recordation mechanisms 58A and passes the image(s) to an input/output (I/O) processor 62. As depicted I/O processor 62 generally includes a set of express peripheral component interconnects (PCIs) 68A-N, a pure load balancer (PLB) 64 coupled to the set of express PCIs 68A-N; and a network interface 66 (e.g., GbE) coupled to the PLB 64 for interfacing with at least one legacy application 32 in IA-based PC 14A. An I/O buffer 60 is also shown coupled to the I/O processor 62.


Further shown within image co-processor 52 is a power processing element (PPE) 76, an element interconnect bus (EIB) 74 coupled to the PPE, and a set (e.g., one or more) but typically a plurality of special purpose engines (SPEs) 54A-N. SPEs 54A-N share the load involved with processing image(s) into image data. The division of work among SPEs 54A-N was not previously performed, and hence, previous system are not suitable for current day and future image technology. As further shown, SPEs 54A-N feed image data, image processing routines, arithmetic/statistical operations, inspect processes, etc. to main memory 70 (which could be realized as staging storage unit 60 of FIG. 2.). Image co-processor 52 will leverage legacy application in IA-based PC 14A to have general purpose or control processor 24 process the image or image data to determine control information.


As further depicted, IA-based PC system 14A of the related art obtains an image from image recordation mechanism 10A via image grabber 20, and passes the image to a general purpose image processor 24 for processing (e.g., utilizing image buffer 22). This sparsely processed image data is then passed to bridge chip 26, IA CPU 30, and (DDR) main memory 28. As can be seen, the previous system utilizes only a single general-purpose processor to process the image. Whereas, the present invention utilizes an image co-processor having a plurality of SPEs 54A-N as well as general purpose control processor 24 of IA-based PC system 14A. This is accomplished by communicated through legacy application(s) 32 in IA-based PC system 14A. Thus, the present invention not only provides improved and accelerated image processing, but it does so by utilizing existing infrastructure. It should be noted that the heterogeneous image processing system of the present invention is operable with multiple different computing platforms (e.g., Windows, Linux, etc.).



FIG. 4 shows another embodiment of the present invention. Specifically, FIG. 4 depicts the convergence of a host machine with PCI express to accommodate in multiple image grabbers 20A-N in IA-based PC system 14A, as well as parallel multi-core processors (MCPs) 84A-N within in image co-processor 52 (MCPs 84A-N can be media communications processors, which are a type of processor specifically designed for the creation and distribution of digital media/content.). Specifically, image recordation mechanisms 10A-N will capture images and feed the same to separate image grabbers 20A-N, which utilize existing O/S image grabber driver images and an existing O/S library for image grabbers 20A-N. Then, using PCI express host interface board (PCIe HIB 34), the image is communicated to bridge chip 26 for processing and placement in temporary/staging storage unit 60. As further shown, bridge chip 26 is coupled to (DDR) main memory 28 and IA CPU 30. Switch 64 couples IA-based PC system 14A to image co-processor 52 via PCIe HIBs 34 and 68A. As shown, image co-processor includes parallel PCIe HIBs 68A-N, I/O processors 62A-N, MCPs 84A-N, and external data representation (XDR) modules 82A-N. A gigabit Ethernet (GbE) network interface and switch 80 can then be used to couple image co-processor 52 and IA-based PC system 14A to control processor 50, and archive storage 62. As further shown in FIG. 4, library functions are written for MCPs 84A-N and off-loaded by (e.g., x86) O/S to image co-processor 52 at runtime.


Along these lines, the library developed for the present invention typically has at least one of the following features (among others):


(1) It is typically structured as a reusable library with many useful functions and algorithms available.


(2) It is useable cross-platform (Windows, Linux, Mac).


(3) It is optimized for a specific processor architecture, but optimizations are encapsulated in a separate library. Architecture supports plugging in of other optimized libraries, such as one for cell.


(4) It includes wrappers for scripting languages such as Python, and graphical user interfaces (GUIs) to make rapid prototyping easier.


Referring now to FIG. 5, a method flow diagram of an illustrative software implementation of the present invention is shown. As depicted, in step S1, the process is started with the OpenCV open source library. In step S2, OpenCV running on x86 is obtained with a frame grabber and a camera (or other image recording device). In a typical embodiment, no cell-specific optimizations are present at this point. In step S3, underlying pieces of OpenCV are offloaded to the cell (image co-processor 52) incrementally as required by necessary algorithms. In this step, information about what algorithms and APIs are required could be needed. In step S4, the offloaded pieces are optimized, with the process repeating to step S3.


As can be seen, the present invention improves upon existing systems by providing a hybrid system comprising a management processor; a general purpose processor, a set (e.g., one or more) of special purpose engines/accelerators; a closely connected network, a staging storage device; and an archive storage device.


The present invention further provides a multi-platform hybrid system that reuses uses the existing infrastructure including the software assets for the image processing/inspection management and control system, and offloads image processing workloads to a special-purpose accelerator


Further provided is a method for processing image(s), comprising: receiving a request for execution of an image co-processor of an application; and analyzing switches and routing at least a portion of library functions of the image(s)


Still yet, an image processing program product stored on a computer readable medium is provided that furnishes: a set (e.g., one or more) of libraries with selective offload of functions, algorithms, and optimizations to other systems; scalable processing system design(s) that increase and decrease processors when needed; multiple hybrid platform system design(s); and optimizations encapsulated in pluggable modules.


It should be understood that the present invention could be deployed within a computer infrastructure. This is intended to demonstrate, among other things, that the present invention could be implemented within a network environment (e.g., the Internet, a wide area network (WAN), a local area network (LAN), a virtual private network (VPN), etc.), or on a stand-alone computer system. In the case of the former, communication throughout the network can occur via any combination of various types of communications links. For example, the communication links can comprise addressable connections that may utilize any combination of wired and/or wireless transmission methods. Where communications occur via the Internet, connectivity could be provided by conventional TCP/IP sockets-based protocol, and an Internet service provider could be used to establish connectivity to the Internet. Still yet, the computer infrastructure is intended to demonstrate that some or all of the components of such an implementation could be deployed, managed, serviced, etc. by a service provider who offers to implement, deploy, and/or perform the functions of the present invention for others.


Where hardware is provided, it is understood that such any computers utilized will include standard elements such as a processing unit, a memory medium, a bus, and input/output (I/O) interfaces. Further, such computer systems can be in communication with external I/O devices/resources. In general, processing units execute computer program code, such as the software and functionality described above, which is stored in memory. While executing computer program code, the processing unit can read and/or write data to/from memory, I/O interfaces, etc. The bus provides a communication link between each of the components in a computer. External devices can comprise any device (e.g., keyboard, pointing device, display, etc.) that enable a user to interact with the computer system and/or any devices (e.g., network card, modem, etc.) that enable the computer to communicate with one or more other computing devices.


The hardware used to implement the present invention can comprise any specific purpose computing article of manufacture comprising hardware and/or computer program code for performing specific functions, any computing article of manufacture that comprises a combination of specific purpose and general purpose hardware/software, or the like. In each case, the program code and hardware can be created using standard programming and engineering techniques, respectively. Moreover, the processing unit therein may comprise a single processing unit, or be distributed across one or more processing units in one or more locations, e.g., on a client and server. Similarly, the memory medium can comprise any combination of various types of data storage and/or transmission media that reside at one or more physical locations. Further, the I/O interfaces can comprise any system for exchanging information with one or more external device. Still further, it is understood that one or more additional components (e.g., system software, math co-processing unit, etc.) can be included in the hardware.


While shown and described herein as a heterogeneous image processing system and method, it is understood that the invention further provides various alternative embodiments. For example, in one embodiment, the invention provides a computer-readable/useable medium that includes computer program code to enable a computer infrastructure to heterogeneously process images. To this extent, the computer-readable/useable medium includes program code that implements the process(es) of the invention. It is understood that the terms computer-readable medium or computer useable medium comprises one or more of any type of physical embodiment of the program code. In particular, the computer-readable/useable medium can comprise program code embodied on one or more portable storage articles of manufacture (e.g., a compact disc, a magnetic disk, a tape, etc.), on one or more data storage portions of a computing device (e.g., a fixed disk, a read-only memory, a random access memory, a cache memory, etc.), and/or as a data signal (e.g., a propagated signal) traveling over a network (e.g., during a wired/wireless electronic distribution of the program code).


In another embodiment, the invention provides a business method that performs the process of the invention on a subscription, advertising, and/or fee basis. That is, a service provider, such as a Solution Integrator, could offer to heterogeneously process images. In this case, the service provider can create, maintain, support, etc., a computer infrastructure, such as computer infrastructure that performs the process of the invention for one or more customers. In return, the service provider can receive payment from the customer(s) under a subscription and/or fee agreement and/or the service provider can receive payment from the sale of advertising content to one or more third parties.


In still another embodiment, the invention provides a computer-implemented method for heterogeneously processing images. In this case, a computer infrastructure, such as a computer infrastructure can be provided and one or more systems for performing the process of the invention can be obtained (e.g., created, purchased, used, modified, etc.) and deployed to the computer infrastructure. To this extent, the deployment of a system can comprise one or more of: (1) installing program code on a computing device from a computer-readable medium; (2) adding one or more computing devices to the computer infrastructure; and (3) incorporating and/or modifying one or more existing systems of the computer infrastructure to enable the computer infrastructure to perform the process of the invention.


As used herein, it is understood that the terms “program code” and “computer program code” are synonymous and mean any expression, in any language, code or notation, of a set of instructions intended to cause a computing device having an information processing capability to perform a particular function either directly or after either or both of the following: (a) conversion to another language, code or notation; and/or (b) reproduction in a different material form. To this extent, program code can be embodied as one or more of: an application/software program, component software/a library of functions, an operating system, a basic I/O system/driver for a particular computing and/or I/O device, and the like.


A data processing system suitable for storing and/or executing program code can be provided hereunder and can include at least one processor communicatively coupled, directly or indirectly, to memory element(s) through a system bus. The memory elements can include, but are not limited to, local memory employed during actual execution of the program code, bulk storage, and cache memories that provide temporary storage of at least some program code in order to reduce the number of times code must be retrieved from bulk storage during execution. Input/output or I/O devices (including, but not limited to, keyboards, displays, pointing devices, etc.) can be coupled to the system either directly or through intervening I/O controllers.


Network adapters also may be coupled to the system to enable the data processing system to become coupled to other data processing systems, remote printers, storage devices, and/or the like, through any combination of intervening private or public networks. Illustrative network adapters include, but are not limited to, modems, cable modems and Ethernet cards.


The foregoing description of various aspects of the invention has been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed, and obviously, many modifications and variations are possible. Such modifications and variations that may be apparent to a person skilled in the art are intended to be included within the scope of the invention as defined by the accompanying claims.

Claims
  • 1. A heterogeneous image processing system, comprising: an image co-processor comprising a plurality of distinct special purpose engines (SPEs), the image co-processor being configured to:receive a plurality of images collectively and substantially simultaneously recorded by a plurality of image recordation mechanisms, each image recordation mechanism providing at least one of the plurality of images for acquisition by one of a plurality of image grabbers, each image grabber sending at least one of the plurality of images to one of a plurality of peripheral component interconnects, each of the plurality of peripheral component interconnects sending at least one of the plurality of images to a pure load balancer;distribute distinct portions of the plurality of images across the plurality of SPEs, wherein the distinct portions are different views of an object recorded at substantially a same time by at least two of the plurality of image recordation mechanisms;collectively and substantially simultaneously process the distinct portions of the plurality of images at the plurality of SPEs to determine associated image data; andreceive a selective off-load of at least one library function at a run time, wherein the at least one library function is written for a plurality of parallel multi-core processors, wherein the at least one library function supports the processing of the distinct portions of the plurality of images at the plurality of SPEs, and wherein the at least one library function is one of at least one algorithm, an optimization for a processor architecture, and a wrapper for a scripting language.
  • 2. The heterogeneous image processing system of claim 1, further comprising a control processor being configured to receive the image data from the image co-processor, and to determine control information for the plurality of images.
  • 3. The heterogeneous image processing system of claim 2, further comprising: a staging storage unit for storing the image data for use by the control processor; andan archive storage unit for storing at least one of the image data or the control information.
  • 4. The heterogeneous image processing system of claim 1, further comprising an input/output (I/O) processor, the I/O processor comprising: a set of express peripheral component interconnects (PCIs);a pure load balancer (PLB) coupled to the set of express PCIs; anda network interface coupled to the PLB for interfacing with at least one legacy application.
  • 5. The heterogeneous image processing system of claim 4, the I/O processor receiving the plurality of images from an image grabber, and providing the plurality of images to the image co-processor.
  • 6. The heterogeneous image processing system of claim 1, the image co-processor further comprising: a power processing element (PPE); andan element interconnect bus (EIB) coupled to the PPE and the plurality of SPEs.
  • 7. The heterogeneous image processing system of claim 1, the SPEs comprising accelerators that increase a rate of processing of the plurality of images.
  • 8. The heterogeneous image processing system of claim 1, the heterogeneous image processing system being operable with multiple different computing platforms.
  • 9. The heterogeneous image processing system of claim 1, wherein the different views comprise different angles of view.
  • 10. The heterogeneous image processing system of claim 1, wherein the different views comprise different perspectives of view.
  • 11. A heterogeneous image processing method, comprising: receiving a plurality of images in an image co-processor, the image co-processor comprising a plurality of special purpose engines (SPEs), wherein the plurality of images include images collectively and substantially simultaneously recorded by a plurality of image recordation mechanisms, each image recordation mechanism providing at least one of the plurality of images for acquisition by one of a plurality of image grabbers, each image grabber sending at least one of the plurality of images to one of a plurality of peripheral component interconnects, each of the plurality of peripheral component interconnects sending at least one of the plurality of images to a pure load balancer;distributing distinct portions of the plurality of images across the plurality of SPEs, wherein the distinct portions are different views of an object recorded at substantially a same time by at least two of the plurality of image recordation mechanisms;processing the distinct portions of the plurality of images collectively and substantially simultaneously with the plurality of SPEs to determine image data associated with the distinct portions of the plurality of images;providing at least one of the image data or the distinct portions of the plurality of images to a control processor to determine control information associated with the distinct portions of the plurality of images; andreceiving a selective off-load of at least one library function at a run time, wherein the at least one library function is written for a plurality of parallel multi-core processors, wherein the at least one library function supports the processing of the distinct portions of the plurality of images at the plurality of SPEs, and wherein the at least one library function is one of at least one algorithm, an optimization for a processor architecture, and a wrapper for a scripting language.
  • 12. The heterogeneous image processing method of claim 11, further comprising: determining, the control information with the control processor.
  • 13. The heterogeneous image processing method of claim 12, further comprising: storing the image data in a staging storage unit for processing by the control processor to determine control information; andstoring at least one of the image data or the control information in an archive storage unit.
  • 14. The heterogeneous image processing method of claim 11, further comprising: receiving the plurality of images in an input/output (I/O) processor, the I/O processor comprising:a set of express peripheral component interconnects (PCIs);a pure load balancer (PLB) coupled to the set of express PCIs; anda network interface coupled to the PLB for interfacing with at least one legacy application.
  • 15. The heterogeneous image processing method of claim 14, the I/O processor receiving the plurality of images from an image grabber, and providing the plurality of images to the plurality of SPEs.
  • 16. The heterogeneous image processing method of claim 11, the image co-processor further comprising: a power processing element (PPE); andan element interconnect bus (EIB) coupled to the PPE and the plurality of SPEs.
  • 17. The heterogeneous image processing method of claim 11, the plurality of SPEs comprising accelerators that increase a rate of processing of the plurality of images.
  • 18. The heterogeneous image processing method of claim 11, further comprising receiving the plurality of images in an image grabber from at least one image recording devices.
  • 19. A program product stored on computer readable storage medium for heterogeneously processing an image, the computer readable storage medium comprising program code for causing a computer system to: receive a plurality of images in an image co-processor, the image co-processor comprising a plurality of special purpose engines (SPEs), wherein the plurality of images include images collectively and substantially simultaneously recorded by a plurality of image recordation mechanisms, each image recordation mechanism providing at least one of the plurality of images for acquisition by one of a plurality of image grabbers, each image grabber sending at least one of the plurality of images to one of a plurality of peripheral component interconnects, each of the plurality of peripheral component interconnects sending at least one of the plurality of images to a pure load balancer;distribute distinct portions of the plurality of images across the plurality of SPEs, wherein the distinct portions are different views of an object recorded at substantially a same time by at least two of the plurality of image recordation mechanisms;process the distinct portions of the plurality of images collectively and substantially simultaneously with the plurality of SPEs to determine image data associated with the plurality of images;provide at least one of the image data or the distinct portions of the plurality of images to a control processor to determine control information associated with the distinct portions of the plurality of images; andreceive a selective off-load of at least one library function at a run time, wherein the at least one library function is written for a plurality of parallel multi-core processors, wherein the at least one library function supports the processing of the distinct portions of the plurality of images at the plurality of SPEs, and wherein the at least one library function is one of at least one algorithm, an optimization for a processor architecture, and a wrapper for a scripting language.
  • 20. The program product of claim 19, the computer readable storage medium further comprising program code for causing the computer system to: determine control information for the plurality of images with the control processor.
  • 21. The program product of claim 20, the computer readable storage medium further comprising program code for causing the computer system to: store the image data in a staging storage unit for processing by the control processor; andstore the at least one of the image data or the plurality of images in an archive storage unit.
  • 22. The program product of claim 19, the computer readable storage medium further comprising program code for causing the computer system to: receive the plurality of images in an input/output (I/O) processor, the I/O processor comprising:a set of express peripheral component interconnects (PCIs);a pure load balancer (PLB) coupled to the set of express PCIs; anda network interface coupled to the PLB for interfacing with at least one legacy application.
  • 23. The program product of claim 22, the I/O processor receiving the plurality of images from an image grabber, and providing the plurality of images to the plurality of SPEs.
  • 24. The program product of claim 19, the image co-processor further comprising: a power processing element (PPE); andan element interconnect bus (EIB) coupled to the PPE and the plurality of SPEs.
  • 25. The program product of claim 19, the SPEs comprising accelerators that increase a rate of processing of the plurality of images.
  • 26. The program product of claim 19, further comprising receiving the plurality of images in an image grabber from at least one image recording devices.
  • 27. A method for deploying a system for heterogeneously processing an image, comprising: providing a computer infrastructure being operable to:receive a plurality of images in an image co-processor, the image co-processor comprising a plurality of special purpose engines (SPEs), wherein the plurality of images include images collectively and substantially simultaneously recorded by a plurality of image recordation mechanisms, each image recordation mechanism providing at least one of the plurality of images for acquisition by one of a plurality of image grabbers, each image grabber sending at least one of the plurality of images to one of a plurality of peripheral component interconnects, each of the plurality of peripheral component interconnects sending at least one of the plurality of images to a pure load balancer;distribute distinct portions of the plurality of images across the plurality of SPEs, wherein the distinct portions are different views of an object recorded at substantially a same time by at least two of the plurality of image recordation mechanisms;process the distinct portions of the plurality of images collectively and substantially simultaneously with the plurality of SPEs to determine image data associated with the distinct portions of the plurality of images; andreceive a selective off-load of at least one library function at a run time, wherein the at least one library function is written for a plurality of parallel multi-core processors, wherein the at least one library function supports the processing of the distinct portions of the plurality of images at the plurality of SPEs, and wherein the at least one library function is one of at least one algorithm, an optimization for a processor architecture, and a wrapper for a scripting language.
US Referenced Citations (127)
Number Name Date Kind
4517593 Keller et al. May 1985 A
4893188 Murakami et al. Jan 1990 A
5136662 Maruyama et al. Aug 1992 A
5506999 Skillman et al. Apr 1996 A
5621811 Roder Apr 1997 A
5659630 Forslund Aug 1997 A
5721883 Katsuo et al. Feb 1998 A
5809078 Tani et al. Sep 1998 A
5956081 Katz et al. Sep 1999 A
6023637 Liu et al. Feb 2000 A
6025854 Hnz et al. Feb 2000 A
6081659 Garza et al. Jun 2000 A
6166373 Mao Dec 2000 A
6215898 Woodfill et al. Apr 2001 B1
6404902 Takano et al. Jun 2002 B1
6456737 Woodfill et al. Sep 2002 B1
6487619 Takagi Nov 2002 B1
6549992 Armangau et al. Apr 2003 B1
6567622 Phillips May 2003 B2
6647415 Olarig et al. Nov 2003 B1
6661931 Kawada Dec 2003 B1
6671397 Mahon et al. Dec 2003 B1
6744931 Komiya et al. Jun 2004 B2
6825943 Barry et al. Nov 2004 B1
6829378 DiFilippo et al. Dec 2004 B2
6898634 Collins et al. May 2005 B2
6898670 Nahum May 2005 B2
6950394 Chou et al. Sep 2005 B1
7000145 Werner et al. Feb 2006 B2
7016996 Schober Mar 2006 B1
7043745 Nygren et al. May 2006 B2
7065618 Ghemawat et al. Jun 2006 B1
7076569 Bailey et al. Jul 2006 B1
7095882 Akahori Aug 2006 B2
7102777 Haraguchi Sep 2006 B2
7106895 Goldberg et al. Sep 2006 B1
7142725 Komiya et al. Nov 2006 B2
7171036 Liu et al. Jan 2007 B1
7225324 Huppenthal et al. May 2007 B2
7243116 Suzuki et al. Jul 2007 B2
7299322 Hosouchi et al. Nov 2007 B2
7327889 Imai et al. Feb 2008 B1
7430622 Owen Sep 2008 B1
7480441 Klausberger et al. Jan 2009 B2
7523148 Suzuki et al. Apr 2009 B2
7602394 Seki et al. Oct 2009 B2
7605818 Nagao et al. Oct 2009 B2
7743087 Anderson et al. Jun 2010 B1
7801895 Hepper et al. Sep 2010 B2
8078837 Kajihara Dec 2011 B2
8094157 Le Grand Jan 2012 B1
20020002636 Vange et al. Jan 2002 A1
20020129216 Collins Sep 2002 A1
20020164059 DiFilippo et al. Nov 2002 A1
20020198371 Wang Dec 2002 A1
20030023154 Nitz Jan 2003 A1
20030053118 Muramoto et al. Mar 2003 A1
20030092980 Nitz May 2003 A1
20030113034 Komiya et al. Jun 2003 A1
20040024810 Choubey et al. Feb 2004 A1
20040062265 Poledna Apr 2004 A1
20040062454 Komiya et al. Apr 2004 A1
20040091243 Theriault et al. May 2004 A1
20040122790 Walker et al. Jun 2004 A1
20040143631 Banerjee et al. Jul 2004 A1
20040153751 Marshal et al. Aug 2004 A1
20040156546 Kloth Aug 2004 A1
20040170313 Nakano et al. Sep 2004 A1
20040186371 Toda Sep 2004 A1
20040217956 Besl et al. Nov 2004 A1
20040228515 Okabe et al. Nov 2004 A1
20040233036 Sefton Nov 2004 A1
20040252467 Dobbs et al. Dec 2004 A1
20050013960 Ozeki et al. Jan 2005 A1
20050022038 Kaushik et al. Jan 2005 A1
20050044132 Campbell et al. Feb 2005 A1
20050063575 Ma et al. Mar 2005 A1
20050080928 Beverly et al. Apr 2005 A1
20050083338 Yun et al. Apr 2005 A1
20050084137 Kim et al. Apr 2005 A1
20050093990 Aoyama May 2005 A1
20050113960 Karau et al. May 2005 A1
20050219253 Piazza et al. Oct 2005 A1
20050259866 Jacobs et al. Nov 2005 A1
20050263678 Arakawa Dec 2005 A1
20060013473 Woodfill et al. Jan 2006 A1
20060117238 DeVries et al. Jun 2006 A1
20060135117 Laumen et al. Jun 2006 A1
20060149798 Yamagami Jul 2006 A1
20060171452 Waehner Aug 2006 A1
20060184296 Voeller et al. Aug 2006 A1
20060190627 Wu et al. Aug 2006 A1
20060235863 Khan Oct 2006 A1
20060239194 Chapell Oct 2006 A1
20060250514 Inoue et al. Nov 2006 A1
20060268357 Vook et al. Nov 2006 A1
20060269119 Goldberg et al. Nov 2006 A1
20060274971 Kumazawa et al. Dec 2006 A1
20060279750 Ha Dec 2006 A1
20070126744 Tsutsumi Jun 2007 A1
20070159642 Choi Jul 2007 A1
20070245097 Gschwind et al. Oct 2007 A1
20070250519 Fineberg et al. Oct 2007 A1
20080013862 Isaka et al. Jan 2008 A1
20080036780 Liang et al. Feb 2008 A1
20080063387 Yahata et al. Mar 2008 A1
20080092744 Storbo et al. Apr 2008 A1
20080129740 Itagaki et al. Jun 2008 A1
20080140771 Vass et al. Jun 2008 A1
20080144880 DeLuca Jun 2008 A1
20080147781 Hopmann et al. Jun 2008 A1
20080177964 Takahashi et al. Jul 2008 A1
20080259086 Doi et al. Oct 2008 A1
20080260297 Chung et al. Oct 2008 A1
20080263154 Van Datta Oct 2008 A1
20080270979 McCool et al. Oct 2008 A1
20090003542 Ramanathan et al. Jan 2009 A1
20090052542 Romanovskiy et al. Feb 2009 A1
20090066706 Yasue et al. Mar 2009 A1
20090074052 Fukuhara et al. Mar 2009 A1
20090083263 Felch et al. Mar 2009 A1
20090089462 Strutt Apr 2009 A1
20090150555 Kim et al. Jun 2009 A1
20090150556 Kim et al. Jun 2009 A1
20090187654 Raja et al. Jul 2009 A1
20090265396 Ram et al. Oct 2009 A1
20100060651 Gala Mar 2010 A1
Foreign Referenced Citations (3)
Number Date Country
1345120 Sep 2003 EP
0068884 Apr 2000 WO
0068884 Nov 2000 WO
Related Publications (1)
Number Date Country
20080260297 A1 Oct 2008 US