Apparatus and method for selectively mapping proper boot image to processors of heterogeneous computer systems

Information

  • Patent Application
  • 20050060531
  • Publication Number
    20050060531
  • Date Filed
    September 15, 2003
    21 years ago
  • Date Published
    March 17, 2005
    19 years ago
Abstract
A machine-readable identification register is provided on each cell of a cellular computer system. The identification register is read during system startup to identify a processor type, which may include an instruction set architecture (ISA), associated with the cell. The processor type information is used to ensure that a compatible boot image is provided to processors of the cell. In another embodiment, the system management subsystem has a version selection flag. When the version selection flag is in a first state, the compatible boot image provided to processors of the cell is a current boot image; with the selection flag in a second state the compatible boot image provided to processors of the cell is an older edition of the boot image.
Description
FIELD

The apparatus and method relate to the field of computer systems and computer system firmware. In particular, they relate to apparatus and methods for providing boot firmware to processors of computer systems.


BACKGROUND

Modern, high performance, computer systems typically have multiple processors. It is known that some computer systems have primary processors of multiple instruction set types, multiple processor systems having primary processors of multiple instruction set architectures (ISAs) are known herein as heterogeneous computer systems.


Heterogeneous computer systems offer advantages in that they may run application code written for a variety of processor types and operating systems.


In addition to primary processors, upon which operating system and user programs run, there are typically additional embedded processors of additional types. Embedded processors are typically provided for control of specific hardware devices, such as disk drives, in the system. In a computer system, embedded processors, hereinafter system management processors, may also perform system management functions as monitoring of primary processor voltages and temperatures, control of cooling and power supply subsystems, as well as boot-time configuration of various system components.


Machine-language operating system code, including low level system code and BIOS (basic input-output system) code, is ISA specific. For example, machine-level code for a PA8800 will not run correctly on an Intel Itanium processor. In a heterogeneous computer system, each low-level operating system code module typically exists in a separately-compiled module for each primary processor type.


A family of high performance heterogeneous computer systems from Hewlett-Packard can be configured to use primary processors of two or more ISA types, including the Intel Itanium and PA8800 instruction set architectures.


In this family of computer systems, a field replaceable “cell” has several primary processor circuits of the same type, together with memory, circuitry for communicating with other cells over a backplane bus, input output (I/O) bus interface circuitry, JTAG (Joint Test Action Group) scan circuitry, and other circuitry. There may be one or more additional embedded processors in each cell to perform system management functions.


One or more cells, which may, but need not, be of the same type, are installed into a backplane. A heterogeneous computer system is formed when cells having two or more types of processors are inserted into the backplane.


This family of computer systems supports simultaneous execution of multiple operating systems, including multiprocessor variants of Windows-NT, Unix, VMS, and Linux. Multiple instances of each system are also supported. Each operating system instance operates in a partition of the computer system.


At system boot time, a group of processors of a particular type are assigned to operate in each partition. These processors may belong to more than one cell, but must all be of the same ISA. As the operating system instance running in the partition boots, or initializes; processors of the partition become aware of each other and appropriate task routing and assignment datastructures are built in system memory. The process of processors becoming aware of each other and building task routing and assignment datastructures in system memory is known herein as a Rendezvous of the processors.


It is known that nonvolatile memory circuits having board identification and timing information may be designed into modules of a computer system. Many Synchronous Dynamic Random Access Memory (SDRAM) modules contain serial memory devices having interface timing information recorded therein. Information in these memory devices is used to configure memory interface circuitry of the computer system such that the system will properly communicate with those memory modules actually installed in the system. The Peripheral Component Interconnect (PCI) bus specification provides for machine-readable identification registers within each peripheral device attached to a PCI bus, information read from these identification registers is typically used by an operating system to allocate bus address space and to determine appropriate drivers for each peripheral device.


Some prior heterogeneous computer systems have assigned processors to partitions according to the physical location of the processors in the system. In these systems, processors on cells installed in particular slots of the backplane are assigned to one partition, those in other slots are assigned to a second partition. Should cells be moved in the backplane, assignment of processors to partitions based on physical location may result in incompatible processors being assigned to a partition.


It is desirable to assign processors to system partitions in a simple, reliable, way. It is desirable to assign processors to partitions in a manner that ensures that each partition includes only compatible processors. Previously filed, U.S. patent application <HP 1476> describes a system having a type register on each cell of the system, whereby processor type may be identified and individual processors assigned to partitions having compatible processors.


Processors of a computer system typically are provided with firmware executable at system boot time. Firmware executable at system boot typically includes boot loader functions for loading operating system software, as well as basic input/output system (BIOS) functions. In a heterogeneous cellular computer system, firmware executable at system boot includes these boot loader and BIOS functions, as well as firmware for performing rendezvous.


As with other operating system modules, it is necessary that firmware presented to each processor at boot time be compatible with the processor to which it is presented. Further, it is desirable that there be a mechanism to permit upgrade of firmware, while allowing rapid return to a previous firmware edition should upgraded firmware prove buggy.


SUMMARY

A machine-readable identification register is provided on each cell of a computer system. In an embodiment, this identification register is incorporated into processor integrated circuits of a cell. The identification register is read during system startup to identify a processor type, which may include an instruction set architecture (ISA), associated with the cell. The processor type is used by a system management subsystem to ensure that a compatible boot image is provided to processors of the cell.


In an alternative embodiment, the identification register is part of a field programmable gate array (FPGA) installed on each cell of the computer system.


In another embodiment, the identification register is located within a serially addressable nonvolatile memory.


In another embodiment, the system management subsystem has a version selection flag. When the version selection flag is in a first state, the compatible boot image provided to processors of the cell is a current boot image; with the selection flag in a second state the compatible boot image provided to processors of the cell is an older edition of the boot image.




BRIEF DESCRIPTION OF THE FIGURES


FIG. 1 is an exemplary block diagram of a heterogeneous computing system.



FIG. 2 is an exemplary block diagram of a cell for a heterogeneous computing system.



FIG. 2A, is an exemplary block diagram of an alternative embodiment of the cell for a heterogeneous computing system.



FIG. 3 is an exemplary flowchart of actions taken during system boot to select an appropriate boot image and present it to the processors.



FIG. 4 is an exemplary block diagram illustrating organization of boot EEPROM contents.



FIG. 5 is an exemplary block flowchart illustrating a method of updating a boot EEPROM of a cell of a cellular, heterogeneous, computer system.




DETAILED DESCRIPTION OF THE EMBODIMENTS

A heterogeneous computer system 100 (FIG. 1) has a system management processor 102, and two or more processor cells 104, 106, and 108. Processor cells 104, 106, and 108 are of two or more ISA types. FIG. 2 is a detailed block diagram of a cell, such as cells 104, 106, and 108 of FIG. 1. With reference to FIG. 1 and FIG. 2, processor cells 104, 106, and 108 each have one or more primary processors 202. Processor cells 104 embody primary processors of a first ISA type, while processor cells 106 embody processors of a second ISA type.


In a particular embodiment, processor cells 104 embody processors 202 of the PA8800 type, while processor cells 106 embody processors 202 of the Intel Itanium type. Additional processor cells 108 may exist in the system 100, having additional types of processors, including in an embodiment processors of other Intel ISA types. In an embodiment, each cell has four primary processors, in another embodiment each cell has sixteen processors.


Cells of the system 100 are interconnected through high-speed interconnect 110. High-speed interconnect 110 provides for communications between cells. Some cells of the system 100 may also be coupled to I/O (Input/Output) interconnect 112. I/O interconnect 112 provides a path for communication between cells of the system, such as cell 108, and I/O devices 114. I/O devices 114 may include disk drives and network interface devices, as well as other peripherals.


Cells 104, 106, and 108 of the system 100 are connected to management processor 102 over a management interconnect 116. Management processor 102 is also coupled to control power supplies and fans 118.


Memory 204 of each cell is accessible from processors 202 of that cell, and from high speed interconnect 110. High speed interconnect interface device 206 of each cell is also capable of providing data from a bootable electrically-erasable programmable read-only memory (EEPROM) 207. Boot EEPROM 207 contains one or more binary images of processor boot code.


In one embodiment, primary processors 202 of each cell each are large integrated circuits each having multiple CPUs (Central Processor Units) together with multiple levels of cache memory. In one version of this embodiment, each processor integrated circuit 202 has four CPUs. It is anticipated that the number of effective CPUs per processor 202 may be greater than four, especially where a multithreaded processor design is employed.


Each cell also has a small management subprocessor 210, which in one embodiment is a microcontroller of the Intel 80251 type. It is anticipated that management subprocessor 210 may be a microcontroller of the Intel 8096, Motorola 6811 or 6805 type, or of another type as known in the art. Management subprocessor 210 is adapted for communication over management interconnect 116. In a particular embodiment, management subprocessor 210 controls cell-level cooling devices 212, and is capable of monitoring temperatures of the cell's primary processors 202.


In a particular embodiment, management subprocessor 210 communicates to cooling devices 212 and other devices (not shown) through an FPGA (Field Programmable Gate Array) 214. In this embodiment, FPGA 214 includes a cell type register 216. In an alternative embodiment, cell type register 218 is incorporated into firmware code of management subprocessor 210. In another embodiment, the cell type register 254 is incorporated into an integrated circuit or module together with one or more primary processors 202 of the cell.


In yet another alternative embodiment, as illustrated in FIG. 2A. boot EEPROM is coupled to high speed interconnect interface 206 through the FPGA 214.


With reference also to FIGS. 3 and 4, at system boot time, management processor 210 of the cell reads 302 the cell type register 216, 218 or 254. Management processor 102 uses the cell type information to determine 304 a processor type of the processors of each cell. Management processor 210 then reads 306 boot-image information 402 from the boot EEPROM 207, and determines an appropriate boot image 404 of one or more boot images 404, 406 in boot EEPROM 207. A boot image is appropriate only if the boot image contains machine readable code compiled to execute on processors of the same processor type as processors of the cell. Unless the version flag 122 is set (see below), an appropriate boot image is the boot image having the most recent version number or compilation date of all boot images for that processor type that are present in boot EEPROM 207. The management processor 210 then sets up 308 mapping hardware in high speed interconnect interface 206, or in the embodiment of FIG. 2A, in the FPGA 214, to map the appropriate boot image 404 into boot address space of the processors 202. Finally, the processor 202 is allowed 310 to boot from the appropriate boot image 404 of the boot EEPROM 207.


In an embodiment, the appropriate boot image 404 is determined to be the boot image having the most recent version number or compilation date of all boot images for that processor type that are present and verified as valid in boot EEPROM 207. Verification that a boot image is valid is performed through a Cyclic Redundancy Check (CRC) on that image. In this embodiment, should the most recent boot image for the processor type be invalid, a prior valid image is used for boot unless no such prior image exists in boot EEPROM 207.


In the event that no appropriate boot image is found in the EEPROM 207, as would happen if all boot images in the EEPROM are for a processor type incompatible with processors present in the cell, boot of that cell is inhibited.


In an embodiment, boot-image information 402 is consolidated in the form of a boot-image information table within boot EEPROM 207. In an alternative embodiment, boot-image information 402, 408 has the form of a separate record associated with each boot image 404, 406. In either embodiment, the boot image information 402 includes a processor type or types for which the image is an appropriate boot image and version information.


In an alternative embodiment, system management processor 102 (FIG. 1) has version flags 122 associated with each processor type. When a version flag associated with a particular processor type of version flags 122 is set, management subprocessors 210 (FIG. 2 or 2A) of cells having that processor type configure mapping hardware, whether located in FPGA 214 or high speed interconnect interface 206, to provide an older firmware image 406, where such an older firmware image appropriate for that processor type exists, to processors 202 at system boot time. This alternative embodiment allows recovery when buggy firmware updates are installed on the system, since recently functional firmware is retained as older firmware image 406.


It is occasionally necessary to update firmware of the computer system. When an update is required, as illustrated in FIG. 5, the EEPROM is checked 502 for sufficient empty space for a new, updated, boot image.


Occasionally, sufficient space may exist, more often, there will be insufficient space for the new image. Images present in the EEPROM are therefore examined to find 504 a discardable boot image. An image is discardable if it is incompatible with processors of the type present in the cell as indicated by the cell type register, or, if there is no such image, the oldest compatible image is discardable. The most recent compatible image is retained and becomes accessible through the version flag. The selected, discardable, image is erased 506 from the EEPROM, the new image is written 508 to the EEPROM, and image information is written 510 to the EEPROM.


While the forgoing has been particularly shown and described with reference to particular embodiments thereof, it will be understood by those skilled in the art that various other changes in the form and details may be made without departing from the spirit and hereof It is to be understood that various changes may be made in adapting the description to different embodiments without departing from the broader concepts disclosed herein and comprehended by the claims that follow.

Claims
  • 1. A computer system having a plurality of processors within a cell, the cell comprising: a processor type register, a management subprocessor, an EEPROM, and mapping hardware coupling the plurality of processors to the EEPROM; wherein at system boot the management subprocessor reads the processor type register to determine an appropriate boot image of a plurality of boot images recorded within the EEPROM, and configures the mapping hardware to map the appropriate boot image into boot address space of processors of the cell.
  • 2. A method of providing firmware to a processor of a cell of a cellular computer system comprising the steps: reading a processor type register; determining a processor instruction set architecture from information read from the processor type register; selecting a compatible boot image from a plurality of boot images, the plurality of boot images contained within an EEPROM of the cell, where each boot image has associated boot-image information; and configuring mapping hardware to map the appropriate boot image of the EEPROM into boot address space of the processor.
  • 3. The method of claim 2 wherein the steps of reading a processor type register and selecting a compatible boot image are performed by a management coprocessor of the cell.
  • 4. The method of claim 2 wherein the boot-image information comprises version information, and where the step of selecting a compatible boot image selects a most recent version unless a version flag is set.
  • 5. The method of claim 2 wherein the boot-image information comprises version information, and where the step of selecting a compatible boot image selects a most recent version unless a condition exists selected from the group consisting of a version flag being set and the most recent version determined invalid.
  • 6. A method of updating firmware on a computer system, the computer system comprising at least one processor, the method comprising the steps: determining a discardable boot image, selected from the group consisting of a boot image incompatible with the processor and a least-recent boot image compatible with the processor; erasing the discardable boot image from an EEPROM of the computer system; and writing a new boot image to the EEPROM.
  • 7. The method of claim 6 wherein the processor is a processor on a cell of a cellular computer system, the cell comprises the processor coupled through mapping hardware to the EEPROM, and wherein a recent boot image compatible with the processor is retained in the EEPROM when the discardable boot image is erased from the EEPROM.
  • 8. The method of claim 7 wherein the computer system further comprises a management processor coupled to a processor type register, and wherein the management processor reads the processor type register at system powerup to determine an appropriate boot image for the processor and configures the mapping hardware to present said appropriate boot image to the processor.
  • 9. Apparatus for providing firmware to a processor of a cell of a cellular computer system comprising: means for reading a processor type register of the cell; means for selecting an appropriate boot image from a plurality of boot images, the plurality of boot images contained within an EEPROM of the cell, where each boot image has associated boot-image information further comprising version information; and means for mapping the appropriate boot image into boot address space of the processor.
RELATED APPLICATIONS

The material of this application relates to copending application U.S. patent application Ser. No. 10/400,856, filed Mar. 27, 2003, Docket Number 100201476-1, which is incorporated herein.