Graphic processing apparatus and method

Information

  • Patent Grant
  • 6704019
  • Patent Number
    6,704,019
  • Date Filed
    Wednesday, April 17, 2002
    22 years ago
  • Date Issued
    Tuesday, March 9, 2004
    20 years ago
Abstract
A graphic processing apparatus for generating, displaying or printing characters and graphic data. A successive column access is used in which a row address is designated for access to a memory and data in different column addresses within the designated same row address are successively accessed and buffer means for buffering a series of data between an access by a processor and an access to a memory is provided. A program and image information for display are stored in a main memory. A frame buffer and the main memory are integrally configured simply and small in size.
Description




BACKGROUND OF THE INVENTION




The present invention relates to a graphic processing apparatus for generating, displaying or printing characters and graphic data, and more particularly to a graphic processing apparatus including a frame buffer provided integrally in a main memory to store display pixel data and a graphic processing method of generating characters and graphic data.




A graphic processing apparatus for generating and displaying characters and graphic data includes a frame buffer for storing data corresponding to pixels on a display screen. In order to display a stable picture on the display screen, it is necessary to read data from the frame buffer successively repeatedly in synchronism with the raster scanning of a display unit. In order to realize the display function, the frame buffer is generally used as a memory independently of a main memory.




Japanese Patent Unexamined Publication JP-A-59-131979 discloses, as a special purpose memory suitable for the frame buffer, a graphic dual-port memory having a serial output port in addition to a random access port. The use of the memory can make short the display access and hence improve the drawing performance (graphic generation). However, thereafter, even if the integration density of the memory is increased, a constant number of memories are required to obtain a fixed display output. Accordingly, there is a problem that the high integration is not utilized in a lower capacity area of the memory effectively.




Further, there are Japanese Patent Unexamined Publications JP-A-63-91787 and JP-A-1-265348 as relevant references.




Heretofore, there is a problem that the access method which is complicated and low in a speed must be utilized in order to cope with various applications. Further, there is a problem that the high integration of the graphic dual-port memory capable of being used as the high-speed frame buffer is not utilized for a small capacity memory.




SUMMARY OF THE INVENTION




It is an object of the present invention to provide a graphic processing apparatus and method in which a frame buffer and a main memory are configured integrally and a high-speed and standard dynamic memory with simple configuration is used to effectively utilize the integration degree of the memory.




Further, it is another object of the present invention to provide a graphic processing apparatus in which a standard dynamic RAM can be used to make high-speed drawing with a high-integrated and small configuration.




In the present invention, successive column access for a memory is used in which a row address is designated and then data in different column addresses within the designated row address are successively accessed, and there is provided means for buffering a series of data between the access by a processor and the access to the memory. Display graphic information is also stored in a main memory in addition to a program and data.




Further, in order to achieve the high-speed drawing, there is provided a graphic processor which performs the successive column access for a dynamic memory. The number of times of the column accesses is varied and there is provided buffer means for temporarily storing data obtained by the column accesses.




The buffer means can absorb deviation in a timing between the access from the processor and the memory access having a higher throughput than that of the access from the processor and utilize empty memory access as display access.




Further, the graphic processor allows to vary the number of times of the column accesses and performs the graphic processing for raster data having any length effectively.




According to the present invention, since the frame buffer and the main memory can be configured integrally, it can be configured simply and small regardless of the high-speed operation. For example, 32 to 64 16-Mbit memory elements are used to operate a processor having a performance of 100 MIPS or more effectively and 1280×1024 pixels can display 16 million colors (8 bits for each of R, G and B).




Further, according to the present invention, system bus access (access by operation processing means) and display access (access by display control means) can be controlled preferentially in accordance with its priority order and waste of the memory access can be eliminated.











BRIEF DESCRIPTION OF THE DRAWINGS





FIG. 1

is a block diagram showing a system configuration of an embodiment of the present invention;





FIG. 2

is a diagram illustrating bus transfer speed;





FIG. 3

is a diagram illustrating a system bus;





FIG. 4

is a diagram illustrating a memory bus;





FIG. 5

is a block diagram showing an internal configuration of a memory controller of

FIG. 1

;





FIG. 6

illustrates memory space;





FIG. 7

is a diagram illustrating a priority control mechanism in a display controller of

FIG. 5

;





FIG. 8

illustrates memory access operation;





FIG. 9

is a block diagram showing an internal configuration of a rendering processor of

FIG. 5

; and





FIG. 10

illustrates operation in the drawing process.











DESCRIPTION OF THE PREFERRED EMBODIMENTS




Embodiments of the present invention are now described in detail with reference to the accompanying drawings.





FIG. 1

shows an embodiment of the present invention, which comprises one to a plurality of CPU modules


1


, one to a plurality of memory modules


2


, a high-speed I/O unit


3


, a bus adapter


4


, a DAC (D/A converter)


5


and a CRT


6


. The CPU modules


1


, the memory modules


2


, the high-speed I/O unit


3


and the bus adapter


4


are connected each other through a high-speed system bus having a width of 64 bits. The CPU module


1


includes a CPU


10


for performing operational process and a large-capacity secondary cache


11


connected to the CPU externally. The CPU


10


is of one-chip LSI and includes a primary cache


100


, a floating-point arithmetic mechanism (not shown) and a memory control unit (not shown). A CPU having a performance of 100 MIPS (the performance index representing that millions instructions per second can be executed) is used in the embodiment. The memory module


2


includes a memory controller


20


and a main memory


21


including frame buffer function. The memory controller


20


includes a raster buffer


2000


for buffering data and controls the access to the main memory and output of display data. Description will be made by way of example on the assumption that the main memory


21


has 32 16-Mbit (four 4-Mbit memory devices) memory chips. The high-speed I/O unit


3


connected directly to the high-speed system bus is a high-speed network controller, a high-speed disk controller or the like. The bus adapter


4


is to connect the high-speed system bus to a low-speed I/O bus. Connected to the low-speed I/O bus are a printer, a key board, a mouse, an external memory such as a disk and a controller for a network or the like. The DAC


5


includes a color pallet for converting a color code and a D/A converter for converting a digital color video signal to an analog video signal. The CRT


6


is of a raster scan type display unit. In the embodiment, a display unit capable of displaying various colors with any resolution can be employed. However, it is assumed that a display unit having 1280×1024 pixels is used in the embodiment unless otherwise specified. Further, in the embodiment, the CRT


6


is connected as an output unit of characters and graphic data, while it is needless to say that the present invention can be applied to other output unit (for example, a printer) similarly.





FIG. 2

shows a relation between a bus width and a speed of a memory and a bus transfer speed. Heretofore, a cycle time of the random access to a dynamic memory generally requires about 200 to 400 ns. For example, if the memory cycle is 200 ns and the bus width is 32 bits, the bus transfer speed of 20 MB/S is obtained. On the other hand, when 256 colors are displayed simultaneously on a display unit having, for example, 1280×1024 pixels, the throughput of at least about 80 MB/S is required for reading of data to be displayed. More particularly, in a conventional general memory design, the throughput necessary for the display is much larger than that of the accesses of a processor to a memory, and this is a reason that the special purpose buffer memory is required. Accordingly, as in the embodiment of

FIG. 1

, when the performance of the processor exceeds 100 MIPS, the performance capable of supplying programs and data from the memory with the throughput of several hundreds MB/S is required in order to operate the processor effectively. That is, a higher throughput for the accesses by the processor is necessarily required as compared with the memory access for the display. This means the possibility that part of the accesses by the processor can be assigned to the display. For example, if the memory cycle is 25 ns, there can be attained the transfer rate of 320 MB/S with the bus width of 64 bits and the transfer rate of 640 MB/S with the bus width of 128 bits.





FIG. 3

illustrates operation of the high-speed system bus. In order to increase the performance, m blocks are transferred as a unit. When an application of an address to an A/D (address/data) bus is represented by a falling edge of an AS (address strobe) from a bus master, the transfer cycle is started. When a response is returned by a falling edge of AA (address acknowledge) from a slave side, data is transferred. The transfer of data is controlled by control signals of DS (data strobe) and DA (data acknowledge). Both of falling edges and rising edges of the DS and DA are significant, and the DS is used to inform existence of data and the DA is used to inform a response thereto. In this manner, the high-speed operation is attained by transferring data in a block. For example, when one transfer cycle of data is 25 ns, the transfer rate of 320 MB/S is obtained in a peak with the system bus of 64 bits.





FIG. 4

illustrates the high-speed memory access method. A read cycle by a page mode is shown by way of example. When a RA (row address) on address lines is supplied in a memory device in response to a falling edge of a RAS (row address strobe), reading of one line from a memory cell is started. Then, a CA (column address) on the address lines is supplied in the memory device in response to a falling edge of a CAS (column address strobe) and data designated by the CA of the one-line data designated by the RA is produced on a data line. Thereafter, the CA is changed and data for a different column address in the same row is successively read in response to the falling edge of the CAS. The page mode can make access only by the application of the CA and accordingly the considerably high-speed operation can be attained as compared with the conventional random access in which both of RA and CA are supplied every time. Consequently, when the cycle time of the page mode is, for example, 25 ns, the transfer rate of maximum 640 MB/S is attained with the data bus width of 128 bits.




Referring now to

FIG. 5

, an internal configuration of the memory controller


20


according to the present invention is described. The memory controller


20


includes a system bus interface


200


, a rendering processor


202


, a display controller


204


, a memory bus interface


206


and a memory control unit (or address converter)


208


. The system bus interface


200


serves to interface with the system bus, and includes a system raster buffer


200


and an access counter


2001


. The system raster buffer


200


is to temporarily store data transferred between the system bus and the memory.




In the embodiment, the system raster buffer


2000


has a capacity of 256 bytes and the transfer of data is made within the range of the memory capacity by the number of times designated by the access counter. Particularly, in the embodiment, since the throughput of the memory bus is higher than that of the system bus, the system raster buffer


2000


is indispensable as means for absorbing a difference in a speed between the memory bus and the system bus.




The access counter


2001


is to control the input and output operation of the system raster buffer


2000


in response to the access of the system bus or the memory bus, and an optimum value determined by a line size of the cache is previously set in the access counter


2001


as the number of times of the block transfer (maximum 32 times) to the system bus. Part of the system raster buffer


2000


and the access counter


2001


may be configured by an FIFO (First-In First-Out).




The rendering processor


202


is to control drawing and has the basic drawing function such as generation of straight lines, generation of painting data of horizontal lines and control of bit block transfer. Further, the rendering processor


202


has the Z comparison function for shading process for effecting interpolation for smooth brightness and elimination of shaded portion.




The display controller


204


is to control display for the display unit, and includes a display raster buffer A


2040


and a display raster buffer B


2041


. The display controller


204


has a programmable function capable of coping with various display units having different resolutions of display screens and different numbers of colors to be displayed and performs generation of a synchronous signal and reading of display data in accordance with a display unit. Each of the display raster buffers A and B


2040


and


2041


has a memory capacity of 6 Kbytes and can store 2048 pixels each including eight bit-data for each of R, G and B. More particularly, since each of two buffers can store the display data for one raster, one of the buffers is used for display during the display period of one raster and the other is used to store the next raster data.




In the embodiment, each of the buffers stores data for one raster of the display unit, while the data for one raster may be divided into a partial data unit of the raster if the capacity of the buffers is small. In this case, however, since the reading access of the display data is concentrated within the display period and is not made during the retrace period, there is a problem that the load balance to the system bus is different in the display period and the retrace period. In other words, if data for one raster can be stored in the display raster buffer, the reading access of the display data can be dispersed into the period including the horizontal retrace period and the display period. The function of the display raster buffers can be replaced by FIFO.




The memory bus interface


206


serves to interface with the memory and performs multiplexing of the address for the dynamic RAM and generation of memory control signals. The memory control unit (or address converter)


208


performs the same address conversion as that included in the CPU


10


and converts a virtual address supplied from the rendering processor


202


and the display controller


204


into a physical address. The physical address supplied from the system bus is sent to the memory bus as it is. If the memory control unit


208


is not provided, the rendering processor


202


and the display controller


204


uses the physical address to control the address. Further, since there is a problem of the synchronization with the display screen, the display address is used as the physical address and the rendering processor


202


may control a logical address.





FIG. 6

illustrates a logical image of the main memory


21


. In the embodiment, the main memory


21


is a single memory space as a hardware and accordingly can be treated logically freely by a software so that various flexible configurations can be formed. FIG.


6


(


a


) shows an image of the memory when accessed by the CPU and the memory has a linear space having a data width of 64 bits. A memory area for 32 Kbytes of the memory has the same row address and can be accessed by the page mode. FIG.


6


(


b


) illustrates an image as a frame buffer with 8 bits/pixel used for display of 256 colors. A width thereof is composed of 2048 pixels in FIG.


6


(


b


), while other configuration with different width may be utilized. In this example, the page mode access can be effective in the area of a width of 2048 pixels by a depth of 16 pixels. FIG.


6


(


c


) illustrates an example of a frame buffer of 32 bits/pixels with 8 bits for each of R, G, B and α (coefficient data used for semitransparent expression). In this case, the page mode access can be effective in the area of a width of 2048 pixels by a depth of 4 pixels. Various configurations may be available in addition to the above configurations. For example, a Z plane having a Z value of 32 bits is treated by the same image as FIG.


6


(


c


). Further, the main memory


21


can store the above various data mixedly and can be applied to various uses flexibly.





FIG. 7

shows a configuration of a priority control mechanism in the display controller


204


, which includes a load counter


2042


, a display counter


2043


, and a priority control circuit


2044


. The load counter


2042


is to control operation of one of the display raster buffers A


2040


and B


2041


being used for reading the display data. The display counter


2043


is to control progress of the other of the display raster buffers being used for the display. Since the display raster buffer operates in synchronism with the display, the reading of the display data must be finished until the processing of the raster buffer being used for the display is completed. Accordingly, the priority control circuit


2044


judges whether a remaining time is sufficient for reading of a remaining display data or not. More particularly, a remaining time comparator


2045


compares a time required to read the remaining display data with the remaining operation time of the display raster buffer being used for the display on the basis of information of the load counter


2042


and the display counter


2043


. When there is a sufficient time for reading, the priority of access for the display is reduced, while when the remaining time is short, the remaining time comparator


2045


produces a priority control signal for increasing the priority of the access for the display.





FIG. 8

shows a flow of operation of each portion. The priority order for the normal access is {circle around (1)} the system bus, {circle around (2)} the rendering processor and {circle around (3)} the display access, while the priority order in the condition of the display priority is {circle around (1)} the display access, {circle around (2)} the system bus and {circle around (3)} the rendering process. “S” in the memory access represents the access by the system bus, “R” represents the access by the rendering processor, and “D” represents the display access. Boxes in the memory access represent a series of page mode accesses. The reading access from the system bus reads data from the memory through the system raster buffer


2000


. Since the memory access is faster than the system bus, data is temporarily stored in the system raster buffer


2000


and then is successively outputted to the system bus. The writing access from the system bus is stored in the memory after storing in the system raster buffer


2000


. The display raster buffers A


2040


and B


2041


are used for display alternately in synchronism with the horizontal scanning. Since the access by the rendering processor


202


has a lower priority order than the system bus, a waiting state occurs when the system bus is accessed and used. When the access of the system bus and the access by the rendering processor are concentrated, the display access is driven after one raster time, and when the remaining time is short in this state, the priority is changed to the display priority so that the display access is forcedly implemented. In this case, there is a case where a wait occurs in the access of the system bus.





FIG. 9

shows a configuration of the rendering processor


202


, which includes a DDA circuit


2020


, a Z raster buffer


2021


, a Z comparator


2022


, a source raster buffer


2023


, a pattern raster buffer


2024


, a destination raster buffer


2025


and a raster arithmetic unit


2026


. The DDA circuit


2020


performs generation of coordinates upon generation of straight lines, calculation of brightness for R, G and B in the interpolation of the brightness and calculation by interpolation of Z value, and produces an address for each pixel. The Z raster buffer


2021


stores data for a series of raster (a plurality of pixels continuing horizontally or its plurality of groups) of Z value read from the memory and stores Z value corresponding to any designated length of raster. The Z comparator


2022


compares data of the Z raster buffer


2021


with the Z value interpolated by the DDA circuit and its comparison result is stored in the pattern raster buffer


2024


. The source raster buffer


2023


stores any length of raster data of a source of the bit block transfer (BITBLT) operation. In the case of generation of straight lines, the buffer stores the drawing line information and the drawing information. In the case of shading operation, the buffer stores a brightness value produced by the DDA circuit


2020


. The pattern raster buffer


2024


stores raster data of a pattern for the BITBLT operation and mask data produced from the Z comparator upon elimination of shading plane. The destination raster buffer


2025


temporarily stores read data of destination in the BITBLT operation. The raster arithmetic unit


2026


executes the BITBLT operation such as color operation and various logic operation in accordance with a predetermined operation mode. The embodiment is characterized in that the memory is accessed in the page mode in a raster block unit at a high speed by providing the rater buffers storing any raster data so that a large amount of data is processed at a high speed.





FIG. 10

shows procedures of the memory accesses for some examples of processes. In

FIG. 10

, one box represents a series of accesses for the raster block capable of being accessed by the page mode. During the actual memory access, when the system bus access or the display access is produced, the waiting state occurs and it is divided into a plurality of page mode access groups and implemented. In the two-operand BITBLT of FIG.


10


(


a


), the writing of the destination is executed subsequently to the reading of the source and destination. FIG.


10


(


b


) is different from FIG.


10


(


a


) in that the reading of the pattern raster data is added. FIG.


10


(


c


) shows generation of straight lines, and the writing is executed for each unit of a series of raster blocks. For example, in the memory configuration shown in FIG.


6


(


b


), generation of a horizontal line can be executed by a single writing of raster block. In the case of a vertical straight line, the raster block is different every 16 pixels and accordingly the writing can be effected in its unit. However, it is needless to say that when the raster buffer has a smaller capacity than 32 KB the access unit is restricted by its capacity size. When the capacity of the raster buffer is small, the efficiency of the memory access can be increased correspondingly if a method of storing a column address together with the raster block can be adapted or an oblong or square shape can be selected as the shape of the raster block. FIG.


10


(


d


) shows shading, and writing of the destination data is made after reading of the Z raster. When operation for the destination data is specified, there is a case where reading of the destination is made before writing of the destination.




As described above, in the embodiment, the high-speed drawing can be executed by the rendering processor shown in

FIG. 9

in combination with the page mode access. The embodiment is configured by using the standard dynamic memory, while it is not necessarily required that the configuration of

FIG. 9

is combined with the dynamic memory and it may be combined with an image dual-port memory, for example. With the configuration of the embodiment, the memory can be small and when the image dual-port memory is used, reduction of the performance by the display access is small. Further, in the embodiment, the page access mode has been described by way of example, while it is needless to say that the similar description can be made if other similar mode (nibble mode or static column mode) is used.



Claims
  • 1. A data processing system comprising:a processor; a memory to store a command to be executed by said processor and data to be displayed, said memory having a single output port for outputting said data or said command therethrough; a display to display said data read out of said memory; a display controller to control display of data on said display; and a memory controller connected to said memory for controlling priority among data and commands and for outputting through said display controller data to be displayed on said display; wherein, while said memory controller reads said command out of said memory, said memory controller outputs said data to be displayed through said display controller.
  • 2. A data processing system according to claim 1, wherein said memory controller is connected to said processor.
  • 3. A data processing system according to claim 2, further comprising a buffer for buffering said data to be displayed.
Priority Claims (1)
Number Date Country Kind
02-197929 Jul 1990 JP
CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of Ser. No. 09/593,496 filed Jun. 14, 2000 and issued Apr. 23, 2002 as U.S. Pat. No. 6,377,267, which is a continuation of Ser. No. 09/327,355 filed Jun. 8, 1999 and issued Apr. 24, 2001 as U.S. Pat. No. 6,222,563, which is a continuation of Ser. No. 08/739,457 filed Oct. 29, 1996 and issued Aug. 17, 1999 as U.S. Pat. No. 5,940,087, which is a continuation of Ser. No. 08/358,988 filed Dec. 19, 1994 and issued Jan. 6, 1998 as U.S. Pat. No. 5,706,034, which is a continuation of Ser. No. 07/735,947 filed Jul. 25, 1991, abandoned.

US Referenced Citations (13)
Number Name Date Kind
4462028 Ryan et al. Jul 1984 A
4562435 McDonough Dec 1985 A
4594587 Chandler et al. Jun 1986 A
4736200 Ounuma Apr 1988 A
4737780 Ishii Apr 1988 A
4757312 Asai et al. Jul 1988 A
4779223 Asai et al. Oct 1988 A
4803475 Matsueda Feb 1989 A
4812998 Maeda Mar 1989 A
4845661 Shimada Jul 1989 A
4953101 Kelleher et al. Aug 1990 A
5072420 Conley et al. Dec 1991 A
5107251 Frank et al. Apr 1992 A
Foreign Referenced Citations (3)
Number Date Country
59131979 Jul 1984 JP
6391787 Apr 1988 JP
1265348 Oct 1989 JP
Non-Patent Literature Citations (2)
Entry
Yao, “Unified Memory Architecture Cuts PC Cost”, Microprocessor Report, vol. 9, No. 8, pp. 5-9, Jun. 19, 1995.
Nikkei Electronics, No. 653, pp.16-17, Jan. 15, 1996.
Continuations (5)
Number Date Country
Parent 09/593496 Jun 2000 US
Child 10/123136 US
Parent 09/327355 Jun 1999 US
Child 09/593496 US
Parent 08/739457 Oct 1996 US
Child 09/327355 US
Parent 08/358988 Dec 1994 US
Child 08/739457 US
Parent 07/735947 Jul 1991 US
Child 08/358988 US