Graphics controller integrated circuit without memory interface

Information

  • Patent Grant
  • 6356497
  • Patent Number
    6,356,497
  • Date Filed
    Tuesday, December 21, 1999
    24 years ago
  • Date Issued
    Tuesday, March 12, 2002
    22 years ago
Abstract
A CMOS integrated circuit that comprises a graphics controller system that consists of a graphics engine and video memory together with some interface blocks, a PCMCIA host adapter, an infrared interface for generating video images on a LCD or CRT display unit, and a video stream interface for receiving video signals. Since the video memory is integrated on the same integrated circuit as the graphics controller, no package pins are required for the memory interface. The pins thus saved are used to provide access to an on-chip PCMCIA host adapter. The internal memory interface is 128 bits wide. Simultaneous performance improvement and power dissipation reduction is achieved because of the wide memory interface and the elimination of the large parasitic capacitances associated with a package pin connection.
Description




BACKGROUND OF THE INVENTION




This invention is related to graphics controller systems and, more particularly, to graphics controller systems with low power dissipation.




As shown in

FIG. 1

, a typical graphics controller system has a graphics controller integrated circuit


10


, which has a graphics engine


12


for manipulating video data, and a CPU interface


13


, display interface


14


and video memory interface


15


. The graphics controller integrated circuit


10


receives video image data from a CPU (central processing unit) through the CPU interface


13


, and after processing the data, stores that information through the video memory interface


15


in a separate video memory


11


, also called the video frame buffer. The graphics controller


10


also makes sure that the image data is regularly retrieved from the video memory (through the interface


15


) and fed to a display unit through the display interface


14


with a frequency which satisfies the refresh requirements of the display. In some more advanced graphics controller systems, video image data may also be received from other sources, such as a device with a PCMCIA (Personal Computer Memory Card International Association) connector.




The video memory interface


15


of the graphics controller integrated circuit


10


has ports dedicated to interface with the video memory


11


. The number of ports required for this interface


15


is the sum of the address, data and control signals required to access the video memory


11


. The memory


11


has a size which is a function of the video frame buffer required to support the display resolution. While dynamic random access memory (DRAM) is most commonly used for the video frame buffer, some high performance systems use VRAMs (DRAMs with serial data ports added). A typical VGA (Video Graphics Adapter standard) display in an IBM-compatible mobile computer, often called a notebook computer, with an LCD (liquid crystal display) panel uses a single 256K×16 DRAM integrated circuit as a video frame buffer. A typical SVGA (Super VGA standard) system uses two such DRAMs organized as 256K×32.




Wider data paths between the video memory and the graphics controller allow greater bandwidth for data transfer. However, the wider data paths also increase the pin count of the graphics controller package and the package count of the DRAMs with the accompanying increased manufacturing complexity and costs. A 16-bit data path requires one DRAM package and approximately 30 signal lines to handle the memory address, data, and control signals, while a 32-bit data path requires two DRAM packages and 50 signal lines. Power dissipation is increased as more signal lines are added since each signal line has a parasitic capacitance associated with the package I/O pin, as well as with the conducting trace on the motherboard of the mobile computer system. Therefore, an increase in graphics performance is accompanied by an increase in power dissipation, pin count and package count.




The present invention solves or substantially mitigates these problems with a high performance graphics controller system having low power dissipation, and low pin and package counts.




SUMMARY OF THE INVENTION




According to the invention, there is provided a graphics controller system with increased performance simultaneously with a reduction in power dissipation, point count and package count. Previously external video memory is integrated with the graphics controller system to eliminate the memory interface. The reduction in pin count is used to add the pins associated with a PCMCIA host adapter and thus allow the integration of that function on the same chip, so as to further reduce the package count on the mother board.




The present invention also provides for particular arrangements for logic circuits and output buffer circuits so that large amounts of logic circuitry sufficient to perform graphics controller system functions may be integrated with the large amounts of memory sufficient to act as a high performance video memory. Furthermore, the present invention provides for a wide bus between the integrated video memory and the functional blocks of the graphics controller system. The present invention has circuits in these blocks for manipulating the video data from the wide bus so that data transfer remains compatible to the various operational VGA modes.











BRIEF DESCRIPTION OF THE DRAWINGS





FIG. 1

illustrates the general organization of a graphics controller system in the prior art;





FIG. 2

illustrates the general organization of a graphics controller integrated circuit according to the present invention;





FIG. 3A

is a circuit diagram of a CMOS logic inverter as connected in a prior art integrated circuit manufactured according to a logic process;





FIG. 3B

is a circuit diagram of a CMOS logic inverter as connected in a prior art integrated circuit manufactured according to a DRAM process;





FIG. 4

is a circuit diagram of a CMOS logic inverter as connected according to the present invention in an integrated circuit manufactured according to a DRAM process;





FIG. 5

is a cross-sectional diagram of a P-channel transistor in an N-type well, which form most of the analog circuits in the graphics controller portion of the integrated circuit shown in

FIG. 2

;





FIG. 6A

is a circuit diagram of a CMOS driver stage of an output buffer found in the prior art;





FIG. 6B

is a circuit diagram of a CMOS driver stage of an output buffer according to the present invention;





FIG. 7

is a block diagram detailing the organization of a graphics controller system according to the present invention;





FIG. 8

illustrates the organization of the output signals of the Bus Read Latch of the graphics controller system of

FIG. 7

; and





FIGS. 9A-9D

illustrate the organization of the multiplexer at the output of the Bus Read Latch of the graphics controller system of FIG.


7


.











DESCRIPTION OF THE PREFERRED EMBODIMENT(S)




In accordance with the present invention, the graphics controller functions are integrated on the same integrated circuit substrate as the video memory, as shown in

FIG. 2. A

single integrated circuit


20


has a portion of its substrate for an advanced graphics engine


22


, the circuitry which handles the graphics controller functions and manipulates the video data. The integrated circuit


20


also has another portion of the substrate for a video memory


21


, in the form of a DRAM. A 128-bit wide data interface


25


connects the graphics engine


22


and the DRAM


21


. The DRAM has 7.3 megabits, organized as 56K×128 bits. In every memory cycle, 128 bits are accessed which can then be multiplexed down to the required number of bits for communication to the CPU through a CPU interface


23


or to the display through a display interface


24


.




The integrated circuit


20


also has other interfaces, such as an infrared interface


26


for wireless transmission of data between the mobile computer and another device, a PCMCIA host adaptor interface


27


for connections to devices, such as modems, hard disks, etc., which are designed to meet the PCMCIA specification, and a video stream interface


28


for receiving video signals from a variety of sources, such as television or VCR signals. The video stream interface


28


is adapted to the VAFC (


V


ESA


A


dvanced


F


eature


C


onnector) standards being promulgated by the Video Electronics Standards Association (VESA).




Integrating a large block of DRAM (approximately 7 megabits) on the same substrate with a large block of logic (approximately 40K to 50K of logic gates) required for the graphics engine


22


and various interfaces is not simply a matter of placing DRAM and logic circuits on the same integrated circuit substrate. The optimum technologies for a DRAM and for a logic circuit are electrically incompatible. Hence various steps, described below, must be taken to ensure that the performance of the DRAM circuits and the logic circuits are fully maintained and not compromised.




An integrated circuit process used for building logic gates uses one of the external supply voltages (V


DD


or V


SS


depending upon the substrate type) as the voltage to bias the substrate. On the other hand, an integrated circuit process used for building a DRAM uses an internally generated substrate bias voltage which is different from the external supply voltages. This is done primarily to lower the junction capacitance of the memory cell bit lines of the DRAM, as well as to improve the memory cell refresh time.




For example, in commonly used CMOS technology today, the substrate material is P-type silicon. A logic process uses the externally supplied V


SS


, or ground, voltage to connect to the substrate. The V


SS


line is also used in the circuit areas to provide the ground path for current flow in the pulldown, N-channel transistors in the logic circuits.

FIG. 3A

illustrates this point with a representative logic gate, an inverter having a pull-down, N-channel transistor


31


and a pull-up, P-channel transistor


32


. The PMOS transistor


32


has its source connected to a metal line


42


at the positive supply voltage, V


DD


, and the NMOS transistor


31


has its source connected to a metal line


41


at V


SS


. The P-type substrate in which the NMOS transistor is placed is also connected to the V


SS


line


41


. In

FIGS. 3A

(and


3


B and


4


), node


30


represents a metal-to-N+ source contact and node


40


represents a metal-to-P+ substrate contact.




Thus the same V


SS


metal tracks in the logic integrated circuit serves two functions: 1) as a substrate tap every 25-50 microns across the substrate surface, and 2) as a source terminal of the N-channel transistors of the logic circuits. The substrate taps are necessary to protect the circuits from going into a latch-up condition during operation, since each logic gate has both N-type and P-type transistors.




In a CMOS DRAM, however, the typical DRAM array is built with only N-type transistors and capacitors, and a major goal is to minimize the parasitic capacitance of the memory bit line. Since the N-type bit line junction areas contribute to a majority of the bit line capacitance and since the junction capacitance is greatly reduced by a reverse junction bias voltage, the P-type substrate is typically biased at −1.5 volts, termed V


BB


. This voltage is generated from an on-chip charge pump and thus has a limited capacity and a high output impedance. This results in the substrate voltage being relatively “noisy” due to the precharging and discharging of the large junction areas associated with the memory array, that are capacitively coupled to the substrate.




In the DRAM, the small amount of on-chip logic which handles the memory address decoding and the data read and write functions typically uses the V


SS


metal tracks only to connect to the source terminal of the N-channel, pulldown transistors and not as a substrate tap. As shown by a representative logic gate, an inverter, in

FIG. 3B

, a pull-down, N-channel transistor


33


has its source connected to a V


SS


metal line


43


. The P-type substrate of the NMOS transistor


33


is at V


BB


.




In fact, DRAMs typically do not have any substrate taps in the middle of the circuitry. The substrate taps are only made at the edges of the die. Since most of the logic blocks in a DRAM consist of a few cells, repeated many times, which together form a small portion of the total chip area, large P-to-N diffusion spacings, typically 25 microns, can be maintained in the logic cells to avoid latch-up. In contrast, in a logic circuit, which has many different cell types connected in a relatively random manner, the cell size is very important as it determines the total chip area. The P-to-N diffusion spacings are minimized, typically 5 microns, which requires the use of substrate taps in every cell to avoid latch-up.




To combine a significant amount of random logic to a significant amount of DRAM in a single integrated circuit requires that this problem be overcome. The present invention combines the random logic, the graphics engine


12


and interfaces, with the DRAM


11


, in an integrated circuit


20


manufactured in accordance with a DRAM process. The logic circuits of the integrated circuit are redesigned to decouple the V


SS


line connected to the source terminals of the N-channel, pulldown transistors from the P-substrate tap. As shown in

FIG. 4

, the source of the N-channel, pulldown transistor


35


of a representational logic circuit is connected to a V


SS


line


45


(at 0 volts), while the substrate is tapped by a V


BB


line


47


(at −1.5 volts). The P-to-N diffusion spacings are then minimized without deleterious consequences.




Additionally, the graphics engine


22


of the integrated circuit


20


has analog circuits. In an exemplary analog circuit, a low-pass filter is often used and an RC circuit is required. Heretofore, the capacitor of the RC circuit has been typically formed by an NMOS transistor with its gate forming one terminal and the shorted source/drains forming the other terminal of the capacitor. Since the body bias of this transistor is the noisy substrate voltage generated from the on-chip charge pump required for the DRAM


21


, some of the noise couples inevitably into the low-pass filter circuit. To avoid this problem, the analog circuits according to the present invention are designed with mostly P-channel transistors within independent N-wells which are connected to the positive and relatively quiet-reference voltage, V


DD


, as shown in FIG.


5


. The N-wells isolate the terminals


37


and


38


of the capacitors and the rest of the analog circuits from the “noisy” substrate voltage.




On the periphery of the integrated circuit are buffer circuits for transferring signals to and from the external world. Problems arise with the DRAM technology in the driver stage circuit of each output buffer. Shown in

FIG. 6A

is a representative output driver stage found in the prior art. Basically an inverter, the driver stage has two transistors, an N-channel, pulldown, driver transistor


50


having its source connected to a voltage supply bus


52


at an external negative supply voltage, V


SSQ


, and a P-channel, pullup, driver transistor


51


having its source connected to a voltage supply bus


53


at an external positive supply voltage, V


DDQ


. The words, “negative” and “positive,” refer to one supply voltage relative to the other supply voltage and the “Q” in the subscript indicates that the supply voltages are not necessarily the same as the supply voltages, V


DD


and V


SS


, in the other parts of the integrated circuit. This allows the interior portions of the integrated circuit to operate at voltage levels, 0-+3.3V, while the peripheral output driver circuits operate at different levels, 0-+5.0V, for example. Furthermore, the separation of supply voltages provides for some insulation from noise between the interior and peripheral portions of the integrated circuit. The drains of the two transistors


50


and


51


are connected together and to an output terminal


54


. The gates of the transistors


50


and


51


are also connected together to an input terminal


57


connected to the rest of the buffer circuit (not shown).




During the switching of output signals, the output signal voltages invariably tend to overshoot the V


DDQ


and V


SSQ


supply voltages due to an impedance mismatch between the driver transistors and the external load. When the output signal voltage, DATA OUT, is being driven high in response to an internal signal, dataout*, going low at the input terminal


57


, for instance, the prior art design results in a parasitic diode


55


, marked by dotted lines, becoming forward-biased when the overshoot exceeds 0.6 volts. The diode


55


is formed by the junction of the drain of the P-channel, pullup transistor


50


and the N-well holding the transistor, which is also connected to V


DDQ


This forward-biasing action results in the injection of positive charges, or holes, into the substrate which works against the on-chip substrate bias voltage generator. The amount of hole injection is a function of the severity of the overshoot, the number of output buffers, and the frequency of switching. If the substrate bias voltage generator is overwhelmed by excessive hole injection, functional failures or soft errors occur in the on-chip DRAM.




To avoid this problem, a new output driver circuit, shown in

FIG. 6B

, is used. In the drawing, the same reference numerals are used where the operation or connection of the referenced element remains unchanged from FIG.


6


A. Different reference numerals are used if the operation or connection of the element is different. In the new, representative driver circuit, for each bank of buffer circuits at particular V


DDQ


and V


SSQ


voltages, the N-wells in which the P-channel driver transistors


51


of the bank are located are raised to a voltage higher than the V


DDQ


voltage for the bank. An on-chip bias generator is connected to the V


DDQ


bus


53


for a reference to generate a voltage at NV


DD


, 1 volt higher than V


DDQ


. The bias generator (not shown) is connected to a bus


59


at NV


DD


, which is connected to the N-wells of each of the P-channel transistors


51


. If several P-channel transistors


51


are in a single N-well, the bus


59


is connected to the N-well in a series of taps, one located near each transistor


51


. This arrangement lowers the possibility of latch-up.




For each bank of output buffers on the same integrated circuit, a different NV


DD


bias generator, referencing the V


DDQ


supply for that bank, is used to generate the higher voltage. This allows the N-well(s) of an output buffer bank which is driven from a +3.3V supply to be biased at +4.3V, while an output buffer bank driven by a +5.0V supply, has its N-well(s) biased at +6.0V.




As shown in

FIG. 6B

, the parasitic diode


58


formed by the drain of the transistor


51


and the N-well holding the transistor


51


now has an extra 1 volt of N-well bias. The diode


58


does not become forward-biased unless the overshoot exceeds 1.6V. This extra margin of safety results in a greatly diminished level of hole injection into the substrate and thus prevents the occurrence of soft errors or functional failures in the on-chip DRAM.




Integrating the video memory, the DRAM


21


, with a graphics controller results in significant power savings compared to present graphics controller systems with external DRAM. Capacitance in present graphics controller systems is comprised of the capacitance of the I/O pins of the DRAM packages and the controller package plus capacitance of the traces on the motherboard which carries and connects the DRAM and controller packages. The present invention has a roughly twenty-fold reduction of the video memory address, data and control bus capacitance. This results in an equivalent power savings since most of these lines are continuously switching at high frequencies.




Another source of power savings is the 128-bit wide memory word which can be transferred between the graphics engine


22


and the video memory


21


. The controller, i.e., the graphics engine


22


, has 128 bits of data available after one DRAM read cycle. In comparison, the graphics controller system in the prior art requires four or eight read cycles, depending upon a 32-bit or 16-bit wide DRAM organization, respectively. Since a fixed amount of power is consumed every DRAM cycle, the present invention has a savings of three-fourths to seven-eighths of the prior art power dissipation.




Furthermore, the present invention uses memory very efficiently. The capacity of the video memory


21


is not required to fall on high order binary boundaries, such as combinations of DRAM integrated circuits forming a memory of 32K×128 bits, or 64K×128 bits, the next larger size. In the present invention, the addition of memory blocks, with a typical size of 256K(2


18


) bits each, achieves the required capacity for the video memory


11


. Memory capacity can be customized for a particular application. For example, a 1024×768×8 display requirement requires a video memory of 6.4 megabits, which can be built with 24 memory blocks. With external DRAMs, a video memory of 8 megabits is required since the standard DRAM package has 4 megabits. The video memory


21


of the integrated circuit


20


can be organized to be of any width, depth or capacity and need not follow the multiplexed addressing architecture associated with standard DRAMs.




With the ability to incorporate large amounts of logic and memory in a single integrated circuit, the present invention provides for a video memory and logic for graphics control operations in one integrated circuit.

FIG. 7

illustrates the organization of the advanced graphics controller system of

FIG. 2

in greater detail. As stated previously, the memory


21


is organized from dynamic RAM memory cells arranged 56K×128 bits wide. Stated differently in terms of memory blocks, the memory


21


has the storage capacity of 2


18


×28 bits. The 128-bit interface


25


in

FIG. 2

is realized by a 128-bit wide bus


61


, organized as sixteen 8-bit bytes, so that a Write operation can be performed at the byte level into the memory


21


. Connected to the bus


61


is a Graphics User Interface (GUI) Acceleration block


62


, part of the Graphics Engine


22


of FIG.


2


. The CPU Interface


23


of

FIG. 2

is realized by a Host Bus Interface block


63


in

FIG. 7

, and the Display Interface


24


is by a CRT Display block


64


and an LCD Display Interface block


65


. All the blocks


62


-


65


are indicated by dotted lines in FIG.


7


. Not shown in

FIG. 7

are the Infrared Interface


26


, PCMCIA Host Adaptor


27


and the Video Source Interface


28


. The circuits for the Interfaces


26


and


28


, and Adaptor


27


are presently found in separate printed circuit boards in personal computer systems and may be integrated onto the single integrated circuit with the techniques described previously.




The GUI acceleration block


62


has a 128-bit wide register


70


, which receives data from the bus


61


. The register


70


splits its contents into two parts, 64-bits apiece, to a 64-bit BIT BLock Transfer (BITBLT) operation unit


71


for performing the operations. The output of the unit


71


is fed into an assemble register


72


on a 64-bit wide path. After a BITBLT operation, the register


72


builds up a 128-bit word for transfer back to the bus


61


. This organization represents the best compromise between performance and space on the integrated circuit; the transfer rate is maximized between the memory


21


and the operation unit


71


, yet an optimum size of 64 bits for the unit


71


is maintained. A 128-bit BITBLT operation unit occupies a very large amount of integrated circuit space, while a unit for 32 bits slows BITBLT operations too much.




The host bus interface block


63


lies between the bus of the host, i.e., the CPU of the computer system, and the bus


61


. The interface block


63


provides a bidirectional data path between the 128-bit bus


61


and a 32-bit bus of the host. The block


63


has a Bus Read Latch


73


which holds a 128-bit wide word from the bus


61


. The output of the latch


73


is connected to the input of a multiplexer


74


, which selects 32 bits from the 128-bit latch


73


for the host bus. For host Read operations, the selected 32 bits should contain four consecutive bytes in the host bus address space. Depending upon the VGA-compatible format and other extended storage formats in use, these four bytes may be scattered among the 16 bytes, 16×8 bits equals 128 bits, of data stored in the latch


73


.




The output of the latch


73


is illustrated in

FIG. 8

with each of the bytes labeled 0-F. To implement 4-byte access properly for all VGA-compatible and additional extended storage formats, the multiplexer


74


is implemented as four separate single byte multiplexers


74


A-


74


D, one for each byte read back upon the host bus. Each of the 8-to-1 multiplexers


74


A-


74


D are illustrated in

FIGS. 9A-9D

respectively together with the particular input bytes from the latch


73


. Each of the multiplexers


74


A-


74


D respectively generates bytes 0-3 for the host bus.




The logic to generate the control signals, selda(


2


:


0


), seldb(


2


:


0


), seldc(


2


:


0


), and seldc(


2


:


0


), for the multiplexers


74


A-


74


D is listed in VHDL code in Appendices


1


and


2


A-


2


D. These control signals are derived from VGA standard control bits in programmable control registers which determine the storage format in use, and additional internal state information in the memory controller state machine. The present invention uses the following standard control bits:




SR


4


[


3


]=Chain-


4






GR


5


[


3


]=Read Mode




GR


5


[


4


]=Odd/Even




GR


4


[


0


]=Read Map Select[


0


]




GR


4


[


1


]=Read Map Select[


1


]




GR


6


[


0


]=APA/Text*, Graphics Mode




and an extended mode control bit:




PACPIX=Packed Pixel Format




As indicated in Appendix


1


, these control bits are used to generate control signals, tmp_pack, pack, rdplanar and wrplanar, which are used ultimately in generating the selda(


2


:


0


), seldb(


2


:


0


), seldc(


2


:


0


), and seldc(


2


:


0


) control signals. From these control signals, other control signals are generated for each of the multiplexers


74


A-


74


D. For example, the control logic and signals for the first multiplexer


74


A are illustrated in FIG.


2


A. The control signals in Appendix


1


generate control signals sela(


0


)-sela(


7


). These eight control signals then generate the three selda control signals. The end of Appendix


2


A shows how the multiplexer


74


A responds to the control signals by selecting one of input eight bytes, mrd_dta(X DOWNTO Y), as the output byte, cpul


0


.




Appendices


2


B-


2


D similarly illustrate the details of the control signals and operation of the multiplexers


74


B-


74


D respectively.




With reference to

FIG. 7

once again, the block


63


also has a FIFO (First-In, First Out) register


77


which has its input terminals connected to the 32-bit host bus for transfer of data from the host bus. The output terminals of the FIFO


77


, also 32 bits wide, are connected to a graphics controller unit


76


, which also has its input terminals connected to the output terminals of the multiplexer


74


. The graphics controller unit


76


manipulates data bits and can selectively load a single, or multiple, byte to the bus


61


through four drivers


75


in response to commands from the CPU (not shown). For VGA compatibility, the 32 bits sent from the Bus Read Latch


73


to the graphics controller unit


76


during a memory write operation are always be the four bytes from the four VGA planes containing the last byte read from the memory module


60


on the host bus. The multiplexer


74


is set properly in a VGA compatible mode upon completion of each Read operation. This setting for the multiplexer


74


is again dependent upon VGA or extended mode and is derived from logic equations similar to those shown in appendices


1


and


2


A-


2


D. The four drivers


75


reverse the data selection operation performed by the multiplexer


74


. The multiplexer


74


selects four bytes out of the 16 bytes read from the memory


60


to forward to the host bus. The drivers


75


position the four bytes from the host bus in a write operation properly in the 16 byte slots on the memory bus


61


so that the data is stored properly for subsequent retrieval. Control of these drivers


75


is derived from the memory controller control states and control register bits that define the VGA-compatible storage format, or other additional extended storage format, in use.




The CRT display block


64


provides a data path between the memory


21


and the CRT display which is compatible with the VGA standard. The block


64


has a Data Rotate unit


80


, which receives 128 bits from the bus


61


. The unit


80


is connected over four 32-bit paths to the input terminals of a CRT FIFO register


81


which has a capacity of


4


words, each word 128 bits wide. Stated differently, the FIFO register


81


is 128 bits wide and four stages deep, and can be filled in four memory fetches. The output terminals of the FIFO register


81


are connected to a VGA Display Path unit


82


over a 32-bit wide path. All VGA compatible graphics controllers for notebook computers today are based on a 16-bit or 32-bit memory bus to an external video memory buffer. The present 128-bit bus architecture, in comparison, allows improved performance while reducing power consumption. However, to achieve VGA compatibility and improve performance, byte swapping is required in transferring data from the memory bus


61


to the FIFO register


81


. This swapping is implemented in the Data Rotate unit


80


. From the 128 bits of the FIFO register


81


,


32


bits are selected and sent to the VGA Display Path unit


82


. Appendix


3


specifies the control signals and implementation in terms of the control register bits which define the VGA storage format or extended mode storage format in use, as well as the memory controller control states.




The control signals are:




fontcy is a signal derived from the internal state machine and indirectly from the previously identified control signal, GR


6


[


0


]; a control signal, such as fontcy, is found in present VGA compatible controllers to determine a font or ASCII fetch operation in text mode.




swap


0


, swap


1


are the 0 and 1-order bits of the CRT address counters found in VGA compatible controllers; these signals are derived from the Chain-


4


and Odd/Even control signals mentioned previously.




rscntb


0


, rscntb


1


are the 0 and 1-order bits of the 5-bit row scan counter found in VGA compatible controllers; the row scan counter is used for tracking the rows of a character in text mode.




lword is the Chain-


4


, or SR


4


[


3


], control signal identified previously.




crsr_dtct is the cursor detect signal in VGA-compatible controllers; and




TEXT, apa are the true and inverted of the GR


6


[


0


] control signal previously identified.




These control signals are used to generate further signals, memcl_dta and memc


2


_dta. Stated generally, these two signals are either the crsr_dtct signal in text mode, or bit


96


(or bit


36


respectively) of the data bits from the 128-bit word on the bus


61


in graphics mode. The signals, mema_dta, are basically the four 32-bit words of data formed from the 128-bit word on the bus


61


. The words for the bit locations,


127


-


96


and


63


-


32


, are modified so that the bits


96


and


32


are either crsr_dtct in text mode or respectively data bits


96


and


32


from the bus


61


. Finally, swapa and swapb are the control signals to the multiplexers in the Data Rotate unit


80


. It should be noted that the symbol, “&”, represents a concatenation of signals.




Appendix


4


illustrates the operation of the Data Rotate unit


80


, which receives the mema_dta signals as input and transmits crt_fin signals as output to the FIFO register


81


. The first VGA (32-bit) word, crt_fin(


31


DOWNTO


0


), may be filled by any one of the four incoming 32-bit words from the bus


61


, depending upon the state of control signals swapa. Similarly, the third VGA (32-bit) word, crt_fin(


95


DOWNTO


64


), may be filled by any one of the four incoming 32-bit words from the bus


61


, depending upon the state of control signals swapb. The second and fourth VGA words, crt_fin(


63


DOWNTO


32


) and crt_fin(


127


DOWNTO


96


), are respectively filled by the third and first incoming 32-bit words from the bus


61


.




The CRT FIFO


81


then selectively feeds 32-bit words into a VGA Display Path unit


82


and a Color Palette RAM


83


. The RAM


83


is, in turn, connected to a digital-to-analog converter (DAC)


84


. The RAM


83


feeds 18 bits of data, 6 bits for each component color, to the DAC


84


. The DAC


84


generates the analog signals for a CRT color display.




The RAM


83


also feeds data into the LCD-Display Interface block


65


which is organized for dual scan LCD panel displays. The general operation of the block


65


is that a Shader unit


96


receives the data from the RAM


83


. The unit


96


generates the grayscale values for the LCD pixels. In passing, it should be noted that the word, grayscale, implies intensity for a color LCD display. These values are sent to a Formatter unit


92


which, as the name implies, formats the grayscale values for the integrated circuit(s) which drive the electrodes of the LCD display. The Shader unit


96


also sends its grayscale values through several buffer units


95


,


94


,


93


,


90


and


91


(and along the bus


61


) before being formatted and transmitted by the Formatter unit


92


for a dual scan operation. Dual scan LCD panels are commonly used today in notebook computers and the buffer units of the block


65


provide for the memory by which, in alternating operation, the display in one LCD panel is updated by the Shader unit


96


while the display in the second panel is maintained from memory.




Therefore, while the description above provides a full and complete disclosure of the preferred embodiments of the present invention, various modifications, alternate constructions and equivalents may be employed without departing from the true scope and spirit of the invention. For example, while the present invention has been described in terms of an integrated circuit with a memory capacity of some 7.3 megabits and some 40-50K logic gates, one could use the present invention to build an integrated circuit of reduced size. An integrated circuit having a memory capacity of 2 megabits, the capacity of basic VGA video memory in graphics cards, with 30K logic gates, the approximate amount of logic in present graphics controller integrated circuits, still realizes the advantages of the present invention. Costs, power dissipation and occupied space are reduced, and performance is enhanced, for instance. The present invention, therefore, should be limited only by the metes and bounds of the appended claims.



Claims
  • 1. A graphics controller integrated circuit for connection to a CPU and a display, said graphics controller integrated circuit formed on a substrate and comprisinga video memory holding video data; a graphics engine including a Bit Block Transfer operation unit performing operations upon said video data up to a predetermined number of bits at a time responsive to instructions from said CPU; and a data interface connected to said video memory and to said graphics engine, said data interface being a multiple of said predetermined number of bits wide.
  • 2. The graphics controller integrated circuit of claim 1 further comprising a first register connected between said data interface and said Bit Block Transfer operation unit, said first register receiving a full number of said data interface bits at a time from said data interface and sending said predetermined number of bits at a time to said Bit Block Transfer operation unit.
  • 3. The graphics controller integrated circuit of claim 2 further comprising a second register connected between said data interface and said Bit Block Transfer operation unit, said second register receiving said predetermined number of bits at a time from said Bit Block Transfer operation unit and sending said full number of said data interface bits at a time to said data interface.
  • 4. The graphics controller integrated circuit of claim 2 wherein said first register receives 128 bits at a time from said data interface and sends 64 bits at a time to said Bit Block Transfer operation unit.
  • 5. The graphics controller integrated circuit of claim 1 further comprising a latch connected to said data interface, said latch receiving said full number of said data interface bits at a time from said data interface, and a multiplexer connected to said latch and a bus connected to said CPU, said bus carrying said predetermined number of bits at a time, said multiplexer selecting said predetermined number of bits from said latch for said bus.
  • 6. The graphics controller integrated circuit of claim 5 wherein said multiplexer controllably selects bytes from said latch to form said predetermined number of bits for said bus.
  • 7. The graphics controller integrated circuit of claim 6 further comprising a first FIFO register and a graphics controller unit, said first FIFO register connected to said bus and to said graphics controller unit, said first FIFO register receiving said predetermined number of bits at a time from said bus and sending said predetermined number of bits at a time to said graphics controller unit, said graphics controller unit connected to said data interface and controllably selectively sending one or more bytes to said data interface in response to said CPU.
  • 8. The graphics controller integrated circuit of claim 7 wherein said graphics controller unit is further connected to said multiplexer and receiving said predetermined number of bits at a time from said multiplexer.
  • 9. The graphics controller integrated circuit of claim 6 wherein said latch receives 128 bits at a time from said data interface and said bus carries 32 bits at time.
  • 10. The graphics controller integrated circuit of claim 1 further comprising a DAC, a color palette RAM, a second FIFO register and a data rotate unit, said data rotate unit connected to said data interface and to said second FIFO register, said second FIFO register connected to said color palette RAM, said color palette RAM connected to said DAC, and said DAC connected to a display so that a path is formed to move display data from said data interface to said display.
  • 11. The graphics controller integrated circuit of claim 10 wherein said data rotate unit receives said full number of said data interface bits at a time from said data interface, controllably swaps bytes of said data interface bits, and sends said byte-swapped data interface bits at a time to said second FIFO register.
  • 12. The graphics controller integrated circuit of claim 11 wherein said second FIFO register sends a subset of said byte-swapped data interface bits to said color palette RAM.
  • 13. The graphics controller integrated circuit of claim 11 further comprising an LCD formatter unit, an LCD shader unit, a color palette RAM, a second FIFO register and a data rotate unit, said data rotate unit connected to said data interface and to said second FIFO register, said second FIFO register connected to said color palette RAM, said color palette RAM connected to said LCD shader unit, and said LCD shader unit connected to said LCD formatter unit, said LCD formatter unit connected to an LCD display so that a path is formed to move display data from said data interface to said LCD display.
  • 14. The graphics controller integrated circuit of claim 13 further comprising a plurality of buffers connected to said data interface, said LCD formatter unit and said LCD shader unit, said buffers interconnected such that memory is provided for dual scan operation of said LCD display.
CROSS-REFERENCES TO RELATED APPLICATIONS

This application is a continuation of and claims the benefit of U.S. application Ser. No. 08/883,538, filed Jun. 26, 1997, now U.S. Pat. No. 6,041,010, which was a File Wrapper Continuation of U.S. application Ser. No. 08/581,086, filed Dec. 29, 1995, abandoned, which was a Rule 60 Division of U.S. application Ser. No. 08/262,412, filed Jun. 20, 1994, abandoned, the disclosures of which are incorporated herein by reference.

US Referenced Citations (43)
Number Name Date Kind
4191956 Groothuis Mar 1980 A
4228528 Cenker Oct 1980 A
4303986 Lans Dec 1981 A
4691295 Erwin Sep 1987 A
4740782 Aoki et al. Apr 1988 A
4812836 Kurakake et al. Mar 1989 A
4816815 Yoshiba Mar 1989 A
4951232 Hannah Aug 1990 A
4956708 Itagaki Sep 1990 A
5027212 Marlton et al. Jun 1991 A
5031092 Edwards et al. Jul 1991 A
5083047 Horie Jan 1992 A
5083294 Okajima Jan 1992 A
5144223 Gillingham Sep 1992 A
5170154 Mantopoulos Dec 1992 A
5198708 Gillingham Mar 1993 A
5202962 Matsuo et al. Apr 1993 A
5258843 Truong Nov 1993 A
5267201 Foss et al. Nov 1993 A
5283761 Gillingham Feb 1994 A
5293540 Trani et al. Mar 1994 A
5297148 Harari et al. Mar 1994 A
5303334 Snyder et al. Apr 1994 A
5305283 Shimokura et al. Apr 1994 A
5319388 Mattison et al. Jun 1994 A
5321806 Meinerth et al. Jun 1994 A
5323343 Ogoh et al. Jun 1994 A
5353402 Lau Oct 1994 A
5363500 Takeda Nov 1994 A
5386573 Okamoto Jan 1995 A
5392393 Deering Feb 1995 A
5396586 Van Aken Mar 1995 A
5406523 Foss et al. Apr 1995 A
5414662 Foss et al. May 1995 A
5434969 Heilveil et al. Jul 1995 A
5442748 Chang et al. Aug 1995 A
5448733 Satoh et al. Sep 1995 A
5469401 Gillingham Nov 1995 A
5473573 Rao Dec 1995 A
5537128 Keene et al. Jul 1996 A
5694143 Fielder et al. Dec 1997 A
5712664 Reddy Jan 1998 A
5790839 Luk et al. Aug 1998 A
Non-Patent Literature Citations (5)
Entry
Sato et al., “A Wafer-Scale-Level System Integrated LSI Containing Eleven 4-Mb DRAM's, six 64-kb SRAM's, and an 18K-Gate Array,” IEEE J. of Solid-State Circuits (1992) 27:1608-1613.
Fujita et al., “IMAP: Integrated Memory Array Processor—Toward a GIPS Order SIMD Processing LSI,” IEICE Trans. Electron. (1993) E76-C:1144-1150.
IBM Technical Disclosure Bulletin, 35(1A):45-46, Jun. 1992.
NEC Research and Development, vol. 33, No. 4, Oct. 1992, Tokyo, JP, pp. 585-594, “Integrated Memory Array Processor.”
IEEE Journal of Solid-State Circuits, vol. 25, No. 1, Feb. 1990, N.Y., pp. 30-35, “A 50 MHz 8 Mbit Video Ram with a Column Direction Drive Sense Amplifier.”
Continuations (2)
Number Date Country
Parent 08/883538 Jun 1997 US
Child 09/467942 US
Parent 08/581086 Dec 1995 US
Child 08/883538 US