The present invention relates generally to intra-chip high-speed serial communication for integrated circuit semiconductor devices. Specifically, the communication system may provide one or more of high-speed, high-bandwidth, low-power, and transparent intra-chip communication by utilizing Time Division Multiplexing (TDM) of signals over one or more of coaxial or Wafer Level Package (WLP) waveguides.
Serial links, so-called Serializer/Deserializers (SerDes), have already begun to replace parallel communication systems for inter-chip communications. Examples include Serial Advanced Technology Attachment (SATA) which replaces Parallel ATA (PATA), Peripheral Component Interconnect Express (PCI-e) which replaces parallel PCI (PCI-X), and Gigabit Ethernet (GbE) which replaces Fast Ethernet (100Base-T Ethernet).
There are several advantages of SerDes over parallel communication systems. SerDes utilize low-loss LC transmission lines that can yield a 10× improvement in signal rise times versus the RC interconnects utilized by parallel communication systems. SerDes is more power efficient than parallel communication systems for transmission of multi-gigabit-per-second (multi-Gbps) data signals over distances greater than several millimeters on chip at deep submicron process technology nodes. Finally, SerDes provides embedded clock information in the transmitted data stream, overcoming many of the clock skew and clock jitter limitations of multi-Gbps parallel communication systems.
Consequently, there has been much work to implement serial communication systems for intra-chip communication. These systems entail multiple SerDes communication links over standard on-chip metal-layer interconnect technology (for example Dally, “Interconnect-Centric Computing,” 2007). However, on-chip metal layer interconnect technologies suffer from high line losses that must be compensated for or managed by the communication system, resulting in one or more of unfavorable power consumption, circuit complexity, circuit gate count, and circuit area. Other on-chip communication systems have implemented standard data communication over Wafer Level Package (WLP) interconnect technology, but have not utilized SerDes methodologies (for example Carchon, “Wafer-Level Packaging Technology for Extended Global Wiring and Inductors,” 2003). WLP has similar line-loss impairments as on-chip interconnects, but to a lesser extent, due to the utilization of thicker conductor and insulator layers deposited on the wafer. The present invention implements an intra-chip communication system that shares the benefits of inter-chip SerDes systems by mitigating the drawbacks of WLP or metal-layer communication systems. Such intra-chip communication system utilizes a TDM scheduling methodology to serialize and transceive intra-chip digital data over coaxial interconnects. Such intra-chip communication system does not preclude the use of WLP interconnects; in fact, an alternate embodiment is to use WLP interconnects in place of, or along with, coaxial interconnects.
A coaxial interconnect is a transmission line comprising a central conductor surrounded by a conductive outer shield, with an insulator or gap disposed therebetween. Such total enclosure of the central conductor by the outer shield prevents electromagnetic fields from escaping the vicinity of the coaxial interconnect—similar to micro-strip, strip-line, or co-planar waveguide technologies—which results in lower crosstalk and more constant impedance. A commercially available coaxial interconnect technology is ‘microCoax,’ developed by Bridgewave Communications, Inc. of Santa Clara, Calif. (Pasternak, et. al. U.S. Pat. No. 7,520,054-B2). It has been designed and used for extremely high frequency inter-chip signal communication in the 30 to 300 GHz range, called Millimeter Wave (MMW) communication. An advantage of microCoax is that it is amenable to low-cost chip wire-bonding manufacturing techniques. microCoax, and other coaxial interconnect technologies, were neither explicitly designed for nor have been previously used for transceiving high-speed serial intra-chip signals, primarily due to the lack of a suitable, scalable, and practical intra-chip communication system, which the present invention addresses and solves.
It is well known that multi-Gbps single-system-clock-cycle (single-cycle) cross-chip communication—distances greater than a few millimeters—has become impossible. This necessitates multi-cycle Finite State Machine (FSM) design methodologies for digital logic or implementation of high-speed intra-chip communication systems capable of transceiving signals across chip within a single system clock cycle. Because multi-cycle FSMs are much more complicated than single-cycle FSMs to implement, verify, and test, and due to well established Electronic Computer-Aided Design (ECAD) models for single-cycle FSM design, single-cycle FSMs are preferred for digital logic.
Accordingly, what is desired, and has not heretofore been developed, is an intra-chip communication system that utilizes coaxial interconnects to achieve high-speed signal communication over distances greater than several millimeters. It is further desired that the intra-chip communication system be capable of transmitting and receiving such signals over such distances within a single system clock cycle, thereby being amenable to standard single-cycle FSM design methodologies. The intra-chip communication system utilizes TDM to aggregate multiple low-speed signals onto one or more high-speed coaxial interconnects using SerDes methodologies described in the following description of the invention.
It is an object of the present invention to provide a high-speed serial intra-chip communication link with low-loss transmission lines.
It is an object of the present invention to provide an intra-chip serial communication link that may utilize at least one wire-bonded coaxial transmission line interconnect.
It is a further object of the present invention to provide an intra-chip serial communication link that may utilize at least one WLP transmission line interconnect.
It is an object of the present invention to provide an intra-chip serial communication link utilizing at least one coaxial transmission line interconnect that employs TDM.
It is a further object of the present invention to provide a low-power intra-chip communication link.
It is a further object of the present invention to provide an intra-chip serial communication link capable of achieving cross-chip signal transmission and reception within a single system clock cycle.
It is a further object of the present invention to provide an intra-chip serial communication link capable of achieving signal transmission and reception within multiple system clock cycles.
It is an object of the present invention to provide an intra-chip communication system comprising at least one intra-chip communication link.
a is a perspective view of an exemplary chip showing several coaxial interconnects.
b is a cross-sectional view of a portion of a single coaxial interconnect adjoining a chip.
a is an exemplary circuit diagram of an intra-chip TDM communication link of the present invention in which circuit elements may process local signals prior to serialization and after deserialization.
b is an exemplary circuit diagram of an intra-chip TDM communication link of the present invention in which no circuit elements are present to process local signals prior to serialization and after deserialization.
a is an exemplary prior art FSM circuit diagram.
b is an exemplary FSM circuit diagram with combinational logic partitioned.
a is an exemplary circuit diagram of an intra-chip TDM communication link implementing single-cycle operation via combinational logic partitioning.
b is an exemplary waveform diagram showing the TDM scheduling methodology of the single-cycle intra-chip TDM communication link of
a is an exemplary circuit diagram of an intra-chip TDM communication link of the present invention in which circuit elements may process local signals prior to serialization and after deserialization.
b is an exemplary circuit diagram of an intra-chip TDM communication link of the present invention in which no circuit elements are present to process local signals prior to serialization and after deserialization.
Heretofore and henceforth the term ‘chip’ is used to describe an integrated circuit device, including but not limited to a Field Programmable Gate Array (FPGA), Application Specific Integrated Circuit (ASIC), a single- or multi-core microprocessor, a Standard Product (SP), or an Application Specific Standard Product (ASSP), od a System-on-Chip (SoC).
Heretofore and henceforth the term ‘micro-coax’ is used to describe a transmission line comprising a central conductor surrounded by a conductive outer shield with an insulator or gap therebetween (a coaxial interconnect).
Heretofore and henceforth the terms ‘circuit,’, ‘circuit element,’ ‘circuit block,’ ‘logic,’ ‘logic element,’ ‘logic block,’ and ‘logic cloud’ may be used interchangeably.
Heretofore and henceforth the term ‘amount of combinational logic’ is used to describe any appropriate quantifiable parameter of a circuit including, but not limited to, number of logic gates, logic depth, or logic complexity; the terms ‘small,’ ‘smaller,’ ‘large,’ and ‘larger’ describe such relative amounts of combinational logic.
Heretofore and henceforth the term ‘finite state machine’ and its abbreviation ‘FSM’ are used to describe any synchronous digital circuit.
Heretofore and henceforth the terms ‘digital data,’ ‘data stream,’ ‘digital signal,’ ‘signal,’ ‘bits,’ and ‘bit stream’ may be used interchangeably.
Heretofore and henceforth the terms ‘interconnect’ and ‘transmission line’ may be used interchangeably.
Heretofore and henceforth the terms ‘multiplexer’ and ‘mux’ are used to describe any so-called serializer circuits known in the art that map m input signals to n≦m output signals; the terms ‘demultiplexer’ and ‘demux’ are used to describe any so-called deserializer circuits known in the art that map n input signals to m≧n output signals.
a shows an exemplary chip 1 with several low-loss transmission line interconnects 10, each end of which is wire-bonded thereto by means of a wire bond pad 20. Wire bonds and wire bonding technology is well know in the art and is therefore not discussed herein.
a shows an exemplary circuit diagram of an intra-chip communication link utilizing a low-loss transmission line interconnect 10. The local data signal lines 104 may carry signals from some local region of a chip 1 to the transmit logic block 31. Also input to the transmit logic block 31 may be a transmit system clock line 100. Output from the transmit logic block 31 are parallel data signal lines 102.
The transmit logic block 31 may alternatively comprise combinational and sequential logic which may implement one or more synchronous FSMs. Otherwise the transmit logic block 31 may be absent, as shown by the alternate embodiment in
Input to the multiplexer 30 are the parallel data signal lines 102 and transmit serial clock line 101. Output from the multiplexer 30 is the low-loss transmission line interconnect 10. The multiplexer 30 performs a parallel-to-serial conversion of transmit parallel data signals 102s carried on the parallel data signal lines 102, outputting the resulting high-speed serial data signal 10s to the low-loss transmission line interconnect 10. Serialization is accomplished by strobing the transmit parallel data signals 102s, shown as d1 through d8 in
The high-speed serial data signal 10s may be, but is not limited to, single-ended, differential, current-mode, voltage-mode, non-return-to-zero (NRZ), return-to-zero (RZ), duobinary, Pulse Amplitude Modulated n (PAM-n), Manchester, differential Manchester, and so forth. Furthermore, although all exemplary circuit diagrams have been drawn with only one low-loss transmission line interconnect 10 between transmit and receive circuits of a single communication link, multiple low-loss transmission line interconnects 10 may be utilized for such single communication link. For example, two low-loss transmission line interconnects 10 may be used for differential signaling of high-speed serial data signal 10s, which is within the scope of the present invention. Similarly, a single low-loss transmission line interconnect 10 may be used for single-ended signaling of high-speed serial data signal.
The transmit serial clock period 101p may be some suitable fraction of the transmit system clock period 100p. This is the fastest clock in the system. The serialization factor—the number of parallel signals that may be serialized by the multiplexer 30 per unit time interval—may be a function of the ratio of the transmit system clock period 100p to the transmit serial clock period 101p. Exemplary
As shown in
Input to the receive logic block 41 are the parallel data signal lines 202. Also input to the receive logic block 41 may be a receive system clock line 200. Output from the receive logic block 41 are local data signal lines 204.
The receive logic block 41 may alternatively comprise combinational and sequential logic which may implement one or more synchronous FSMs. Otherwise the receive logic block 41 may be absent, as shown by the alternate embodiment in
The TDM communication system of the present invention may operate plesiochronously or mesochronously; there need not be any clock tree that distributes system clock signals or serial clock signals between transmit and receive circuits. The transmit system clock signal 100s corresponds to the system clock signal of the local clock domain of the transmit end of the TDM communication link; the receive system clock signal 200s corresponds to the system clock signal of the local clock domain of the receive end of the TDM communication link. The transmit serial clock signal 101s may be derived from the transmit system clock signal 100s. The receive serial clock signal 201s may be recovered from the high-speed serial data signal 10s using any suitable clock recovery methodology, including but not limited to bang-bang or Mueller-Müller, which are well known in the art and are therefore not discussed herein. Such clock recovery circuitry may be a functional portion of the demultiplexer 40. Finally, the receive system clock signal 200s may be derived from the receive serial clock signal 201s. Clock signal derivation may be accomplished by any suitable means, including but not limited to Phase-Locked Loops (PLLs) and Delay-Locked Loops (DLLs), which are well known in the art and are therefore not discussed herein.
The exemplary intra-chip TDM communication systems shown in
a shows an exemplary circuit diagram of a typical prior art single-cycle FSM; between transmit bistables 32 and receive bistables 42 may be combinational logic, depicted as combinational logic 50a, combinational logic 50b, and combinational logic 50c.
a shows the exemplary circuit diagram of
The logic delays shown in
Single-cycle operation may not always be desired. For example, it may be advantageous to utilize multiple system clock cycles to transfer large amounts of data to or from memory. The exemplary intra-chip TDM communication system shown in
a shows a transmit control signal line 105 that may be input to the transmit logic block 31, which may carry a transmit control signal 105s. Alternatively, such transmit control signal 105s may be generated within the transmit logic block 31 via combinational and sequential logic that may be present therein. The transmit control signal 105s may control, re-sequence, or synchronize m transmit parallel data signals 102s, shown as d1 through dm in
Input to the receive logic block 41 may be a receive control signal line 105, which may carry a receive control signal 105s. Alternatively, such receive control signal 105s may be generated within the receive logic block 41 via combinational and sequential logic that may be present therein. The receive control signal 105s may control or synchronize m receive parallel data signals 202s, such receive parallel data signals 202s may be alternatively transformed and subsequently distributed to some local region of the chip 1. The transmit logic block 31 and receive logic block 41 may alternatively comprise additional combinational and sequential logic which may implement one or more synchronous FSMs.
One specific embodiment of the intra-chip communication system is the microCoax; many variations or modifications would be apparent that do not depart from the spirit and scope of the invention. For example, an alternate intra-chip communication link embodiment may utilize WLP interconnects in place of micro-coax interconnects throughout. Furthermore, a first alternate intra-chip communication system may utilize WLP communication links throughout; a second alternate intra-chip communication system may utilize both micro-coax and WLP communication links. Still further, carbon nanotubes and superconducting interconnects may be utilized in place of or in combination with micro-coax interconnects. Such deviations from the exemplary descriptions and illustrations provided are within the scope of the present invention, which is defined by the appended claims.
This application claims the benefit of U.S. provisional patent application No. 61/464,186, entitled “Intra-chip Micro-coaxial TDM Communication Link and System,” filed Feb. 26, 2011 by applicant William Loh.
Number | Name | Date | Kind |
---|---|---|---|
6272130 | Panahi et al. | Aug 2001 | B1 |
7520054 | Pasternak et al. | Apr 2009 | B2 |
7860398 | Tatum et al. | Dec 2010 | B2 |
8464145 | Grivna et al. | Jun 2013 | B2 |
8538259 | Kai | Sep 2013 | B2 |
Number | Date | Country | |
---|---|---|---|
20120224613 A1 | Sep 2012 | US |
Number | Date | Country | |
---|---|---|---|
61464186 | Mar 2011 | US |