The field of invention pertains to the computing sciences generally, and, more specifically, to a processing core having a shared front end unit.
The memory controller 104 reads/writes data and instructions from/to system memory 108. The I/O hub 105 manages communication between the processor and “I/O” devices (e.g., non volatile storage devices and/or network interfaces). Port 106 stems from the interconnection network 102 to link multiple processors so that systems having more than N cores can be realized. Graphics processor 107 performs graphics computations. Power management circuitry (not shown) manages the performance and power states of the processor as a whole (“package level”) as well as aspects of the performance and power states of the individual units within the processor such as the individual cores 101_1 to 101_N, graphics processor 107, etc. Other functional blocks of significance (e.g., phase locked loop (PLL) circuitry) are not depicted in
In order to avoid the unnecessary delay of an instruction that does not have any dependencies on earlier “in flight” instructions, many modern instruction execution pipelines have enhanced data fetch and write back stages to effect “out-of-order” execution. Here, the respective data fetch stage 202 of pipelines 250, 260 is enhanced to include data dependency logic 205 to recognize when an instruction does not have a dependency on an earlier in flight instruction, and, permit its issuance to the instruction execution stage 203 “ahead of”, e.g., an earlier instruction whose data has not yet been fetched.
Moreover, the write-back stage 204 is enhanced to include a re-order buffer 206 that re-orders the results of out-of-order executed instructions into their correct order, and, delays their retirement to the physical register file until a correctly ordered consecutive sequence of instruction execution results have retired.
The enhanced instruction execution pipeline is also observed to include instruction speculation logic 207 within the instruction fetch stage 201. The speculation logic 207 guesses at what conditional branch direction or jump the instruction sequence will take and begins to fetch the instruction sequence that flows from that direction or jump. The speculative instructions are then processed by the remaining stages of the execution pipeline.
The present invention is illustrated by way of example and not limitation in the figures of the accompanying drawings, in which like references indicate similar elements and in which:
The number of logic transistors manufactured on a semiconductor chip can be viewed as the semiconductor chip's fixed resource for processing information. A characteristic of the processor and processing core architecture discussed above with respect to
The dedication of logic circuitry to the speed-up of a currently active threads is achieved, however, at the expense of the total number of threads that the processor can simultaneously process at any instant of time. Said another way, if the logic circuitry units of a processor were emphasized differently, the processor might be able to simultaneously process more threads than the processor of
As observed in
A particular thread is assigned to a particular processing unit, and, each processing unit, as described in more detail below, is multi-threaded (i.e., can simultaneously and/or concurrently process more than one thread). Thus, if each processing unit can simultaneously/concurrently execute up to M hardware threads and there are N processing units, the processing core can simultaneously/concurrently execute up to M*N hardware threads. Here, the product M*N may be greater than the typical number of hardware threads that can simultaneously executed in a typical processing core (e.g., greater than 8 or 16 at current densities).
Referring to the shared front end unit 301, the shared front end unit contains program control logic circuitry 311 to identify and fetch appropriate “next” instructions for each thread. Here, the program control logic circuitry 311 includes an instruction pointer 312_1 to 312 _M*N for each thread and instruction fetch circuitry 313. Note that
The individual hardware threads may be serviced by the instruction fetch circuitry 313 on a time-sliced basis (e.g., a fair round robin approach). Further still, the instruction fetch circuitry 313 may be parallelized into similar/same blocks that fetch instructions for different hardware threads in parallel (e.g., each parallel block of instruction fetch circuitry services a different subset of instruction pointers).
Because, however, the individual hardware threads may be processed slower than a traditional processor (e.g., because per thread latency reduction circuitry has not been instantiated in favor of more processing units as described above), it is conceivable that some implementations may not require parallel instruction fetch capability, or, at least include less than N parallel instruction fetch channels (e.g., N/2 parallel instruction fetch blocks). Accordingly, in any of these cases, certain components of the front end unit 301 are shared by at least two of the processing units 302_1 to 302_N.
In a further embodiment, the program control logic circuitry 311 also includes an instruction translation look-aside buffer (ITLB) circuit 316 for each hardware thread. As is understood in the art, an ITLB translates the instruction addresses received from program memory 315 into actual addresses in physical memory where the instructions actually reside.
After an instruction has been fetched it is decoded by an instruction decoder 317. In an embodiment there is an instruction decoder for each processing unit (i.e., there are N decoders). Again, e.g., where the number of processing units N has been increased at the expense of executing threads with lower latency, there may be more than one processing unit per instruction decoder. Conceivably there may even be one decoder for all the processing units.
An instruction typically specifies: i) an operation to be performed in the form of an “opcode”; ii) the location where the input operands for the operation can be found (register and/or memory space); and, iii) the location where the resultant of the operation is to be stored (register and/or memory space). In an embodiment, the instruction decoder 317 decodes an instruction not only by breaking the instruction down into its opcode and input operand/resultant storage locations, but also, converting the opcode into a sequence of micro-instructions.
As is understood in the art, micro-instructions are akin to a small software program (microcode) that an execution unit will execute in order to perform the functionality of an instruction. Thus, an instruction opcode is converted to the microcode that corresponds to the functional operation of the instruction. Typically, the opcode is entered as a look-up parameter into a circuit 318 configured to behave like a look-up table (e.g., a read only memory (ROM) configured as a look-up table). The look-up table circuit 318 responds to the input opcode with the microcode for the opcode's instruction. Thus, in an embodiment, there is a ROM for each processing unit in the processing core (or, again, there is more than one processing unit per micro-code ROM because the per-thread latency of the processing units has been diluted compared to a traditional processor).
The microcode for a decoded instruction is then dispatched along with the decoded instruction's register/memory addresses of its input operands and resultants to the processing unit that has been assigned to the hardware thread that the decoded instruction is a component of. Note that the respective micro-code for two different instructions of two different hardware threads running on two different processing units may be simultaneously dispatched to their respective processing units.
In an embodiment, as discussed above, each processing unit 302_1 to 302_N can simultaneously and/or concurrently execute more than one hardware thread. For instance, each processing unit may have X sets of execution units (where X=1 or greater), where, each set of execution units is capable of supporting an entire instruction set such as an entire x86 instruction set. Alternatively or in combination, each processing unit can concurrently (as opposed to simultaneously) execute multiple software threads. Here, concurrent execution, as opposed to simultaneous execution, corresponds to the execution of multiple software threads within a period of time by alternating processing resources amongst the software threads supported by the processing unit (e.g., servicing each of the software threads in an round robin fashion resources). Thus, in an embodiment, over a window of time, a single processing unit may concurrently execute multiple software threads by switching the software threads and their associated state information in/out of the processing unit as hardware threads of the processing unit.
As observed in
Each processing unit includes register space 321 coupled to its internal functional unit set(s) 303 for keeping the operand/resultant data of the thread(s) the functional unit set(s) 303 are responsible for executing. If a single functional unit set is to concurrently execute multiple hardware threads, the register space 321 for the functional unit set 303 may be partitioned such that there is one register set partition for each hardware thread the functional unit set 303 is to concurrently execute. As such, the functional unit set 303 “operates out of” a specific register partition for each unique hardware thread that the functional unit set is concurrently executing.
As observed in
In an embodiment, each functional unit set 303 includes: i) an integer functional unit cluster that contains functional units for executing integer mathematical/logic instructions; ii) a floating point functional unit cluster containing functional units for executing floating point mathematical/logic instructions; iii) a SIMD functional unit cluster that contains functional units for executing SIMD mathematical/logic instructions; and, iv) a memory access functional unit cluster containing functional units for performing data memory accesses (for integer and/or floating point and/or SIMD operands and/or results). The memory access functional unit cluster may contain one or more data TLBs to perform virtual to physical address translation for its respective threads.
Micro-code for a particular instruction issues from its respective microcode buffer 320 to the appropriate functional unit along with the operand data that was fetched for the instruction by the fetch circuitry associated with the register allocation logic 322. Results of the execution of the functional units are written back to the register space 321 associated with the execution units.
In a further embodiment, each processing unit contains a data cache 329 that is coupled to the functional units of the memory access cluster. The functional units of the memory access cluster are also coupled to system memory 323 so that they can fetch data from memory. Notably, each register file partition described above may be further partitioned into separate integer, floating point and SIMD register space that is coupled to the corresponding functional unit cluster.
According to one scenario, operating system and/or virtual machine monitor (VMM) software assigns specific software threads to a specific processing unit. The shared front end logic 301 and/or operating system/VMM is able to dynamically assign a software thread to a particular processing unit or functional unit set to activate the thread as a hardware thread. In various embodiments, each processing unit includes “context switching” logic (not shown) so that each processing unit can be assigned more software threads than it can simultaneously or concurrently support as hardware threads. That is, the number of software threads assigned to the processing unit can exceed the number of “active” hardware threads the processing unit is capable of presently executing (either simultaneously or concurrently) as evidenced by the presence of context information of a thread within the register space of the processing unit.
Here, for instance, when a software thread becomes actived as a hardware thread, its context information (e.g., the values of its various operands and control information) is located within the register space 321 that is coupled to the functional unit set 303 that is executing the thread's instructions. If a decision is made to transition the thread from an active to inactive state, the context information of the thread is read out of this register space 321 and stored elsewhere (e.g., system memory 323). With the register space of the thread now being “freed up”, the context information of another “inactive” software thread whose context information resides, e.g., in system memory 232, can be written into the register space 321. As a consequence, the other thread converts from “inactive” to “active” and its instructions are executed as a hardware thread going forward.
As discussed above, the “room” for the logic circuitry to entertain a large number of hardware threads may come at the expense of maximizing the latency of any particular thread. As such, any of the mechanisms and associated logic circuitry for “speeding-up” a hardware thread's execution may not be present in the shared front end or processing unit circuitry. Such eliminated blocks may include any one or more of: 1) speculation logic (e.g., branch prediction logic); 2) out-of-order execution logic (e.g., register renaming logic and/or a re-order buffer and/or data dependency logic); 3) superscalar logic to dynamically effect parallel instruction issuance for a single hardware thread.
A multi-core processor built with multiple instances of the processing core architecture of
In the foregoing specification, the invention has been described with reference to specific exemplary embodiments thereof. It will, however, be evident that various modifications and changes may be made thereto without departing from the broader spirit and scope of the invention as set forth in the appended claims. The specification and drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense.
Number | Name | Date | Kind |
---|---|---|---|
4943915 | Wilhelm et al. | Jul 1990 | A |
4982402 | Beaven et al. | Jan 1991 | A |
5276798 | Peaslee et al. | Jan 1994 | A |
5329615 | Peaslee et al. | Jul 1994 | A |
5371849 | Peaslee et al. | Dec 1994 | A |
5423025 | Goldman et al. | Jun 1995 | A |
5430841 | Tannenbaum et al. | Jul 1995 | A |
5550988 | Sarangdhar et al. | Aug 1996 | A |
5649230 | Lentz | Jul 1997 | A |
5890010 | Nishigami | Mar 1999 | A |
6061711 | Song et al. | May 2000 | A |
6081849 | Born et al. | Jun 2000 | A |
6105127 | Kimura et al. | Aug 2000 | A |
6148326 | Born et al. | Nov 2000 | A |
6247040 | Born et al. | Jun 2001 | B1 |
6275497 | Varma et al. | Aug 2001 | B1 |
6331857 | Hussain et al. | Dec 2001 | B1 |
6341324 | Caulk, Jr. et al. | Jan 2002 | B1 |
6397240 | Fernando | May 2002 | B1 |
6725416 | Dadurian | Apr 2004 | B2 |
6742104 | Chauvel et al. | May 2004 | B2 |
6779085 | Chauvel | Aug 2004 | B2 |
6944746 | So | Sep 2005 | B2 |
6952214 | Naegle et al. | Oct 2005 | B2 |
6957315 | Chauvel | Oct 2005 | B2 |
7065625 | Alderson | Jun 2006 | B2 |
7079147 | Wichman et al. | Jul 2006 | B2 |
7082508 | Khan et al. | Jul 2006 | B2 |
7200741 | Mine | Apr 2007 | B1 |
7209996 | Kohn et al. | Apr 2007 | B2 |
7234042 | Wilson | Jun 2007 | B1 |
7302627 | Mimar | Nov 2007 | B1 |
7370243 | Grohoski et al. | May 2008 | B1 |
7480838 | Wilkerson et al. | Jan 2009 | B1 |
7545381 | Huang et al. | Jun 2009 | B2 |
7583268 | Huang et al. | Sep 2009 | B2 |
7598958 | Kelleher | Oct 2009 | B1 |
7676649 | Rapp et al. | Mar 2010 | B2 |
7746350 | Danilak | Jun 2010 | B1 |
7765388 | Barrett | Jul 2010 | B2 |
7793080 | Shen | Sep 2010 | B2 |
7865675 | Paver et al. | Jan 2011 | B2 |
7930519 | Frank | Apr 2011 | B2 |
8020039 | Reid et al. | Sep 2011 | B2 |
8055872 | Biles et al. | Nov 2011 | B2 |
8063907 | Lippincott | Nov 2011 | B2 |
8082426 | Paltashev et al. | Dec 2011 | B2 |
8140823 | Codrescu et al. | Mar 2012 | B2 |
8141102 | Aho et al. | Mar 2012 | B2 |
8190863 | Fossum et al. | May 2012 | B2 |
8212824 | Allen et al. | Jul 2012 | B1 |
8230442 | Aho | Jul 2012 | B2 |
8281185 | Nussbaum et al. | Oct 2012 | B2 |
8345052 | Diard | Jan 2013 | B1 |
8424018 | Aho | Apr 2013 | B2 |
8683175 | Ekanadham et al. | Mar 2014 | B2 |
8776084 | Aho | Jul 2014 | B2 |
8780123 | Crow et al. | Jul 2014 | B2 |
8959311 | Akkar et al. | Feb 2015 | B2 |
9003102 | Lassa | Apr 2015 | B2 |
9003166 | Sinha et al. | Apr 2015 | B2 |
9015443 | Aho | Apr 2015 | B2 |
9053025 | Ben-Kiki | Jun 2015 | B2 |
9086813 | Zeng et al. | Jul 2015 | B2 |
9189261 | Masood | Nov 2015 | B2 |
9275491 | Bolz et al. | Mar 2016 | B2 |
9361116 | Ben-Kiki et al. | Jun 2016 | B2 |
9396020 | Ginzburg et al. | Jul 2016 | B2 |
9417873 | Ben-Kiki et al. | Aug 2016 | B2 |
9703603 | Roy et al. | Jul 2017 | B1 |
20010042210 | Blaker et al. | Nov 2001 | A1 |
20020004904 | Blaker et al. | Jan 2002 | A1 |
20020069327 | Chauvel | Jun 2002 | A1 |
20020178349 | Shibayama et al. | Nov 2002 | A1 |
20030028751 | McDonald et al. | Feb 2003 | A1 |
20030093648 | Moyer | May 2003 | A1 |
20030126416 | Marr et al. | Jul 2003 | A1 |
20030135718 | Dewitt et al. | Jul 2003 | A1 |
20030135719 | Dewitt et al. | Jul 2003 | A1 |
20030135789 | Dewitt et al. | Jul 2003 | A1 |
20030212874 | Alderson | Nov 2003 | A1 |
20040055003 | Sundaram et al. | Mar 2004 | A1 |
20040073836 | Shimada | Apr 2004 | A1 |
20040111594 | Feiste et al. | Jun 2004 | A1 |
20040160446 | Gosalia et al. | Aug 2004 | A1 |
20040215444 | Patel et al. | Oct 2004 | A1 |
20040227763 | Wichman et al. | Nov 2004 | A1 |
20040268071 | Khan et al. | Dec 2004 | A1 |
20050149937 | Pilkington | Jul 2005 | A1 |
20050166038 | Wang et al. | Jul 2005 | A1 |
20050257186 | Zilbershlag | Nov 2005 | A1 |
20060050077 | D'Amora et al. | Mar 2006 | A1 |
20060095721 | Biles et al. | May 2006 | A1 |
20060095807 | Grochowski et al. | May 2006 | A1 |
20060200802 | Mott et al. | Sep 2006 | A1 |
20060288193 | Hsu | Dec 2006 | A1 |
20070050594 | Augsburg et al. | Mar 2007 | A1 |
20070103476 | Huang et al. | May 2007 | A1 |
20070226464 | Chaudhry et al. | Sep 2007 | A1 |
20080005546 | Wang et al. | Jan 2008 | A1 |
20080052532 | Akkar et al. | Feb 2008 | A1 |
20080104425 | Gunther et al. | May 2008 | A1 |
20080222383 | Spracklen et al. | Sep 2008 | A1 |
20090019264 | Correale et al. | Jan 2009 | A1 |
20090024836 | Shen et al. | Jan 2009 | A1 |
20090141034 | Pryor et al. | Jun 2009 | A1 |
20090144519 | Codrescu et al. | Jun 2009 | A1 |
20090150620 | Paver et al. | Jun 2009 | A1 |
20090150722 | Reid et al. | Jun 2009 | A1 |
20090198966 | Gschwind et al. | Aug 2009 | A1 |
20090216958 | Biles et al. | Aug 2009 | A1 |
20090254907 | Neary | Oct 2009 | A1 |
20090259996 | Grover et al. | Oct 2009 | A1 |
20090309884 | Lippincott et al. | Dec 2009 | A1 |
20100058356 | Aho et al. | Mar 2010 | A1 |
20100153686 | Frank | Jun 2010 | A1 |
20100274972 | Babayan et al. | Oct 2010 | A1 |
20100332901 | Nussbaum et al. | Dec 2010 | A1 |
20110040924 | Selinger | Feb 2011 | A1 |
20110047533 | Gschwind | Feb 2011 | A1 |
20110072234 | Chinya et al. | Mar 2011 | A1 |
20110093637 | Gupta et al. | Apr 2011 | A1 |
20110145778 | Chen | Jun 2011 | A1 |
20110271059 | Aho et al. | Nov 2011 | A1 |
20120023314 | Crum | Jan 2012 | A1 |
20120036339 | Frazier et al. | Feb 2012 | A1 |
20120124588 | Sinha et al. | May 2012 | A1 |
20120131309 | Johnson et al. | May 2012 | A1 |
20120139926 | Clohset et al. | Jun 2012 | A1 |
20120155777 | Schweiger et al. | Jun 2012 | A1 |
20120159090 | Andrews et al. | Jun 2012 | A1 |
20120166777 | McLellan et al. | Jun 2012 | A1 |
20120239904 | Ekanadham et al. | Sep 2012 | A1 |
20120311360 | Balasubramanian et al. | Dec 2012 | A1 |
20120331310 | Burns et al. | Dec 2012 | A1 |
20130054871 | Lassa | Feb 2013 | A1 |
20130159630 | Lichmanov | Jun 2013 | A1 |
20130167154 | Peng et al. | Jun 2013 | A1 |
20130179884 | Masood | Jul 2013 | A1 |
20130205119 | Rajwar et al. | Aug 2013 | A1 |
20130332937 | Gaster | Dec 2013 | A1 |
20140025822 | Guha et al. | Jan 2014 | A1 |
20140176569 | Meixner | Jun 2014 | A1 |
20140189317 | Ben-Kiki et al. | Jul 2014 | A1 |
20140189333 | Ben-Kiki | Jul 2014 | A1 |
20140282580 | Zeng et al. | Sep 2014 | A1 |
20140331236 | Mitra et al. | Nov 2014 | A1 |
20140344815 | Ginzburg et al. | Nov 2014 | A1 |
20150317161 | Murphy | Nov 2015 | A1 |
20160246597 | Ben-Kiki et al. | Aug 2016 | A1 |
20160335090 | Weissmann et al. | Nov 2016 | A1 |
20160342419 | Ben-Kiki et al. | Nov 2016 | A1 |
20170017491 | Ben-Kiki et al. | Jan 2017 | A1 |
20170017492 | Ben-Kiki et al. | Jan 2017 | A1 |
20170109281 | Weissmann et al. | Apr 2017 | A1 |
20170109294 | Weissmann et al. | Apr 2017 | A1 |
20170153984 | Weissmann et al. | Jun 2017 | A1 |
Number | Date | Country |
---|---|---|
1164704 | Nov 1997 | CN |
1608246 | Apr 2005 | CN |
1981280 | Jun 2007 | CN |
101083525 | Dec 2007 | CN |
101452423 | Jun 2009 | CN |
101667138 | Mar 2010 | CN |
101855614 | Oct 2010 | CN |
102270166 | Dec 2011 | CN |
102314671 | Jan 2012 | CN |
102567556 | Jul 2012 | CN |
102741806 | Oct 2012 | CN |
102741826 | Oct 2012 | CN |
2013147887 | Oct 2013 | WO |
2014105128 | Jul 2014 | WO |
2014105152 | Jul 2014 | WO |
Entry |
---|
PCT/US2013/048694 Notification of Transmittal of the International Search Report and the Written Opinion of the International Searching Authority, mailed Sep. 30, 2013, 9 pages. |
PCT/US2013/048694 Notification Concerning Transmittal of International Preliminary Report on Patentability, mailed Jul. 9, 2015, 6 pages. |
Office action from U.S. Appl. No. 13/730,055, mailed Jul. 27, 2015, 11 pages. |
Notice of Allowance from U.S. Appl. No. 13/730,055, mailed Feb. 25, 2016, 5 pages. |
Notice of Allowance from U.S. Appl. No. 13/730,055, mailed Apr. 15, 2016, 8 pages. |
PCT/US2013/046166 Notification of Transmittal of the International Search Report and the Written Opinion of the International Searching Authority, mailed Sep. 2, 2013, 9 pages. |
PCT/US2013/046166 Notification Concerning Transmittal of International Preliminary Report on Patentability, mailed Jul. 9, 2015, 6 pages. |
Final Office Action from U.S. Appl. No. 13/729,915, mailed Dec. 2, 2015, 24 pages. |
Final Office Action from U.S. Appl. No. 13/730,143, mailed May 4, 2016, 18 pages. |
Final Office Action from U.S. Appl. No. 13/730,971, mailed Jul. 29, 2015, 14 pages. |
IBM Technical Disclosure Bulletin NN84102905 “Multiprocessor System to Improve Context Switching,” Oct. 1984, 4 pages. |
International Preliminary Report on Patentability for Application No. PCT/US2013/048339, mailed Jun. 30, 2015, 6 pages. |
International Preliminary Report on Patentability for International Application No. PCT/US2013/046863, mailed Jul. 9, 2015, 6 pages. |
International Preliminary Report on Patentability for International Application No. PCT/US2013/046911, mailed Jul. 9, 2015, 7 pages. |
International Preliminary Report on Patentability for International Application No. PCT/US2013/047387, mailed Jun. 30, 2015, 5 pages. |
International Search Report and Written Opinion for Application No. PCT/US2013/046863, mailed Aug. 28, 2013, 9 pages. |
International Search Report and Written Opinion for Application No. PCT/US2013/046911, mailed Sep. 27, 2013, 10 pages. |
International Search Report and Written Opinion for Application No. PCT/US2013/047387, mailed Sep. 2, 2013, 9 pages. |
International Search Report and Written Opinion for Application No. PCT/US2013/048339, mailed Nov. 1, 2013, 10 pages. |
Jesshope C., et al., “Making Multi-cores Mainstream—from Security to Scalability,” Parallel computing: From multicores and GPU's to petascale, 2010, 16 pages. |
Lankamp M., “Developing a Reference Implementation for a Microgrid of Microthreaded Microprocessors,” Diss. Master's thesis, University of Amsterdam, Amsterdam, the Netherlands, 2007, 57 pages. |
Li E., et al., “Accelerating Video-Mining Applications Using Many Small, General-Purpose Cores,” IEEE, 2008, 2 pages. |
Li T., et al., “Operating system support for overlapping-ISA heterogeneous multi-core architectures,” HPCA-16 2010 The Sixteenth International Symposium on High-Performance Computer Architecture. IEEE, 2010, 12 pages. |
Lyons M., et al., “PowerPC storage model and AIX programming,” Published: 2005, 15 pages. |
Navada S.S., “A Unified View of Core Selection and Application Steering in Heterogeneous Chip Multiprocessors”, Abstract, Chapters 1, 3, and 6 Jun. 15, 2012, 114 pgs. |
Non-Final Office Action from U.S. Appl. No. 13/729,915, mailed Jul. 23, 2015, 22 pages. |
Non-Final Office Action from U.S. Appl. No. 13/729,931, mailed Oct. 3, 2014, 13 pages. |
Non-Final Office Action from U.S. Appl. No. 13/730,143, mailed Aug. 11, 2015, 13 pages. |
Non-Final Office Action from U.S. Appl. No. 13/730,971, mailed Jan. 6, 2015, 13 pages. |
Non-Final Office Action from U.S. Appl. No. 13/730,971, mailed Mar. 29, 2016, 15 pages. |
Non-final Office Action from U.S. Appl. No. 15/145,748, mailed Oct. 3, 2016, 22 pages. |
Non-Final Office Action from U.S. Appl. No. 15/226,875, mailed Oct. 4, 2016, 25 pages. |
Non-Final Office Action from U.S. Appl. No. 15/281,944, mailed Nov. 25, 2016, 23 pages. |
Non-Final Office Action from U.S. Appl. No. 15/282,082, mailed Nov. 28, 2016, 26 pages. |
Notice of Allowance for foreign counterpart Korea Application No. 10-2015-7012995, mailed Jul. 20, 2016, 3 pages. |
Notice of Allowance for foreign counterpart Korean Application No. 10-2015-7012861, mailed Mar. 29, 2016, 3 pages. |
Notice of Allowance from U.S. Appl. No. 13/729,915, mailed Feb. 17, 2016, 10 pages. |
Notice of Allowance from U.S. Appl. No. 13/729,931, mailed Feb. 2, 2015, 5 pages. |
Notice of Allowance from U.S. Appl. No. 13/730,971, mailed Oct. 7, 2016, 17 pages. |
Notice of Preliminary Rejection for foreign counterpart Korean Application No. 10-2015-7012995, mailed Feb. 22, 2016, 12 pages. |
Office action for foreign counterpart Korean Application No. 10-2015-7012861, mailed Dec. 11, 2015, 5 pages. |
Office Action from Foreign Counterpart Chinese Patent Application No. 201380059874.8, mailed Sep. 5, 2016, 17 pages. |
Office Action from foreign counterpart Korean Patent Application No. 1020167029367, mailed Oct. 27, 2016, 6 pages. |
Raphael Poss et al., “Apple-CORE: Microgrids of SVP cores Flexible, general-purpose, fine-grained hardware concurrency management,” Published: Sep. 5, 2012. 8 pgs. |
Sykora J., et al., “Microthreading as a Novel Method for Close Coupling of Custom Hardware Accelerators to SVP Processors,” Published: 2011, 8 pgs. |
Taylor I.L., “64-bit PowerPC ELF Application Binary Interface Supplement 1.9”, Chapters 1 and 3-3.2.5 Published: 2004. |
Thiago Madeira's blog, Architectures and ABIs detailed , Published: Jan. 2012, 15 pages. |
Unifying Software and Hardware of Multithreaded Reconfigurable Applications Within Operating System Processes by Miljan Vuletic pp. i, iii, vii-ix, 1-44, 61-99, and 125-136 Published: 2006. |
Vo H., “Hardware Support for Irregular Control Flow in Vector Processor,” Published: May 7, 2012, 10 pages. |
Choi J., et al., “Impact of Cache Architecture and Interface on Performance and Area of FPGA-Based Processor/Parallel-Accelerator Systems,” IEEE 20th International Symposium on Field-Programmable Custom Computing Machines, 2012, pp. 17-24. |
Decision to Grant a Patent from foreign counterpart Korean Patent Application No. 10-2016-7029367, dated Apr. 26, 2017, 4 pages. |
Final Office Action from U.S. Appl. No. 13/730,143, dated Aug. 7, 2017, 16 pages. |
Final Office Action from U.S. Appl. No. 15/145,748, dated Mar. 29, 2017, 41 pages. |
Final Office Action from U.S. Appl. No. 15/226,875, dated May 17, 2017, 36 pages. |
Final Office Action from U.S. Appl. No. 15/281,944, dated May 10, 2017, 42 pages. |
Final Office Action from U.S. Appl. No. 15/282,082, dated Apr. 24, 2017, 38 pages. |
Grant of Patent from foreign counterpart Korean Patent Application No. 10-2016-7029366, dated Jul. 31, 2017, 4 pages. |
Gschwind M., “Optimizing Data Sharing and Address Translation for the Cell BE Heterogeneous Chip Multiprocessor,” IEEE, 2008, pp. 478-485. |
Intel 915G/915GV/91 OGL Express Chipset Graphics and Memory Controller Hub (GMCH)—White Paper' Sep. 2004 by Intel, 9 pages. |
Non-Final Office Action from U.S. Appl. No. 13/730,143, dated Apr. 17, 2017, 36 pages. |
Non-Final Office Action from U.S. Appl. No. 15/282,082 dated Aug. 2, 2017, 32 pages. |
Notice of Allowance from U.S. Appl. No. 13/730,971, dated Jul. 28, 2017, 21 pages. |
Notice of Preliminary Rejection from foreign counterpart Korean Patent Application No. KR1020167029366, dated Jan. 19, 2017, 13 pages. |
Office Action from foreign counterpart China Patent Application No. 201380059874.8, dated May 16, 2017, 3 pages. |
Office Action from foreign counterpart Chinese Patent Application No. 201380059888, dated Feb. 7, 2017, 18 pages. |
First Office Action and Search Report from foreign counterpart China Patent Application No. 201380059899.8, dated Sep. 1, 2017, 10 pages. |
First Office Action and Search Report from foreign counterpart Chinese Patent Application No. 201380059921.9, dated Sep. 20, 2017, 10 pages. (Translation available only for office action). |
Non-final Office Action from U.S. Appl. No. 15/145,748, dated Sep. 22, 2017, 30 pages. |
Non-Final Office Action from U.S. Appl. No. 15/226,875, dated Sep. 12, 2017, 19 pages. |
Notice on Grant of Patent Right for Invention from foreign counterpart Chinese Patent Application No. 201380059874.8, dated Sep. 12, 2017, 4 pages. |
Notice of Allowance from U.S. Appl. No. 15/281,944, dated Dec. 8, 2017, 28 pages. |
Notice of Allowance from U.S. Appl. No. 15/282,082, dated Dec. 20, 2017, 14 pages. |
Second Office Action from foreign counterpart Chinese Patent Application No. 201380059888, dated Oct. 16, 2017, 15 pages. |
Second Office Action from foreign counterpart Chinese Patent Application No. 201380059921.9, dated Mar. 20, 2018, 7 pages. |
First Office Action and Search Report from foreign counterpart Chinese Patent Application No. 201611088838.6, dated Apr. 28, 2018, 24 pages. |
Notice of Allowance from U.S. Appl. No. 15/145,748, dated Jun. 6, 2018, 15 pages. |
Notice of Allowance from U.S. Appl. No. 13/730,971, dated Mar. 27, 2018, 27 pages. |
Notice of Allowance from U.S. Appl. No. 15/145,748, dated Mar. 12, 2018, 14 pages. |
Notice of Allowance from U.S. Appl. No. 15/281,944, dated Mar. 26, 2018, 18 pages. |
Notice of Allowance from U.S. Appl. No. 15/282,082, dated Mar. 30, 2018, 14 pages. |
Notice on Grant of Patent Right for Invention from foreign counterpart Chinese Patent Application No. 201380059899.8, dated May 4, 2018, 4 pages. |
Request for Amendment from foreign counterpart Korean Patent Application No. KR1020157012861, dated May 27, 2015, 5 pages. |
Abandonment from U.S. Appl. No. 13/730,143, dated Apr. 24, 2018, 3 pages. |
Corrected Notice of Allowability from U.S. Appl. No. 15/145,748, dated Jul. 11, 2018, 9 pages. |
Final Office Action from U.S. Appl. No. 15/226,875, dated Jun. 29, 2018, 26 pages. |
Notice of Allowance from U.S. Appl. No. 15/281,944, dated Jun. 27, 2018, 9 pages. |
Third Office Action from foreign counterpart Chinese Patent Application No. 201380059888.0, dated Jul. 3, 2018, 17 pages. |
Number | Date | Country | |
---|---|---|---|
20140189300 A1 | Jul 2014 | US |