Inter-virtual machine time profiling of I/O transactions

Description

BACKGROUND OF THE INVENTION

Virtual machines allow multiple operating systems to be run simultaneously on the same computer hardware. This allows the sharing of the underlying physical machine resources (e.g., memory, I/O, etc.) between multiple operating systems (or instances of the same operating system). Virtual machines facilitate application provisioning, maintenance, high availability, and disaster recovery. The software layer providing the virtualization is typically called a virtual machine monitor or hypervisor. A hypervisor may run on bare hardware, or on top of an operating system.

SUMMARY OF THE INVENTION

An embodiment of the invention may therefore comprise a method of providing a common timing reference value, comprising: in response to a timer hardware interrupt processed by a first virtual machine, writing a timer value to a shared memory location, the timer value based on a kernel timing parameter maintained by an operating system of said first virtual machine; and, reading, by a second virtual machine, said shared timer value from said shared memory location.

An embodiment of the invention may therefore further comprise a method of profiling the timing of an I/O request, comprising: reading, by a first virtual machine, a shared memory location containing a first timer value written by a second virtual machine; embedding said first timer value in an I/O request sent to a hypervisor, said I/O request causing an event to be processed by a second virtual machine; and, writing, by said second virtual machine, a second timer value in response to a hardware timer interrupt.

An embodiment of the invention may therefore further comprise a computer readable medium having instructions stored thereon for profiling an I/O transaction that, when executed by a computer, at least instruct the computer to: store a plurality of kernel timing parameter values maintained by a first virtual machine into a shared memory location; read, by a second virtual machine, a first of said plurality of kernel timing parameters associated with said I/O transaction; read, by a hypervisor, a second of said plurality of kernel timing parameters associated with said I/O transaction; and, read, by said first virtual machine, a third of said plurality of kernel timing parameters associated with said I/O transaction.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of multiple virtual machines running on a computer.

FIG. 2 is a flow diagram of providing a common timing reference.

FIG. 3 is a diagram of I/O delays associated with multiple virtual machines.

FIG. 4 is a flowchart of a method of providing a common timing reference value.

FIG. 5 is a flowchart of a method of profiling the timing of an I/O request.

FIG. 6 is a block diagram of a computer system.

DETAILED DESCRIPTION OF THE EMBODIMENTS

FIG. 1 is a block diagram of multiple virtual machines running on a computer. In FIG. 1, virtual machine system 100 comprises software running on computer 101. Computer 101 includes shared memory 110. Shared memory 110 stores a timer value 112. Software running on computer 100 comprises operating system #1 (OS #1) 120, operating system #2 (OS #2) 130, and hypervisor 140. OS #1120 includes I/O driver 122. OS #2130 includes I/O driver 132 and timer interrupt service routine (timer ISR) 134. Shared memory 110 may be an area of computer 101's memory which is accessible from both OS #1120 and OS #2130.

Hypervisor 140 is operatively coupled to OS #1120 and OS #2130. OS #1 driver 122 and OS #2 driver 132 are operatively coupled to receive timer value 112 from shared memory 112. Because OS #1 driver 122 and OS #2 driver 132 are part of OS #1120 and OS #2130, respectively, OS #1120 and OS #2130 are also operatively coupled to receive (or read) timer value 112 from shared memory 110. Timer ISR 134 is operatively coupled to send (or write) a timer value 112 to shared memory 110. Because OS #2 driver 132 is part of OS #2130, OS #2130 is also operatively coupled to send (or write) timer value 112 to shared memory 110.

In an embodiment, OS #1120 and OS #2130 are running as virtual machines under the supervision of hypervisor 140. OS #1120 and OS #2130 may be any guest operating systems compatible with hypervisor 140. For example, OS #1120 and/or OS #2130 may be Windows, Apple, UNIX, Linux, or FreeBSD based operating systems. In an embodiment, OS #2 driver 132 may implement RAID functionality.

OS #2130 may be configured to respond to I/O requests sent by OS #1120 via hypervisor 140. Computer 101 generates hardware timer interrupts that are processed by timer ISR 134 of OS #2130. When timer ISR 134 processes a hardware timer interrupt, it may from time-to-time (or each time) write a new timer value 112 to shared memory 110. This timer value may be based on a kernel timing parameter maintained by OS #2130.

In an embodiment, OS #1120 (or OS #1 driver 122) may read shared memory 110 to obtain timer value 112. OS #1120 may read timer value 112 before it dispatches an I/O request to hypervisor 140. OS #1120 may also read timer value 112 after it receives a response from OS #2130 associated with the I/O request. By comparing or subtrancting the second timer value 112 with a previous timer value 112, OS #1120 may determine an elapsed time. This elapsed time may correspond a processing time for the I/O request. Likewise, hypervisor 140, OS #2130, or any application, driver, or debug routine running on computer 101 may read timer value 112 in order to time or profile the processing of I/O requests.

FIG. 2 is a flow diagram of providing a common timing reference. In FIG. 2, OS #2130 periodically writes a timer value 112 to shared memory 110. When OS #1120 is ready to make an I/O request (e.g., by generating an I/O request event), OS #1120 reads a 1^sttimer value from shared memory 110. OS #1120 may store the 1^sttimer value. OS #1120, and OS #1 driver 122 in particular, then process the I/O request and generates a hypervisor I/O request event. During this process, OS #2130 writes a 2^ndtimer value to shared memory 110. Before sending the hypervisor I/O request, OS #2130 may read the 2^ndtimer value from shared memory 110. OS #1120 may store the 2^ndtimer value before sending the I/O request event to hypervisor 140. As an alternative, OS #1120 may embed the 2^ndtimer value in the I/O request event.

At some point after OS #1120 read the 2^ndtimer value, OS #2130 may write a 3^rdtimer value to shared memory 110. In response to the I/O request event, hypervisor 140 may read the 3^rdtimer value from shared memory 110. Hypervisor 140 may store this 3^rdtimer value. Also in response to the I/O request event, hypervisor 140 may send an I/O request event to OS #2130. Hypervisor 140 may embed the 3^rdtimer value in the I/O request.

At some point after hypervisor 140 read the 3^rdtimer value, OS #2130 may write a 4^thtimer value to shared memory 110. In response to the I/O request event, OS #2130 may read the 4^thtimer value from shared memory 110. OS #2130 may store this 4^thtimer value. Also in response to the I/O request event, OS #2130 may send an I/O completion event to hypervisor 140. OS #2130 may embed the 4^thtimer value in the I/O completion event.

At some point after OS #2130 read the 4^thtimer value, OS #2130 may write a 5^thtimer value to shared memory 110. In response to the I/O completion event, hypervisor 140 may read the 5^thtimer value from shared memory 110. Hypervisor 140 may store this 5^thtimer value. Also in response to the I/O completion event, hypervisor 140 may send an I/O completion event to OS #1120. Hypervisor 140 may embed the 5^thtimer value in the I/O completion event.

At some point after hypervisor 140 read the 5^thtimer value, OS #2130 may write a 6^thtimer value to shared memory 110. In response to the I/O completion event, OS #1120 may read the 6^thtimer value from shared memory 110. OS #1120, OS #2130, and/or hypervisor 140 may compare (or subtract) any of the 1^st-6^thtimer values with any of the other timer values to determine an elapsed time (or delay) associated with the processing etc. of the I/O request event and/or the I/O completion event. This information can be used to profile execution times and/or performance of OS #1120, OS #1 driver 122, OS #2130, OS #2 driver 132, and/or hypervisor 140.

FIG. 3 is a diagram of I/O delays associated with multiple virtual machines. FIG. 3 illustrates the delays T1-T5 shown in FIG. 2. T1 is a first delay associated with OS #1120 and OS #1 driver 122 processing an I/O request and generating a hypervisor I/O request event. T1 may correspond to the difference between the 1^sttime value and the 2^ndtime value discussed previously. T2 is a second delay associated with hypervisor 140 receiving the hypervisor I/O request event and generating an I/O request event for OS #2130. T2 may correspond to the difference between the 2^ndtime value and the 3^rdtime value.

T3 is a third delay associated with OS #2130, and OS #2 driver 132 in particular, processing the I/O request, performing the requested actions, and generating a hypervisor completion event. T3 may correspond to the difference between the 3^rdtime value and the 4^thtime value. T4 is a fourth delay associated with hypervisor 140 receiving the I/O completion event and generating an I/O completion event for OS #1120. T4 may correspond to the difference between the 4^thtime value and the 5^thtime value. T5 is a fifth delay associated with OS #1120, and OS #1 driver 122 in particular, processing the hypervisor I/O completion event (interrupt) and completing I/O processing. T5 may correspond to the difference between the 5^thtime value and the 6^thtime value.

FIG. 4 is a flowchart of a method of providing a common timing reference value. The step illustrated in FIG. 4 may be performed by one or more elements of virtual machine system 100. A hardware time interrupt is received by a first virtual machine (402). For example, OS #2130 timer ISR 134 may be called in response to a hardware timer interrupt. A timer value is written to a shared memory location (404). The timer value may be based on a kernel timing parameter. For example, timer ISR 134 may write a kernel timing parameter, such as ticks, to shared memory 110. A second virtual machine reads the timer value from the shared memory location (406). For example, OS #1120 may read the timer value stored by OS #2130's timer ISR 134 from shared memory 110. The timer value is compared with a previous timer value to determine and elapsed time (408). For example, an application running on OS #1120, or OS #2130, may compare timer values stored by OS #2130's timer ISR 134 to each other (or subtract them) in order to determine an elapsed time. Examples of elapsed times that may be determined from the values read from shared memory 110, which represent delays across multiple virtual machines and the hypervisor 140, are delays T1-T5 discussed previously.

FIG. 5 is a flowchart of a method of profiling the timing of an I/O request. The steps illustrated in FIG. 5 may be performed by one or more elements of virtual machine system 100. A first virtual machine reads a shared memory location containing a first timer value written by a second virtual machine (502). For example, OS #1120 may read timer value 112 from shared memory 110 after it was written by timer ISR 134 of OS #2130. The first timer value may be optionally embedded in an I/O request sent to a hypervisor (504). For example, the timer value 112 read by OS #1120 may be embedded in a field of an I/O request OS #1120 sends to hypervisor 140.

A second virtual machine processes an event associated with the I/O request (506). For example, OS #2130 may process an interrupt or I/O event from hypervisor 140 that hypervisor 140 generated in response to the I/O request sent in block 504. In response to a hardware timer interrupt, the second virtual machine writes a second timer value to the shared memory location (508). For example, timer ISR 134 of OS #2130 may be called in response to a hardware timer interrupt from computer 101. After being called, timer ISR 134 may write a new timer value 112 to shared memory 110. This new timer value may be based on a kernel timing parameter.

A third timer value written to the shared memory location by the second virtual machine may be read (510). For example, OS #1120, OS #2130, hypervisor 140, or some other application running on computer 101 may read timer value 112 from shared memory 110. As timer value 112 is constantly being updated by OS #2130's timer ISR 134, the third value may be different that the first and second timer values, above.

The systems, software, operating systems, hypervisors, and functions described above may be implemented with or executed by one or more computer systems. The methods described above may be stored on a computer readable medium. Many of the elements of virtual machine system 100 may be, comprise, or include computers systems. This includes, but is not limited to computer 101.

FIG. 6 illustrates a block diagram of a computer system. Computer system 600 includes communication interface 620, processing system 630, storage system 640, and user interface 660. Processing system 630 is operatively coupled to storage system 640. Storage system 640 stores software 650 and data 670. Processing system 630 is operatively coupled to communication interface 620 and user interface 660. Computer system 600 may comprise a programmed general-purpose computer. Computer system 600 may include a microprocessor. Computer system 600 may comprise programmable or special purpose circuitry. Computer system 600 may be distributed among multiple devices, processors, storage, and/or interfaces that together comprise elements 620-670.

Communication interface 620 may comprise a network interface, modem, port, bus, link, transceiver, or other communication device. Communication interface 620 may be distributed among multiple communication devices. Processing system 630 may comprise a microprocessor, microcontroller, logic circuit, or other processing device. Processing system 630 may be distributed among multiple processing devices. User interface 660 may comprise a keyboard, mouse, voice recognition interface, microphone and speakers, graphical display, touch screen, or other type of user interface device. User interface 660 may be distributed among multiple interface devices. Storage system 640 may comprise a disk, tape, integrated circuit, RAM, ROM, network storage, server, or other memory function. Storage system 640 may be a computer readable medium. Storage system 640 may be distributed among multiple memory devices.

Processing system 630 retrieves and executes software 650 from storage system 640. Processing system may retrieve and store data 670. Processing system may also retrieve and store data via communication interface 620. Processing system 650 may create or modify software 650 or data 670 to achieve a tangible result. Processing system may control communication interface 620 or user interface 670 to achieve a tangible result. Processing system may retrieve and execute remotely stored software via communication interface 620.

Software 650 and remotely stored software may comprise an operating system, utilities, drivers, networking software, and other software typically executed by a computer system. Software 650 may comprise an application program, applet, firmware, or other form of machine-readable processing instructions typically executed by a computer system. When executed by processing system 630, software 650 or remotely stored software may direct computer system 600 to operate as described herein.

The foregoing description of the invention has been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed, and other modifications and variations may be possible in light of the above teachings. The embodiment was chosen and described in order to best explain the principles of the invention and its practical application to thereby enable others skilled in the art to best utilize the invention in various embodiments and various modifications as are suited to the particular use contemplated. It is intended that the appended claims be construed to include other alternative embodiments of the invention except insofar as limited by the prior art.

Claims

1. A method of providing a common timing reference value, comprising: in response to a timer hardware interrupt processed by a first virtual machine, writing a timer value to a shared memory location, said timer value based on a kernel timing parameter maintained by an operating system of said first virtual machine;reading, by a second virtual machine, said shared timer value from said shared memory location; and,comparing, by said second virtual machine, said shared timer value from said shared memory location with a previous value of said shared timer value from said shared memory location to determine an elapsed time, said previous value of said shared timer value is provided to said second virtual machine as part of an I/O request and said elapsed time corresponding to a processing time for the I/O request.
2. The method of claim 1, wherein said previous value of said shared timer value is provided to said second virtual machine as part of a header of the I/O request.
3. The method of claim 1, wherein said kernel timing parameter is a kernel ticks value.
4. A method of profiling the timing of an I/O request, comprising: reading, by a first virtual machine, a shared memory location containing a first timer value written by a second virtual machine;embedding said first timer value in an I/O request sent to a hypervisor, said I/O request causing an event to be processed by a second virtual machine;writing, by said second virtual machine, a second timer value in response to a hardware timer interrupt; and,comparing, by said second virtual machine, said first timer value from said I/O request with said second timer value from said shared memory location to determine an elapsed time, said elapsed time corresponding to a processing time for the I/O request.
5. The method of claim 4, further comprising: reading, by a said hypervisor, said shared memory location containing a third timer value written by said second virtual machine.
6. The method of claim 5, further comprising: in response to receiving a response to said I/O request from said second virtual machine, reading, by said first virtual machine, said shared memory location containing a third timer value written by said second virtual machine.
7. A non-transitory computer readable medium having instructions stored thereon for profiling an I/O transaction that, when executed by a computer, at least instruct the computer to: store a plurality of kemel timing parameter values maintained by a first virtual machine into a shared memory location;read, by a second virtual machine, a first of said plurality of kernel timing parameters associated with said I/O transaction;read, by a hypervisor, a second of said plurality of kernel timing parameters associated with said I/O transaction;read, by said first virtual machine, a third of said plurality of kernel timing parameters associated with said I/O transaction; and,determine a delay associated with said hypervisor based on said first of said plurality of kernel timing parameters and said second of said plurality of kernel timing parameters, said delay associated with said hypervisor and corresponding to a processing time by said hypervisor for the I/O request.
8. The computer readable medium of claim 7, wherein the computer is further instructed to: read, by said first virtual machine, a fourth of said plurality of kernel timing parameters associated with a response to said I/O transaction.
9. The computer readable medium of claim 8, wherein the computer is further instructed to: read, by said hypervisor, a fifth of said plurality of kernel timing parameters associated with said response to said I/O transaction.
10. The computer readable medium of claim 9, wherein the computer is further instructed to: read, by said second virtual machine, a sixth of said plurality of kernel timing parameters associated with said response to said I/O transaction.
11. The computer readable medium of claim 7, wherein the computer is further instructed to: determine a delay associated with said hypervisor based on said first of said plurality of kernel timing parameters and said third of said plurality of kernel timing parameters.
12. The computer readable medium of claim 9, wherein the computer is further instructed to: determine a delay associated with said first virtual machine based on said fourth of said plurality of kernel timing parameters and said fifth of said plurality of kernel timing parameters.

US Referenced Citations (35)

Number	Name	Date	Kind
7299468	Casey et al.	Nov 2007	B2
7328437	Donovan et al.	Feb 2008	B2
7668177	Trapp et al.	Feb 2010	B1
7784053	Casey et al.	Aug 2010	B2
7831977	Shultz et al.	Nov 2010	B2
7840962	Neiger et al.	Nov 2010	B2
7895597	Hartikainen	Feb 2011	B2
7913009	Vega et al.	Mar 2011	B2
7917677	Johnson et al.	Mar 2011	B2
7917903	Lumb et al.	Mar 2011	B2
8146078	Bennett et al.	Mar 2012	B2
8151265	Ben-Yehuda et al.	Apr 2012	B2
8209681	Turner et al.	Jun 2012	B1
20020161961	Hardin et al.	Oct 2002	A1
20030101440	Hardin et al.	May 2003	A1
20040194095	Lumb et al.	Sep 2004	A1
20040268348	Waki et al.	Dec 2004	A1
20070079022	Carlson et al.	Apr 2007	A1
20080104589	McCrory et al.	May 2008	A1
20080222632	Ueno et al.	Sep 2008	A1
20090319256	Chow et al.	Dec 2009	A1
20090320009	Chow et al.	Dec 2009	A1
20100077394	Wang et al.	Mar 2010	A1
20100082321	Cherkasova et al.	Apr 2010	A1
20100082855	Accapadi et al.	Apr 2010	A1
20100082995	Dees et al.	Apr 2010	A1
20100162242	Grouzdev	Jun 2010	A1
20100235557	Guo et al.	Sep 2010	A1
20100299673	Shultz et al.	Nov 2010	A1
20110113208	Jouppi et al.	May 2011	A1
20110131335	Spaltro et al.	Jun 2011	A1
20110197191	Malloy et al.	Aug 2011	A1
20120042061	Ayala et al.	Feb 2012	A1
20120084780	Pasternak	Apr 2012	A1
20120087319	Raleigh et al.	Apr 2012	A1

Non-Patent Literature Citations (1)

Entry
Understanding the Linux Kernel, Bovet and Cesati, O'Reilly Media; Third Edition edition (Nov. 2005).

Related Publications (1)

	Number	Date	Country
	20120096205 A1	Apr 2012	US

Inter-virtual machine time profiling of I/O transactions

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

US Classifications

Field of Search

US

International Classifications

Term Extension

Abstract

Description

Claims

US Referenced Citations (35)

Non-Patent Literature Citations (1)

Related Publications (1)