This application claims priority under 35 U.S.C. §119 to Korean Patent Application No. 10-2014-0185988, filed Dec. 22, 2014, in the Korean Intellectual Property Office, the entire content of which is incorporated herein by reference in its entirety.
The present disclosure relates to an input/output method in a virtual machine environment.
An input/output (I/O) operation generated by a guest operating system (OS), which is driven by a virtual machine (VM), may be sent to a host OS in an interrupt manner, and the host OS may perform the input/output operation using various input/output devices. When the VM emulates an input/output event generated by the guest OS in an interrupt manner and sends the input/output event to the host OS, it can increase overhead due to context switching and result in resource waste and performance degradation of a computing device in which this virtual environment has been established. To overcome these and other shortcomings, it may be beneficial to provide an improved input/output processing. method for the guest OS and the VM in a high-speed storage environment (e.g., a solid state drive (SSD) environment) that requires low latency.
In one exemplary embodiment, the present disclosure is directed to a method, comprising: performing, by a virtual machine (VM) executing on a computing device, request polling (RP) to detect an input/output event generated by a first operating system (OS), wherein the first OS is driven by the VM; sending, by the VM, an input/output request message to a second OS when the input/output event is detected through the RP, wherein the VM is executed on the second OS; performing, by the VM, response waiting polling (IMP) to detect an input/output completion event generated by the second OS; and sending, by the VM, an input/output response message to the first OS when the input/output completion event is detected through the RWP, wherein the RP and the WP are performed by multiple threads executed on the VM.
In another exemplary embodiment, the present disclosure is directed to a method, comprising: receiving, by a virtual machine (VM) executing on a computing device, an input/output event notification generated by a first OS using a request polling (RP) thread that is executed on the VM, wherein the first OS is driven by the VM; sending, by the VM to a second OS, an input/output request message that is based on the input/output event notification, wherein the VM is executed on the second OS; receiving, by the VM, an input/output completion event notification generated by the second OS using a request waiting polling (RWP) thread that is executed on the VM; and sending, by the VM to the first OS, an input/output response notification in response to a completion polling (CP) thread that is executed on the first OS, wherein the input/output response notification is based on the input/output completion message.
In another exemplary embodiment, the present disclosure is directed to a method, comprising: polling, by a virtual machine (VM) executing on a computing device, a first operating system (OS) to detect an input/output event generated by the first OS, wherein the first OS is driven by the VM, sending, when the input/output event is detected by the VM, an input/output request message to a second OS, wherein the VM is executed on the second OS, polling, by the VM, the second OS to detect an input/output completion event generated by the second OS; generating, by the VM, an input/output completion notification based on the input/output completion event; and sending, to the first OS, the input/output completion notification.
The above and other aspects and features of the present disclosure will become more apparent by describing in detail exemplary embodiments thereof with reference to the attached drawings, in which:
Embodiments will be described in detail with reference to the accompanying drawings. The inventive concept and features, however, may be embodied in various different forms, and should not be construed as being limited only to the illustrated embodiments. Accordingly, known processes, elements, and techniques are not described with respect to some of the disclosed embodiments. Unless otherwise noted, like reference numerals denote like elements throughout the attached drawings and written description, and thus descriptions will not be repeated. In the drawings, the sizes and relative sizes of layers and regions may he exaggerated for clarity.
It will be understood that, although the terms “first”, “second”, “third”, etc., may be used herein to describe various elements, components, regions, layers and/or sections, these elements, components, regions, layers and/or sections should not be limited by these terms. Unless indicated otherwise, these terms are only used to distinguish one element, component, region, layer or section from another element, component, region, layer or section. Thus, a first element, component, region, layer or section discussed below could be termed a second element, component, region, layer or section without departing from the teachings of the disclosure.
Spatially relative terms, such as “beneath”, “below”, “lower”, “under”, “above”, “upper” and the like, may be used herein for ease of description to describe one element or feature's relationship to another element(s) or feature(s) as illustrated in the figures. It will he understood that the spatially relative terms are intended to encompass different orientations of the device in use or operation in addition to the orientation depicted in the figures. For example, if the device in the figures is turned over, elements described as “below” or “beneath” “or under” other elements or features would then be oriented “above” the other elements or features. Thus, the exemplary terms “below” and “under” can encompass both an orientation of above and below. The device may be otherwise oriented (rotated 90 degrees or at other orientations) and the spatially relative descriptors used herein interpreted accordingly. In addition, it will also be understood that when a layer is referred to as being “between” two layers, it can be the only layer between the two layers, or one or more intervening layers may also be present.
The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting. As used herein, the singular forms “a”, “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms “comprises” and/or “comprising,” when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. As used herein, the term “and/or” includes any and all combinations of one or more of the associated listed items. Also, the term “exemplary” is intended to refer to an example or illustration.
It will be understood that when an element or layer is referred to as being “on”, “connected to”, “coupled to”, or “adjacent to” another element or layer, it can be directly on, connected, coupled, or adjacent to the other element or layer, or intervening elements or layers may be present. In contrast, when an element is referred to as being “directly on,” “directly connected to”, “directly coupled to”, or “immediately adjacent to” another element or layer, there are no intervening elements or layers present.
Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and/or the present specification and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
Referring to
The guest OS 100 may be an OS installed on a VM and driven by the VM. In certain embodiments, the guest OS 100 may be different from the host OS 300. For example, the guest OS 100 may be an OS having a different platform from the host OS 300, an OS incompatible with the host OS 300, or an OS incapable of directly controlling the hardware 320 that is controlled by the host OS 300.
In some embodiments, for example, the guest OS 100 may be an OS such as OS X® of Apple Inc., WINDOWS® of Microsoft Corporation, UNIX®, LINUX®, or a mobile specific OS, such as, for example, ANDROID™ of Google. However, the guest OS 100 is not limited to these examples. In some embodiments, the guest OS 100 may be not only an OS but also a specific application, program or process that can be executed on a VM.
In certain embodiments, the VM 200 operates in the host OS 300 and implements a computing device or environment as software that emulates a computing device or environment. For example, the VM 200 provides to the guest OS 100 an environment in which it executes. In some embodiments, the VM 200 may provide a complete execution environment of an OS (i.e., the guest OS 100) by supporting emulation of the entire computing device. In some other embodiments, the VM 200 may support emulation for executing a specific application, program or process.
In some embodiments, the VM 200 may be, but is not limited to, a kernel-based virtual machine (KVM). The VM 200 may be implemented in the computing device or environment at one or more levels such as an application level, a kernel level, etc.
The host OS 300 may be an OS that directly controls the computing device or environment. For example, the host OS 300 may directly control the file system 310, the hardware 320, etc. of the computing device or environment. For example, as illustrated in
In some embodiments, for example, the host OS 300 may be an OS such as OS X® of Apple Inc., WINDOWS® of Microsoft Corporation, UNIX®, LINUX®, or a mobile specific OS such as ANDROID™ of Google, However, the host OS 300 is not limited to these examples.
Referring to
When the input/output event is detected by the VM 200 through the RP 210, the VM 200 may generate and transmit an input/output request message to the host OS 300. For example, when VM 200 detects an input/output event for storing the data in a memory or storage device by using the RP 210, the VM 200 may send an input/output request message to the host OS 300, requesting that host OS 300 store the data in the memory or storage device.
Referring to
When the input/output completion event is detected by the VM 200 through the RWP 220, the VM 200 may generate and send an input/output response message to the guest OS 100. For example, when VM 200 detects the input/output completion event that informs of the completion of the data storage operation using the RWP 220, the VM 200 may send an input/output response message to the guest OS 100 to notify a user or a specific application, program or process of the completion of an input/output operation and the result of the input/output operation. In some embodiments, to notify a user or a specific application, program or process, the guest OS 100 may, for example, generate message (audio, visual, etc) for output on an interface device (e.g., display unit, printer, speakers, lights, etc.); prepare and transmit notification messages to other applications, programs or processes; change a status associated with the guess OS 100 and/or applications, programs or processes; etc.
In some embodiments, the RP 210 and the RWP 220 may be performed or executed sequentially, concurrently, or in parallel. In some embodiments, one or more threads performing the RWP 220 may be executed in parallel with one or more threads performing the RP 210. In some embodiments, when threads are executing in parallel, they may be executing at the same instant; when threads are executing concurrently, they may he executing such that one thread starts before another thread ends; and when threads are executing sequentially, they may be executing without any overlap in their respective executions.
As illustrated in the example of
Specifically,
In some embodiments, the RP 400, the RWP 410, the RP 402, and the RWP 412 may be performed in a pipeline manner as illustrated in
In some embodiments, the threads executing or performing the RP 400, the RWP 410, the RP 402, and the RWP 412 may be reusable. For example, the RP 400 of the first input/output event may be performed by a first thread. When an input/output event generated by the guest OS 100 is detected through the RP 400, the VM 200 may send an input/output request to the host OS 300 using the first thread and then generate a second thread for performing the RP 402. After the generation of the second thread, the RWP 410 may be performed by the first thread.
As another example embodiment, the RWP 410 may be performed by a third thread. When an input/output completion event generated by the host OS 300 is detected through the RWP 410, the VM 200 may send an input/output response message to the guest OS 100 using the third thread and then generate a fourth thread for performing the RWP 412. Here, the RIP following the generation of the fourth thread (not illustrated) may be performed by the third thread.
Referring to
During the RP 402 performed by the second thread, the first thread may send an input/output completion notification corresponding to an input/output completion event generated by the host OS 300 to the VM 200 (the RP 410). In this way, the input/output method according to the current embodiment may be performed in a pipeline polling manner using multiple threads.
Referring to
Referring to
As illustrated in the example of
Specifically,
In some embodiments, the RP 400, the RWP 410, the CP 500, the RP 402, the RWP 412, the CP 502, the RP 404, the RWP 414 and the CP 504 may be performed in a pipeline manner as illustrated in
Referring to
While the second thread is performing the RWP 412, a third thread may send an input/output response event generated by the VM 200 to the guest OS 100 (the CP 500).
The input/output response event may be a status change and/or notification that is generated by the VM 200 and detected or received by the CP 500. In this way, the input/output method according to some embodiments may be performed in a pipeline polling manner using multiple threads.
Referring to
In some embodiments, at least two of the RP thread 610, the RWP thread 620, and the CP thread 630 may be executed serially, concurrently, and/or in parallel. And, in some embodiments, the RP 610, the RWP 620, and the CP 630 may be performed in a pipeline manner by the RP thread 610, the RWP thread 620, and the CP thread 630, respectively.
Referring to
Referring to
In some embodiments, performing the RWP 220 may include performing the RP 210 in parallel or concurrently with the RWP 220, such that the RWP 220 is performed using the first thread and RP 210 is performed using a second thread that is different from the first thread. In some embodiments, sending the input/output completion message corresponding to the input/output completion event detected through the RWP 220 to the attest OS 100 may include sending an input/output response message to the guest OS 100 based on the input/output completion event detected through the RWP 220.
Referring to
In some embodiments, performing the RWP 220 may include performing the RP 210 and the CP 110 in serially, concurrently, or parallel with the IMP 220 using the second thread and the third thread while performing the RWP 220 using the first thread.
According to various embodiments, a VM emulates an input/output event generated by a guest OS using a polling method and sends the input/output event to a host OS. Therefore, overhead due to context switching can be prevented. Accordingly, the disclosed embodiments may prevent resource waste and performance degradation of a computing device in which a virtual environment and a high-speed storage environment (e.g., a solid state drive (SSD) environment) that requires low latency have been established.
The disclosed embodiments have been described with reference to the attached drawings, but it may be understood by one of ordinary skill in the art that the disclosed embodiments may be performed in other forms without changing the technical concept or essential features. Further, the above-described embodiments are merely examples and are not intended to limit or otherwise restrict the scope of the claims.
Number | Date | Country | Kind |
---|---|---|---|
10-2014-0185988 | Dec 2014 | KR | national |
Number | Name | Date | Kind |
---|---|---|---|
5506975 | Onodera | Apr 1996 | A |
6435737 | Wise et al. | Aug 2002 | B1 |
6738836 | Kessler et al. | May 2004 | B1 |
6957219 | Lin et al. | Oct 2005 | B1 |
7398329 | Moore et al. | Jul 2008 | B2 |
7934055 | Flynn et al. | Apr 2011 | B2 |
8380681 | Oltean et al. | Feb 2013 | B2 |
8412904 | Flynn et al. | Apr 2013 | B2 |
8539453 | Flemming et al. | Sep 2013 | B2 |
8560779 | Castillo et al. | Oct 2013 | B2 |
8589655 | Colgrove et al. | Nov 2013 | B2 |
8694994 | Vincent et al. | Apr 2014 | B1 |
8700842 | Dinker | Apr 2014 | B2 |
20130160005 | Jung | Jun 2013 | A1 |
20160124763 | Tsirkin | May 2016 | A1 |
20160261586 | Huang | Sep 2016 | A1 |
Entry |
---|
Sironi, M. and Tisato, F., “Capturing Information Flows Inside Android and Qemu Environments,” Universita degli Studi di Milano-Bicocca, Dipartimento di Informatica, Sistemistica e Comunicazione, Milano, Italy, Feb. 12, 2013, 13 pages. |
Shin, Dong In et al., “Dynamical Interval Polling and Pipelined Post I/O Processing for Low-Latency Storage Class Memory,” HotStorage'13 Proceedings of the 5th USENIX Conference on Hot Topics in Storage and File Systems, USENIX, Berkeley, CA, USA, 5 pages. |
Barve, Rakesh et al., “Modeling and optimizing I/O throughput of multiple disks on a bus,” in Proceedings of ACM Sigmetrics Joint International Conference on Measurement and Modeling of Computer Systems, Atlanta, GA, USA, ACM Press, May 1999, pp. 83-92. |
Ohta, Kazuki et al., “Optimization Techniques at the I/O Forwarding Layer,” CLUSTER '10 Proceedings of the 2010 IEEE International Conference on Cluster Computing, IEEE Computer Society Washington, DC, USA, Sep. 9, 2010, pp. 312-321. |
Rizzo, Luigi et al., “Speeding Up Packet I/O in Virtual Machines,” ANCS '13 Proceedings of the Ninth ACM/IEEE Symposium on Architectures for Networking and Communications Systems, IEEE Press Piscataway, NJ, USA, Oct. 21, 2013. pp. 47-58. |
“I/O Buffering and Streaming,” Duke University Systems and Architecture, available at https://users.cs.duke.edu/˜chase/cps210-archive/slides/buf6.pdf, retrieved Dec. 21, 2015, 23 pages. |
Marazakis, Manolis, “I/O in Linux Hypervisors and Virtual Machines Lecture for the Embedded Systems Course,” CSD, University of Crete, Institute of Computer Science, Foundation for Research and Technology—Hellas (FORTH), May 8 & 10, 2014, available at http://www.csd.uoc.gr/˜hy428/reading/vmio—may8—2014.pdf, retrieved Dec. 21, 2015, 78 pages. |
Number | Date | Country | |
---|---|---|---|
20160179725 A1 | Jun 2016 | US |