In a typical virtual desktop infrastructure (VDI) architecture, displays and input devices are local, and applications execute remotely in a server. A user's desktop is typically hosted in a datacenter or cloud, and the user remotely interacts with her desktop via a variety of endpoint devices, including desktops, laptops, thin clients, smart phones, and tablets. There are many other instances where users may interact with a computer system remotely.
The foregoing and other features of the present disclosure will become more fully apparent from the following description and appended claims, taken in conjunction with the accompanying drawings. Understanding that these drawings depict only several embodiments in accordance with the disclosure and are therefore not to be considered limiting of its scope, the disclosure will be described with additional specificity and detail through use of the accompanying drawings.
In the drawings:
User experience is a key consideration when organizations decide on software for deploying remote desktops. An important way to measure user experience is to determine the display quality visually seen by the users. Thus what is needed is a mechanism to measure display quality in a remote desktop environment.
In accordance with examples of the present disclosure, methods are provided to measure the display quality of a remote desktop that an end user sees on a client. The methods compare the screen the end user sees at the client side against what the end user should see on the screen at the server side. Two indicators of display quality are provided: (a) a relative frame rate and (b) an overall image quality score.
The relative frame rate compares a frame rate at the client side against a baseline frame rate at the server side. To compare the frame rates, a timestamp video is played at the server side and the screens at the server side and the client side are captured. A frame rate on each side is determined as the number of captured frames with unique timestamps divided by the duration of the capture, which is calculated from the timestamps on the first and the last captured frames.
The overall image quality score compares each frame captured at the client side against a baseline frame captured at the server side to determine frame image quality scores, and then the overall image quality is calculated from the frame image quality scores. For each frame image quality score, the frames are compared on a block basis where the frame image quality score is a weighted average of the block similarity scores and bad blocks are given additional weight. The overall image quality score is a weighted average of the frame image quality scores for the client frames and bad frames where bad blocks propagate through are given additional weight.
VM 106-n includes a guest operating system (OS) 116 and a benchmark server application 118. Client 108-n (e.g., “second computer”) includes an OS 120, a desktop viewer application 122, and a benchmark client application 124. Desktop viewer application 122 displays the remote desktop of VM 106-n on client 108-n. Benchmark server application 118 and benchmark client application 124 work together to benchmark the user experience of the VDI in system 100.
In block 202, benchmark server application 118 plays a timestamp video on the remote desktop on VM 106-n at the server side. The timestamp video is played back at a fast rate, such as 100 fps. The timestamp video is made up of individual timestamps that are small in size, such as four-by-four pixels. Each timestamp has black and white pixels where a black pixel represents a zero (0) bit and a white pixel represents a one (1) bit. FIG. 3 shows a timestamp 300 in examples of the present disclosure.
In block 204, benchmark server application 118 captures the screen on VM 106-n at the server side for a predetermined amount of time, such as one to several minutes. In one example, benchmark server application 118 captures the screen at a rate, such as 200 fps, faster than the playback rate of the timestamp video. Benchmark server application 118 reads the timestamp on each captured frame. If multiple frames have the same timestamp, benchmark server application 118 only retains one, such as the first one. Benchmark server application 118 then counts the number of captured frames with unique timestamps. Block 204 may be followed by block 206.
In block 206, benchmark server application 118 calculates the frame rate on VM server 106-n at the server side. The frame rate is calculated as follows:
where “SUM of unique frames” is the count from block 204, and the “Time span of screen capture” is equal to the difference in last and the first timestamps divided by the timestamp playback rate (e.g., 100 fps). Block 206 may be followed by block 208.
In block 208, benchmark client application 124 captures the screen at client 108-n on the client side for a predetermined amount of time, such as one to several minutes. In one example, benchmark client application 124 captures the screen at a rate, such as 50 fps, faster than the rate of the remote display protocol, such as 30 fps. Benchmark client application 124 may save the captured frames in a shared folder on VM 106-n that can be accessed by benchmark server application 118. Block 208 may be followed by block 210.
In block 210, benchmark server application 118 calculates the frame rate at client 108-n on the client side. First, benchmark server application 118 retrieves the captured frames from the shared folder. As in block 204, if multiple frames have the same timestamp, benchmark server application 118 only retains one, such as the first one. Benchmark server application 118 then counts the number of captured frames with unique timestamps.
The timestamps captured at client 108-n on the client side may be distorted by the remote display protocol as it tries to relay the remote desktop while conserving network bandwidth. To correctly read the value of the timestamps, the brightness of each pixel in a timestamp is determined and then each pixel is binarized as a black pixel or a white pixel by comparing its brightness against a brightness threshold. In one example, the brightness of each pixel is determined as follows:
Brightness=0.299R+0.587G+0.144B, (2)
where R, G, B are the color values of the pixel. In one example a pixel is considered white if its brightness is greater than 128, or otherwise the pixel is considered black.
The frame rate at client 108-n at the client side is also calculated with equation (1) described above. Block 210 may be followed by block 212.
In block 212, benchmark server application 118 calculates the relative frame rate as follows:
In block 402, benchmark server application 118 plays a timestamp video on the remote desktop on VM 106-n at the server side. As in block 202 (
In block 404, benchmark server application 118 and benchmark client application 124 synchronize with each other in preparation to capture screen on both sides at the same time. Benchmark server application 118 and benchmark client application 124 may synchronize with each other through a virtual channel between them. Block 404 may be followed by block 406.
In block 406, benchmark server application 118 captures the screen on VM 106-n at the server side for a predetermined amount of time, such as one to several minutes. Similar to block 204 (
At the same time, benchmark client application 124 captures the screen at client 108-n on the client side for a predetermined amount of time, such as one to several minutes. Similar to block 208 (
In block 408, benchmark server application 118 determines, for each captured frame at client 108-n on the client side (hereafter “client frame” or “second computer frame”), a corresponding captured frame at server 106-n on the server side (hereafter “server frame” or “first computer frame”) based on timestamps on the frames. The corresponding server frame is a baseline for determining frame image quality. The details of block 408 are described later in reference to
In block 410, benchmark server application 118 calculates a frame image quality score for each client frame compared to its baseline server frame. Instead of comparing the entire frame, benchmark server application 118 may compare a small area (e.g., a comparison area of 300 pixels by 300 pixels) and such a comparison area may be randomly moved from frame to frame. The details of block 410 are described later in reference to
In block 412, benchmark server application 118 calculates an overall frame image quality score for the entire series of client frames based on the individual frame image quality scores determined in block 410. In particular, the overall image quality score considers the aggregate effect of “bad blocks” that propagate through the frames over time. The details of block 412 are described later in reference to
As introduced above, in block 408, benchmark server application 118 determines a baseline server frame for each client frame based on the timestamps on the frames. However, due to network latency, the timestamps on the client frames and the server frames may not be completely synchronized.
In block 602, benchmark server application 118 finds the first frame (hereafter “target frame”) in the server frames that has the same timestamp as a client frame. Block 602 may be followed by block 604.
In block 604, benchmark server application 118 determines if the target frame is the same as the previous server frame and the next server frame in the sequence of captured server frames. For example, benchmark server application 118 calculates an image quality score using a method described later in reference blocks 702 to 710 (
In block 606, benchmark server application 118 sets the target frame as the baseline server frame for the client frame.
In block 608, benchmark server application 118 determines if the target frame is not the same as the previous server frame and the target frame is not the same as the next server frame in the sequence of captured server frames. If so, block 608 may be followed by block 610. Otherwise the target frame is equal to one of the previous server frame and the next server frame, and block 608 may be followed by block 612.
In block 610, benchmark server application 118 calculates three frame image quality scores by comparing the client frame against the target frame, the previous server frame, and the next server frame. A method for calculating a frame image quality score is described later in reference to
In block 612, benchmark server application 118 calculates two frame image quality scores by comparing the client frame against the previous server frame and the next server frame. A method for calculating a frame image quality score is described later in reference to
In block 702, benchmark server application 118 divides a client frame and its baseline server frame into blocks (e.g., 16 by 16 pixels) and then compares the corresponding blocks. By dividing the frames into blocks, a shift in a single pixel will have more weight than if it is compared as part of the entire frame.
The corresponding blocks are compared as follows:
where “BlockSimilarity” is a block similarity score between two corresponding blocks, “PSNR” is the peak signal-to-noise ratio score between two corresponding blocks, and “SSIM” is the structural similarity score between two corresponding blocks. PSNR and SSIM are well-known algorithms for determining image quality or similarity between two images. Equation (3) provides that if the PSNR score is less than or equal to 18 and the SSIM score is greater than or equal to 50%, then the block similarly score is set equal to the PSNR score multiplied by 50% and divided by 18. Otherwise the block similarity score is set equal to the SSIM score. Block 702 may be followed by block 704.
In block 704, benchmark server application 118 finds each bad block in the client frame and then counts the number of bad blocks in a surrounding neighborhood. In one example, a bad block is a block that has a block similar score of less than 50%. In one example, the neighborhood is set to have a radius of 1 to 2% of the comparison area (e.g., a radius of 2 blocks for a 300 by 300 pixel comparison area).
In block 706, benchmark server application 118 determines a “MaxBadBlockDensityRatio” parameter that represents how densely bad blocks are concentrated, which is visually distracting to the end user. The MaxBadBlockDensityRatio is calculated as follows:
where “DensityMaxCount” is the largest number of bad blocks in a neighborhood that was counted in block 704. Block 706 may be followed by block 708.
In block 708, benchmark server application 118 calculates the frame image quality score using a weighted average of the block similarity scores of the client frame as follows:
where “BS” is a particular block similarity score and “A” is the number of blocks that has a corresponding BS value. As can be seen in equation (5), bad blocks having similar scores of 50% or less are given more weight. Furthermore, the frame image quality score is further reduced by the MaxBadBlockDensityRatio parameter. Block 708 may be followed by block 710.
In block 710, benchmark server application 118 finds the noisiest block in the client frame (i.e., the block with the lowest block similarity score). Benchmark server application 118 also finds the noisiest blocks in the prior client frame and the next client frame. Block 710 may be followed by block 712.
In block 712, benchmark server application 118 determines if the noisiest blocks match by calculating two PSNR scores that compare the noisiest block in the client frame against the noisiest blocks in the prior client frame and the next client frame. Block 712 may be followed by block 714.
In block 714, benchmark server application 118 determines if the two PSNR scores are both greater than a threshold, such as 18. If so, it means a bad block is propagating through the captured frames, which is visually distracting to the end user. If the two PSNR scores are both greater than the threshold, then block 714 may be followed by block 716. Otherwise block 714 may be followed by block 718.
In block 716, benchmark server application 118 increments a “ContinuousBadness” parameter by 1. ContinuousBadness is a variable that represents the number of continuous frames through which bad blocks propagate. At the beginning of method 700, ContinuousBadness is initialized to 0. Block 716 may be followed by block 718.
In block 718, benchmark server application 118 calculates an overall image quality score as follows:
where “FIQS” is a particular FrameImageQualtiyScore, and “A” is the number of frames that has a corresponding FIQS value. As can be seen in equation (6), the overall image quality score is a weighted average of the frame image quality scores. The overall image quality score is further reduced by the ContinuousBadness parameter.
In addition to remote desktops running on VMs, the benchmark methods and applications in the present disclosure may also be applied to systems with remote desktops running on physical machines.
From the foregoing, it will be appreciated that various embodiments of the present disclosure have been described herein for purposes of illustration, and that various modifications may be made without departing from the scope and spirit of the present disclosure. Accordingly, the various embodiments disclosed herein are not intended to be limiting, with the true scope and spirit being indicated by the following claims.
This application is related to U.S. patent application No. ______ (Attorney Docket No. B127.02), entitled “Frame Image Quality as Display Quality Benchmark for Remote Desktop,” which is concurrently filed, commonly owned, and incorporated by reference in its entirety.