This disclosure relates generally to graphic computer software systems and, more specifically, to a system and method for creating computer graphics and artwork with a vision system.
Graphic software applications provide users with tools for creating drawings for presentation on a display such as a computer monitor or tablet. One such class of applications includes painting software, in which computer-generated images simulate the look of handmade drawings or paintings. Graphic software applications such as painting software can provide users with a variety of drawing tools, such as brush libraries, chalk, ink, and pencils, to name a few. In addition, the graphic software application can provide a ‘virtual canvas’ on which to apply the drawing or painting. The virtual canvas can include a variety of simulated textures.
To create or modify a drawing, the user selects an available input device and opens a drawing file within the graphic software application. Traditional input devices include a mouse, keyboard, or pressure-sensitive tablet. The user can select and apply a wide variety of media to the drawing, such as selecting a brush from a brush library and applying colors from a color panel, or from a palette mixed by the user. Media can also be modified using an optional gradient, pattern, or clone. The user then creates the graphic using a ‘start stroke’ command and a ‘finish stroke’ command. In one example, contact between a stylus and a pressure-sensitive tablet display starts the brushstroke, and lifting the stylus off the tablet display finishes the brushstroke. The resulting rendering of any brushstroke depends on, for example, the selected brush category (or drawing tool); the brush variant selected within the brush category; the selected brush controls, such as brush size, opacity, and the amount of color penetrating the paper texture; the paper texture; the selected color, gradient, or pattern; and the selected brush method.
As the popularity of graphic software applications flourish, new groups of drawing tools, palettes, media, and styles are introduced with every software release. As the choices available to the user increase, so does the complexity of the user interface menu. Graphical user interfaces (GUIs) have evolved to assist the user in the complicated selection processes. However, with the ever-increasing number of choices available, even navigating the GUIs has become time-consuming, and may require a significant learning curve to master. In addition, the GUIs can occupy a significant portion of the display screen, thereby decreasing the size of the virtual canvas.
In one aspect of the invention, a method for displaying visual feedback in a graphics application program executing on a computer is disclosed. The method includes a step of connecting a vision system to the computer, wherein the vision system is adapted to monitor a visual space. The method further includes the steps of detecting, by the vision system, a tracking object in the visual space, executing, by the computer, a graphics application program, and outputting, by the vision system to the computer, spatial coordinate data representative of the location of the tracking object within the visual space. The method further includes the steps of mapping a horizontal portion and a vertical portion of the spatial coordinate data to a display connected to the computer, and rendering, by the computer on the display, a visual feedback attribute for a tool within the graphics application program by mapping the spatial coordinate data to a property of the tool.
In another aspect of the invention, a graphic computer software system is disclosed. The system includes a computer comprising one or more processors, one or more computer-readable memories, and one or more computer-readable tangible storage devices. The system further includes program instructions stored on at least one of the one or more storage devices for execution by at least one of the one or more processors via at least one of the one or more memories. The system further includes a display connected to the computer, a tracking object, and a vision system connected to the computer. The vision system includes one or more image sensors adapted to capture the location of the tracking object within a visual space. The vision system is adapted to output to the computer spatial coordinate data representative of the location of the tracking object within the visual space. The computer program instructions include program instructions to execute a graphics application program and output to the display, program instructions to map at least a horizontal and vertical portion of the spatial coordinate data of the tracking object as input to a graphics engine of the graphics application program, and program instructions to render a visual feedback attribute for a tool within the graphics application program by mapping the spatial coordinate data to a property of the tool.
The features described herein can be better understood with reference to the drawings described below. The drawings are not necessarily to scale, emphasis instead generally being placed upon illustrating the principles of the invention. In the drawings, like numerals are used to indicate like parts throughout the various views.
According to various embodiments of the present invention, a graphic computer software system provides a solution to the problems noted above. The graphic computer software system includes a vision system as an input device to track the motion of an object in the vision system's field of view. The output of the vision system is translated to a format compatible with the input to a graphics application program. The object's motion can be used to create brushstrokes, control drawing tools and attributes, and control a palette, for example. As a result, the user experience is more natural and intuitive, and does not require a long learning curve to master.
As will be appreciated by one skilled in the art, the present disclosure may be embodied as a system, method or computer program product. Accordingly, the present disclosure may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a “circuit,” “module” or “system.” Furthermore, the present disclosure may take the form of a computer program product embodied in one or more computer-readable medium(s) having computer-readable program code embodied thereon.
Any combination of one or more computer-readable medium(s) may be utilized. The computer-readable medium may be a computer-readable signal medium or a computer-readable storage medium. A computer-readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer-readable storage medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer-readable storage medium may be any tangible medium that can contain or store a program for use by or in connection with an instruction execution system, apparatus, or device.
A computer-readable signal medium may include a propagated data signal with computer-readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer-readable signal medium may be any computer-readable medium that is not a computer-readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
Note that the computer-usable or computer-readable medium could even be paper or another suitable medium upon which the program is printed, as the program can be electronically captured, via, for instance, optical scanning of the paper or other medium, then compiled, interpreted, or otherwise processed in a suitable manner, if necessary, and then stored in a computer memory. In the context of this document, a computer-usable or computer-readable medium may be any medium that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. The computer-usable medium may include a propagated data signal with the computer-usable program code embodied therewith, either in baseband or as part of a carrier wave. The computer usable program code may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc.
Program code embodied on a computer-readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Computer program code for carrying out operations of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as PHP, Javascript, Java, Smalltalk, C++ or the like and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
The present invention is described below with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions.
These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer program instructions may also be stored in a computer-readable medium that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable medium produce an article of manufacture including instruction means which implement the function/act specified in the flowchart and/or block diagram block or blocks.
The computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
With reference now to the figures, and in particular, with reference to
Computer 12 includes a processor (or CPU) 14 that is coupled to a system bus 15. Processor 14 may utilize one or more processors, each of which has one or more processor cores. A video adapter 16, which drives/supports a display 18, is also coupled to system bus 15. System bus 15 is coupled via a bus bridge 20 to an input/output (I/O) bus 22. An I/O interface 24 is coupled to (I/O) bus 22. I/O interface 24 affords communication with various I/O devices, including a keyboard 26, a mouse 28, a media tray 30 (which may include storage devices such as CD-ROM drives, multi-media interfaces, etc.), a printer 32, and external USB port(s) 34. While the format of the ports connected to I/O interface 24 may be any known to those skilled in the art of computer architecture, in a preferred embodiment some or all of these ports are universal serial bus (USB) ports.
As depicted, computer 12 is able to communicate with a software deploying server 36 and central service server 38 via network 40 using a network interface 42. Network 40 may be an external network such as the Internet, or an internal network such as an Ethernet or a virtual private network (VPN).
A storage media interface 44 is also coupled to system bus 15. The storage media interface 44 interfaces with a computer readable storage media 46, such as a hard drive. In a preferred embodiment, storage media 46 populates a computer readable memory 48, which is also coupled to system bus 14. Memory 48 is defined as a lowest level of volatile memory in computer 12. This volatile memory includes additional higher levels of volatile memory (not shown), including, but not limited to, cache memory, registers and buffers. Data that populates memory 48 includes computer 12's operating system (OS) 50 and application programs 52.
Operating system 50 includes a shell 54, for providing transparent user access to resources such as application programs 52. Generally, shell 54 is a program that provides an interpreter and an interface between the user and the operating system. More specifically, shell 54 executes commands that are entered into a command line user interface or from a file. Thus, shell 54, also called a command processor, is generally the highest level of the operating system software hierarchy and serves as a command interpreter. The shell 54 provides a system prompt, interprets commands entered by keyboard, mouse, or other user input media, and sends the interpreted command(s) to the appropriate lower levels of the operating system (e.g., a kernel 56) for processing. Note that while shell 54 is a text-based, line-oriented user interface, the present disclosure will equally well support other user interface modes, such as graphical, voice, gestural, etc.
As depicted, operating system (OS) 50 also includes kernel 56, which includes lower levels of functionality for OS 50, including providing essential services required by other parts of OS 50 and application programs 52, including memory management, process and task management, disk management, and mouse and keyboard management.
Application programs 52 include a renderer, shown in exemplary manner as a browser 58. Browser 58 includes program modules and instructions enabling a world wide web (WWW) client (i.e., computer 12) to send and receive network messages to the Internet using hypertext transfer protocol (HTTP) messaging, thus enabling communication with software deploying server 36 and other described computer systems.
The hardware elements depicted in computer 12 are not intended to be exhaustive, but rather are representative to highlight components useful by the present disclosure. For instance, computer 12 may include alternate memory storage devices such as magnetic cassettes (tape), magnetic disks (floppies), optical disks (CD-ROM and DVD-ROM), and the like. These and other variations are intended to be within the spirit and scope of the present disclosure.
The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
In one embodiment of the invention, application programs 52 in computer 12's memory (as well as software deploying server 36's system memory) may include a graphics application program 60, such as a digital art program that simulates the appearance and behavior of traditional media associated with drawing, painting, and printmaking.
Turning now to
The visual space 66 is a three-dimensional area in the field of view of the image sensors 64. In one embodiment, the visual space 66 is limited to a small area to provide more accurate tracking and prevent noise (e.g., other objects) from being detected by the system. In one example, the visual space 66 is approximately 0.23 m3 (8 cu.ft.), or roughly equivalent to a 61 cm cube. As shown, the vision system 62 is positioned directly in front of the computer display 18, the image sensors 64 pointing vertically upwards. In this manner, a user may position themselves in front of the display 18 and draw or paint as if the display were a canvas on an easel.
In other embodiments of the present invention, the vision system 62 could be positioned on its side such that the image sensors 64 point horizontally. In this configuration, the vision system 62 can detect a tracking object 68 such as a hand, and the hand could be manipulating the mouse 28 or other input device. The vision system 62 could detect and track movements related to operation of the mouse 28, such as movement in an X-Y plane, right-click, left-click, etc. It should be noted that a mouse need not be physically present—the user's hand could simulate the movement of a mouse (or other input device such as the keyboard 26), and the vision system 62 could track the movements accordingly.
The tracking object 68 may be any object that can be detected, calibrated, and tracked by the vision system 62. In the example wherein the vision system is a Leap Motion controller, exemplary tracking objects 68 include one hand, two hands, one or more fingers, a stylus, painting tools, or a combination of any of those listed. Exemplary painting tools can include brushes, sponges, chalk, and the like.
The vision system 62 may include as part of its operating software a calibration routine 70 in order that the vision system recognizes each tracking object 68. For example, the vision system 62 may install program instructions including a detection process in the application programs 52 portion of memory 48. The detection process can be adapted to learn and store profiles 70 (
As shown in
Traditional graphics application programs utilize a mouse or pressure-sensitive tablet as an input device to indicate position on the virtual canvas, and where to begin and end brushstrokes. In the case of a mouse as an input device, the movement of the mouse on a flat surface will generate planar coordinates that are fed to the graphics engine of the software application, and the planar coordinates are translated to the computer display or virtual canvas. Brushstrokes can be created by positioning the mouse cursor to a desired location on the virtual canvas and using mouse clicks to indicate start brushstroke and stop brushstroke commands. In the case of a tablet as an input device, the movement of a stylus on the flat plane of the tablet display will generate similar planar coordinates. In some tablets, application of pressure on the flat display can be used to indicate a start brushstroke command, and lifting the stylus can indicate a stop brushstroke command. In either case, the usefulness of the input device is limited to generating planar coordinates and simple binary commands such as start and stop.
In contrast, the spatial coordinate data 72 of the vision system 62 can be adapted to provide coordinate input to the graphics application program 60 in three dimensions, as opposed to only two. The three dimensional data stream, the directional vector information, and additional information such as the width, length, size, velocity, shape and geometry of the tracking object can be used to enhance the capabilities of the graphics application program 60 to provide a more natural user experience.
In one embodiment of the present invention, the (x, y) portion of the position data from the spatial coordinate data 72 can be mapped to (x′, y′) input data for a painting application program 60. As the user moves the tracking object 68 within the visual space 66, the (x, y) coordinates are mapped and fed to the graphics engine of the software application, then ‘drawn’ on the virtual canvas. The mapping step involves a conversion from the particular coordinate output format of the vision system to a coordinate input format for the painting application program 60. In one embodiment using the Leap Motion controller, the mapping involves a two-dimensional coordinate transformation to scale the (x, y) coordinates of the visual space 66 to the (x′, y′) plane of the virtual canvas.
The (z) portion of the spatial coordinate data 72 can be captured to utilize specific features of the graphics application program 60. In this manner, the (x, y) coordinates could be utilized for a position database and the (z) coordinates could be utilized for another, separate database. In one example, depth coordinate data can provide start brushstroke and stop brushstroke commands as the tracking object 68 moves through the depth of visual space 66. The tracking object 68 may be a finger or a paint brush, and the graphics application program 60 may be a digital paint studio. The user may prepare to apply brush strokes to the virtual canvas by inserting the finger or brush into the visual space 66, at which time spatial coordinate data 72 begins streaming to the computer 12 for mapping, and the tracking object appears on the display 18. The brushstroke start and stop commands may be initiated via keyboard 26 or by holding down the left-click button of the mouse 28. In one embodiment of the invention, the user moves the tracking object 68 in the z-axis to a predetermined point, at which time the start brushstroke command is initiated. When the user pulls the tracking object 68 back in the z-axis past the predetermined point, the stop brushstroke command is initiated and the tracking object “lifts” off the virtual canvas.
In another embodiment of the invention, a portion of the visual space can be calibrated to enhance the operability with a particular graphics application program. Turning to
Furthermore, the scale of the zones can be non-linear. Thus, the mapping of the (z) coordinate data in the spatial coordinate data 72 is not a scalar, it may be mapped according to a quadratic equation, for example. This can be useful when it is desired that the rate of depth change accelerates as the distance increases from the central position.
Continuing with the example set forth above, wherein the tracking object 68 is a finger or a paint brush, and the graphics application program 60 may be a digital paint studio, the user may prepare to apply brush strokes to the virtual canvas by inserting the finger or brush into the visual space 66, at which time spatial coordinate data 72 begins streaming to the computer 12 for mapping, and the tracking object appears on the display 18.
As the user approaches the virtual canvas 76, the tracking object passes into zone Z1 and the object may be displayed on the screen. As the tracking object passes Z0, which may signify the virtual canvas, a start brushstroke command is initiated and the finger or brush “touches” the virtual canvas and begins the painting or drawing stroke. When the user completes the brushstroke, the tracking object 68 can be moved in the z-axis towards the user, and upon passing Z0 the stop brushstroke command is initiated and the tracking object “lifts” off the virtual canvas.
In another embodiment of the invention, the depth or position on the z-axis can be mapped to any of the brush's behaviors or characteristics. In one example, zone Z2 can be configured to apply “pressure” on the tracking object 68 while painting or drawing. That is, once past Z0, further movement of the tracking object into the second zone Z2 can signify the pressure with which the brush is pressing against the canvas; light or heavy. Graphically, the pressure is realized on the virtual canvas by converting the darkness of the paint particles. A light pressure or small depth into zone Z2 results in a light or faint brushstroke, and a heavy pressure or greater depth into zone Z2 results in a dark brushstroke.
In some applications, the transformation from movement in the vision system to movement on the display is linear. That is, a one-to-one relationship exists wherein the amount the object is moving is the same amount of pixels that are displayed. However, certain aspects of the present invention can apply a filter of sorts to the output data to accelerate or decelerate the movements to make the user experience more comfortable.
In yet another embodiment of the invention, non-linear scaling can be utilized in mapping the z-axis to provide more realistic painting or drawing effects. For example, in zone Z2, a non-linear coordinate transformation could result in the tracking object appearing to go to full pressure slowly, which is more realistic than linear pressure with depth. Conversely, in zone Z1, a non-linear coordinate transformation could result in the tracking object appearing to lift off the virtual canvas very quickly. These non-linear mapping techniques could be applied to different lengths of zones Z1 and Z2 to heighten the effect. For example, zone Z1 could occupy about one-third of the calibrated depth 74, and zone Z2 could occupy the remaining two-thirds. The non-linear transformation would result in the zone Z1 action appearing very quickly, and the zone Z2 action appearing very slowly.
The benefit to using non-linear coordinate transformation is that the amount of movement in the z-axis can be controlled to make actions appear faster or slower. Thus, the action of a brush lifting up could be very quick, allowing the user to lift up only a small amount to start a new stroke.
In the illustrated embodiments, and
Referring to
Note that the rendered image on the canvas 94 of the application window 80 is not required to look like the tracking object 68. Instead, it can be any image, such as one familiar to users of a previous release of the graphics application program 60.
As the user approaches the virtual canvas 94, the tracking object 68 passes through zone Z1 and breaks through the control plane Z0, which may signify the virtual canvas. A start brushstroke command is initiated and the tracking object 68 “touches” the virtual canvas and begins the painting or drawing stroke.
The inventors have determined that feedback is helpful to alert the user when the tracking object 68 either reaches or is about to reach the control plane 78 Z0. The feedback alleviates the potential problem of the user touching the canvas “blind,” with no point of reference. In one embodiment, then, the feedback may be tactile, such as haptic technology that applies a force, vibration, or motion to the tracking object 68 when it reaches the control plane 78.
In another embodiment, the feedback may be visual to provide a visual indication on the display 18 when the tracking object 68 reaches the control plane 78. In one example, the rendering of the tracking object 68 (or other representative graphic image) on the canvas 94 changes opacity as the object nears the control plane 78. The change in opacity can be linear, so the object on the display 18 appears darker in proportion to the distance traveled through zone Z1. For example, assuming zone Z1 is 30 cm in depth, the location of the tracking object 68 on the z-axis can be mapped to a linear equation defining the opacity attribute:
OPACITY(%)=3.33z+100 (1)
Thus, when the tracking object 68 first enters zone Z1 (e.g., z=−30 cm), the opacity value equals zero. At a mid-point of the zone Z1, the opacity value is 50%. The opacity value linearly increases until, at z=0=Z0, the opacity value equals 100%. Referring to
In another example, the opacity attribute can be non-linear, so that the object on the display 18 appears darker only as it approaches very near to the control plane Z0. For example, assuming zone Z1 is 30 cm in depth, the location of the tracking object 68 on the z-axis can be mapped to a quadratic equation defining the opacity attribute:
OPACITY(%)=0.14z2+7.42z+100 (2)
In this example, the opacity value at the mid-point of the zone Z1 is approximately 20%, which is barely visible, and the opacity does not reach 50% until approximately z=−7 cm, or about three-quarters through the depth of the zone. Again referring to
When the tracking object 68 passes through the control plane 78, the spatial coordinate data 72 being sent to the computer 12 is no longer sent to a preview mechanism and instead passes to the drawing or painting engine until the finish brushstroke command is received. Visual feedback can be applied to attributes of the rendered image on the display 18 (e.g., image 68 in
One advantage of mapping the depth portion of the spatial coordinate data 72 to a feedback attribute is that the extent of the depth within any zone can be discerned visually. For example, in zone Z1 the color could follow the color wheel 90 and gradually change form yellow to orange to red, for example, as the tracking object 68 nears a control plane 78. Upon entering the next zone Z2, which may be the brushstroke, the color could change to light blue and gradually change to blue to purple as the tracking object 68 nears a second control plane 78. The feedback attribute here could be the simulated pressure by the tracking object 68 on the virtual canvas 94, for example. Thus, when establishing control planes 78 and depth zones between each control plane, various visual feedback attributes can be applied to inform the user not only which zone they are in, but how far in each zone. Exemplary embodiments could include a first zone with a tool selection, a second zone with a color selection, a third zone with a preview or hover mechanism, and a fourth zone for painting.
In another embodiment, the feedback attribute can provide a visual indication of not only the extent of the depth within any zone, but also the functionality, contents, or use of the preceding or following zones. In the example above, the system included a first zone with a tool selection, a second zone with a color selection, and a third zone with a preview or hover mechanism. One example of visual feedback is that the opacity of the tool selection in the first zone may vary depending upon the depth in the z-axis. A different visual feedback could indicate when the tracking object is approaching the second zone with a color selection. For example, when the tracking object is in the first zone and approaching the second, a small preview ghost image of the color selection user interface (UI) could begin to appear. The preview ghost image is realized by varying the opacity of the UI with location along the z-axis. Similarly, as the tracking object backs out of the second plane towards the first, a ghost image of the tool selection UI could appear using the technique just described. In this manner, the visual feedback provides a preview of what is to come if the tracking object is moved in and out of the various control planes.
While the present invention has been described with reference to a number of specific embodiments, it will be understood that the true spirit and scope of the invention should be determined only with respect to claims that can be supported by the present specification. Further, while in numerous cases herein wherein systems and apparatuses and methods are described as having a certain number of elements it will be understood that such systems, apparatuses and methods can be practiced with fewer than the mentioned certain number of elements. Also, while a number of particular embodiments have been described, it will be understood that features and aspects that have been described with reference to each particular embodiment can be used with each remaining particularly described embodiment.