This invention relates to expression languages used in database technologies.
Many software products require the user to define rules, queries or conditions using an expression language. A few examples include: reporting tools that may allow their users to define database queries specifying the contents of a report; ETL (Extract Transform Load) tools, such as the DataStage tool, provided by International Business Machines (IBM) Corporation of Armonk, N.Y., that allow users to enter expressions to filter or transform a data set; and data profiling tools, such as the AuditStage, also provided by IBM, that allow users to define business rules describing constraints that the data should verify in order to fulfill certain data quality requirements.
A typical problem is that the expression languages used in products of this type often have a rich and complex syntax. The reason for the rich and complex syntax is that the expression languages must be flexible enough to cover all features that a user may need. Examples of expression languages include SQL (Structured Query Language), MDX (Multi Dimensional Expression), and various types of proprietary languages.
Often, the users who must define the expressions are not always skilled in programming languages. Users of reporting or data profiling tools are typically business users and not programmers. Even for programmers, the task of defining expressions can be tedious and error prone, since the product cannot always rely on a standardized language and a proprietary syntax must be introduced, which the user must learn.
Finally, sometimes the definition of a rule or a query may be defined in different steps by different users. For example, a business user may first define a high-level definition of the expression in plain English, and then, in a second step, a programmer may implement the technical details of the expression.
In all of these cases it may be challenging to find a comfortable method allowing users to define expressions using the full possibilities of the expression language, without requesting the users to learn a complex syntax or acquire programming skills.
Different solutions have been developed over the last few years in attempts to solve the above problems and make the software easier to use for business users. One common solution is to use a free-form text field, with some content assistance. That is, the user must still have some basic knowledge of the syntax of the expression language, but while he types the expression, an editor makes suggestions about the content. Features like code completion, syntax coloring, pop-up hints, error highlighting, and so on, are common in these programming tools.
Code templates can be also used to generate a “skeleton” of the expression and provide a starting point for the user. While this method is of great use for a programmer and does not set any limits for the complexity of the expression, the method is not very suitable for a pure business user with little programming language experience. The content assistance only provides hints, but does not require that the user respect the language constraints or prevent the user from making errors. Furthermore, once an error is contained in an expression, it may be difficult for the user to find the error. The user also still has to face the syntax of the language.
Another common solution is to use a table or tree form editor instead of a free form text editor. With such an editor, the user is no longer free to type any kind of expression, like in the above free form editor. Instead, the editor constraints the edits. A table editor can make an assumption about the format of the expression and force the user to enter each part of the expression in a different cell of a table, by choosing a possible value from a list of possibilities. For example an expression like: “IF age<18 THEN status=‘child’” may be edited in a table containing 6 columns where the user enters successively “age”, “<”, “18”, “status”, “‘child’”. A tree editor may allow the expression to be edited by adding nodes to a tree. Both techniques prevent the user from syntax errors, but significantly reduce the usability or flexibility of the editor.
A table based editor makes the assumption that an expression always contains the same number of elements in the same order. It is not easy to define an editor that would allow entering of a simple expression, such as “IF age<18 THEN status=‘child’”, but also a more complex expression, such as “IF ((col1+col2)/2)>50 and abs(col3)>10 then rtrim(col4)=‘abc’”. The tree editor may allow editing rules with different degrees of complexity, but is not intuitive for the user, as the tokenization of an expression in a tree of elements and sub-elements is not a natural way for the user to enter or read an expression.
Another approach is to use a specialized graphical editor. For instance some SQL query builders are based on a graphical canvas, where the user can drag and drop tables, check/uncheck a list of columns and connect the objects graphically to define join. Generally, these editors are specialized for a specific type of expressions (for example, such an editor may allow editing a SQL query, but not an update statement). Furthermore, graphical editors often have to rely on free form editors or table editors to define elements of their query (for example, a join condition). Another drawback is that their usage must be learned and is completely different from a text based solution. Skilled users who are familiar with the syntax of the expression languages often have difficulties using such tools.
In general, in one aspect, the invention provides methods and apparatus, including computer program products, implementing and using techniques for providing a visual editor allowing graphical editing of expressions in an expression language. A graphical user interface is displayed. A first user input of an expression is received. The expression is defined in a logical or textual form, and each component of the expression is represented by a graphical element on the graphical user interface. A syntax of the first user input is verified and an alert is provided to the user in response to detecting a syntax error or an inconsistency of the first user input when verifying the syntax.
Advantageous implementations can include one or more of the following features. The expression language can be a structured query language, a multi dimensional expression language, or a scripting language. A second user input can be received that converts one or more expressions defined in a textual form into to expressions defined in a logical form, and verifying a syntax can include verifying the syntax after receiving the second user input. The expression can be displayed on the graphical user interface as a root element and one or more sub-elements, wherein each root element or sub-element includes rules defining what further sub-elements can be added to the root element or sub-elements, respectively, and where each sub-element can be defined in a logical or textual form. The layout of the displayed expression can be changed on the graphical user interface in response to detecting an increased or decreased complexity of the expression.
A focus indicator can be displayed, which indicates to the user what graphical element is currently being edited by the user. One or more of the graphical elements can be displayed in an expanded state or a collapsed state, where the expanded state provides further details about the component represented by the graphical element compared to the collapsed state. In response to selecting a graphical element to be edited, a list of available graphical elements that can be added as sub-elements to the selected graphical element can be displayed. A validation icon can be displayed on the graphical user interface. The validation icon initiates a validation of the expression in response to being selected. A palette can be displayed on the graphical user interface. The palette contains representations of available graphical elements that can be added as sub-elements to the selected graphical element.
The invention can be implemented to include one or more of the following advantages. The advantages of a tree editor, such as hierarchical definition of the expression in a constraint way which prevents the user from making syntax errors, are combined with the advantages of a free form editor, such as natural representation of the expression in a linear way, which is easiest to understand for a typical user. As a result, the graphical editor in accordance with various embodiments of the invention allows editing of expressions independently from the complexity of the expression. The graphical editor works in a constrained way and prevents users from entering syntax errors. The graphical editor is intuitive to use and understand, and is therefore suitable both for a business user entering a high level definition, and for a skilled programmer familiar with the syntax, and allows a top-down editing approach.
Furthermore, the various embodiments of the graphical editor allow different levels of definition of the rule. For example, in a first pass, a user can provide a high-level definition of the overall expression only, and leave the implementation details for another user, or for a later time. In a second pass, another user may describe the expression further by editing the overall implementation of the expression, but the user can still also describe some sub-elements of the expression at a high level only and leave the exact implementation details (for example an exact binding) for later. It can be up to the user to decide to what extent the rules should be specified. When creating blocks, the user does not have to enter parentheses, which is typically a process that is prone to errors. Since the graphical editor automatically adds sub-elements when needed, less input work from the users is required.
The details of one or more embodiments of the invention are set forth in the accompanying drawings and the description below. Other features and advantages of the invention will be apparent from the description and drawings, and from the claims.
Like reference symbols in the various drawings indicate like elements.
Overview
The various embodiments of the invention described herein pertain to a graphical editor for editing expression languages. In particular, the various embodiments of the invention relate to a graphical editor, which displays an edited expression as a recursive list of graphical elements, as will be described in further detail below. Each element of a rule is represented by a geometric shape. This allows the user to enter a high level definition of the element, similar to a conventional editor, and to specify implementation details for the element by adding other elements that are specified the same way. Thus each graphical element can recursively contain other figures specifying sub-element of a current element of an expression.
When defining a new expression, the user typically starts with a root element that represents the whole expression. The user can provide a high-level definition of the expression in plain text in this element. When the implementation details of the expression are defined, the user can add sub-elements which are represented as other graphical elements contained in the root figure. Each sub-element behaves like the root, that is, the user can enter high-level definitions and specify them further by adding further sub-elements. The list and order of the sub-elements, which can be added to an element, is controlled by the element itself. As a result, the user can only add elements that are allowed in the current context. In various embodiments of the invention, the layout of the graphical elements is updated automatically when the complexity of an expression increases, in order to offer the best view of the whole expression as a whole.
As was discussed above, the various embodiments of the graphical editor allow different levels of definition of the rule. For example, in a first pass, a user can provide a high-level definition of the overall expression only, and leave the implementation details for another user, or for a later time. In a second pass, the other user may describe the expression further by editing the overall implementation of the expression, but the other user can also describe some sub-elements of the expression at a high level only and leave the exact implementation details (for example an exact binding) for later. It can be up to the user to decide to what extent the rules should be specified.
In contrast to existing expression language editors, the various embodiments of this graphical editor allow editing of expressions irrespective of the complexity of the expression. The graphical editor works in a constrained way and prevents users from entering syntax errors. The graphical editor is intuitive to use and can be easily understood both by business users entering high level definitions, and for skilled programmers familiar with the syntax. Furthermore, the graphical editor allows a top down editing approach.
Various embodiments of the invention will now be described by way of example, and with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions can be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
These computer program instructions can also be stored in a computer-readable medium that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable medium produce an article of manufacture including instruction means which implement the function/act specified in the flowchart and/or block diagram block or blocks.
The computer program instructions can also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide processes for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
The user then presses ‘enter’ indicating that he has finished with the high level definition and wants to move on to the details. This is further illustrated in
The input caret (306) in the detail areas allows the user to select blocks of logic to add to the area. When the user presses enter, a list (400) of possible blocks that can be added at that particular spot is displayed, as illustrated in
In addition to typing expressions, the user can be presented with a ‘palette’ (600) of logic blocks that the user can add to the rule he is creating, as shown in
Next, the user continues to fill in high-level definitions and details of the implementation, as shown in
Adding an operator with the same precedence level appends the operator (1300) to the end of a selected rule logic block (1302).
Adding an operator with a different precedence level creates a new nested rule block.
A rule block can be reorganized at the same depth by selecting the block, as shown in
A rule block can be reorganized at different depths by selecting the block, as shown in
It should be noted that since the definition of the rule is being fundamentally changed here, the nesting will be automatically changed, where appropriate. In this example, the user wants to ‘or’ to two blocks, which will then push an ‘and’ block (which is at a different precedence level) to a different nesting depth. Just as was discussed above, the same operation can be accomplished with a drag-and-drop operation, that is, the block is selected, cut, and then dragged to where the user would like to place the block, as shown in
A “return type,” such as a Boolean, string, number, and so on, which is used for completeness and validation.
A “depth,” which indicates how far down the hierarchical tree the expression is located.
One or more “children,” which is a collection of the children of the expression. The subclasses of the expression element (3004) determine how many (if any) children the element has. The child count is fixed by the subclass, and children can never be added or moved, but can be refined.
A “Parent,” which is the parent element in the expression tree.
An “IsComplete( )” function, which determines whether the expression element (3004) is complete or not, based on whether there are any placeholder children, if the children return types are compatible, and so on. It is up to the subclass to do its own computations as to whether the children's return types are incompatible.
A “Refine( )” function, which makes it possible to swap a current expression element (3004) with a new expression. As will be discussed in further detail below, an expression typically starts out with a sole Placeholder Expression. The user then chooses an actual expression to use, such as an “if-then” expression, an “and” expression, an “or” expression, and so on, which changes the placeholder element into an actual expression.
As mentioned, children of expressions are never added or removed, they are only ever refined and in doing so additional sub-expressions get added/removed.
It should be noted that in this implementation children are never added or removed to specific expressions, that is, children will never be added to or removed from an if-then expression. The expression will only ever have two children. Thus, refining a placeholder to an if-then expression always results in two additional sub-expressions being added.
The flowcharts and block diagrams in the figures referred to above illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the invention. In this regard, each block in the flowchart or block diagrams can represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block can occur out of the order noted in the figures. For example, two blocks shown in succession can, in fact, be executed substantially concurrently, or the blocks can sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
As will be appreciated by one skilled in the art, various embodiments of the invention can include a system, method or computer program product. Accordingly, the invention can take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a “circuit,” “module” or “system.” Furthermore, the invention can take the form of a computer program product embodied in any tangible medium of expression having computer-usable program code embodied in the medium.
Any combination of one or more computer usable or computer readable medium(s) can be used. The computer-usable or computer-readable medium can be, for example but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, device, or propagation medium. More specific examples (a non-exhaustive list) of the computer-readable medium would include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a transmission media such as those supporting the Internet or an intranet, or a magnetic storage device. Note that the computer-usable or computer-readable medium could even be paper or another suitable medium upon which the program is printed, as the program can be electronically captured, via, for instance, optical scanning of the paper or other medium, then compiled, interpreted, or otherwise processed in a suitable manner, if necessary, and then stored in a computer memory. In the context of this document, a computer-usable or computer-readable medium can be any medium that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. The computer-usable medium can include a propagated data signal with the computer-usable program code embodied therewith, either in baseband or as part of a carrier wave. The computer usable program code can be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, and so on.
Computer program code for carrying out operations of the invention can be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++ or the like and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The program code can execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer can be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection can be made to an external computer (for example, through the Internet using an Internet Service Provider).
A number of implementations of the invention have been described. Nevertheless, it will be understood that various modifications may be made without departing from the spirit and scope of the invention. For example, the visual representations can vary compared to what was described above, such as using different shapes or colors. The size and shape of various icons and graphical user interface elements can also vary. The means of user interaction with the graphical editor can also vary, such as using drag-and-drop operations, shortcuts, pop-up menus, and so on, as is familiar to those of ordinary skill in the art. Accordingly, other embodiments are within the scope of the following claims.
Number | Name | Date | Kind |
---|---|---|---|
5187788 | Marmelstein | Feb 1993 | A |
5267346 | Maruyama | Nov 1993 | A |
5351200 | Impink, Jr. | Sep 1994 | A |
5504853 | Schuur et al. | Apr 1996 | A |
5546507 | Staub | Aug 1996 | A |
5553304 | Lipner et al. | Sep 1996 | A |
5603021 | Spencer et al. | Feb 1997 | A |
5652899 | Mays et al. | Jul 1997 | A |
5701400 | Amado | Dec 1997 | A |
5706205 | Masuda et al. | Jan 1998 | A |
5790863 | Simonyi | Aug 1998 | A |
5828318 | Cesar | Oct 1998 | A |
5913215 | Rubinstein | Jun 1999 | A |
5929423 | Boersma | Jul 1999 | A |
6026233 | Shulman et al. | Feb 2000 | A |
6083278 | Olson et al. | Jul 2000 | A |
6171109 | Ohsuga | Jan 2001 | B1 |
6233545 | Datig | May 2001 | B1 |
6243614 | Anderson | Jun 2001 | B1 |
6243857 | Logan et al. | Jun 2001 | B1 |
6243859 | Chen-Kuang | Jun 2001 | B1 |
6269475 | Farrell et al. | Jul 2001 | B1 |
6353452 | Hamada et al. | Mar 2002 | B1 |
6366300 | Ohara et al. | Apr 2002 | B1 |
6408430 | Gunter et al. | Jun 2002 | B2 |
6441835 | Pazel | Aug 2002 | B1 |
6489970 | Pazel | Dec 2002 | B1 |
6558431 | Lynch et al. | May 2003 | B1 |
6683624 | Pazel et al. | Jan 2004 | B1 |
6757889 | Ito | Jun 2004 | B1 |
6836878 | Cuomo et al. | Dec 2004 | B1 |
6889113 | Tasker et al. | May 2005 | B2 |
6973649 | Pazel | Dec 2005 | B1 |
7024631 | Hudson et al. | Apr 2006 | B1 |
7178112 | Ciolfi et al. | Feb 2007 | B1 |
7299419 | Evans | Nov 2007 | B2 |
7975233 | Macklem et al. | Jul 2011 | B2 |
20030035009 | Kodosky et al. | Feb 2003 | A1 |
20030061600 | Bates et al. | Mar 2003 | A1 |
20040006765 | Goldman | Jan 2004 | A1 |
20050060129 | Mosterman et al. | Mar 2005 | A1 |
20050086638 | Farn | Apr 2005 | A1 |
20050107998 | McLernon et al. | May 2005 | A1 |
20050108682 | Piehler et al. | May 2005 | A1 |
20050114318 | Dettinger et al. | May 2005 | A1 |
20050216248 | Ciolfi et al. | Sep 2005 | A1 |
20050278286 | Djugash et al. | Dec 2005 | A1 |
20050289506 | Renner | Dec 2005 | A1 |
20060064670 | Linebarger et al. | Mar 2006 | A1 |
20070006145 | Hill et al. | Jan 2007 | A1 |
20070027845 | Dettinger et al. | Feb 2007 | A1 |
20080022259 | MacKlem et al. | Jan 2008 | A1 |
20080022264 | Macklem et al. | Jan 2008 | A1 |
20080082961 | Adams et al. | Apr 2008 | A1 |
20080263512 | Dellas et al. | Oct 2008 | A1 |
20080263513 | Neumann et al. | Oct 2008 | A1 |
20080263515 | Dellas et al. | Oct 2008 | A1 |
20080270983 | Ahadian et al. | Oct 2008 | A1 |
20080295085 | Rachamadugu et al. | Nov 2008 | A1 |
20090172636 | Griffith et al. | Jul 2009 | A1 |
Entry |
---|
“Shape” , Wikipedia , Feb. 24, 2013 , <http://en.wikipedia.org/wiki/Shape> , pp. 1-4. |
Mel Tsai et al. , “EE411 Laboratory Exercise #1 using Xilinx Foundation v2.1i” , UCSC , Feb. 1, 2001 , <http://classes.soe.ucsc.edu/cmpe100/Fall02/Labs/EE411%201.htm>, pp. 1-28. |
Wei Su et al. , “MathEdit, A Browser-based Visual Mathematics Expression Editor” , Chicago Kent University , 2006 , <http://wme.cs.kent.edu/mathedit/mathedit—one.pdf> , pp. 1-10. |
Matti Karjalainen et al. , “Block Diagram Compilation and Grapghical Editing of DSP Algorithms in the QuickSig System” , IEEE , 1988 , <http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=15107> , pp. 1-4. |
Claudia-Lavinia Ignat et al. , “Grouping in Collaborative Graphical Editors” , ACM , 2004 , <http://delivery.acm.org/10.1145/1040000/1031682/p447-ignat.pdf> , pp. 1-10. |
USPTO Office Action. U.S. Appl. No. 13/403,194. Notification date: Apr. 3, 2013. |
USPTO Office Action. U.S. Appl. No. 13/403,194, dated Jan. 30, 2014. |
USPTO Office Action. U.S. Appl. No. 13/403,194, dated Jul. 24, 2014. |
USPTO Office Action. U.S. Appl. No. 13/403,194, dated May 15, 2015. |
USPTO Office Action. U.S. Appl. No. 13/403,194, dated Aug. 28, 2015. |
USPTO Office Action. U.S. Appl. No. 13/403,194, dated Jul. 22, 2013. |
Number | Date | Country | |
---|---|---|---|
20100162210 A1 | Jun 2010 | US |