The invention concerns a control device for an information display including
In addition, the invention also concerns a corresponding system, method and program product for implementation of the invention.
Distribution of information between participants in negotiations in a negotiation room is usually done by prepared overhead slide presentations (for example, PowerPoint). It is also possible to use special smart white boards, provided that the negotiation premises are equipped with such.
In certain applications, especially in the case of digital white boards, visual information can be efficiently manipulated on an information surface, on which the information is presented by using an information apparatus. However, since digital white board systems are often quite cost-intensive, their use is limited. For this reason, every negotiation room is not necessarily provided with those.
If the negotiating parties wish to distribute one another something during the negotiation, for example, electronic additional information brought about during the negotiation, this is difficult to do. According to the state of the art, distribution of information can usually be done only afterwards. The matter may hereby be dealt with in such a way, for example, that the concerned information is sent to each negotiating party hungry for information, for example, by electronic mail. If it is desired to distribute electronic information already in the negotiation situation, then each interested person must connect his device (usually a laptop computer) to the presentation apparatus system in order to transfer the information to his own device. A corresponding connection must also be made for transferring information from the personal device to the presentation apparatus for visualisation or distribution of information.
Yet another drawback in state-of-the-art information systems is that in these only one participant at a time can control the information apparatus. Hereby the other participants in the idea generation or brainstorming team become passive and can at the most just comment on the matter visualised by the presenter on the presentation apparatus.
The state of the art as regards the presentation equipment of negotiation rooms is represented by separate devices, which make possible either just visualisation of information (for example, a data projector connected to a computer) or, in addition, also editing of the information and creation of new information. Such more sophisticated devices are, for example, various integrated touch screens/pen-operated digital white boards, which are provided with embedded softwares. One example of this is the SMART Board system. Another example is the ANOTO-Digital pen and yet a third example is Ebeam. In Ebeam, a sensor is connected to a corner of a manual white board. There the cap of a container emits a signal, which is measured by the sensor. However, the device needs calibrating when put in a new place.
An example of a solution of the SMART Board type is known from the international PCT patent publication WO 02/03316 A1. Therein the corners of the information surface include cameras. The cameras are used to form image information of the information surface. From the image information, the location of the control device touching the information surface can be recognised at each time. The desired action is directed at the point of contact of the control device.
However, such digital apparatuses of the white board type are expensive special devices. This limits the possibilities of arranging them in each negotiation premises. In addition, controlling them is normally possible only by special pens or by equivalent input devices. In addition, they have been limited to serve only couple of persons at a time in the presenting/editing of information, and the persons must also be in the immediate vicinity of the white board.
One more prior art is known in which mobile camera device and control information arranged on information surface is applied. However, this “SpotCode” concept [1] isn't suitable, for example, for meeting room implementations but rather for discrete presenting systems in which the location of the controlled objects are fixed to a particular location of the information surface. Also, problems are also caused by the fact that it is difficult to determine the current orientation of the mobile camera device owing to which it is difficult to operate, for example, in a 3D space. In the SpotCode concept the rotation of the device in front of the canvas can be detected only around one axis.
The purpose of this invention is to provide a manner of controlling an information device arrangement and more particular an information display. With the invention a manner is achieved of controlling an existing presentation, of editing and/or creating new information and/or of distributing it to other users, which manner is effortless from the viewpoint of one end user or even more end users. The characteristic features of the control device according to the invention are presented in the appended claim 1, while the characteristic features of the method are presented in claim 27. The invention also concerns a corresponding system and program product, whose characteristic features are presented in the appended claims 12 and 33.
In the invention, for example, an electronic device equipped with video sensor and possible display may be surprisingly used as the control device for the information display. In the invention subsequent data transmission between control device and information display is dependent upon the origin of the received image information as determined by the control information. In consequence of this, the control device can be used for operating the information display in a surprising manner without contact. Such mobile station equipment can be mentioned as an example of a control device, for which the necessary functionalities can be arranged to control the information display in the manner according to the invention.
According to one embodiment with the intelligible device equipped with a video sensor the destined location can be pointed directly or indirectly at the information display without any contact with the information display, and the place of the concerned location can be determined. The place of the targeted location can be determined from the image information received by the video sensor. When the location has been determined, the desired actions can be directed at the concerned location. The location may be selected freely.
Such actions can be understood broadly in the context of the invention. Besides the actions for controlling and creating information, which are known from the traditional digital presentation devices, information can also be transferred from the information apparatus to the control device, for example, from this pointed location on the surface. It is also possible to transfer information from the control device towards the information apparatus, for example, to a location targeted on the information display. The invention in fact offers numerous alternative actions, which may be understood as subsequent data transmission between control device and information display, and none of them are mutually exclusive within the scope of the basic inventive idea.
In order to determine, for example, the location on the information surface, which is pointed by the device equipped with a video sensor, special control information can be presented on this same or even on some other information display or surface. The control information may form a refined and controllable, even dynamic arrangement. It will not essentially cause any harm to the interpretability of the actual information to be presented by the presentation means, if the information to be presented and the control information are presented, for example, on the same information display. The control information is arranged on the information display in such a manner that it permits the controlling of the whole wide information surface area without limiting to some special location or even the area outside of the information display.
According to one embodiment, based on the continuous streaming of image information received by the control device, this device may be used to determine data concerning the place and size of the control information in the image information. This data or the data calculated and determined from it in an established manner may be transmitted at least partly, for example, together with the size of the image information to the information apparatus for determination of the pointed location on the information display. It is also possible to transmit just the image data to the information apparatus. Using the information apparatus it is possible to carry out a final determination of the operated location. With such a determination, which is a two-step one in a certain sense, is achieved decentralisation of the functionality to the control device and, on the other hand, also a more accurate determination of the operation location.
The control device and the information apparatus may communicate with one another, for example, by applying a wireless local area network. Hereby setting information relating to the connection settings between them can also be presented on the information display. From the setting information it is possible to decode connection setting data for carrying out communication between the devices, and they are easily recognised from the image information formed by the video sensor. In this manner such an advantage is achieved, among others, that the user need not now set connection setting data in his device, for example, manually, which would in fact be difficult and troublesome. This also speeds up start-up of the system and the connection to it and, in addition, makes also possible to control the presentation by a plurality of control devices.
The invention allows several applications, with which numerous advantages are achieved in terms of information control, presentation, creation, editing, distribution and receiving. Besides the traditional 2D control, 3D control of the presented information is now also possible.
The invention relates also to a concept for using at least one video sensor in accurate 3D-positioning, with help of data patterns projected or printed on a surface or drawn to a display.
Other features characterizing the invention emerge from the appended claims, and more advantages, which can be achieved, are listed in the description.
The invention, which is not limited to the embodiments to be presented in the following, will now be presented in greater detail by referring to the appended figures, wherein
a shows a simplified flow chart of the method according to the invention,
b shows a flow chart of an example of the method according to the invention,
a-14b show some application examples of the head-mounted control devices,
The invention relates to the use of one or more devices 10, 10.1-10.8 equipped with at least one camera 52, more generally speaking a video sensor for receiving image information, and with the necessary functionalities for possible presenting and/or forming information INFO, more generally for its control. Intervals in the performance of the invention are presented in suitable places of the description referring to the flow charts shown in
Generally speaking and referring to the all
The invention provides several embodiments. First possible scenario may be, for example, that the user points by the control device 10 at a surface 13 of the information device 11-13. Dependent upon the location pointed at the user may download data from the information device 11-13 to the control device 10. The downloaded data may be, for example, a segment of a ppt slide or the whole ppt slide. Second possible scenario is, for example, that the user points by the control device 10 at a surface 13 of the information device 11-13. Dependent upon the location pointed at (by using the control information 15, 16 formed from the image data 50) user could may transmit data from the control device 10 to the information display 13. Third possible scenario is that the user points by the control device 10 at a surface 13 of the information device 11-13. Dependent upon the location pointed at, user may now re-arrange data on the surface 13 of the information device 11-13. In every embodiment is performed subsequent data transmission between the control device 10 and the information display 13. The location pointed at does not necessarily have to be within the boundary of the information display 13 but that may be somewhere else exterior to that.
Next term “origin” is defined in the context of the invention. Each point of image information 50 may correspond to a specific point of information display 13. A point of the information display 13 (or outside the information display 13, in same plane) may be called an “origin” or a “source point” of the point in image information 50.
Owing to this the “origin” means the mapping between the coordinates in the bitmap image captured by the camera 51 and the coordinates on the plane of the information display 13. So, for each point in the bitmap image, it is possible to find an origin point (i.e. a source point, or “originating point”) on the information display 13. In connection with the invention the origin of the coordinate system means the fixed point (0, 0, 0) and which meaning is generally known from the coordinate systems.
The system according to the invention comprises as its major parts at least one server 11, at least one presentation device 12 connected to the server 11 for data communication and at least one control device 10. Together, the server 11 and the presentation device 12 may also form an information device arrangement 11, 12 or they may also be separate from each other. The control device 10 and the server 11 may also be a complete solid device unit, such as, a “Smartphone” and in that embodiment there may be only an external display or projector unit 12 that is instructed by the “Smartphone”.
Instead of cost-intensive systems of the SMART board type, the invention makes it possible to use, for example, standard office equipment. One application example is an information apparatus 11 in general use and known as such and equipped with a processor, which apparatus 11 can be connected to the presentation device 12.
The server may be, for example, a computer apparatus or equivalent, such as, for example, a PC device 11. The presentation device may be, for example, a data projector 12 (video projector) or only a simple display unit. Besides the presentation device 12, the system may also include at least one projection screen 12′, wherein there is at least one information surface 13. On the information surface 13 it is possible to present at least information INFO or a reference FILE to such, which is visualised on it from the data projector 12 controlled by the server 11. The whole wide area of the information surface 13 may be used in these purposes.
On the other hand, the server 11, the presentation device 12 and the information surface 13 may also form one compact entity. The server functionality may hereby be integrated even in the presentation device 12 itself, or vice versa. The presentation device 12 may also be, for example, a display of the LCD type. Hereby it already in itself has an information surface 13 for presenting at least information INFO. According to one more embodiment, the server may even simply be a piece of mobile equipment, which can be connected to an external display or data projector. There are numerous alternatives, and of course they are not in any way limited by the basic principle of the invention. Thus, in general, it is possible to speak about “information display” independently of the number of the units 11-13 needed to implement such a display.
Owing to the camera 52 belonging to the device 10 it is possible surprisingly to carry out controlling of the information apparatus 11 by operating without contact the destined location 21 of information surface 13 by the control device 10. Hereby the device 10 or rather its camera 52 is aimed at the information surface 13 from a distance and at the desired location 21 therein. The place of the targeted location 21 in the image information 50 may be arranged in an established manner, such as, for example, in its centre.
In the context of the invention, operating of the information surface 13 can be understood as aiming at it or pointing it with the camera 52 of the control device 10. The aiming or pointing may be directed to a particular point that may be in any location on the information surface 13. Thus, the invention allows the pointing of the particular location that may be select freely from the information surface 13. From the image information 50 formed from the information surface 13 by the camera 52 by using of which the information surface 13 is continuously and directly pointed, it is possible to determine continuously a continuous stream of locations 21 on the information surface 13 pointed by the control device 10 in order to carry out control actions i.e. operations directed to the information apparatus 11. Generally speaking, this may also be called the subsequent data transmission performed between control device 10, 10.1-10.8 and information display 13, 13′. This makes control of the presentation effortless even from distances from the presentation surface 13.
Continuous pointing procedure of the information surface 13 helps tracking the movement of the location 21 of pointing in order to control the particular point of the whole wide information surface 13. This continuous pointing enables interactions such as “Drag” or “point and click” or “place”. In this sense placing gets more accurate if user can move (“drag”) the object (e.g. a photo) “live” on screen 13 instead of just “aim & shoot” (single opportunity). If the user mis-aims in “aim & shoot” case, this causes that the user needs to re-aim and re-shoot. This is not at all user-friendly.
In addition, the control device 10 includes processor 51. These are used to control and attend to the functions of device 10 in a manner known as such. Besides the basic functions of device 10 (for example, mobile telecommunication), the processor 51 may also be used for determining from the image information 50 formed by the camera 52 at least partly the location 21 on the information surface 13 pointed at each time by the control device 10 as well as an action to be directed at the location 21 (
Besides the processor 51 attending to the functions of the control device 10, the mobile equipment 10 or the control device 10 in general is also equipped with suitable communication protocol means 22. Corresponding data transmission unit 24 also belong to the equipment of the server 11 shown in the block diagram of
According to one embodiment, the data transmission unit 22, 24 may be based on wireless local area network means. One example of such is the Bluetooth communication protocol BT. Another could be WLAN (Wireless Local Area Network). Of course, also other protocols capable of local data transmission are possible.
According to one embodiment, the implementation according to the invention may be a program implementation or a suitable combination of program and hardware level functions. In the following program products 30.1, 30.2 are described by way of example, which are arranged for the mobile equipment 10 and for the server 11 to be run in these.
Firstly, the server 11 shown in
Besides presentation software of a known kind (which could be, for example, MS PowerPoint 34) or integrated together with it, the server 11, in general, a computer (or a piece of software) is here provided with one or more code means 32.3, 32.4, with which the data projector 12 can be commanded to generate and present encoded screen surface locations 15, 16.x on the information surface 13. By utilizing of these it is possible to determine the particular location 21 on the information surface 13 chosen by the control device 10 (step 901′). An example of such is shown in
More generally, it is possible in the context of the invention to speak of control information 15, 16.x to be generated and presented on the information surface 13 that covers the information surface 13 mainly but, however, sparsely without causing considerably inconvenience when performing controlling measures and viewing of the information INFO on that. The control information 15, 16.x can be merged by the information apparatus 11 with the information INFO visualised on the information surface 13 (code means 32.8). Hereby the control information 15, 16.x and the information INFO visualised by the information apparatus 11 is visualised on a common and same information surface 13 (code means 32.9). In such arrangement the information INFO presented on surface 13 may be on top of the control information 16.1, 16.2 (
On the other hand, the control information 15, 16 may be brought about on the information surface 13 also by some other device than the server 11 proper, with which a view of the presentation software 34 is brought about. The control information 15, 16.x may also be presented on some other information surface (not shown). This may be a different information surface than the one, on which the server's 11 information INFO is presented.
According to one embodiment, the screen surface locations 15, 16.x can be visually detectable control information 15, 16.x. According to another embodiment, they may also be of a type undetectable by the human eye or very weakly detectable (code means 32.11). The control information 15, 16.x may be formed by visually encoded control elements 15 the detection of which are based on the orientation (code means 32.4). They may form control element groups 16.1, 16.2 on the information surface 13. On the information surface 13 the control element groups 16.1, 16.2 may form a control pattern entity 14 arranged in an established manner. From this control information 15, 16.x arranged in the known locations of the information surface 13 it is possible to determine a continuous stream of locations 21 of the information surface 13 operated and pointed by the control device 10 when performing controlling of the information INFO and presentation software 34.
Next, reference is made to
The maximum distance between glyph groups and the size of glyph groups 16.1, 16.2 depend on 1) the area the camera 52 covers and 2) the resolution of the camera images 50 (camera properties). The area the camera 52 covers is related to the desired distance between the camera 52 and the information surface 13. The resolution of the camera images 50 and the size of glyph groups 16.1, 16.2 are related in a way that the glyph groups 16.1, 16.2 must be detectable and recognizable in the image 50 captured by the camera 52. In the embodiment implementation with low resolution camera images (160 by 120 pixels), the glyph groups 16.1, 16.2 are spanned in a way that enables pointing in the full wide information surface area 13 but the glyph groups 16.1, 16.2 take only 3.5% of the visible area. The suitable distance between camera 52 and surface 13 may be in this case when applying stream of low resolution images of mobile phone camera 52, for example, 0.5-5 m (even 10 m and even over). For example, by utilizing the digital zoom it is also possible to increase the distance. Also, the dimensions of the screen 13 affect to this distance feature. One example to estimate the distance (to be able to point location or target somewhat steady) would be around 0.3-2 times the width of the information surface 13, at least with described setup of glyphs 15, 16.1, 16.2.
In general, the covering portion which glyph groups 16.1, 16.2 take from the information surface 13 may be, for example, 1-30%. However, the bigger cover may be annoying at least if using red glyphs or other such color, for example. On the other hand, it is possible to use hiding techniques (described later in connection with
According to one experimental embodiment the glyph's 15 size may be 16×16 pixels and the gap between two glyphs 15 may be 8 pixels. If 75% of each glyph's 15 area is painted with a signal color, this version would paint half (coverage=0.75*(16/(16+8))=0.5) of the pixels of the screen 13 with signal color(s) (if the whole screen area 13 is covered with glyphs 15 by using this gap setup). This kind of arrangement may be applicable if the signal color is chosen dynamically by using a hiding technique, such as the one presented in
In this embodiment presented more detailed refGrid of size 240×192 pixels contains one glyph group of 40×40 pixels, thus area cover is (40*40)/(240*192)=0.03472 (this contains only the 2×2 glyph matrixes, not the connection pattern 17).
The glyph group assembly 14 may cover the whole large information surface 13, for example, divided at regular intervals. Each code conglomeration 16.1, 16.2 can be set to correspond to a certain location on the information surface 13. From each code group 16.1, 16.2 assisting the recognition of pointing it is hereby possible to determine very accurately that location 21 exactly on the information surface 13, at which the control device 10 is especially pointing/aiming and which is intended as the target for the desired action. This makes possible the free selection of the pointing location on the surface 13 and the whole wide area of the surface 13 may be utilized. This is important, for example, in the meeting room—embodiments in which the system according to the invention is used, for example, to create new information INFO to a precise freely selected location or to manipulate the existing information being already on the surface 13 in a sporadic location.
The server 11 may be used for setting as control elements on the information surface 13 also setting information 17 relating to connection settings 17′ (code means 32.5, step 901′). In general, the system may be configured so, that each or some of the datamatrices 17 contain connectivity details 17′ to the computing device 11 that is generating the display 12′ related to the datamatrices 17. Owing to this the system can be used for direct connectivity and direct manipulation of data displayed on screens of multiple devices. These features are important when several participants equipped with devices 10, 10.1-10.2 according to the invention want take part of the brainstorm.
If there are arranged multiple display surfaces related to multiple servers the switching between servers is arranged to be seamless. In this kind of embodiment the phone 10 may be first connected to server S(A) generating display D(A). Then, the user may want to point to display D(B) that is connected to another server S(B). The system (on the user's device 10, for example) may automatically decode the connection information 17′ from the data on display D(B) and connect automatically to this particular server S(B). Owing to this feature the pointing events are sent directly to server S(B) instead of server S(A). Thus, the connection details 17′ may also be determined dynamically from the continuous stream of connection detail information 17. Only pointing of the new surface equipped with the connection details 17 and imaging that with camera 52 is enough to take that control of this particular surface and possible server connected to it. This is very user-friendly.
On the information surface 13 datamatrix 17 relating to the connection setting information may form an assembly 17 formed by glyphs of a corresponding type. Matters relating, for example, to the session protocol and settings between the server 11 and the control device 10 may be decoded from it. Some examples of these are the Bluetooth address and port 17′ used in communication. Based on these it is possible in the mobile equipment 10, with which the information surface 13 is continuously pointed at and the camera 52 of which is used to form image information 50 of it (step 902), to conclude these connection settings, according to which a data transmission session is then set up with the server 11 (code means 31.5, step 903).
If the glyphs 15 are formed on the information surface 13 within a wavelength range visible to the human eye, they may be faded, for example, by a user at least partly, if they are not needed and/or if they interfere in some way with the presentation of information 18-20 to be performed on the surface 13.
A control element 15, 16.x fading function is hereby set on from a device providing the control elements 15, 16.x on the information surface 13, such as, for example, from server 11 (code means 32.10). In consequence of this, the elements 15, 16.x are no longer projected on the presentation surface 13 or at least not on a part of it, but only the information 18-20 to be presented by the server 11 is projected on it.
It should be noted that the control elements 15, 16.x may according to one embodiment also be of a non-visual type. The data matrixes 16.x, 17 may hereby be visualised on the information surface 13 by using, for example, such a suitable wavelength range, which is more or less or even entirely invisible to the human eye control elements (code means 32.11). However, from the image information 50 formed by the mobile equipment 10 it is possible for the equipment 10 still to recognise a whole glyph group 16.1 at each time in the camera 52 and from this to determine the location 21 or session protocol data 17′ pointed by the device 10 on the screen surface 13.
One example of such non-visual control elements 15, 16 are infrared tags invisible to the human eye, which can still however be detected by the camera 52. The presentation apparatus 12 may hereby need additional equipment, with which the infrared tags are brought about on the presentation surface 13. In addition, such an embodiment may also require calibration actions in order to align the tags in position on the screen surface 13, if a device separate from the presentation apparatus 12 proper is used to bring about the tags on the presentation surface 13.
The server 11 may also have a programme module or an equivalent functionality (code means 32.1), with which the stream of locations 21 of information surface 13 pointed by mobile equipment 10 can be determined (step 909′). The location 21 or their stream can be determined with server 11 at least partly by utilising the data x1, y1, hei, wid, gx, gy concerning the control information 15, 16.x received from control device 10 in step 908′ and determined by this from the image information 50. Hereby at least a part of the functionality concerning determination actions can also be arranged in the mobile equipment 10 in a surprising manner (code means 31.1, 31.4, 31.7).
Several advantages are achieved by such a decentralised determination arranged between the server 11 and the control devices 10, 10.1, 10.2. Firstly, it allows a reduction of the processing to be performed by the server 11, whereby it can serve more control devices 10, 10.1, 10.2 efficiently. On the other hand, by a decentralised implementation of this kind an advantage is also achieved in the accuracy of determination of the targeted location 21 or the stream of locations that is achieved by moving the control device 10, 10.1, 10.2 during control operation. If the control device 10 sends only information of which glyph groups 16.1, 16.2 are visible (determined by using of glyph data 23 arranged in the device 10, 10.1, 10.2), but not the location and size of the glyph groups 16.1, 16.2 in the viewfinder VF, the server 11 wouldn't be able to accurately calculate the location 21 or the stream of that which the user is pointing to. Now, with the determination performed also by the control device 10, where it is based on the image information 50 formed by a camera 52 having known characteristics, advantage is obtained from this small-scale determination. The decoded glyph data gx, gy and/or the data x1, y1, hei, wid produced by this determination can then be transmitted to the server 11 for final calculation of the actual operation screen location 21 (code means 31.3, 31.7). Server 11 does lookup in its array (i.e. array_generated) and continues the processing to find out the actual screen location 21. One way of implementing the determination will be considered later in this description. By transmitting to the server 11 only the decoded glyph data gx, gy and the data x1, y1, hei, wid determined continuously from the image 50 by the control device 10 is avoided the transmission of the huge amount of entire image data, which would slow the system operation considerably.
According to another embodiment the client device 10 may send the actual cropped bitmap image inside the box ((x1,y1) (wid,hei)) and the detection is performed on the server 11. However, this makes much more traffic and the client device 10 needs to “detect” the glyphs 16.1, 16.2 even if not going to the recognition process (i.e. find where the glyphs 16.1, 16.2 are in the image 50).
Yet, according to one embodiment the client device 10 may have received the array mapping glyph data 23 to locations (array_generated) and performs calculation by itself (similar with the code means 32.1 and 32.7 of server) and sends the result on screen coordinates (x,y) and relative distance to the server 11. Thus, there may be even several alternatives to perform this procedure.
The camera phone 10 can also be used for running program 31 of the kind already described in part in the foregoing and which allows an implementation according to the invention. In this program 31 there may be a programme module or an equivalent functionality, which is used for decoding by processor unit 51 the visually encoded information 15, 16.x, 17 from the image information 50 formed by camera 52 from information surface 13 concerning, for example, session details 17′ and location glyphs 15, 16.x, which are presented on the information surface 13. From the control information 15, 16.x it is possible to determine the precise location 21 of information surface 13 at each time operated by the control device 10. From the setting information 17 it is possible to form connection setting data 17′ to carry out communication between the information apparatus 11 and the control device 10 (steps 902, 903). In addition, the camera phone 10 has programme modules for connecting the data transmission session, for carrying out data transmission towards the server 11 and back from the server 11, including besides the above-mentioned data x1, y1, hei, wid, gx, gy concerning image information 50 or determined from this, also for example uploading application data, such as, for example, files FILE to server 11 and downloading application data, such as, for example, files FILE from server 11 to the mobile equipment 10 (code means 31.6). In general, the application data may be transferred from the mobile phone 10 to that particular anchored location pointed by the phone 10 and to receive from a specific location of the information surface 13 that is pointed by the phone 10. This embodiment provides a visual representation of the application specific data by using wide information surface 13 instead of small displays VF of mobile devices 10. There may also be an equivalent module in the information apparatus 11 (code means 32.6).
As was already mentioned above, it is possible in the invention to present small visually encoded elements, “glyphs” 15, on the information surface 13. With the glyphs 15 or from their location in the image information 50 formed by the camera 52 it is possible to determine that location or the stream of locations of the information surface 13, which is pointed at with the control device's 10 camera 52. With the camera 52 in the control device 10 a constant image flow is formed in an established manner in order to calculate the current information surface location 21 continuously pointed by the camera 52 (step 905).
Successive image frames 50 can be formed, for example, with a Nokia 6600 phone using about ten image frames per a second as the imaging frequency. Even at a lower imaging frequency compared with a conventional imaging frequency used, for example, in video or viewfinder imaging (for example, 15-30 image frames/second) it is possible to cope owing to the invention. When using a lower imaging frequency, the control device's 10 power consumption is reduced.
When wishing to take part with the control device 10 in the information INFO presented on the information surface 13, a connection is set up between the mobile equipment 10 and the server 11. Hereby the control device 10 is used for imaging special element patterns (glyph patterns) presented at one location of the information surface 13 (step 902). These are used for coding the details of the connection, such as, for example, a Bluetooth address and a server port 17′. The occurrence of a glyph group 17 comprising connection settings 17′ in the image information 50 is easily noted with the mobile equipment's 10 viewfinder VF. When the glyph group 17 is in the viewfinder VF in its entirety, this can be acknowledged in order to carry out the connection setting decoding. This control element group 17 is recognised from the formed image information 50 and the connection setting data 17′ is decoded from it in order to form communication connection between the information apparatus 11 and the control device 10 (step 903). Surely, the device 10 can also spontaneously recognise the concerned glyph group 17 and carry out decoding even without any confirmation from the user concerning the occurrence of a glyph group 17 on the viewfinder VF. When the connection has been once established there is no more need to take care of it, and the communication can proceed normally.
Right after creating the connection between the control device 10 and the server 11 the control device 10 may announce its identity. Owing to this everybody knows all the time who is operating the information on the surface 13 in each particular location.
After the data transmission connection between the control device 10 and the server 11 has been done as step 904, 904′, the device 10 equipped with a camera 52 may begin sampling the targeted information surface area 50, at which the control device 10 is pointed at each moment (step 905). The sampling procedure can be controlled by the user, whereby the camera 52 will not be imaging constantly.
The arrangement according to the invention can be enlarged to support more control devices 10, 10.1, 10.2 provided with a camera 52 in such a way that several representatives at a meeting can work at the same time on the screen 12′, distribute data FILE, INFO to one another and draw concept maps, for example. Each participant may download a copy of his own of all the contents INFO, FILE presented on the screen 12′ or just individual parts of the contents. Owing to this aspect the visually connected display 12′ also becomes part of the shared memory space, where the original device 11 hosting the display 12′ can provide access to other users to either keep a copy of the data shared or only view it during the visual interaction and knowledge creation.
The arrangement according to the invention can also be enlarged to support the user's gestures as an interaction method in the user interface. Gestures are generally known specific kind of interaction mechanisms, e.g. strokes, or in general, specific movement of the device 10 pointed to the screen 13 that is performed in this case with the mobile phone 10, 10.1, 10.2 and that is used for performing a previously defined action(s). One example of this is a gesture of moving the phone 10 away or towards the screen 12′ by using of which movement can be activate as a intention to copy from or paste to the visually interactive screen 12′ at/from a particular location. According to second example the user could “pull” objects from the screen 13 to download them to his device 10. The interaction may be as follows: 1) point to an object FILE on the surface 13; 2) press button on the mobile device 10; 3) drag the mobile device 10 backwards from the surface 13; 4) release button. Other examples of the gestures may be drag & drop, doubleclick, drag to “rubberband” a set of objects, etc. Also, the other 2D- or 3D-gestures are possible. In addition the current velocity of the phone 10 may also be determined in connection with the controlling. This may be inferred from the rate of change of the tags' 16.1, 16.2 positions and their sizes. Gestures may also be understood, in connection with the invention, as the operations performed by the control device.
A description now follows in somewhat greater detail and by way of example of the elements 15 presented on the screen 12′, from which that information surface 13 location is calculated, at which the mobile equipment 10 is pointed at each time. In the concerned embodiment, encoding is formed by L-shaped objects 15, which may also be called glyphs. The other shapes are also possible, like U-shape.
Now the glyphs 15 may have four different orientations. One L-shaped glyph 15 can encode two bits. For eight bits, four glyphs 15 are hereby required, which form a glyph group 16. Also, the number of control elements may vary and four is presented in this connection only as an example. In general, glyph groups may include a plurality of control elements. In the group 16.1 there may be two glyphs 15 to express the X coordinate and two glyphs 15 to express the Y coordinate. The glyphs 15 may hereby be arranged, for example, as a square assembly 16, 16.1, 16.2 shown in FIGS. 1 and 4-8. Each assembly 16, 16.1, 16.2 defines a screen location in the established manner. With the glyph groups 16, 16.1, 16.2 the screen surface 13 can be covered altogether more or less. The glyph groups 16, 16.1, 16.2 can hereby be presented as a matrix grid at regular intervals on the screen surface 13, where their distance, for example, in the horizontal and vertical directions is constant.
Each glyph group 16 in itself means a certain coordinate in the glyph group grid 14, which is provided on the screen 12′, for example, by projecting with a data projector 12. These glyph group coordinates can be named, for example, by variables gx, gy (
Referring to
The party rendering glyph groups 16, 16.1, 16.2 on the information surface 13, being a server 11 controlling a video projector 12, now knows at least some variables, which are used for determination of the targeted location 21. These variables may be called, for example, encoded rough large-scale variables. They form a rough main coordinate system, which corresponds to the entire large information surface 13 and the glyph groups 16, 16.1, 16.2 presented on it. These variables are determined, for example, at the stage where the glyph groups 16, 16.1, 16.2 are rendered on the whole wide screen part 12′ (step 901′).
The variables refGridHei and refGridWid shown in
Yet, a third variable also known by the server 11 is the height and width (sqSize) of each glyph group 16.1. In other words, sqSize means the length of the glyph group's 16.1 side, for example, between the top corner farthest to the left and the top corner farthest to the right and between the top corner farthest to the left and the bottom corner farthest to the left. In the application example shown in
Referring to
In addition, processor 51 can also recognise an action directed at the location 21 or a corresponding activation/acknowledgement command (code means 31.2), according to which the server 11 will then carry out the concerned action (code means 32.2). In its simplest form it is sufficient with a short or long push on a function keypad 35 of the device 10 in a manner known from mouse control devices. The action to be performed may depend, for example, on which tool is active at the time (is an object to be dragged or is an object to be drawn, for example).
The data x1, y1, hei, wid, gx, gy concerning the location 21 of information surface 13 operated by the control device 10 and the action protocol command directed at the concerned location 21 are then transferred to the information apparatus 11 by the control device's 10 data transmission unit 22. One should understand that performing continuous controlling in which, for example, some target object FILE is moved on the surface 13 (the aimed location of the camera 52 changes all the time), this imaging and location determining procedure is continuous. In the case of control device 10, the viewfinder image 50 can be understood as a kind of small-scale coordinate system, which merges into the large-scale glyph group coordinate system 14 rendered by the server 11 on the presentation surface 13.
The control device 10 itself may also be used for choosing functions. Hereby available function alternatives can be presented, for example, on the phone's 10 display VF. The phone's 10 keypad 35 may also be used for choosing functions. In this way it is possible to avoid, for example, the going-through of difficult menus in the presentation software 34.
The resolution of the display/camera 52 functioning as viewfinder VF is known in the camera device 10. The variables vf_wid and vf_hei can be set to correspond to this resolution value in the corresponding width and height directions. According to one embodiment, the value of variable vf_wid may be fixed to be, for example, 160, and the value of variable vf_hei may be fixed to be, for example, 120. Such values may be used, for example, in many cameras 52 provided with the Symbian operating system for a quick input.
In the case of described embodiment, the data determined by the control device 10 from the image information 50 includes information on the location x1, y1, x2, y2 and size hei, wid of the control information, that is, the glyph groups 16.1, 16.2 in the image information 50. In addition, the data also includes the decoded/recognized control information. That is the decoded byte value of the 2×2 glyph group (gx, gy), for example. This can be clarified with the code means 31.7, which can be a sub-module to code means 31.1. The determined information and the size vf_hei, vf_wid of the image information 50 can be sent to the information apparatus 11 in order to determine the location 21 of the information surface 13 operated by the control device 10 (code means 31.3). To get the size vf_hei, vf_wid of the image information 50 to server 11 one transmission from each device 10, 10.1, 10.2 is sufficient. From the viewfinder image 50 it is possible to determine the location and size of the glyph group 16.1 in the targeted image information 50. This can be understood as an action taking place in a small-scale coordinate system, wherein the exact place and zooming of the pointed location 21 are determined from the place x1, y1, x2, y2 and size hei, wid of the glyphs 15.
The location in viewfinder image 50 of the glyph group 16.1 (i.e. the VISIBLE datamatrices) detected and recognised from the image information 50 by the camera 52 and the computing device 51 attached to it in step 906 can be expressed as coordinates (x1, y1), (x2, y2). Here the coordinates (x1, y1) mean the location of the glyph group's 16.1 top left corner in the viewfinder image 50. Correspondingly, the coordinates (x2, y2) mean the location of the glyph group's 16.1 bottom right corner in the viewfinder image 50. The glyph group's 16.1 size, that is, zooming, can be expressed as height (hei) and width (wid). In the embodiment shown in
Besides the location and size of the glyph group 16.1 in the viewfinder image 50, the camera device 10 can be used for determination of the concerned glyph group's 16.1 location (gx, gy) in the glyph grid 14, that is, in the large-scale coordinate system presented by the presentation device 12. This is found out by recognising the assembly located in the glyph group square 16.1 and by searching the lookup table 23 arranged in device 10 for the coordinate corresponding with the concerned recognised assembly. The device 10 may hereby be provided with a memory MEM containing the lookup table 23 in question. By using data 23 in the lookup table decoding of the glyph group 16.1, 16.2 into coordinates (gx, gy) of the information surface 13 can be performed. Instead of the lookup table 23 located fixedly in the control device 10 it is also possible to use a lookup table, which is downloaded into the device 10 each time. This may be performed, for example, in connection with step 904. This can be an option, for example, in such a case that the manner of presenting the glyph group 16 is different in each meeting or e.g. due to circumstances depending on program suppliers. In a case according to the embodiment, the grid coordinates of the glyph group 16.1 in the viewfinder image 50 as seen on the screen 13 are gx=1 and gy=1.
When considering generally the implementation of the described embodiment the data (x1, y1, x2, y2, hei, wid) determined from the image 50 only describe the location and size of the glyphs 16.1, 16.2 that are captured by the camera 52. The described implementation decodes all the four glyphs 15 on the client device 10, 10.1, 10.2, and transmits a byte (integer 0 . . . 255) representing them to the server 11 together with (x1,y1, wid,hei) in order to determine the “coarse” part of the screen 13 that is operated. The server 11 happens to know that it uses an evenly laid out glyph grid 14. It splits the byte to two integers (gx and gy, in the range 0 . . . 15) and continues calculation with refGridWid & refGridHei. The grid coordinates (gx, gy) can be considered as a traditionally understood coordinate system if the glyph groups 16.1, 16.2 are distributed evenly. If the glyph groups 16.1, 16.2 are put on the screen 13 in an arbitrary manner, another mapping (GridNum—>on screen coordinates) is used instead of mapping grid coordinates (gx, gy) to on-screen co-ordinates (x, y).
When considering more precisely this “coarse” part determination the encoding (from 1 byte=8 bits to 4 glyphs) may go according to one embodiment as follows: first the byte is splitted to 2-bit parts (A B C D)—>here AB (concatenated) (4 bits) is gx and CD (4 bits) is gy, secondly for each part (with value 0 . . . 3) create a L-shaped glyph as follows: 0=L, 1=—|, 2=-| and 3=|-
These parts are put on-screen as follows:
The control device 10 pretty much reverses the process—from a glyph image to 2-bit parts (a,b,c,d)
The server 11 can now calculate: gx=1, gy=2 and the top-left coordinates of the glyph group are now: x=refgrid_wid*1; y=refgrid_hei*2.
As is said in above there is also a possibility to use array_generated in the server 11, which would contain mappings (glyphgroup ID—>screen location or bounding box of glyph group). Here glyphgroup ID would be the full byte.
The array_generated could also be according to another embodiment transferred to the client device 10 so that the client device 10 would be able to perform the whole of the location calculation and output the screen coordinates of the pointed location.
For calculating the location 21 pointed by the camera 52 it is sufficient that in the image information 50 formed by the camera 52 there is at least one glyph element group 16 in its entirety at each time. In group 16, two glyphs 15 can express the glyph group's 16 X coordinate and two glyphs 15 can express the glyph group's 16 Y coordinate.
According to one embodiment, the camera device 10 can send to server 11, for example, the top left (x1, y1) and also possible the bottom right (x2, y2) coordinates of the glyph group 16.1 recognised from the viewfinder image 50 (step 908). When the width and height of the camera device's 10 viewfinder VF are known (vf_wid, vf_hei) and the data corresponding to them has also been received by server 11 as step 908′, it is possible there to calculate the final aimed-at location 21 of the screen surface 13.
Programme code 32 executed by the processor CPU2 of the server 11 receives as input data x1, y1, wid, hei, gx, gy concerning the image information 50 formed from the information surface 13 by control device 10 (code means 32.7). The each control device 10, 10.1, 10.2 performing the controlling is recognized by the server 11 by using, for example, their BT address (port). This makes possible that server 11 serves multiple devices connected the same display 12′ utilising the visually interactive screen to share and manipulate the data in a collaborative manner, for example, in a negotiation situations. The on-screen coordinates of the screen location 21 pointed by the camera device 10 can be calculated by using the variable values x1, y1, wid, hei, gx, gy presented above and received from the control device 10 in the following manner by the code means 32.1 (step 909′):
x=gx*refGridWid+(sqSize*((vf_wid/2)−x1)/wid); (x==1*240+(40*((160/2)−40/15)==347)
y=gy*refGridHei+(sqSize*((vf_hei/2)−y1)/hei); (y==1*192+(40*((160/2)−20)/15)==299)
Based on this, the central location 21 in the viewfinder image 50 is directed to aim at a location (347, 299) in the coordinate system determined by the screen domain 13. A mouse pointer, object or the control action active at each time can now be aimed at this screen location 21 (step 910′). The concerned series of actions ends in step 911′ if the final acknowledgement is received or corresponding indication to terminate the concerned action. It must be understood that steps 905-910′ form a continuous loop to perform controlling of the information apparatus 11. Thus, this means that the information surface 13 is continuously pointed by the mobile phone 10 with out aiming any other objects than the information surface 13 that is operated. For this has been referred above in the description in connection with the imaging frequency of the camera 52.
The scale variable that is required for, for example, the 3D control of the information (described later) and in order to determine how far the device 10 is from the tags 16.1, 16.2 depends on the distance between camera device 10 and screen surface 13. The variable scale may control, for example, the presentation size of the information. It may be determined to be as follows in which it is inferred from the size of the tags 16.1, 16.2 on display VF and in surface 13:
scale=(wid+hei)/2.0/sqSize;
The manner presented above of determining the location 21 presented on screen surface 13 by using the camera device 10 is to be understood as one application example only. The basic embodiment presented herein does not take into account, for example, the camera device's 10 possible orientation angle in relation to the screen surface 13. In order to carry out the determination of the intended pointed location 21 in the desired manner, the control device's 10 alignment in relation to the screen surface 13 must be of the established kind in this case. In a set-up of this kind, the screen surface 13 presenting the information INFO must be located rather directly in front of the camera device 10 being mainly in the same plane. However, the device 10 may be allowed a deviation angle of, for example, some ten degrees in relation to the pointed screen surface 13, but even then it is still possible to determine the pointed location 21 (for example, 10-15 degrees is ok). With an additional code adapted to the calculation, the camera device's 10 turning and tilting can be taken into account. This may be inferred from the tags' 16.1, 16.2 orientation and, for example, the utilization of a specific kind of “encoding” for the tags. This implementation is described more precisely later in the description. Also, the devices' 10, 10.1-10.2 angular velocities may be determined. This may be inferred from the rate of change of the tags' 16.1, 16.2 orientation. It is also possible by encoding (for example, by check sums) to recover from and cope with possible recognition errors that may sometimes occur in an optical recognition of this kind. When using more advanced camera optics, resolution and zoom, it is possible to increase the control distance from the information surface 13.
The method presented above can be varied in several different ways. Instead of the generated datamatrices that spans over the canvas 13, for examples, in two dimensions, the glyphs 15 can also be located dynamically on the whole wide screen surface 13. Locating them randomly on the surface 13 is hereby possible. In this case, the glyph patterns could be encoded by one number (byte) and the lookup table 23 could be used for conversion between the glyph group's 16 number and the glyph group's 16 top-left coordinate.
It is possible in the embodiment presented above to manage even without the lookup table 23. The gx*refGridWid and gy*refGridHei formulas could hereby be used for converting the encoded glyph data into top-left screen coordinates x1, y1 of the concerned glyph group 16. The “glyph data” would hereby simply be gx*16+gy.
The colours of glyphs 15 or glyph groups 16 can also be varied. One example of this is presented in
The invention provides a means to render the application specific data to be presented visually and layered on top of the data grid 16.1, 16.2 on the visually interactive display 12′. The ways of expression of control and/or setting information 15, 16, 17 and of the information INFO presented on the screen surface 13 and/or the background of the screen surface 13 can also be compared to each other (code means 32.13). In case their difference does not meet a criterion set, for example, in regard to their colour, the expression of control and/or setting information 15, 16, that is, their colour in this case, can be changed in order to get a difference. Of course, it is also possible to change the colour of the information INFO presented on surface 13 and/or the background colour of the desktop of presentation software 34.
Such a situation may occur, for example, when the glyphs 15, 17 have been presented with, for example, a red colour. If a negotiator makes notes on the presentation surface 13 with a red colour, then the red glyphs 15, 17 will not necessarily stand out from the actual presentation information INFO shown on the screen surface 13. The similarity of the colours of control and/or setting information 15, 16, 17 and of the information INFO to be presented is hereby noticed and the colour of the control and/or setting information 15, 16, 17 is adjusted to be such that they can be better distinguished from the presented information INFO. This adjusting process may be performed automatic by the server 11.
Referring to
More generally, if in the content INFO visible to the user on screen surface 13 there is one or more signal colours characteristics of the camera sensor 52, some minor adjustment can hereby be made to the colour of the content INFO. Even a small change in the colour of content INFO is enough to bring about control information 15, which the human eye cannot distinguish or that is minimally visible to the human eyes from the content INFO/background, but which the camera sensor 52 can distinguish. The use of yellow glyph groups on a white screen background can be mentioned as another example (for example, in the case of an LCD display).
Besides presentation and editing of information INFO, the invention also makes possible a transfer of information INFO, FILE between the server 11 and the mobile equipment 10. Using control device 10 it is possible to transmit by data transmission module 22 to information apparatus 11 such information FILE, 18, which is intended for visualisation or distribution. On the other hand, the control device 10 can also be used for receiving from the information apparatus 11 such information FILE, 18, which is set for visualisation and/or distribution by this.
Like the control, creating and editing of information INFO visualised on screen surface 13, transmission and receiving actions are also made possible by such data transmission unit 22, 24 and connection settings 17′ in both devices 10, 11, which according to the invention can also be determined from the connection setting information 17 presented on the information surface 13. Owing to this, those taking part, for example, in a negotiation will immediately get in electronic form the information presented in the meeting or formed during the meeting. Owing to the invention there is no longer any need for post-deliveries of information, for example, by e-mail, which is easy to forget, as is known. Instead or besides of the application specific data received by the device 10, 10.1, 10.2 from the server 11 the information INFO presented on the visually interactive display 12′ can also be made available to other mobile devices' displays VF. In this case the information may be stored as image information captured by the mobile phone. It is also possible to store information locally for the future reference.
The programme codes 31, 32, which can be arranged in the device 10 and the device 11 according to the invention, may contain several code means 31.1-31.7, 32.1-32.13, which can be executed by processor 51, CPU2. Their function can be adapted to the application descriptions just presented above, in the connection of which reference is made to them. The code means 31.1-31.7, 32.1-32.13 may be formed by a set of processor commands, which can be executed one after the other and which are used to bring about the functionalities desired in the invention in the devices 10, 11 according to the invention.
The functions of the information apparatus complex or information device arrangement 11, 12, in general, can now be controlled not only by one user but also by several users. This allows a considerably efficient teamwork, since every participant can even from his own place take part in controlling the information device arrangement 11, 12 with his own mobile device 10, 10.1, 10.2. In addition, the presentation system 11, 12 is easily moved from the negotiation premises to other negotiation premises and it is also quickly erected. The system requires no special calibration actions, but it can be used immediately upon its start-up. The arrangement allows various interaction methods (point and click, etc.). Pointing is possible, even if the phone 10 has no separate pointing device, such as, for example, a stylus (pen) or a mouse. The phone's 10 user interface UI in itself can be used as an information input device.
Various kinds of pointing at the desktop are examples of actions allowed by the invention. A mouse pointer 21 or information INFO presented on the information surface 13 can be transferred or edited as desired. Surprisingly, the invention also allows three-dimensional information control. For example, the size of the information INFO can hereby be changed on the information surface 13. If it desired, for example, to change the size of an image 18 shown on the information surface 13, the control device 10 can hereby be moved closer to the information surface 13 (decreasing) or moved farther away from the information surface 13 (increasing). It is even possible to move information 18 and change its size at the same time. This is done by application of the variable scale as was described in the foregoing.
The invention also allows downloading and uploading of files FILE on a common screen 13, for example, over a Bluetooth connection BT. The mobile equipment's 10 user can hereby upload a file FILE (Concert_video.3gp) from the device's 10 memory MEM to server 11, which will in this case project a downloading icon of the file FILE on the screen surface 13. Users of other pieces of mobile equipment 10.1, 10.2 can for their own part download on their own devices 10.1, 10.2 the video file FILE stored in the concerned server's 11 data memory MEM′. A corresponding procedure can also be implemented for image 18 (Concert.jpg). Image 18 may be presented as such on surface 13 or also as a corresponding file reference.
The arrangement may also be used for sending text elements to the screen 13 and also generally for writing text 20. The user can also draw pre-determined shapes (for example, arrows between elements, rectangles, circles, etc.) or also with a free hand, bring about new information 19 more generally, besides editing existing information INFO. Generally speaking, at least corresponding actions are possible as in traditional virtual presentation environments. And what is there to prevent from applying the invention also in parallel with traditional white boards based on contact. This means that the concept “surface 13” is not to be understood only as two dimensional flat like surface on which the information is projected but also as a electronic structure on which the information is presented (a LCD-display, for example).
As already said before, the camera device's 10 turning and tilting can also be taken into account. This may be inferred from the tags' 16.1, 16.2 orientation and the utilization of a specific kind of “encoding” for the tags.
The basic group 16.1, 16.2 of four L-shaped glyphs in their basic orientation as already described prior would encode a grid ID (0). If the user rotates the camera 52 by 90, 180 and 270 degrees, he will get other ID's for the same glyph group (in decimals 85, 170 and 255 respectively). According to one embodiment it is possible to use an encoding by using of which it would be possible to determine the orientation of the camera device 10, 10.1-10.2.
One way to deal out this would be to encode only values 0-63 to group of four grids (in carefully chosen way so that none of the encodings is a rotation of another: e.g. from above, if it is used ID (0) it is not possible to use ID's 85, 170 and 255—limiting the use to using 0-63 as in the current encoding would do (but it would lead to one of the L's being in fixed position). Other way would be using the current group of four glyphs and put an external marker for being able to detect the rotation (e.g. a fifth glyph or a single square or something). One example of this may be a glyph group
In this alternative O may be a red filled square of size of L or a square with width and height being ½ of the width of an L). The latter alternative of these would look something like below, of course, dot location could also be different
Both ways above would allow to detect the “basic” orientation of the camera phone device 10, 10.1-10.2 (rotated 0, 90, 180 or 270 degrees). In addition, there would be possible need to handle the “in between” orientations, e.g. so that would have a 110 deg. rotated image and be able to interpret it as an image rotated by 90+20 degrees and continue the calculation respectively.
Although the invention has been described to a far extent as an application, wherein data x1, y1 determined from image information 50 and concerning control information 15, 16 is sent to a server 11, it is possible also to send image data 50 in its entirety to the server 11. Hereby all the determinations concerning the targeted location 21 are carried out on the server 11, while the control device 10 is used just to perform capturing of the image information 50.
It is also possible to perform all location determinations and calculations relating to them surprisingly in the control device 10. In such case the control device 10 may transmit to the server 11 or even directly to the data projector or display device 12 only the final on-screen coordinates y and x, the value of the variable scale and the code which indicates the action to be performed or only the control commands in order to control the data projector or display device 12 directly. In this case the server 11 may transmit the data required by the determination to the control devices 10. Thus, in general, the comparison/calculation of arrays (GENERATED/VISIBLE) can take place in different places: 1) the computing device 11 that has generated the array of generated datamatrices 2) the computing device 51 attached to the camera 52 3) in some setups 1+2 may be surprisingly the same computing device that may be, for example, the mobile phone 10.1.
In first possible case the server 11 knows location of each piece of control information, client 10 calculates and sends low-bandwidth data to server 11 to perform the full calculation. In the second possible case the locations of control information are sent to the client 10 (database 23), so the client 10 can calculate the actual pointing location 21. In the third embodiment the client 10 and server 11 parts are in the same device 10. In that case the client 10 itself controls the pointing.
The method and device 10 according to the invention can be adapted for use, for example, at group events with at least two participants. Some examples of such are negotiation events, idea generating events and brainstorming events. The invention can also be used for distributing information efficiently at mass events, such as, for example, fairs.
Hereby, for example, in the entrance hall to the fair there could be a server 11, presentation device 12 and information surface 13 according to the invention. The information surface 13 can hereby be used for presenting, for example, control and setting information according to the invention and, for example, a file downloading icon (reference) concerning content having a format of a set form. The file, which can be downloaded in the device 10, may contain, for example, details relating to the fair (for example, information on timetables and exhibitors).
Fair visitors may when entering set up a session with server 11 in the manner according to the invention and choose for downloading the concerned file in their portable device 10, which is equipped with camera 52. In this way the invention also allows an efficient distribution of information to several parties without even involving any presentation or generation of information. The information is in an electronic form in the device 10 for easy watching by visitors to the fair.
According to one more embodiment the control device 10 and the information apparatus may be the same unified device and there would only be an external display. A realistic case that is feasible even with the existing hardware would be e.g. software that has both functionalities (the camera part and the glyph generation part) running on Nokia 9500 communicator or corresponding “SmartDevice”. This device would be connected via Bluetooth or WLAN to (a module connected to) directly on the data projector 12.
In a “smart projector” embodiment to the one display device is embedded the generating functionality of the encoded location elements and also possibly the hiding functionality of the encoded location elements (possibly to previously-known locations).
The single control device 10 that functions as a server could also be connected to a smart projector as a mouse replacement without needing to have any software installed on the PC. The smart projector may be connected to a PC (without the “server” functionality). The same analogy may be applied in this connection with the “virtual” mice of some of present projector remote controls that connect as a mouse to the PC.
The
In this embodiment is used a number of cameras 52 and fixed points to determine the accurate position, for example, the pointing point and also an orientation of an control device 10.3-10.8 related to the surface(s) or more general, to at least one reference point. Thus, besides of the location 21 pointed at also the orientation of the control device 10, 10.1-10.8 may also be determined on the basis of the origin of the received image information 50 as determined by the control information 15, 16.
The cameras 52 are mounted on a handheld or a head-mounted device 10.3-10.8. The camera devices 10.3-10.8 can communicate, for example, wirelessly with a system 10.6 with a display 13, 13′. By means of this they can be used in navigating and manipulating 3D environments. The described devices 10.3-10.8 could also be used, for example, detecting and transmitting movements in virtual meetings.
The first embodiment needs only a single plane equipped with encoded location information 14. One of the cameras 52 always points to the plane with coordinate data 16.1. This is depending on the properties of the camera lenses. If the user is so close to the plane 13, 41 that the display angle is small enough, then there may be difficulties to find coordinate data. Zooming may be applied to solve problems relating to this.
In
In this device 10.3 there are cameras 52 on five sides of the up end of the device 10.3. This end may have a form of cubic, for example. To the one side is fixed the grip handle 42. The other sides include the cameras 52. The grip handle 42 may have buttons etc (not presented). Also, there is one camera 52 on the opposite end relative to the up end equipped with five cameras 52.
This embodiment uses at least one camera 52 at a time. The method described already above is partly applied also in this connection. However, that more or less assumes that the camera 52 is pointed somewhat towards the canvas 13, 13.1-13.3 and thus that requires some more. Next a pseudo-code description is presented for the implementation of 3D-pointing for finding the location and orientation of the control device 10.3-10.8.
IF current_camera sees data matrix THEN
In Step 1 the location and orientation of the camera are calculated in relation to the canvas coordinate system aka the world coordinate system. Implementation of the step 2 is somewhat trivial for the man skilled in the art, and it basically involves translation (movement) and rotation from the coordinates and orientation of the camera to the coordinates (x, y, z) and orientation (α, β, γ) of the control device. Also, step 3 is trivial and not described in this connection in great detail.
One application example to implement the steps 1 and 2 are described next. The world coordinate system may be defined as follows. Let the origin of the coordinate system (0, 0, 0) be the middle point of a flat surface 13, 13.1 that has reference points (glyphs) 16.1, 16.2 on the surface 13, 13.1, and Z axis be positive outwards the surface (i.e. to the direction of the viewer). So the surface 13, 13.1 is (x, y, 0) where x and y are variable.
The input for the algorithm is a camera image 50 with known reference elements 16.1, 16.2 whose location and orientation is known, all of them lay on a plane 13, 13.1. The outputs of the algorithm are the coordinates (x, y, z) and also the orientation (α, β, γ) of the control device in relation to the origin. The location (x, y, z) and orientation (α, β, γ) are enough.
In order to do this it is possible to apply, for example, inverse perspective transform (for Step 1 above). One possible method for this may be, for example, a homography transform. Homography means a relation between two figures, such that to any point of the one corresponds one and but one point in the other, and vise versa.
Pose estimation may be one possible procedure to compute the position of the control device relative to the surface 13, 13.1. This gives a 3×4 homography matrix that represents mapping between real world coordinates Xi and image coordinates xi. The relationship can be written as:
xi=λPXi where xi and Xi are in homographic form.
Once the matrix P has been found, it can be used to find out the location of the camera and orientation of the camera. With couple of points it is possible to find the direction vectors of the camera (=upwards and leftwards and outwards in the camera image) in the world coordinate system defined above.
Step 2 is about mapping the direction vectors and location of the current_camera to the direction vectors and location of the control device. Let's assume that an origin point and directions for the control device (to know where is “up”, “right”, “outwards” from the control-device point of view) are agreed. It is a trivial transformation from camera location+orientation to device location+orientation, basically it's just translation (=move) and rotation of known angles, and these go with simple matrix multiplications.
Some alternative control devices 10.5-10.8 are presented in
One more example is a camera glove 40 presented in
A head-mounted, standalone display is also possible. It may have a form of band/strap to be attached to the forehead. The arrangement may be like a forehead light used in, for example, night-time orienteering. External cameras 52 can also be attached to some part of spectacles instead of a separate strap.
In
The glove embodiment is presented in
Several advantages have been achieved by the embodiments presented in
For head-mounted 3D-locationing one can be found several applications easily. Some of them are, for example, CAD/52, 3D games and virtual meetings. In those the head-mounted device may detect the head movements of the user. These detected movements can be shown to other participants of the meeting. The embodiment can be also used as “See-what-I-see” 3D-system. In general, it is possible to speak about pointing by using body movements.
Additionally, the data (x1, y2, hei, wid, gx, gy) presented already in different connections above works in the basic 2D situation, but in the 3D version it is possible to calculate a mapping by rather using the corner points of a glyph group 16.1, 16.2 (or more likely multiple glyph groups). So, instead of x1, y1, wid, hei (which represents a non-rotated rectangle), it is possible to apply, for example, the XY-coordinates of the four different reference points: topleft TL, topright TR, bottomleft BL, bottomright BR of a glyph group 16.1 (plus also gx, gy). From these “stretched” glyph corner points it is possible to determine the rotation angle of the control device 10.
This information can be used surprisingly to calculate both the orientation and also the location of the device 10, 10.1-10.8 relative to the reference surface 13, 41 or any other corresponding reference target which may play such.
It should be understood that the foregoing description and the figures relating to it are only intended to illustrate the present invention. Thus, the invention is not limited only to the embodiments presented above or defined in the claims, but many such different variations and modifications of the invention are obvious to the professional in the art, which are possible within the scope of the inventive idea defined in the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
PCTFI2005050263 | Jun 2005 | FI | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/FI2006/050306 | 6/30/2006 | WO | 00 | 11/16/2007 |