This application is based on and claims priority under 35 U.S.C. 119 from Japanese Patent Application No. 2008-153181 filed Jun. 11, 2008.
The present invention relates to an information display apparatus, an information displaying method, and a computer readable medium.
According to an aspect of the present invention, an information display apparatus includes: a receiving unit that receives character sequence information arranged in a plurality of lines; an image acquisition unit that acquires a line image in which an end of an nth line and a start of an (n+1)th line of the received character sequence information are connected into a single line, n representing an integer of 1 or more; and a display unit that displays the acquired line image within a predetermined display range of a screen.
Exemplary embodiment of the present invention will be described in detail based on the following figures, wherein:
Exemplary embodiments of the present invention will be described with reference to the drawings. As illustrated in
The control section 11 is a program control device such as a CPU (Central Processing Unit), and is operated in accordance with a program stored in the storage section 12. This control section 11 acquires character sequence information to be subjected to a display process, and generates and acquires an image in which the end of an n-th line and the start of an (n+1)-th line of the character sequence information are connected into a single line. Then, the generated line image is displayed within a display range determined in advance in the display section 14. The detailed process contents of this control section 11 will be described later.
The storage section 12 is a storage device such as a RAM (Random Access Memory), and retains a program executed by the control section 11. This program may be stored in and provided by a CD-ROM (Compact Disc Read Only Memory), a DVD-ROM (Digital Versatile Disc Read Only Memory), and a portable memory device, for example, and may be stored in this storage section 12. This storage section 12 is also operated as a work memory of the control section 11.
The operation section 13 is constituted by, for example, a numeric keypad, and/or arrow keys or the like for providing an instruction for vertical and lateral moving directions. This operation section 13 outputs the contents of an operation, performed by a user, to the control section 11. The display section 14 is a display device such as a liquid crystal display, for example, and displays an image of a character sequence or the like at an instructed position in accordance with an instruction inputted from the control section 11.
Next, the contents of process steps performed by the control section 11 will be described. As illustrated in
The document layout analysis section 21 acquires data of a document to be processed. As illustrated in
The document property extraction section 22 extracts information concerning the start position of each line, line height, line width, length between lines, and blank portion from character sequence information included in the character sequence portion image outputted from the document layout analysis section 21. For example, as illustrated in
From the vertical average distance and lateral average distance between the adjacent character circumscribing rectangles, the document property extraction section 22 determines that the characters are read in a direction in which the distance between the adjacent character circumscribing rectangles is shorter, and obtains line circumscribing rectangles further circumscribing a plurality of the character circumscribing rectangles in the direction in which the characters are read (
Then, the document property extraction section 22 recognizes, as line start positions (L1, L2 . . . ), positions at ends of the line circumscribing rectangles, located opposite to the direction in which the characters are read (
Further, the document property extraction section 22 recognizes the height of the line circumscribing rectangle for each line (hn), the width of the line circumscribing rectangle (wn), and the distance to the adjacent line circumscribing rectangle (ln) as the line height, the line width, and the length between the lines, respectively (S4). Furthermore, the document property extraction section 22 detects a value “wmax” indicating the maximum width among the widths of the respective line circumscribing rectangles, and obtains a difference between the value “wmax” and the width of each line circumscribing rectangle as follows: Wrest_n=wmax−wn. This “Wrest_n” serves as a value representing the width of a blank portion of an n-th line. Moreover, among the respective line circumscribing rectangles, there may be obtained a value “Lmin” of the start position of the line circumscribing rectangle in which the line start position is located most opposite to the character reading direction (e.g., the most leftward line circumscribing rectangle when the characters are read from left to right), and a blank at the line starting point “|Ln−Lmin|” and a blank at the line end “Wrest_n−|Ln+Lmin|” may further be computed as blank portion information. It should be noted that in this embodiment, “|x|” represents an absolute value of “x”.
The image deformation section 23 secures, in the storage section 12, a storage region for an image having at least a width of (2×W) and a height of (2×H). Further, this image deformation section 23 initially sets a variable “n”, representing a noticeable line, so that n=1. Hereinafter, for simplification of the description, the following description will be made on the supposition that the character sequences are read in the direction from left to right (in the direction of an axis X). However, for example, when the character sequences are located from a higher position to a lower position, the axis may be changed, and when the character sequences are read from right to left, the axial direction may be reversed.
For example, the image deformation section 23 places the original image, outputted from the document layout analysis section 21, within a range “(O, H−ΣPi−1)−(W, 2×H−ΣPi−1)” of the secured region (
Thus, the image deformation section 23 continuously and repeatedly arranges original images in the character reading direction, and as for the images arranged adjacently, the image deformation section 23 arranges the images so that they are deviated by a line “Pn” in the direction in which the lines are arranged. Specifically, as illustrated in
Moreover, the image deformation section 23 extracts a portion of this line-connected image, and outputs the extracted portion to the display section 14 so that the display section 14 displays the extracted portion. In other words, the image deformation section 23 receives information indicating the shape and size of a displayable range of the display section 14, and extracts, from the line-connected image, a partial image of the range equivalent to these shape and size. In one example, if the displayable range of the display section 14 is equivalent to the shape and size represented by a rectangle having the following dimensions: a width “Rw”×a height “Rh” (which will be hereinafter called an “extraction range R”), the image deformation section 23 extracts and outputs an image of this extraction range R (
The image deformation section 23 receives, from the movement control section 24, information indicating a position of this extraction range R on the line-connected image, and sets the extraction range R at a position on the line-connected image indicated by this information. Then, the image deformation section 23 extracts a partial image within the set extraction range R, and outputs the partial image to the display section 14.
Further, when the coordinate of an end side opposite to the moving direction of the extraction range R has reached the boundary of the original images repeatedly arranged, i.e., when the X-axis coordinate (where the lateral axis is defined as the X axis) of a left end side of the extraction range R has reached the width W of the original image, the image deformation section 23 moves the original image, placed within the range “(W, H−ΣPi)−(2×W, 2×H−ΣPi)”, to the range “(O, H−ΣPi)'(W, 2×H−ΣPi). Furthermore, the image deformation section 23 places a new original image within the range “(W, H−ΣPi+1)−(2×W, 2×H−ΣPi+1)”. Moreover, W is subtracted from the X-coordinate value of the extraction range R.
The movement control section 24 sets the position of the extraction range R on the X axis so that the first character of the first line is displayed at a predetermined position of the extraction range R (which is a center portion, for example, and which will be hereinafter called a “gaze position”). Thereafter, the movement control section 24 moves the position of the extraction range R in a certain direction on the line-connected image with the passage of time. In the present embodiment, among pieces of character sequence information arranged in a plurality of lines, the end of the n-th line and the start of the (n+1)-th line are connected to generate a one-line character sequence image; therefore, if the extraction range R is moved in one direction (X-axis direction) along this one-line character sequence image, character sequences of respective lines are sequentially scroll-displayed.
The information display apparatus 1 of the present embodiment includes the above-described configuration, and is operated as follows. The control section 11 acquires, via a communication unit such as a network, for example, data of a document to be processed. Then, a widely known layout analysis process is performed, thereby identifying a character sequence portion (T).
The control section 11 extracts information concerning the start position of each line, line height, line width, length between lines, blank portion and the like from character sequence information included in the character sequence portion image. Further, the control section 11 continuously and repeatedly arranges the extracted character sequence portion images (original images) in the character reading direction. During this time, the adjacently arranged images are located so as to be deviated by one line in the direction in which the lines are arranged, thereby aligning the end of the n-th line and the start of the (n+1)-th line with each other in a single line.
Upon generation of the image (line-connected image) in which the end of the n-th line and the start of the (n+1)-th line of the character sequence information are connected into a single line, the extraction range R, having the shape and size equivalent to the displayable range of the display section 14, is set at a position including the starting point thereof (i.e., a position corresponding to the start of the first line), and this extraction range R is moved at a certain speed (p pixels per second), for example, in the direction of the lines (i.e., the direction in which the characters are read).
The control section 11 extracts a partial image within the extraction range R from the line-connected image, and allows the display section 14 to display the extracted partial image. In accordance with the movement of the extraction range R, the line-connected image is scroll-displayed; therefore, all a user has to do is to look at a specific range of the display section 14, thus making it possible to continuously read the character sequences connected into a single line regardless of line feed of the character sequence.
Further, the extraction range R is moved at a certain speed (scroll speed) in this embodiment; however, for example, a user may be allowed to adjust the moving speed. For example, while a key of the arrow keys, which is associated with the character sequence reading direction, is pressed down, the moving speed may be increased by a predetermined speed. Furthermore, when a key associated with the direction opposite to the character sequence reading direction, the moving speed may be decreased, or the moving direction may be reversed. In the case of reversing the moving direction, the control section 11 allows the image deformation section 23 to perform the following process steps. When the coordinate of an end side opposite to the moving direction of the extraction range R has reached the boundary of the original images repeatedly arranged, i.e., when the X-axis coordinate of a right end side of the extraction range R has reached the width W of the original image, the image deformation section 23 moves the original image, placed within the range “(O, H−ΣPi)−(W, 2×H−ΣPi)”, to the range “(W, H−ΣPi)−(2×W, 2×H−ΣPi), and newly places an original image within the range (O, H−ΣPi−1)−(W, 2×H−ΣPi−1)”. Moreover, W is added to the X-coordinate value of the extraction range R.
Besides, the control section 11 may move the position of the extraction range R to the previous line or the next line in accordance with an instruction from a user, for example. Specifically, upon reception of an instruction for moving the extraction range R to the previous line, for example, the control section 11 moves the position of the extraction range R from the position thereof at the time when the instruction is received toward the direction opposite to the character sequence reading direction by a width of the original image. On the other hand, upon reception of an instruction for moving the extraction range R to the next line, for example, the control section 11 moves the position of the extraction range R from the position thereof at the time when the instruction is received toward the character sequence reading direction by a width of the original image.
It should be noted that when there are a plurality of pieces of character sequence information arranged in a plurality of lines (i.e., character sequence portions found out by a layout analysis process), as already mentioned above, these pieces of information are prioritized in the predetermined order, and respective images thereof are to be processed. In this case, the control section 11 may connect the end of the final line of the i-th character sequence information (character sequence portion image) and the start of the front line of the (i+1)-th character sequence information (character sequence portion image) into a single line, thus generating a line-connected image.
Further, the control section 11 may change the rendering color between a character present in a line different from a line in which there is located a range (gaze position) such as a center portion of the extraction range R where a character being read by a user should be displayed (which will be hereinafter called a “different line character), and a character that is included in a line in which the gaze position is located and that is present at least within a certain range from the gaze position (which will be hereinafter called a “gaze character”). In one example, the different line character may be displayed in light gray, and the gaze character may be displayed in dark gray or black.
Furthermore, even if characters are included in a line in which the gaze position is located, the control section 11 may change the rendering color between the gaze character and the character present at a position outside a certain range from the gaze position. For example, the farther away from the gaze position, the lighter the color of the displayed characters may be (
It should be noted that instead of changing the rendering color of pixels constituting characters, the rendering color of background of the gaze character may be different from that of the other portion. Further, the rendering color of pixels constituting the gaze character and that of its background may both be different from the rendering color of the other portion. Furthermore, in this embodiment, the rendering color of a character included in a line in which the gaze position is located, or the rendering color of a character present within a certain range from the gaze position is changed; however, an image other than a character included in a line in which the gaze position is located, or an image other than a character present within a certain range from the gaze position may also be displayed in rendering color different from that of the other portion. For example, an image other than a character present in a line in which the gaze position is located may be increased in chroma, and the other images may be reduced in chroma.
(Example of Diagonal Display)
In the description made thus far, the extraction range R is a rectangle, and is set so that a pair of sides thereof is in parallel with respective associated lines included in a line-connected image, but the present invention will not be limited to this. Alternatively, in order to make it clear to a user that the next line is going to be read in the course of reading, the extraction range R is inclined by an angle θ as shown in
(Switching to Other Display Mode)
Further, the control section 11 of the information display apparatus 1 of the present embodiment may be capable of performing switching between a mode in which the line-connected image is scroll-displayed, and a mode in which the line-connected image is not scroll-displayed. In this case, in the scroll display mode, a line other than a line including the gaze position is displayed in relatively light gray, for example, as in the display shown in
Moreover, as shown in
The control section 11 may be capable of performing mutual switching among the overall display mode, non-scroll display mode, and scroll display mode.
(Case Where Character Type is Recognizable)
Further, the control section 11 uses information about the blank at the line starting point “|Ln−Lmin|” and the blank at the line end “Wrest_n−|Ln+Lmin|” for the noticeable line “n”, and may increase the scroll speed (moving speed of the extraction range R) when the gaze position of the extraction range R (e.g., the center coordinate) is located somewhere between the left end of the original image and the blank at the line starting point “|Ln−Lmin|”. Furthermore, the control section 11 may increase the scroll speed (moving speed of the extraction range R) when the gaze position of the extraction range R (e.g., the center coordinate) is located somewhere between the position located leftward from the right end of the original image by the blank at the line end “Wrest_n−|Ln+Lmin|” (i.e., the front end of the blank) and the right end of the original image.
Moreover, in the description made thus far, an image of a character sequence included in an original image is supposed to be a bitmap image; however, when code information for the character sequence included in the original image (information by which the character type can be determined as a Chinese character, a Japanese phonetic syllabary, or an alphabet) is also provided, the control section 11 may use this code information to change the scroll speed (moving speed of the extraction range R) depending on the type of the character present at the gaze position (e.g., center coordinate) of the extraction range R. For example, the scroll speed may be reduced when the character type is a Chinese character, may be increased when the character type is a Japanese phonetic syllabary, and may be further increased when the character type is an alphabet.
Besides, the scroll speed may be changed in accordance with the degree of difficulty in reading a Chinese character, which is determined, for example, depending on the number of strokes of the Chinese character or which level of JIS (Japan Industrial Standard) the Chinese character is at.
In addition, the scroll speed may be changed in accordance with how a character present at the gaze position (e.g., center coordinate) of the extraction range R is modified. For example, when the size of a character is large, the scroll speed may be reduced, and in the case of a bold font, the scroll speed may be reduced.
Depending on whether or not a character sequence present around the gaze position (e.g., center coordinate) of the extraction range R is a character sequence included in a predetermined dictionary, the scroll speed may be changed. For example, generally used words are included in a dictionary in advance, and when the character sequence does not coincide with any of the words, the scroll speed may be reduced.
Moreover, the scroll speed may be changed in accordance with a distance between characters of a character sequence. For example, the narrower the distance between characters, the slower the scroll speed may be.
(Example of Arranging More Original Images)
Although an example of arranging two original images in the reading direction has been described thus far, a larger number of original images may be arranged in the reading direction. In this case, the j-th original image from the left (which is defined as the original image with a noticeable line “i”) is to be located at the following position: ((j−1)×W, H−ΣPi)−(j×W, 2×H−ΣPi). Further, in this case, the original image that does not overlap with the extraction range R is deleted, and the position of each original image and the position of the extraction range R are moved over in the direction of the deleted original image.
(Example of Using Server)
Moreover, in the present embodiment, instead of performing the process steps of the document layout analysis section 21 in the information display apparatus 1, the process steps of the document layout analysis section 21 may be performed in an external server device or the like. In this case, the information display apparatus 1 acquires character sequence images, resulting from the process steps performed in the server device or the like and virtually arranged in a row, and performs the subsequent process steps.
Furthermore, the process steps of the document property extraction section 22 (the process steps shown in
The foregoing description of the embodiments of the present invention has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Obviously, many modifications and variations will be apparent to practitioners skilled in the art. The embodiments were chosen and described in order to best explain the principles of the invention and its practical applications, thereby enabling others skilled in the art to understand the invention for various embodiments and with the various modifications as are suited to the particular use contemplated. It is intended that the scope of the invention defined by the following claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
2008-153181 | Jun 2008 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
5586196 | Sussman | Dec 1996 | A |
7765471 | Walker | Jul 2010 | B2 |
20060129922 | Walker | Jun 2006 | A1 |
20060236238 | Yoshikawa | Oct 2006 | A1 |
20090303258 | Uehori et al. | Dec 2009 | A1 |
Number | Date | Country |
---|---|---|
5-080726 | Apr 1993 | JP |
7-146674 | Jun 1995 | JP |
10-69475 | Mar 1998 | JP |
11-224082 | Aug 1999 | JP |
2002-366135 | Dec 2002 | JP |
2003-131642 | May 2003 | JP |
2005-322046 | Nov 2005 | JP |
Entry |
---|
Japanese Office Action for Application No. 2008-153181, dated Apr. 27, 2010. |
Number | Date | Country | |
---|---|---|---|
20090309892 A1 | Dec 2009 | US |