The present application claims priority to and incorporates by reference the entire contents of Japanese patent application, No. 2006-304012, filed in Japan on Nov. 9, 2006 and Japanese patent application No. 2007-116070, filed in Japan on Apr. 25, 2007.
1. Field of the Invention
The present invention relates to an image processing apparatus that generates a list display screen of plural images, such as a thumbnail list, with respect to image data accumulated in an image database and to an image processing method. More specifically, the present invention relates to a suitable technology for MFPs (Multi Function Printers) such as composite machines, file servers, and image processing programs.
2. Description of the Related Art
Although there are, for example, electronic filing apparatuses that computerize paper documents using input devices such as scanners, they are mainly used for business purposes where a large amount of paper documents is handled. In recent years and continuing to the present, electronic filing has been acknowledged for its handling ability and convenience even at offices because of the price-reduction of scanners, the widespread use of MFPs including a scanner function, and legislation such as an electronic document law, resulting in computerization of paper documents. Meanwhile, image information databases have been increasingly used that make a database (hereinafter simply referred to as DB) of image data generated by computerizing paper documents and document data generated by applications of a PC or the like to collectively manage the same. For example, even if it is necessary to store the original of a paper document, image information DBs are likely to be structured because of their easiness for management and searching.
As the image information DBs, there are various ones such as a large-scale type that has installed therein a server apparatus to which a large number of users make accesses, and a personal-use type formed by structuring a DB in the PC of an individual person. For example, recent MFPs come with a function of storing image data generated by computerizing paper documents in a built-in HDD (Hard Disk Drive), and image information DBs based on the MFPs have been structured.
When browsing the images of an image information DB in which plural images are accumulated, the user searches for a target image by using an image search method. In other words, if the image name (file name) of the search target image is known, a thumbnail list display is generally used. For example, when searching for document images, the user performs keyword searches and then displays the candidate images hit (selected) by the keyword in a thumbnail list. In order to search for the target image, the user employs a method of either selecting the search target image from the thumbnail list display at the end or using only the thumbnail list display from the beginning.
The thumbnail list display is such that plural reduced images are arrayed on a screen to facilitate understanding of the contents of the images. However, since the plural images are displayed on the limited screen at a time, the resolution of an individual thumbnail is generally low. When photographic images are displayed in a thumbnail list, it is relatively easy to understand the contents of the images even if they are reduced images of a low resolution. In the case of document images mainly consisting of characters, on the other hand, it is difficult to discriminate the characters in reduced images one from another and understand the contents of the document images. Accordingly, it is necessary for the user to zoom in on an individual document image with a viewer function or the like in order to confirm the same when searching for document images, which reduces operability during searching. Particularly, in the case of a client/server system via a network, it is necessary for the user to newly transfer image data of a high resolution when displaying images with the viewer, which causes a long processing time for confirming the plural images and remarkably-reduced search efficiency.
Since it takes time to display a large number of thumbnails in the thumbnail list display, the client/server system via a network, in particular, reduces the number of displays viewable at a time and changes a screen as if a page is turned over, to thereby reduce standby time until the thumbnails are displayed. In this case, however, the number of thumbnails capable of being displayed on the screen is small, and so it is necessary to turn over a page (change a screen) many times. Additionally, since the whole picture of the images included in the thumbnail list display cannot easily be recognized, a desired image may not be found in some cases even if the thumbnail list display is viewed until the last page. As a result, the search efficiency is further reduced. As described above, if the number of thumbnails displayed in the screen (page) is increased, it takes time and reduces the search efficiency.
Meanwhile, when the thumbnail list display is generated in the image information DB, that is, every time the display screen is created, dynamic thumbnails are not created from stored original images. Generally, there is employed a method of previously holding (accumulating) images for thumbnails generated by reducing the original images and using the same. This method is excellent in processing speed. For example, when HTML (Hyper Text Markup Language) or the like is used to create the display screen of a thumbnail list in the server/client system, the server does not generally create a bitmap display screen. The server creates only a link based on the image (file) name displayed in the HTML document, and the HTML document is developed (rendered) by browser software on the side of the client to create a so-called bitmap display screen. In this case, however, it is necessary to transfer all the thumbnail images to be displayed on the display screen from the server to the client (all the thumbnail images are generally transferred even if a part protruding from the screen exists) regardless of the size of thumbnails (usually designated by the server) to be displayed on the display screen. Therefore, if the number of thumbnails displayed on a screen is increased, the amount of data to be transferred increases accordingly. Additionally, since a small amount of data is transferred many times, data transfer efficiency is reduced to thereby take time for performing screen display on the client. Generally, since the length of a packet is fixed at data transfer and different files are not put in the same packet, redundant transfer data appear in small files. If the transfer data of the small files are increased, the redundant data are prominent to thereby reduce the transfer efficiency. Generally, if the number of thumbnails to be displayed is increased in the server as well, the workload such as disk access increases.
Accordingly, there has been proposed the search method of Japanese Patent No. JP-A-2004-258838 in order to solve the above problems. In other words, target information is searched for with simple operations, namely, a map display procedure and a thumbnail detailed display procedure. According to the map display procedure, thumbnails are arranged on a two-dimensional map and displayed. Furthermore, according to the thumbnail detailed display procedure, when the user specifies points in a specific small area from among the plural small areas formed by dividing a map, a small area group centered on the specific small area is defined as an enlargement target area. Then, the thumbnails arranged in the enlargement target area are enlarged to display contents in detail.
However, the method disclosed in the above Patent Document 1 switches a display between thumbnails and a detailed display in a binary manner. Therefore, if the position of a search target image cannot be understood on the map, it is necessary to enlarge and display images one by one, which may cause an insufficient enlargement factor. Furthermore, if a large number of thumbnails to be displayed exist on the map, it is impossible to overlappingly display the thumbnails. As a result, the size of thumbnails is reduced to make the thumbnail list useless. Moreover, as described above, if the number of thumbnails displayed in a list is increased, it takes much time to display them.
An image processing apparatus and image processing method are described. In one embodiment, an image processing apparatus generates a list display screen for displaying a thumbnail, wherein the list display screen comprises: a thumbnail list view of which display magnification is changeable; a list view window for displaying at least part of the thumbnail list view; and plural of the thumbnails of which size or resolution is changeable in accordance with the display magnification.
An embodiment of the present invention has been made in view of the above problems and may provide an image processing apparatus that displays thumbnails in a list and improves operability and search efficiency when the user searches for a search target image from the list and an image processing method.
According to an embodiment of the present invention, there is provided an image processing apparatus that generates a list display screen for displaying a thumbnail. In the image processing apparatus, the list display screen comprises a thumbnail list view of which display magnification is changeable; a list view window for displaying at least part of the thumbnail list view; and a plurality of the thumbnails of which size or resolution is changeable in accordance with the display magnification.
Reference numeral 110 denotes the server apparatus that performs the image classification in accordance with the command from the client apparatus 100 and outputs the results of the image classification to the client apparatus 100. Reference numeral 111 denotes an interface (hereinafter referred to as external I/F) with the external communications path 104. Reference numeral 112 denotes registration image data to be registered in an image information DB 114. Reference numeral 113 denotes a thumbnail generation processing unit that scales the registration image data 112 to a predetermined size or smaller to generate plural thumbnail images. Reference numeral 114 denotes the image information DB that accumulates the image data of the registration image data 112 and the thumbnail image data thereof. Reference numeral 118 denotes a display screen control processing unit that generates a display screen to be displayed in the client apparatus 100 and controls the display screen in accordance with the content of screen control data 120. Reference numeral 119 denotes display screen data to be displayed on the display device 101 of the client apparatus 100. Reference numeral 120 denotes the screen control data specified and input by the client apparatus 100. In
This embodiment exemplifies a case in which the server apparatus 110 is composed of a server computer and processing such as display screen generation is implemented by software. In other words, the processing in the server apparatus 110 is implemented by an application program (not shown). The embodiments of the present invention are not limited to this. The processing may be implemented by hardware in an apparatus such as a MFP, or the configuration of
Next, a description is made of an operations outline of this embodiment. The system of the first embodiment is roughly divided into two operations. One is an operation of registering images and the other is an operation of “using the images of the DB,” i.e., the operation of searching for, browsing, and acquiring (downloading from the screen apparatus) a desired image. In order to use an image, the user first searches for a desired image, browses it by using a viewer as an application, and then downloads it into his/her PC. Furthermore, there are image search techniques such as keyword search processing and similar image search processing. In this embodiment, an operation of searching for a search target image from a thumbnail list display, which is performed after the keyword search processing and the similar image search processing, is a search processing target operation for simplicity of description. Note, however, that there is also a case in which an image is searched for only from a thumbnail list display without performing the keyword search processing and the similar image search processing.
In step S001, the user issues an instruction for registering image data and specifies the registration image data 112 to be registered from the client apparatus 100 to the server apparatus 110 through the application program 102.
In step S002, the registration image data 112 are input to the server apparatus 110 through the external communications path 104 and registered in the image information DB 114 via the external I/F 111 where an ID is added together with accompanying meta-information such as a file name. At the same time, the thumbnail generation processing unit 113 reduces the registration image data 112 to generate different sizes of “plural thumbnail images” a predetermined size or smaller and registers them in the image information DB 141 after adding the IDs to them. If the registration image data 112 are plural pages of image data, thumbnails are generated on a page-by-page basis.
In this embodiment, plural thumbnail images different in size for each registration image are generated. As a method of generating a thumbnail, for example, the length of the long side of the thumbnail is defined for each thumbnail having a different size as shown in
For example, where the image size of the input registration image data 112 is 4000 pixels long by 2000 pixels wide, seven different sizes of thumbnails Sam1 through Sam7 are generated. In this case, the length of the long side of the thumbnail is the size shown in
In the image information DB, the accompanying meta-information such as an ID and a file name can easily be registered, managed, and searched for by the use of a general-purpose RDB (relational database). Furthermore, thumbnails and original image data may be compression-coded and accumulated as required and be configured to be linked from the meta-information so that they can be read. Furthermore, if meeting the above function, the image information DB 114 may establish and accumulate a hierarchical data structure by using a language such as XML (Extensible Markup Language) or accumulate it as a DB for each different server. For the image registration, image data may be directly registered in the server apparatus 110 from an image input device such as a scanner and a digital camera.
In step S101, the user instructs the server apparatus 110 to display thumbnails in a list by using the application program 102 of the client apparatus 100.
In step S102, when the instruction for displaying the list is received at the server apparatus 110, the display screen control processing unit 118 generates an initial screen for displaying a thumbnail list as shown in
The thumbnail list display screen of this embodiment is roughly composed of two screen structures. One is the thumbnail list view 302 and the other is the frame of a user interface part and an outer frame part. The application 102 of the client apparatus 100 synthesizes these two frames to generate a display screen for the display device 101. As a result, the screen of
The display screen control processing unit 118 generates the two types of display screens as described above. However, since the outer frame only serves to change the display magnification of the thumbnail list view 302 and the position of the sliders 305 and 306 of the display area, the description thereof is omitted. Here, the screen generation of the thumbnail list view 302 is specifically described.
When generating the initial screen, the display screen control processing unit 118 sets the display magnification (the lowest magnification in
Although the thumbnail list view 302 becomes the screen as shown in FIG. 6B, it is not necessary for the display screen control processing unit 118 to hold such images. It is only necessary for the display screen control processing unit 118 to hold the position information (coordinate information) of an individual display image and the ID information thereof. Furthermore, the thumbnail list view 302 transmits only the images of the display area 307 to the client apparatus 100. The generation of the thumbnail list view 302 is described later. Furthermore, although the center of the screen is enlarged as the display magnification increases, it is necessary to provide the margin of the screen of the
As the generation method for the display screen and the communication method between the server apparatus 110 and the client apparatus 100 described above, various techniques are available. As a commonly used one, a World-Wide-Web based technique using the server apparatus 110 as a Web server can perform these methods. It is possible for the display screen 119 to be written in HTML and a general Web browser to be used as the application 102. Furthermore, in this embodiment, the scrolling sliders for changing the display magnification and the display area are provided in the screen, but a function equivalent to the sliders may be provided to an input device such as a mouse of the client apparatus 100.
Now, let us return to
In step S104, the user using the client apparatus 100 browses the display screen data 119, operates the sliders 305 and 306 for changing the display area to search for a search target image, and operates the slider 304 for setting the display screen magnification to change the display magnification. Accordingly, the user gives an instruction for changing the screen scroll and the display magnification. The operation of the sliders is performed by the use of the input device 103 (not shown).
In step S105, the instruction for changing the screen scroll and the display magnification is converted into display-area data and display-magnification data as the screen control data 120 and transmitted to the server apparatus 110.
In step S106, upon receipt of the screen control data 120, the server apparatus 110 changes the thumbnail list view screen as described below. In step S107, the display screen 119 after being changed is displayed on the display device 101 in the same manner as step S103. In step S108, if the user cannot find the search target image, the operations of steps S104 through S107 are repeated.
In step S201, when the screen control data 120 are input from the client apparatus 100, the display magnification and the display area 307 of the thumbnail list view are set. For an initial screen, the predetermined setting values are stored in the server apparatus 110.
In step S202, the size of a thumbnail to be displayed is set in accordance with the display magnification. In other words, the setting of the size of the thumbnail means to set the type of the thumbnail (Sam1 through Sam8 of
If the size of the thumbnail corresponding to the display magnification falls between the values of
In step S203, the type of the thumbnail corresponding to the image data included in the display area 307 of the thumbnail list view is selected and determined.
In step S204, as for the selected thumbnail, the image in the display area of the thumbnail list view is generated. There is a method of converting the screen data of the thumbnail list view into bitmap data. However, since a method of writing the coordinate information of images and the link information thereof in a structured document is generally used in HTML, it is necessary to transfer the data of each thumbnail image of the structured document and the display area from the server apparatus 110 to the client apparatus 100.
As described above, in the method of searching for an image from the thumbnail list view, this embodiment makes it possible to smoothly and continuously search for a target image while confirming the contents of plural images without opening another window such as a viewer, thereby improving operability. Furthermore, since the size of a thumbnail (or resolution) is changed according to the display magnification to alter the fineness degree of the thumbnail in this embodiment, it is possible to confirm the content of an image without lowering its quality every time the enlargement factor is increased. For example, as a simple method of enlarging an image, an individual thumbnail is typically enlarged for each image. However, this method makes it difficult to discriminate a character image or the like because a fine image cannot be obtained even if the image once reduced in size is enlarged.
Furthermore, since plural thumbnail images different in size for each image are held in this embodiment of the present invention, it is not necessary to transfer a large size image just for confirming the content of the image. That is, since it is only necessary to transfer the thumbnail image of the size adapted to the display magnification, the amount of data to be transferred until the confirmation of the content of the image is small and the transfer time is reduced, to thereby improve search efficiency. Furthermore, when a screen with a large number of thumbnails is displayed, it is possible to use a thumbnail image smaller than the typical one. Therefore, in this case, the transfer time is further reduced to improve search efficiency. Furthermore, since only the data of the thumbnail image in the display area of the screen are transferred, the transfer time is also reduced for a large size thumbnail image to improve search efficiency.
In the first embodiment, there is described the method of constituting the thumbnail list view with the structured document and the link using such as HTML. If there are many thumbnails in the display area as in the case of low magnification, however, the image at the low magnification is generated at the image registration, accumulated in the image information DB 114 together with images and thumbnail data, and processed as image data. Accordingly, the data transfer time is further reduced, the amount of data to be processed by the server apparatus 110 becomes small, and the time waiting for the display of the image on the screen is reduced, so that search efficiency is improved.
Although the arrangement of thumbnails is not particularly taken into consideration in the first embodiment, it is more efficient to search for a target image if there is employed the arrangement in which images having the same attribute are placed near the target image in searching for the image from a thumbnail list view. Accordingly, in this embodiment, image classification processing is performed to represent modes of classification on the screen in order to improve search efficiency. In the following description, a document image frequently used at offices is referred to as a target image. Note that although processing is performed with one image data group in this embodiment, the present invention is not limited to this.
(Classification Processing)
Although various clustering and classification processing techniques for document images have been proposed, here is exemplified a classification processing technique as described below. For example, plural characteristic amounts (color characteristic amount, shape characteristic amount, and layout characteristic amount) are calculated from a registered document image. In other words, the color characteristic amount relating to the color of an image such as the background color and the color distribution of a document image is calculated from the registered document image, and the shape characteristic amount relating to the shape of an image such as the edge and the texture of a document image is calculated from the registered document image. For calculation of the layout characteristic amount, an image is divided into objects on an image-element-by-image-element basis, the attributes of the objects are determined to obtain layout information, and then an arrangement, an area ratio, or the like is calculated for each object attribute (e.g., a title, a character, a graphic, a picture, a table).
With the plural characteristic amounts, the following plural category identification processes are performed. The category type for identification consists of color category identification, shape category identification, layout category identification, and document type identification. In other words, the color category identification is that the background color, the most frequently used color, or the like as the color characteristic amount is input as a representative color and classified into an approximate one of the categories such as red, blue, green, yellow, and white. The shape category identification is such that a document image is classified into a category based on the similarity of plural characteristic amounts such as the edge and the texture of the image. The layout category identification may classify an image in the same manner as the shape category identification. For identification of a document type, an image is classified into a category by the use of the characteristic of the document type such as column setting from among plural layout characteristic amounts in a two-way search manner. Alternatively, pairs of characteristic amount data of layouts and answer data of document types to be identified are previously learned as teacher data by a learning machine for machine learning or the like. The document type is thus identified based on the layout characteristic amount using the learning data.
In this embodiment, the classification is performed based on the above methods.
In step S003, the registration image data 112 are subjected to the classification processing with the classification processing unit 115, and respective category data are registered in the image information DB 114 together with other meta-information.
The classification categories set at the image registration are used to arrange an image on the thumbnail list view 302. Since the operations thereof are the same as those of the first embodiment, they are not described. Below, a description is made of the thumbnail list view of this embodiment.
In
As described above, since the classification is displayed on the thumbnail list view according to this embodiment of the present invention, the images having the same attribute are arranged adjacent to one another. In this case, it is possible to enlarge the display magnification without lowering the image quality. As a result, document images are efficiently refined.
In the first and second embodiments, plural thumbnail images different in size (or resolution) for each registration image are generated, but the data amount accumulated in the image information DB is caused to be increased. Accordingly, in this embodiment, an original image is compressed by hierarchical coding to reduce the data amount stored in the image information DB.
The hierarchical-coding conversion processing unit 116 hierarchically encodes the input registration image data 112. Since image data are generally compressed, they are hierarchically encoded after being decoded and decompressed.
As a hierarchical coding method, for example, a standard method ((part 1), ISO, IS15444-1) of JPEG-2000 is used in the embodiment of the present invention. Next, the encoding method and the progressive order of JPEG-2000 part 1 (hereinafter referred to as JPEG-2000) are briefly described.
Then, the image data are converted into a luminance/color difference signal by a color conversion processing unit 2. In JPEG-2000, two color conversions are defined according to the types (5×3 and 9×7) of a filter used in the Discrete Wavelet Transform (hereinafter referred to as DWT). Prior to the color conversion, a DC level shift is applied to each of the signals of RGB.
After the color conversion, the DWT is applied to the signal for each component by the DWT processing unit 3 to output wavelet coefficients for each component. The DWT is two-dimensionally performed. However, it is generally performed based on the convolution of a one-dimensional filter calculation using a calculation method called lifting calculation.
The sub-band at each decomposition level can be divided into areas called precincts where the aggregation of codes is formed. Furthermore, encoding is performed for each predetermined block called a code block.
Scalar quantization is applied to the wavelet coefficients output from the DWT processing unit 3 by a quantization processing unit 4. However, if lossless transformation is applied to the wavelet coefficient, the scalar quantization is not applied thereto or the wavelet coefficient is quantized as “1.” Furthermore, almost the same effect as the quantization is obtained in the below-described post quantization processing. The scalar quantization allows for the change of parameters for each tile.
Entropy encoding is applied to the quantization data output from the quantization processing unit 4 by an entropy coding processing unit 5. The entropy encoding method of JPEG-2000 divides (or does not divide the sub-band if the size of a sub-band area is smaller than or equal to that of a code block area) the sub-band into rectangular areas called code blocks and performs encoding for each block.
Furthermore, the data of the code block are decomposed into bit planes as shown in
The entropy-encoded code data are subjected to code truncation processing as needed by the post quantization processing unit 6. If it is necessary to output a lossless code, the post quantization processing is not performed. JPEG-2000 allows for the truncation of a code amount after the encoding and provides a configuration (one-pass encoding) of eliminating the feedback to control the code amount as the characteristic thereof. In a code stream generation processing unit 7, the code data after the post quantization processing are subjected to processing in which the codes are sorted in accordance with a predetermined progressive order (decoding order of the code data) and a header is added, thereby completing a code stream for the corresponding tile.
Next, a description is made of the progressive order of JPEG-2000. In JPEG-2000, the following five progressions are defined by changing the priority of four image elements of image quality (layer (L)), resolution (R), component (C), and position (precinct (P)).
(LRCP Progression)
Decoding is performed in the order of the precinct, the component, the resolution level, and the layer. Accordingly, the image quality of an entire image is improved as a layer index increases, so that the progression of the image quality can be achieved. This is also called a layer progression.
(RLCP Progression)
Decoding is performed in the order of the precinct, the component, the layer, and the resolution level. Accordingly, it is possible to achieve the progression of the resolution.
(RPCL Progression)
Decoding is performed in the order of the layer, the component, the precinct, and the resolution level. Accordingly, it is possible to achieve the progression of the resolution as in the case of RPCL progression. However, it is also possible to increase the priority at a specific position.
(PCRL Progression)
Decoding is performed in the order of the layer, the resolution level, the component, and the precinct. Accordingly, the decoding at a specific position is prioritized, so that the progression of a space position can be achieved.
(CPRL Progression)
Decoding is performed in the order of the layer, the resolution level, the precinct, and the component. Accordingly, for example, it is possible to achieve the progression of the component like a case in which a gray image is first reproduced when the progressive decoding is applied to a color image.
With the hierarchical coding method as represented by JPEG-2000, image data are held in the image information DB 114 and a thumbnail image is generated according to the resolution level adapted to the size of a thumbnail. Accordingly, it is possible to generate plural types of thumbnails different in resolution (size) just from the code data of an original image.
In step S302, the resolution level used for the display is set in accordance with the display magnification. In step S303, the image corresponding to the image data included in the display area 307 of the thumbnail list view is selected and determined. In step S304, a thumbnail image is generated based on the resolution level of the selected image data, and the screen of the display area of the thumbnail list view is generated.
As described above, this embodiment of the present invention provides a configuration in which a registration image is converted into hierarchical code instead of generating plural thumbnails different in size, and thumbnails different in resolution (size) are generated from the hierarchical code. Therefore, using only the code data amount of an original image makes it possible to achieve this embodiment of the present invention and reduce the data amount stored in the image information DB. Note that JPEG-2000 is used as the hierarchical coding method in this embodiment, but other hierarchical coding methods may also be used to achieve this embodiment of the present invention as a matter of course.
Although this embodiment is described using the configuration of the first embodiment as an example, it may also be applicable to the configuration of the second embodiment having the classification processing.
The above embodiment describes an example of selecting the image of the display area accumulated in the image information DB 114 at the generation of the display area screen of the thumbnail list view. However, if a large number of thumbnails are to be generated, the processing may be redundant. Accordingly, this embodiment describes an example of solving the redundant processing problem.
The configuration of this embodiment is the same as that of the third embodiment. The mode of accumulating image data in the image information DB of the first through third embodiments is not particularly restricted. For example, there may be employed a type in which individual image files exist in the directory structure such as a personal computer.
In this embodiment, the image of the thumbnail list view as shown in
As described above, this embodiment makes the thumbnail list view be processed as one image data group, to thereby simplify the processing at the generation of the display area screen. As a result, it is possible to reduce the time required for an image to be displayed on the screen and improve search efficiency. Furthermore, since the tiling is performed for each registration image in this embodiment, it is possible to easily cut out the “thumbnail list view image” for each registration image, further simplify the processing, and facilitate the processing for each registration image. For example, even when each registration image is sorted on the thumbnail list view screen, it is possible to achieve the processing just by rewriting header information.
The present invention is not limited to the specifically disclosed embodiments, and variations and modifications may be made without departing from the scope of the present invention.
The present application is based on Japanese Priority Patent Applications No. 2006-304012, filed on Nov. 9, 2006, and No. 2007-116070, filed on Apr. 25, 2007, the entire contents of which are hereby incorporated by reference.
Number | Date | Country | Kind |
---|---|---|---|
2006-304012 | Nov 2006 | JP | national |
2007-116070 | Apr 2007 | JP | national |