This application is based upon and claims the benefit of priority of the prior Japanese Patent Application No. 2016-004797, filed on Jan. 13, 2016, the entire contents of which are incorporated herein by reference.
The embodiments discussed herein are related to an encoding program, an encoding method, an encoding device, a decoding program, a decoding method, and a decoding device.
For example, structured document data such as Extensible Markup Language (XML) have been utilized. XML has widely been spread as, for example, a compatible format for exchanging data between different systems. A variety of document data are disclosed in a structured format such as XML. Herein, structured document data are stored, for example, in such a manner that the whole is compressed in a compression format such as zip, in order to reduce an amount of data for storage or communication thereof. In a case where compressed document data are utilized, decompression of the whole of the compressed document data is executed to restore the document data and a variety of analyses are executed for the restored document data. For example, in a case where whether a specific word is included in a specific hierarchy of document data is searched, a lexical analysis or a structural analysis is executed for restored document data.
Japanese Laid-open Patent Publication No. 2005-215951
Japanese Laid-open Patent Publication No. 2002-297568
Japanese Laid-open Patent Publication No. 2005-018672
However, in a case where document data with the whole having been compressed in a compression format such as zip are utilized, the whole of the compressed document data is decompressed and subsequently a variety of analyses are executed, so that an amount of processing is high. Compressed document data may be utilized in a terminal with a low processing capacity such as a mobile terminal, and when an amount of processing for utilization thereof is high, the processing is time-consuming. Even in a case where only a part of document structure in a structured document is utilized, the whole of the document is decompressed in a case where the whole has been compressed in a compression format such as zip.
According to an aspect of the embodiments, a non-transitory computer-readable recording medium stores therein an encoding program that causes a computer to execute a process including: identifying document structure of a first structured document; and encoding a character string in a specific hierarchy of the first structured document with an encoding rule corresponding to the specific hierarchy utilizing the document structure.
The object and advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the claims.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are not restrictive of the invention.
Preferred embodiments will be explained with reference to accompanying drawings. The scope of the right is not limited by the embodiments. It is possible to combine the respective embodiments with one another appropriately as long as processing contents thereof are consistent with one another.
Encoding Process
First, an outline of an encoding process will be described by using
The encoding target file 30 stores a document structured by, for example, XML. In XML, a document is structured in such a manner that the document is described by text and elements of the document are delimited by tags. An example of
In a case where encoding of the encoding target file 30 is executed, an encoding processing unit 40 of an encoding device 10 reads a document stored in the encoding target file 30 and identifies document structure of the document (
The encoding processing unit 40 encodes the read document according to an encoding rule for hierarchical structure that corresponds to document structure, and stores encoded data as encoded data 32 (
For example, the encoding processing unit 40 encodes character strings that define document structure according to a common encoding rule. In the example of
For example, the encoding processing unit 40 encodes a character string in each hierarchy according to an encoding rule that corresponds to a characteristic of a character string that appears in the hierarchy. It is sufficient that a code that is assigned in encoding is unique for each encoding rule. Thus, in a case where a character string that appears in a hierarchy is encoded according to an encoding rule for the hierarchy, it is sufficient that a code is unique for each hierarchy. For this reason, as hierarchies are different, an identical code can be assigned to different character strings. In the example of
The encoding processing unit 40 stores a character string to be converted and a code that corresponds to the character string in dictionary data 31 for an encoding rule. In the example of
The encoding processing unit 40 creates, for each encoding rule, an index 33 that indicates a pattern that appears in an encoded character string (
Searching Process
Next, an outline of a searching process that is executed by the encoding device 10 according to a first embodiment will be described by using
A file searching unit 50 of the encoding device 10 accepts input of a search condition. For example, in the example of
The file searching unit 50 searches a file that satisfies a search condition. For example, the file searching unit 50 refers to the dictionary data 31B provided by converting a character string in the hierarchy of “outline” and identifies the code B1 that corresponds to the character string of “XXX” (
As described above, none of the indices 33A and 33B may be created.
The file searching unit 50 accepts input of a search condition. For example, in the example of
The file searching unit 50 searches a file that satisfies a search condition. For example, the file searching unit 50 decodes tags encoded by a common encoding rule. The file searching unit 50 refers to the dictionary data 31B provided by converting a character string in the hierarchy of “outline” and partially decodes a code in the hierarchy of “outline” (
Device Configuration
Next, a configuration of the encoding device 10 will be described.
The storage unit 20 is a storage device such as a hard disk, a Solid State Drive (SSD), or an optical disk. The storage unit 20 may be a data-rewritable semiconductor memory such as a Random Access Memory (RAM), or a flash memory, a Non-Volatile Static Random Access Memory (NVSRAM).
The storage unit 20 stores an Operating System (OS) and a variety of programs that are executed by the control unit 21. For example, the storage unit 20 stores programs for executing an encoding process and a searching process as described later. The storage unit 20 also stores a variety of data that are used for the programs that are executed by the control unit 21. For example, the storage unit 20 stores an encoding target file 30, dictionary data 31, encoded data 32, and an index 33.
The encoding target file 30 are data provided by storing text data of an encoding target. For example, a document structured by XML is stored in the encoding target file 30.
The dictionary data 31 are data of a dictionary that is used for encoding and decoding of data.
In the present embodiment, when a structured document is encoded, an encoding rule is switched depending on structure or an attribute thereof. The dictionary data 31 are data of a dictionary that is used for an encoding rule that uses the dictionary to execute encoding. The dictionary data 31 are provided for each encoding rule that uses a dictionary to execute encoding. For example, the dictionary data 31 are provided for each hierarchy where a dictionary is used to execute encoding or each of hierarchies where a dictionary is used to execute encoding and data attributes thereof are similar to one another, among hierarchies of a hierarchized document. The dictionary data 31 include a static dictionary 34 and a dynamic dictionary 35.
The static dictionary 34 is data that hold a code that corresponds to a pattern with a high appearance frequency depending on structure or an attribute of a document. The dynamic dictionary 35 is data that hold a code that corresponds to a pattern with a low appearance frequency depending on structure or an attribute of a document. The static dictionary 34 is preliminarily provided. The dynamic dictionary 35 is dynamically created as needed.
The static dictionary 34 stores a code that corresponds to a character string, according to a characteristic of a character string that appears in a corresponding hierarchy. For example, the static dictionary 34 stores a code that corresponds to a character string or a pattern such as a number that normally appears in a corresponding hierarchy. The static dictionary 34 stores a pattern with a high appearance frequency in a corresponding hierarchy that has been associated with a short code. For example, a human body temperature normally falls within a range of 35.0° C. to 42.0° C. and is around 36.0° C. with a high appearance frequency. Accordingly, for example, the static dictionary 34 that corresponds to a hierarchy of a body temperature stores numerical values of 35.0 to 42.0 that have been associated with codes, and stores around 36.0 that has been assigned with a short code. In the present embodiment, a character string that appears in an outline is encoded in units of words. For example, in the present embodiment, a general document is analyzed, and thereby, words are classified into high-frequency words with a relatively high appearance frequency and low-frequency words with a relatively low appearance frequency. For example, high-frequency words are basic words from a top to a predetermined ranking in a descending order of an appearance frequency while low-frequency words are basic words from the predetermined ranking to a bottom. A high-frequency word is preliminarily assigned with a short code, and the high-frequency word and the assigned code are associated with one another and stored in the static dictionary 34. For example, a high-frequency word is preliminarily assigned with a 2-byte (16-bit) code and the assigned code is preliminarily stored in the static dictionary 34. A low-frequency word is dynamically assigned with a code when the low-frequency word appears, and the assigned code is stored in the dynamic dictionary 35. That is, a code is preliminarily registered for a high-frequency word, and dynamically assigned and stored in the dynamic dictionary 35 for a low-frequency word. In a case where a character string or a pattern such as a number that appears in an outline is determined as a specific pattern, the specific pattern and a code are associated with one another and preliminarily stored in the static dictionary 34 that corresponds to a hierarchy of the outline.
The dynamic dictionary 35 is data that hold a variety of information with respect to a dynamically assigned code, according to a characteristic of a character string that appears in a corresponding hierarchy. For example, the dynamic dictionary 35 that corresponds to a hierarchy of an outline stores a code that is dynamically assigned to a pattern with a low appearance frequency such as a low-frequency word.
The dictionary data 31 are provided in each hierarchy where a dictionary is used to execute encoding or each of hierarchies where a dictionary is used to execute encoding and data attributes thereof are similar to one another, and for codes of “6*h” to “F*h”, a character string and a code are associated with one another and stored, according to a characteristic of a character string that appears in a hierarchy.
The dictionary data 31 may be capable of dynamically assigning a code to a tag.
By returning to
The control unit 21 is a device that controls the encoding device 10. For the control unit 21, an electronic circuit such as a Central Processing Unit (CPU) or a Micro Processing Unit (MPU), or an integrated circuit such as an Application Specific Integrated Circuit (ASIC) or a Field Programmable Gate Array (FPGA) can be employed. The control unit 21 includes an internal memory for storing a program that defines a variety of processing steps, and control data, and thereby, executes a variety of processes. The control unit 21 functions as a variety of processing units when a variety of programs operate. For example, the control unit 21 includes an encoding processing unit 40, a file searching unit 50, and a decoding processing unit 60.
The encoding processing unit 40 reads a structured document stored in the encoding target file 30 and creates the encoded data 32 provided by encoding the read document according to an encoding rule for hierarchical structure that corresponds to document structure. The encoding processing unit 40 includes an identification unit 41, an encoding unit 42, and a creation unit 43.
The identification unit 41 executes a variety of identification. For example, the identification unit 41 identifies document structure of an XML document stored in the encoding target file 30. For example, in a case where a schema of XML is defined so as to correspond to the encoding target file 30, the identification unit 41 identifies document structure based on the schema that corresponds to the encoding target file 30.
In a case where the XML schema 70 is defined so as to correspond to the encoding target file 30, the identification unit 41 identifies document structure based on the XML schema 70. The identification unit 41 may analyze a document stored in the encoding target file 30 to identify document structure.
The encoding unit 42 executes encoding of a document stored in the encoding target file 30. For example, the encoding unit 42 reads an XML document with document structure identified by the identification unit 41 from the encoding target file 30. The encoding unit 42 encodes the read document according to an encoding rule for hierarchical structure that corresponds to the document structure. For example, the encoding unit 42 sequentially assigns codes to tags that appear in the read document to execute encoding thereof. In a case where a tag that appears in document structure is defined, dictionary data for tag with a tag and a code that have been associated with one another are stored preliminarily, and the encoding unit 42 may use the dictionary data for tag to encode a tag that appears in the read document. Some tags with a high appearance frequency in document structure are stored in dictionary data for tag and the encoding unit 42 may use the dictionary data for tag to encode some tags with high appearance frequency and sequentially assign codes to other tags to execute encoding.
Herein, a structured document includes a document with document structure indicated by delimiting document elements by tags and a document with metadata provided on a part of the document by a tag.
The encoding unit 42 encodes tags according to a common encoding rule. For the document of example 1, tags of “outline” and “body” are encoded according to a common encoding rule. For the document of example 2, tags of “patent”, “title”, “object”, and “advantage” are encoded according to a common encoding rule.
The encoding unit 42 encodes a character string in a part delimited by tags according to an encoding rule that corresponds to each hierarchy. For example, the encoding unit 42 encodes a character string in a part delimited by tags, by using the dictionary data 31 that correspond to each hierarchy. For example, in a case where a word that appears in a character string has been registered in the static dictionary 34 or the dynamic dictionary 35 of the dictionary data 31 that correspond to a hierarchy, the encoding unit 42 encodes the appearing word into a code registered in the static dictionary 34 or the dynamic dictionary 35. In a case where a word that appears in a character string has not been registered in the static dictionary 34 or the dynamic dictionary 35 of the dictionary data 31 that correspond to a hierarchy, the encoding unit 42 dynamically assigns a code thereto, so that the appearing word is encoded into the assigned code. The encoding unit 42 associates the appearing word and the assigned code with one another and registers the appearing word and the assigned code in the dynamic dictionary 35. Thereby, afterward, words registered in the dynamic dictionary 35 are encoded into an identical code by using the dynamic dictionary 35 when the words appear. The encoding unit 42 may encode character strings in hierarchies with similar data attributes according to an identical encoding rule. Thereby, the encoding unit 42 can encode character strings in hierarchies with similar data attributes based on the identical dictionary data 31.
The encoding unit 42 encodes tags according to a common encoding rule. For the document of example 3, a tag of “link” is encoded by a common encoding rule. For the document of example 4, tags of “medical condition”, “disease name”, and “medicine name” are encoded according to a common encoding rule. For the document of example 5, tags of “personal name”, “place name”, and “date” are encoded according to a common encoding rule. The encoding unit 42 encodes a character string on a part delimited by tags according to an encoding rule for each hierarchy. For example, the encoding unit 42 encodes a character string on a part delimited by tags by using the dictionary data 31 that correspond to each hierarchy.
The encoding unit 42 can assign an identical code to different character strings in different hierarchies, and hence, can convert a character string into a short code in each hierarchy. For example, in the example of
The encoding unit 42 may encode a character string on a part delimited by tags without using the dictionary data 31, depending on an attribute or a range of the character string. For example, in a case where a character string on a part delimited by tags is a character string that indicates a numerical value in a range of “0” to “255”, the encoding unit 42 may encode the character string that indicates a numerical value in a range of “0” to “255” into a 1-byte integer-type (for example, int-type) code. That is, in a case where character strings indicate numerical values, the encoding unit 42 may encode the character strings into codes with a data type that corresponds to a range of the numerical values. As character strings that represent numerical values are encoded into codes with a data type of the numerical values, a variety of operations such as comparison or totalization of the numerical values can be executed even in encoded states thereof.
Herein, in a structured document in XML or the like, context is defined by a tag. Context of a structured document is defined by a tag and an element in association with data processing is determined by the context. For example, an element in association with a dictionary is determined, such as a type or a value range of data, or a component of a document (with respect to language, a Japanese word, an English word, or a word in another language). For example, how data content can be utilized, namely, a field of utilization thereof, such as searching or mining for text, or an average value, a total value, or a frequency distribution for numerical values, is determined. As illustrated in
The encoding unit 42 stores encoded data of a document stored in the encoding target file 30 as the encoded data 32.
The creation unit 43 creates, for each encoding rule, the index 33 that indicates a pattern appearing in an encoded character string. For example, the creation unit 43 sequentially provides a file number to the encoding target file 30 that has been encoded. The creation unit 43 creates the index 33 that stores the number of appearances of a pattern such as a numerical value or a word that appears in the encoding target file 30, in association with a file number of the encoding target file 30 that has been encoded.
Herein, a flow of encoding will be described.
The encoding processing unit 40 stores encoded data of the document stored in the encoding target file 30, as the encoded data 32. In an example of
By returning to
The acceptance unit 51 accepts a search condition. For example, the acceptance unit 51 provides an input interface such as an operation screen that accepts input of a keyword or a hierarchy that is a search condition, and accepts input of a character string or a hierarchy that is a search condition.
The searching unit 52 searches a file that satisfies a search condition. For example, the searching unit 52 identifies a code that corresponds to a keyword of a search condition, with reference to the static dictionary 34 and the dynamic dictionary 35 of the dictionary data 31 that correspond to a hierarchy of the search condition. The searching unit 52 identifies a file number of a file with the identified code appearing therein, with reference to the index 33 that corresponds to a hierarchy of a search condition. In a case where a keyword of a search condition includes a plurality of words or numerical values, the searching unit 52 decomposes the keyword into the words or the numerical values to encode the words or the numerical values, and identifies a code that corresponds to each of the words or the numerical values. The searching unit 52 identifies a file number of a file with a code corresponding to each of words or numerical values appearing therein, with reference to the index 33 that corresponds to a hierarchy of a search condition. Herein, for the index 33, it may be impossible to confirm whether an order of appearances of a plurality of words or numerical values that are included in a character string of a search condition is correct. Accordingly, for example, the searching unit 50 searches whether a character string of a search condition is included in the encoding target file 30 with the identified file number. The searching unit 52 may decode a hierarchy of a search condition of the encoded data 32 that correspond to the identified file number to search whether a character string of the search condition is included therein.
Thus, the file searching unit 50 can execute searching without decoding the encoded data 32, and hence, can reduce an amount of processing for searching so that processing time for searching can be reduced.
In a case where the index 33 is not created, the file searching unit 50 decodes only a specified hierarchy to search a specified character string. In this case, the file searching unit 50 can also execute searching by decoding only a specified hierarchy, and hence, can reduce an amount of processing for utilization thereof so that processing time for searching can be reduced, as compared with a case where the whole of encoded data is encoded.
The output unit 53 executes output of a result of searching. For example, in a case where a file number is identified by the searching unit 52, the output unit 53 outputs a file name of a file with the identified file number as a result of searching. On the other hand, in a case where a file number is not identified by the searching unit 52, the output unit 53 outputs no corresponding file as a result of searching.
By returning to
The acceptance unit 61 accepts an instruction of decoding. For example, the acceptance unit 61 provides an input interface such as an operation screen that accepts specification of the encoded data 32 that is a target for decoding, and accepts specification of the encoded data 32 that is a target for decoding. The acceptance unit 61 may accept specification of a hierarchy for decoding as well as the encoded data 32 that is a target for decoding.
The decoding unit 62 decodes the encoded data 32 that have been specified. For example, the decoding unit 62 decodes code data in each hierarchy of the encoded data 32 according to an encoding rule for the hierarchy. For example, the decoding unit 62 decodes code data in each hierarchy of the encoded data 32 into a character string by using the static dictionary 34 and the dynamic dictionary 35 of the dictionary data 31 that correspond to the hierarchy. For example, the decoding unit 62 decodes code data of tags according to a common encoding rule. The decoding unit 62 decodes code data in each hierarchy delimited by tags into a character string, with reference to the static dictionary 34 and the dynamic dictionary 35 of the dictionary data 31 that correspond to the hierarchy. In a case where specification of a hierarchy for decoding is accepted by the acceptance unit 61, the decoding unit 62 may decode only code data in a specified hierarchy.
Processing Flow
A flow of an encoding process for the encoding device 10 according to the present embodiment to encode the encoding target file 30 will be described.
As illustrated in
Next, a flow of a searching process for the encoding device 10 according to the present embodiment to search a file that satisfies a search condition will be described. First, a flow of a searching process in a case where a hierarchy is not specified for a search condition will be described.
As illustrated in
On the other hand, in a case where a code is present (S20, Yes), the searching unit 52 identifies a file number of a file with an identified code appearing therein, with reference to the index 33 (S24).
The output unit 53 outputs a result of searching and the process is ended (S25). For example, the output unit 53 outputs a file name of the encoding target file 30 in a case where the encoding target file 30 that includes a character string of the search condition is searched or a case where a file number of the encoding target file 30 is identified by the searching unit 52.
Next, a flow of a searching process in a case where a hierarchy is specified for a search condition will be described.
As illustrated in
On the other hand, in a case where a code is present (S30, Yes), the searching unit 52 identifies a file number of a file with an identified code appearing therein, with reference to the index 33 in a specified hierarchy (S34).
The output unit 53 outputs a result of searching and the process is ended (S35). For example, the output unit 53 outputs a file name of the encoding target file 30 in a case where the encoding target file 30 that includes a character string of the search condition is searched or a case where a file number of the encoding target file 30 is identified by the searching unit 52.
Next, a flow of a decoding process of the encoding device 10 according to the present embodiment to decode the encoded data 32 will be described.
The decoding unit 62 reads code data from the encoded data 32 that have been specified (S40). The decoding unit 62 decodes the read code data into a character string by using the static dictionary 34 and the dynamic dictionary 35 of the dictionary data 31 that correspond to a hierarchy (S41). The decoding unit 62 determines whether or not reading of the encoded data 32 has been completed (S42). In a case where reading has not been completed (S42, No), transfer to S40 is executed. On the other hand, in a case where reading has been completed (S42, Yes), the process is ended.
Advantage
As described above, the encoding device 10 according to the present embodiment identifies document structure of a structured document. The encoding device 10 encodes a character string in a specific hierarchy in the document with the identified document structure, in an encoding rule for a hierarchical structure that corresponds to the document structure. Thereby, the encoding device 10 can decode only a code in a specific hierarchy part, and hence, can reduce an amount of processing for utilization thereof.
The encoding device 10 according to the present embodiment encodes character strings that define document structure in a document according to a common encoding rule. Thereby, the encoding device 10 can execute decoding in a common encoding rule to restore character strings that define document structure in a document in an identical encoding rule, and hence, the document structure can be identified quickly, so that data in a specific hierarchy can be extracted.
The encoding device 10 according to the present embodiment encodes character strings in hierarchies with similar data attributes in an identical encoding rule. Thereby, the encoding device 10 can encode character strings in hierarchies with similar data attributes by the identical dictionary data 31.
The encoding device 10 according to the present embodiment encodes a character string in a specific hierarchy, according to an encoding rule that corresponds to a characteristic of a character string that appears in the specific hierarchy. Thereby, the encoding device 10 can encode a character string in a specific hierarchy in an encoding rule that corresponds to a characteristic thereof.
The encoding device 10 according to the present embodiment executes encoding according to an encoding rule that converts a pattern with a high appearance frequency into a short code, in a single hierarchy or a plurality of hierarchies with similar data attributes. Thereby, the encoding device 10 can encode the encoding target file 30 at a high compression rate.
The encoding device 10 according to the present embodiment creates the index 33 that indicates a pattern that appears in an encoded character string. Thereby, the encoding device 10 can identify the encoding target file 30 with an appearing pattern based on the index 33.
Although the embodiment that relates to the disclosed device has been described above, the disclosed technique may be implemented in a variety of different embodiments other than the embodiment as described above. Hereinafter, other embodiments that are included in the present invention will be described.
For example, although a case where a code that corresponds to a pattern with a high appearance frequency is preliminarily stored in the static dictionary 34 of the dictionary data 31 has been described in the embodiment as described above, this is not limiting. For example, an appearance frequency of each appearing pattern such as a word or a number in a character string may be obtained by analysis in each hierarchy of a document, so as to assign a short code to a pattern with a high appearance frequency for encoding thereof. The dictionary data 31 may associate the appearing pattern and the assigned code with one another and store the appearing pattern and the assigned code.
Although a case where a code is stored in the dictionary data 31 in units of hierarchical structure has been described in the embodiment describe above, this is not limiting. For example, the common dictionary data 31 may be used. A part of codes may commonly be registered and managed in the dictionary data 31 in units of hierarchical structure.
Each component of each device as illustrated in the drawings is functionally conceptual and is not requested to be physically configured as illustrated in the drawings. That is, a specific state of dispersion or integration in each device is not limited to that illustrated in the drawings, and a configuration thereof can be provided in such a manner that all or a part thereof can be dispersed or integrated functionally or physically in arbitrary units, depending on a variety of loads, usage, or the like. For example, respective processing units of the encoding device 10 that are the identification unit 41, the encoding unit 42, the creation unit 43, the acceptance unit 51, the searching unit 52, the output unit 53, the acceptance unit 61, and the decoding unit 62 may be integrated appropriately. A process for each of the above-mentioned processing units of the encoding device 10 may appropriately be separated into processes for a plurality of processing units. All or any part of each processing function that is executed in each processing unit can be realized by a CPU and a program that is analyzed and executed by the CPU or realized by hardware based on wired logic.
Encoding Program
A variety of processes described for the embodiment as described above can also be realized by executing a preliminarily prepared program in a computer system such as a personal computer or a work station. Hereinafter, an example of a computer system that executes a program that has a function identical to that of the embodiment as described above will be described. First, an encoding program for executing an encoding process will be described.
As illustrated in
The HDD 420 preliminarily stores an encoding program 420a that fulfills functions similar to those of the identification unit 41, the encoding unit 42, and the creation unit 43 of the encoding device 10 as described above. The encoding program 420a may be divided appropriately.
The HDD 420 stores a variety of information. For example, the HDD 420 stores a variety of data that are used for an OS or encoding.
The CPU 410 reads the encoding program 420a from the HDD 420 and executes the encoding program 420a, so that an operation similar to that of each processing unit of the embodiment is executed. That is, the encoding program 420a executes operations similar to those of the identification unit 41, the encoding unit 42, and the creation unit 43.
The encoding program 420a as described above is not requested to be stored in the HDD 420 from the start.
Searching Program
Next, a searching program for searching the encoded data 32 will be described.
As illustrated in
The HDD 420 stores a variety of information. For example, the HDD 420 stores a variety of data that are used for an OS or searching.
The CPU 410 reads the searching program 420b from the HDD 420 and executes the searching program 420b, so that an operation similar to that of each processing unit of the embodiment is executed. That is, the searching program 420b executes operations similar to those of the acceptance unit 51, the searching unit 52, and the output unit 53.
The searching program 420b as described above is also not requested to be stored in the HDD 420 from the start.
Decoding Program
Next, a decoding program for decoding a file that satisfies a searching condition will be described.
As illustrated in
The HDD 420 stores a variety of information. For example, the HDD 420 stores a variety of data that are used for an OS or decoding.
The CPU 410 reads the decoding program 420c from the HDD 420 and executes the decoding program 420c, so that an operation similar to that of each processing unit of the embodiment is executed. That is, the decoding program 420c executes operations similar to those of the acceptance unit 61 and the decoding unit 62.
The decoding program 420c as described above is also not requested to be stored in the HDD 420 from the start.
For example, the encoding program 420a, the searching program 420b, the decoding program 420c may be stored in a “portable physical medium” such as a flexible disk (FD), a CD-ROM, a DVD disk, a magneto-optical disk, or an IC card that is inserted into the computer 400. The computer 400 may read a program from a portable physical medium and execute the program.
A program is stored in “another computer (or server)” or the like that is connected to the computer 400 through a public line, the internet, a LAN, a WAN, or the like. The computer 400 may read a program from another computer (or server) and execute the program.
According to one embodiment, an advantage is provided that encoding that corresponds to document structure can be executed.
All examples and conditional language recited herein are intended for pedagogical purposes of aiding the reader in understanding the invention and the concepts contributed by the inventors to further the art, and are not to be construed as limitations to such specifically recited examples and conditions, nor does the organization of such examples in the specification relate to a showing of the superiority and inferiority of the invention. Although the embodiments of the present invention have been described in detail, it should be understood that the various changes, substitutions, and alterations could be made hereto without departing from the spirit and scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
2016-004797 | Jan 2016 | JP | national |