BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 shows increases in average processing time with increasing number of states;
FIG. 2 shows comparison between Deltarser and existing XML parsers in state transition generation overhead;
FIG. 3 is a functional block diagram of a structured document processing apparatus of an embodiment of the present invention;
FIG. 4 illustrates a method for assigning IDs to state transitions and a method for counting repetitive elements;
FIG. 5 shows an optimized automaton;
FIG. 6 shows a loop automaton;
FIG. 7 shows an expanding automaton;
FIG. 8 shows a first XML instance document;
FIG. 9 shows a structure of an automaton of the first XML instance document before optimization;
FIG. 10 shows a flow of processing nested repetitive elements;
FIG. 11 shows a structure of optimized automaton of the first XML instance document 1;
FIG. 12 shows a second XML instance document;
FIG. 13 shows a structure of optimized automaton of the second XML instance document;
FIG. 14 shows classification of optimizations;
FIG. 15 shows an automaton of simple-type elements before optimization;
FIG. 16 shows an optimized automaton of the simple-type elements;
FIG. 17 shows a first exemplary XML Schema;
FIG. 18 shows a compositor;
FIG. 19 shows a second exemplary XML Schema;
FIG. 20 shows an example of state transitions due to blanks;
FIG. 21 shows an XML Schema used in an experiment in a first embodiment;
FIG. 22 shows an XML instance document used in the experiment in the first embodiment;
FIG. 23 shows result of the experiment in the first embodiment; and
FIG. 24 shows a graph of result of the experiment in the first embodiment.