Referring now to the figures and, in particular,
Next, the characters in the documents are analyzed to determine which characters and character combinations are present and to determine which, if any, glyph variants of the font set are used within the document, and/or whether one or more combination of characters is represented by a single ligature in the font character set used to render the text in the document (step 30). For example, the document may use a font character set in which characters are represented by different glyphs, where a different glyph is used depending on whether the character is the initial, middle or final character in a word, as in Arabic and Indic character sets. Further, the font character set may include ligatures, which represent a combination of characters, such as “fi” which is represented by ligature “fi and “fl” which is represented by ligature “fl.” Therefore, if the document includes one or more ligatures, it will be determined that such a ligature is to be included in a font subset to be associated with the document (step 30).
A font subset is created which contains all of the character forms present in the document based on the analysis of the document, which includes all glyphs and ligatures present in the document (step 40). The font subset does not contain extraneous or unused glyphs or ligatures which may be present in a complete font set but are not used within the document analyzed. For example, should the font set include a character having a glyph form corresponding to when the character is present at the initial position of a word, and the document does not contain a word in which the character is in the initial position of a word, then the font subset created will not include that glyph form.
At step 50, the font subset is associated with the document as an embedded font set.
Referring now to
It will now be apparent to one of ordinary skill in the art that the present method provides features and advantages not found in prior font embedding methods. For example, the embedded fonts associated with text documents only include those characters present in the text of the document and not all characters which are present in the font set used in the document text. As a result, the embedded font subset will have a reduced size as compared with prior embedded fonts created using the prior art method of font subsetting, as the prior embedded font subsets include all character forms or glyphs for all characters of a document, regardless of whether a particular glyph form is actually used in the text of the document. Consequently, a document with embedded fonts, in accordance with the present invention, will have a reduced size, requiring less storage space for the electronic document and requiring a reduced data transmission bandwidth when being sent as an electronic document.
Although the invention has been described above in relation to preferred embodiments thereof, it will be understood by those skilled in the art that variations and modifications can be effected in these preferred embodiments without departing from the scope and spirit of the invention.