The present invention relates to a data processing method and apparatus and, in particular, discloses a data encoding method and apparatus for storing data on photographs using an ink jet printing system using an infra-red ink wherein the data is original image data taken from a camera system which has been transformed by an image processing program, the data also including a copy of the program.
Various methods, systems and apparatus relating to the present invention are disclosed in the following co-pending applications filed by the applicant or assignee of the present invention with the present application:
The disclosures of these co-pending applications are incorporated herein by reference.
Various methods, systems and apparatus relating to the present invention are disclosed in the following co-pending application filed by the applicant or assignee of the present invention on Jul. 10, 1998:
The disclosures of this co-pending application are incorporated herein by reference.
Various methods, systems and apparatus relating to the present invention are disclosed in the following co-pending applications filed by the applicant or assignee of the present invention on Jun. 30, 2000:
The disclosures of this co-pending application are incorporated herein by reference.
As the applicant has previously noted in pending applications U.S. Ser. No. 09/113,070 and U.S. Ser. No. 09/112,785 there is a general need for a print media scanning system that allows for high volumes of computer data to be stored on a simple print media, such as a card while simultaneously tolerating a high degree of corruption when read by a scanning device. For example, the form of distribution can suffer a number of data corruption errors when the surface is scanned by a scanning device. The errors can include:
Ideally, any scanning system should be able to maintain its accuracy in the presence of errors due to the above factors.
In applications U.S. Ser. No. 09/113,070 and U.S. Ser. No. 09/112,785, the applicant disclosed a method and apparatus for printing data in an encoded fault tolerant form on the back of a photograph preferably using black ink on a white background. The data represented the photograph in a digital image file format and/or data comprising a computer programme script which could be run to recreate the image or to apply some effect to the image. A programming language called a VARK script was invented for this purpose which was designed to be portable and device independent.
According to an aspect of the present disclosure, a method of storing data on a photograph containing an image includes the steps of: delimiting a plurality of data block regions on the photograph; compressing data representing the image contained in the photograph with a compression technique; redundantly encoding the compressed data using Reed-Solomon encoding techniques; shuffling the redundantly encoded compressed data in a deterministic manner to reduce potential effects of localized encoded data damage to the image of said photograph; and printing the shuffled data as dots within the data block regions on the photograph. The printing of the shuffled data is performed simultaneously with a printing out of the image of the photograph.
Notwithstanding any other forms which may fall within the scope of the present invention, preferred forms of the invention will now be described, by way of example only, with reference to the accompanying drawings in which:
The present invention preferably uses an ink jet printing system having at least four ink jet print nozzles per dot in a pagewidth printhead. The four inks would be cyan, magenta, and yellow for printing a color image and an infra-red (IR) ink for printing data in an encoded fault tolerant form along with the color image. One such ink jet printhead which can print using four inks is disclosed in the applicant's co-pending applications U.S. Ser. No. 09/608,779, U.S. Ser. No. 09/607,987, U.S. Ser. No. 09/608,776, U.S. Ser. No. 09/607,250, and U.S. Ser. No. 09/607,991.
Infra-red inks suitable for use with the current invention are disclosed in the applicant's co-pending applications, Australian provisional patent applications PQ9412 and PQ9376 both filed on Aug. 14, 2000 and applicant's applications PQ9509 filed on Aug. 18, 2000, and PQ9571, and PQ9561 filed on Aug. 21, 2000.
Techniques that can be used to encode the information for printing an infra-red ink are disclosed applicant's co-pending application U.S. Ser. No. 09/113,070 and U.S. Ser. No. 09/112,785, the description of which is incorporated herein by reference. These techniques were described as Artcard, alternative Artcard or Dotcard formats. In these applications, the data was printed using a black ink on a white background on the back of a card of size 85 mm×55 mm in an active data area of 80 mm×50 mm. In this way 967 Kbytes of data was fault tolerantly encoded as 1.89 Mbytes of data using 15,876,000 printed dots.
Of course, while other encoded data formats are possible, there will now be described one such encoded data format using the “alternative Artcard” format as above referred to) with a number of preferable features.
The Encoded data can be used to recover the image over which it is written or to provide a digital format thereof for manipulation in applications, for example transmission over a digital telecommunication network or image processing in a computer.
Encoded data technology can also be independent of the printing resolution. The notion of storing data as dots on print media simply means that if it is possible to put more dots in the same space (by increasing resolution), then those dots can represent more data. The preferred embodiment assumes utilization of 1600 dpi printing on a 102 mm×152 mm (4″×6″) size photograph as the sample photograph, but it is simple to determine alternative equivalent layouts and data sizes for other photograph sizes and/or other print resolutions. For example, in the applicant's ink jet printing camera system a panoramic print can also be produced which is twice the length of the standard size photograph allowing twice the data to be recorded enhancing redundancy of the image data. Regardless of the print resolution, the reading technique remains the same. After all decoding and other overhead has been taken into account, the encoded data format is capable of storing 3 to 4 Megabyte of data for a 4″×6″ print size at print resolutions up to 1600 dpi. More encoded data can be stored at print resolutions greater than 1600 dpi.
The structure of data on the photograph is therefore specifically designed to aid the recovery of data. This section describes the format of the data on a photograph. This format was previously described in U.S. Ser. No. 09/113,070 and U.S. Ser. No. 09/112,785.
The dots printed on the photograph are in infra-red ink with or over a color image. Consequently a “data dot” is physically different from a “non-data dot”. When the photograph is illuminated by an infra-red source having complementary spectral properties to the absorption characteristics of the IR ink the data appears as a monochrome display of “black” on “white” dots. The black dots correspond to dots were the IR ink is and has absorbed the IR illumination and “white” dots correspond to areas of the color image over which no IR ink has been printed and reflecting the IR illumination substantially unattenuated or only partially attenuated. Hereinafter the terms black and white as just defined will be used when referring to the IR ink dots recording data.
In describing this embodiment, the term dot refers to a physical printed dot (of IR ink) on a photograph. When an encoded data reader scans encoded data, the dots must be sampled at least double the printed resolution to satisfy Nyquist's Theorem. The term pixel refers to a sample value from an encoded data reader device. For example, when 1600 dpi dots are scanned at 4800 dpi there are 3 pixels in each dimension of a dot, or 9 pixels per dot. The sampling process will be further explained hereinafter.
Turning to
Turning now to
Each data block 107 has dimensions of 627×394 dots. Of this, the central area of 595×384 dots is the data region 108. The surrounding dots are used to hold the clock-marks, borders, and targets.
The clock marks are symmetric in that if the encoded data is inserted rotated 180 degrees, the same relative border/clockmark regions will be encountered. The border 112, 113 is intended for use by an encoded data reader to keep vertical tracking as data is read from the data region. The clockmarks 114 are intended to keep horizontal tracking as data is read from the data region. The separation between the border and clockmarks by a white line of dots is desirable as a result of blurring occurring during reading. The border thus becomes a black line with white on either side, making for a good frequency response on reading. The clockmarks alternating between white and black have a similar result, except in the horizontal rather than the vertical dimension. Any encoded data reader must locate the clockmarks and border if it intends to use them for tracking. The next section deals with targets, which are designed to point the way to the clockmarks, border and data.
As shown in
As shown in
As shown in
The simplified schematic illustrations of
As illustrated in
From the encoded data reader's point of view, assuming no degradation to the dots, there are two possibilities:
As shown in
The actual interpretation of the bits derived from the dots, however, requires understanding of the mapping from the original data to the dots in the data regions of the photograph.
There will now be described the process of taking an original data file of maximum size 2,986,206 bytes and mapping it to the dots in the data regions of the 210 data blocks on a 1600 dpi photograph. An encoded data reader would reverse the process in order to extract the original data from the dots on a photograph. At first glance it seems trivial to map data onto dots: binary data is comprised of 1s and 0s, so it would be possible to simply write black and white dots onto the card. This scheme however, does not allow for the fact that ink can fade, parts of a card may be damaged with dirt, grime, or even scratches. Without error-detection encoding, there is no way to detect if the data retrieved from the card is correct. And without redundantly encoding, there is no way to correct the detected errors. The aim of the mapping process then, is to make the data recovery highly robust, and also give the encoded data reader the ability to know it read the data correctly.
There are four basic steps involved in mapping an original data file to data region dots:
The data to be recorded on the photograph may comprise several blocks, e.g.
For a high quality image, the source image data may be 2000×3000 pixels, with 3 bytes per pixel. This results in 18 Mbytes of data, which is more than can be stored in infra-red dots on the photo. The image data can be compressed by a factor of around 10:1 with generally negligible reduction in image quality using an image compression technique. Suitable image compression techniques include JPEG compression based on discrete cosine transforms and Huffman coding, wavelet compression as used in the JPEG2000 standard or fractal compression.
With 10:1 compression, the 18 Mbytes of a high quality image results in 1.8 Mbytes of compressed data.
The audio annotation data can also be compressed using, for example, MP3 compression.
The image processing control scrip will typically not consume more than 10 Kbytes of data, with the exception of images embedded in the script. These images should generally be compressed. A suitable image processing script language designed for photograph processing is the ‘Vark’ language developed by the present applicant and disclosed in U.S. Ser. No. 09/113,070 (Docket No. ART02US). The remaining data is small, and need not be compressed.
The mapping of data to encoded data dots relies heavily on the method of redundancy encoding employed. Reed-Solomon encoding is preferably chosen for its ability to deal with burst errors and effectively detect and correct errors using a minimum of redundancy. Reed Solomon encoding is adequately discussed in the standard texts such as Wicker, S., and Bhargava, V., 1994, Reed-Solomon Codes and their Applications, IEEE Press, Rorabaugh, C, 1996; Error Coding Cookbook, McGraw-Hill, Lyppens, H., 1997; Reed-Solomon Error Correction, Dr. Dobb's Journal, January 1997 (Volume 22, Issue 1).
A variety of different parameters for Reed-Solomon encoding can be used, including different symbol sizes and different levels of redundancy. Preferably, the following encoding parameters are used:
Having m=8 means that the symbol size is 8 bits (1 byte). It also means that each Reed-Solomon encoded block size n is 255 bytes (28-1 symbols). In order to allow correction of up to t symbols, 2t symbols in the final block size must be taken up with redundancy symbols. Having t=64 means that 64 bytes (symbols) can be corrected per block if they are in error. Each 255 byte block therefore has 128 (2×64) redundancy bytes, and the remaining 127 bytes (k=127) are used to hold original data. Thus:
Each of the two Control blocks 132, 133 contain the same encoded information required for decoding the remaining 23,518 Reed-Solomon blocks:
The Control Block is stored twice to give greater chance of it surviving. In addition, the repetition of the data within the Control Block has particular significance when using Reed-Solomon encoding. In an uncorrupted Reed-Solomon encoded block, the first 127 bytes of data are exactly the original data, and can be looked at in an attempt to recover the original message if the Control Block fails decoding (more than 64 symbols are corrupted). Thus, if a Control Block fails decoding, it is possible to examine sets of 3 bytes in an effort to determine the most likely values for the 2 decoding parameters. It is not guaranteed to be recoverable, but it has a better chance through redundancy. Say the last 159 bytes of the Control Block are destroyed, and the first 96 bytes are perfectly ok. Looking at the first 96 bytes will show a repeating set of numbers. These numbers can be sensibly used to decode the remainder of the message in the remaining 23,518 Reed-Solomon blocks.
To store a 3-color image each color either “on” or “off” and size 4″×6″ (102 mm×152 mm) at 1600 dpi resolution requires 210,400 bits or 26,300 bytes of data. A program in Vark script may be approximately 10-15 Kbytes long. If we assume a program of size 13,568 bytes, this requires 39,868 Kbytes to be encoded to store the image data and the program data, according to the invention. The number of Reed-Solomon blocks required is 314. The first 313 Reed-Solomon blocks are completely utilized, consuming 39,751 bytes (313×127). The 314th block has only 117 bytes of data (with the remaining 10 bytes all 0s).
A hex representation of the 127 bytes in each Control Block data before being Reed-Solomon encoded would be as illustrated in
Assuming all the encoded blocks have been stored contiguously in memory, a maximum 5,997,600 bytes of data can be stored on the photograph (2 Control Blocks and 23,518 information blocks, totaling 23,520 Reed-Solomon encoded blocks). Preferably, the data is not directly stored onto the photograph at this stage however, or all 255 bytes of one Reed-Solomon block will be physically together on the card. Any dirt, grime, or stain that causes physical damage to the card has the potential of damaging more than 64 bytes in a single Reed-Solomon block, which would make that block unrecoverable. If there are no duplicates of that Reed-Solomon block, then the entire photograph cannot be decoded.
The solution is to take advantage of the fact that there are a large number of bytes on the photograph, and that the photograph has a reasonable physical size. The data can therefore be scrambled to ensure that symbols from a single Reed-Solomon block are not in close proximity to one another. Of course pathological cases of photograph degradation can cause Reed-Solomon blocks to be unrecoverable, but on average, the scrambling of data makes the data much more robust. The scrambling scheme chosen is simple and is illustrated schematically in
Under this scrambling scheme, complete damage to 16 entire data blocks on the photograph will result in 64 symbol errors per Reed-Solomon block. This means that if there is no other damage to the photograph, the entire data is completely recoverable, even if there is no data duplication.
Once the original data has been Reed-Solomon encoded, duplicated, and scrambled, there are 5,997,600 bytes of data to be stored on the photograph. Each of the data blocks on the photograph stores 28,560 bytes.
The data is simply written out to the photograph data blocks so that the first data block contains the first 28,560 bytes of the scrambled data, the second data block contains the next 28,560 bytes etc.
As illustrated in
For example, a set of 5,997,600 bytes of data can be created by scrambling 23,520 Reed-Solomon encoded blocks to be stored onto an photograph. The first 28,560 bytes of data are written to the first data block. The first 48 bytes of the first 28,560 bytes are written to the first column of the data block, the next 48 bytes to the next column and so on. Suppose the first two bytes of the 28,560 bytes are hex D3 5F. Those first two bytes will be stored in column 0 of the data block. Bit 7 of byte 0 will be stored first, then bit 6 and so on. Then Bit 7 of byte 1 will be stored through to bit 0 of byte 1. Since each “1” is stored as a black dot, and each “0” as a white dot, these two bytes will be represented on the photograph as the following set of dots:
D3 (1101 0011) becomes: black, black, white, black, white, white, black, black
5F (0101 1111) becomes: white, black, white, black, black, black, black, black
The encoded image data is sent to an ink jet printer to drive the infra-red ink jet nozzles while the image data is used to drive the cyan, magenta, and yellow color nozzles while the print media is driven through the printhead of the printer as disclosed in applicant's co-pending applications U.S. Ser. No. 09/113,070 (Docket No. ART02US) and U.S. Ser. No. 09/112,785 (Docket No. ART29).
The image taken by the camera system is now available as a photographic image with the data necessary to reproduce that image printed therewith. It is not necessary to separately locate the negative if another copy of the photograph is desired, the image can be reproduced notwithstanding damage thereto and the image is available in a digital format which can be scanned into a computer system as disclosed in applicant's co-pending applications U.S. Ser. No. 09/113,070 (Docket No. ART02US) and U.S. Ser. No. 09/112,785 (Docket No. ART29) for whatever purpose or transmitted over a telecommunications network.
Another type of format the so-called Artcard format is disclosed in U.S. Ser. No. 09/113,070 (Docket No. ART02US) and U.S. Ser. No. 09/112,785 (Docket No. ART29US) and may equally be used here in place of the “alternative Artcard” format as described above. In the Artcard format a continuous area of data is printed on the print media, in the present case, in infra-red ink on the photograph surrounded by margins printed as targets at the leading and trailing edges of the data area and as other indicia to specify borders and clockmarks along the top and bottom thereof to aid decoding of the data contained in the data area. The targets are used to confirm that the orientation of the card when read is not rotated more than 1° from the horizontal and to detect whether the card has been inserted front or back first. Otherwise the reading of the data would be unreliable.
The foregoing description has been limited to specific embodiments of this invention. It will be apparent, however, that variations and modifications may be made to the invention, with the attainment of some or all of the advantages of the invention. For example, it will be appreciated that the invention may be embodied in either hardware or software in a suitably programmed digital data processing system, both of which are readily accomplished by those of ordinary skill in the respective arts. Therefore, it is the object of the appended claims to cover all such variations and modifications as come within the true spirit and scope of the invention.
This is a Continuation of Ser. No. 12/031,526 filed on Feb. 14, 2008, which is a Continuation of Ser. No. 11/006,577 filed on Dec. 8, 2004, now issued U.S. Pat. No. 7,354,122, which is a Continuation of Ser. No. 09/693,083 filed on Oct. 20, 2000 issued as U.S. Pat. No. 6,859,225, which is herein incorporated by reference.
Number | Date | Country | |
---|---|---|---|
Parent | 12031526 | Feb 2008 | US |
Child | 12966838 | US | |
Parent | 11006577 | Dec 2004 | US |
Child | 12031526 | US | |
Parent | 09693083 | Oct 2000 | US |
Child | 11006577 | US |