Digitizing printed documents provides a variety of benefits. For example, digital documents may be stored and shared electronically and may be more efficiently archived, searched, and reproduced as compared to printed or physical documents. Various types of scanning and image capture devices exist that are able to convert a sheet of paper including text, images, or a combination of both, to a digital format. However, existing scanners and image capture devices are only able to scan or digitize a single page at a time. Thus, processing a stack of pages may be time consuming and inefficient. Moreover, the pages may not be easily separable, as in the case of a book. In such a scenario, the pages of the book must be turned, either manually, or by a machine, in between each page scan.
In addition, it may be desirable to scan the contents of a stack of papers, a book, or an envelope for example, without removing any papers from the stack of papers, without opening the book, or without opening the envelope, for example. Existing scanners do enable such scanning without disturbing the contents of the target to be scanned.
In a method for scanning a book including a plurality of pages, a property of the plurality of pages is measured between a plurality of points on the top of the book and a plurality of points on the bottom of the book to generate a plurality of data values. The plurality of data values are grouped into a plurality of levels corresponding to the plurality of pages. A determination is made for each of the plurality of the points at each of the plurality of levels as to whether ink is present. Pixel data is generated, indicative of one of the presence or absence of ink at each of the plurality of points for each of the plurality of levels. An image is generated using the pixel data for each of the plurality of levels.
A system for scanning a book including a plurality of pages includes a first copper plate, with a first plurality of squares, disposed on a first side of a book. The system further includes a second copper plate, with a second plurality of squares, disposed on a second side of the book. The system further includes a power source, coupled to the first copper plate and the second copper plate, for providing an electric charge between one of the first plurality of squares and one of the second plurality of squares. The system further includes a measurement device for measuring the capacitance of the book between the one of the first plurality of squares and the one of the second plurality of squares. The system further includes a computer with program instructions for identifying a plurality of pixels between the one of the first plurality of squares and the one of the second plurality of squares and for determining whether one of the plurality of pixels between the one of the first plurality of squares and the one of the second plurality of squares comprises ink based on the measured capacitance.
In a method for scanning a plurality of papers, the electrical capacitance of a plurality of pixels of a plurality of papers disposed between a first metal plate and a second metal plate is measured. It is determined whether each of the plurality of pixels includes ink based on the measured electrical capacitance. A plurality of images corresponding to a plurality of papers are generated based on the determination of whether the plurality of pixels comprise ink.
In the accompanying drawings, structures are illustrated that, together with the detailed description provided below, describe example embodiments of the claimed invention. Where appropriate, like elements are identified with the same or similar reference numerals. Elements shown as a single component may be replaced with multiple components. Elements shown as multiple components may be replaced with a single component. The drawings may not be to scale. The proportion of certain elements may be exaggerated for the purpose of illustration.
Scanners or image capture devices operate by analyzing how light is reflected by paper and by ink on the paper. Light does not reach the inside of a book when the book is closed, however. Therefore, scanning or examining the content of a book using known scanners or image capture devices is not feasible without opening the book. Described herein is a system and method for scanning a book and determining the contents of the pages inside without opening the book by relying on alternative properties of the ink and paper. Based on the analysis, the book can be digitized and stored in electronic form for archiving, and for distribution and reproduction.
It should be understood that, although the example systems and methods described herein refer to scanning a closed book and determining the content of the pages of the closed book, the systems and methods may similarly be used in other applications to identify content of papers or pages without disturbing the context of the papers, such as a stack of papers, a sealed envelope, and the like.
It should be further appreciated that the systems and methods described herein may similarly be used to perform CT scans and X-Rays for human beings or to examine inorganic objects such as gold or diamonds for cavities and impurities for example.
A pixel is defined as a portion of a page of the book 102. The metal plates include squares corresponding to a set of vertical pixels going through the book 102. The pages of the book 102 may be divided up into any suitable number of pixels. Dividing up the book 102 into a greater number of pixels results in a greater resolution of a scan of the book 102.
System 100 includes a power source 106, which can be AC or DC, for delivering an electrical voltage to the first semiconductor metal plate 104 and to the second semiconductor metal plate. In one example, power source 106 may be a DC circuit defined by the universal time constant formula:
Vc=Vs(1−e−t/RC) equ. (1)
Where Vc is the voltage across the capacitor;
Vs is the supply voltage;
t is the elapsed time since the application of the supply voltage; and
RC is the time constant of the RC charging circuit.
In another example, power source 106 may be an AC circuit where the current is defined by the equation:
Z=√{square root over (R2+X2)} equ. (2)
where, X=Xc=½πfC when connected to a AC source. Z=V/I is computer where I is the measured current.
Paper that includes ink, however, exhibits different properties than paper without ink and therefore produces different values of capacitance when acting as a dielectric in between the metal grid squares through which the current is active. Measuring the value of the capacitance of the book indicates whether there is any ink present inside the book and how much ink is present.
System 100 includes a measurement device 108 such as an Attofarad Capacitance Measurement Instrument for measuring the capacitance of the book in combination with the two semiconductor metal plates 104. Measurement device 108 produces and stores data readings in output data 110. The data readings include information about whether the measured capacitance indicates the presence of ink or not in the paper. Output data 110 can be a database, a flat file such as an Excel file, or any other suitable data storage format.
For a pixel area of 1 and depth of 1 cm, the following equation ca be used:
Using the above equation, C=k*8.84*10−12*0.070004163889*10−6/0.01, where 0.070004163889 *10−6 m2 is area of 1 pixel, 1 aF=0.000001 pF=10−18F. This calculated C is in Attofarad. In one example where the minimum resolution a measuring instrument may take is in Attofarad, taking a minimum reading in Attofarad might lead to errors. Thus, an area larger than one pixel of the metal plate is covered. The reading C can be taken, after which one pixel line current is dropped and a reading C is taken again. The two readings are subtracted to find the reading C of 1 pixel line of the book with a depth of r pages.
In order to determine the contents of the pages of the book 102, system 100 divides up the book 102 into multiple pixels, applies voltage to the pixels, and takes a capacitance reading at the pixels.
A capacitor 400, as illustrated in
where the constant
∈e=8.854×10−12 F/m equ. (5)
and k is equal to relative permittivity of the dielectric material between the plates. K is approximately equal to 1 for air or free space and k is greater than 1 for all media such as paper. A is equal to the area of metal plates 302 and 304 and D is equal to the distance between the metal plates 302 and 304.
Since book pages are dielectric, attaching metal charged plates 104 to opposite ends of the book 102 causes the combination to act as a capacitor. By dividing up the metal plate 104 into squares 202 or pixels, multiple combinations of capacitors can be formed depending on which squares 202 receive the charge. Each capacitor is one pixel area dielectric and the number of dielectric in series is determined by the depth and the thickness of the book surrounded by metal plates or the semiconductor plates on top and on the bottom.
The book 102 contains n pixels length, m pixels width, 1 pixels height, and a total of r pages. When charged metal conductors are joined by copper wire capacitance can be measured by the formula C=q/V.
Also, 1/C1+1/C2+ . . . +1/Cr=1/C where, C1 and C2 is the capacitance of one pixel cross-section of the book 102, in series.
The book will start acting as a capacitor rather than a resistor as Resistance of the book is typically 1 Mohms and C=3.0989 to 2.92 femtofarad, for metal plates 0.001 m by 0.001 m in area and gap between plates=0.01 m. Thus, RC (time constant) is low, so the capacitor will charge up quickly. The internal resistance of the book 102 does not allow for leakage current to flow through the book capacitor. Thus the book is a good dielectric and increases the capacitance of a capacitor.
To obtain the multiple data points, (n+1)(m+1) readings of voltage drop across opposite facing squares 202 in the book's 102 opposite facing sides are measured, considering across top and bottom only. There are three variables, (l)(m)(n), since the book 102 is 3-dimensional. So to factor in the multiple pages or layers of the book 102, a diagonal reading is taken as well. Thus, one readings of C11 is
Measuring voltage across opposite side ends yields (l)(m)+(n)+(l)+(n)(m) readings.
Measuring diagonal readings cross-section in the book 102 with charged squares 202 on metal plates 104 on the diagonals, the number of readings increases by (n)(m)2 as compared to considering only top down readings. These additional data points enable the analysis of the book in 3-dimensions rather than just a single page. Since the capacitance C measured is either the capacitance of air, paper, or ink, measuring C at a pixel indicates whether there is ink or paper at a pixel.
In one example, a book includes 600 by 825 pixels on the top surface so there are (600*825)̂2 linear equations to solve. For improved efficiency and for facilitating real time calculations, the book may be divided into slices to form a lesser 600̂2 linear equations for each slice. The number of pages r of the book determines r*600 variables. The slices can then be individually solved for more efficiently since a slice of the book includes 600 pixels by 1 pixel. The number of slices, 825 for example, is determined by the length of the book.
Referring back to
To create the digital image, the data conversion software 114 first solves mn*mn linear equations in lmn variables which represent the readings taken across multiple levels corresponding to the multiple pages of the book 102.
To determine the inverse capacitance values D at each of the pixels 11, 12, 21, and 22, 4 equations are solved to find the total inverse of capacitance readings. In particular, Y1, Y2, Y3, and Y4 are solved using an example coefficient matrix (1, 0, 2/3) illustrated in
It should be appreciated that the example illustrated in
It should be appreciated that although the example system and method described measures capacitance across pixels of the book in order to determine the contents of the book, other techniques such as X-Ray CT scanning may be relied on for the purpose of determining the contents of the book.
In one example, the thermal conductivity of paper may be used to determine whether the pixels of paper of a book include ink. Heat is applied to the top of the book where the temperature is monitored. By calculating the temperature gradient of a point or a pixel inside of the book based on the monitored outside temperature, the contents of the inside of the book can be determined since a pixel of paper with ink has a different thermal coefficient and therefore conducts heat differently compared to a pixel of paper without ink. The heat has to reach the opposite pixel of the opposite side of paper stack. The time taken to do that and reach a steady state can be calculated by calibrating for the all cuboid ink case.
dQ/dt=q in watts of microwave and top 1 surface is heated up and the other 5 surfaces are not. Further, suppose there are m pixels of paper on heated of the outermost surfaces and n pixels of ink, then the effective coefficient of thermal conductivity is:
for paralle pixels or perpendicular to heat and
Koff=1/(m/k1+n/k2) equ. (8)
for series pixels or parallel to heat. Therefore:
Assuming isothermal conditions, the book Aq is taken as a whole and ‘a’ is the total number of pixels in y-z plane; ‘b’ is the total number of pixels along the y-axis; and ‘c’ is the total number of pixels along the z-axis.
Assuming adiabatic conditions and simplifying for end temperatures we get:
Here we are assuming adiabatic conditions our outer Δqijkx=(q−heat loss to atmosphere from all six sides)/(A)(2mn+2lm+2nl), where A is top surface area of book and where top side is heated=m*n not in meter2. A in this equation is the area of 1 pixel in meters2. Heat is assumed to be divided equally for all directions for going to all pixels from one pixel.
Thus there are lm+nl+mn equations from the above equation from 2(ml+nl+nm) temperature readings on the outer surfaces and lmn variables inside the book. This leads to solving for lmn variables, since heat travels from pixel 111 to 12n also and takes the shortest route to calculate it.
For the outermost surface, heat lost to atmosphere must also be considered.
Thus q-dQ/qt=heat input in the book.
Newton's cooling law is a solution of the differential equation given by Fourier's law: Where Q is the thermal energy in joules;
h is the heat transfer coefficient (assumed independent of T here) (W/m2K);
a is the surface area of the heat being transferred (m2);
T is the temperature of the object's surface and interior; and
Tenv is the temperature of the environment; i.e. the temperature suitably far from the surface. Thus,
In one example, a book may be scanned using magnetic inductance.
Reluctance (total)=Rpole+Rpole+Rgap+Router/2 and Rgap=reluctance of ink and paper.
Where
etc.
The μ magnetic permeability is found by computing the effective permeability of paper and paper with ink or air contact
AC Current is:
where, X=XL=2πfL when connected to a AC source. Z=V/I is computed where I is the measured current. A DC source inductor follows the same Universal Time Formula as a capacitor except:
τ=L/R
Readings are take at different positions on the book. Some readings are taken diagonally opposite. The number of readings obtained is equal to (n)(m)2 considering only top down readings and (l)(m)(n) variables.
In one example, the thickness of ink and paper may not be constant. Thus, in one example, the systems and methods described herein may be calibrated for variances in thickness as well as humidity, different ink densities, and dust or other particles on the book.
It should be understood that, although some of the examples that have been described make reference to scanning a book, the system and method described herein may be used to scan other objects as well.
Some portions of the detailed descriptions are presented in terms of algorithms and symbolic representations of operations on data bits within a memory. These algorithmic descriptions and representations are the means used by those skilled in the art to convey the substance of their work to others. An algorithm is here, and generally, conceived to be a sequence of operations that produce a result. The operations may include physical manipulations of physical quantities. Usually, though not necessarily, the physical quantities take the form of electrical or magnetic signals capable of being stored, transferred, combined, compared, and otherwise manipulated in a logic and the like.
While example systems, methods, and so on, have been illustrated by describing examples, and while the examples have been described in considerable detail, it is not the intention to restrict or in any way limit the scope of the appended claims to such detail. It is simply not possible to describe every conceivable combination of components or methodologies for purposes of describing the systems, methods, and so on. With the benefit of this application, additional advantages and modifications will readily appear to those skilled in the art. The scope of the invention is to be determined by the appended claims and their equivalents.
To the extent that the term “includes” or “including” is used in the specification or the claims, it is intended to be inclusive in a manner similar to the term “comprising” as that term is interpreted when employed as a transitional word in a claim. Furthermore, to the extent that the term “or” is employed (e.g., A or B) it is intended to mean “A or B or both.” When the applicants intend to indicate “only A or B but not both” then the term “only A or B but not both” will be employed. Thus, use of the term “or” herein is the inclusive, and not the exclusive use. See, Bryan A. Garner, A Dictionary of Modern Legal Usage 624 (2d. Ed. 1995). Also, to the extent that the terms “in” or “into” are used in the specification or the claims, it is intended to additionally mean “on” or “onto.” Furthermore, to the extent the term “connect” is used in the specification or claims, it is intended to mean not only “directly connected to,” but also “indirectly connected to” such as connected through another component or components.
This application claims priority from U.S. Provisional Patent Application No. 61/869,362 filed on Aug. 23, 2013, which is incorporated by reference herein in its entirety.
Number | Date | Country | |
---|---|---|---|
61869362 | Aug 2013 | US |