The present invention relates to a data processing apparatus for a comprehensive two-dimensional chromatograph, the apparatus processing data collected by a comprehensive two-dimensional gas chromatograph (GC) or a comprehensive two-dimensional liquid chromatograph (LC).
As a GC analysis technique, a technique called a comprehensive two-dimensional GC (also called “GC×GC”) is known (refer to Patent Literature 1). The comprehensive two-dimensional GC first separates various components in a sample in a first-dimension column (hereinafter, called a “primary column”), and introduces the eluted components into a modulator. The modulator repeats an operation of trapping the introduced components at constant time intervals (typically, about several seconds to several tens of seconds; the time interval is usually called “modulation time”) and subsequently drawing the components in a significantly narrow time band, and introducing the components into a second-dimension column (hereinafter, called a “secondary column”). Typically, in the primary column, components are separated under a separation condition that allows elution similar to that of a typical GC or elution slightly slower than that of a typical GC. On the other hand, as a secondary column, a column that has a different polarity and a smaller inner diameter in comparison with the primary column is adopted. Components are separated under a condition that allows elution to be completed in a predetermined modulation time.
Accordingly, in the comprehensive two-dimensional GC, the secondary column can be used to separate multiple compounds that have not been separated in the primary column and have peaks overlap with each other, thereby allowing the separation performance to be significantly improved in comparison with a typical GC. Therefore, this GC is significantly effective in analyzing a sample that contains many compounds having close retention times, typically in analyzing hydrocarbons in diesel fuel and the like.
Unlike a multi-dimensional GC that adopts multiple detectors corresponding to respective columns, the comprehensive two-dimensional GC obtains a detection signal through a single detector connected at an output port of a secondary column. Accordingly, though components are separated in columns in two stages, data output from the detector is one series of chronological order data items. Therefore, by plotting the thus obtained data items in the occurrence order, a chromatogram similar to that of a typical GC, i.e., a chromatogram where the abscissa indicates the temporal axis and the ordinate indicates the signal strength axis, can be created.
As described above, in many cases, the comprehensive two-dimensional GC includes two columns having different separation characteristics. Accordingly, in order to represent the state of separation in each column in a manner easy to understand, a two-dimensional chromatogram is created where the retention time in the primary column and the retention time in the secondary column are represented in respective two axes orthogonal to each other and the signal strength is represented as contour lines, or a three-dimensional chromatogram is created where the signal strength is represented as the third axis. As data processing software dedicated to a comprehensive two-dimensional GC for creating such a multi-dimensional chromatogram, “GC Image” (refer to Non Patent Literature 1) provided by GC Image LLC in the U.S. is well known.
In a temperature rising analysis, the abscissa indicates the order of boiling points, and the ordinate indicates the polarity order. Accordingly, this two-dimensional chromatogram can facilitate understanding of the characteristics of each compound, and identifying contained compounds even when many types of compounds are contained.
Typically, in the comprehensive two-dimensional GC, data obtained from sample analysis is temporarily stored in a storage device, such as a hard disk. Subsequently, the data is read from the storage device at an appropriate timing, and processed by the dedicated data processing software as described above. The processes of collecting data from the comprehensive two-dimensional GC and storing the collected data in the storage device are performed using software for a typical GC or GC/MS instead of the comprehensive two-dimensional GC. However, the typical GC does not originally have a concept of “modulation time”. Accordingly, the collected chromatogram data does not contain data (parameter information) that indicates the modulation time as one of analysis conditions. Thus, conventionally, an analyst records modulation time information during execution of analysis. When the chromatogram data is read by the data processing software dedicated to the comprehensive two-dimensional GC, the analyst inputs the modulation time as one of processing parameters (refer to Non Patent Literature 2). Such operations are complicated for the analyst, and cause a possibility that input errors and the like generate incorrect results.
The comprehensive two-dimensional LC that executes analysis similar to that of the comprehensive two-dimensional GC is also in situations similar to the above situations.
The present invention has been made in view of the above problems and is addressed to an object to provide a data processing apparatus for a comprehensive two-dimensional chromatograph that can negate the need to input modulation time information required for processing in data processing, such as two-dimensional chromatogram creating, based on chromatogram data obtained through a comprehensive two-dimensional chromatograph.
To solve the aforementioned problems, the present invention provides a data processing apparatus for a comprehensive two-dimensional chromatograph, the apparatus processing chromatogram data collected by the comprehensive two-dimensional chromatograph including a primary column, a modulator, a secondary column, and a detector, the data processing apparatus including:
a) a one-dimensional chromatogram creation unit configured to arrange chromatogram data items acquired by the comprehensive two-dimensional chromatograph according to a chronological order to create a one-dimensional chromatogram that represents a relationship between time and signal strength; and
b) a modulation time estimation unit configured to estimate modulation time in the modulator, based on regularity in appearance time on the one-dimensional chromatogram between a certain peak and one or more other peaks that originate from a compound identical to the compound for the peak but appear at different time.
Here, the comprehensive two-dimensional chromatograph may be any of a comprehensive two-dimensional GC, and a comprehensive two-dimensional LC.
In general, the peak width of a compound separated in the primary column is large. Accordingly, the same compound is introduced into the secondary column over consecutive ranges of modulation time. Therefore, in the chromatogram data corresponding to the ranges of modulation time, peaks originating from the same compound must regularly appear, and the interval between peaks originating from the same compound on the one-dimensional chromatogram in adjacent two ranges of modulation time must be substantially identical to the modulation time. Thus, in the data processing apparatus for a comprehensive two-dimensional chromatograph according to the present invention, the modulation time estimation unit estimates the modulation time using the regularity of appearance of peaks originating from the same compound on the aforementioned one-dimensional chromatogram.
An aspect of the present invention can have a configuration where the modulation time estimation unit examines correlation between the one-dimensional chromatogram and a time-shifted one-dimensional chromatogram obtained by shifting the one-dimensional chromatogram in a temporal axis, and estimates that a shift time maximizing the correlation during variation in the shift time is the modulation time. More specifically, for example, the product of signal strengths in respective times on between the one-dimensional chromatogram and the time-shifted one-dimensional chromatogram is calculated, and the integrated value over the entire measurement time range is obtained. The variation in integrated value according to variation in shift time may be obtained to find a shift time maximizing the integrated value, and this shift time may be estimated as the modulation time.
Such a configuration can estimate the modulation time through repetition of a simple operation. The repetition has a small load on hardware. Accordingly, the modulation time can be estimated in a short time.
Preferably, the data processing apparatus for a comprehensive two-dimensional chromatograph according to the present invention has a configuration that further includes a two-dimensional chromatogram creation unit configured to create a two-dimensional chromatogram having axes that are retention time in the primary column and retention time in the secondary column, based on the modulation time obtained by the modulation time estimation unit.
According to the data processing apparatus for a comprehensive two-dimensional chromatograph according to the present invention, based on one-dimensional chromatogram data collected by a comprehensive two-dimensional GC or a comprehensive two-dimensional LC, modulation time is automatically obtained. Accordingly, during execution of data processing specific to a comprehensive two-dimensional chromatograph, such as two-dimensional chromatogram creation through use of modulation time information, the modulation time information is not required to be input from the outside. This negates the need of efforts by the analyst for information input, thereby improving operation efficiencies, and avoiding inappropriate data processing due to an input error or the like.
One embodiment of a comprehensive two-dimensional GC system adopting a data processing apparatus for a comprehensive two-dimensional chromatograph according to the present invention will be described below with reference to the accompanying drawings.
In this system, an analysis unit 1 includes: a primary column 12; a sample introduction unit 11 that includes a sample vaporization chamber for introducing sample gas into the primary column 12; a modulator 13 that traps components (compounds) eluted from the primary column 12 in constant time (modulation time t) intervals, and temporally compresses and transfers the components; a secondary column 14 that has separation characteristics (typically, different polarity) different from those of the primary column 12 and is capable of high speed separation; and a detector 15 that detects compounds separated in two-stage columns 12 and 14 respectively, and outputs a strength signal according to the detected amount (concentration). The detector 15 may have any configuration that can be generally used as a detector for GC, irrespective of the type (detection scheme). For example, in the case of adopting a mass spectrometer as the detector 15, the mass spectrometer repeatedly performs scanning measurement in a range of predetermined mass-to-charge ratio, and total ion chromatogram data in which the ion intensity data item obtained in each scanning is accumulated be output from the detector 15.
A data processing unit 2 includes: a data collection unit 21 that collects chromatogram data items sequentially output from the detector 15 according to the time lapse as described above; a data storage unit 22 that stores the collected chromatogram data items; and a GC×GC data processor 23 that reads the data items stored in the data storage unit 22, and processes the data items. The GC×GC data processor 23 includes: a modulation time estimation unit 24 that estimates the modulation time tm based on the read data items; and a two-dimensional chromatogram creation unit 25 that creates a two-dimensional chromatogram based on the estimated modulation time tm.
The operation of each component included in the analysis unit 1 is controlled by an analysis control unit 3. A main control unit 4, to which an operation unit 5 as a user interface and a display unit 6 are connected, performs overall control of the system. The main control unit 4, the analysis control unit 3, and the data processing unit 2 can be achieved by adopting a personal computer as hardware resources, and executing dedicated control/processing software having been preliminarily installed in the personal computer. Particularly, in the GC×GC data processor 23, functions other than the modulation time estimation unit 24 can be achieved using software described in Non Patent Literatures 1 and 2.
An analysis operation in the analysis unit 1, that is, a chromatogram data collecting operation, is schematically described.
In the analysis unit 1, the sample introduction unit 11 introduces a sample to be analyzed into carrier gas transferred to the primary column 12 at a substantially constant flow rate, according to an instruction from the analysis control unit 3. Typically, the sample contains many compounds. The various compounds contained in the sample are separated while passing through the primary column 12 temperature-controlled according to a predetermined temperature rising program, and are eluted by being shifted in time. At this time, not all the compounds are necessarily separated sufficiently. Compounds having close retention times in the primary column 12 are eluted in an overlapping manner (in a mixed state).
The modulator 13 repeatedly performs operations that trap all compounds eluted from the primary column 12 over the modulation time tm (typically, about several seconds to several tens of seconds), temporally compress the compounds, and transfer the compounds into the secondary column 14 in a significantly narrow band. Accordingly, compounds eluted from the primary column 12 are transferred into the secondary column 14 without exception. The multiple compounds transferred in every modulation time tmm are eluted while being temporally separated at a high resolution during passing through the secondary column 14, and introduced into the detector 15 in the elution order. As described above, in the case of adopting a mass spectrometer as the detector 15, scanning measurement at an interval shorter than a time width in which one compound is eluted from the secondary column 14 allows all the compounds to be detected without exception.
The detection signal by the detector 15 is converted by an internally included A/D converter into digital data at a predetermined sampling period, and output. The data collection unit 21 collects the chromatogram data items sequentially obtained according to the time lapse in this manner, and stores the data items in the data storage unit 22. Typically, a series of chromatogram data items obtained by execution of comprehensive two-dimensional GC analysis for one sample is integrally stored as one data file.
The GC×GC data processor 23 is a dedicated data processing unit for processing data obtained by the comprehensive two-dimensional GC analysis as described above. Typically, the GC×GC data processor has a function of creating a two-dimensional chromatogram where the abscissa indicates a first-dimensional retention time and the ordinate indicates a second-dimensional retention time. In contrast, the GC×GC data processor 23 of this embodiment not only includes the two-dimensional chromatogram creation unit 25, but also the characteristic modulation time estimation unit 24. Hereinafter, description will be made mainly on the operations of the modulation time estimation unit 24.
For example, when an operation by the analyst through the operation unit 5 instructs creation of a two-dimensional chromatogram on the basis of the analysis result of the predetermined sample, the GC×GC data processor 23 reads designated data items from the data storage unit 22 based on control by the main control unit 4. The modulation time estimation unit 24 arranges the read chromatogram data items according to the time lapse, thereby creating a one-dimensional chromatogram as shown in the diagram (a) of
Since the resolution in the primary column 12 is relatively low, gas eluted from the primary column 12 contains the same compound spreading over multiple ranges of modulation time tm. Accordingly, in the one-dimensional chromatogram, a peak originating from a certain one compound hardly appears only in one range of the modulation time tm. In many cases, a sharp peak originating from the same compound appears in each of consecutive ranges of modulation time tm. From the modulator 13 to the secondary column 14, compounds compressed at intervals of the modulation time tm are introduced. The same compound must have the same retention time in the secondary column 14. Accordingly, the interval between peaks that originate from the same compound and appear in respective adjacent ranges of modulation time tm on the one-dimensional chromatogram has regularity. The interval must coincide with the modulation time tm. Accordingly, when the shift time τ coincides with the modulation time tm, the positions of peaks originating from the same compound coincide with each other between the time-shifted one-dimensional chromatogram and the original one-dimensional chromatogram (refer to the diagram (c)
More specifically, the regularity of peaks originating from the same compound on the one-dimensional chromatogram can be evaluated according to the following method.
Now, suppose the function of the curve of the one-dimensional chromatogram be f(t). An evaluation function with respect to the shift time τ as represented in Expression (1) is considered.
F(τ)=∫f(t)·f(t−τ)dt (1)
This evaluation function is obtained by temporally integrating the product of the signal strengths at the same position (temporal position) between the original one-dimensional chromatogram and the time-shifted one-dimensional chromatogram.
The evaluation function of Expression (1) has peaks coinciding with each other and represents a significantly high value, when the shift time τ is an integral multiple of the modulation time tm. Typically, the modulation time tm is set within a range of about one second to several tens of seconds, and thus the evaluation range of the shift time τ is preferably set to 0 seconds to about an extent ranging from at least 20 seconds to one minute. It is a matter of course that, if the setting range of the modulation time tm is narrower, the evaluation range of the shift time τ can also be set narrower. The integral calculation in Expression (1) may be replaced with total summation of the sampling intervals.
While the shift time τ is changed by each predetermined step width over the evaluation range of the shift time τ as described above, the evaluation value F(τ) based on Expression (1) is calculated. This calculation can obtain the relationship between the shift time τ and the evaluation value F(τ). As described above, if the shift time τ is twice, three times, . . . longer than the modulation time tm, F(τ) becomes large. In the relationship between the shift time τ and the evaluation value F(τ), a predetermined number of shift times τi at which F(τ) becomes a local maximum are selected in a descending order of F(τ). Here, τ0 (i=0)<τ1 . . . <τn. For example, n may be, say, about 1000. After the shift time τi is thus obtained, it is preferred that Δτ1=τ1−τ0. If mΔτ1 resides in a range from τ2 to τn, Δτ1 is set as the modulation time tm. If peaks originating from one compound eluted from the primary column 12 spread over consecutive four ranges of modulation time tm (typically, the resolution in the primary column 12 is set to such an extent), the value of m is set to all integers ranging from 2 to 4.
For determination whether mΔτ1 is in a range from τ2 to τn or not as described above, in consideration of the peak width of the chromatogram and deviation in peak position due to variation in carrier gas flow velocity in the secondary column 14, it is preferred that an error to an extent of about, e.g., 20 ms be neglected.
If mΔτ1 does not reside in a range from τ2 to τn, setting is next made such that Δτ2=τ2−τ0 and it is determined whether mΔτ2 resides in a range from τ3 to τn or not. If mΔτ2 resides in the range from τ3 to τn, Δτ2 is set as the modulation time tm. If mΔτ2 does not reside in a range from τ3 to τn, it is preferred that mΔτ1 be obtained for higher ti and a similar process be repeated. Thus, even if peaks capable of being regarded to originate from the same compound are not detected in adjacent ranges of modulation time owing to, e.g., disturbance, detection of peaks capable of being regarded to originate from the same compound in two or more ranges of modulation time that are not adjacent to each other (e.g., apart by one modulation time) enables a highly accurate modulation time to be estimated.
After the modulation time tm is thus estimated, the two-dimensional chromatogram creation unit 25 two-dimensionally arranges each chromatogram data item according to the obtained modulation time tm, and creates the two-dimensional chromatogram as shown in
The modulation time estimating method is not limited to the aforementioned method. Instead, the method may be any method as long as the method can detect the regularity or periodicity of appearance time of multiple peaks that can be estimated to originate from the same compound on the one-dimensional chromatogram. For example, the method is only required to evaluate correlation in waveform between the original one-dimensional chromatogram and the time-shifted one-dimensional chromatogram. Accordingly, any general method capable of calculating the correlation coefficient can be used.
It is apparent that the embodiment described above is merely an example of the present invention and it is obvious that any change, modification, or addition made as appropriate within the spirit and scope of the present invention is included in the scope of the appended claims. For example, the data processing apparatus according to the present invention is applicable not only to processing of data obtained by the comprehensive two-dimensional GC but also to processing of data obtained by the comprehensive two-dimensional LC.
This application is a National Stage of International Application No. PCT/JP2012/077540 filed Oct. 25, 2012, the contents of all of which are incorporated herein by reference in their entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2012/077540 | 10/25/2012 | WO | 00 |