Method for determining the size distribution of a mixture of particles using Taylor dispersion, and associated system

FIELD OF THE INVENTION

This invention concerns methods for determining the size distribution of a mixture of particles by implementing the Taylor dispersion and associated system, including the following steps:

- injecting a sample of the mixture to be analyzed inside a capillary in which an eluent is flowing;
- transporting the sample injected along the capillary from an injection section to a detection section thereof, in experimental conditions suitable to generate a Taylor dispersion phenomenon that is measurable at the level of the detection section;
- generating, by means of a suitable sensor included in the detection section, a signal characteristic of the Taylor dispersion of the transported sample;
- acquiring the detection signal in order to obtain an experimental Taylor signal; and
- analysing the experimental Taylor signal.

Below, ‘particle’ refers to any molecule in solution and/or particles in suspension in the mixture.

In this document, a species includes all particles characterized by the same size, e.g., the same hydrodynamic ray. A species is thus associated with a ‘particle size’ value.

BACKGROUND OF THE INVENTION

In this field, the ‘deconvolution’ of a Taylor signal refers to the processing of the experimental Taylor signal leading to the determination of the hydrodynamic ray of each of the species forming the mixture and the determination of the concentration of each of these species.

The international application published under no. WO 2010 009907 A1 discloses a method of the aforementioned type, the analysis step of which implements various deconvolution algorithms for an experimental Taylor signal. However, these algorithms may only be used in the specific case of a binary mixture, i.e., a mixture of two species. Accordingly, these known algorithms do not allow for the analysis of any desired sample, but only of samples of which it is known in advance that they result from the mixture of two species.

SUMMARY OF THE INVENTION

In practice, it is currently considered impossible to solve the general problem of deconvoluting the experimental Taylor signal of a sample of any given mixture of species.

The invention thus seeks to alleviate this problem by proposing, in particular, a method for real-time analysis of an experimental Taylor signal of a sample of any given mixture.

To this end, the invention concerns a method for determining the size distribution of a mixture of molecule or particle species including the following steps:

- injecting a sample of the mixture to be analyzed inside a capillary in which an eluent is flowing;
- transporting the sample injected along the capillary from an injection section to a detection section thereof, in experimental conditions suitable to generate a Taylor dispersion phenomenon that is measurable at the level of the detection section;
- generating, by means of a suitable sensor included in the detection section, a signal characteristic of the Taylor dispersion of the transported sample;
- processing the detection signal in order to obtain an experimental Taylor signal Ŝ(t); and
- analysing (200) the experimental Taylor signal Ŝ(t),

characterized in that the step of analysing an experimental Taylor signal Ŝ(t) of a sample of the mixture consists of searching an amplitude distribution P(G_(c)) that allows the experimental Taylor signal Ŝ(t) to be broken down into a sum of Gaussian functions by means of the equation:

{circumflex over (S)}(t)≡∫₀^∞P(G_(c))G_(c)^c/2exp[−(t−t₀)²G_(c)^c]dG_(c)

where

t is a variable upon which the experimental Taylor signal depends and t₀is a value of the variable t common to the various Gaussian functions and corresponding to the peak of the experimental Taylor signal Ŝ(t);

G_(c)is a characteristic parameter of a Gaussian amplitude function P(G_(c)) and is associated:

where c=1, with the diffusion coefficient D of a species according to the relation

G₍₁₎=12D/(R_c²t₀)

for c=−1, to the hydrodynamic ray R_hof a species according to the relation

$G_{(- 1)} = \frac{2 k_{B} T}{πη R_{c}^{2} t_{0}} R_{h}^{- 1};$

and

for c=−1/d_f=−(1−a)/3, to the molar mass M of a species according to the relation

$G = \frac{2 k_{B} T}{πη R_{c}^{2} t_{0}} {(\frac{10 π N_{a}}{3 K})}^{1 / 3} M^{- (\frac{1 + a}{3})},$

where k_Bis the Boltzmann constant, T is the absolute temperature expressed in Kelvins at which the experiment is conducted, η is the viscosity of the eluent used, R_cis the internal ray of the capillary used, N_ais Avogadro's number, and K and a are Mark Houwink coefficients,

by implementing a constrained regularization algorithm consisting of minimising a cost function H_α including at least one constraint term associated with a constraint that must observe the amplitude distribution P(G_(c)) that is the solution of the foregoing equation, whereby the minimization is carried out on an interval of interest of the values of the parameter G_(c).

According to specific embodiments, the method includes one or more of the following characteristics, taken alone or in all combinations technically possible:

- the foregoing equation is discretized by subdividing the interval of values of the parameter G_(c), whereby each discretization point G_mis indexed by an integer m that varies between the unit value and the value N, whereby the point G_mis at a distance from the point G_m−1corresponding to a sub-interval of the length c_m;
- the cost function takes the form:
  
  H_α=χ²+α²Δ²
  
  Where:
- the first term χ²is a distance term between the experimental Taylor signal Ŝ(t) and a reconstructed Taylor signal defined by:
  
  S(t)=Σ_m=1^Nc_mP(G_m)√{square root over (G_m)}exp[−(t−t₀)²G_m], and
- the second term Δ²is a constraint term associated with the at least one constraint that must observe the amplitude distribution P(G) that is the solution of the foregoing equation, whereby the second term is introduced by a Lagrange coefficient α, allowing the contribution of the second term of the cost function H_ato be adapted to the first term;
- the first term χ²is a distance of the type ‘least squares’ taking the form:
  
  χ²=Σ_k=1^L(S′(t_k)−{circumflex over (S)}(t_k))²

where the experimental Taylor signal Ŝ(t) and the reconstructed function S′(t) are sampled over time, whereby each sample is indexed by an integer k varying between the unit value and the value L;

- the at least one constraint that must be observed by the amplitude distribution P(G) that is the solution of the foregoing equation is a regularity constraint associated with a constraint term Δ²preferably taking the form:
  
  Δ²=Σ_m=2^N−1[P(G_m−1)−2P(G_m)+P(G_m+1)]²;
- the analysis step includes a step of determining the optimal value α₀of the Lagrange coefficient α such that the value of the distance term χ²corresponding to the minimum of the cost function H_α=α0is close by values lower than a statistical error ν, preferably of the form ν=L−N;
- because the interval of interest of the values of the parameter G_(c)are delimited by a minimum G_min and a maximum G_max, the method includes a step of determining the values of the minimum and maximum;
- it includes a step of breaking down a normalized Taylor signal s(t) associated with the experimental Taylor signal Ŝ(t) into components by the relation: s(t)=S(t)/S(t₀), consisting of adjusting to the curve ln [s(t)] a second-order polynomial of the variable (t−t₀)²so as to determine the first-order components Γ₁and second-order components Γ₂, and in that the step of determining the values of the minimum G_minand maximum G_maxuses the equations:

$β = \ln Γ_{1} - \ln (1 + \frac{Γ_{2}}{Γ_{1}^{2}}) and γ = \sqrt{\ln (1 + \frac{Γ_{2}}{Γ_{1}^{2}})}$

where β and γ are respectively the average and the standard deviation of the logarithm of the parameter G of a log-normal distribution, followed by,

G_min=exp(β−k√{square root over (2)}γ) and G_max=exp(β+k√{square root over (2)}γ);

- the step of determining the values of the minimum and maximum of the interval of interest is empirical and consists of:
- determining a normalized Taylor signal s(t) associated with the experimental Taylor signal Ŝ(t) by the relation: s(t)=S(t)/S(t₀),
- calculating the logarithm ln [s(t)],
- determining the derivative

$\frac{\partial \ln s}{\partial x}$

relative to the variable x=(t−t₀)², and

- determining the values of the parameters τ_minand τ_minrelated to the extrema of the derivative according to the relations:

$τ_{\min} = {a_{\min} (| \frac{\partial \ln s}{\partial x} |_{\max})}^{- 1 / 2}$

$τ_{\max} = {a_{\max} (| \frac{\partial \ln s}{\partial x} |_{\min})}^{- 1 / 2}$

with α_min=0.1; α_max=3,

- determining the minimum G_minand maximum G_maxusing the equations:
  
  for c=1: G_min=τ_max⁻²,G_max=τ_min⁻²,
  for c=−1: G_min=τ_min⁻²,G_max=τ_max⁻²,
  for c=−1/d_f=−(1+a)/3:

$G_{\min} = τ_{\min}^{(\frac{6}{1 + a})}, G_{\max} = τ_{\max}^{(\frac{6}{1 + a})};$

- it includes a step of measuring an average in T, G_T, of the parameter G₍₁₎based on the experimental Taylor signal and/or an average in Γ, G_Γ, of the parameter G₍₁₎based on the breakdown, whereby each average may be used in a constraint that must be observed by the amplitude distribution P(G₍₁₎) that is the solution of the foregoing equation;
- the step of determining the values of the minimum G_minand maximum G_maxuses the equations:

$β = \frac{1}{3} \ln {〈 G 〉}_{Γ} + \frac{2}{3} \ln {〈 G 〉}_{T}$

$γ = \sqrt{\frac{2}{3} \ln \frac{{〈 G 〉}_{Γ}}{{〈 G 〉}_{T}}}$

then

G_min=exp(β−k√{square root over (2)}γ) and G_max=exp(β+k√{square root over (2)}γ).

The invention also concerns a data storage medium including instructions for the execution of a method for determining the hydrodynamic ray, diffusion coefficient, or molar mass distribution of a mixture of molecule or particle species as defined above when the instructions are executed by a computer.

The invention lastly concerns a system for determining the hydrodynamic ray, diffusion coefficient, or molar mass distribution of a mixture of molecule or particle species including a computer, whereby the computer is programmed to execute a method for determining the hydrodynamic ray, diffusion coefficient, or molar mass distribution of a mixture of molecule or particle species as defined above.

BRIEF DESCRIPTION OF THE DRAWINGS

Other characteristics and advantages of the invention will become apparent from the following detailed description, provided by way of example and by reference to the attached drawings, in which:

FIG. 1 is a schematic representation of a system for determining the size distirbution of a particle mixture;

FIG. 2 is a schematic block representation of the method for determining the size distirbution of a particle mixture implemented by the system of FIG. 1; and

FIGS. 3-5 (A and B) are graphs showing the results of the implementation of the method of FIG. 2 in the case of an equimassic mixture of two samples of synthetic polymers: FIG. 3 shows the accumulation of three repetitions of the experimental Taylor signal; FIG. 4 shows the hydrodynamic ray distribution obtained by implementing the method of FIG. 2 compared with that obtained by steric exclusion chromatography and provided by the supplier of the synthetic polymer samples; and FIG. 5A shows the adjustment of the Taylor signal by the method of FIG. 2, and FIG. 5B shows the breakdown of the experimental Taylor signal of FIG. 3 into its components.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT
Experimental Device

By reference to FIG. 1, the system for determining the size of a mixture of particles 2 includes an experimental device 3 suited to generate a Taylor dispersion phenomenon and to generate an experimental Taylor signal, and an analysis device 5 suited to analyse the experimental Taylor signal output by the experimental device 3 in order to determine in real time the size distribution of the particle mixture, a sample of which was injected into the experimental device.

The experimental device 3 includes, as is known, a capillary 6.

The experimental device 3 includes, in the vicinity of one end of the capillary 6, an injection section 7, and, in the vicinity of the other end of the capillary 6, a detection section 9.

The injection section 7 includes means 11 for injection into the capillary 6 of a sample of the mixture to be analyzed. The injection section 7 also includes means to allow an eluent to flow inside the capillary 6 from the injection section 7 to the detection section 9. These flow means are shown schematically in FIG. 1 by block 13.

The detection section 9 is optical. It is equipped with an optical cell including a light source S and an optical system 15 suited to cause the rays of light emitted by the source S to converge on a narrow portion of the capillary 6. Along the optical axis of the optical system 15, but opposite the lighted side of the capillary 6, the cell includes a CCD, diode array, or photomultiplier sensor 17 suited to collect the light that has passed through the capillary 6 and to generate a detection signal corresponding to the light collected. The sensor 17 is electrically connected to an electronic card 19 for pre-processing and digitising the detection signal generated by the sensor 17. The card 19 outputs a digital measurement signal that is time-dependent. This measurement signal is referred to as the ‘experimental Taylor signal’ or Taylorgram. It is indicated by the notation Ŝ(t) in the following. It depends on the time t.

The experimental Taylor signal Ŝ(t) is sampled at a predetermined temporal frequency, although the sampling points t_k(k=1, . . . , L) are at regular intervals. The experimental Taylor signal Ŝ(t) thus consists of a group of L pairs of data (t_k,Ŝ(t_k)).

In one variant, the detection section may be of another type, e.g., a conductivity detector, using mass spectrometry, fluorescence (laser-induced, if applicable), electrochemical, light diffusion, or more generally, any type of detector used in capillary electrophoresis. In particular, the Taylor signal may not be a time signal (scrolling the Taylor peak in front of a narrow sensor), but rather a spatial signal (instantaneous capture of the Taylor peak in front of an extended sensor). In this case, the variable t does not represent time, but the position along the capillary.

The analysis device 5 consists of a computer including an input/output interface 21 to which the electronic card 19 of the experimental device 3 is connected.

The computer further includes a memory 23, such as a RAM and/or ROM, as well as a processing unit 25, such as a microprocessor. The computer also includes human-machine interface means, indicated by the number 27 in FIG. 1. They include, e.g., a touch screen allowing a user to interact with the computer 5. All of the components of the computer are interconnected in a known fashion, e.g., by a data exchange bus.

The experimental Taylor signal Ŝ(t) is processed by a software application, the instructions of which are stored in the memory 23 and executed by the processing unit 25. This software is shown schematically in FIG. 1 by block 31.

Modelling of the Taylor Signal

In the known manner, assuming that i) the contribution of the diffusion along the axis of the capillary 6 to the dispersion at the level of the peak of the signal is negligible, ii) the injection time of the sample into the capillary 6 is sufficiently short (typically, the injected volume is less than 1% of the volume of the capillary), and iii) the detection device is sensitive to the mass of the molecules, the real Taylor signal S(t) of a monodispersed sample, i.e., a sample having only one species, and, accordingly, characterized by a hydrodynamic ray value R_hor diffusion coefficient D, is modelled by a Gaussian function:

S(t)=CMρ√{square root over (D)}exp[−(t−t₀)²12D/(R_c²t₀)]+B. (1.)

Where:

C is an instrumental constant,

M is the molar mass of the species,

ρ is the molar concentration of the species,

R_cis the internal ray of the capillary,

t₀is the moment corresponding to the peak of the Taylor signal, and

B is an offset constituting a measurement artifact (this term will be omitted in the following for clarity and because it is taken into account in the constrained regularization method).

It should be emphasized that assumption iii) depends on the nature of the sensor used in the detection section, and that the use of another type of sensor results in modifications to the equations shown herein in a manner known to persons skilled in the art.

The real Taylor signal S(t) of a polydispersed sample, i.e., a sample including several species, is modelled for the sum of the contributions of each of the species. Thus, assuming a mixture including a continuum of species, equation (1) is generalized by a continuous sum of Gaussian functions according to:

S(t)=∫₀^∞CM(D)ρ(D)√{square root over (D)}exp[−(t−t₀)²12D/(R_c²t₀)]dD, (2.)

In equation (2), the Gaussian functions are all centred on the same reference time t₀.

The parameter G=12D/(R_C²T₀) is introduced, causing the equation (2) to become:

S(t)=∫₀^∞P(G)√{square root over (G)}exp[−(t−t₀)²G]dG (3.)

where P(G) is referred to in the following as the ‘amplitude distribution’ of the Guassian functions of the parameter G.

A value of the parameter G is associated, via the diffusion coefficient D, with a species. For example, the Stokes-Einstein-Sutherland formula allows for the association of a value of the parameter G with the species characterized by the hydrodynamic ray R_h, according to the relation:

$\begin{matrix} G = \frac{2 k_{B} T}{πη R_{c}^{2} t_{0} R_{h}} & (4.) \end{matrix}$

Where:

k_Bis the Boltzmann constant,

T is the absolute temperature expressed in Kelvins at which the experiment is carried out, and

η is the viscosity of the eluent used.

Thus, in equation (3), each Gaussian function P(G)√{square root over (G)}exp[−(t−t₀)²G] represents the contribution of one species to the total amplitude of the real Taylor signal S(t). The amplitude P(G) of each Gaussian function depends directly on the concentration of the corresponding species in the mixture.

The function of the software 31 is to determine the distribution P(G) that is the solution of the following equation corresponding to equation 83) when the real Taylor signal S(t) is replaced with the experimental Taylor signal Ŝ(t):

{circumflex over (S)}(t)≡∫₀^∞P(G)√{square root over (G)}exp[−(t−t₀)²G]dG (5.)

Measurement Error Introduced by the Experimental Device

Like any measurement device, the detection section 9 introduces a systematic measurement error such that the experimental Taylor signal Ŝ(t) is not exactly equal to the real Taylor signal S(t).

The solution of the equation (5) then results in the determination of several distributions that are each solution to the measurement error. In other words, the solution of the equation (5) results in the identification of several families of Gaussian functions that each result in sum in a reconstructed Taylor signal Ŝ(t)=∫₀^∞P(G)√{square root over (G)}exp[−(t−t₀)²G]dG that is adjusted to the experimental Taylor signal Ŝ(t), with the adjustment criterion taking into account the measurement error introduced by the detection section.

However, amongst the various distributions P(G) that are solutions to the equation (5), only some have physical significance. It is such a ‘physical’ solution that the software 31 is suited to determine.

Method for Determining the Physically Significant Distribution

To solve this problem, the software 31 uses an algorithm that implements a constrained regularization method.

This algorithm is based on the following equation, which results from a discretization relative to the parameter G of the equation (5):

{circumflex over (S)}(t)≡Σ_m=1^N−1c_mP(G_m)√{square root over (G_m)}exp[−(t−t₀)²G_m] (6.)

where the interval of the values of interest of the parameter G, between the predetermined minimum G_min=G₀and maximum G_max=G_N, is subdivided into N sub-intervals identified by the integer m of the length c_m. Preferably, the various sub-intervals have the same length:

$c_{m} = \frac{G_{N} - G_{O}}{N}$

The limits are such that the interval on which equation (6) is discretized exceeds the interval for which the distribution P(G) is not nil. Accordingly, P(G₁)=P(G_N)=0.

The unknowns of equation (6) consist of all amplitudes P(G_m), m=1, . . . N.

In principle, the solution of equation (6) is obtained by a process of adjusting the experimental Taylor signal Ŝ(t) by the reconstructed Taylor signal Ŝ(t)=Σ_m=1^Nc_mP(G_m)√{square root over (G_m)}exp[−(t−t₀)²G_m]. That is, for any time t_k, S′(t_k) must be as near as possible to Ŝ(t_k).

In order to obtain robust results that are physically significant, however, it is necessary to take into account all information available on the amplitudes P(G_m) during the adjustment process so as to reject all solutions that are not physically acceptable.

To this end, the constrained regularization algorithm solves equation (6) by minimising a cost function dependent on the N unknown P(G_m) and translating the information available on the amplitudes P(G_m) into constraints.

In the current embodiment, the cost function H_α takes the following form:

H_α=χ²+αΔ² (7.)

It includes a first term χ²corresponding to a ‘distance’ between the experimental Taylor signal Ŝ(t_k) and a reconstructed Taylor signal S′(t_k).

For example, this first term is a distance of the type ‘least squares’:

χ²=Σ_k=1^L(S′(t_k)−{circumflex over (S)}(t_k))² (8.)

Another distance measure may be used, in particular, one that, in the foregoing sum, weights each term by a coefficient wk inversely proportional to the noise affecting the measurement taken at the instant t_k.

The cost function H_α includes a second term Δ², referred to as the constraint term, expressing a constraint that penalises the amplitudes P(G_m) that have no physical significance.

For example:

Δ²=Σ_m=2^N−1[P(G_m−1)−2P(G_m)+P(G_m+1)]² (9.)

In this example, the constraint term corresponds to the sum of the terms elevated to the square of the second derivative of the distribution P(G). The constraint term translates a regularity constraint. The amplitudes P(G_m) that vary too rapidly relative to their neighbours, P(G_m−1) or P(G_m+1), are thus penalised.

Another example of a regularity constraint is the following:

Δ²=Σ_m=3^N−1[P(G_m−2)−3P(G_m−1)+3P(G_m)−P(G_m+1)]² (9.)

This constraint term corresponds to the sum of the terms elevated to the square of the third derivative of the distribution P(G). The generalization to a regularity constraint based on the n-th derivative (n≧1) is immediate and can be carried out by a person skilled in the art.

In the following, it is assumed that the regularity constraint used is that of the second derivative of the distribution P(G), eq. (9).

The first and second terms of the cost function H_α have relative contributions that may be adapted by selecting the value of a coefficient α, a Lagrange coefficient. This coefficient verifies the size of the constraint term relative to the distance term. If α is very small, the constraint term is negligible. In this case, the minimization of the cost function yields the same result as a simple adjustment to the experimental data. For values of α that are too large, on the other hand, a significant cost is assigned to the constraint on the P(G), and the algorithm will reject the solutions that do not observe the constraint at the risk of preserving a solution that does not properly fit the experimental data.

Supplemental constraints that may be expressed in the form of supplemental equalities or inequalities, or P(G_m) linear inequalities are directly imposed during the search for the minimum of the cost function by limiting the search to the subregions specific to the space of the P(G_m).

For example, the constraint that the amplitudes are positive, P(G_m)≧0 ∀m, is imposed by minimising the cost function only on the half space of the positive amplitudes.

For example, if the value of an average custom character G of the parameter G is determined, a supplemental constraint on the amplitudes P(G_m) that are solutions to the equation (6) is that the amplitudes allow the previously determined average G to be determined at a deviation ε. This constraint is also expressed in the form of P(G_m) linear inequalities.

custom character G−ε≦Σ_m=1^NP(G_m)G_m≦G+ε (11)

With, for example: ε/ custom character G=5%.

Equation 12 may easily be generalized to other types of averages than the arithmetical average:

custom character G=Σ_m=1^NP(G_m)G_m (12)

for example the averages custom character G_Tand G_Γ which will be introduced below.

A crucial point is the choice of the coefficient α of the cost function H_α in equation 7. Two strategies may be used to select the coefficient α:

- A first strategy consists of choosing the greater value α such that the value of the distance term χ²does not exceed the statistically expected value that depends on the measurement error and the number of degrees of freedom in the constrained regularization process. Thus, if, for each experimental point, the standard deviation of the measurement error σ_k, is known, α is selected such that the normalized value χ_norm²of the distance term χ²does not exceed the number of degrees of freedom: ν=L−N, where χ_norm²is given by the equation:

$\begin{matrix} χ_{norm}^{2} = Σ_{k = 1}^{N} \frac{{(S^{'} (t_{k}) - \hat{S} (t_{k}))}^{2}}{σ_{k}^{2}} & (10.) \end{matrix}$

If the standard deviation of the noise σ_kis not known, an estimated value σ_estthereof may be determined based on the mean deviation between the experimental data and the best possible adjustment without taking into account the constraint, i.e., that obtained for α=0:

$\begin{matrix} σ_{est}^{2} = \frac{1}{N} {Σ_{k = 1}^{N} (S^{'} (t_{k}) (α = 0) - \hat{S} (t_{k}))}^{2}, & (11.) \end{matrix}$

where S′(t_k)(α=0) is the value of the k-th point of the Taylor signal reconstructed based on the amplitudes P(G_m) obtained by minimising only the first term of the cost function H_α=0.

Once the noise has been estimated, α is selected such that the normalized value χ_norm²of the distance term χ², calculated by replacing σ_kwith σ_estin equation 12 does not exceed the number of degrees of freedom: ν=L−N

A second strategy consists of selecting α value for a that gives equal weight to the distance term and the constraint term.

In this case, the parameter α that is retained is the one for which, once the constraint function H_α has been minimized, the result is χ²=αΔ².

In practice, the selection of α is made then by scanning a large range of values of the coefficient α. For each value of α, all amplitudes P(G_m) that minimise the cost function H_α are determined. The corresponding values of χ_norm², H_α and P(G_m) are recorded. Amongst all of the trials, the Lagrange coefficient α₀having the greatest value such that χ_norm²≦ν is finally retained.

In order to improve the efficiency of the digital search, the values of α are first scanned on a trial grid with large steps, and then the value of α is refined using a finer grid.

Determining G_Tand G_Γ from an Experimental Taylor Signal

Below, two averages of G are presented that may be obtained directly from the experimental Taylor signal.

The T average of the parameter G is defined as follows:

$\begin{matrix} {〈 G 〉}_{T} = \frac{Σ_{m = 1}^{N} c_{m} P (G_{m})}{Σ_{m = 1}^{N} c_{m} \frac{P (G_{m})}{G_{m}}} = \frac{1}{[\overline{G^{- 1}}]} & (12.) \end{matrix}$

and the Γ average of the parameter G is defined as follows:

$\begin{matrix} {〈 G 〉}_{Γ} = \frac{Σ_{m = 1}^{N} c_{m} P (G_{m}) G_{m}^{3 / 2}}{Σ_{m = 1}^{N} c_{m} P (G_{m}) G_{m}^{1 / 2}}, & (13.) \end{matrix}$

where c_mare those defined in relation to equation (6).

The purpose of determining these averages is twofold:

- it allows constraints on the amplitudes P(G_m) to be added during the solution of equation (6);
- it allows for an estimation of the average and breadth of the distribution P(G). This information is not only interesting in itself, but also allows for a determination of the interval of the values of the parameter G on which a solution to equation (6) can be sought.

custom character G_Tand G_Γ may be calculated respectively based on the temporal variance of the experimental Taylor signal and based on the cumulative approach described below.

Determining G_TBased on the Temporal Variance of the Experimental Taylor Signal

It is shown that:

$\begin{matrix} \frac{\int_{0}^{+ \infty} S (t) ⅆ t}{\int_{0}^{+ \infty} S (t) {(t - t_{0})}^{2} ⅆ t} \approx \frac{\int_{- \infty}^{+ \infty} S (t) ⅆ t}{\int_{- \infty}^{+ \infty} S (t) {(t - t_{0})}^{2} ⅆ t} = \frac{\sqrt{π} \int_{0}^{+ \infty} P (G) ⅆ G}{\frac{\sqrt{π}}{2} \int_{0}^{+ \infty} G^{- 1} P (G) ⅆ G} = {2 [\overline{G^{- 1}}]}^{- 1} = 2 {〈 G 〉}_{T} & (14.) \end{matrix}$

Thus, the T average of the parameter G is accessible by integrating the experimental Taylor signal.

With G=12D/(R_c²t₀), the T average of the parameter D is given by:

$\begin{matrix} {〈 D 〉}_{T} = \frac{R_{c}^{2} t_{0}}{24} \frac{\int_{0}^{+ \infty} S (t) ⅆ t}{\int_{0}^{+ \infty} S (t) {(t - t_{0})}^{2} ⅆ t} & (15.) \end{matrix}$

Determining G_Γ Based on a Breakdown of the Experimental Taylor Signal

In this part, the breakdown of the experimental Taylor signal is described in the case of a sample that is moderately polydispersed.

It is assumed that the size distribution is discrete. Equation (2) then becomes:

S(t)=C′Σ_i=1^Nρ_iM_i√{square root over (D)}_iexp[−(t−t₀)²12D_i/(R_c²t₀)] (16.)

where η_iis the molar concentration of the i-th species in the mixture, and M_iand D_iare respectively the molar mass and diffusion coefficient thereof.

It is useful to ‘normalise’ the Taylor signal relative to the height of its peak by introducing:

s(t)=S(t)/S(t₀=Σ_i=1^Nƒ_iexp[−(t−t₀)²G_i] (17.)

Where:

−G_i=12D_i/(R_c²i₀) and

−ƒ=ρ_iM_i√{square root over (D)}_i/Σ_i=1^N(ρ_iM_i√{square root over (D)}) is the relative contribution of the i-th species in the Taylor signal. It should be noted that t, depends on the diffusion coefficient of the i-th species.

The Γ average of the parameter G is then expressed as follows:

$\begin{matrix} {〈 G 〉}_{Γ} \equiv \frac{\sum_{i = 1}^{N} ρ_{i} M_{i} \sqrt{D_{i}} G_{i}}{\sum_{i = 1}^{N} ρ_{i} M_{i} \sqrt{D_{i}}} = Σ_{i = 1}^{N} f_{i} G_{i} . & (18.) \end{matrix}$

By positing G_i= custom character G_Γ=δG_i, with δG_Γ=0, equation (19) becomes:

s(t)=exp[−(t−t₀)²G_Γ]Σ_i=1^Nƒ_iexp[−(t−t₀)²δG_i], (19.)

which is the product of a Gaussian function by correction terms.

If (t−t₀)²δG_i<<1, i.e., near the peak of the Taylor signal, the limited development of the second term of equation (21) results in:

exp[−(t−t₀)²δG_i]=1−(t−t₀)²δG_i+½[(t−t₀)²δG_i]²+ . . . (20.)

Which leads, in equation (21), using Σ_i=1^Nƒ_i=1 and Σ_i=1^NδG_i=0 to:

s(t)=exp[−(t−t₀)² custom character G_Γ](1+½(t−t₀)⁴δG²_Γ+ . . . ) (21.)

Where the size distribution is not too broad (i.e., a slightly polydispersed sample), it is possible to express the normalized Taylor signal s(t) as the sum of a Gaussian function (as in the case of a monodispersed sample) and correction terms (to take into account a deviation in the case of a monodispersed sample).

Taking the logarithm of the expression (23) and carrying out a new limited development, the following is obtained:

ln [s(t)]=−(t−t₀)² custom character G_Γ+½(t−t₀)⁴δG²Γ+ . . . (22.)

The equation (24) is the desired cumulant development. The coefficients Γ₁= custom character G_Γ and Γ₂δG²_Γ are the first- and second-order cumulants of this development.

They may be obtained by adjusting a second-order polynomial of the variable (t−t₀)²to the function ln [s(t)].

The first cumulant also provides access to the Γ average of the diffusion coefficient:

$\begin{matrix} {〈 D 〉}_{Γ} = {〈 G 〉}_{Γ} \frac{R_{c}^{2} t_{0}}{12} = Γ_{1} \frac{R_{c}^{2} t_{0}}{12} . & (23.) \end{matrix}$

The Γ average is different to the aforementioned T average because the diffusion coefficient D appears there at different powers. These two averages contain different information on the distribution P(G). They allow for the addition of two constraints in the cost function to be minimized.

The second cumulant is linked to the Γ average of the variance of the distribution of the diffusion coefficients, providing an estimate of the polydispersity of the sample. More specifically, the ratio of the second cumulant divided by the square of the first cumulant gives:

$\begin{matrix} \frac{Γ_{2}}{Γ_{1}^{2}} = \frac{{〈 δ G^{2} 〉}_{Γ}}{{〈 G 〉}_{Γ}^{2}} = \frac{{〈 D^{2} 〉}_{Γ} - {〈 D 〉}_{Γ}^{2}}{{〈 D 〉}_{Γ`}^{2}} = \frac{\frac{\overline{D^{5 / 2}}}{\overline{D^{1 / 2}}} - {(\frac{\overline{D^{3 / 2}}}{\overline{D^{1 / 2}}})}^{2}}{{(\frac{\overline{D^{3 / 2}}}{\overline{D^{1 / 2}}})}^{2}} . & (24.) \end{matrix}$

Selection of the Interval of the Values of the Parameter G on Which to Seek a Solution

In the constrained regularization adjustment procedure, the choice of the interval of the values G on which the distribution P(G) is sought is an important factor.

In fact, the number N of points used in the discretization of the equation (6) cannot be too large; otherwise, the calculation time for the adjustment will be too long.

Furthermore, N must be significantly smaller than the number L of digitization points of the experimental Taylor signal.

Typical values of N are in the range of 50-200.

These considerations show that the interval [G_min, G_max] (G_min=G₀and G_max=G_N) must be carefully chosen.

However, the interval [G_min, G_max] must be greater than the interval on which the distribution P(G) is not nil in order to avoid artifacts due to the truncation of this distribution.

Additionally, if the interval on which the distribution P(G) is not nil is a sub-interval of the interval [G_min, G_max] that is too narrow, the details of the distribution P(G) will be weekly resolved during discretization.

It is also essential to define an automatic procedure allowing for the determination of G_minand G_max, such that the user does not waste time selecting the limits of the interval and avoiding a series of trial/error.

We propose three possible approaches to determine G_minand G_max. The first two approaches are based on the calculation of the equivalent log-normal distribution, whilst the third is empirical and based on the representation of ln [S(t)] depending on (t−t₀)²in the same system of axes that is used for the breakdown into cumulants.

Determination of the Interval Based on a Log-Normal Distribution

Equivalent Log-Normal Distribution

A log-normal distribution often allows for a highly accurate description of the size distribution of a polymer or particle sample:

$\begin{matrix} PDF (G) = \frac{1}{G γ \sqrt{2 π}} \exp [- \frac{{(\ln G - β)}^{2}}{2 γ^{2}}] & (25.) \end{matrix}$

Where:

PDF(G) is the probability density that the particles of the sample will have a G value between G and G+dG; and,

β and γ are respectively the average and the standard deviation of the logarithm of the parameter G, ln G.

Although the log-normal distribution may be a poor model for more complex mixtures, the determination of the equivalent log-normal distribution of any mixture is useful to estimate the interval of the values of the parameter G on which the distribution P(G) that is the solution of equation (6) is to be sought.

The probability density PDF (G) depends only on the parameters β and γ. It is possible to determine these two parameters from custom character G_Tand G_Γ.

The definition is:

$\begin{matrix} {〈 G 〉}_{T} = {[\int_{0}^{\infty} G^{- 1} PDF (G) ⅆ G]}^{- 1} & (26.) \\ {〈 G 〉}_{Γ} = \frac{\int_{0}^{\infty} G^{3 / 2} PDF (G) ⅆ G}{\int_{0}^{\infty} G^{1 / 2} PDF (G) ⅆ G} & (27.) \end{matrix}$

By replacing equation (27) in equations (28) and (29), the following is obtained:

$\begin{matrix} β = \frac{1}{3} \ln {〈 G 〉}_{Γ} + \frac{2}{3} \ln {〈 G 〉}_{T} & (28.) \\ γ = \sqrt{\frac{2}{3} \ln \frac{{〈 G 〉}_{Γ}}{{〈 G 〉}_{T}}} & (29.) \end{matrix}$

It is also possible to obtain the equivalent log-normal distribution from first- and second-order cumulants according to the following relations:

$\begin{matrix} β = \ln Γ_{1} - \ln (1 + \frac{Γ_{2}}{Γ_{1}^{2}}) & (30.) \\ γ = \sqrt{\ln (1 + \frac{Γ_{2}}{Γ_{1}^{2}})} & (31.) \end{matrix}$

In conclusion, it is possible to determine the log-normal distribution either from custom character G and G_Γ according to equations (30) and (31) or from Γ₁and Γ₂using equations (32) and (33).

Calculation of the Limits of the Interval Based on the Equivalent Log-Normal Distribution

The G_minet G_maxmay be estimated by replacing the distribution P(G) with an equivalent log-normal distribution.

The objective is for the interval [G_min, G_max] to cover a significant fraction of the log-normal distribution equivalent to the experimental Taylor signal. This fraction of the distribution is yielded by:

ΔQ_G=Q_G(G_max)−Q_G(G_min), (32.)

where Q_G(G) is the cumulative probability defined by:

Q_G(G)=∫₀^GdG′PDF(G′). (33.)

Furthermore, it is preferable for the interval [Q_G(G_min), Q_G(G_max)] to be distributed symmetrically relative to the median value Q_G=½. Thus:

Q_G(G_min)=1−Q_G(G_max) (34.)

In the context of this assumption, equation (36) yields:

$\begin{matrix} \erf (\frac{\ln G_{\max} - β}{\sqrt{2} γ}) = 2 Q_{G} (G_{\max}) - 1 = Δ Q_{G} & (35.) \end{matrix}$

where erf is the error function known to persons skilled in the art.

This results in:

G_max=β+k√{square root over (2)}γ i.e., G_max=exp(β+k√{square root over (2)}γ);G_max=exp(β+k√{square root over (2)}γ), (36.)
and
G_min=β−k√{square root over (2)}γ i.e., G_min=exp(β−k√{square root over (2)}γ);G_min=exp(β−k√{square root over (2)}γ), (37.)

with k=erf⁻¹(ΔQ_G) where erf¹is the inverse error function.

For example, if ΔQ_G=99.53%, then k=2, or if ΔQ_G=99.998%, then k=3.

Empirical Determination of the Interval Based on the Breakdown into Cumulants

For the sake of simplicity, the following notation will be used: x=(t−t₀)².

For a monodispersed sample, ln [s(t)] as a function of x is a line, the gradient of which gives the diffusion coefficient of the species of the sample. That is:

$\begin{matrix} | \frac{\partial \ln s}{\partial x} | = G & (38.) \end{matrix}$

Thus, ∂ ln s/∂x does not depend on x, i.e., it is not time-dependent.

On the other hand, for a polydispersed sample, the curve ln [s(t)] depending on x has a curve that is calculated by determining the derivative ∂ ln s/∂x. This is proportional to the parameter G.

Based on equation (40), it is assumed that G_minis linked to the minimum of the local gradient of ln s (in absolute value), because the species with low diffusion coefficients correspond to low G values, and thus to a slight decrease of the Taylor signal over time. Accordingly, it is assumed that:

$\begin{matrix} G_{\min} = b_{\min} | \frac{\partial \ln s}{\partial x} |_{\min} & (39.) \end{matrix}$

where the minimum of |∂ ln s/∂x| sought on an adapted interval of x, to be determined empirically, and where b_minis a numerical coefficient, also to be determined empirically.

By studying a large number of Taylor signals of samples of all kinds, we have found that the suitable interval of x is that for which the signal s(t) decreases by two decades. This corresponds to an interval in time between the time t₀corresponding to the peak of s(t) and the time t₁such that s(t₁)=S(t₁)/S(t₀)=0.01.

Likewise, it is considered that:

$\begin{matrix} G_{\max} = b_{\max} | \frac{\partial \ln s}{\partial x} |_{\max} & (40.) \end{matrix}$

where the maximum of |∂ ln s/∂x| is sought on the same interval of x.

In analysing an experimental Taylor signal, it is simpler to estimate the characteristic decrease time of s(t). Defining the decrease time as τ=G^−1/2, minimum and maximum decrease times are defined based on the aforementioned relations, according to:

$\begin{matrix} τ_{\min} = {a_{\min} (| \frac{\partial \ln s}{\partial x} |_{\max})}^{- 1 / 2} & (41.) \\ τ_{\max} = {a_{\max} (| \frac{\partial \ln s}{\partial x} |_{\min})}^{- 1 / 2} & (42.) \end{matrix}$

By studying a large number of Taylor signals of samples of all kinds, we have found that the following values of the parameters α_minand α_maxare able to frame the desired min and max values.

α_min=0.1;α_max=3 i.e., b_min=1/9;b_max=100 (43.)

Lastly, the values G_min, G_maxto use in order to adjust the data are calculated according to:

G_min=τ_max⁻² (44.)
G_max=τ_min⁻² (45.)

Calculation of the Distributions According to the Parameter D, R_h, or M Based on the Distribution According to the Parameter G

Once the distribution of the amplitudes P(G) has been obtained, the distribution of the amplitudes P_D(D) according to the diffusion coefficient D, can be easily calculated.

The following equation links the probability distribution P_yof the stochastic variable y to the probability distribution P_xof the stochastic variable x, where x is a function of y:

$\begin{matrix} P_{y} (y) = P_{x} (x) |_{x = x (y)} | \frac{\partial x}{\partial y} |_{x = x (y)} & (46.) \end{matrix}$

with G=12D/(R_c²t₀), the following is obtained:

$\begin{matrix} P_{D} (D) = \frac{\frac{12}{R_{c}^{2} t_{0}} P (G) |_{G = \frac{12 D}{R_{c}^{2} t_{0}}}}{\int_{0}^{\infty} P (G) ⅆ G} & (47.) \end{matrix}$

In this expression, an integral was introduced to the denominator because the distribution P(G) is not necessarily normalized.

It is often desirable to express the polydispersity of the sample in terms of amplitude distribution according to the first hydrodynamic ray R_h, or the parameter of molar mass, M.

These two distributions may be calculated based on the distribution P(G) using equation (46) and the following transformation rules:

$\begin{matrix} G = \frac{2 k_{B} T}{πη R_{c}^{2} t_{0}} R_{h}^{- 1} & (48.) \\ G = \frac{2 k_{B} T}{πη R_{c}^{2} t_{0}} {(\frac{10 π N_{a}}{3 K})}^{1 / 3} M^{- (\frac{1 + a}{3})} & (49.) \end{matrix}$

Equation (48) uses the Stokes-Einstein relation

$D = \frac{k_{B} T}{6 πη R_{h}},$

where k_Bis the Boltzmann constant, T the absolute temperature, and η the viscosity of the eluent.

Equation (49) uses the Einstein equation for the viscosity of a diluted suspension and the Mark Houwink equation linking the intrinsic viscosity [η] to the molar mass according to the relation:

[η]=K M^a (50.)

where K and a are the Mark Houwink coefficients.

The following relation, which gives the hydrodynamic ray as a function of molar mass, may also be used:

$\begin{matrix} R_{h} = {(\frac{3 K}{10 π N_{a}})}^{1 / 3} M^{(\frac{1 + a}{3})} & (51.) \end{matrix}$

where N_ais Avogadro's number and d_f=3/(1+a) is the fractal dimension of the object (e.g., d_f=3 for an ordinary compact object, 2 for a statistical polymer, and 5/3 for a polymer in a good solvent).

Equations (48) and (49) show that G is non-linear in R_hand M, respectively, whilst G is simply proportional to D.

Due to this non-linearity, the transformation of the distribution according to the parameter G identified as the solution of equation (6) results in a distribution according to the parameter R_h, or the parameter M, which does not necessarily observe the constraint of the cost function, in particular the regularity constraint. In most cases, the transform results in fact in the presence of non-physical peaks or oscillations in the distribution according to the parameter R_h, or the parameter M.

This is why, in a variant of the method described in detail above, the method consists of directly seeking the distribution according to the parameter R_h, or the parameter M, which observes the constraint(s) and allows for the correct reproduction of the experimental data, by constrained regularization.

To this end, the experimental Taylor signal is broken down on a family of Gaussians of an adapted parameter: The equation (5) is thus generalized in the form of:

{circumflex over (S)}(t)≡∫₀^∞P(G_(c))G_(c)^c/2exp[−(t−t₀)²G_(c)^c]dG, (52.)

where the three following cases are considered:

1) c=1: to be used when seeking the amplitude distribution according to D;

2) c=−1: to be used when seeking the amplitude distribution according to R_h;

3) c=−1/d_f=−(1+a)/3: to be used when seeking the amplitude distribution according to M;

P_norm(G_(c)) is the distribution P(G_(c)), properly normalized:

$\begin{matrix} P_{norm} (G_{(c)}) = \frac{P (G_{(c)})}{\int_{0}^{\infty} P (G_{(c)}) ⅆ G_{(c)}} . & (53) \end{matrix}$

In case 1), G_(f)=G. Equation (54) then reduces to equation (5). The amplitude distribution P_D(D) is determined based on the amplitude distribution P(G₍₁₎) using the following equation:

$\begin{matrix} P_{D} (D) = \frac{12}{R_{c}^{2} t_{0}} P_{norm} (G_{(1)}) . & (54.) \end{matrix}$

For case 2),

$G_{(- 1)} = \frac{π η R_{c}^{2} t_{0}}{2 k_{B} T} R_{h} .$

The implementation of the constrained regularization algorithm results in the determination of the amplitude distribution P(G₍₋₁₎). The distribution P_R(R_h) is then determined based on P_norm(G₍₋₁₎) according to the relation:

$\begin{matrix} P_{R} (R_{h}) = \frac{π η R_{c}^{2} t_{0}}{2 k_{B} T} P_{norm} (G_{(- 1)}) . & (55.) \end{matrix}$

Lastly, for case 3),

$G_{(- (1 + a) / 3)} = {(\frac{3 K}{10 π N_{a}})}^{(\frac{1}{1 + a})} {(\frac{π η R_{c}^{2} t_{0}}{2 k_{B} T})}^{(\frac{3}{1 + a})} M .$

The implementation of the constrained regularization algorithm results in the determination of the amplitude distribution P(G_(−(1+a)/3)). The distribution P_M(M) is then determined by means of P_norm(G_(−(1+a)/3)) according to the following relation:

$\begin{matrix} P_{M} (M) = {(\frac{3 K}{10 π N_{a}})}^{(\frac{1}{1 + a})} {(\frac{π η R_{c}^{2} t_{0}}{2 k_{B} T})}^{(\frac{3}{1 + a})} P_{norm} (G_{(- 1 (1 + a) / 3)}) . & (56) \end{matrix}$

The manner of selecting the interval [G_(c),min, G_(c),max] on which the distribution P(G_(c)) is to be sought is similar to that described above. In particular, τ_minand τ_maxare calculated according to equations (43) and (44). Lastly, the values G_(c),min, G_(c),maxare determined as follows:

$\begin{matrix} 1) c = 1 : G_{\min} = τ_{\max}^{- 2}, G_{\max} = τ_{\min}^{- 2} . & (57) \\ 2) c = 1 G_{\min} = τ_{\max}^{2}, G_{\max} = τ_{\min}^{2} . & (58) \\ 3) c = - 1 / d_{f} = - (1 + a) / 3 G_{\min} = τ_{\max}^{(\frac{6}{1 + a})}, G_{\max} = τ_{\min}^{(\frac{6}{1 + a})} . & (59) \end{matrix}$

Method for Determining the Size Distribution of a Sample

The method for determining the size distribution of a mixture of particles will now be described by reference to FIG. 2.

The method includes a first step 100 of injecting a sample to be analyzed into the injection section 7 of the experimental device 3.

Then, in step 110, after actuating the means 13 for introducing and circulating an eluent inside the capillary 6, the sample injected is transported from the injection section 7 to the detection section 9 of the experimental device 3. The experimental conditions (nature of the eluent, flow speed of the eluent, transport distance separating the injection section from the detection section, temperature, internal ray of the capillary, etc.) are adapted so that a Taylor dispersion phenomenon will occur that is detectable in the detection section 9. In the experimental examples below, precise experimental conditions are indicated.

In step 120, the sample transported by the eluent passes through the optical cell of the detection section 9. The sensor 17 then generates an electrical measurement signal characteristic of the Taylor dispersion occurring in the sample.

In step 130, the detection signal generated by the sensor 17 is processed by the electronic card 19 so as to deliver a digitized experimental Taylor signal Ŝ(t).

In step 140, the experimental Taylor signal Ŝ(t) is acquired by the computer 5.

It is then analyzed (step 200) by running the software 31 in order to determine a size distribution. The software 31 carries out the following elementary steps.

In step 142, a first adapted menu is presented to the user so that the user may select the parameter according to which the constrained regularization method is to be carried out. The user may thus choose either the diffusion coefficient D (case 1, c=1), the hydrodynamic ray (case 2, c=−1), or the molar mass M (case 3, c=−1/d_f). The user is also asked to choose the number N for the discretization of the distribution sought. In the following, for simplicity, it is assumed that the user chooses the diffusion coefficient D and that the parameter to be taken into account is the parameter G.

In step 144, a second adapted menu is presented to the user so that the user may select the number and nature of the constraints to be taken into account in seeking the distribution that is the solution of equation (5). The proposed constraints to select are, e.g.:

- 1. regularity of distribution;
- 2. the distribution is positive at all points;
- 3. the distribution must result in a predetermined T average plus or minus a deviation to be specified;
- 4. the distribution must result in a predetermined r average plus or minus a deviation to be specified.

In the following, it is assumed that the first constraint is implemented via a Lagrange multiplier in the cost function, whilst the other constraints are implemented directly by appropriately limiting the space of the amplitudes P(G_m) in which an extrema of the cost function is sought.

In step 146, the experimental Taylor signal Ŝ(t) is broken down into cumulants. More specifically, the normalized Taylor signal s(t)=Ŝ(t)/Ŝ(t₀) is first determined, then its logarithm ln [s(t)] is calculated. Lastly, a second-degree polynomial of the variable (t−t₀)²is adjusted to the function ln [s(t)]. The first- and second-order cumulants Γ₁and Γ₂are then determined. Γ₁allows, in particular, for a measurement of the Γ average of the parameter G. Additionally, the T average of the Taylor signal is measured.

In step 148, the limits G_minand G_maxof the interval of the values of the parameter G on which the distribution is sought are calculated based on the equivalent log-normal distribution determined based on the values of the first- and second-order cumulants Γ₁and Γ₂obtained in step 148, using equations (32) and (33) followed by (38) and (39).

In step 150, the cost function H_α is developed from the first constraint selected by the user in step 144. The constraint term associated with the constraint selected is red in the memory of the computer 5.

In step 152, the discretized expression of the cost function H_α is obtained by subdividing the interval G_minand G_maxdetermined in step 154 into N sub-intervals.

In step 154, for each value of the Lagrange coefficient α in a group of test values, the minimum of the cost function H_α is determined. To take into account strict constraints according to which the amplitudes are positive and must result in predetermined T and Γ averages with a predetermined deviation, the minimum of the cost function is sought exclusively on the appropriate subspace of the amplitudes P(G_m) that satisfy these strict constraints.

In step 156, the statistical error ν is determined, and the optimal value α₀of the Lagrange coefficient α is determined by selecting the value of the Lagrange coefficient α, which, in step 156, resulted in the nearest distance term χ₂by values lower than this statistical error ν.

In step 158, the group of distributions P(G) sought is the group of those that minimise the cost function H_α for the optimal value α₀of the Lagrange coefficient determined in step 156.

In step 160, the value related to the size of the particles of the mixture is calculated based on the distribution P(G) obtained in step 158.

Lastly, in step 162, for an adapted transformation, the distributions according to the hydrodynamic ray or molar mass are calculated based on the distribution P(G) obtained in step 158.

If applicable, the various distributions calculated are displayed on the screen of the computer 5. The software 31 includes ‘tools’ allowing the user to carry out the desired calculations on the calculated distributions.

The software 31 thus includes means suited for the execution of each of the steps of the analysis of the experimental Taylor signal.

In one variant, the limits G_minand G_maxof the interval on which the distribution P(G) is sought are calculated empirically. This consists of determining the normalized Taylor signal s(t)=Ŝ(t)/Ŝ(t₀), obtaining its logarithm, and then calculating the derivative |∂ ln s/∂x|. The limits of the interval of interest of the parameter G are finally deduced using equations (43) and (44), followed by equations (46) and (47).

In yet another variant, independent of the preceding variant, the T average of the parameter G, custom character G_T, is calculated by integrating the experimental Taylor signal Ŝ(t) using equation (16), and the Γ average of the parameter G, G_Γ, is calculated based on the determination of the first-order cumulant resulting from the breakdown of the experimental Taylor signal into cumulants. The limits G_min, and G_maxof the interval on which the size distribution is to be sought are calculated based on the T and Γ of the parameter G according to equations (30), (31), and (38), (39). This variant of the method also allows for a constraint term based on one or the other of these averages to be integrated into the cost function.

Constrained Regularization Analysis on a Series of Experimental Taylor Signals of a Single Sample

Adjustment by constrained regularization may be advantageously implemented using several experimental Taylor signals obtained by repeating identical experiments on a group of samples of a single mixture.

Although each repetition may be analyzed independently, and the amplitude distributions obtained may be averaged, it has been found to be more robust to accumulate the various individual Taylor signals into a single global Taylor signal including a number of experimental points equal to the sum of the experimental points of each individual Taylor signal. Secondly, the amplitude distribution is sought on the global Taylor signal by applying the constrained regularization algorithm.

This results in an amplitude distribution P(G) that most closely observes the constraint imposed, e.g., the distribution P(G) is more regular. This also allows the uncertainties and imprecisions affecting the acquisition of the individual Taylor signals to be taken into account.

During this operation, if the reference time t₀is not strictly identical from one experimental Taylor signal to another, the time coordinates are translated such that all experimental Taylor signals have exactly the same reference time t₀.

The correction of the baseline, followed by the normalization of each experimental Taylor signal, may also be necessary before processing.

The software 31 includes a menu allowing users to process several experimental Taylor signals before analysing the global Taylor signal thus obtained.

Advantages

The method just discussed allows the size distribution of a mixture of species, as well as the concentrations of these species in the mixture, to be obtained automatically and in real time, no matter what the polydispersity of the sample analyzed is, i.e., the number of species included in this sample and the respective concentrations thereof.

The fields of application of the device and method described above include the size characterization of polymers, colloids, latex nanomaterials, emulsions, liposomes, vesicles, and molecules or biomolecules in general. One important field of application is the study of the stability/degradation/aggregation of proteins for the pharmaceuticals industry.

The advantages of the characterization of a sample by means of the Taylor dispersion phenomenon are known to persons skilled in the art: Low volume of the sample to be injected into the capillary, no need to calibrate the experimental device, use of an extremely simple experimental device, technique that is particularly well adapted to size measurements of particles smaller than a few nanometers, a signal that is generally sensitive to mass concentration, etc.

EXPERIMENTAL EXAMPLES
Experimental Conditions

Virgin silicon capillary: R_c=50 μm with a distance between the injection and detection sections of 30 cm.

Temperature: T=293° K

Eluent: Sodium borate buffer 80 mM, pH 9.2.

Viscosity of the eluent: η=8.9 10⁻⁴Pa·s.

Sample: Polystyrene sulphonate (PSS) 0.5 g/l.

Injection: 0.3 psi (20 mbar), 9 s, i.e., an injected volume of 8 nl (total capillary volume 589 nl).

UV detection at a wavelength of 200 nm.

Characteristics of the Polymer Standards Injected

$\begin{matrix} 1) PSS 1290 & M_{w} = 1290 g / mol & M_{p} = 1094 g / mol & M_{w} / M_{n} < 1.20 \\ 2) PSS 1590 & M_{w} = 5190 g / mol & M_{p} = 5280 g / mol & M_{w} / M_{n} < 1.20 \\ 3) PSS 29000 & M_{w} = 29000 g / mol & M_{p} = 29500 g / mol & M_{w} / M_{n} < 1.20 \\ 4) PSS 148000 & M_{w} = 145000 g / mol & M_{p} = 148500 g / mol & M_{w} / M_{n} < 1.20 \\ 5) PSS 333000 & M_{w} = 333000 g / mol & M_{p} = 338000 g / mol & M_{w} / M_{n} < 1.20 \end{matrix}$

where M_wis the average molar mass by weight, M_pthe molar mass at the summit of the chromatographic peak, and M_w/M_nis the polydispersity index. The average molar masses and the characteristics of the distribution were given by the supplier, who determines them by steric exclusion chromatography with calibration using polymer standards of the same chemical nature (PSS).

Experimental Taylor Signals of the Polymers Studied

FIG. 3 shows an experimental Taylor signal obtained for an equimassic mixture of PSS 1290 and PSS 29000. In fact, three experimental Taylor signals are aggregated here.

Furthermore, only the left part of the experimental Taylor signal is shown. In fact, generally, taylorgrams are symmetrical. However, in the case of the phenomenon of adsorption to the capillary surface, the right part of the signal, corresponding to the times following the time t₀, may not be exactly symmetrical on the left part of the signal, corresponding to the time preceding the time t₀. In this case, it is preferable to focus the processing of the data on the left part of the experimental Taylor signal. Advantageously, the method described above only takes into account the left part of the signal in order to limit the influence of these possible parasitic phenomena.

FIG. 4 shows the superimposition of the distribution P_R(R_h) obtained by the aforementioned method (software 31) compared to the distribution given by the supplier of the polymers and obtained by a chromatographic method (SEC). FIG. 4 shows good consistency between the hydrodynamic ray distribution given by the supplier and the distribution obtained by running the software 31.

FIG. 5A shows the adjustment between the normalized experimental Taylor signal Ŝ(t) (Data) and the normalized reconstructed signal S′(t) (Fit), and FIG. 5B shows the adjustment between the logarithm of the normalized experimental Taylor signal Ŝ(t)(Data) and the cumulant development (Fit).

Comparison of the Results

Table 1 shows the various averages of the diffusion coefficient D obtained: Directly by breakdown into cumulants (Γ average, column 2) or by integrating the taylorgram (7 average, column 3), and, on the other hand, by running the software 31 (Γ average, column 4 and Taverage, column 5). It should be noted that, in this example, the averages measured directly on the experimental Taylor signal are not used to constrain the deconvolution of the signal, and only a loose regularity constraint and a strict positivity constraint were used.

TABLE 1

1
2
3
4
5

Sample:

custom character

_Γ^a

_T^b

_Γ^c

_T^c

1290
2.534E−06
2.457E−06
2.654E−06
2.595E−06

5190
1.197E−06
1.083E−06
1.197E−06
1.192E−06

29000
4.754E−07
4.222E−07
5.152E−07
5.135E−07

145000
2.014E−07
1.899E−07
2.218E−07
2.201E−07

333000
1.209E−07
1.157E−07
1.413E−07
1.119E−07

1290 + 29000
1.065E−06
8.407E−07
1.900E−06
8.533E−07

1290 + 5190
2.137E−06
1.555E−06
2.082E−06
1.812E−06

5190 + 29000
7.669E−07
6.509E−07
9.237E−07
6.921E−07

^abased on the breakdown into cumulants (equation 25)

^bbased on the integration of the taylorgram (equation 17)

^cbased on the distribution given by the software 31

Overall, the results show great coherence, and the software 31 results in a solution in which the T and Γ averages (columns 4 and 5) are close to the experimental values (columns 2 and 3).

Table 2 shows the values of τ_minand τ_maxdetermined based on the various proposed approaches, i.e., the imperical approach (columns 4 and 5), breakdown into cumulants (columns 6 and 7) based on the cumulants Γ₁and Γ₂(columns 2 and 3), and the approach using the T and Γ averages of the diffusion coefficient (columns 8 and 9 based on the first-order cumulant and the integration of the taylorgram).

TABLE 2

2
3
4^a
5^a
6^b
7^b
8^c
9^c

1
Γ₁
Γ₂
τ_min/α_min
τ_max/α_max
τ_min/α_min
τ_max/α_max
τ_min/α_min
τ_max/α_max

Sample:
s⁻²
s⁻⁴
s^a
s^a
s^b
s^b
en s
en s

1290
6.339E−02
1.231E−05
3.891
4.280
3.679
4.301
3.277
4.914

5190
3.004E−02
5.638E−06
5.554
5.985
5.176
6.471
4.137
8.603

29000
1.189E−02
6.307E−06
7.595
9.052
6.975
12.593
6.408
14.203

145000
5.038E−03
1.651E−06
9.804
13.555
10.195
20.736
10.856
19.016

333000
3.024E−03
5.378E−07
11.279
30.264
13.344
26.235
14.497
23.480

1290 + 29000
2.664E−02
8.047E−05
4.704
8.860
4.067
10.276
3.781
11.622

1290 + 5190
5.210E−02
2.518E−04
4.453
7.740
3.005
6.978
2.540
9.343

5190 + 29000
1.918E−02
2.571E−05
6.625
9.891
5.171
10.784
4.777
12.172

^aempirical approach based on equations (43) and (44)

^bbased on the cumulant breakdown (equations (32-33), (38-39), and (46-47))

^cbased on the T and Γ averages of the diffusion coefficient (equations (17), (25), (30-31), (38-39), and (46-47)).

The orders of magnitude of τ_minand τ_maxare highly consistent no matter what method is considered.

Table 3 compares the average hydrodynamic ray values obtained by breakdown into cumulants (Γ average, column 2), and by running the software 31 by determining the distribution P(G) followed by the Γ T integration (columns 3 and 6), by the reference method by steric exclusion chromatography (columns 4 and 7) following the average in question, by direct integration of the taylorgram on the entire signal (column 5). For the same average (columns 2-4, on the one hand, and column 5-7, on the other hand), the results are homogeneous for all samples considered.

This shows high consistency for all samples for each group of averages in question.

TABLE 3

1
2
3
4
5
6
7

Sample:

\frac{kT}{6 {πη [{〈 D 〉}_{Γ}]}_{cumulant}}

\frac{kT}{6 {πη [{〈 D 〉}_{Γ}]}_{\log iciel 31}}

\frac{kT}{6 {πη [{〈 D 〉}_{Γ}]}_{SEC}}

\frac{kT}{6 {πη [{〈 D 〉}_{T}]}_{Taylor}}

\frac{kT}{6 {πη [{〈 D 〉}_{T}]}_{Logiciel 31}}

\frac{kT}{6 {πη [{〈 D 〉}_{T}]}_{SEC}}

1290
0.968
0.924
0.913^a
0.998^b
0.945^c
1.000^d

5190
2.049
2.048
1.880
2.265
2.057
1.989

29000
5.159
4.761
5.076
5.810
4.776
5.278

145000
12.178
11.057
12.200
12.917
11.144
13.133

333000
20.286
17.359
17.297
21.190
21.917
20.915

1290 + 29000
2.303
1.291
1.319
2.917
2.874
3.449

1290 + 5190
1.148
1.178
1.046
1.578
1.354
1.470

5190 + 29000
3.198
2.655
2.473
3.768
3.544
3.638

^aobtained based on the distribution obtained by SEC.

^bobtained by integrating the taylorgram (based on the variance of the taylorgram).

^cobtained by integrating the weight distribution of the diffusion coefficients obtained by the software 31 (minimisation on D).

^dobtained based on the weight distribution of the hydrodynamic rays originating from the SEC.

Determination of the Time at the Peak

Experimentally, the time at the peak to of the experimental Taylor signal Ŝ(t) is not known with precision due to the measurement noise.

The time at peak t₀affects both a cumulant analysis and the determination of the size distribution obtained by constrained regularization.

Additionally, the cumulant method is based on a limited development for (t−t₀)→0. From an experimental standpoint, it is necessary to choose the range of time t for the analysis wisely: If one limits oneself to a very small interval, the result will be substantially affected by measurement noise. If, on the other hand, too wide an interval is considered, the contribution of the higher-order (t−t₀) terms, which are ignored in the cumulant method, will be significant.

A step for determining a peak time t₀, as well as an optimal range of times suitable to a cumulant analysis, is shown below.

In a first sub-step, a first estimate t_0,guessof the peak time is obtained, e.g., by considering the time for which Ŝ(t) is at its maximum or by adjusting the peak of Ŝ(t) by means of a parabolic or Gaussian function.

In a second sub-step, a list of N peak times to be tested t_0,iis established, with the natural integer i varying between 1 and N, where the times t_0,iare around t_0,guessand regularly spaced by a constant time increment, with:

t_0,1<t_0,2< . . . t_0,N;
t_0,1+1=t_0,i+dt; and
t_0,1=t_0,guess−Δt,t_0,N=t_0,guess+Δt;

where

dt is the time increment between two consecutive tested peak times; and

Δt is a time interval typically on the order of t_0,guess/50.

In a third sub-step, for each of the peak times to be tested t_0,i, a series of cumulant analyses is carried out taking into account various ranges of time of differing lengths.

The time range is, e.g., at a cutoff level of the signal Ŝ(t). For example, for a cutoff level of 0.1, the time range t is considered such that Ŝ(t)>0.1×Ŝ(t_0,i). The value of the first and second cumulant resulting from the adjustment on each range of time is noted.

In a fourth sub-step, the optimal peak time t₀is determined as being between the peak times for which the first cumulant Γ₁diverges towards positive values when the cutoff level increases, and those for which the first cumulant Γ₁diverges towards negative values when the cutoff level increases.

Tracing on a graph, for each peak time to be tested t_0,i, the curve of the first cumulant Γ₁as a function of the cutoff level, the optimal peak time t₀is that for which the curve is located between upward concave curves and downward concave curves. This curve has a smaller variation than the others.

The choice of optimal peak time is made by visual analysis of the aforementioned graphics or automatically, e.g., based on the sign of the second numerical derivative of the first cumulant Γ₁as a function of the cutoff.

Alternatively or optimally, it is possible to do the same with the second cumulant Γ₂and/or the square of the ratio of the second cumulant to the first cumulant. By doing it simultaneously for the first cumulant, the second cumulant, and the square of the ratio of the second cumulant to the first cumulant, the choice of peak time may be made more reliable.

In a fifth step, the optimal cutoff value is determined as the one that is the highest before the data show a significant deviation relative to their general tendancy due to the influence of measurement noise for very high cutoff levels.

Number	Name	Date	Kind
7099778	Chien	Aug 2006	B2
8118986	Cottet	Feb 2012	B2
20110264380	Cottet et al.	Oct 2011	A1

Method for determining the size distribution of a mixture of particles using Taylor dispersion, and associated system

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

Field of Search

US

CPC

International Classifications

Term Extension

Abstract

Description

Claims

Priority Claims (1)

PCT Information

US Referenced Citations (3)

Non-Patent Literature Citations (4)

Related Publications (1)

Entry
Cottet, et al. 2007 “Determination of dendrigraft poly-L-lysine diffusion coefficients by Taylor dispersion analysis” Biomacromolecules, 8(10); 3235-3243.
Cottet, et al. 2007 “Taylor dispersion analysis of mixtures” Analytical Chemistry 79(23); 9066-9073.
Kelly, B. and Leaist, D.G. 2004 “Using Taylor dispersion profiles to characterize polymer molecular weight distributions” Physical Chemistry Chemical Physics 6(24); 5523-5530.
Schure, 1999 “Advances in the theory of particle size distributions by field-flow fractionation—outlet and apparent polydispersity at constant field” Journal of Chromatography 831(1); 89-104.