Glatiramer acetate (GA), a complex heterogeneous mixture of synthetic polypeptides, has been approved as an immunomodulatory drug by the US Food and Drug Administration (FDA) for the treatment of relapsing-remitting multiple sclerosis, the most common disabling neurological disorder of young adults.
Glatiramer acetate (GA) is the active ingredient of COPAXONE® (Teva Pharmaceutical Industries Ltd., Israel), comprises the acetate salts of a synthetic polypeptide mixture containing four naturally occurring amino acids: L-glutamic acid, L-alanine, L-tyrosine, and L-lysine, with a reported average molar fraction of 0.141, 0.427, 0.095, and 0.338, respectively. The average molecular weight of COPAXONE® is between 4,700 and 11,000 daltons. In controlled clinical trials, Copaxone has been demonstrated to have a 75% reduction in relapse rate over 2 years and significantly reduce progression of disability in multiple sclerosis with long-term efficacy, safety, and tolerability. The extensive use and relatively high cost of Copaxone leads to an evolving need for development of other generic versions of GA to increase affordability and access to this medication.
GA is one kind of non-biological complex drugs (NBCDs). Over the years, a robust regulatory system for development of generic versions of small molecule medicines, which can be fully identified and characterized, has been well-established using the concept of pharmaceutical equivalence and bioequivalence. However, the regulatory policies and analytical approaches for biologicals and NBCDs remain under development. Since NBCDs are usually synthesized complex macromolecules/mixtures that cannot be fully characterized, they are suggested to be evaluated based on the “similarity” with the reference listed drug, like “biosimilar approaches” for biologicals.
More than 1036 possible theoretical sequences exist in GA, which makes its components neither fully identifiable nor quantifiable even by the up-to-date analytical techniques. Therefore, no two GA can ever be proved “identical”. Various chemical analyses, including molecular mass distribution profiling by gel permeation chromatography, peptide mapping by capillary electrophoresis, relative amino acid levels at the N-termini by Edman degradation, secondary structure characterization by circular dichroism, and proteolytic digests profiling by reverse-phase high-performance liquid chromatography (RP-HPLC), have been conducted to compare GA drugs.
The present invention developed a hypothesis testing approach to analyze the high-dimensional LC-MS data to assess the extent of similarity between a reference drug and generics. One characteristic of our proposed hypothesis testing approach is to consider the differences in all data points between two sample groups. Besides, additional resampling technique can introduce robust inference procedures, even for a small number of samples. These characteristics lead to the robust results obtained from this approach.
The dash lines on
The phrase “hypothesis testing” as used herein refers to a statistical test used to determine whether the hypothesis assumed for the sample of data stands true for the entire population or not.
The expression “non-biological complex drugs (NBCDs)” as used herein refers to a type of drug with following properties: a) encompassing a complex multitude of closely related structure; b) the properties cannot be fully revealed by physicochemical analysis; c) the entire multitude is the active pharmaceutical ingredient, and d) the consistent, rigorously controlled manufacturing process is essential to reproduce the product.
“Random copolymer drugs” as used herein refers to a drug that is generated from coplymerization process based on the reaction kinetics of chemicals or monomers.
“Polypeptide mixture” as used herein refers to a mixture contains various polypeptides.
“Copolymer mixture” as used herein refers to a mixture containing coplymer.
“Polypeptides” as used herein refers to peptides with short chains of amino acid monomers linked by peptide (amide) bonds.
“Compelex organic molecule” as used herein refers to a polymer-like molecule. Listed reference drug or generic version is digested by Lys-C and followed by UPLC/HILIC-MS analysis. Features in the LC-MS data, identified by the software, such as Progenesis QI for Proteomics software, that can be matched to values in the in-house database were considered to be potential active ingredient of drugs and were further submitted to one developed hypothesis testing approach, sum of squared deviations test, which can process these high-dimensional LC-MS data and evaluate the similarity/difference between sample groups.
The present invention has developed a hypothesis testing approach to assess the similarity between samples. Before performing hypothesis testing on the data points, data points are resampled by resampling technique such as bootstrapping, to regenerate the data points based on the assumption that a statistic can best be assessed by referencing the data it is derived from and is typically used to assess the stability of a statistic or estimate. After obtaining the resampled dataset, we developed a statistical hypothesis testing to compare two datasets. The null hypothesis (H0) is assumed to be that there are differences between two data sets. The alternative hypothesis (Ha) is assumed to be that there is no difference between two data sets, which we conclude when H0 is rejected. This strategy is to perform hypothesis testing on LC-MS data to determine the similarity/difference of potential active ingredients between two random copolymer drugs, such as peptide drugs. It can also be used to quickly check the lot-to-lot variation in the production process. In principle, this approach also can be applied to non-biological complex drugs (NBCDs) sharing the same characteristics that consist of a multitude of closely related structures, and their properties cannot be fully characterized by physicochemical analysis.
Random copolymer drugs are classified as one kind of non-biological complex drugs (NBCDs) defined as: a) encompassing a complex multitude of closely related structure; b) the properties cannot be fully revealed by physicochemical analysis; c) the entire multitude is the active pharmaceutical ingredient and d) the consistent, rigorously controlled manufacturing process is essential to reproduce the product. Over the years, a robust regulatory system for development of generic versions of small molecule medicines, which can be fully identified and characterized, has been well-established using the concept of pharmaceutical equivalence and bioequivalence. However, the regulatory policies and analytical approaches for biologics and NBCDs remain under development. Since NBCDs are mostly synthesized complex macromolecules/mixtures whose total chemical structure cannot be fully characterized, they are suggested to be evaluated based on the “similarity” with the reference-listed drug, such as “biosimilar approaches” for biologics. No two copolymer drugs can ever be proved “identical”. Various chemical analyses, including molecular mass distribution profiling by gel permeation chromatography, peptide mapping by capillary electrophoresis, relative amino acid levels at the N-termini by Edman degradation, secondary structure characterization by circular dichroism, and proteolytic digests profiling by reverse-phase high-performance liquid chromatography (RP-HPLC), have been conducted to compare glatiramer acetate (GA) drugs. Recently, FDA agency has proposed a molecular fingerprinting approach, including liquid chromatography coupled with mass spectrometry (LC-MS), nuclear magnetic resonance (NMR), and asymmetric field flow fractionation coupled with multi-angle light scattering (AFFF-MALS), to distinguish analytical differences between complex mixtures of peptide chains from GA and non-GA compounds. The study also evaluated the methods' ability to detect analytical differences in the mixtures by applying the statistical analyses to the MS and AFFF-MALS data. However, in that approach, the number of data points (266) was too low to meet these (>1000) suggested by the FDA.
Copolymer-1 (20 mg, purchased from Sigma-Aldrich (St. Louis, Mo.)) or GA (20 mg, ScinoPharm Taiwan Ltd.) was dissolved in 1 mL mannitol (40 mg/mL) at the same concentration as Copaxone, and 7 replicate samples of Copolymer-1 or GA were prepared from 30 μL of the solution. Ten samples were prepared from 30 μL of each lot of Copaxone. For digestion, 45 μL of distilled deionized water (ddH2O), 18 μL of ammonium bicarbonate (24 mg/mL, adjusted pH 8.40), and 15 μL of Lys-C (0.2 g/L) were added to each sample. These samples were incubated at 37° C. for 16 hours in a water bath. After incubation, 10 μL trifluoroacetic acid (0.1%, v/v) and 118 μL acetonitrile (100%) were added to stop the reaction. These samples were filtered through a hydrophilic polyvinylidene fluoride membrane filter with pore size 0.22 μm (Millipore, Billerica, Mass.). Before UPLC-MS analysis, the samples were stored at −20° C.
High-Dimensional LC-MS Data Generated from Copolymer-1 Samples
The LC/MS data of the 7 replicate samples from 2 different sources of Copolymer-1 runs, including Copolymer-1 samples and negative control (NC), both look similar (
Evaluation of Similarity by Hypothesis Testing
A statistical hypothesis test is a method of statistical inference and commonly applied to comparison of two or more data sets. In the test method, the statistical hypothesis is a testable hypothesis that is based on the basis of observing a process that is modeled via a set of random variables. We developed a hypothesis testing approach to analyze the high-dimensional LC-MS data to assess the extent of similarity between the reference drug and generics. One characteristic of our proposed hypothesis testing approach is to consider the differences in all data points between two sample groups.
To first evaluate the feasibility of this approach, 10 lots of Copaxone were randomly separated into two groups with 5 lots each and their data points were used for the developed sum of squared deviations test. The was {circumflex over (ρ)}(95%) (p-value<0.01) showing that H0 was rejected and different lots of Copaxone were significantly similar (
A shown in these examples, we developed a hypothesis testing approach on the multivariate (high-dimensional) LC-MS data to assess the extent of similarity between the Copaxone and generics with statistically significance. The statistical significance is used to determine the difference between two groups with probability. In other words, the sameness of profiles between two groups can be determined based on a user setting value.
This application claims priority from U.S. Provisional Patent Application Ser. No. 62/726,342, which was filed on Sep. 3, 2018. The entire content of this provisional application is incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
62726342 | Sep 2018 | US |