The present invention relates to compounds, reagent combinations and methods useful in isolating and/or characterizing biomolecules or biomolecule fragments, in particular proteins or protein fragments of complex mixtures such as cell lysates or living cells.
Proteins form the important structural and functional machinery of the cell, and are the molecular entities with which nearly all of today's marketed drugs interact. Proteins are thus drug targets. Furthermore, identifying “non-target” proteins with which the drug interacts to trigger side effects has been especially elusive. It is believed that side effects of many drugs could be diminished with a greater understanding of the mechanism of action involving their target and the non-target proteins, in particular by identifying the drug binding site and/or the cross link site, more particularly the drug binding site, of such proteins.
Thus, there is a need to reduce time and costs of drug development by
There is also a need to develop technologies for analysis of the proteome that allow scaling up to industrial levels with the features of an industrial process: high accuracy, reproducibility and flexibility in that the process is high-throughput, automatable and cost-effective. There is a need to develop technologies that permit probing and identification of proteins in their native conformation using automated protocols and systems therefore.
However, these needs are not only limited to drug development. Many other bioactive small compounds (e.g. natural products or herbicides etc.) interact with proteins. A deeper understanding of such an interaction is essential.
Tri-functional compounds for selective reaction with proteins are known from the WO 2004/064972 A2. These so called capture compounds described therein comprise three essential functionalities X, Y and Q linked through a central core Z. These functions impart the selective affinity targeting of the capture compound for certain target proteins (selectivity function or moiety Y), the light-mediated covalent linkage of the capture compound to the target protein (reactivity function or moiety X), and the subsequent removal or isolation of the capture-compound-protein complex from the mixture by a sorting function Q.
Different methods for analyzing or identifying the isolated (“captured”) proteins can be employed. For high-throughput analysis, mass spectrometry has proven effective. The combination of capture compound isolation and mass spectrometric analysis is referred to as CCMS (Capture Compound Mass Spectrometry). The Capture Compound Mass Spectrometry (CCMS) is a novel platform technology combining two basic technologies, named photo-crosslinking of proteins with small, reactive molecules and liquid chromatography-mass spectrometry (LC-MS).
The isolation step in CCMS involves the equilibration of the capture compound with a complex mixture of proteins, for example a cell homogenate, a cell lysate, an isolated organ, a specific subcellular fraction/organelle, such as nuclei and mitochondria or living cells with intact cellular architecture, or an organelle. This first contacting step is performed in the absence of light able to activate the reactivity function X. The capture compound selectively associates (non-covalently) with certain proteins to which the selectivity function has a high affinity. One example is a capture compound having a pharmaceutical drug as Y, which will then specifically interact with drug target proteins and, in addition, any protein that the drug does also interact with specifically with high affinity but is not designed to do so (“off-targets”). Once equilibration has occurred, the sample is irradiated with light of a wavelength sufficient to activate the reactivity function X, which immediately reacts with any suitable neighboring group, effectively linking the target or off-target protein to the capture compound. The exact position in the amino acid sequences of target and off-target proteins, to which the capture compound is covalently linked, is, if precisely identified, informative on the small molecule (drug) binding site of the target or off-target protein itself.
Subsequently, the capture compound, with any attached protein, is removed from the mixture by contacting the mixture with a ligand for the sorting function that allows for removal of Q-containing molecules by attachment to a surface. One prominent example of Q is biotin, which can mediate removal of biotinylated molecules via highly stable association to streptavidin bound to magnetic particles or surfaces (“matrices”).
During CCMS, matrices to which captured proteins are attached, bind proteins non-specifically to a certain extent. These non-specifically bound proteins form a background “noise” appearing also in control experiments, potentially drowning out signals from proteins that have bound specifically in small quantity.
The WO 2012/156377 A1 describes a possibility to reduce a potential drowning of signals. Aside from the three essential functionalities X (reactivity function), Y (selectivity function) and Q (sorting function), which are linked by linker moieties S to a central core Z, the capture compound comprises further a cleavable function F (which is cleavable by light of a certain wavelength) in the proximity of Q. Thus, the captured proteins can be removed and separated from the matrices (sorting function Q) prior to an analysis. Thus, the background “noise” from said non-specifically bound proteins can be reduced.
However, the remaining capture compound, to which the proteins are attached, comprises still a significant size, due to the remaining reactivity function X, selectivity function Y and the remaining necessary linkers S connected to the before mentioned functions. If the analysis is achieved by mass spectrometry, “bigger” molecules will generally complicate an analysis, since said “big” molecules are more prone to collapse into a multiplicity of fragments than “smaller” molecules, which will hinder the analysis and identification. In particular, the determination of the respective crosslink site is complicated or even—depending on the molecule and captured protein—impossible. In addition, the remaining part of the capture compound after cleavage and cross linked to the affinity-captured protein should be stable under mass spectrometric detection or fragmenting in a reproducible way to only a few defined fragment ions permitting identifying the crosslink peptide after protein digestion and mass spectrometry.
The objective of the present invention is to provide methods, compounds and reagents to overcome the stated problems of the state of the art. This objective is attained by the subject-matter of the independent claims.
As used herein the term “characterising,” refers to the identification of captured target proteins or protein fragments (or biomolecules or biomolecules fragments) as well as to the determination or identification of certain functionalities or aspects of the captured proteins (or protein fragments), in particular the determination or identification of the respective cross-link site of said target or off-target proteins (protein fragments).
As used herein the term “reactivity function” refers to a moiety that can be stimulated (activated) to generate a reactive species such as a nitrene, a carbene or a radical by irradiation such as visible light or UV light. Such a reactive species is capable of quickly forming a covalent bond with a range of suitable partners, for example by addition to a C═C— double bond, or by insertion into a O—H, S—H, N—H or C—H bond. In other words, the reactivity function includes groups that specifically react or interact after activation with functionalities on the surface of a protein such as hydroxyl, amine, amide, sulfide and carboxylic acid groups or interacts with the active site of enzymes yielding covalent bond between the target or off-target protein and the reactivity function. Those skilled in the art can select from a library of functionalities to accomplish this reaction.
As used herein the term “activated function” refers to a structural element of the reactivity function derived from activation of the reactivity function by irradiation and the subsequent forming of a covalent bond to the target compound (proteins or protein fragments).
As used herein the term “selectivity function” refers to a variety of groups, which are capable of interacting non-covalently (selective interaction) with target compounds (proteins) based on affinity. The selectivity function in principle may be any small molecular moiety with a highly selective affinity to a target (or off-target) protein (or protein fragment) in the micromolar, nanomolar or sub-nanomolar range resulting in a functional enrichment based on the selective affinity of the selectivity function to the respective proteins in the protein mixture.
As used herein the term “drug binding site” is the site of a target compound (in particular a protein), which comprises the selective interaction of the selectivity function Y with target compound (in particular the protein).
As used herein the term “cross link site” refers to the site of a target compound (in particular a protein), which comprises the covalent bonding of the reactivity function X with the respective target compound (in particular the protein). Depending on the used capture compound—due to the flexible nature of the capture compound—there may be several possible cross linking sites on the target compound, which are situated—due to the structure of the capture compound—in proximity to the drug binding site.
As used herein the term “cleavable function” refers to a cleavable bond or moiety that is cleaved or cleavable under the specific conditions, such as chemically, enzymatically or by irradiation. A cleavable function is a moiety that can be selectively cleaved without affecting or altering the composition of the other portions of the respective compound of interest. For example, a cleavable moiety of the compounds provided herein is one that can be cleaved by chemical, enzymatic, photolytic, or other means without affecting or altering the composition (e. g. the chemical composition) of a captured protein. Non limiting examples are diarylazo, dithio or silyloxy groups. These moieties can be chemically cleaved in the presence of biopolymers using either reducing agents (e.g. sodium dithionite for azo groups, organothiols for disulfide bridges) or fluorides (for silyloxy groups). A function cleavable by irradiation may also be used. However, in this case, the cleavable function has to be chosen in such a way, that the necessary wavelength for the cleaving step is significantly different from the necessary wavelength for the activation of the reactivity function. As used herein the term “fragment part” of the cleavable function refers to the remaining structural element on the target site after a cleaving step was applied.
As used herein the term “bioactive” refers to a compound which is capable of interaction with or effect on any living matter, in particular cells and tissue in the human body or any other living being. Under this definition falls also a pharmacological activity, which refers to compounds with beneficial or adverse effects on any living matter, in particular to cells and tissue in the human body, animals, insects, plants, etc. Bioactive compounds include—by way of non-limiting example—pharmaceutical drugs to treat humans and animals, insecticides, herbicides, pesticides and fungicides.
As used herein the term “small molecule” refers to a moiety of a molecular mass of less than 1500 Daltons, in particular a moiety of a molecular mass of less than 1000 Daltons, more particularly a moiety of a molecular mass of less than 500 Daltons.
As used herein the term “pharmaceutical drug” refers to a compound, which has properties for use in the medical diagnosis, cure, treatment, or prevention of disease, in particular any compound which may be used in view of restoring, correcting or modifying physiological functions by exerting a pharmacological, immunological or metabolic action, or to making a medical diagnosis
As used herein the term “drug metabolite” refers to any compound that is formed after transformation of a pharmaceutical drug following its metabolism in the body that results in a different molecule that may be responsible for adverse side (off-target) effects.
As used herein the term “prodrug” refers to a compound, which is formulated deliberately so it will break down inside the body to form the active drug.
As used herein the term “drug fragment” refers to a portion or moiety of a pharmaceutical drug, a drug development candidate or a drug metabolite or a prodrug.
As used herein the term “natural product” refers to a compound produced by a living organism and which comprises a pharmacological or biological activity that can be of therapeutic benefit in treating diseases.
As used herein the term “macroscopic carrier” refers to an element, which can be separated due to its physical properties (e.g. by centrifugation, filtering, use of a magnetic field etc.) from a mixture of components and comprises a non-gaseous, non-liquid material having a surface. Thus, a macroscopic carrier can be—by way of non-limiting example—a flat surface constructed, for example, of glass, silicon, metal, plastic or a composite; or can be in the form of a bead such as a silica gel, a controlled pore glass, a magnetic or cellulose bead; or a pin, including an array of pins suitable for combinatorial synthesis or analysis, a hollow fibre, a microtiter plate.
As used herein the term “spacer moieties” or “spacer moiety” refers to a covalently connected straight chain or a ring structure containing carbon, phosphorous, sulphur, silicon, nitrogen and/or oxygen atoms connecting one moiety to another moiety providing a certain distance between these moieties. The spacer moieties may comprise further substituent groups, which are not involved in providing said distance.
As used herein the term “linking function” refers to a functional group capable of forming selectively a covalent bond with another functional group under reaction conditions, which are not leading to a covalent reaction of said linking functions with natural occurring polypeptides, in particular with proteins. Thus, the two linking function allow to covalently connect one moiety comprising on linking function with another moiety comprising the other linking function without a reaction with the respective target compounds (proteins). Non-limiting examples are compounds, which are used in the so called “click chemistry”.
As used herein the term “biological sample” refers to compositions containing biomolecules (in particular proteins) to be isolated and identified. For the purposes herein, sample refers to anything which can contain a target compound (in particular a protein). Thus, biological samples include (e. g. any material obtained from a source originating from a living being (e. g. human, animal, plant, bacteria, fungi or protist). The biological sample can be in any form, including solid materials (e. g. tissue, cell pellets and biopsies, tissues from cadavers) and biological fluids (e. g. urine, blood, saliva, amniotic fluid and mouth wash (containing buccal cells).
As used herein the term “biomolecules” refers to any molecule that is produced by a living organism, including large macromolecules such as proteins, polysaccharides, lipids, and nucleic acids.
As used herein the term “digestive step” refers to a procedure, in which a protein is out enzymatically into a limited number of defined shorter fragments (peptides). These fragments allow for the identification of the protein with their characteristic mass and pattern. The serine protease trypsin can be used—by way of a non limiting example—as an enzyme.
As used herein the term “substituted” refers to the addition of a substituent group to a parent compound.
As used herein the term “substituent group” can be protected or unprotected and can be added to one available site or to many available sites in a parent compound. Substituent groups may also be further substituted with other substituent groups and may be attached directly or by a linking group such as an alkyl, an amide or hydrocarbyl group to a parent compound. “Substituent groups” amenable herein include, without limitation, halogen, oxygen, nitrogen, sulphur, hydroxyl, alkyl, alkenyl, alkynyl, acyl (—C(O)Ra), carboxyl (—C(O)ORa), aliphatic groups, alicyclic groups, alkoxy, substituted oxy (—ORa), aryl, aralkyl, heterocyclic radical, heteroaryl, heteroarylalkyl, amino (—N(Rb)(Rc)), imino (═NRb), amido (—C(O)N(Rb)(Rc) or —N(Rb)C(O)Ra), hydrazine derivates (—C(NH)NRaRb), triazole, tetrazole (CN4H2), azido (—N3), nitro (—NO2), cyano (—CN), isocyano (—NC), cyanato (—OCN), isocyanato (—NCO), thiocyanato (—SCN); isothio-cyanato (—NCS); carbamido (—OC(O)N(Rb)(Rc) or —(Rb)C(O)ORa), thiol (—SRb), sulfinyl (—S(O)Rb), sulfonyl (—S(O)2Rb), sulfonamidyl (—S(O)2N(Rb)(Rc) or —N(Rb)S(O)2Rb) and fluorinated compounds —CF3, —OCF3, —SCF3, —SOCF3 or —SO2CF3. Wherein each Ra, Rb and Rc is, independently. H or a further substituent group with a preferred list including without limitation, H, alkyl, alkenyl, alkynyl, aliphatic, alkoxy, acyl, aryl, heteroaryl, alicyclic, heterocyclic and heteroarylalkyl.
As used herein the term “alkyl,” refers to a saturated straight or branched hydrocarbon moiety containing up to 20, particularly up to 10 carbon atoms. Examples of alkyl groups include, without limitation, methyl, ethyl, propyl, butyl, isopropyl, n-hexyl, octyl, decyl and the like. Alkyl groups typically include from 1 to about 20 carbon atoms, particularly from 1 to about 10 carbon atoms.
As used herein the term “aliphatic chain” refers to a straight or branched chain of carbon atoms derived from alkanes, alkenes, and alkynes. As used herein the term “heteroalkylchain” refers to an aliphatic chain in which at least one carbon atom is replaced with an oxygen, a nitrogen or a sulphur atom.
As used herein the term “cycloalkyl” refers to an interconnected alkyl group forming a ring structure containing 3 to 8, particularly 5 to 6 carbon atoms. Examples of cycloalkyl groups include, without limitation, cyclopropane, cyclopentane, cyclohexane or cyclooctane and the like. Cycloalkyl groups typically include from 5 to 6 carbon atoms (C3-C6 cycloalkyl). The term “unsaturated cycloalkyl” refers to a cycloalkyl group comprising at least one double or triple bond. Cycloalkyl groups as used herein may optionally include further substituent groups.
As used herein the term “unsaturated heterocycle” refers to cycloalkyl compounds in which at least one carbon atom is replaced with an oxygen, a nitrogen or a sulphur atom forming a ring structure. Said ring structure comprising at least one double or triple bond. Unsaturated heterocycle groups as used herein may optionally include further substituent groups.
As used herein the term “alkenyl,” refers to a straight or branched hydrocarbon chain moiety containing up to 20 carbon atoms and having at least one carbon-carbon double bond. Examples of alkenyl groups include, without limitation, ethenyl, propenyl, butenyl, 1-methyl-2-buten-1-yl, dienes such as 1,3-butadiene and the like. Alkenyl groups typically include from 2 to about 20 carbon atoms, more typically from 2 to about 10 carbon atoms. Alkenyl groups as used herein may optionally include further substituent groups.
As used herein the term “alkynyl,” refers to a straight or branched hydrocarbon moiety containing up to 20 carbon atoms and having at least one carbon-carbon triple bond. Examples of alkynyl groups include, without limitation, ethynyl, 1-propynyl, 1-butynyl, and the like. Alkynyl groups typically include from 2 to about 20 carbon atoms, more typically from 2 to about 10 carbon atoms. Alkynyl groups as used herein may optionally include further substituent groups.
As used herein the term “aryl” refers to a hydrocarbon with alternating double and single bonds between the carbon atoms forming a ring structure (in the following also “aromatic hydrocarbon”). The term “heteroaryl” refers to aryl compounds in which at least one carbon atom is replaced with an oxygen, a nitrogen or a sulphur atom. Aryl or hetero aryl groups as used herein may optionally include further substituent groups.
As used herein the term “protein capture compound PCC” refers—due to simplicity reasons—not only to a capture compound CC comprising an interaction with a protein P. as discussed below, but also to a capture compound CC comprising an interaction with a biomolecule BM, as depicted for example in formula 5. If not stated otherwise a “protein capture compound PCC” comprises the structural elements BM or P. however, P is preferred. The same applies to the “protein fragment capture compound PFCC”.
According to a first aspect of the invention provided herein is a process for isolating and/or characterising biomolecules and/or biomolecule fragments, in particular proteins and/or protein fragments, comprising the steps:
E-Siv—B formula 3,
T-X′—S0—F′ formula 4
Any process of the invention is an ex-situ process.
According to a second aspect of the invention provided herein is a compound, particularly a compound for carrying out a process according to the first aspect of the invention, of the general formula 3,
E-Siv—B formula 3
wherein
According to a third aspect of the invention provided herein is a compound, particularly a compound for carrying out a process according to the first aspect of the invention, of the general formula 2,
wherein
According to a fourth aspect of the invention provided herein is a compound, particularly a compound for carrying out a process according to the first aspect of the invention, of the general formula 6,
wherein
According to a fifth aspect of the invention provided herein is a compound, particularly a compound for carrying out a process according to the first aspect of the invention, of the general formula 4,
T-X′—S0—F′ (formula 4),
in particular of the general formula 4b or 4b′, more particularly 4b′,
BMF-X′—S0—F′ (formula 4b),
PF—X′—S0—F′ (formula 4b′),
wherein
According to a sixth aspect of the invention provided herein is a combination of reagents for carrying out a process according to the first aspect of the invention, in particular a combination of reagents for carrying out a process according to at least one of the claims 1 to 12, comprising a first compound according to the second aspect of the invention, in particular according to claim 13, and a second compound according to the third aspect of the invention, in particular according to claim 14.
According to the first aspect, the invention relates to a process for isolating and/or characterising biomolecules and/or biomolecule fragments, in particular proteins and/or protein fragments, comprising the steps:
E-Siv—B formula 3,
T-X′—S0—F′ formula 4
In some embodiments, the provision of an immobilized compound ICa, wherein T is at least one biomolecule BM or at least one protein P, is achieved by means of the following steps:
In some embodiments, the provision of an immobilized compound ICa, wherein T is at least one biomolecule BM or at least one protein P, is achieved by means of the following steps:
In some embodiments, a compound of the formula 1a or 1a′ is provided, in particular according to one of the previously discussed steps, and
BMF-X′—S0—F′ formula 4b,
PF—X′—S0—F′ formula 4b′,
In some embodiments, a compound of the formula 1a or 1a′ is provided, in particular according to one of the previously discussed steps, and a division step is carried out, in which the cleavable function F is cleaved, producing a division compound DCa of the general formula 4a or 4a′, in particular 4a′,
BM-X′—S0—F′ (formula 4a),
P—X′S0—F′ (formula 4a′)
wherein F′, S0 and X have the meanings defined previously, and BM is the biomolecule or P is the protein which is bound to the division compound DCa of formula 4a or 4a′ by the function X′, and subsequently an isolation and/or a characterisation step of said division compound DCa of formula 4a or 4a′ is carried out.
In some embodiments, a compound of the formula 1a or 1a′ is provided, in particular according to one of the previously discussed steps, and a division step is carried out, in which the cleavable function F is cleaved, producing a division compound DCa of the general formula 4a or 4a′, and subsequently the at least one biomolecule BM or the at least one protein P of the compound of the formula 4a or 4a′ is broken down into individual fragment parts in a digestive step, producing a division compound DCb of the general formula 4b or 4b′, in particular 4b,
BMF-X′—S0—F′ (formula 4b),
PF—X′—S0—F′ (formula 4b′), and subsequently
an isolation and/or a characterisation step of said division compound DCb of formula 4b or 4b′ is carried out, wherein F′, S0 and X′ have the meanings defined previously, and BMF and PF are the fragment parts of the at least one digested biomolecule BM or the at least one digested protein P which are bound to the division compound DCb of formula 4b or 4b′ by the function X′ or a characterisation step of said division compound DCb of formula 4b or 4b′ and the further fragment parts of the at least one digested biomolecule BM or the at least one digested protein P. which are not bound to the immobilising compound ICb of formula 1b or 1b′ is carried out
In some embodiments, the process for isolating and/or characterising biomolecules and/or biomolecule fragments, in particular proteins and/or protein fragments comprises
The steps of the previous discussed process are depicted in scheme 1.
In some embodiments, the process for isolating and/or characterising biomolecules and/or biomolecule fragments, in particular proteins and/or protein fragments comprises
The steps of the previous discussed process are depicted in scheme 2.
In some embodiments, the process for isolating and/or characterising biomolecules and/or biomolecule fragments, in particular proteins and/or protein fragments comprises
In some embodiments, the process for isolating and/or characterising biomolecules and/or biomolecule fragments, in particular proteins and/or protein fragments comprises
The steps of the previous discussed process are depicted in scheme 3.
In some embodiments, the process for isolating and/or characterising biomolecules and/or biomolecule fragments, in particular proteins and/or protein fragments comprises
The steps of the previous discussed process are depicted in scheme 4.
In some embodiments, the process for isolating and/or characterising biomolecules and/or biomolecule fragments, in particular proteins and/or protein fragments comprises
The steps of the previous discussed process are depicted in scheme 5.
In some embodiments, the process for isolating and/or characterising biomolecules and/or biomolecule fragments, in particular proteins and/or protein fragments comprises
In some embodiments, the process for isolating and/or characterising biomolecules and/or biomolecule fragments, in particular proteins and/or protein fragments comprises
In some embodiments, an immobilized compound ICb of general formula 1b or 1b′ is provided by means of the following steps:
and subsequently
and subsequently
In some embodiments, the process for isolating and/or characterising biomolecules and/or biomolecule fragments, in particular proteins and/or protein fragments comprises
The steps of the previous discussed process are depicted in scheme 6.
If not stated otherwise, any of the embodiments described below can be combined with all of the above mentioned embodiments of the process according to the invention.
In some embodiments, the target compound comprises, in particular essentially consist of, a biomolecule BM or a biomolecule fragment BMF, more particularly a protein P or a protein fragment PF selected from the group of membrane, fibrous and/or globular proteins, in particular from the group of membrane proteins and cytosolic proteins or proteins derived from subcellular fractions such as nuclei or mitochondria.
The selectivity function Y comprises an affinity to the (non/off-) target protein(s) P (or a biomolecule BM). The affinity is characterized by a dissociation constant below 10 mmol/l. In some embodiments, the dissociation constant is below 1 mmol/l. In some embodiments, the dissociation constant is below 100 μmol/l. In some embodiments, the dissociation constant is below 10 μmol/l. In some embodiments, the dissociation constant is below 1 μmol/l. In some embodiments, the dissociation constant is in the nanomolar range. In some embodiments, the dissociation constant is below <100 nmol/l.
In some embodiments, the reactivity function X is a moiety that can be stimulated to generate a reactive species (activation of the reactivity function) which will quickly form a covalent bond with a range of suitable partners of a target compound. Thus, the reactivity function X forms a covalent bond irreversibly linking the compound comprising the reactivity function X to those target compounds for which there was an affinity.
In some embodiments, the reactivity function X can be activated by exposure to light comprising a wavelength λmax in the range of 250 nm to 400 nm, in particular a wavelength λ of 300 nm to 360 nm. After the activation target compounds (proteins) in close proximity react with the highly reactive species formed by activation of X and form a covalent bond between the target compound T and the compound comprising the reactivity function X. Thus, the activation of the reactivity function X and the subsequent reaction with the target compound T yields a function X′, derived from said reaction, which covalently connects the compound comprising the reactivity function X and the target compound T. In other words, the target compound is now “captured” through a covalent crosslink.
One example for such light source is the “caprobox” (caprotec bioanalytics GmbH, Berlin). Another example is a common trans-illuminator for DNA agarose gel analysis, which generates light of ca. 360 nm and 254 nm. Further examples are the stratalinker (Stratagene/Agilent), which can generate light at different UV wavelengths or a lamp from AlienBees Self-Contained Studio Flash Lamp.
Non-limiting examples for activation reactions for said reactivity functions X are the generation of a carbene by photolysis of a diazo compound, for example a diazirine, or the generation of a nitrene from an azide, or the generation of a (di-)radical from a carbon-hetero atom double bond, any of which reactive intermediate species will quickly go on to react further with—by way of non-limiting example—an aromatic amino acid side chain, or an alcohol-, amino- or thiole function or amide bond of a sterically proximate amino acid of the polypeptide chain.
In some embodiments, X is selected from a diazirine-, an arylazide-, a di- or tetrahalogenarylazide or an arylketone group, wherein X is selected particularly from the group comprising aryltrifluoromethyldiazirine or tetrahalogenarylazide. Aryl azides may comprise one or two azides group on the aromatic ring.
Non-limiting examples for reactivity functions X are:
The covalent link of the reactivity function X to the target compound T can be realized through another than the indicated positions, for example through position 2 or 3 in the ring. The aromatic rings may be substituted by, for example, cyano groups, nitro groups or hydroxyl groups.
The reactivity function X, as discussed above, forms upon activation a covalent bond to any protein in its proximity. In absence of a selectivity function Y the capture compound would capture randomly a more or less representative selection of any proteins present in the sample. The selectivity function restricts the linkage of the capture compound to such proteins that specifically interact with the selectivity function Y based on affinity, as discussed below.
The selectivity function Y selects through a non-covalent interaction the types of target compounds that can interact based on affinity. Thereafter the reactivity function X is activated to freeze-in the equilibrium. The result is a functionally enriched sub-proteome based on the functional relationship between T and Y. The selectivity function Y, generally reduces the number of target compounds T covalently connected (“captured”) to an immobilized compound IC, as described above, so that the target compounds T can then be isolated and/or characterized, for example by mass spectrometry.
The selectivity function Y in principle may be any molecular moiety with highly selective affinity to target (off-target-) proteins in the micromolar, nanomolar or sub-nanomolar range. Selective interaction of Y with a target protein will restrict the covalent linkage of the reactivity function X to certain proteins. The term “target” protein in this context is somewhat arbitrary. When used in the context of pharmaceutical drug development, the term “target” protein is used to designate the protein that the drug is meant to interact with, or assumed to interact with, thus forming the desired target-drug interaction necessary for the therapeutic action. Proteins which interact with the drug molecule that are not involved in the mode of therapeutic action are defined as “non-target” or “off-target” protein, which however may turn out to be important for the leading to adverse side effects or protein targets for secondary medical indications. The selectivity function Y comprises a bioactive small molecule and is allowed to be extensively varied depending on the goal to be achieved.
In some embodiment, the selectivity function Y is a chiral group, which allows for stereoselective capture of target compounds T.
One important field of application for the instant invention is drug development, in particular the analysis of drug-protein interactions and the improvement of specificity and selectivity for the (desired) drug-target interaction, and likewise the modification of (in most cases, undesired) drug-non-target interactions. For embodiments addressing such applications. Y is a pharmaceutical drug, a drug development candidate, a drug fragment, a drug metabolite or a prodrug, wherein all of the before mentioned can be characterized as small molecules. Other important exemplary applications are herbicides, insecticides, pesticides, fungicides or other small organic bioactive molecules of relevance in any biomedical context. In other embodiments, Y is a pharmaceutically active natural product.
In some embodiments, the selectivity function Y is a bioactive small molecule, as discussed above, of a molecular mass of less than 1500 u. In one embodiment, the selectivity function Y is a bioactive small molecule, as discussed above, of a molecular mass of less than 1000 u. In one embodiment, the selectivity function Y is a bioactive small molecule, as discussed above, of a molecular mass of less than 500 u.
In some embodiment. Y is a molecular moiety obeying the “Lipinski Rule of Five”, i.e. Y has a molecular mass between 160 u and 500 u, comprises up to five hydrogen bond donators (e.g. oxygen and or nitrogen atoms with one H attached), up to ten hydrogen bond acceptors (e.g., oxygen or nitrogen atoms) and an octanol-water partition coefficient log P of below 5,6 (any of these characteristics applied to the isolated Y moiety, without regard to the remaining moieties of the respective capture compounds).
Binding of the selectivity function Y to the (on/off-) target protein(s) is effected by non-covalent interaction. The affinity of the bioactive molecule, in particular the bioactive small molecule, that corresponds to the selectivity function Y (measured with no further compound attached) is characterized by a dissociation constant in the micromolar, nanomolar or sub-nanomolar range.
The cleavable function F refers to a bond or moiety (also referred to as a “biocompatible cleaving site”) that is cleaved or cleavable under the specific conditions, such as chemically, enzymatically or by irradiation thereby releasing a portion (target site) from the compound IC of the formula 1 including the target compound T.
As used herein, a “cleavable function” F is a moiety that can be selectively cleaved without affecting or altering the composition of the other portions of the compound of interest or a cleavage of any other moiety of the immobilized compound IC of the formula 1. For example, a cleavable function F of the compounds provided herein is one that can be cleaved by chemical, enzymatic, photolytic, or other means without affecting or altering the composition (e. g. the chemical composition) of the covalently connected target molecule or a cleavage of any other moiety of the immobilized compound IC of the formula 1.
The conditions for a cleavage of the cleavable function F depends on the chosen cleaving function. The respective cleavable function—depending on the target compound and the remaining moieties of the immobilized compound IC—can be chosen from different groups.
In some embodiments, F is a cleavable function, which is cleavable by H+, OH−, thiols, salts from acids, fluorides, hydrazides, nucleophiles, electrophiles, radicals, alkylating agents, oxidation, reduction, enzymes or irradiation or a combination of the before mentioned.
In some embodiments, F is selected from the group comprising an azo-, an acyl-, a disulfide-, a silyoxy-, a carbamate-, an acetal-, a ketal, an arylether, a hydrazone-, an arylketone, or an o-nitrobenzyl group.
In some embodiments, a cleavable function F, is selected from
The abbreviations R and Ar on the left side of the reaction scheme represent the further structures of the compounds of the invention (e.g. —S0—X or S—Z—). The compounds on the right side of the reaction scheme are the chemical structures F′ resulting from the cleavage reaction. The conditions of the before mentioned cleavage reactions are known to a general expert in the field.
In some embodiments, a cleavable function F, is selected from
Scheme 7: Possible cleavable functions F; cleavable under reaction conditions a) buffered Na-dithionite solution; b) thiols, such as thioethanol or DTT or c) fluoride solutions.
The use of a cleavable function allows to selectively remove the captured proteins from the macroscopic carrier B (prior or after a digestion step) and separate the cleaved moiety containing the target compound from the macroscopic carrier B. The advantage is obvious, since proteins, unspecifically bound to the macroscopic carrier B by for example unspecific crosslinking to native proteins would not be removed and therefore isolated together with the specifically captured proteins based on the affinity of Y to the target proteins T. As a result of this cleavage the ratio of specific to unspecific proteins will be improved. Furthermore, the remaining part of the capture compound, to which the proteins are attached, comprises significantly reduced size, in comparison to known cleavable capture compounds after cleavage. Due to position of the cleavable function in proximity to the reactivity function X, not only the macroscopic carrier B but also the selectivity function Y and the remaining necessary linkers S together with the core Z are separated (with the exception of S0). This allows an easier and better characterization of the target compound, due to lesser fragment parts in case of mass spectroscopy. In particular, the determination of the respective crosslink site of the target compound is improved or even—in some cases—possible for the first time with such a cleavable site. The functions, which remain on the cross link site after cleavage (X′, F′ and/or So), may a marker, in particular an isotopic marker, a mass tag, fluorescent tag and/or a multiplex marker.
In some embodiments, B is a macroscopic carrier, which is capable of being separated from a medium by means of his physical properties. The macroscopic carrier B allows removing the immobilized compound IC of formula 1 from the biological sample (in particular a mixture of proteins) selectively, for example by virtue of its magnetic, electrostatic, inductive or optical or particulate properties, e.g. by filtration or centrifugation.
In general, the macroscopic carrier B can comprise any solid material. For example, silica gel, glass (e. g., controlled-pore glass), nylon. Wang resin, Merrifield resin, dextran cross-linked with epichlorohydrin (e. g., Sephadex), agarose (e. g., Sepharose), cellulose, magnetic beads, Dynabeads, a metal surface (e. g., steel, gold, silver, aluminum, silicon and copper), a plastic material (e. g., polyethylene, polypropylene, polyamide, polyester, polyvinylidenedifluoride (PVDF)).
The macroscopic carrier B is in any desired form, including, but not limited to, a bead, capillary, plate, membrane, wafer, comb, pin, a wafer with pits, an array of pits or nanoliter wells and other geometries and forms known to those of skill in the art. Thus, the macroscopic carrier B can be particulate or can be in the form of a continuous surface, such as a microtiter plate, a dish or well, a glass slide, a silicon chip, a nitrocellulose sheet, nylon mesh, or other such materials. When particulate, typically the particles have at least one dimension in the 100 μm range or smaller. Such particles, referred collectively herein as “beads”, are often, but not necessarily, spherical. Reference to “bead”, however, does not constrain the geometry, which can be any shape, including random shapes, needles, fibers, pin (including an array of pins suitable for combinatorial synthesis or analysis) and elongated. “Beads”, particularly microspheres that are sufficiently small to be used in the liquid phase, are also contemplated. The “beads” can include additional components, such as magnetic or paramagnetic particles (see, e. g., Dyna beads (Dynal, Oslo, Norway)) for separation using magnets, as long as the additional components do not interfere with the methods and analyses herein.
In some embodiments, B comprises a polymer material or a monolithic polymer material
In some embodiments, B comprises monolithic silica or polymer silica material. In some embodiments. B comprises CPG (Controlled Porous Glass).
In some embodiments, B comprises an in aqueous media non-swellable or low swellable material.
In some embodiment, B comprises magnetic properties.
In some embodiment, B comprises a magnetic particle(s).
In some embodiment, B comprises an anti-fouling material or anti-fouling coating.
In some embodiment, B comprises the form of non-porous beads, porous beads, flat supports, flat supports with pits, arrays, in particular micro arrays, glass, silicon tubes, chips, channels or hollow fibres, pipette tips or microtiter plates.
In some embodiment, B comprises the form of non-porous beads, porous beads, flat supports, flat supports with pits, arrays, in particular micro arrays, glass, silicon tubes, chips, channels or hollow fibres, pipette tips or microtiter plates and a material selected from polymer material, monolithic silica or polymer silica material, monolithic polymer material, CPG (Controlled Porous Glass), in particular an in aqueous media non-swellable or low swellable material. Additionally, B may comprise magnetic properties, in particular magnetic particle(s), and/or an anti-fouling material or anti-fouling coating.
In some embodiment, B comprises “beads” U. e. particles, typically in the range of less than 200 um or less than 50 pm in their largest dimension) including, but not limited to, polymeric, monolithic silica or polymeric silica material, magnetic, and other such beads. The beads can be made from hydrophobic materials, including polystyrene, polyethylene, polypropylene or teflon, or hydrophilic materials, including, but not limited to, cellulose, dextran cross-linked with epichlorohydrin, agarose, polyacrylamide, silica gel and controlled pore glass beads or particles. These types of capture compounds bound to B can be reacted in liquid phase in suspension, and then spun down or otherwise removed from the reaction medium.
The first linking function D is capable of forming a covalent bond selectively with a second linking function E under reaction conditions not leading to a covalent reaction of D or E with natural occurring polypeptides, in particular with proteins. It has to be noted that the abbreviation spacer moiety S′ of the immobilized compound IC of formula 1, formula 1a or 1a′, formula 1b or 1b′ or formula 6 comprises the spacer moieties S′″ of the capture compound CC formula 2, the protein capture compound PCC of formula 5 or 5′ or the protein fragment capture compound PFCC of formula 5a or 5a′ and spacer moieties Siv of the carrier compound CCB of formula 3 and additionally the structural element D′ and E′ derived from the reaction of the linking functions D and E. The spacer moiety S′ may also be described by the following formula 7
wherein S′″ is connected to Z and S′″ and Siv have the same meaning as defined previously and D′ and E′ represent structural elements (fragments) derived from the reaction of the linking functions D and E.
The reaction partners D and E that mediate coupling of the first and second compound of the invention can be any variants of the reactions commonly referred to as “click” chemistry.
The reaction most often used to date in biological chemistry is the 1,3-dipolar cycloaddition reaction (scheme 8, a) between an azide and an alkyne developed by Huisgen (Rostovtsev et al., Angew. Chem. 2002; van Berkel et al., ChemBioChem 2008, 9, 1805-1815. Another possibility is a Staudinger ligation (Saxon & Bertozzi, Science 2000, 287, 2007-2010; Köhn & Breinbauer, Angew. Chem. 2004, 116, 3168-3178; Scheme 1 b upper panel) between an azide and an arylcarboxylic acid methyl ester-5-ortho-phosphine or a Staudinger phosphite reaction (Serwa et al., Angew. Chem. 2009, 121 Scheme 1 b lower panel) between an azide and a phosphite ester (Scheme 8, b).
Alternatively a Diels-Alder reaction (Scheme 8, c) with the ene component as the linking function D (Song et al., Angew. Chem. 2008, 120, 2874-2877; J. Am. Chem. Soc. 2008, 130, 9654-9655; Devaraj et al., Bioconjugate Chem. 2008, 19, 2297-2299 or N. K. Devaraj, R. Weissleder, S. A. Hilderbrand, Bioconjugate Chem. 2008, 19, 2297-2299) may be applied.
A further alternative (Scheme 8, d) may be the use of a Diels-Alder reaction with the ene component as the linking function E (no catalyst: V. Marchán, S. Ortega, D. Pulido, E. Pedroso, Anna Grandas, Nucl. Acids Res. 2006, 34, e24 1-9; Titanium catalyst: K. E. Litz, Molecules 2007, 12, 1674-1678 or A. Serganov, S. Keiper, L. Malinina, V. Tereshko, E. Skripkin, C. Höbartner, A. Polonskaia, A. T. Phan, R. Wombacher, R. Micura, Z. Dauter, A. Jäschke, D. J. Patel, Nat. Struct. Mol. Biol. 2005, 12, 218-224).
Another alternative (see Scheme 8, e) may be the use of an ene reaction (K. L. Killops, L. M. Campos, C. J. Hawker, J. Am. Chem. Soc. 2008, 130, 5062-5064) or photo-thio-en reaction (S. Narayan, J. Muldoon, M. G. Finn, V. V. Fokin, H. C. Kolb, K. B. Sharpless, Angew. Chem. 2005, 117, 3339-3343), which can be carried out in the caproBox™ due to a photo-induced click process at 300-400 nm.
a) 1,3-dipolar cycloaddition
b) Staudinger ligation and Staudinger phosphite reactions
c) Diels-Alder reactions (D=alkene)
Diels-Alder reactions (D=diene)
e) Examples for further coupling reactions (ene reactions)
Scheme 8: Possible linking groups D and E; reaction conditions concerning a linking reaction of said linking groups D and E; a1: CuSO4, sodium ascorbate; a2: no catalyst/additive; b1: no catalyst/additive; b2: no catalyst/additive; c1: h·v (λ=302 nm); c2: no catalyst/additive; d1: no catalyst/additive or metal salt catalysis; d2: Diels-Alderase (ribozyme); e1: h·v (λ=350 nm); e2: no catalyst/additive.
According to some embodiments, D and E are each selected together from the group of reaction partners, which are capable of performing a reaction according to the click chemistry, wherein in particular
In other words, either D or E is an azide, and the other partner is a member of the group comprising alkyne, aryl carboxylic acid methylester-ortho-phosphine, phosphite ester or a chelatising ligand. Alternatively, either D or E is a tetrazine or tetrazine derivative and the other partner comprises a triple bond or a double bond, particularly a cyclic double bond, such as for example norbornene or cyclooctene or a derivative thereof.
In some embodiments, either D or E is selected from the following tetrazines
and the other partner comprises a triple bond or a double bond.
The reactivity function X may be connected to the cleavable function F by a spacer moiety S0. The selectivity function Y. the cleavable function F and the linking function D may be connected to the central core Z by a spacer moiety S″ (for Y), S (for F) and S′″ (for D). The linking function E may be connected to the macroscopic carrier B by a spacer moiety Siv. Furthermore, the macroscopic carrier B is connected to the central core Z by the spacer moiety S′, wherein the spacer moiety S′ comprises at least the structural elements D′ and E′-derived from the reaction of the linking functions D and E and can comprise the optional spacer moieties S′″ and Siv. Thus, S′ is predefined by the linking functions D and E and the optional spacer moieties S′″ and Siv.
The spacer moieties S0, S″, S′″ and Siv can be any group that provides a spacing (a distance between two functions). Thus, the spacer S0, S″, S′″ and Siv (and in case of formula 1 the spacer S′) can be any chemical moiety that is suited to link the functional moieties X, F, Y and D in formula 2. E and B in formula 3 and X, F, Y and B, for formula 1 with each other or the central core Z.
In some embodiments, each S0, S″, S′″ and Siv are an aliphatic chain or a heteroalkylchain, which comprises carbon and nitrogen, oxygen, sulphur, silicon and/or phosphorous atoms, wherein in particular said heteroalkyl chain comprises at least one oxygen atom, optionally the aliphatic chain or the heteroalkyl chain comprise one or more carbon double or triple bonds.
In some embodiments, at least one S0, S″, S′″ and Siv an aliphatic chain or a heteroalkylchain, which comprises carbon and nitrogen, oxygen, sulphur, silicon and/or phosphorous atoms, wherein in particular said heteroalkyl chain comprises at least one oxygen atom, optionally the aliphatic chain or the heteroalkylchain comprise one or more carbon double or triple bonds.
In some embodiments, each S0, S″, S′″ and Siv comprises chain length of 10 to 20 carbon atoms, in particular of 3 to 10 carbon atoms. The chain may comprise substituents and/or any carbon atom of the chain may be substituted by hetero atoms and contains the hydrogen atoms corresponding to the oxidation state of each non-hydrogen atom. One example for a spacer moiety is the methylenoxy moiety forming the side chain of serine, a CO—NH or NH—CO moiety (linkage by amide on the nitrogen or carboxy carbon), a CO—O or O—CO moiety. One particular example is a PEG-spacer moiety (polyethylene glycol polymer or oligomer). When designing compounds of formula 1, 2 or 3, ease of chemical synthesis, sterical freedom of functions X, F, Y, D, E and B, particularly the interaction of Y with the protein to be isolated, and the requirements regarding diffusion into cells or other biological structures will be taken into account.
Those of skill in the art in the light of the disclosure herein can readily select suitable spacers.
In some embodiments, the spacer moieties S0, S″, S′″ and Siv is selected from (CH2)r, (CH2O), (CH2CH2O)r, (NH(CH2)rC(═O))s, (O(CH)rC(═O))s, —((CH2)r1—C(O)NH—(CH2)r2)s— and —(C(O)NH—(CH2)r)s—, where r, r1, r2 and s are each independently and integer from 1 to 10.
Examples for spacer moieties S0, S″, S′″ and Siv are
with n=0 to n=20.
X and Y should be linked to Z (via F in case of X) by spacer moieties that are sufficiently long to enable both X and Y functions to interact with protein surfaces independently from each other; on the other hand spacer moieties should be minimized to allow for diffusion into cells or organelles, where such diffusion is expected.
In general, Z is the central atom or core bridging the moieties F (and thus X). Y and D in the capture compound CC of formula 2 (and thereby, B in the immobilized compound IC of formula 1 after reaction with the carrier compound CCB of formula 3).
In some embodiments, Z is carbon, nitrogen, phosphorus, silicon, sulfonium, cycloalkyl, unsaturated cycloalkyl, unsaturated hetero cycloalkyl or heteroaryl.
The functional moieties X, F, Y, D/E, and B are linked to each other by bridging moieties (spacers), as discussed previously. In its most simple manifestation. Z is a carbon atom, for example the central carbon atom of an alpha amino acid. Proteinogenic amino acids having a side chain such as lysine, glutamic or aspartic acid or cysteine or serine are suitable core molecules since they provide three functions that can be easily derivatized and coupled to moieties independently from each other. If, for instance, lysine is used, then the central alpha carbon atom can be viewed as the core Z, and the carboxy function, the amino function and the butylamine function can be viewed as the respective spacer moieties. If, for instance, glutamic acid is used, then the central alpha carbon atom can be viewed as the core Z, and the carboxy function, the amino function and the propionic acid function can be viewed as the respective spacer moieties.
Similarly, Z can be a tertiary amine bearing the three functions F (and thus X), Y and D.
In some embodiments, Z is a trisubstituted phosphorus atom, for example a phosphine
In some embodiments, Z is a silicium atom.
In some embodiments, the core Z comprises a ring structure such as a cycloalkyl, unsaturated cycloalkyl, unsaturated heterocycloalkyl or heteroaryl.
The reactivity function X can be attached directly to the cleavable function F or can be attached via a spacer moiety S0 to the cleavable function F.
In some embodiments, S0 is selected from a polyethylene glycol polymer. or oligomer.
In some embodiments, S0 comprises a chain length of up to 20 carbon atoms optionally including heteroatoms such as O, N, S, Si or P and one or more double or triple bonds.
In some embodiments, S0 comprises a chain length of up to 10 carbon atoms optionally including heteroatoms such as O, N, S, Si or P and one or more double or triple bonds.
In some embodiments, S0 comprises a chain length of up to 6 carbon atoms optionally including heteroatoms such as O, N, S, Si or P and one or more double or triple bonds.
In some embodiments, the reactivity function X is attached directly to the cleavable function F in order to minimize the size of the moiety which remains attached after cleavage of F at the captured protein or peptide fragment to facilitate the subsequent analysis by mass spectrometry.
The selectivity function Y can be attached directly to the central core Z or can be attached via a spacer moiety S″ to the central core Z.
In some embodiments, S− is a polyethylene glycol polymer or oligomer.
In some embodiments, S− comprises a chain length of up to 20 carbon atoms optionally including heteroatoms such as O, N, S or P and one or more double or triple bonds
In some embodiments, S− comprises a chain length of up to 10 carbon atoms optionally including heteroatoms such as O, N, S or P and one or more double or triple bonds.
In some embodiments, S− comprises a chain length of up to 6 carbon atoms optionally including heteroatoms such as O, N, S or P and one or more double or triple bonds.
In some embodiments, the selectivity function Y is attached directly to the central core Z.
In some embodiments, the spacer moiety S″ is chosen such that the selectivity function can reach the binding pocket of a target or non-target protein.
The cleavable function F can be attached directly to the central core Z or can be attached via a spacer moiety S to the central core Z.
In some embodiments, S is a polyethylene glycol polymer or oligomer.
In some embodiments, S comprises a chain length of up to 20 carbon atoms optionally including heteroatoms such as O, N, S, Si or P and one or more double or triple bonds
In some embodiments, S comprises a chain length of up to 10 carbon atoms optionally including heteroatoms such as O, N, S, Si or P and one or more double or triple bonds.
In some embodiments, S comprises a chain length of up to 6 carbon atoms optionally including heteroatoms such as O, N, S, Si or P and one or more double or triple bonds.
In some embodiments, the cleavable function F is attached directly to the central core Z.
The linking function D can be attached directly to the central core Z or can be attached via a spacer moiety S′″ to the central core Z.
In some embodiments, S′″ is a polyethylene glycol polymer or oligomer.
In some embodiments, S′″ comprises a chain length of up to 20 carbon atoms optionally including heteroatoms such as O, N, S, Si or P and one or more double or triple bonds.
In some embodiments, S′″ comprises a chain length of up to 10 carbon atoms optionally including heteroatoms such as O, N, S, Si or P and one or more double or triple bonds.
In some embodiments, the linking function D is attached directly to the central core Z.
The linking function E can be attached directly to macroscopic carrier B or can be attached via a spacer moiety Siv to macroscopic carrier B.
In some embodiments, Siv is a polyethylene glycol polymer or oligomer.
In some embodiments, Siv comprises a chain length of up to 20 carbon atoms optionally including heteroatoms such as O, N, S, Si or P and one or more double or triple bonds.
In some embodiments, Siv comprises a chain length of up to 10 carbon atoms optionally including heteroatoms such as O, N, S, Si or P and one or more double or triple bonds.
In some embodiments, the cleavable function F is attached directly to the central core Z.
In some embodiments, S′ (derived from the reaction of —S′″-D and E-Siv—; see formula 7 and discussion above) comprises a chain length up to 40 atoms, wherein the nature of the chain is depending on the functions S′″, D′, E′ and Siv (as discussed previously).
In some embodiments, the functions X, X′, F, F′ and/or So comprise a marker, in particular an isotopic marker, a mass tag, fluorescent tag and/or a multiplex marker. It has to be noted, that a marker can be produced by a cleavage of the cleavable function F yielding the function F′, which can now act after the cleavage as a marker.
Any combination of the structural elements X, X′, Z, P, PF, BM, BMF, So, S, S′, S′″, Siv, Y, B, D, E and F—independently from each other—discussed in the previous section is possible for providing the compounds of the invention or for the process of the invention. A general expert in the field may choose suitable combinations. Reference is also made to the examples in the following sections. The structural elements or similar elements thereof (e.g. F, X, Y . . . ) of the compounds depicted or mentioned below may also be used in a combination—independently from each other—for providing compounds of the invention or for the process of the invention.
In some embodiments, the characterization step is achieved by applying mass spectrometry, in particular matrix assisted laser desorption ionization (MALDI), continuous or pulsed electrospray ionization (ESI), ionspray, thermospray, or massive cluster impact mass spectrometry for ionization, and linear time-of-flight (TOF), reflectron time-of-flight, single quadrupole, multiple quadrupole, single magnetic sector, multiple magnetic sector, Fourier transform ion cyclotron resonance (ICR), orbitrap or ion trap for mass analysis.
According to a second aspect of the invention provided herein is a carrier compound CCB, particularly a carrier compound CCB for carrying out a process according to the first aspect of the invention, of the general formula 3,
E-Siv—B formula 3
wherein SiV comprises a polyethylene glycol group,
Concerning specific embodiments of the functions E and B and the spacer moiety Siv reference is made to the previously discussed embodiments with respect to the first aspect of the invention.
Examples of a carrier compound CCB are:
LCAA refers to a long chain aminoalkyl, CNA refers to long chain alkyl amine and CPG refers to Controlled Porous Glass. Reference is made to the experimental section. R refers to a second linking function E, as discussed previously and as depicted in the examples.
According to a third aspect of the invention provided herein is a capture compound CC, particularly a capture compound CC for carrying out a process according to the first aspect of the invention, of the general formula 2,
wherein
Concerning specific embodiments of the functions X, F, Y, Z and D and the spacer moieties So, S, S″ and S′″ reference is made to the previously discussed embodiments with respect to the first aspect of the invention.
Examples of a capture compound CC are:
Reference is made to the experimental section.
According to a fourth aspect of the invention provided herein is a precursor compound PC, particularly a precursor compound PC for carrying out a process according to the first aspect of the invention, of the general formula 6,
wherein
Concerning specific embodiments of the functions X, F, Y, Z and B and the spacer moieties So, S, S′ and S″ reference is made to the previously discussed embodiments with respect to the first aspect of the invention.
According to a fifth aspect of the invention provided herein is a division compound DC, particularly a division compound DC for carrying out a process according to the first aspect of the invention, of the general formula 4,
T-X′—S0—F′ (formula 4),
in particular a division compound DCb of the general formula 4b or 4b′
BMF-X′—S0—F′ (formula 4b), more particularly
PF—X′—S0—F′ (formula 4b′)
wherein
Concerning specific embodiments of the functions X′, P, PF and F′ and the spacer moiety So reference is made to the previously discussed embodiments with respect to the first aspect of the invention.
According to a sixth aspect of the invention provided herein is a combination of reagents for carrying out a process according to the first aspect of the invention, in particular a combination of reagents for carrying out a process according to at least one of the claims 1 to 12, comprising a first compound according to claim 13 and a second compound according to claim 14.
Concerning specific embodiments of the combination of reagents and the use of the combination of reagents for a process according to the invention reference is made to the previously discussed embodiments with respect to the first, second and third aspect of the invention.
Detailed examples, results and conditions are provided in the example section. The methods, steps and conditions concerning the capturing of biomolecules, in particular proteins is known in the art and can be considered basic knowledge of the general expert in the field. Reference is made to the detailed description of the EP 1 485 707 B1, in particular page 43 to 61 and examples 1 to 9.
In a Pierce® Spin Column a solution of R—COOH (2.2 eq.), DIPEA (5 eq.) and HATU (2.0 eq.) in dry DMA (300 μL) was rotated in a VWR tube rotator for 10 min at 23° C. LCAA/CNA CPG (1.0 eq., Prime Synthesis, 57.0 μmol(NH2)/g, 3000 A, 120-200 mesh) were added and rotation were continued for 1 h at 23° C. The reaction solution was flushed out and the LCAA/CNA CPG was successively washed with DMA. MeOH and DCM. The LCAA/CNA CPG was incubated with a solution of 1.0 mL acetic anhydride and pyridine (4/1) and rotated for 1 h at 23° C. The reaction solution was flushed out and the LCAA/CNA CPG was successively washed with H2O, DMA, MeOH and DCM.
A sample of the LCAA/CNA CPG was incubated with a solution of 2-(6-(dimethylamino)-3-(dimethyliminio)-3H-xanthen-9-yl)-5-(prop-2-yn-1-ylcarbamoyl)benzoate [Probe 1] (1.0 eq.), Copper(II) sulfate (1.0 eq.) and Sodium ascorbate (2.5 eq.) in a solvent mixture of H2O and MeOH (150 μL, 1/1) and rotated for 1 h at 23° C. The reaction solution was flushed out and the LCAA/CNA CPG was successively washed with H2O and MeOH. The reaction solution and the washing solutions were combined and diluted with water. Loading of the LCAA/CNA CPG was determined by comparison of the UV absorption at 562 nm (anthos Reader 2010, Typ: 17550) of the solution to a reference.
In a Pierce® Spin Column a solution of 2-(2-(azidomethyl)-1H-benzo[d]imidazol-1-yl)acetic acid (2.90 mg, 12.5 μmol), DIPEA (5.0 μL, 28.5 μmol) and HATU (4.3 mg, 11.4 μmol) in dry DMA (300 μL) was rotated in a VWR tube rotator for 10 min at 23° C. 100 mg (5.7 μmol) LCAA/CNA CPG (Prime Synthesis, 57.0 μmol(NH2)/g, 3000 A. 120-200 mesh) were added and rotation were continued for 1 h at 23° C. The reaction solution was flushed out and the LCAA/CNA CPG was successively washed with DMA. MeOH and DCM. The LCAA/CNA CPG was incubated with a solution of 1.0 mL acetic anhydride and pyridine (4/1) and rotated for 1 h at 23° C. The reaction solution was flushed out and the LCAA/CNA CPG was successively washed with H2O, DMA, MeOH and DCM.
A sample of the LCAA/CNA CPG (3.0 mg) was incubated with a solution of 2-(6-(dimethylamino)-3-(dimethyliminio)-3H-xanthen-9-yl)-5-(prop-2-yn-1-ylcarbamoyl)benzoate [Probe 1] (0.08 mg, 0.17 μmol), Copper(II) sulfate (0.04 mg, 0.17 μmol) and Sodium ascorbate (0.09 mg, 0.43 μmol) in a solvent mixture of H2O and MeOH (150 μL, 1/1) and rotated for 1 h at 23° C. The reaction solution was flushed out and the LCAA/CNA CPG was successively washed with H2O and MeOH. The reaction solution and the washing solutions were combined and diluted with water. Loading of the LCAA/CNA CPG was determined to 33.0 μmol(N3)/g by comparison of the UV absorption at 562 nm (anthos Reader 2010, Typ: 17550) of the solution to a reference.
In a Pierce® Spin Column a solution of 1-azido-3,6,9,12-tetraoxapentadecan-15-oic acid (166.0 μL, 0.06 μmol), DIPEA (0.05 μL, 0.29 μmol) and HATU (0.02 mg, 0.06 μmol) in dry DMA (300 μL) was rotated in a VWR tube rotator for 10 min at 23° C. 10 mg (0.57 μmol) LCAA/CNA CPG (Prime Synthesis, 57.0 μmol(NH2)/g, 3000 A, 120-200 mesh) were added and rotation were continued for 1 h at 23° C. The reaction solution was flushed out and the LCAA/CNA CPG was successively washed with DMA. MeOH and DCM. The LCAA/CNA CPG was incubated with a solution of 1.0 mL acetic anhydride and pyridine (4/1) and rotated for 1 h at 23° C. The reaction solution was flushed out and the LCAA/CNA CPG was successively washed with H2O, DMA, MeOH and DCM.
A sample of the LCAA/CNA CPG (10.0 mg) was incubated with a solution of 2-(6-(dimethylamino)-3-(dimethyliminio)-3H-xanthen-9-yl)-5-(prop-2-yn-1-ylcarbamoyl)benzoate [Probe 1] (0.03 mg, 0.06 μmol), Copper(II) sulfate (0.01 mg, 0.05 μmol) and Sodium ascorbate (0.03 mg, 0.13 μmol) in a solvent mixture of H2O and MeOH (150 μL, 1/1) and rotated for 1 h at 23° C. The reaction solution was flushed out and the LCAA/CNA CPG was successively washed with H2O and MeOH. The reaction solution and the washing solutions were combined and diluted with water. Loading of the LCAA/CNA CPG was determined to 2.9 μmol(N3)/g by comparison of the UV absorption at 562 nm (anthos Reader 2010, Typ: 17550) of the solution to a reference.
In a Pierce® Spin Column 60 mg (3.42 μmol) LCAA/CNA CPG (Prime Synthesis, 57.0 μmol(NH2)/g, 3000 A. 120-200 mesh) were treated with a solution of (E)-2,5-dioxopyrrolidin-1-yl 1-(cyclooct-4-en-1-yloxy)-1-oxo-5,8,11,14-tetraoxa-2-azaheptadecan-17-oate (5.28 mg, 10.3 μmol) and DIPEA (5.97 μL, 34.2 μmol) in dry DMA (700 μL). The reaction suspension was rotated in a VWR tube rotator for 1 h at 23° C. The reaction solution was flushed out and the LCAA/CNA CPG was successively washed with DMA. MeOH and DCM. The LCAA/CNA CPG was incubated with a solution of 1.0 mL acetic anhydride and pyridine (4/1) and rotated for 1 h at 23° C. The reaction solution was flushed out and the LCAA/CNA CPG was successively washed with H2O, DMA, MeOH and DCM.
A sample of the LCAA/CNA CPG (0.1 mg) in a solvent mixture of H2O and MeOH (40 μL, 1/1) was incubated with a solution of 3′,6′-bis(dimethylamino)-N-(1-(4-(6-methyl-1,2,4,5-tetrazin-3-yl)phenyl)-1-oxo-6,9,12-trioxa-2-azapentadecan-15-yl)-3-oxo-3H-spiro[isobenzofuran-1,9′-xanthene]-5-carboxamide [Probe 2] (0.285 μL, 20 mM, 5.7 nmol) in DMSO and rotated for 1 h at 23° C. The reaction solution was diluted with water. Loading of the LCAA/CNA CPG was determined to 47 μmol(TCO)/g by comparison of the UV absorption at 562 nm (anthos Reader 2010, Typ: 17550) of the solution to a reference.
A Monotip NH2 (GLScience, 2.0 μmol(NH2)/tip) was incubated with a solution of (E)-2,5-dioxopyrrolidin-1-yl 1-(cyclooct-4-en-1-yloxy)-1-oxo-5,8,11,14-tetraoxa-2-azaheptadecan-17-oate (1.5 eq.) and DIPEA (10 eq.) in dry DMA (150 μL) applying Thermo Finnpipette (200 μL, Up 1, Down 1, Cycles 999). The reaction solution was flushed out. The Monotip NH2 was incubated a second cycle with a solution of (E)-2,5-dioxopyrrolidin-1-yl 1-(cyclooct-4-en-1-yloxy)-1-oxo-5,8,11,14-tetraoxa-2-azaheptadecan-17-oate (0.75 eq.) and DIPEA (5.0 eq.) in dry DMA (150 μL) applying Thermo Finnpipette (200 μL, Up 1, Down 1, Cycles 999). The reaction solution was flushed out and the Monotip NH2 was washed with DMA (Thermo Finnpipette—200 μL, Up 1, Down 1, Cycles 100) and DCM (Thermo Finnpipette—200 μL, Up 1, Down 1, Cycles 100). The Monotip NH2 was incubated with a solution of 2.5 mL acetic anhydride and pyridine (4/1) applying Thermo Finnpipette (200 μL, Up 1, Down 1, Cycles 999). The reaction solution was flushed out. The Monotip NH2 was washed successively with H2O. MeOH and DCM (Thermo Finnpipette—200 μL, Up 1, Down 1, Cycles 100).
The Monotip NH2 was incubated with a solution of 3′,6′-bis(dimethylamino)-N-(1-(4-(6-methyl-1,2,4,5-tetrazin-3-yl)phenyl)-1-oxo-6,9,12-trioxa-2-azapentadecan-15-yl)-3-oxo-3H-spiro[isobenzofuran-1,9′-xanthene]-5-carboxamide [Probe 2] (0.25 eq.) in a solvent mixture of H2O and MeOH (150 μL, 1/1) applying Thermo Finnpipette (200 μL, Up 1, Down 1, Cycles 999). A sample of 5 μL was taken out and diluted with 200 μL water. Loading of the Monotip NH2 was determined by comparison of the UV absorption at 562 nm (anthos Reader 2010, Typ: 17550) of the sample to a reference.
A Monotip NH2 (GLScience, 2.0 μmol(NH2)/tip) was incubated with a solution of (E)-2,5-dioxopyrrolidin-1-yl 1-(cyclooct-4-en-1-yloxy)-1-oxo-5,8,11,14-tetraoxa-2-azaheptadecan-17-oate (1.54 mg, 3.0 μmol) and DIPEA (3.49 μL, 20.0 μmol) in dry DMA (150 μL) applying Thermo Finnpipette (200 μL, Up 1, Down 1, Cycles 999). The reaction solution was flushed out. The Monotip NH2 was incubated a second cycle with a solution of (E)-2,5-dioxopyrrolidin-1-yl 1-(cyclooct-4-en-1-yloxy)-1-oxo-5,8,11,14-tetraoxa-2-azaheptadecan-17-oate (0.77 mg, 1.5 μmol) and DIPEA (1.75 μL, 10.0 μmol) in dry DMA (150 μL) applying Thermo Finnpipette (200 μL, Up 1, Down 1, Cycles 999). The reaction solution was flushed out and the Monotip NH2 was washed with DMA (Thermo Finnpipette—200 μL, Up 1, Down 1, Cycles 100) and DCM (Thermo Finnpipette—200 μL, Up 1, Down 1, Cycles 100). The Monotip NH2 was incubated with a solution of 2.5 mL acetic anhydride and pyridine (4/1) applying Thermo Finnpipette (200 μL, Up 1, Down 1, Cycles 999). The reaction solution was flushed out. The Monotip NH2 was washed successively with H2O. MeOH and DCM (Thermo Finnpipette—200 μL, Up 1, Down 1, Cycles 100).
The Monotip NH2 was incubated with a solution of 3′,6′-bis(dimethylamino)-N-(1-(4-(6-methyl-1,2,4,5-tetrazin-3-yl)phenyl)-1-oxo-6,9,12-trioxa-2-azapentadecan-15-yl)-3-oxo-3H-spiro[isobenzofuran-1,9′-xanthene]-5-carboxamide [Probe 2] (0.49 mg, 0.5 μmol) in a solvent mixture of H2O and MeOH (150 μL, 1/1) applying Thermo Finnpipette (200 μL, Up 1, Down 1, Cycles 999). A sample of 5 μL was taken out and diluted with 200 μL water. Loading of the Monotip NH2 was determined to 0.5 μmol(TCO)/tip by comparison of the UV absorption at 562 nm (anthos Reader 2010, Typ: 17550) of the sample to a reference.
A solution of 3′,6′-bis(dimethylamino)-3-oxo-spiro[isobenzofuran-1,9′-xanthene]-5-carboxylic acid [TAMRA] (40.0 mg, 0.1 mmol) and Diisopropylethylamin [DIPEA] (81.0 μl, 0.47 mmol) in dry DMA (1.0 mL) was treated with prop-2-yn-1-amine (6.6 μL, 0.1 mmol) and 1-[Bis(dimethylamino)methylene]-1H-1,2,3-triazolo[4,5-b]pyridinium 3-oxid hexafluorophosphate [HATU] (35.3 mg, 0.1 mmol). The reaction mixture was stirred for 30 min at 23° C. The solvent was removed under reduced pressure and the crude product was purified by column chromatography to yield probe 1 (25.4 mg, 0.05 mmol, 59%) as a red solid.
A solution of TAMRA (40.0 mg, 0.1 mmol) and tert-butyl (2-aminoethyl)carbamate (16.4 mg, 0.1 mmol) was treated with DIPEA (81.0 μl, 0.47 mmol) and HATU (42.4 mg, 0.11 mmol). The reaction solution was stirred for 30 min at 23° C. The solvent was removed under reduced pressure and the crude product was purified by MPLC to yield the intermediate (40.6 mg, 0.07 mmol, 76%) as a red solid.
The intermediate (2.8 mg, 4.9 μmol) was dissolved in dry dichloromethane [DCM] (0.5 mL) and treated with TFA (0.1 mL). The reaction solution was stirred for 30 min at 23° C. All volatiles were removed under reduced pressure and DCM (0.5 mL) was added. The solution was treated with DIPEA (8.5 μL, 0.05 mmol) and a solution of 2,5-dioxopyrrolidin-1-yl 1-(4-(6-methyl-1,2,4,5-tetrazin-3-yl)phenyl)-3-oxo-6,9,12,15,18-pentaoxa-2-azahenicosan-21-oate (3.0 mg, 4.9 μmol) in dry DCM (0.2 mL). The reaction solution was stirred for 1 h at 23° C. The solvent was removed under reduced pressure and the crude product was purified by MPLC to yield the probe 2 (2.3 mg, 2.4 μmol, 49%) as a red solid.
The intermediate of probe 2 (3.34 mg, 5.83 μmol) was dissolved in dry dichloromethane [DCM] (0.5 mL) and treated with TFA (0.1 mL). The reaction solution was stirred for 30 min at 23° C. All volatiles were removed under reduced pressure and DCM (0.5 mL) was added. The solution was treated with DIPEA (10.2 μL, 0.06 mmol) and a solution of (E)-2,5-dioxopyrrolidin-1-yl 1-(cyclooct-4-en-1-yloxy)-1-oxo-5,8,11,14-tetraoxa-2-azaheptadecan-17-oate (3.0 mg, 5.83 μmol) in dry DCM (0.2 mL). The reaction mixture was stirred for 1 h at 23° C. The solvent was removed under reduced pressure and the crude product was purified by MPLC to yield probe 3 (4.2 mg, 4.8 μmol, 83%) as a red solid.
4-azido-2,3,5,6-tetrafluorobenzaldehyde (B): 545 mg of 2,3,4,5,6-pentafluorobenzaldehyde (A) and 163 mg of NaN3 were given in one microwave flask (10-20 mL). 8 mL acetone and 2 mL water were added and reaction mixture was stirred at room temperature until regents were dissolved. Flask was closed and reaction mixture was stirred in microwave at 100° C. for 5 min.
30 mL water was added and reaction mixture was extracted three times with EtOAc (15 mL). The united organic extract was dried (MgSO4) and concentrated in vacuum to give 539 mg (89%) of the title compound as a yellow solid.
Diethyl-2-(4-azido-2,3,5,6-tetrafluorophenyl)-1,3-dioxane-5,5-dicarboxylate (C): 783 mg of pentafluorobenzaldehyde B and 1.34 g diethyl 2,2-bis(hydroxymethyl)malonate were dissolved in 40 mL toluene. 123 mg of pTsOH were added and the reaction mixture was refluxed at 80° C. for 18 h. After this time 500 μL of Et3N were added and the solvent was removed in vacuum. The residue was purified by column chromatography on silica gel (9:1 cyclohexane/EtOAc+0.1% Et3N) to afford 1.35 g (90%) of the title compound as a colorless oil.
2-(4-azido-2,3,5,6-tetrafluorophenyl)-5-(ethoxycarbonyl)-1,3-dioxane-5-carboxylic acid (D): 1.35 g of acetal C were dissolved in 20 mL THF. 40 mL of water and 169 mg of LiOH were added. Reaction mixture was stirred at room temperature for 2.5 h. After the reaction was completed (TLC-test), 850 μL of 0.5 M HCl were added dropwise until the reaction mixture turned milky (pH=3.0). Water phase was extracted with CH2Cl2 three times. The united organic extract was dried (MgSO4) and concentrated in vacuum to give 447 mg (36%) of the title compound as a white solid.
Ethyl-2-(4-azido-2,3,5,6-tetrafluorophenyl)-5-(prop-2-ynylcarbamoyI)-1,3-dioxane-5-carboxylate (E): To a solution of 153 mg of acetal D in 7 mL DCM, 75 μL of propargylamine were added. 224 mg EDC, 214 μL N-methyl morpholine and 10 mg HOAt were added successively. The reaction mixture was stirred at room temperature for 1 h. After completion (LC-MS test) the reaction mixture was poured into the ice water and extracted twice with CH2Cl2. The united organic extract was dried (MgSO4) and concentrated in vacuum. The residue was purified by column chromatography on silica gel (3:2 cyclohexane/EtOAc) to afford 129 mg (77%) of the title compound as a colorless oil.
2-(4-azido-2,3,5,6-tetrafluorophenyl)-5-(prop-2-ynylcarbamoyl)-1,3-dioxane-5-carboxylic acid (F): 129 mg of acetal E were dissolved in 1 mL THF. 2 mL of water and 50 mg of LiOH were added. Reaction mixture was stirred at room temperature for 18 h. After the reaction was completed (TLC-test), 0.7 mL of 0.5 M HCl was added to adduce pH between 3 and 4. Water phase was extracted with CH2Cl2 twice. The united organic extract was dried (MgSO4) and concentrated in vacuum to give 92 mg (76%) of the title compound as a white solid.
(S)-6-(2-(4-azido-2,3,5,6-tetrafluorophenyl)-5-(prop-2-ynylcarbamoyl)-1,3-dioxane-5-carboxamido)-1-(4-(8-methoxy-8-oxooctanamido)phenethylamino)-N,N,N-trimethyl-1-oxohexan-2-aminium (H): 37 mg (S)-6-amino-1-(4-(8-methoxy-8-oxooctanamido)phenethylamino)-N,N,N-trimethyl-1-oxohexan-2-aminium (G) were dissolved in 5 mL DMA and 37 mg of acetal F was added. After starting material was dissolved 33 mg HATU and 128 μL DIPEA were added to reaction mixture and stirred 3 h at room temperature. After reaction was completed (LC-MS test) DMA was removed in high vacuum and 1 g of crude mixture was obtained. Crude product was purified by MPLC to give 52 mg (78%) of the title compound as a white solid.
(S)-6-(2-(4-azido-2,3,5,6-tetrafluorophenyl)-5-(prop-2-ynylcarbamoyl)-1,3-dioxane-5-carboxamido)-1-(4-(7-carboxyheptanamido)phenethylamino)-N,N,N-trimethyl-1-oxohexan-2-aminium (I): 52 mg of methyl ester H was dissolved in 2 mL THF and 2 mL water and 4.5 mg LiOH were added. Reaction mixture was stirred for 2 h at room temperature. 2 M HCl was added to adduce the pH of the mixture between 6 and 7. THF was removed in vacuum and residue was lyophilized to give 47 mg (92%) of the title product as a white solid.
(S)-6-(2-(4-azido-2,3,5,6-tetrafluorophenyl)-5-(prop-2-ynylcarbamoyl)-1,3-dioxane-5-carboxamido)-1-(4-(8-(hydroxyamino)-8-oxooctanamido)phenethylamino)-N,N,N-trimethyl-1-oxohexan-2-aminium (1): 52 mg of acid I was dissolved in 2 mL THF and 2 mL DMA in vacuum heated flask containing molecular sieves. This solution was cooled to 0° C. and 11.2 μL ethyl formylchloride and 19 μL Et3N were added and the reaction mixture was stirred for 30 min. In separate vacuum heated flask containing molecular sieves 25 mg hydroxylamine hydrochloride and 20 mg KOH were dissolved in 1.5 mL MeOH. After stirring for 15 min, this mixture was filtrated using syringe filter and transferred into another vacuum heated flask containing molecular sieves. So generated NH2OH solution in MeOH was cooled down at 0° C. and the activated acid from the first flask was added to this solution slowly. Reaction mixture was stirred 1 h at 0° C. and then 18 h at rt. 2 M HCl was added to adduce pH between 6 and 7. THF was removed in vacuum and residue was purified by MPLC to give 8 mg (15%) of the title compound as a white solid.
Tert-butyl 2-(4-(3-(trifluoromethyl)-3H-diazirin-3-yl)benzoyl)hydrazinecarboxylate (K): 369 mg of the benzoic acid J were dissolved in 15 mL CH2Cl2. 209 μL of SOCl2 and 20 μL of DMF were added and the reaction mixture was refluxed for 3 h at 60° C. Solvents were than evaporated in vacuum and obtained crude carboxylic acid chloride was dissolved in 5 mL CH2Cl2. This solution was then slowly dropped to the solution of 191 mg BocNHNH2 and 667 μL Et3N in 10 mL CH2Cl2 at 0° C. Obtained reaction mixture was stirred at 23° C. over night.
After this time water was added to the reaction mixture and water phase was extracted with CH2Cl2 twice. The united organic extract was dried (MgSO4) and concentrated in vacuum. The residue was purified by column chromatography on silica gel (20:1 CH2Cl2/MeOH) to afford 462 mg (84%) of the title compound as a white solid.
(E)-methyl 4-(2-(4-(3-(trifluoromethyl)-3H-diazirin-3-yl)benzoyl)hydrazono)butanoate (L): 487 mg of Boc-hydrazin K and 437 μL of methyl dimethoxybutyrate were dissolved in 15 mL CH2Cl2. 8.5 mL of TFA were added and reaction mixture was stirred for 1 h. Solvent was removed in co-destination with toluene. Water was added and the water phase was extracted with EtOAc twice. The united organic extract was dried (MgSO4) and concentrated in vacuum to give 640 mg of yellow oil. The residue was purified by the column chromatography on silica gel (20:1 CH2Cl2/MeOH) to afford 198 mg of still not pure title compound. Additional purification by column chromatography on silica gel (1:1 to 2:3 cyclohexan/EtOAc) yielded 95 mg (20%) of the title compound as a colorless oil.
(E)-methyl 4-(2-methyl-2-(4-(3-(trifluoromethyl)-3H-diazirin-3-yl)benzoyl)hydrazono)butanoate (M): 95 mg of Hydrazone L were dissolved in 15 mL acetone. 173 μL of MeI and 192 mg of K2CO3 were added and reaction mixture was refluxed at 80° C. for 3 h. Acetone was removed in vacuum. Water was added and water phase was extracted with EtOAc twice. United organic extracts was dried (MgSO4) and concentrated in vacuum to give 106 mg of a yellow oil. The residue was purified by column chromatography on silica gel (1:1 cyclohexan/EtOAc) to afford 45 mg (46%) of the title compound as a white solid.
(E)-4-(2-methyl-2-(4-(3-(trifluoromethyl)-3H-diazirin-3-yl)benzoyl)hydrazono)butanoic acid (N): 44 mg of N-Methyl hydrazone M were dissolved in 2 mL THF and 2 mL water. 9 mg of LiOH were added. Reaction mixture was stirred for 1 h at room temperature. 74 μL of 5 M HCl, water and EtOAc were added. Phases were separated and organic phase was dried and concentrated to give 44 mg of the crude title compound which was used for the next reaction without further purification.
(E)-tert-butyl 2-(N-(4-(6-methyl-1,2,4,5-tetrazin-3-yl)benzyl)-4-(2-methyl-2-(4-(3-(trifluoromethyl)-3H-diazirin-3-yl)benzoyl)hydrazono)butanamido)acetate (P): 27 mg of the carboxylic acid N was dissolved in 3 mL DMA. 30 mg HATU and 60 μL DIPEA were added and the reaction mixture was stirred for 5 min. 23 mg of the tetrazine 0 (for the synthesis see Synthesis of capture compound CPT634) were dissolved in 2 mL DMA and added to the reaction mixture. Reaction mixture was stirred at 23° C. for 1 h. After this time the solvent (DMA) was removed in vacuum and the residue was purified by column chromatography on silica gel (20:1 CH2Cl2/MeOH) to afford 24 mg (52%) of the title compound as purple oil.
(2S)-tert-butyl 2-(tert-butoxycarbonylamino)-4-((2,2-dimethyl-6-(6-(4-(2-((E)-N-(4-(6-methyl-1,2,4,5-tetrazin-3-yl)benzyl)-4-(2-methyl-2-(4-(3-(trifluoromethyl)-3H-diazirin-3-yl)benzoyl)hydrazono)butanamido)acetamido)butylamino)-9H-purin-9-yl)tetrahydrofuro[3,4-d][1,3]dioxol-4-yl)methylthio)butanoate (R): Deprotection: 24 mg of tetrazine (P) were dissolved in 2 mL CH2Cl2 and 254 μL of TFA were added. Reaction mixture was stirred at 23° C. over night. After this time the solvent was removed in vacuum using co-distillation with toluene. Coupling: Crude product obtained in the deprotection reaction was dissolved in 3 mL DMA. 25 mg NO modified SAH Q. 45 μL DIPEA and 22 mg PyBOP were added and reaction mixture was stirred for 1.5 h. After this time the solvent (DMA) was removed in vacuum and the residue was purified by column chromatography on silica gel (20:1 CH2Cl2/MeOH) to afford 14 mg (83%) of the title compound as purple oil.
(2S)-2-amino-4-((3,4-dihydroxy-5-(6-(4-(2-((E)-N-(4-(6-methyl-1,2,4,5-tetrazin-3-yl)benzyl)-4-(2-methyl-2-(4-(3-(trifluoromethyl)-3H-diazirin-3-yl)benzoyl)hydrazono)butanamido)acetamido)butylamino)-9H-purin-9-yl)tetrahydrofuran-2-yl)methylthio)butanoic acid 11 (CPT727): 32.5 mg of the protected capture compound R were dissolved in 0.5 mL CH2Cl2 and 500 μL of TFA were added. The reaction mixture was stirred at 23° C. for 4 h. After this time the solvent was removed in vacuum using co-distillation with toluene. The residue was purified by MPLC to afford 5.1 mg (18%) of the title compound as purple solid.
A solution of tetrazine T (36.0 mg, 0.12 mmol) in dry dichloromethane [DCM] (0.5 mL) was treated with TFA (0.1 mL). The reaction solution was stirred for 20 min at 23° C. All volatiles were removed under reduced pressure and THF (300 μL), K2CO3 (38.0 mg, 0.275 mmol) and tert-butyl 2-bromoacetate S (15.9 μL, 0.11 mmol) were added. The reaction mixture was stirred for 17 h at 23° C. Ethyl acetate was added and the organic layer was washed with water. The organic layer was dried over Na2SO4 and all solvents were removed under reduced pressure. The crude product was purified by Isolera to yield compound 0 (24.2 mg, 0.08 μmol, 64%) as a red oil.
A suspension of 4-(3-(trifluoromethyl)-3H-diazirin-3-yl)benzoic acid (19.4 mg, 0.084 mmol) in dry DCM (400 μL) was treated with DIPEA (67.0 μL, 0.38 mmol) and HATU (32.1 mg, 0.084 mmol). The reaction mixture was stirred 15 min before a solution of amine 0 (24.2 mg, 0.08 mmol) in dry DCM (100 μL) was added. The reaction mixture was stirred for 17 h at 23° C. The solvent was removed under reduced pressure and the crude product was directly purified by MPLC to yield compound V (25.1 mg, 0.05 mmol, 62%) as a red solid.
A solution of ester V (25.1 mg, 0.05 μmol) in dry DCM (2.5 mL) was treated with TFA (0.7 mL) and stirred for 4 h at 23° C. All volatiles were removed under reduced pressure and the crude product was directly purified by MPLC to yield compound W (14.6 mg, 0.03 μmol, 65%) as a red solid.
A solution of acid W (14.6 mg, 0.03 mmol) in dry DMA (600 μL) was treated with DIPEA (27.0 μL, 0.16 mmol) and PyBOP (17.7 mg, 0.034 mmol) and stirred for 10 min at 23° C. Then, a solution of N6 modified SAH (20.2 mg, 0.03 mmol) in dry DMA (100 μL) was added and the reaction mixture was stirred for 10 min at 23° C. The solvent was removed under reduced pressure and the crude product was directly purified by MPLC to yield compound X (25.5 mg, 0.023 mmol, 75%) as a red solid.
At −17° C., a solution of compound X (25.5 mg, 0.023 mmol) in dry DCM (23.0 mL) was treated with a solution of hydrochloric acid (0.57 mL, 4.0 M. 2.3 mmol), water (10.4 μl, 0.58 mmol) and Thioanisole (0.07 mL, 0.58 mmol) and stirred for 17 h at 23° C. The reaction mixture was quenched by addition of triethyl amine (0.08 mL, 0.58 mmol) and the solvent was removed by nitrogen stream. The crude product was purified by MPLC to yield compound 16 (5.1 mg, 4.94 μmol, 21%) as a red solid.
In a 2 mL Eppendorf reaction tube, 450 μL 10 g/I azide functionalized CPG beads (CCB 2, CPG-pN3, see structures below) in 20% EtOH were incubated with 275 μL 1.2 mM 1 (CPT600, see structures below) in DMSO and 33 μL 5 mM CuSO4 under rotation at r.t. (room temperature) for 25 min. After brief centrifugation (tabletop centrifuge), the supernatant was transferred into another 2 mL tube, 33 μL 50 mM sodium ascorbate was added, mixed and a sample (8.41 μl) was taken (pre-click.
As a control, the same procedure was performed side-by-side using acetyl functionalized CPG beads (CPG-Ac) instead of CPG-pN3. Again, samples were drawn before (pre-click,
The four 8.41 μL samples were diluted with 330 μl 20% MeCN, respectively, and 10 μl, respectively, were separated via RP-HPLC on a YMC-Pack Octyl C8, 50×2.1 mm, 3 μm column and analyzed with UV and MS Instrumenmtation consisting of a Dionex Ultimate 3000 LC comprising a HPG-3200RS pump, WPS-3000TRS autosampler, TCC-3000RS column thermostat, and DAD-3000 UV detector coupled to a Bruker amaZon speed ion trap mass spectrometer governed by the Compass 1.5 software.
Using CPG-pN3, the amount of 1 (CPT600) decreases by about 10-15% (
In a 100 μL PCR tube, 20 μL 73 μM 1 (CPT600) was irradiated at 4° C. for 5 min in the caproBox (310 nm version, caprotec bioanalytics GmbH, Berlin, Germany). 1.7 μL 37% HCl was added, mixed and the solution was incubated on a thermoshaker at 37° C. for 2 h. Then 100 μL Milli-Q water and 9 μL 2N NaOH were added and mixed. The resulting solution as analyzed by RP-HPLC coupled to a UV detector and a mass spectrometer (
UV irradiation of 1 (CPT600) yields the reaction product of the resulting carbene (cleavage of N2) with water (3) (
Prolonged acidic treatment of 1 (CPT600) after UV irradiation yields various products, all explainable by the hydrolysis of the acetal (5 and 7) and the hydroxamic acid (2, 4, and 7) (
20 μL 10 mg/mL 1 (CPT600)-loaded CPG-pN3 from example 1 were transferred into a 100 μL PCR tube, centrifuged, the supernatant was discarded and 20 μL in water was added. The suspension was irradiated at 4° C. for 5 min (with re-suspending every 30 s) in the caproBox (310 nm version, caprotec bioanalytics GmbH, Berlin, Germany). 1.7 μL 37% HCl was added, mixed and the solution was incubated on a thermoshaker at 37° C. for 2 h. Then 100 μL Milli-Q water and 9 μL 2N NaOH were added and mixed. After centrifugation, the resulting supernatant was analyzed by RP-HPLC coupled to a UV detector and a mass spectrometer as described in example 1. As a control, the same sample but with only very brief incubation under acidic conditions (r.t. for about 10 s instead of 37° C. for 2 h) was prepared.
Further controls included using 1 (CPT600)-treated CPG-Ac from example 1 (short and long acid treatment) instead of 1 (CPT600)-loaded CPG-pN3 and 1 (CPT600) in solution (short and long acid treatment) as described in example 2. All six experiments were performed side-by-side. The results are summarized in
The desired cleavage fragment 6 (see example 4 for verification) is clearly detected in the UV chromatogram of the UV-irradiated 1 (CPT600)-loaded CPG-pN3 sample after 2 h acidic treatment (48.9% of the amount from the respective sample in solution as rated from the respective peak areas in the UV chromatograms). The respective sample employing CPG-Ac instead of CPG-pN3 only showed a minor amount of 6 (8.0% of the amount from the respective CPG-pN3 sample and 3.9% of the amount from the respective sample in solution as rated from the respective peak areas in the UV chromatograms).
Whereas after brief acidic incubation, no 1 (CPT600) derivative could be identified in the supernatant of the samples employing CPG beads, minor amounts of 5 and 7 (3.6% and 16.1%, respectively, compared to solution sample under the same conditions as rated from the respective peak areas in the base peak MS chromatograms) were identified after 2 h acidic treatment. Since 5 and 7 are present to about the some amount in the CPG-pN3 and CPG-Ac samples, these species probably arise from adsorption (non covalent binding) of a small proportion of 3 to the solid support (CPG surface).
The final solution of example 2 (final 130 μL 11 μM 1 (CPT600) after photo-reaction with water, acid cleavage and quenching and dilution with NaOH) was analyzed by RP-HPLC as described in example 1 before and after addition of a 100-fold molar excess of hydrazine 8 (1.1 μL 125 mM 8 as its HCl salt in MeOH) (
Photo-reacted and cleaved 1 (CPT600) shows an UV peak at retention time 2.72 min, which is largely diminished after addition of hydrazine 8 and a new signal is detected at retention time 3.57 min (
A solution of 100 μL 2.5 μM 11 (CPT727, Capture Compound bearing tetrazine as sorting function and a chemically cleavable photo-reactivity function) in phosphate buffered saline (PBS) was irradiated with UV light (λmax=310 nm) for 5 min at 4° C. in the caproBox (caprotec bioanalytics, Berlin. Germany) to activate the reactivity function yielding 12 (Scheme 12).
The resulting solution was incubated with 2 mg 13 (CPT601, CCB 4, trans-cyclooctene (TCO) modified controlled pore glass (CPG), CPG-TCO beads) over night at 4° C. to covalently bind compound 12 to 13 by the TCO-tetrazine click reaction yielding 14 (Scheme 12). During this time, 13 was kept in suspension by rotation.
Unbound 12 was removed by separating the supernating solution from 14 after centrifugation in a table top centrifuge and washing the solid 14 by re-suspending in fresh solution and removal of the supernatant after centrifugation (three times with 200 μl SL Buffer (50 mM Tris/HCl, 320 mM beta-mercaptoethanol, 2.5% sodium dodecylsulfate, 0.05% bromophenol blue, 10% glycerol, pH 6.8), three times with 200 μl 50% acetonitrile).
The washed 14 was divided into two aliquots. One aliquot was incubated with 10 mM 2-nitrobenzhydrazide and 10 mM HCl in 10 μl 50% acetonitrile (final concentrations and volume) at room temperature for 1 h while keeping 14 in suspension by rotation yielding the cleaved off photo-reacted reactivity function 15 (Scheme 12). As a control, the second aliquot of 14 was treated the same way side-by-side but without 2-nitrobenzhydrazide and HCl. The solutions supernating (cleaved) 14 were analyzed by LC-MS.
The same procedure was performed side-by-side using 16 (CPT634, Capture Compound bearing tetrazine as sorting function and non-cleavable photo-reactivity function) instead of 11.
The cleaved off photo-reacted reactivity function 15 was only detected when using 11 and not when using 16. When using 11, 15 was detected with a more than 50-fold higher intensity when 2-nitrobenzhydrazide and HCl were employed than when not (
A solution of 75 μL 56 μM 11 (CPT727, Capture Compound bearing tetrazine as sorting function and a chemically cleavable photo-reactivity function) in 50% acetonitrile was irradiated with UV light (λmax=310 nm) for 5 min at 4° C. in the caproBox to activate the reactivity function yielding 12 (Scheme 12).
Cleavage of the photo-reacted reactivity function (15) from 12 yielding also 17 as a product (Scheme 13) was investigated under several conditions summarized in Table 5. The reaction was performed in a final volume of 10 μl for 1 h at room temperature. Each reaction mixture contained 8 μl of the above described solution of 12 (45 μM final concentration), 1 μl 100 mM or 10 mM 2-nitrobenzhydrazide in DMSO or just DMSO and 1 μl 100 mM or 10 mM HCl in water or just water. Analysis was performed by LC-MS. Integration of peak areas corresponding to the respective species is included into Table 5.
Thus, 1 mM or higher concentration of HCl alone is sufficient for cleaving more than 60% 12 at room temperature within 1 h. At 10 mM HCl and 1 mM or 10 mM 2-nitrobenzhydrazide, the starting material 12 was no longer detected. The highest yield of 15 and 17, respectively, was achieved at 10 mM HCl and 10 mM 2-nitrobenzhydrazide.
Compound 11 (final concentration 2.5 μM) was incubated for 5 min at 4° C. with a lysate (final total protein concentration 5 mg/ml) prepared from the DH5a derivative of the E. coli K-12 strain in phosphate buffered saline (final volume 100 μl). In a so called competition control (designated “C” as opposed to “A” for CCMS assay without the competitor SAH) carried out side-by-side, S-adenosyl-L-homocysteine (SAH) (final concentration 250 μM) was added to the lysate before compound 11 to compete for the binding of the SAH selectivity function of 11 to SAH binding proteins such as S-adenosyl-L-methionine (SAM) dependent methyltransferases.
The samples were irradiated with UV light (λmax=310 nm) for 5 min in the caproBox (caprotec bioanalytics, Berlin, Germany) at about 4° C. to activate the reactivity function of 11 and form the covalent cross-link to proteins leading to 11-protein conjugates. Due to the SAH selectivity function in 11. SAH binding proteins will be selectively cross-linked to 11 and this selective crosslink will be diminished or inhibited in the competition control “C”.
The samples “A” and “C” were incubated with 250 μg 13 (CPT601, CCB 4, trans-cyclooctene (TCO) modified controlled pore glass (CPG). CPG-TCO beads), respectively, over night at 4° C. with rotation to keep the CPG beads (13) in suspension. During this time the 11-protein conjugates covalently bound to 13 by the TCO-tetrazine click reaction to form 13-11-protein conjugates. Selective binding of 11-protein conjugates to 13 was shown by taking 5 μl samples of “A” and “C”, respectively, before and after incubation with 13 and incubating these samples with 18 (final concentration 10 μM, CPT533. Probe 3, TCO-TAMRA) for 15 min at room temperature. Formed by the TCO-tetrazine click reaction, the fluorescent 18-11-protein conjugates were detected as in-gel fluorescent bands after SDS-PAGE. In the samples taken after incubation with 13, less in-gel fluorescent bands (18-11-protein conjugates) were detected than in the samples taken before incubation with 13, showing that 11-protein conjugates had reacted with 13 to form 13-11-protein conjugates. Since the Coomassie stain of the proteins in the same gel after fluorescence detection showed no difference between the samples before and after incubation with 13, general protein precipitation on 13 as the cause for diminished fluorescence signals can be excluded. Moreover, CPG beads not having TCO attached (CPG-Ac, NHAc instead of NHCO-PEG-TCO in 13, see example 1), did not bind 11-protein conjugates shown by the same intensity of in-gel fluorescent bands (18-11-protein conjugates) before and after incubation with CPG-Ac instead of 13.
The supernating solution was separated from the 13-11-protein conjugates after centrifugation in a table top centrifuge and the CPG beads containing the 13-11-protein conjugates were washed by re-suspending in fresh solution and removal of the supernatant after centrifugation (three times with 200 μl 4×SL Buffer, three times with 200 μl 50% acetonitrile).
The washed CPG beads containing the 13-11-protein conjugates were divided into two aliquots, respectively, for “A” and “C”. One aliquot was incubated with 10 mM 2-nitrobenzhydrazide and 10 mM HCl in 10 μl 50% acetonitrile (final concentrations and volume) at room temperature for 1 h while keeping the CPG beads containing the 13-11-protein conjugates in suspension by rotation yielding proteins with attached cleaved off photo-reacted reactivity function. As a control, the second aliquot of the CPG beads containing the 13-11-protein conjugates was treated the same way side-by-side but without 2-nitrobenzhydrazide and HCl. The proteins in the solutions supernating (cleaved) the CPG beads were analyzed by SDS-PAGE (4-20% Tris-Glycine gradient gels, 15 well, 1.0 mm; Novex® life Technologes® Ref. EC60255BOX, 25 min run time at 300 V) followed by silver staining (ProteoSilver Silver Stain Kit, Sigma). The results are summarized in
Gel slices were cut out around 25 kDa in each gel lane corresponding to the molecular weight (height) of the specific gel lane marked by the red arrow in
The same procedure was performed side-by-side using 16 (CPT634, Capture Compound bearing tetrazine as sorting function and a non-cleavable photo-reactivity function) or 19 (CPT594, Capture Compound bearing tetrazine as sorting function linked to the core by PEG and a non-cleavable photo-reactivity function) instead of 11.
A gel band is detected after SDS-PAGE/silver stain corresponding to a SAH binding protein (present in “A” with compound 11, absent in the corresponding SAH competition control “C”) that is only present under cleavage conditions (10 mM 2-nitrobenzhydrazide and 10 mM HCl in 50% acetonitrile) using the cleavable compound 11 (
Table 6 A to 6C: Normalized protein intensities of SAH binding proteins from cut-out gel slices from the gel shown in
Number | Date | Country | Kind |
---|---|---|---|
14152956.0 | Jan 2014 | EP | regional |
14153142.6 | Jan 2014 | EP | regional |
14157389.9 | Feb 2014 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2015/051739 | 1/28/2015 | WO | 00 |