COMPOSITIONS OF MODIFIED TREMS AND USES THEREOF

BACKGROUND

tRNAs are complex RNA molecules that possess a number of functions including the ability to initiate and elongate proteins.

SUMMARY

The present disclosure features, inter alia, a tRNA-based effector molecule (TREM) entity comprising an asialoglycoprotein receptor (ASGPR) binding moiety, as well as compositions and methods of use thereof. The ASGPR binding moiety may be conjugated to a nucleobase within the TREM entity, or within an internucleotide linkage of the TREM entity, or at a terminus (e.g., the 5′ or 3′ terminus) of the TREM entity. In an embodiment, the TREM entity comprises a TREM, a TREM Core Fragment, or a TREM Fragment. In an embodiment, the nucleobase comprises adenine, thymine, cytosine, guanosine, or uracil, or a variant or modified form thereof.

In one aspect, the TREM entity (e.g., TREM) described herein comprises the sequence of Formula A: [L1]-[ASt Domain1]-[L2]-[DH Domain]-[L3]-[ACH Domain]-[VL Domain]-[TH Domain]-[L4]-[ASt Domain2] (A), wherein, independently, the TREM comprises an ASGPR binding moiety. In an embodiment, the ASGPR binding moiety comprises an ASGPR carbohydrate and an ASGPR linker. In an embodiment, the ASGPR binding moiety comprises a galactose (Gal) and/or N-acetylgalactosamine (GalNAc) moiety. In an embodiment, the ASGPR binding moiety comprises a plurality of Gal and/or GalNAc moieties (e.g., 2, 3, 4, 5, 6, 7, 8, or more Gal and/or GalNAc moieties). In an embodiment, the ASGPR binding moiety comprises a triantennary GalNAc moiety. In an embodiment, the TREM further comprises a chemical modification (e.g., a phosphothiorate internucleotide linkage, or a 2′-modification on a ribose moiety within the TREM).

In an embodiment, the ASGPR binding moiety is present on a nucleobase within a nucleotide in the TREM. In an embodiment, the ASGPR binding moiety is present on the 5′ terminus of the TREM. In an embodiment, the ASGPR binding moiety is present on the 3′ terminus of the TREM.

In an embodiment, the ASGPR binding moiety is present in a TREM domain selected from L1, ASt Domain1, L2, DH Domain, L3, ACH Domain, VL Domain, TH Domain, L4, and ASt Domain2. In an embodiment, the ASGPR binding moiety is present in the L1 region. In an embodiment, the ASGPR binding moiety is present in the AST Domain1. In an embodiment, the ASGPR binding moiety is present in the L2 region. In an embodiment, the ASGPR binding moiety is present in the DH Domain. In an embodiment, the ASGPR binding moiety is present in the L3 region. In an embodiment, the ASGPR binding moiety is present in the ACH Domain.

In an embodiment, the ASGPR binding moiety is present in the VL Domain. In an embodiment, the ASGPR binding moiety is present in the TH Domain. In an embodiment, the ASGPR binding moiety is present in the L4 region. In an embodiment, the ASGPR binding moiety is present in the AST Domain2.

In an embodiment, the TREM comprising an ASGPR binding moiety retains the ability to support protein synthesis, be charged by a synthetase, be bound by an elongation factor, introduce an amino acid into a peptide chain, support elongation, and/or support initiation. In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least X contiguous nucleotides without a chemical modification, wherein X is greater than 10. In an embodiment, the TREM comprising an ASGPR binding moiety comprises no more than 5, 10, or 15 nucleotides of a type (e.g., A, T, C, G or U) that do not comprise chemical modification. In an embodiment, the TREM comprising an ASGPR binding moiety comprises no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, or 80 nucleotides of a type (e.g., A, T, C, G or U) that do not comprise a chemical modification. In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least X contiguous nucleotides comprising a chemical modification, wherein X is greater than 10. In an embodiment, the TREM comprising an ASGPR binding moiety comprises more than 5, 10, or 15 nucleotides of a type (e.g., A, T, C, G or U) that comprise a chemical modification. In an embodiment, the TREM comprising an ASGPR binding moiety comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, or 80 nucleotides of a type (e.g., A, T, C, G or U) that comprise a chemical modification. In an embodiment, the chemical modification is a naturally occurring chemical modification or a non-naturally occurring chemical modification (e.g., a phosphothiorate internucleotide linkage or a 2′-modification on a ribose moiety within the TREM). In an embodiment, the chemical modification comprises a fluorophore.

In another aspect, a TREM comprising an ASGPR binding moiety, or a composition thereof, described herein may be used to modulate a production parameter (e.g., an expression parameter and/or a signaling parameter) of an RNA corresponding to, or a polypeptide encoded by, a nucleic acid sequence comprising an endogenous open reading frame (ORF) having a premature termination codon (PTC).

In another aspect, a TREM comprising an ASGPR binding moiety, or a composition thereof, described herein may be used in a method of modulating a production parameter of an mRNA corresponding to, or polypeptide encoded by, an endogenous open reading frame (ORF) in a subject, which ORF comprises a premature termination codon (PTC), contacting the subject with a TREM comprising an ASGPR binding moiety or a composition thereof in an amount and/or for a time sufficient to modulate the production parameter of the mRNA or polypeptide, wherein the TREM comprising an ASGPR binding moiety has an anticodon that pairs with the codon having the first sequence, thereby modulating the production parameter in the subject. In an embodiment, the production parameter comprises a signaling parameter and/or an expression parameter, e.g., as described herein.

In another aspect, a TREM comprising an ASGPR binding moiety, or a composition thereof, described herein may be used in a method of treating a subject having an endogenous open reading frame (ORF) which comprises a premature termination codon (PTC), comprising providing a TREM comprising an ASGPR binding moiety, or a composition thereof, wherein the TREM comprising an ASGPR binding moiety comprises an anticodon that pairs with the PTC in the ORF; contacting the subject with the TREM comprising an ASGPR binding moiety or a composition thereof in an amount and/or for a time sufficient to treat the subject, thereby treating the subject. In an embodiment, the PTC comprises UAA, UGA or UAG.

In another aspect, a TREM comprising an ASGPR binding moiety, or a composition thereof, described herein may be used in a method of treating a subject having an disease or disorder associated with a premature termination codon (PTC), comprising providing a TREM comprising an ASGPR binding moiety or a composition described herein; contacting the subject with the TREM comprising an ASGPR binding moiety or a composition thereof in an amount and/or for a time sufficient to treat the subject, thereby treating the subject. In an embodiment, the PTC comprises UAA, UGA or UAG. In an embodiment, the disease or disorder associated with a PTC is a disease or disorder described herein, e.g., a cancer or a monogenic disease.

Additional features of any of the aforesaid TREM entities (e.g., TREMs, TREM core fragments, TREM Fragments, TREM compositions, preparations, methods of making TREM compositions and preparations, and methods of using TREM compositions and preparations include one or more of the following enumerated embodiments).

Those skilled in the art will recognize or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. Such equivalents are intended to be encompassed by the following enumerated embodiments.

BRIEF DESCRIPTIONS OF THE DRAWINGS

FIGS. 1A-1J are images that depict ASGPR-expressing U2OS cells transfected with exemplary TREMs comprising an ASGPR binding moiety described herein. In this experiment, uptake of the TREMs comprising a SEQ ID NO. 650 backbone with ASGPR binding moieties at various positions along the sequence and conjugated to Cy3 was monitored and visualized by fluorescent microscopy.

FIG. 2 is a graphical representation of the fluorescent microscopy results of FIGS. 1A-1J. The results are depicted as the average intensity over the concentration of oligo (nM) given to the cells.

FIGS. 3A-3H are images that depict ASGPR-expressing U2OS cells transfected with exemplary TREMs comprising an ASGPR binding moiety described herein. In this experiment, uptake of the TREMs comprising a SEQ ID NO. 650 backbone with ASGPR binding moieties at various positions along the sequence and conjugated to Cy3 was monitored and visualized by fluorescent microscopy.

FIG. 4 is a graphical representation of the fluorescent microscopy results of FIGS. 3A-3H. The results are depicted as the average intensity over the concentration of oligo (nM) given to the cells.

FIGS. 5A-5J are images that depict ASGPR-expressing U2OS cells transfected with exemplary TREMs comprising an ASGPR binding moiety described herein. In this experiment, uptake of the TREMs comprising a SEQ ID NO. 622 backbone with ASGPR binding moieties at various positions along the sequence and conjugated to Cy3 was monitored and visualized by fluorescent microscopy.

FIG. 6 is a graphical representation of the fluorescent microscopy results of FIGS. 5A-5J. The results are depicted as the average intensity over the concentration of oligo (nM) given to the cells.

FIGS. 7A-7J are images depicting uptake of exemplary TREMs comprising an ASGPR binding moiety as described herein by primary human hepatocytes. In this experiment, uptake of the TREMs comprising a SEQ ID NO. 650 backbone with ASGPR binding moieties at various positions along the sequence and conjugated to Cy3 was monitored and visualized by fluorescent microscopy. FIG. 8 is a graphical representation of the fluorescent microscopy results of FIGS. 7A-7J. The results are depicted as the average intensity over the concentration of oligo (nM) given to the cells.

FIGS. 9A-9H are images depicting uptake of exemplary TREMs comprising an ASGPR binding moiety as described herein by primary human hepatocytes. In this experiment, uptake of the TREMs comprising a SEQ ID NO. 653 backbone with ASGPR binding moieties at various positions along the sequence and conjugated to Cy3 was monitored and visualized by fluorescent microscopy.

FIG. 10 is a graphical representation of the fluorescent microscopy results of FIGS. 9A-9H. The results are depicted as the average intensity over the concentration of oligo (nM) given to the cells.

FIGS. 11A-11J are images depicting uptake of exemplary TREMs comprising an ASGPR binding moiety as described herein by primary human hepatocytes. In this experiment, uptake of the TREMs comprising a SEQ ID NO. 622 backbone with ASGPR binding moieties at various positions along the sequence and conjugated to Cy3 was monitored and visualized by fluorescent microscopy.

FIG. 12 is a graphical representation of the fluorescent microscopy results of FIGS. 11A-11J. The results are depicted as the average intensity over the concentration of oligo (nM) given to the cells.

FIG. 13 is a graph depicting the results of exemplary TREM uptake by ASGPR-expressing U2OS cells transfected with a nLUC-premature terminating codon (PTC) reporter. The exemplary TREMs comprising a SEQ ID NO. 650 backbone comprising an ASGPR-binding moiety at a position along the sequence were transfected using RNAiMAX transfection reagent. The results are shown as fold-change over the mock (no TREM) sample.

FIG. 14 is a graph depicting the results of exemplary TREM uptake by ASGPR-expressing U2OS cells transfected with a nLUC-premature terminating codon (PTC) reporter. The exemplary TREMs comprising a SEQ ID NO. 653 backbone comprising a ASGPR binding moiety at a position along the sequence were transfected using RNAiMAX transfection reagent. The results are shown as fold-change over the mock (no TREM) sample.

FIG. 15 is a graph depicting the results of exemplary TREM uptake by ASGPR-expressing U2OS cells transfected with a nLUC-premature terminating codon (PTC) reporter. The exemplary TREMs comprising a SEQ ID NO. 622 backbone comprising a ASGPR binding moiety at a position along the sequence were transfected using RNAiMAX transfection reagent. The results are shown as fold-change over the mock (no TREM) sample.

DETAILED DESCRIPTION OF CERTAIN EMBODIMENTS

The present disclosure features tRNA-based effector molecule (TREM) entities (e.g., TREMs, TREM Core Fragments, and TREM Fragments) comprising an asialoglycoprotein receptor (ASGPR) binding moiety, as well as compositions and related methods of use thereof. As disclosed herein, TREM entities (e.g., TREMs) are complex molecules which can mediate a variety of cellular processes. Pharmaceutical TREM compositions, e.g., TREMs comprising an ASGPR binding moiety, can be administered to a cell, a tissue, or to a subject to modulate these functions.

Definitions

An “acceptor stem domain (AStD),” as that term is used herein, refers to a domain that binds an amino acid. In an embodiment, an AStD comprises an ASt Domain1 and an ASt Domain 2. For example, the ASt Domain 1 is at or near the 5′ end of the TREM and the ASt Domain 2 is at or near the 3′ end of the TREM. An AStD comprises sufficient RNA sequence to mediate, e.g., when present in an otherwise wildtype tRNA, acceptance of an amino acid, e.g., its cognate amino acid or a non-cognate amino acid, and transfer of the amino acid (AA) in the initiation or elongation of a polypeptide chain. Typically, the AStD comprises a 3′-end adenosine (CCA) for acceptor stem charging which is part of synthetase recognition. In an embodiment the AStD has at least 75, 80, 85, 85, 90, 95, or 100% identity with a naturally occurring AStD, e.g., an AStD encoded by a nucleic acid in Table 1. In an embodiment, the TREM can comprise a fragment or analog of an AStD, e.g., an AStD encoded by a nucleic acid in Table 1, which fragment in embodiments that has AStD activity and in other embodiments do not have AStD activity. One of ordinary skill can determine the relevant corresponding sequence for any of the domains, stems, loops, or other sequence features mentioned herein from a sequence encoded by a nucleic acid in Table 1. For example, one of ordinary skill can determine the sequence which corresponds to an AStD from a tRNA sequence encoded by a nucleic acid in Table 1. In an embodiment, the ASGPR binding moiety is present within the AStD. In an embodiment, the ASGPR binding moiety is bound to a nucleobase within a nucleotide in the AStD. In an embodiment, the ASGPR binding moiety is present within the internucleotide linkage in the AStD. In an embodiment, the ASGPR binding moiety is present on a terminus (e.g., the 5′ or 3′ terminus) within the AStD.

In an embodiment, the ASt Domain 1 comprises positions 1-9 within the TREM sequence. In an embodiment, the ASGPR binding moiety is present within ASt Domain1 (e.g., positions 1-9) within the TREM sequence. In an embodiment, the ASt Domain2 comprises positions 65-76 within the TREM sequence. In an embodiment, the ASPGR binding moiety is present within the ASt Domain 1 (e.g., positions 65-76) within the TREM sequence.

In an embodiment the AStD falls under the corresponding sequence of a consensus sequence provided in the “Consensus Sequence” section or differs from the consensus sequence by no more than 1, 2, 5, or 10 positions. In an embodiment, the ASPGR binding moiety is present with the AStD which falls under the corresponding sequence of a consensus sequence provided in the “Consensus Sequence” section or differs from the consensus sequence by no more than 1, 2, 5, or 10 positions.

In an embodiment, the AStD comprises residues R₁-R₂-R₃-R₄-R₅-R₆-R₇(an exemplary ASt Domian2) and residues R₆₅-R₆₆-R₆₇-R₆₈-R₆₉-R₇₀-R₇₁(an exemplary ASt Domian2) of Formula I_ZZZ, wherein ZZZ indicates any of the twenty amino acids. In some embodiments, Formula I_ZZZrefers to all species.

In an embodiment, the AStD comprises residues R₁-R₂-R₃-R₄-R₅-R₆-R₇and residues R₆₅-R₆₆-R₆₇-R₆₈-R₆₉-R₇₀-R₇₁of Formula II_ZZZ, wherein ZZZ indicates any of the twenty amino acids. In some embodiments, Formula II_ZZZrefers to mammals.

In an embodiment, the AStD comprises residues R₁-R₂-R₃-R₄-R₅-R₆-R₇and residues R₆₅-R₆₆-R₆₇-R₆₈-R₆₉-R₇₀-R₇₁of Formula III_ZZZ, wherein ZZZ indicates any of the twenty amino acids. In some embodiments, Formula III_ZZZrefers to humans.

In an embodiment, ZZZ indicates any of the amino acids: Alanine, Arginine, Asparagine, Aspartate, Cysteine, Glutamine, Glutamate, Glycine, Histidine, Isoleucine, Methionine, Leucine, Lysine, Phenylalanine, Proline, Serine, Threonine, Tryptophan, Tyrosine, or Valine.

An “anticodon hairpin domain (ACHD)”, as that term is used herein, refers to a domain comprising an anticodon that binds a respective codon in an mRNA, and comprises sufficient sequence, e.g., an anticodon triplet, to mediate, e.g., when present in an otherwise wildtype tRNA, pairing (with or without wobble) with a codon. In an embodiment the ACHD has at least 75, 80, 85, 85, 90, 95, or 100% identity with a naturally occurring ACHD, e.g., an ACHD encoded by a nucleic acid in Table 1. In an embodiment, the TREM can comprise a fragment or analog of an ACHD, e.g., an ACHD encoded by a nucleic acid in Table 1, which fragment in embodiments has ACHD activity and in other embodiments does not have ACHD activity. In an embodiment, the ASGPR binding moiety is present within the ACHD. In an embodiment, the ASGPR binding moiety is bound to a nucleobase within a nucleotide in the ACHD.

In an embodiment, the ACHD comprises positions 27-43 within the TREM sequence. In an embodiment, the ASGPR binding moiety is present within the ACHD (e.g., positions 27-43) within the TREM sequence.

In an embodiment the ACHD falls under the corresponding sequence of a consensus sequence provided in the “Consensus Sequence” section or differs from the consensus sequence by no more than 1, 2, 5, or 10 positions. In an embodiment, the ASGPR binding moiety is present within the corresponding sequence of a consensus sequence provided in the “Consensus Sequence” section or a sequence that differs from the consensus sequence by no more than 1, 2, 5, or 10 positions.

In an embodiment, the ACHD comprises residues -R₃₀-R₃₁-R₃₂-R₃₃-R₃₄-R₃₅-R₃₆-R₃₇-R₃₈-R₃₉-R₄₀-R₄₁-R₄₂-R₄₃-R₄₄-R₄₅-R₄₆of Formula I_ZZZ, wherein ZZZ indicates any of the twenty amino acids. In some embodiments, Formula I_ZZZrefers to all species.

In an embodiment, the ACHD comprises residues -R₃₀-R₃₁-R₃₂-R₃₃-R₃₄-R₃₅-R₃₆-R₃₇-R₃₈-R₃₉-R₄₀-R₄₁-R₄₂-R₄₃-R₄₄-R₄₅-R₄₆of Formula II_ZZZ, wherein ZZZ indicates any of the twenty amino acids. In some embodiments, Formula II_ZZZrefers to mammals.

In an embodiment, the ACHD comprises residues -R₃₀-R₃₁-R₃₂-R₃₃-R₃₄-R₃₅-R₃₆-R₃₇-R₃₈-R₃₉-R₄₀-R₄₁-R₄₂-R₄₃-R₄₄-R₄₅-R₄₆of Formula III_ZZZ, wherein ZZZ indicates any of the twenty amino acids. In some embodiments, Formula III_ZZZrefers to humans.

In an embodiment, the anticodon of a TREM entity comprises three nucleotide residues and pairs with a three nucleotide codon. In an embodiment, the anticodon of a TREM entity consists of three nucleotide residues and pairs with an anticodon which consists of three nucleotide residues. In an embodiment the anticodon of the TREM entity does not pair with a codon having four, five or a larger number of nucleotide residues but pairs only with three codon nucleotide residues.

In an embodiment, the TREM entity does not alter the reading frame of an mRNA. In an embodiment, the anti-codon of a TREM entity pairs with a triplet codon of an mRNA and does not pair with an adjacent nucleotide.

In an embodiment, use of the TREM entity does not alter the length of the polypeptide transcribed from the mRNA, e.g., it does not suppress a termination codon, e.g., a premature termination codon. In an embodiment, the TREM does not alter the length of the ORF of an mRNA.

An “asialoglycoprotein receptor (ASGPR) binding moiety,” as that term is used herein, refers to a moiety which binds an asialoglycoprotein receptor. In an embodiment, the ASGPR binding moiety as described herein refers to structure comprising: (i) an ASGPR carbohydrate and (ii) a ASGPR linker (e.g., a linker connecting the carbohydrate to the TREM). Exemplary ASGPR moieties include galactose (Gal), galactosamine (GalNH₂), or an N-acetylgalactosamine (GalNAc) moiety, for example, a Gal, GalNH₂, or GalNAc, or an analog thereof. The ASGPR binding moieties may comprise functional groups (e.g., hydroxyl groups, carboxylate groups, amines) that may be protected by a chemical protecting group, e.g., an acetyl group or methyl group. In an embodiment, the ASGPR binding moiety comprises a triantennary GalNAc moiety. In an embodiment, the ASGPR binding moiety may ASGPR binding moieties are described in further detail herein.

A “cognate adaptor function TREM,” as that term is used herein, refers to a TREM which mediates initiation or elongation with the AA (the cognate AA) associated in nature with the anti-codon of the TREM.

“Decreased expression,” as that term is used herein, refers to a decrease in comparison to a reference, e.g., in the case where altered control region, or addition of an agent, results in a decreased expression of the subject product, it is decreased relative to an otherwise similar cell without the alteration or addition.

A dihydrouridine hairpin domain (DHD), as that term is used herein, refers to a domain which comprises sufficient RNA sequence to mediate, e.g., when present in an otherwise wildtype tRNA, recognition of aminoacyl-tRNA synthetase, e.g., acts as a recognition site for aminoacyl-tRNA synthetase for amino acid charging of the TREM. In embodiments, a DHD mediates the stabilization of the TREM's tertiary structure. In an embodiment the DHD has at least 75, 80, 85, 85, 90, 95, or 100% identity with a naturally occurring DHD, e.g., a DHD encoded by a nucleic acid in Table 1. In an embodiment, the TREM can comprise a fragment or analog of a DHD, e.g., a DHD encoded by a nucleic acid in Table 1, which fragment in embodiments has DHD activity and in other embodiments does not have DHD activity. In an embodiment, the ASGPR binding moiety is present within the DHD. In an embodiment, the ASGPR binding moiety is bound to a nucleobase within a nucleotide in the DHD.

In an embodiment, the DHD comprises positions 10-26 within the TREM sequence. In an embodiment, the ASGPR binding moiety is present within the DHD (e.g., positions 10-26) within the TREM sequence.

In an embodiment the DHD falls under the corresponding sequence of a consensus sequence provided in the “Consensus Sequence” section or differs from the consensus sequence by no more than 1, 2, 5, or 10 positions. In an embodiment, the ASGPR binding moiety is present within the corresponding sequence of a consensus sequence provided in the “Consensus Sequence” section or a sequence that differs from the consensus sequence by no more than 1, 2, 5, or 10 positions.

In an embodiment, the DHD comprises residues R₁₀-R₁₁-R₁₂-R₁₃-R₁₄R₁₅-R₁₆-R₁₇-R₁₈-R₁₉-R₂₀-R₂₁-R₂₂-R₂₃-R₂₄-R₂₅-R₂₆-R₂₇-R₂₈of Formula I_ZZZ, wherein ZZZ indicates any of the twenty amino acids. In some embodiments, Formula I_ZZZrefers to all species.

In an embodiment, the DHD comprises residues R₁₀-R₁₁-R₁₂-R₁₃-R₁₄R₁₅-R₁₆-R₁₇-R₁₈-R₁₉-R₂₀-R₂₁-R₂₂-R₂₃-R₂₄-R₂₅-R₂₆-R₂₇-R₂₈of Formula II_ZZZ, wherein ZZZ indicates any of the twenty amino acids. In some embodiments, Formula II_ZZZrefers to mammals.

In an embodiment, the DHD comprises residues R₁₀-R₁₁-R₁₂-R₁₃-R₁₄R₁₅-R₁₆-R₁₇-R₁₈-R₁₉-R₂₀-R₂₁-R₂₂-R₂₃-R₂₄-R₂₅-R₂₆-R₂₇-R₂₈of Formula III_ZZZ, wherein ZZZ indicates any of the twenty amino acids. In some embodiments, Formula III_ZZZrefers to humans.

An “exogenous nucleic acid,” as that term is used herein, refers to a nucleic acid sequence that is not present in or differs by at least one nucleotide from the closest sequence in a reference cell, e.g., a cell into which the exogenous nucleic acid is introduced. In an embodiment, an exogenous nucleic acid comprises a nucleic acid that encodes a TREM.

An “exogenous TREM,” as that term is used herein, refers to a TREM that:

- (a) differs by at least one nucleotide or one post transcriptional modification from the closest sequence tRNA in a reference cell, e.g., a cell into which the exogenous nucleic acid is introduced;
- (b) has been introduced into a cell other than the cell in which it was transcribed;
- (c) is present in a cell other than one in which it naturally occurs; or
- (d) has an expression profile, e.g., level or distribution, that is non-wildtype, e.g., it is expressed at a higher level than wildtype. In an embodiment, the expression profile can be mediated by a change introduced into a nucleic acid that modulates expression or by addition of an agent that modulates expression of the RNA molecule. In an embodiment an exogenous TREM comprises 1, 2, 3 or 4 of properties (a)-(d).

A “GMP-grade composition,” as that term is used herein, refers to a composition in compliance with current good manufacturing practice (cGMP) guidelines, or other similar requirements. In an embodiment, a GMP-grade composition can be used as a pharmaceutical product.

As used herein, the terms “increasing” and “decreasing” refer to modulating that results in, respectively, greater or lesser amounts of function, expression, or activity of a particular metric relative to a reference. For example, subsequent to administration to a cell, tissue or subject of a TREM described herein, the amount of a marker of a metric (e.g., protein translation, mRNA stability, protein folding) as described herein may be increased or decreased by at least 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or 98%, 2X, 3X, 5X, 10X or more relative to the amount of the marker prior to administration or relative to the effect of a negative control agent. The metric may be measured subsequent to administration at a time that the administration has had the recited effect, e.g., at least 12 hours, 24 hours, one week, one month, 3 months, or 6 months, after a treatment has begun.

“Increased expression,” as that term is used herein, refers to an increase in comparison to a reference, e.g., in the case where altered control region, or addition of an agent, results in an increased expression of the subject product, it is increased relative to an otherwise similar cell without the alteration or addition.

A Linker 2 region (L2), as that term is used herein, refers to a linker comprising residues R₈-R₉of a consensus sequence provided in the “Consensus Sequence” section.

A Linker 3 region (L3), that term is used herein, refers to a linker comprising residue R₂₉of a consensus sequence provided in the “Consensus Sequence” section.

A “Linker 4 region (L4), as that term is used herein, refers to a domain comprising residue R₇₂of a consensus sequence provided in the “Consensus Sequence” section.

A “modification,” as that term is used herein with reference to a nucleotide, refers to a modification of the chemical structure, e.g., a covalent modification, of the subject nucleotide. In an embodiment, the modification is present within the nucleobase, nucleotide sugar, or internucleotide linkage of a nucleotide of the TREM. The modification can be naturally occurring or non-naturally occurring. In an embodiment, the modification is non-naturally occurring. In an embodiment, the modification is naturally occurring. In an embodiment, the modification is a synthetic modification. In an embodiment, the modification is a modification provided in Tables 5, 6, 7, 8 or 9.

A “naturally occurring nucleotide,” as that term is used herein, refers to a nucleotide that does not comprise a non-naturally occurring modification. In an embodiment, it includes a naturally occurring modification.

A “nucleotide,” as that term is used herein, refers to an entity comprising a sugar, typically a pentameric sugar; a nucleobase; and a phosphate linking group (e.g., internucleotide linkage). In an embodiment, a nucleotide comprises a naturally occurring, e.g., naturally occurring in a human cell, nucleotide, e.g., an adenine, thymine, guanine, cytosine, or uracil nucleotide.

A “thymine hairpin domain (THD), as that term is used herein, refers to a domain which comprises sufficient RNA sequence, to mediate, e.g., when present in an otherwise wildtype tRNA, recognition of the ribosome, e.g., acts as a recognition site for the ribosome to form a TREM-ribosome complex during translation. In an embodiment the THD has at least 75, 80, 85, 85, 90, 95, or 100% identity with a naturally occurring THD, e.g., a THD encoded by a nucleic acid in Table 1. In an embodiment, the TREM can comprise a fragment or analog of a THD, e.g., a THD encoded by a nucleic acid in Table 1, which fragment in embodiments has THD activity and in other embodiments does not have THD activity. In an embodiment, the ASPGR binding moiety is present within the THD. In an embodiment, the ASGPR binding moiety is bound to a nucleobase within a nucleotide in the THD.

In an embodiment, the THD comprises positions 50-64 within the TREM sequence. In an embodiment, the ASPGR binding moiety is present within the THD (e.g., positions 50-64) within the TREM sequence.

In an embodiment the THD falls under the corresponding sequence of a consensus sequence provided in the “Consensus Sequence” section or differs from the consensus sequence by no more than 1, 2, 5, or 10 positions.

In an embodiment, the THD comprises residues -R₄₈-R₄₉-R₅₀-R₅₁-R₅₂-R₅₃-R₅₄-R₅₅-R₅₆-R₅₇-R₅₈-R₅₉-R₆₀-R₆₁-R₆₂-R₆₃-R₆₄of Formula I_ZZZ, wherein ZZZ indicates any of the twenty amino acids. In some embodiments, Formula I_ZZZrefers to all species.

In an embodiment, the THD comprises residues -R₄₈-R₄₉-R₅₀-R₅₁-R₅₂-R₅₃-R₅₄-R₅₅-R₅₆-R₅₇-R₅₈-R₅₉-R₆₀-R₆₁-R₆₂-R₆₃-R₆₄of Formula II_ZZZ, wherein ZZZ indicates any of the twenty amino acids. In some embodiments, Formula II_ZZZrefers to mammals.

A “tRNA-based effector molecule” or “TREM,” as that term is used herein, refers to an RNA molecule comprising a structure or property from (a)-(v) below, and which is a recombinant TREM, a synthetic TREM, or a TREM expressed from a heterologous cell. The TREMs described in the present invention are synthetic molecules and are made, e.g., in a cell free reaction, e.g., in a solid state or liquid phase synthetic reaction. TREMs are chemically distinct, e.g., in terms of primary sequence, type or location of modifications from the endogenous tRNA molecules made in cells, e.g., in mammalian cells, e.g., in human cells. A TREM can have a plurality (e.g., 2, 3, 4, 5, 6, 7, 8, 9) of the structures and functions of (a)-(v).

In an embodiment, a TREM is non-native, as evaluated by structure or the way in which it was made.

In an embodiment, a TREM comprises one or more of the following structures or properties:

- (a′) an optional linker region of a consensus sequence provided in the “Consensus Sequence” section, e.g., a Linker 1 region;
- (a) an acceptor stem domain (an AStD), which typically comprises an ASt Domain1 and an ASt Domain2;
- (a′-1) a Linker 2 region (L2) a linker comprising residues R₈-R₉of a consensus sequence provided in the “Consensus Sequence” section, e.g., a Linker 2 region;
- (b) a DHD or dihydrouridine hairpin domain (DHD);
- (b′-1) a Linker 3 region, or L3;
- (c) an ACHD or anticodon hairpin domain;
- (d) a VLD, or variable loop domain (VLD);
- (e) a THD or thymine hairpin domain (THD);
- (e′1) an L4 linker comprising residue R₇₂of a consensus sequence provided in the “Consensus Sequence” section;
- (f) under physiological conditions, it comprises a stem structure and one or a plurality of loop structures, e.g., 1, 2, or 3 loops. A loop can comprise a domain described herein, e.g., a domain selected from (a)-(e). A loop can comprise one or a plurality of domains. In an embodiment, a stem or loop structure has at least 75, 80, 85, 85, 90, 95, or 100% identity with a naturally occurring stem or loop structure, e.g., a stem or loop structure encoded by a nucleic acid in Table 1. In an embodiment, the TREM can comprise a fragment or analog of a stem or loop structure, e.g., a stem or loop structure encoded by a nucleic acid in Table 1, which fragment in embodiments has activity of a stem or loop structure, and in other embodiments does not have activity of a stem or loop structure;
- (g) a tertiary structure, e.g., an L-shaped tertiary structure;
- (h) adaptor function, i.e., the TREM mediates acceptance of an amino acid, e.g., its cognate amino acid and transfer of the AA in the initiation or elongation of a polypeptide chain;
- (i) cognate adaptor function wherein the TREM mediates acceptance and incorporation of an amino acid (e.g., cognate amino acid) associated in nature with the anti-codon of the TREM to initiate or elongate a polypeptide chain;
- (j) non-cognate adaptor function, wherein the TREM mediates acceptance and incorporation of an amino acid (e.g., non-cognate amino acid) other than the amino acid associated in nature with the anti-codon of the TREM in the initiation or elongation of a polypeptide chain;
- (k) a regulatory function, e.g., an epigenetic function (e.g., gene silencing function or signaling pathway modulation function), cell fate modulation function, mRNA stability modulation function, protein stability modulation function, protein transduction modulation function, or protein compartmentalization function;
- (l) a structure which allows for ribosome binding;
- (m) a post-transcriptional modification, e.g., a naturally occurring post-transcriptional modification;
- (n) the ability to inhibit a functional property of a tRNA, e.g., any of properties (h)-(k) possessed by a tRNA;
- (o) the ability to modulate cell fate;
- (p) the ability to modulate ribosome occupancy;
- (q) the ability to modulate protein translation;
- (r) the ability to modulate mRNA stability;
- (s) the ability to modulate protein folding and structure;
- (t) the ability to modulate protein transduction or compartmentalization;
- (u) the ability to modulate protein stability; or
- (v) the ability to modulate a signaling pathway, e.g., a cellular signaling pathway.

In an embodiment, a TREM comprises a full-length tRNA molecule or a fragment thereof.

In an embodiment, a TREM comprises the following properties: (a)-(e).

In an embodiment, a TREM comprises the following properties: (a) and (c).

In an embodiment, a TREM comprises the following properties: (a), (c) and (h).

In an embodiment, a TREM comprises the following properties: (a), (c), (h) and (b).

In an embodiment, a TREM comprises the following properties: (a), (c), (h) and (e).

In an embodiment, a TREM comprises the following properties: (a), (c), (h), (b) and (e).

In an embodiment, a TREM comprises the following properties: (a), (c), (h), (b), (e) and (g).

In an embodiment, a TREM comprises the following properties: (a), (c), (h) and (m).

In an embodiment, a TREM comprises the following properties: (a), (c), (h), (m), and (g).

In an embodiment, a TREM comprises the following properties: (a), (c), (h), (m) and (b).

In an embodiment, a TREM comprises the following properties: (a), (c), (h), (m) and (e).

In an embodiment, a TREM comprises the following properties: (a), (c), (h), (m), (g), (b) and (e).

In an embodiment, a TREM comprises the following properties: (a), (c), (h), (m), (g), (b), (e) and (q).

In an embodiment, a TREM comprises:

- (i) an amino acid attachment domain that binds an amino acid (e.g., an AStD, as described in (a) herein); and
- (ii) an anticodon that binds a respective codon in an mRNA (e.g., an ACHD, as described in (c) herein).

In an embodiment the TREM comprises a flexible RNA linker which provides for covalent linkage of (i) to (ii).

In an embodiment, the TREM mediates protein translation.

In an embodiment a TREM comprises a linker, e.g., an RNA linker, e.g., a flexible RNA linker, which provides for covalent linkage between a first and a second structure or domain. In an embodiment, an RNA linker comprises at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 or 15 ribonucleotides. A TREM can comprise one or a plurality of linkers, e.g., in embodiments a TREM comprising (a), (b), (c), (d) and (e) can have a first linker between a first and second domain, and a second linker between a third domain and another domain.

In an embodiment, the TREM comprises a sequence of Formula A: [L1]-[ASt Domain1]-[L2]-[DH Domain]-[L3]-[ACH Domain]-[VL Domain]-[TH Domain]-[L4]-[ASt Domain2].

In an embodiment, a TREM comprises an RNA sequence at least 60, 65, 70, 75, 80, 85, 90, 95, 96, 97, 98 or 99% identical with, or which differs by no more than 1, 2, 3, 4, 5, 10, 15, 20, 25, or 30 ribonucleotides from, an RNA sequence encoded by a DNA sequence listed in Table 1, or a fragment or functional fragment thereof. In an embodiment, a TREM comprises an RNA sequence encoded by a DNA sequence listed in Table 1, or a fragment or functional fragment thereof. In an embodiment, a TREM comprises an RNA sequence encoded by a DNA sequence at least 60, 65, 70, 75, 80, 85, 90, 95, 96, 97, 98 or 99% identical with a DNA sequence listed in Table 1, or a fragment or functional fragment thereof. In an embodiment, a TREM comprises a TREM domain, e.g., a domain described herein, comprising at least 60, 65, 70, 75, 80, 85, 90, 95, 96, 97, 98, or 99% identical with, or which differs by no more than 1, 2, 3, 4, 5, 10, or 15, ribonucleotides from, an RNA encoded by a DNA sequence listed in Table 1, or a fragment or a functional fragment thereof. In an embodiment, a TREM comprises a TREM domain, e.g., a domain described herein, comprising an RNA sequence encoded by DNA sequence listed in Table 1, or a fragment or functional fragment thereof. In an embodiment, a TREM comprises a TREM domain, e.g., a domain described herein, comprising an RNA sequence encoded by DNA sequence at least 60, 65, 70, 75, 80, 85, 90, 95, 96, 97, 98 or 99% identical with a DNA sequence listed in Table 1, or a fragment or functional fragment thereof.

In an embodiment, a TREM is 76-90 nucleotides in length. In embodiments, a TREM or a fragment or functional fragment thereof is between 10-90 nucleotides, between 10-80 nucleotides, between 10-70 nucleotides, between 10-60 nucleotides, between 10-50 nucleotides, between 10-40 nucleotides, between 10-30 nucleotides, between 10-20 nucleotides, between 20-90 nucleotides, between 20-80 nucleotides, 20-70 nucleotides, between 20-60 nucleotides, between 20-50 nucleotides, between 20-40 nucleotides, between 30-90 nucleotides, between 30-80 nucleotides, between 30-70 nucleotides, between 30-60 nucleotides, or between 30-50 nucleotides.

In an embodiment, a TREM is aminoacylated, e.g., charged, with an amino acid by an aminoacyl tRNA synthetase.

In an embodiment, a TREM is not charged with an amino acid, e.g., an uncharged TREM (uTREM).

In an embodiment, a TREM comprises less than a full length tRNA. In embodiments, a TREM can correspond to a naturally occurring fragment of a tRNA, or to a non-naturally occurring fragment. Exemplary fragments include: TREM halves (e.g., from a cleavage in the ACHD, e.g., in the anticodon sequence, e.g., 5′halves or 3′ halves); a 5′ fragment (e.g., a fragment comprising the 5′ end, e.g., from a cleavage in a DHD or the ACHD); a 3′ fragment (e.g., a fragment comprising the 3′ end, e.g., from a cleavage in the THD); or an internal fragment (e.g., from a cleavage in one or more of the ACHD, DHD or THD).

A “TREM core fragment,” as that term is used herein, refers to a portion of the sequence of Formula B: [L1]_y-[ASt Domain1]_x-[L2]_y-[DH Domain]_y-[L3]_y-[ACH Domain]_x-[VL Domain]_y-[TH Domain]_y-[L4]_y-[ASt Domain2]_x, wherein: x=1 and y=0 or 1.

A “TREM fragment,” as used herein, refers to a portion of a TREM, wherein the TREM comprises a sequence of Formula A: [L1]-[ASt Domain1]-[L2]-[DH Domain]-[L3]-[ACH Domain]-[VL Domain]-[TH Domain]-[L4]-[ASt Domain2].

A “non-cognate adaptor function TREM,” as that term is used herein, refers to a TREM which mediates initiation or elongation with an AA (a non-cognate AA) other than the AA associated in nature with the anti-codon of the TREM. In an embodiment, a non-cognate adaptor function TREM is also referred to as a mischarged TREM (mTREM).

A “non-naturally occurring sequence,” as that term is used herein, refers to a sequence wherein an Adenine is replaced by a residue other than an analog of adenine, a cytosine is replaced by a residue other than an analog of cytosine, a guanine is replaced by a residue other than an analog of guanine, and a uracil is replaced by a residue other than an analog of uracil. An analog refers to any possible derivative of the ribonucleotides, A, G, C or U. In an embodiment, a sequence having a derivative of any one of ribonucleotides A, G, C or U is a non-naturally occurring sequence.

A “pharmaceutical TREM composition,” as that term is used herein, refers to a TREM composition that is suitable for pharmaceutical use. Typically, a pharmaceutical TREM composition comprises a pharmaceutical excipient. In an embodiment the TREM will be the only active ingredient in the pharmaceutical TREM composition. In embodiments the pharmaceutical TREM composition is free, substantially free, or has less than a pharmaceutically acceptable amount, of host cell proteins, DNA, e.g., host cell DNA, endotoxins, and bacteria.

A “post-transcriptional processing,” as that term is used herein, with respect to a subject molecule, e.g., a TREM, RNA or tRNAs, refers to a covalent modification of the subject molecule. In an embodiment, the covalent modification occurs post-transcriptionally. In an embodiment, the covalent modification occurs co-transcriptionally. In an embodiment, the modification is made in vivo, e.g., in a cell used to produce a TREM. In an embodiment the modification is made ex vivo, e.g., it is made on a TREM isolated or obtained from the cell which produced the TREM. In an embodiment, the post-transcriptional modification is selected from a post-transcriptional modification listed in Table 2.

A “subject,” as this term is used herein, includes any organism, such as a human or other animal. In embodiments, the subject is a vertebrate animal (e.g., mammal, bird, fish, reptile, or amphibian). In embodiments, the subject is a mammal, e.g., a human. In embodiments, the method subject is a non-human mammal. In embodiments, the subject is a non-human mammal such as a non-human primate (e.g., monkeys, apes), ungulate (e.g., cattle, buffalo, sheep, goat, pig, camel, llama, alpaca, deer, horses, donkeys), carnivore (e.g., dog, cat), rodent (e.g., rat, mouse), or lagomorph (e.g., rabbit). In embodiments, the subject is a bird, such as a member of the avian taxa Galliformes (e.g., chickens, turkeys, pheasants, quail), Anseriformes (e.g., ducks, geese), Paleaognathae (e.g., ostriches, emus), Columbiformes (e.g., pigeons, doves), or Psittaciformes (e.g., parrots). The subject may be a male or female of any age group, e.g., a pediatric subject (e.g., infant, child, adolescent) or adult subject (e.g., young adult, middle-aged adult, or senior adult)). A non-human subject may be a transgenic animal.

A “synthetic TREM,” as that term is used herein, refers to a TREM which was synthesized other than in or by a cell having an endogenous nucleic acid encoding the TREM, e.g., a synthetic TREM is synthetized by cell-free solid phase synthesis. A synthetic TREM can have the same, or a different, sequence, or tertiary structure, as a native tRNA.

A “recombinant TREM,” as that term is used herein, refers to a TREM that was expressed in a cell modified by human intervention, having a modification that mediates the production of the TREM, e.g., the cell comprises an exogenous sequence encoding the TREM, or a modification that mediates expression, e.g., transcriptional expression or post-transcriptional modification, of the TREM. A recombinant TREM can have the same, or a different, sequence, set of post-transcriptional modifications, or tertiary structure, as a reference tRNA, e.g., a native tRNA.

A “tRNA”, as that term is used herein, refers to a naturally occurring transfer ribonucleic acid in its native state.

A “TREM composition,” as that term is used herein, refers to a composition comprising a plurality of TREMs, a plurality of TREM core fragments and/or a plurality of TREM fragments. A TREM composition can comprise one or more species of TREMs, TREM core fragments or TREM fragments. In an embodiment, the composition comprises only a single species of TREM, TREM core fragment or TREM fragment. In an embodiment, the TREM composition comprises a first TREM, TREM core fragment or TREM fragment species; and a second TREM, TREM core fragment or TREM fragment species. In an embodiment, the TREM composition comprises X TREM, TREM core fragment or TREM fragment species, wherein X=2, 3, 4, 5, 6, 7, 8, 9, or 10. In an embodiment, the TREM, TREM core fragment or TREM fragment has at least 70, 75, 80, 85, 90, or 95, or has 100%, identity with a sequence encoded by a nucleic acid in Table 1. A TREM composition can comprise one or more species of TREMs, TREM core fragments or TREM fragments. In an embodiment, the TREM composition is at least 10, 20, 30, 40, 50, 60, 70, 80, 90, 95 or 99% dry weight TREMs (for a liquid composition dry weight refers to the weight after removal of substantially all liquid, e.g., after lyophilization). In an embodiment, the composition is a liquid. In an embodiment, the composition is dry, e.g., a lyophilized material. In an embodiment, the composition is a frozen composition. In an embodiment, the composition is sterile. In an embodiment, the composition comprises at least 0.5 g, 1.0 g, 5.0 g, 10 g, 15 g, 25 g, 50 g, 100 g, 200 g, 400 g, or 500 g (e.g., as determined by dry weight) of TREM.

In an embodiment, at least X % of the TREMs in a TREM composition comprises a chemical modification at a selected position, and X is 80, 90, 95, 96, 97, 98, 99, or 99.5.

In an embodiment, at least X % of the TREMs in a TREM composition comprises a chemical modification at a first position and a chemical modification at a second position, and X, independently, is 80, 90, 95, 96, 97, 98, 99, or 99.5. In embodiments, the modification at the first and second position is the same. In embodiments, the modification at the first and second position are different. In embodiments, the nucleotide at the first and second position is the same, e.g., both are adenine. In embodiments, the nucleotide at the first and second position are different, e.g., one is adenine and one is thymine.

In an embodiment, at least X % of the TREMs in a TREM composition comprises a chemical modification at a first position and less than Y % have a chemical modification at a second position, wherein X is 80, 90, 95, 96, 97, 98, 99, or 99.5 and Y is 20, 20, 5, 2, 1, 0.1, or 0.01. In embodiments, the nucleotide at the first and second position is the same, e.g., both are adenine. In embodiments the nucleotide at the first and second position are different, e.g., one is adenine and one is thymine.

A “variable loop domain (VLD),” as that term is used herein refers to a domain which comprises sufficient RNA sequence to mediate, e.g., when present in an otherwise wildtype tRNA, recognition of aminoacyl-tRNA synthetase, e.g., acts as a recognition site for aminoacyl-tRNA synthetase for amino acid charging of the TREM. In embodiments, a VLD mediates the stabilization of the TREM's tertiary structure. In an embodiment, a VLD modulates, e.g., increases, the specificity of the TREM, e.g., for its cognate amino acid, e.g., the VLD modulates the TREM's cognate adaptor function. In an embodiment the VLD has at least 75, 80, 85, 85, 90, 95, or 100% identity with a naturally occurring VLD, e.g., a VLD encoded by a nucleic acid in Table 1. In an embodiment, the TREM can comprise a fragment or analog of a VLD, e.g., a VLD encoded by a nucleic acid in Table 1, which fragment in embodiments has VLD activity and in other embodiments does not have VLD activity. In an embodiment, the ASGPR binding moiety is present within the VLD. In an embodiment, the ASGPR binding moiety is bound to a nucleobase within a nucleotide in the VLD.

In an embodiment, the VLD comprises positions 44-49 within the TREM sequence. In an embodiment, the ASGPR binding moiety is present within the VLD (e.g., positions 44-49) within the TREM sequence.

In an embodiment the VLD falls under the corresponding sequence of a consensus sequence provided in the “Consensus Sequence” section.

In an embodiment, the VLD comprises residue -[R₄₇]_xof a consensus sequence provided in the “Consensus Sequence” section, wherein x=1-271 (e.g., x=1-250, x=1-225, x=1-200, x=1-175, x=1-150, x=1-125, x=1-100, x=1-75, x=1-50, x=1-40, x=1-30, x=1-29, x=1-28, x=1-27, x=1-26, x=1-25, x=1-24, x=1-23, x=1-22, x=1-21, x=1-20, x=1-19, x=1-18, x=1-17, x=1-16, x=1-15, x=1-14, x=1-13, x=1-12, x=1-11, x=1-10, x=10-271, x=20-271, x=30-271, x=40-271, x=50-271, x=60-271, x=70-271, x=80-271, x=100-271, x=125-271, x=150-271, x=175-271, x=200-271, x=225-271, x=1, x=2, x=3, x=4, x=5, x=6, x=7, x=8, x=9, x=10, x=11, x=12, x=13, x=14, x=15, x=16, x=17, x=18, x=19, x=20, x=21, x=22, x=23, x=24, x=25, x=26, x=27, x=28, x=29, x=30, x=40, x=50, x=60, x=70, x=80, x=90, x=100, x=110, x=125, x=150, x=175, x=200, x=225, x=250, or x=271).

TREM Entities

Described herein are TREM entities, e.g., a TREM, a TREM Core Fragment, or a TREM Fragment, modified with an asialoglycoprotein receptor (ASGPR) binding moiety, as well as compositions and methods of use thereof. A TREM entity (e.g., a TREM) refers to an RNA molecule comprising one or more of the properties described herein. The ASGPR binding moiety may be conjugated to a nucleobase within the TREM entity, or within an internucleotide linkage of the TREM entity, or at a terminus (e.g., the 5′ or 3′ terminus) of the TREM entity. A TREM entity (e.g., a TREM) can comprise a chemical modification, e.g., as provided in Tables 4, 5, 6 or 7.

In an embodiment, a TREM entity includes a TREM comprising a sequence of Formula A; a TREM core fragment comprising a sequence of Formula B; or a TREM fragment comprising a portion of a TREM which TREM comprises a sequence of Formula A.

In an embodiment, a TREM comprises a sequence of Formula A: [L1]-[ASt Domain1]-[L2]-[DH Domain]-[L3]-[ACH Domain]-[VL Domain]-[TH Domain]-[L4]-[ASt Domain2], wherein the ASGPR binding moiety is present within the ASt Domain1 (e.g., on a nucleobase, at a terminus (e.g., the 5′ terminus), or within the internucleotide linkage of ASt Domain1). In an embodiment, the ASGPR binding moiety is present on a nucleobase of a nucleotide within ASt Domain1. In an embodiment, the ASGPR binding moiety is present at the 5′ terminus within ASt Domain1 or at [L1]. In an embodiment, the ASGPR binding moiety is present within an internucleotide linkage of ASt Domain1. In an embodiment, [VL Domain] is optional. In an embodiment, [L1] is optional.

In an embodiment, a TREM comprises a sequence of Formula A: [L1]-[ASt Domain1]-[L2]-[DH Domain]-[L3]-[ACH Domain]-[VL Domain]-[TH Domain]-[L4]-[ASt Domain2], wherein the ASGPR binding moiety is present within the ASt Domain2 (e.g., on a nucleobase, at a terminus (e.g., 3′ terminus), or within the internucleotide linkage of ASt Domain2). In an embodiment, the ASGPR binding moiety is present on a nucleobase of a nucleotide within ASt Domain2. In an embodiment, the ASGPR binding moiety is present at the 3′ terminus within ASt Domain2. In an embodiment, the ASGPR binding moiety is present within an internucleotide linkage of ASt Domain2. In an embodiment, [VL Domain] is optional. In an embodiment, [L1] is optional.

In an embodiment, a TREM comprises a sequence of Formula A: [L1]-[ASt Domain1]-[L2]-[DH Domain]-[L3]-[ACH Domain]-[VL Domain]-[TH Domain]-[L4]-[ASt Domain2], wherein the ASGPR binding moiety is present within either one or both of ASt Domain1 and ASt Domain2 (e.g., on a nucleobase, at a terminus (e.g., 5′ or 3′ terminus), or within the internucleotide linkage of ASt Domain1 or ASt Domain2). In an embodiment, the ASGPR binding moiety is present on a nucleobase of a nucleotide within ASt Domain1 or ASt Domain2.

In an embodiment, the ASGPR binding moiety is present at the 5′ terminus within ASt Domain1 or [L1] or the 3′ terminus within ASt Domain2. In an embodiment, the ASGPR binding moiety is present within an internucleotide linkage of ASt Domain1 or ASt Domain2. In an embodiment, [VL Domain] is optional. In an embodiment, [L1] is optional.

In an embodiment, a TREM comprises a sequence of Formula A: [L1]-[ASt Domain1]-[L2]-[DH Domain]-[L3]-[ACH Domain]-[VL Domain]-[TH Domain]-[L4]-[ASt Domain2], wherein the ASGPR binding moiety is present within the DH Domain (e.g., on a nucleobase or within the internucleotide linkage of the DH Domain). In an embodiment, the ASGPR binding moiety is present on a nucleobase of a nucleotide within the DH Domain. In an embodiment, the ASGPR binding moiety is present within an internucleotide linkage of the DH Domain. In an embodiment, [L1] is optional.

In an embodiment, a TREM comprises a sequence of Formula A: [L1]-[ASt Domain1]-[L2]-[DH Domain]-[L3]-[ACH Domain]-[VL Domain]-[TH Domain]-[L4]-[ASt Domain2], wherein the ASGPR binding moiety is within the ACH Domain (e.g., on a nucleobase or within the internucleotide linkage of the ACH Domain). In an embodiment, the ASGPR binding moiety is present on a nucleobase of a nucleotide within the ACH Domain. In an embodiment, the ASGPR binding moiety is present within an internucleotide linkage of the ACH Domain. In an embodiment, [VL Domain] is optional. In an embodiment, [L1] is optional.

In an embodiment, a TREM comprises a sequence of Formula A: [L1]-[ASt Domain1]-[L2]-[DH Domain]-[L3]-[ACH Domain]-[VL Domain]-[TH Domain]-[L4]-[ASt Domain2], wherein the ASGPR binding moiety is present within the VL Domain (e.g., on a nucleobase or within the internucleotide linkage of the VL Domain). In an embodiment, the ASGPR binding moiety is present on a nucleobase of a nucleotide within the VL Domain. In an embodiment, the ASGPR binding moiety is present within an internucleotide linkage of the VL Domain. In an embodiment, [L1] is optional.

In an embodiment, a TREM comprises a sequence of Formula A: [L1]-[ASt Domain1]-[L2]-[DH Domain]-[L3]-[ACH Domain]-[VL Domain]-[TH Domain]-[L4]-[ASt Domain2], wherein the ASGPR binding moiety is present within the TH Domain (e.g., on a nucleobase or within the internucleotide linkage of the TH Domain). In an embodiment, the ASGPR binding moiety is present on a nucleobase of a nucleotide within the TH Domain. In an embodiment, the ASGPR binding moiety is present within an internucleotide linkage of the TH Domain. In an embodiment, [VL Domain] is optional. In an embodiment, [L1] is optional.

In an embodiment, a TREM comprises a sequence of Formula A: [L1]-[ASt Domain1]-[L2]-[DH Domain]-[L3]-[ACH Domain]-[VL Domain]-[TH Domain]-[L4]-[ASt Domain2], wherein the ASGPR binding moiety is bound to a nucleobase within one or more domains selected from [ASt Domain1], [DH Domain], [ACH Domain], [TH Domain], and/or [ASt Domain2]. In an embodiment, [VL Domain] is optional. In an embodiment, [L1] is optional.

In an embodiment, a TREM comprises a sequence of Formula A: [L1]-[ASt Domain1]-[L2]-[DH Domain]-[L3]-[ACH Domain]-[VL Domain]-[TH Domain]-[L4]-[ASt Domain2], wherein the ASGPR binding moiety is bound to an internucleotide linkage within one or more domains selected from [ASt Domain1], [DH Domain], [ACH Domain], [TH Domain], and/or [ASt Domain2]. In an embodiment, [VL Domain] is optional. In an embodiment, [L1] is optional.

In an embodiment, a TREM core fragment comprises a sequence of Formula B: [L1]_y-[ASt Domain1]_x-[L2]_y-[DH Domain]_y-[L3]_y-[ACH Domain]_x-[VL Domain]_y-[TH Domain]_y-[L4]_y-[ASt Domain2]x, wherein: x=1 and y=0 or 1, and the ASGPR binding moiety is bound to a nucleobase within a nucleotide within one or both of ASt Domain1 and ASt Domain2. In an embodiment, y=0. In an embodiment, y=1.

In an embodiment, a TREM core fragment comprises a sequence of Formula B: [L1]_y-[ASt Domain1]_x-[L2]_y-[DH Domain]_y-[L3]_y-[ACH Domain]_x-[VL Domain]_y-[TH Domain]_y-[L4]_y-[ASt Domain2]_x, wherein: x=1 and y=0 or 1, and the ASGPR binding moiety is bound to a nucleobase within a nucleotide within the DH Domain. In an embodiment, y=0. In an embodiment, y=1.

In an embodiment, a TREM core fragment comprises a sequence of Formula B: [L1]_y-[ASt Domain1]_x-[L2]_y-[DH Domain]_y-[L3]_y-[ACH Domain]_x-[VL Domain]_y-[TH Domain]_y-[L4]_y-[ASt Domain2]_x, wherein: x=1 and y=0 or 1, and the ASGPR binding moiety is bound to a nucleobase within a nucleotide within the ACH Domain. In an embodiment, y=0. In an embodiment, y=1.

In an embodiment, a TREM core fragment comprises a sequence of Formula B: [L1]_y-[ASt Domain1]_x-[L2]_y-[DH Domain]_y-[L3]_y-[ACH Domain]_x-[VL Domain]_y-[TH Domain]_y-[L4]_y-[ASt Domain2]_x, wherein: x=1 and y=0 or 1, and the ASGPR binding moiety is bound to a nucleobase within a nucleotide within the TH Domain. In an embodiment, y=0. In an embodiment, y=1.

In an embodiment, a TREM core fragment comprises a sequence of Formula B: [L1]_y-[ASt Domain1]_x-[L2]_y-[DH Domain]_y-[L3]_y-[ACH Domain]_x-[VL Domain]_y-[TH Domain]_y-[L4]_y-[ASt Domain2]_x, wherein: x=1 and y=0 or 1, and the ASGPR binding moiety is bound to a nucleobase within one ore more domain selected from [ASt Domain1], [DH Domain], [ACH Domain], [TH Domain], and/or [ASt Domain2]. In an embodiment, y=0. In an embodiment, y=1.

In an embodiment, a TREM fragment comprises a portion of a TREM, wherein the TREM comprises a sequence of Formula A: [L1]-[ASt Domain1]-[L2]-[DH Domain]-[L3]-[ACH Domain]-[VL Domain]-[TH Domain]-[L4]-[ASt Domain2], and wherein the TREM fragment comprises: one, two, three or all or any combination of the following: a TREM half (e.g., from a cleavage in the ACH Domain, e.g., in the anticodon sequence, e.g., a 5′half or a 3′ half); a 5′ fragment (e.g., a fragment comprising the 5′ end, e.g., from a cleavage in a DH Domain or the ACH Domain); a 3′ fragment (e.g., a fragment comprising the 3′ end, e.g., from a cleavage in the TH Domain); or an internal fragment (e.g., from a cleavage in any one of the ACH Domain, DH Domain or TH Domain). Exemplary TREM fragments include TREM halves (e.g., from a cleavage in the ACHD, e.g., 5′TREM halves or 3′ TREM halves), a 5′ fragment (e.g., a fragment comprising the 5′ end, e.g., from a cleavage in a DHD or the ACHD), a 3′ fragment (e.g., a fragment comprising the 3′ end of a TREM, e.g., from a cleavage in the THD), or an internal fragment (e.g., from a cleavage in one or more of the ACHD, DHD or THD).

In an embodiment, a TREM, a TREM core fragment or a TREM fragment can be charged with an amino acid (e.g., a cognate amino acid); charged with a non-cognate amino acid (e.g., a mischarged TREM (mTREM)); or not charged with an amino acid (e.g., an uncharged TREM (uTREM)). In an embodiment, a TREM, a TREM core fragment or a TREM fragment can be charged with an amino acid selected from alanine, arginine, asparagine, aspartate, cysteine, glutamine, glutamate, glycine, histidine, isoleucine, methionine, leucine, lysine, phenylalanine, proline, serine, threonine, tryptophan, tyrosine, or valine.

In an embodiment, the TREM, TREM core fragment or TREM fragment is a cognate TREM. In an embodiment, the TREM, TREM core fragment or TREM fragment is a non-cognate TREM. In an embodiment, the TREM, TREM core fragment or TREM fragment recognizes a codon provided in Table 2 or Table 3.

TABLE 2

List of codons

AAA

AAC

AAG

AAU

ACA

ACC

ACG

ACU

AGA

AGC

AGG

AGU

AUA

AUC

AUG

AUU

CAA

CAC

CAG

CAU

CCA

CCC

CCG

CCU

CGA

CGC

CGG

CGU

CUA

CUC

CUG

CUU

GAA

GAC

GAG

GAU

GCA

GCC

GCG

GCU

GGA

GGC

GGG

GGU

GUA

GUC

GUG

GUU

UAA

UAC

UAG

UAU

UCA

UCC

UCG

UCU

UGA

UGC

UGG

UGU

UUA

UUC

UUG

UUU

TABLE 3

Amino acids and corresponding codons

Amino Acid
mRNA codons

Alanine
GCU, GCC, GCA, GCG

Arginine
CGU, CGC, CGA, CGG, AGA, AGG

Asparagine
AAU, AAC

Aspartate
GAU, GAC

Cysteine
UGU, UGC

Glutamate
GAA, GAG

Glutamine
CAA, CAG

Glycine
GGU, GGC, GGA, GGG

Histidine
CAU, CAC

Isoleucine
AUU, AUC, AUA

Leucine
UUA, UUG, CUU, CUC, CUA, CUG

Lysine
AAA, AAG

Methionine
AUG

Phenylalanine
UUU, UUC

Proline
CCU, CCC, CCA, CCG

Serine
UCU, UCC, UCA, UCG, AGU, AGC

Stop
UAA, UAG, UGA

Threonine
ACU, ACC, ACA, ACG

Tryptophan
UGG

Tyrosine
UAU, UAC

Valine
GUU, GUC, GUA, GUG

In an embodiment, a TREM comprises a ribonucleic acid (RNA) sequence encoded by a deoxyribonucleic acid (DNA) sequence disclosed in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1. In an embodiment, a TREM comprises an RNA sequence at least 60%, 65%, 70%, 75%, 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% identical to an RNA sequence encoded by a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1. In an embodiment, a TREM comprises an RNA sequence encoded by a DNA sequence at least 60%, 65%, 70%, 75%, 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% identical to a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1.

In an embodiment, a TREM, a TREM core fragment, or TREM fragment comprises at least 5, 10, 15, 20, 25, or 30 consecutive nucleotides of an RNA sequence encoded by a DNA sequence disclosed in Table 1, e.g., at least 5, 10, 15, 20, 25, or 30 consecutive nucleotides of an RNA sequence encoded by any one of SEQ ID NOs: 1-451 disclosed in Table 1. In an embodiment, a TREM, a TREM core fragment, or TREM fragment comprises at least 5, 10, 15, 20, 25, or 30 consecutive nucleotides of an RNA sequence at least 60%, 65%, 70%, 75%, 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% identical to an RNA sequence encoded by a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1. In an embodiment, a TREM, a TREM core fragment, or TREM fragment comprises at least 5, 10, 15, 20, 25, or 30 consecutive nucleotides of an RNA sequence encoded by a DNA sequence at least 60%, 65%, 70%, 75%, 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% identical to a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1.

In an embodiment, a TREM core fragment or a TREM fragment comprises at least 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% of an RNA sequence encoded by a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1. In an embodiment, a TREM core fragment or a TREM fragment comprises at least 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% of an RNA sequence at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to an RNA sequence encoded by a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1. In an embodiment, a TREM core fragment or a TREM fragment comprises at least 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% of an RNA sequence encoded by a DNA sequence at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1.

In an embodiment, a TREM core fragment or a TREM fragment comprises at least 5 ribonucleotides (nt), 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, 50 nt, 55 nt or 60 nt (but less than the full length) of an RNA sequence encoded by a DNA sequence disclosed in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1. In an embodiment, a TREM core fragment or a TREM fragment comprises at least 5 ribonucleotides (nt), 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, 50 nt, 55 nt or 60 nt (but less than the full length) of an RNA sequence which is at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to an RNA sequence encoded by a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1. In an embodiment, a TREM core fragment or a TREM fragment comprises at least 5 ribonucleotides (nt), 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, 50 nt, 55 nt or 60 nt (but less than the full length) of an RNA sequence encoded by a DNA sequence with at least 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, 99% or 100% identity to a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1.

In an embodiment, a TREM core fragment or a TREM fragment comprises a sequence of a length of between 10-90 ribonucleotides (rnt), between 10-80 rnt, between 10-70 rnt, between 10-60 rnt, between 10-50 rnt, between 10-40 rnt, between 10-30 rnt, between 10-20 rnt, between 20-90 rnt, between 20-80 rnt, 20-70 rnt, between 20-60 rnt, between 20-50 rnt, between 20-40 rnt, between 30-90 rnt, between 30-80 rnt, between 30-70 rnt, between 30-60 rnt, or between 30-50 mt.

In any and all embodiments, the TREM described herein comprises a consensus sequence of Formula I_ZZZ,

R₀-R₁-R₂-R₃-R₄-R₅-R₆-R₇-R₈-R₉-R₁₀-R₁₁-R₁₂-R₁₃-R₁₄-R₁₅-R₁₆-R₁₇-R₁₈-R₁₉-R₂₀-R₂₁-R₂₂-

R₂₃-R₂₄-R₂₅-R₂₆-R₂₇-R₂₈-R₂₉-R₃₀-R₃₁-R₃₂-R₃₃-R₃₄-R₃₅-R₃₆-R₃₇-R₃₈-R₃₉-R₄₀-R₄₁-R₄₂-R₄₃-R₄₄-R₄₅-

R₄₆-[R₄₇]_x1-R₄₈-R₄₉-R₅₀-R₅₁-R₅₂-R₅₃-R₅₄-R₅₅-R₅₆-R₅₇-R₅₈-R₅₉-R₆₀-R₆₁-R₆₂-R₆₃-R₆₄-R₆₅-R₆₆-R₆₇-

R₆₈-R₆₉-R₇₀-R₇₁-R₇₂

- wherein (i)_ZZZindicates any of the twenty amino acids; (ii) Formula I corresponds to all species; (iii) x=1-271 (e.g., x=1-250, x=1-225, x=1-200, x=1-175, x=1-150, x=1-125, x=1-100, x=1-75, x=1-50, x=1-40, x=1-30, x=1-29, x=1-28, x=1-27, x=1-26, x=1-25, x=1-24, x=1-23, x=1-22, x=1-21, x=1-20, x=1-19, x=1-18, x=1-17, x=1-16, x=1-15, x=1-14, x=1-13, x=1-12, x=1-11, x=1-10, x=10-271, x=20-271, x=30-271, x=40-271, x=50-271, x=60-271, x=70-271, x=80-271, x=100-271, x=125-271, x=150-271, x=175-271, x=200-271, x=225-271, x=1, x=2, x=3, x=4, x=5, x=6, x=7, x=8, x=9, x=10, x=11, x=12, x=13, x=14, x=15, x=16, x=17, x=18, x=19, x=20, x=21, x=22, x=23, x=24, x=25, x=26, x=27, x=28, x=29, x=30, x=40, x=50, x=60, x=70, x=80, x=90, x=100, x=110, x=125, x=150, x=175, x=200, x=225, x=250, or x=271); and (iv) an ASGPR binding moiety is bound to a nucleobase within one or more of R₀-R₁-R₂-R₃-R₄-R₅-R₆-R₇-R₈or (v) an ASGPR binding moiety is bound to a nucleobase within one or more of R₆₁-R₆₂-R₆₃-R₆₄-R₆₅-R₆₆-R₆₇-R₆₈-R₆₉-R₇₀-R₇₁-R₇₂.

In any and all embodiments, the TREM described herein comprises a consensus sequence of Formula II_ZZZ,

- wherein (i) zzz indicates any of the twenty amino acids; (ii) Formula II corresponds to mammals; (iii) x=1-271 (e.g., x=1-250, x=1-225, x=1-200, x=1-175, x=1-150, x=1-125, x=1-100, x=1-75, x=1-50, x=1-40, x=1-30, x=1-29, x=1-28, x=1-27, x=1-26, x=1-25, x=1- 24, x=1-23, x=1-22, x=1-21, x=1-20, x=1-19, x=1-18, x=1-17, x=1-16, x=1-15, x=1-14, x=1-13, x=1-12, x=1-11, x=1-10, x=10-271, x=20-271, x=30-271, x=40-271, x=50-271, x=60-271, x=70-271, x=80-271, x=100-271, x=125-271, x=150-271, x=175-271, x=200-271, x=225-271, x=1, x=2, x=3, x=4, x=5, x=6, x=7, x=8, x=9, x=10, x=11, x=12, x=13, x=14, x=15, x=16, x=17, x=18, x=19, x=20, x=21, x=22, x=23, x=24, x=25, x=26, x=27, x=28, x=29, x=30, x=40, x=50, x=60, x=70, x=80, x=90, x=100, x=110, x=125, x=150, x=175, x=200, x=225, x=250, or x=271); and (iv) an ASGPR binding moiety is bound to a nucleobase within one or more of R₀-R₁-R₂-R₃-R₄-R₅-R₆-R₇-R₈or (v) an ASGPR binding moiety is bound to a nucleobase within one or more of R₆₁-R₆₂-R₆₃-R₆₄-R₆₅-R₆₆-R₆₇-R₆₈-R₆₉-R₇₀-R₇₁-R₇₂.

In any and all embodiments, the TREM described herein comprises a consensus sequence of Formula IIII_ZZZ,

- wherein (i)_ZZZindicates any of the twenty amino acids; (ii) Formula III corresponds to humans; (iii) x=1-271 (e.g., x=1-250, x=1-225, x=1-200, x=1-175, x=1-150, x=1-125, x=1-100, x=1-75, x=1-50, x=1-40, x=1-30, x=1-29, x=1-28, x=1-27, x=1-26, x=1-25, x=1-24, x=1-23, x=1-22, x=1-21, x=1-20, x=1-19, x=1-18, x=1-17, x=1-16, x=1-15, x=1-14, x=1-13, x=1-12, x1-11, x1-1, x10-271, x20-271, x30-271, x40-271, x50-271, x60-271, x70-271, x80-271, x100-271, x125-271, x150-271, x175-271, x200-271, x225-271, x1, x2, x=3, x=4, x=5, x=6, x=7, x=8, x=9, x=10, x=11, x=12, x=13, x=14, x=15, x=16, x=17, x=18, x=19, x=20, x=21, x=22, x=23, x=24, x=25, x=26, x=27, x=28, x=29, x=30, x=40, x=50, x=60, x=70, x=80, x=90, x=100, x=110, x=125, x=150, x=175, x=200, x=225, x=250, or x=271); and (iv) an ASGPR binding moiety is bound to a nucleobase within one or more of R₀-R₁R₂-R₃-R₄-R₅-R₆-R₇-R₈or (v) an ASGPR binding moiety is bound to a nucleobase within one or more of R₆₁-R₆₂-R₆₃-R₆₄-R₆₅-R₆₆-R₆₇-R₆₈-R₆₉-R₇₀-R₇₁-R₇₂.

TABLE 1

SEQ

ID

NO
tRNA name
tRNA sequence

1
Ala_AGC_chr6:28763741-28763812 (−)
GGGGGTATAGCTCAGTGGTAGAGC

GCGTGCTTAGCATGCACGAGGTCCT

2
Ala_AGC_chr6:26687485-26687557 (+)
GGGGAATTAGCTCAAGTGGTAGAG

CGCTTGCTTAGCACGCAAGAGGTA

3
Ala_AGC_chr6:26572092-26572164 (−)
GGGGAATTAGCTCAAATGGTAGAG

CGCTCGCTTAGCATGCGAGAGGTA

4
Ala_AGC_chr6:26682715-26682787 (+)
GGGGAATTAGCTCAAGTGGTAGAG

CGCTTGCTTAGCATGCAAGAGGTA

5
Ala_AGC_chr6:26705606-26705678 (+)
GGGGAATTAGCTCAAGCGGTAGAG

CGCTTGCTTAGCATGCAAGAGGTA

6
Ala_AGC_chr6:26673590-26673662 (+)
GGGGAATTAGCTCAAGTGGTAGAG

CGCTTGCTTAGCATGCAAGAGGTA

7
Ala_AGC_chr14:89445442-89445514 (+)
GGGGAATTAGCTCAAGTGGTAGAG

CGCTCGCTTAGCATGCGAGAGGTA

8
Ala_AGC_chr6:58196623-58196695 (−)
GGGGAATTAGCCCAAGTGGTAGAG

CGCTTGCTTAGCATGCAAGAGGTA

9
Ala_AGC_chr6:28806221-28806292 (−)
GGGGGTGTAGCTCAGTGGTAGAGC

GCGTGCTTAGCATGCACGAGGCCC

10
Ala_AGC_chr6:28574933-28575004 (+)
GGGGGTGTAGCTCAGTGGTAGAGC

GCGTGCTTAGCATGTACGAGGTCCC

11
Ala_AGC_chr6:28626014-28626085 (−)
GGGGATGTAGCTCAGTGGTAGAGC

GCATGCTTAGCATGCATGAGGTCCC

12
Ala_AGC_chr6:28678366-28678437 (+)
GGGGGTGTAGCTCAGTGGTAGAGC

GCGTGCTTAGCATGCACGAGGCCCT

13
Ala_AGC_chr6:28779849-28779920 (−)
GGGGGTATAGCTCAGCGGTAGAGC

GCGTGCTTAGCATGCACGAGGTCCT

14
Ala_AGC_chr6:28687481-28687552 (+)
GGGGGTGTAGCTCAGTGGTAGAGC

GCGTGCTTAGCATGCACGAGGCCC

15
Ala_AGC_chr2:27274082-27274154 (+)
GGGGGATTAGCTCAAATGGTAGAG

CGCTCGCTTAGCATGCGAGAGGTA

16
Ala_AGC_chr6:26730737-26730809 (+)
GGGGAATTAGCTCAGGCGGTAGAG

CGCTCGCTTAGCATGCGAGAGGTA

17
Ala_CGC_chr6:26553731-26553802 (+)
GGGGATGTAGCTCAGTGGTAGAGC

GCATGCTTCGCATGTATGAGGTCCC

18
Ala_CGC_chr6:28641613-28641684 (−)
GGGGATGTAGCTCAGTGGTAGAGC

GCATGCTTCGCATGTATGAGGCCCC

19
Ala_CGC_chr2:157257281-157257352 (+)
GGGGATGTAGCTCAGTGGTAGAGC

GCGCGCTTCGCATGTGTGAGGTCCC

20
Ala_CGC_chr6:28697092-28697163 (+)
GGGGGTGTAGCTCAGTGGTAGAGC

GCGTGCTTCGCATGTACGAGGCCCC

21
Ala_TGC_chr6:28757547-28757618 (−)
GGGGGTGTAGCTCAGTGGTAGAGC

GCATGCTTTGCATGTATGAGGTCCC

22
Ala_TGC_chr6:28611222-28611293 (+)
GGGGATGTAGCTCAGTGGTAGAGC

GCATGCTTTGCATGTATGAGGTCCC

23
Ala_TGC_chr5:180633868-180633939 (+)
GGGGATGTAGCTCAGTGGTAGAGC

GCATGCTTTGCATGTATGAGGCCCC

24
Ala_TGC_chr12:125424512-125424583 (+)
GGGGATGTAGCTCAGTGGTAGAGC

GCATGCTTTGCACGTATGAGGCCCC

25
Ala_TGC_chr6:28785012-28785083 (−)
GGGGGTGTAGCTCAGTGGTAGAGC

GCATGCTTTGCATGTATGAGGCCTC

26
Ala_TGC_chr6:28726141-28726212 (−)
GGGGGTGTAGCTCAGTGGTAGAGC

ACATGCTTTGCATGTGTGAGGCCCC

27
Ala_TGC_chr6:28770577-28770647 (−)
GGGGGTGTAGCTCAGTGGTAGAGC

GCATGCTTTGCATGTATGAGGCCTC

28
Arg_ACG_chr6:26328368-26328440 (+)
GGGCCAGTGGCGCAATGGATAACG

CGTCTGACTACGGATCAGAAGATTC

29
Arg_ACG_chr3:45730491-45730563 (−)
GGGCCAGTGGCGCAATGGATAACG

CGTCTGACTACGGATCAGAAGATTC

30
Arg_CCG_chr6:28710729-28710801 (−)
GGCCGCGTGGCCTAATGGATAAGG

CGTCTGATTCCGGATCAGAAGATTG

31
Arg_CCG_chr17:66016013-66016085 (−)
GACCCAGTGGCCTAATGGATAAGG

CATCAGCCTCCGGAGCTGGGGATT

32
Arg_CCT_chr17:73030001-73030073 (+)
GCCCCAGTGGCCTAATGGATAAGG

CACTGGCCTCCTAAGCCAGGGATTG

33
Arg_CCT_chr17:73030526-73030598 (−)
GCCCCAGTGGCCTAATGGATAAGG

CACTGGCCTCCTAAGCCAGGGATTG

34
Arg_CCT_chr16:3202901-3202973 (+)
GCCCCGGTGGCCTAATGGATAAGG

CATTGGCCTCCTAAGCCAGGGATTG

35
Arg_CCT_chr7:139025446-139025518 (+)
GCCCCAGTGGCCTAATGGATAAGG

CATTGGCCTCCTAAGCCAGGGATTG

36
Arg_CCT_chr16:3243918-3243990 (+)
GCCCCAGTGGCCTGATGGATAAGG

TACTGGCCTCCTAAGCCAGGGATTG

37
Arg_TCG_chr15:89878304-89878376 (+)
GGCCGCGTGGCCTAATGGATAAGG

CGTCTGACTTCGGATCAGAAGATTG

38
Arg_TCG_chr6:26323046-26323118 (+)
GACCACGTGGCCTAATGGATAAGG

CGTCTGACTTCGGATCAGAAGATTG

39
Arg_TCG_chr17:73031208-73031280 (+)
GACCGCGTGGCCTAATGGATAAGG

CGTCTGACTTCGGATCAGAAGATTG

40
Arg_TCG_chr6:26299905-26299977 (+)
GACCACGTGGCCTAATGGATAAGG

CGTCTGACTTCGGATCAGAAGATTG

41
Arg_TCG_chr6:28510891-28510963 (−)
GACCACGTGGCCTAATGGATAAGG

CGTCTGACTTCGGATCAGAAGATTG

42
Arg_TCG_chr9:112960803-112960875 (+)
GGCCGTGTGGCCTAATGGATAAGG

CGTCTGACTTCGGATCAAAAGATTG

43
Arg_TCT_chr1:94313129-94313213 (+)
GGCTCCGTGGCGCAATGGATAGCG

CATTGGACTTCTAGAGGCTGAAGG

44
Arg_TCT_chr17:8024243-8024330 (+)
GGCTCTGTGGCGCAATGGATAGCG

CATTGGACTTCTAGTGACGAATAGA

45
Arg_TCT_chr9:131102355-131102445 (−)
GGCTCTGTGGCGCAATGGATAGCG

CATTGGACTTCTAGCTGAGCCTAGT

46
Arg_TCT_chr11:59318767-59318852 (+)
GGCTCTGTGGCGCAATGGATAGCG

CATTGGACTTCTAGATAGTTAGAGA

47
Arg_TCT_chr1:159111401-159111474 (−)
GTCTCTGTGGCGCAATGGACGAGC

GCGCTGGACTTCTAATCCAGAGGTT

48
Arg_TCT_chr6:27529963-27530049 (+)
GGCTCTGTGGCGCAATGGATAGCG

CATTGGACTTCTAGCCTAAATCAAG

49
Asn_GTT_chr1:161510031-161510104 (+)
GTCTCTGTGGCGCAATCGGTTAGCG

CGTTCGGCTGTTAACCGAAAGGTTG

50
Asn_GTT_chr1:143879832-143879905 (−)
GTCTCTGTGGCGCAATCGGCTAGCG

CGTTTGGCTGTTAACTAAAAGGTTG

51
Asn_GTT_chr1:144301611-144301684 (+)
GTCTCTGTGGTGCAATCGGTTAGCG

CGTTCCGCTGTTAACCGAAAGCTTG

52
Asn_GTT_chr1:149326272-149326345 (−)
GTCTCTGTGGCGCAATCGGCTAGCG

CGTTTGGCTGTTAACTAAAAAGTTG

53
Asn_GTT_chr1:148248115-148248188 (+)
GTCTCTGTGGCGCAATCGGTTAGCG

CGTTCGGCTGTTAACCGAAAGGTTG

54
Asn_GTT_chr1:148598314-148598387 (−)
GTCTCTGTGGCGCAATCGGTTAGCG

CATTCGGCTGTTAACCGAAAGGTTG

55
Asn_GTT_chr1:17216172-17216245 (+)
GTCTCTGTGGCGCAATCGGTTAGCG

CGTTCGGCTGTTAACCGAAAGATTG

56
Asn_GTT_chr1:16847080-16847153 (−)
GTCTCTGTGGCGCAATCGGTTAGCG

CGTTCGGCTGTTAACTGAAAGGTTG

57
Asn_GTT_chr1:149230570-149230643 (−)
GTCTCTGTGGCGCAATGGGTTAGCG

CGTTCGGCTGTTAACCGAAAGGTTG

58
Asn_GTT_chr1:148000805-148000878 (+)
GTCTCTGTGGCGTAGTCGGTTAGCG

CGTTCGGCTGTTAACCGAAAAGTTG

59
Asn_GTT_chr1:149711798-149711871 (−)
GTCTCTGTGGCGCAATCGGCTAGCG

CGTTTGGCTGTTAACTAAAAGGTTG

60
Asn_GTT_chr1:145979034-145979107 (−)
GTCTCTGTGGCGCAATCGGTTAGCG

CGTTCGGCTGTTAACTGAAAGGTTA

61
Asp_GTC_chr12:98897281-98897352 (+)
TCCTCGTTAGTATAGTGGTTAGTAT

CCCCGCCTGTCACGCGGGAGACCG

62
Asp_GTC_chr1:161410615-161410686 (−)
TCCTCGTTAGTATAGTGGTGAGTAT

CCCCGCCTGTCACGCGGGAGACCG

63
Asp_GTC_chr6:27551236-27551307 (−)
TCCTCGTTAGTATAGTGGTGAGTGT

CCCCGTCTGTCACGCGGGAGACCG

64
Cys_GCA_chr7:149007281-149007352 (+)
GGGGGCATAGCTCAGTGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

65
Cys_GCA_chr7:149074601-149074672 (−)
GGGGGTATAGCTCAGGGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

66
Cys_GCA_chr7:149112229-149112300 (−)
GGGGGTATAGCTTAGCGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

67
Cys_GCA_chr7:149344046-149344117 (−)
GGGGGTATAGCTTAGGGGTAGAGC

ATTTGACTGCAGATCAAAAGGTCCC

68
Cys_GCA_chr7:149052766-149052837 (−)
GGGGGTATAGCTCAGGGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

69
Cys_GCA_chr17:37017937-37018008 (−)
GGGGGTATAGCTCAGGGGTAGAGC

ATTTGACTGCAGATCAAGAAGTCCC

70
Cys_GCA_chr7:149281816-149281887 (+)
GGGGGTATAGCTCAGGGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCTC

71
Cys_GCA_chr7:149243631-149243702 (+)
GGGGGTATAGCTCAGGGGTAGAGC

ACTTGACTGCAGATCAAGAAGTCCT

72
Cys_GCA_chr7:149388272-149388343 (−)
GGGGATATAGCTCAGGGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

73
Cys_GCA_chr7:149072850-149072921 (−)
GGGGGTATAGTTCAGGGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

74
Cys_GCA_chr7:149310156-149310227 (−)
GGGGGTATAGCTCAGGGGTAGAGC

ATTTGACTGCAAATCAAGAGGTCCC

75
Cys_GCA_chr4:124430005-124430076 (−)
GGGGGTATAGCTCAGTGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

76
Cys_GCA_chr7:149295046-149295117 (+)
GGGCGTATAGCTCAGGGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

77
Cys_GCA_chr7:149361915-149361986 (+)
GGGGGTATAGCTCACAGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

78
Cys_GCA_chr7:149253802-149253871 (+)
GGGCGTATAGCTCAGGGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

79
Cys_GCA_chr7:149292305-149292376 (−)
GGGGGTATAGCTCACAGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

80
Cys_GCA_chr7:149286164-149286235 (−)
GGGGGTATAGCTCAGGGGTAGAGC

ACTTGACTGCAGATCAAGAGGTCC

81
Cys_GCA_chr17:37025545-37025616 (−)
GGGGGTATAGCTCAGTGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

82
Cys_GCA_chr15:80036997-80037069 (+)
GGGGGTATAGCTCAGTGGGTAGAG

CATTTGACTGCAGATCAAGAGGTCC

83
Cys_GCA_chr3:131947944-131948015 (−)
GGGGGTGTAGCTCAGTGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

84
Cys_GCA_chr1:93981834-93981906 (−)
GGGGGTATAGCTCAGGTGGTAGAG

CATTTGACTGCAGATCAAGAGGTCC

85
Cys_GCA_chr14:73429679-73429750 (+)
GGGGGTATAGCTCAGGGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

86
Cys_GCA_chr3:131950642-131950713 (−)
GGGGGTATAGCTCAGGGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

87
Gln_CTG_chr6:18836402-18836473 (+)
GGTTCCATGGTGTAATGGTTAGCAC

TCTGGACTCTGAATCCAGCGATCCG

88
Gln_CTG_chr6:27515531-27515602 (−)
GGTTCCATGGTGTAATGGTTAGCAC

TCTGGACTCTGAATCCAGCGATCCG

89
Gln_CTG_chr1:145963304-145963375 (+)
GGTTCCATGGTGTAATGGTGAGCAC

TCTGGACTCTGAATCCAGCGATCCG

90
Gln_CTG_chr1:147737382-147737453 (−)
GGTTCCATGGTGTAATGGTAAGCAC

TCTGGACTCTGAATCCAGCGATCCG

91
Gln_CTG_chr6:27263212-27263283 (+)
GGTTCCATGGTGTAATGGTTAGCAC

TCTGGACTCTGAATCCGGTAATCCG

92
Gln_CTG_chr6:27759135-27759206 (−)
GGCCCCATGGTGTAATGGTCAGCA

CTCTGGACTCTGAATCCAGCGATCC

93
Gln_CTG_chr1:147800937-147801008 (+)
GGTTCCATGGTGTAATGGTAAGCAC

TCTGGACTCTGAATCCAGCCATCTG

94
Gln_TTG_chr17:47269890-47269961 (+)
GGTCCCATGGTGTAATGGTTAGCAC

TCTGGACTTTGAATCCAGCGATCCG

95
Gln_TTG_chr6:28557156-28557227 (+)
GGTCCCATGGTGTAATGGTTAGCAC

TCTGGACTTTGAATCCAGCAATCCG

96
Gln_TTG_chr6:26311424-26311495 (−)
GGCCCCATGGTGTAATGGTTAGCAC

TCTGGACTTTGAATCCAGCGATCCG

97
Gln_TTG_chr6:145503859-145503930 (+)
GGTCCCATGGTGTAATGGTTAGCAC

TCTGGGCTTTGAATCCAGCAATCCG

98
Glu_CTC_chr1:145399233-145399304 (−)
TCCCTGGTGGTCTAGTGGTTAGGAT

TCGGCGCTCTCACCGCCGCGGCCCG

99
Glu_CTC_chr1:249168447-249168518 (+)
TCCCTGGTGGTCTAGTGGTTAGGAT

TCGGCGCTCTCACCGCCGCGGCCCG

100
Glu_TTC_chr2:131094701-131094772 (−)
TCCCATATGGTCTAGCGGTTAGGAT

TCCTGGTTTTCACCCAGGTGGCCCG

101
Glu_TTC_chr13:45492062-45492133 (−)
TCCCACATGGTCTAGCGGTTAGGAT

TCCTGGTTTTCACCCAGGCGGCCCG

102
Glu_TTC_chr1:17199078-17199149 (+)
TCCCTGGTGGTCTAGTGGCTAGGAT

TCGGCGCTTTCACCGCCGCGGCCCG

103
Glu_TTC_chr1:16861774-16861845 (−)
TCCCTGGTGGTCTAGTGGCTAGGAT

TCGGCGCTTTCACCGCCGCGGCCCG

104
Gly_CCC_chr1:16872434-16872504 (−)
GCATTGGTGGTTCAGTGGTAGAATT

CTCGCCTCCCACGCGGGAGACCCG

105
Gly_CCC_chr2:70476123-70476193 (−)
GCGCCGCTGGTGTAGTGGTATCATG

CAAGATTCCCATTCTTGCGACCCGG

106
Gly_CCC_chr17:19764175-19764245 (+)
GCATTGGTGGTTCAATGGTAGAATT

CTCGCCTCCCACGCAGGAGACCCA

107
Gly_GCC_chr1:161413094-161413164 (+)
GCATGGGTGGTTCAGTGGTAGAATT

CTCGCCTGCCACGCGGGAGGCCCG

108
Gly_GCC_chr1:161493637-161493707 (−)
GCATTGGTGGTTCAGTGGTAGAATT

CTCGCCTGCCACGCGGGAGGCCCG

109
Gly_GCC_chr16:70812114-70812184 (−)
GCATTGGTGGTTCAGTGGTAGAATT

CTCGCCTGCCACGCGGGAGGCCCG

110
Gly_GCC_chr1:161450356-161450426 (+)
GCATAGGTGGTTCAGTGGTAGAATT

CTTGCCTGCCACGCAGGAGGCCCA

111
Gly_GCC_chr16:70822597-70822667 (+)
GCATTGGTGGTTCAGTGGTAGAATT

CTCGCCTGCCATGCGGGCGGCCGG

112
Gly_TCC_chr19:4724082-4724153 (+)
GCGTTGGTGGTATAGTGGTTAGCAT

AGCTGCCTTCCAAGCAGTTGACCCG

113
Gly_TCC_chr1:145397864-145397935 (−)
GCGTTGGTGGTATAGTGGTGAGCAT

AGCTGCCTTCCAAGCAGTTGACCCG

114
Gly_TCC_chr17:8124866-8124937 (+)
GCGTTGGTGGTATAGTGGTAAGCAT

AGCTGCCTTCCAAGCAGTTGACCCG

115
Gly_TCC_chr1:161409961-161410032 (−)
GCGTTGGTGGTATAGTGGTGAGCAT

AGTTGCCTTCCAAGCAGTTGACCCG

116
His_GTG_chr1:145396881-145396952 (−)
GCCGTGATCGTATAGTGGTTAGTAC

TCTGCGTTGTGGCCGCAGCAACCTC

117
His_GTG_chr1:149155828-149155899 (−)
GCCATGATCGTATAGTGGTTAGTAC

TCTGCGCTGTGGCCGCAGCAACCTC

118
Ile_AAT_chr6:58149254-58149327 (+)
GGCCGGTTAGCTCAGTTGGTTAGAG

CGTGGCGCTAATAACGCCAAGGTC

119
Ile_AAT_chr6:27655967-27656040 (+)
GGCCGGTTAGCTCAGTTGGTTAGAG

CGTGGTGCTAATAACGCCAAGGTC

120
Ile_AAT_chr6:27242990-27243063 (−)
GGCTGGTTAGCTCAGTTGGTTAGAG

CGTGGTGCTAATAACGCCAAGGTC

121
Ile_AAT_chr17:8130309-8130382 (−)
GGCCGGTTAGCTCAGTTGGTTAGAG

CGTGGTGCTAATAACGCCAAGGTC

122
Ile_AAT_chr6:26554350-26554423 (+)
GGCCGGTTAGCTCAGTTGGTTAGAG

CGTGGTGCTAATAACGCCAAGGTC

123
Ile_AAT_chr6:26745255-26745328 (−)
GGCCGGTTAGCTCAGTTGGTTAGAG

CGTGGTGCTAATAACGCTAAGGTC

124
Ile_AAT_chr6:26721221-26721294 (−)
GGCCGGTTAGCTCAGTTGGTCAGA

GCGTGGTGCTAATAACGCCAAGGT

125
Ile_AAT_chr6:27636362-27636435 (+)
GGCCGGTTAGCTCAGTCGGCTAGA

GCGTGGTGCTAATAACGCCAAGGT

126
Ile_AAT_chr6:27241739-27241812 (+)
GGCTGGTTAGTTCAGTTGGTTAGAG

CGTGGTGCTAATAACGCCAAGGTC

127
Ile_GAT_chrX:3756418-3756491 (−)
GGCCGGTTAGCTCAGTTGGTAAGA

GCGTGGTGCTGATAACACCAAGGT

128
Ile_TAT_chr19:39902808-39902900 (−)
GCTCCAGTGGCGCAATCGGTTAGC

GCGCGGTACTTATATGACAGTGCG

129
Ile_TAT_chr2:43037676-43037768 (+)
GCTCCAGTGGCGCAATCGGTTAGC

GCGCGGTACTTATACAGCAGTACAT

130
Ile_TAT_chr6:26988125-26988218 (+)
GCTCCAGTGGCGCAATCGGTTAGC

GCGCGGTACTTATATGGCAGTATGT

131
Ile_TAT_chr6:27599200-27599293 (+)
GCTCCAGTGGCGCAATCGGTTAGC

GCGCGGTACTTATACAACAGTATAT

132
Ile_TAT_chr6:28505367-28505460 (+)
GCTCCAGTGGCGCAATCGGTTAGC

GCGCGGTACTTATAAGACAGTGCA

133
Leu_AAG_chr5:180524474-180524555 (−)
GGTAGCGTGGCCGAGCGGTCTAAG

GCGCTGGATTAAGGCTCCAGTCTCT

134
Leu_AAG_chr5:180614701-180614782 (+)
GGTAGCGTGGCCGAGCGGTCTAAG

GCGCTGGATTAAGGCTCCAGTCTCT

135
Leu_AAG_chr6:28956779-28956860 (+)
GGTAGCGTGGCCGAGCGGTCTAAG

GCGCTGGATTAAGGCTCCAGTCTCT

136
Leu_AAG_chr6:28446400-28446481 (−)
GGTAGCGTGGCCGAGTGGTCTAAG

ACGCTGGATTAAGGCTCCAGTCTCT

137
Leu_CAA_chr6:28864000-28864105 (−)
GTCAGGATGGCCGAGTGGTCTAAG

GCGCCAGACTCAAGCTAAGCTTCCT

138
Leu_CAA_chr6:28908830-28908934 (+)
GTCAGGATGGCCGAGTGGTCTAAG

GCGCCAGACTCAAGCTTGGCTTCCT

139
Leu_CAA_chr6:27573417-27573524 (−)
GTCAGGATGGCCGAGTGGTCTAAG

GCGCCAGACTCAAGCTTACTGCTTC

140
Leu_CAA_chr6:27570348-27570454 (−)
GTCAGGATGGCCGAGTGGTCTAAG

GCGCCAGACTCAAGTTGCTACTTCC

141
Leu_CAA_chr1:249168054-249168159 (+)
GTCAGGATGGCCGAGTGGTCTAAG

GCGCCAGACTCAAGGTAAGCACCT

142
Leu_CAA_chr11:9296790-9296863 (+)
GCCTCCTTAGTGCAGTAGGTAGCGC

ATCAGTCTCAAAATCTGAATGGTCC

143
Leu_CAA_chr1:161581736-161581819 (−)
GTCAGGATGGCCGAGCAGTCTTAA

GGCGCTGCGTTCAAATCGCACCCTC

144
Leu_CAG_chr1:161411323-161411405 (+)
GTCAGGATGGCCGAGCGGTCTAAG

GCGCTGCGTTCAGGTCGCAGTCTCC

145
Leu_CAG_chr16:57333863-57333945 (+)
GTCAGGATGGCCGAGCGGTCTAAG

GCGCTGCGTTCAGGTCGCAGTCTCC

146
Leu_TAA_chr6:144537684-144537766 (+)
ACCAGGATGGCCGAGTGGTTAAGG

CGTTGGACTTAAGATCCAATGGAC

147
Leu_TAA_chr6:27688898-27688980 (−)
ACCGGGATGGCCGAGTGGTTAAGG

CGTTGGACTTAAGATCCAATGGGCT

148
Leu_TAA_chr11:59319228-59319310 (+)
ACCAGAATGGCCGAGTGGTTAAGG

CGTTGGACTTAAGATCCAATGGATT

149
Leu_TAA_chr6:27198334-27198416 (−)
ACCGGGATGGCTGAGTGGTTAAGG

CGTTGGACTTAAGATCCAATGGAC

150
Leu_TAG_chr17:8023632-8023713 (−)
GGTAGCGTGGCCGAGCGGTCTAAG

GCGCTGGATTTAGGCTCCAGTCTCT

151
Leu_TAG_chr14:21093529-21093610 (+)
GGTAGTGTGGCCGAGCGGTCTAAG

GCGCTGGATTTAGGCTCCAGTCTCT

152
Leu_TAG_chr16:22207032-22207113 (−)
GGTAGCGTGGCCGAGTGGTCTAAG

GCGCTGGATTTAGGCTCCAGTCATT

153
Lys_CTT_chr14:58706613-58706685 (−)
GCCCGGCTAGCTCAGTCGGTAGAG

CATGGGACTCTTAATCCCAGGGTCG

154
Lys_CTT_chr19:36066750-36066822 (+)
GCCCAGCTAGCTCAGTCGGTAGAG

CATAAGACTCTTAATCTCAGGGTTG

155
Lys_CTT_chr19:52425393-52425466 (−)
GCAGCTAGCTCAGTCGGTAGAGCA

TGAGACTCTTAATCTCAGGGTCATG

156
Lys_CTT_chr1:145395522-145395594 (−)
GCCCGGCTAGCTCAGTCGGTAGAG

CATGAGACTCTTAATCTCAGGGTCG

157
Lys_CTT_chr16:3207406-3207478 (−)
GCCCGGCTAGCTCAGTCGGTAGAG

CATGAGACCCTTAATCTCAGGGTCG

158
Lys_CTT_chr16:3241501-3241573 (+)
GCCCGGCTAGCTCAGTCGGTAGAG

CATGGGACTCTTAATCTCAGGGTCG

159
Lys_CTT_chr16:3230555-3230627 (−)
GCCCGGCTAGCTCAGTCGATAGAG

CATGAGACTCTTAATCTCAGGGTCG

160
Lys_CTT_chr1:55423542-55423614 (−)
GCCCAGCTAGCTCAGTCGGTAGAG

CATGAGACTCTTAATCTCAGGGTCA

161
Lys_CTT_chr16:3214939-3215011 (+)
GCCTGGCTAGCTCAGTCGGCAAAG

CATGAGACTCTTAATCTCAGGGTCG

162
Lys_CTT_chr5:26198539-26198611 (−)
GCCCGACTACCTCAGTCGGTGGAG

CATGGGACTCTTCATCCCAGGGTTG

163
Lys_TTT_chr16:73512216-73512288 (−)
GCCTGGATAGCTCAGTTGGTAGAG

CATCAGACTTTTAATCTGAGGGTCC

164
Lys_TTT_chr12:27843306-27843378 (+)
ACCCAGATAGCTCAGTCAGTAGAG

CATCAGACTTTTAATCTGAGGGTCC

165
Lys_TTT_chr11:122430655-122430727 (+)
GCCTGGATAGCTCAGTTGGTAGAG

CATCAGACTTTTAATCTGAGGGTCC

166
Lys_TTT_chr1:204475655-204475727 (+)
GCCCGGATAGCTCAGTCGGTAGAG

CATCAGACTTTTAATCTGAGGGTCC

167
Lys_TTT_chr6:27559593-27559665 (−)
GCCTGGATAGCTCAGTCGGTAGAG

CATCAGACTTTTAATCTGAGGGTCC

168
Lys_TTT_chr11:59323902-59323974 (+)
GCCCGGATAGCTCAGTCGGTAGAG

CATCAGACTTTTAATCTGAGGGTCC

169
Lys_TTT_chr6:27302769-27302841 (−)
GCCTGGGTAGCTCAGTCGGTAGAG

CATCAGACTTTTAATCTGAGGGTCC

170
Lys_TTT_chr6:28715521-28715593 (+)
GCCTGGATAGCTCAGTTGGTAGAA

CATCAGACTTTTAATCTGACGGTGC

171
Met_CAT_chr8:124169470-124169542 (−)
GCCTCGTTAGCGCAGTAGGTAGCG

CGTCAGTCTCATAATCTGAAGGTCG

172
Met_CAT_chr16:71460396-71460468 (+)
GCCCTCTTAGCGCAGTGGGCAGCG

CGTCAGTCTCATAATCTGAAGGTCC

173
Met_CAT_chr6:28912352-28912424 (+)
GCCTCCTTAGCGCAGTAGGCAGCG

CGTCAGTCTCATAATCTGAAGGTCC

174
Met_CAT_chr6:26735574-26735646 (−)
GCCCTCTTAGCGCAGCGGGCAGCG

CGTCAGTCTCATAATCTGAAGGTCC

175
Met_CAT_chr6:26701712-26701784 (+)
GCCCTCTTAGCGCAGCTGGCAGCGC

GTCAGTCTCATAATCTGAAGGTCCT

176
Met_CAT_chr16:87417628-87417700 (−)
GCCTCGTTAGCGCAGTAGGCAGCG

CGTCAGTCTCATAATCTGAAGGTCG

177
Met_CAT_chr6:58168492-58168564 (−)
GCCCTCTTAGTGCAGCTGGCAGCGC

GTCAGTTTCATAATCTGAAAGTCCT

178
Phe_GAA_chr6:28758499-28758571 (−)
GCCGAAATAGCTCAGTTGGGAGAG

CGTTAGACTGAAGATCTAAAGGTC

179
Phe_GAA_chr11:59333853-59333925 (−)
GCCGAAATAGCTCAGTTGGGAGAG

CGTTAGACTGAAGATCTAAAGGTC

180
Phe_GAA_chr6:28775610-28775682 (−)
GCCGAGATAGCTCAGTTGGGAGAG

CGTTAGACTGAAGATCTAAAGGTC

181
Phe_GAA_chr6:28791093-28791166 (−)
GCCGAAATAGCTCAGTTGGGAGAG

CGTTAGACCGAAGATCTTAAAGGT

182
Phe_GAA_chr6:28731374-28731447 (−)
GCTGAAATAGCTCAGTTGGGAGAG

CGTTAGACTGAAGATCTTAAAGTTC

183
Pro_AGG_chr16:3241989-3242060 (+)
GGCTCGTTGGTCTAGGGGTATGATT

CTCGCTTAGGATGCGAGAGGTCCC

184
Pro_AGG_chr1:167684725-167684796 (−)
GGCTCGTTGGTCTAGGGGTATGATT

CTCGCTTAGGGTGCGAGAGGTCCC

185
Pro_CGG_chr1:167683962-167684033 (+)
GGCTCGTTGGTCTAGGGGTATGATT

CTCGCTTCGGGTGCGAGAGGTCCCG

186
Pro_CGG_chr6:27059521-27059592 (+)
GGCTCGTTGGTCTAGGGGTATGATT

CTCGCTTCGGGTGTGAGAGGTCCCG

187
Pro_TGG_chr14:21101165-21101236 (+)
GGCTCGTTGGTCTAGTGGTATGATT

CTCGCTTTGGGTGCGAGAGGTCCCG

188
Pro_TGG_chr11:75946869-75946940 (−)
GGCTCGTTGGTCTAGGGGTATGATT

CTCGGTTTGGGTCCGAGAGGTCCCG

189
Pro_TGG_chr5:180615854-180615925 (−)
GGCTCGTTGGTCTAGGGGTATGATT

CTCGCTTTGGGTGCGAGAGGTCCCG

190
SeC_TCA_chr19:45981859-45981945 (−)
GCCCGGATGATCCTCAGTGGTCTGG

GGTGCAGGCTTCAAACCTGTAGCTG

191
SeC_TCA_chr22:44546537-44546620 (+)
GCTCGGATGATCCTCAGTGGTCTGG

GGTGCAGGCTTCAAACCTGTAGCTG

192
Ser_AGA_chr6:27509554-27509635 (−)
GTAGTCGTGGCCGAGTGGTTAAGG

CGATGGACTAGAAATCCATTGGGG

193
Ser_AGA_chr6:26327817-26327898 (+)
GTAGTCGTGGCCGAGTGGTTAAGG

CGATGGACTAGAAATCCATTGGGG

194
Ser_AGA_chr6:27499987-27500068 (+)
GTAGTCGTGGCCGAGTGGTTAAGG

CGATGGACTAGAAATCCATTGGGG

195
Ser_AGA_chr6:27521192-27521273 (−)
GTAGTCGTGGCCGAGTGGTTAAGG

TGATGGACTAGAAACCCATTGGGG

196
Ser_CGA_chr17:8042199-8042280 (−)
GCTGTGATGGCCGAGTGGTTAAGG

CGTTGGACTCGAAATCCAATGGGG

197
Ser_CGA_chr6:27177628-27177709 (+)
GCTGTGATGGCCGAGTGGTTAAGG

CGTTGGACTCGAAATCCAATGGGG

198
Ser_CGA_chr6:27640229-27640310 (−)
GCTGTGATGGCCGAGTGGTTAAGG

TGTTGGACTCGAAATCCAATGGGG

199
Ser_CGA_chr12:56584148-56584229 (+)
GTCACGGTGGCCGAGTGGTTAAGG

CGTTGGACTCGAAATCCAATGGGG

200
Ser_GCT_chr6:27065085-27065166 (+)
GACGAGGTGGCCGAGTGGTTAAGG

CGATGGACTGCTAATCCATTGTGCT

201
Ser_GCT_chr6:27265775-27265856 (+)
GACGAGGTGGCCGAGTGGTTAAGG

CGATGGACTGCTAATCCATTGTGCT

202
Ser_GCT_chr11:66115591-66115672 (+)
GACGAGGTGGCCGAGTGGTTAAGG

CGATGGACTGCTAATCCATTGTGCT

203
Ser_GCT_chr6:28565117-28565198 (−)
GACGAGGTGGCCGAGTGGTTAAGG

CGATGGACTGCTAATCCATTGTGCT

204
Ser_GCT_chr6:28180815-28180896 (+)
GACGAGGTGGCCGAGTGGTTAAGG

CGATGGACTGCTAATCCATTGTGCT

205
Ser_GCT_chr6:26305718-26305801 (−)
GGAGAGGCCTGGCCGAGTGGTTAA

GGCGATGGACTGCTAATCCATTGTG

206
Ser_TGA_chr10:69524261-69524342 (+)
GCAGCGATGGCCGAGTGGTTAAGG

CGTTGGACTTGAAATCCAATGGGGT

207
Ser_TGA_chr6:27513468-27513549 (+)
GTAGTCGTGGCCGAGTGGTTAAGG

CGATGGACTTGAAATCCATTGGGGT

208
Ser_TGA_chr6:26312824-26312905 (−)
GTAGTCGTGGCCGAGTGGTTAAGG

CGATGGACTTGAAATCCATTGGGGT

209
Ser_TGA_chr6:27473607-27473688 (−)
GTAGTCGTGGCCGAGTGGTTAAGG

CGATGGACTTGAAATCCATTGGGGT

210
Thr_AGT_chr17:8090478-8090551 (+)
GGCGCCGTGGCTTAGTTGGTTAAAG

CGCCTGTCTAGTAAACAGGAGATC

211
Thr_AGT_chr6:26533145-26533218 (−)
GGCTCCGTGGCTTAGCTGGTTAAAG

CGCCTGTCTAGTAAACAGGAGATC

212
Thr_AGT_chr6:28693795-28693868 (+)
GGCTCCGTAGCTTAGTTGGTTAAAG

CGCCTGTCTAGTAAACAGGAGATC

213
Thr_AGT_chr6:27694473-27694546 (+)
GGCTTCGTGGCTTAGCTGGTTAAAG

CGCCTGTCTAGTAAACAGGAGATC

214
Thr_AGT_chr17:8042770-8042843 (−)
GGCGCCGTGGCTTAGCTGGTTAAA

GCGCCTGTCTAGTAAACAGGAGAT

215
Thr_AGT_chr6:27130050-27130123 (+)
GGCCCTGTGGCTTAGCTGGTCAAAG

CGCCTGTCTAGTAAACAGGAGATC

216
Thr_CGT_chr6:28456770-28456843 (−)
GGCTCTATGGCTTAGTTGGTTAAAG

CGCCTGTCTCGTAAACAGGAGATCC

217
Thr_CGT_chr16:14379750-14379821 (+)
GGCGCGGTGGCCAAGTGGTAAGGC

GTCGGTCTCGTAAACCGAAGATCA

218
Thr_CGT_chr6:28615984-28616057 (−)
GGCTCTGTGGCTTAGTTGGCTAAAG

CGCCTGTCTCGTAAACAGGAGATCC

219
Thr_CGT_chr17:29877093-29877164 (+)
GGCGCGGTGGCCAAGTGGTAAGGC

GTCGGTCTCGTAAACCGAAGATCG

220
Thr_CGT_chr6:27586135-27586208 (+)
GGCCCTGTAGCTCAGCGGTTGGAG

CGCTGGTCTCGTAAACCTAGGGGTC

221
Thr_TGT_chr6:28442329-28442402 (−)
GGCTCTATGGCTTAGTTGGTTAAAG

CGCCTGTCTTGTAAACAGGAGATCC

222
Thr_TGT_chr1:222638347-222638419 (+)
GGCTCCATAGCTCAGTGGTTAGAGC

ACTGGTCTTGTAAACCAGGGGTCGC

223
Thr_TGT_chr14:21081949-21082021 (−)
GGCTCCATAGCTCAGGGGTTAGAG

CGCTGGTCTTGTAAACCAGGGGTCG

224
Thr_TGT_chr14:21099319-21099391 (−)
GGCTCCATAGCTCAGGGGTTAGAG

CACTGGTCTTGTAAACCAGGGGTCG

225
Thr_TGT_chr14:21149849-21149921 (+)
GGCCCTATAGCTCAGGGGTTAGAG

CACTGGTCTTGTAAACCAGGGGTCG

226
Thr_TGT_chr5:180618687-180618758 (−)
GGCTCCATAGCTCAGGGGTTAGAG

CACTGGTCTTGTAAACCAGGGTCGC

227
Trp_CCA_chr17:8124187-8124258 (−)
GGCCTCGTGGCGCAACGGTAGCGC

GTCTGACTCCAGATCAGAAGGTTGC

228
Trp_CCA_chr17:19411494-19411565 (+)
GACCTCGTGGCGCAATGGTAGCGC

GTCTGACTCCAGATCAGAAGGTTGC

229
Trp_CCA_chr6:26319330-26319401 (−)
GACCTCGTGGCGCAACGGTAGCGC

GTCTGACTCCAGATCAGAAGGTTGC

230
Trp_CCA_chr12:98898030-98898101 (+)
GACCTCGTGGCGCAACGGTAGCGC

GTCTGACTCCAGATCAGAAGGCTG

231
Trp_CCA_chr7:99067307-99067378 (+)
GACCTCGTGGCGCAACGGCAGCGC

GTCTGACTCCAGATCAGAAGGTTGC

232
Tyr_ATA_chr2:219110549-219110641 (+)
CCTTCAATAGTTCAGCTGGTAGAGC

AGAGGACTATAGCTACTTCCTCAGT

233
Tyr_GTA_chr6:26569086-26569176 (+)
CCTTCGATAGCTCAGTTGGTAGAGC

GGAGGACTGTAGTTGGCTGTGTCCT

234
Tyr_GTA_chr2:27273650-27273738 (+)
CCTTCGATAGCTCAGTTGGTAGAGC

GGAGGACTGTAGTGGATAGGGCGT

235
Tyr_GTA_chr6:26577332-26577420 (+)
CCTTCGATAGCTCAGTTGGTAGAGC

GGAGGACTGTAGGCTCATTAAGCA

236
Tyr_GTA_chr14:21125623-21125716 (−)
CCTTCGATAGCTCAGCTGGTAGAGC

GGAGGACTGTAGATTGTATAGACA

237
Tyr_GTA_chr8:67025602-67025694 (+)
CCTTCGATAGCTCAGCTGGTAGAGC

GGAGGACTGTAGCTACTTCCTCAGC

238
Tyr_GTA_chr8:67026223-67026311 (+)
CCTTCGATAGCTCAGCTGGTAGAGC

GGAGGACTGTAGGCGCGCGCCCGT

239
Tyr_GTA_chr14:21121258-21121351 (−)
CCTTCGATAGCTCAGCTGGTAGAGC

GGAGGACTGTAGCCTGTAGAAACA

240
Tyr_GTA_chr14:21131351-21131444 (−)
CCTTCGATAGCTCAGCTGGTAGAGC

GGAGGACTGTAGATTGTACAGACA

241
Tyr_GTA_chr14:21151432-21151520 (+)
CCTTCGATAGCTCAGCTGGTAGAGC

GGAGGACTGTAGTACTTAATGTGTG

242
Tyr_GTA_chr6:26595102-26595190 (+)
CCTTCGATAGCTCAGCTGGTAGAGC

GGAGGACTGTAGGGGTTTGAATGT

243
Tyr_GTA_chr14:21128117-21128210 (−)
CCTTCGATAGCTCAGCTGGTAGAGC

GGAGGACTGTAGACTGCGGAAACG

244
Tyr_GTA_chr6:26575798-26575887 (+)
CTTTCGATAGCTCAGTTGGTAGAGC

GGAGGACTGTAGGTTCATTAAACT

245
Tyr_GTA_chr8:66609532-66609619 (−)
TCTTCAATAGCTCAGCTGGTAGAGC

GGAGGACTGTAGGTGCACGCCCGT

246
Val_AAC_chr3:169490018-169490090 (+)
GTTTCCGTAGTGTAGTGGTTATCAC

GTTCGCCTAACACGCGAAAGGTCC

247
Val_AAC_chr5:180615416-180615488 (−)
GTTTCCGTAGTGTAGTGGTCATCAC

GTTCGCCTAACACGCGAAAGGTCC

248
Val_AAC_chr6:27618707-27618779 (−)
GTTTCCGTAGTGTAGTGGTTATCAC

GTTCGCCTAACACGCGAAAGGTCC

249
Val_AAC_chr6:27648885-27648957 (−)
GTTTCCGTAGTGTAGTGGTTATCAC

GTTCGCCTAACACGCGAAAGGTCC

250
Val_AAC_chr6:27203288-27203360 (+)
GTTTCCGTAGTGTAGTGGTTATCAC

GTTTGCCTAACACGCGAAAGGTCCC

251
Val_AAC_chr6:28703206-28703277 (−)
GGGGGTGTAGCTCAGTGGTAGAGC

GTATGCTTAACATTCATGAGGCTCT

252
Val_CAC_chr1:161369490-161369562 (−)
GTTTCCGTAGTGTAGTGGTTATCAC

GTTCGCCTCACACGCGAAAGGTCCC

253
Val_CAC_chr6:27248049-27248121 (−)
GCTTCTGTAGTGTAGTGGTTATCAC

GTTCGCCTCACACGCGAAAGGTCCC

254
Val_CAC_chr19:4724647-4724719 (−)
GTTTCCGTAGTGTAGCGGTTATCAC

ATTCGCCTCACACGCGAAAGGTCCC

255
Val_CAC_chr1:149298555-149298627 (−)
GTTTCCGTAGTGTAGTGGTTATCAC

GTTCGCCTCACACGCGAAAGGTCCC

256
Val_CAC_chr1:149684088-149684161 (−)
GTTTCCGTAGTGTAGTGGTTATCAC

GTTCGCCTCACACGCGTAAAGGTCC

257
Val_CAC_chr6:27173867-27173939 (−)
GTTTCCGTAGTGGAGTGGTTATCAC

GTTCGCCTCACACGCGAAAGGTCCC

258
Val_TAC_chr11:59318102-59318174 (−)
GGTTCCATAGTGTAGTGGTTATCAC

GTCTGCTTTACACGCAGAAGGTCCT

259
Val_TAC_chr11:59318460-59318532 (−)
GGTTCCATAGTGTAGCGGTTATCAC

GTCTGCTTTACACGCAGAAGGTCCT

260
Val_TAC_chr10:5895674-5895746 (−)
GGTTCCATAGTGTAGTGGTTATCAC

ATCTGCTTTACACGCAGAAGGTCCT

261
Val_TAC_chr6:27258405-27258477 (+)
GTTTCCGTGGTGTAGTGGTTATCAC

ATTCGCCTTACACGCGAAAGGTCCT

262
iMet_CAT_chr1:153643726-153643797 (+)
AGCAGAGTGGCGCAGCGGAAGCGT

GCTGGGCCCATAACCCAGAGGTCG

263
iMet_CAT_chr6:27745664-27745735 (+)
AGCAGAGTGGCGCAGCGGAAGCGT

GCTGGGCCCATAACCCAGAGGTCG

264
Glu_TTC_chr1:16861773-16861845 (−)
TCCCTGGTGGTCTAGTGGCTAGGAT

TCGGCGCTTTCACCGCCGCGGCCCG

265
Gly_CCC_chr1:17004765-17004836 (−)
GCGTTGGTGGTTTAGTGGTAGAATT

CTCGCCTCCCATGCGGGAGACCCG

266
Gly_CCC_chr1:17053779-17053850 (+)
GGCCTTGGTGGTGCAGTGGTAGAA

TTCTCGCCTCCCACGTGGGAGACCC

267
Glu_TTC_chr1:17199077-17199149 (+)
GTCCCTGGTGGTCTAGTGGCTAGGA

TTCGGCGCTTTCACCGCCGCGGCCC

268
Asn_GTT_chr1:17216171-17216245 (+)
TGTCTCTGTGGCGCAATCGGTTAGC

GCGTTCGGCTGTTAACCGAAAGATT

269
Arg_TCT_chr1:94313128-94313213 (+)
TGGCTCCGTGGCGCAATGGATAGC

GCATTGGACTTCTAGAGGCTGAAG

270
Lys_CTT_chr1:145395521-145395594 (−)
GCCCGGCTAGCTCAGTCGGTAGAG

CATGAGACTCTTAATCTCAGGGTCG

271
His_GTG_chr1:145396880-145396952 (−)
GCCGTGATCGTATAGTGGTTAGTAC

TCTGCGTTGTGGCCGCAGCAACCTC

272
Gly_TCC_chr1:145397863-145397935 (−)
GCGTTGGTGGTATAGTGGTGAGCAT

AGCTGCCTTCCAAGCAGTTGACCCG

273
Glu_CTC_chr1:145399232-145399304 (−)
TCCCTGGTGGTCTAGTGGTTAGGAT

TCGGCGCTCTCACCGCCGCGGCCCG

274
Gln_CTG_chr1:145963303-145963375 (+)
AGGTTCCATGGTGTAATGGTGAGC

ACTCTGGACTCTGAATCCAGCGATC

275
Asn_GTT_chr1:148000804-148000878 (+)
TGTCTCTGTGGCGTAGTCGGTTAGC

GCGTTCGGCTGTTAACCGAAAAGTT

276
Asn_GTT_chr1:148248114-148248188 (+)
TGTCTCTGTGGCGCAATCGGTTAGC

GCGTTCGGCTGTTAACCGAAAGGTT

277
Asn_GTT_chr1:148598313-148598387 (−)
GTCTCTGTGGCGCAATCGGTTAGCG

CATTCGGCTGTTAACCGAAAGGTTG

278
Asn_GTT_chr1:149230569-149230643 (−)
GTCTCTGTGGCGCAATGGGTTAGCG

CGTTCGGCTGTTAACCGAAAGGTTG

279
Val_CAC_chr1:149294665-149294736 (−)
GCACTGGTGGTTCAGTGGTAGAATT

CTCGCCTCACACGCGGGACACCCG

280
Val_CAC_chr1:149298554-149298627 (−)
GTTTCCGTAGTGTAGTGGTTATCAC

GTTCGCCTCACACGCGAAAGGTCCC

281
Gly_CCC_chr1:149680209-149680280 (−)
GCACTGGTGGTTCAGTGGTAGAATT

CTCGCCTCCCACGCGGGAGACCCG

282
Val_CAC_chr1:149684087-149684161 (−)
GTTTCCGTAGTGTAGTGGTTATCAC

GTTCGCCTCACACGCGTAAAGGTCC

283
Met_CAT_chr1:153643725-153643797 (+)
TAGCAGAGTGGCGCAGCGGAAGCG

TGCTGGGCCCATAACCCAGAGGTC

284
Val_CAC_chr1:161369489-161369562 (−)
GTTTCCGTAGTGTAGTGGTTATCAC

GTTCGCCTCACACGCGAAAGGTCCC

285
Asp_GTC_chr1:161410614-161410686 (−)
TCCTCGTTAGTATAGTGGTGAGTAT

CCCCGCCTGTCACGCGGGAGACCG

286
Gly_GCC_chr1:161413093-161413164 (+)
TGCATGGGTGGTTCAGTGGTAGAAT

TCTCGCCTGCCACGCGGGAGGCCC

287
Glu_CTC_chr1:161417017-161417089 (−)
TCCCTGGTGGTCTAGTGGTTAGGAT

TCGGCGCTCTCACCGCCGCGGCCCG

288
Asp_GTC_chr1:161492934-161493006 (+)
ATCCTTGTTACTATAGTGGTGAGTA

TCTCTGCCTGTCATGCGTGAGAGAG

289
Gly_GCC_chr1:161493636-161493707 (−)
GCATTGGTGGTTCAGTGGTAGAATT

CTCGCCTGCCACGCGGGAGGCCCG

290
Leu_CAG_chr1:161500131-161500214 (−)
GTCAGGATGGCCGAGCGGTCTAAG

GCGCTGCGTTCAGGTCGCAGTCTCC

291
Gly_TCC_chr1:161500902-161500974 (+)
CGCGTTGGTGGTATAGTGGTGAGC

ATAGCTGCCTTCCAAGCAGTTGACC

292
Asn_GTT_chr1:161510030-161510104 (+)
CGTCTCTGTGGCGCAATCGGTTAGC

GCGTTCGGCTGTTAACCGAAAGGTT

293
Glu_TTC_chr1:161582507-161582579 (+)
CGCGTTGGTGGTGTAGTGGTGAGC

ACAGCTGCCTTTCAAGCAGTTAACG

294
Pro_CGG_chr1:167683961-167684033 (+)
CGGCTCGTTGGTCTAGGGGTATGAT

TCTCGCTTCGGGTGCGAGAGGTCCC

295
Pro_AGG_chr1:167684724-167684796 (−)
GGCTCGTTGGTCTAGGGGTATGATT

CTCGCTTAGGGTGCGAGAGGTCCC

296
Lys_TTT_chr1:204475654-204475727 (+)
CGCCCGGATAGCTCAGTCGGTAGA

GCATCAGACTTTTAATCTGAGGGTC

297
Lys_TTT_chr1:204476157-204476230 (−)
GCCCGGATAGCTCAGTCGGTAGAG

CATCAGACTTTTAATCTGAGGGTCC

298
Leu_CAA_chr1:249168053-249168159 (+)
TGTCAGGATGGCCGAGTGGTCTAA

GGCGCCAGACTCAAGGTAAGCACC

299
Glu_CTC_chr1:249168446-249168518 (+)
TTCCCTGGTGGTCTAGTGGTTAGGA

TTCGGCGCTCTCACCGCCGCGGCCC

300
Tyr_GTA_chr2:27273649-27273738 (+)
GCCTTCGATAGCTCAGTTGGTAGAG

CGGAGGACTGTAGTGGATAGGGCG

301
Ala_AGC_chr2:27274081-27274154 (+)
CGGGGGATTAGCTCAAATGGTAGA

GCGCTCGCTTAGCATGCGAGAGGT

302
Ile_TAT_chr2:43037675-43037768 (+)
AGCTCCAGTGGCGCAATCGGTTAG

CGCGCGGTACTTATACAGCAGTAC

303
Gly_CCC_chr2:70476122-70476193 (−)
GCGCCGCTGGTGTAGTGGTATCATG

CAAGATTCCCATTCTTGCGACCCGG

304
Glu_TTC_chr2:131094700-131094772 (−)
TCCCATATGGTCTAGCGGTTAGGAT

TCCTGGTTTTCACCCAGGTGGCCCG

305
Ala_CGC_chr2:157257280-157257352 (+)
GGGGGATGTAGCTCAGTGGTAGAG

CGCGCGCTTCGCATGTGTGAGGTCC

306
Gly_GCC_chr2:157257658-157257729 (−)
GCATTGGTGGTTCAGTGGTAGAATT

CTCGCCTGCCACGCGGGAGGCCCG

307
Arg_ACG_chr3:45730490-45730563 (−)
GGGCCAGTGGCGCAATGGATAACG

CGTCTGACTACGGATCAGAAGATTC

308
Val_AAC_chr3:169490017-169490090 (+)
GGTTTCCGTAGTGTAGTGGTTATCA

CGTTCGCCTAACACGCGAAAGGTC

309
Val_AAC_chr5:180596609-180596682 (+)
AGTTTCCGTAGTGTAGTGGTTATCA

CGTTCGCCTAACACGCGAAAGGTC

310
Leu_AAG_chr5:180614700-180614782 (+)
AGGTAGCGTGGCCGAGCGGTCTAA

GGCGCTGGATTAAGGCTCCAGTCTC

311
Val_AAC_chr5:180615415-180615488 (−)
GTTTCCGTAGTGTAGTGGTCATCAC

GTTCGCCTAACACGCGAAAGGTCC

312
Pro_TGG_chr5:180615853-180615925 (−)
GGCTCGTTGGTCTAGGGGTATGATT

CTCGCTTTGGGTGCGAGAGGTCCCG

313
Thr_TGT_chr5:180618686-180618758 (−)
GGCTCCATAGCTCAGGGGTTAGAG

CACTGGTCTTGTAAACCAGGGTCGC

314
Ala_TGC_chr5:180633867-180633939 (+)
TGGGGATGTAGCTCAGTGGTAGAG

CGCATGCTTTGCATGTATGAGGCCC

315
Lys_CTT_chr5:180634754-180634827 (+)
CGCCCGGCTAGCTCAGTCGGTAGA

GCATGAGACTCTTAATCTCAGGGTC

316
Val_AAC_chr5:180645269-180645342 (−)
GTTTCCGTAGTGTAGTGGTTATCAC

GTTCGCCTAACACGCGAAAGGTCC

317
Lys_CTT_chr5:180648978-180649051 (−)
GCCCGGCTAGCTCAGTCGGTAGAG

CATGAGACTCTTAATCTCAGGGTCG

318
Val_CAC_chr5:180649394-180649467 (−)
GTTTCCGTAGTGTAGTGGTTATCAC

GTTCGCCTCACACGCGAAAGGTCCC

319
Met_CAT_chr6:26286753-26286825 (+)
CAGCAGAGTGGCGCAGCGGAAGCG

TGCTGGGCCCATAACCCAGAGGTC

320
Ser_GCT_chr6:26305717-26305801 (−)
GGAGAGGCCTGGCCGAGTGGTTAA

GGCGATGGACTGCTAATCCATTGTG

321
Gln_TTG_chr6:26311423-26311495 (−)
GGCCCCATGGTGTAATGGTTAGCAC

TCTGGACTTTGAATCCAGCGATCCG

322
Gln_TTG_chr6:26311974-26312046 (−)
GGCCCCATGGTGTAATGGTTAGCAC

TCTGGACTTTGAATCCAGCGATCCG

323
Ser_TGA_chr6:26312823-26312905 (−)
GTAGTCGTGGCCGAGTGGTTAAGG

CGATGGACTTGAAATCCATTGGGGT

324
Met_CAT_chr6:26313351-26313423 (−)
AGCAGAGTGGCGCAGCGGAAGCGT

GCTGGGCCCATAACCCAGAGGTCG

325
Arg_TCG_chr6:26323045-26323118 (+)
GGACCACGTGGCCTAATGGATAAG

GCGTCTGACTTCGGATCAGAAGATT

326
Ser_AGA_chr6:26327816-26327898 (+)
TGTAGTCGTGGCCGAGTGGTTAAG

GCGATGGACTAGAAATCCATTGGG

327
Met_CAT_chr6:26330528-26330600 (−)
AGCAGAGTGGCGCAGCGGAAGCGT

GCTGGGCCCATAACCCAGAGGTCG

328
Leu_CAG_chr6:26521435-26521518 (+)
CGTCAGGATGGCCGAGCGGTCTAA

GGCGCTGCGTTCAGGTCGCAGTCTC

329
Thr_AGT_chr6:26533144-26533218 (−)
GGCTCCGTGGCTTAGCTGGTTAAAG

CGCCTGTCTAGTAAACAGGAGATC

330
Arg_ACG_chr6:26537725-26537798 (+)
AGGGCCAGTGGCGCAATGGATAAC

GCGTCTGACTACGGATCAGAAGAT

331
Val_CAC_chr6:26538281-26538354 (+)
GGTTTCCGTAGTGTAGTGGTTATCA

CGTTCGCCTCACACGCGAAAGGTCC

332
Ala_CGC_chr6:26553730-26553802 (+)
AGGGGATGTAGCTCAGTGGTAGAG

CGCATGCTTCGCATGTATGAGGTCC

333
Ile_AAT_chr6:26554349-26554423 (+)
TGGCCGGTTAGCTCAGTTGGTTAGA

GCGTGGTGCTAATAACGCCAAGGT

334
Pro_AGG_chr6:26555497-26555569 (+)
CGGCTCGTTGGTCTAGGGGTATGAT

TCTCGCTTAGGGTGCGAGAGGTCCC

335
Lys_CTT_chr6:26556773-26556846 (+)
AGCCCGGCTAGCTCAGTCGGTAGA

GCATGAGACTCTTAATCTCAGGGTC

336
Tyr_GTA_chr6:26569085-26569176 (+)
TCCTTCGATAGCTCAGTTGGTAGAG

CGGAGGACTGTAGTTGGCTGTGTCC

337
Ala_AGC_chr6:26572091-26572164 (−)
GGGGAATTAGCTCAAATGGTAGAG

CGCTCGCTTAGCATGCGAGAGGTA

338
Met_CAT_chr6:26766443-26766516 (+)
CGCCCTCTTAGCGCAGCGGGCAGC

GCGTCAGTCTCATAATCTGAAGGTC

339
Ile_TAT_chr6:26988124-26988218 (+)
TGCTCCAGTGGCGCAATCGGTTAGC

GCGCGGTACTTATATGGCAGTATGT

340
His_GTG_chr6:27125905-27125977 (+)
TGCCGTGATCGTATAGTGGTTAGTA

CTCTGCGTTGTGGCCGCAGCAACCT

341
Ile_AAT_chr6:27144993-27145067 (−)
GGCCGGTTAGCTCAGTTGGTTAGAG

CGTGGTGCTAATAACGCCAAGGTC

342
Val_AAC_chr6:27203287-27203360 (+)
AGTTTCCGTAGTGTAGTGGTTATCA

CGTTTGCCTAACACGCGAAAGGTCC

343
Val_CAC_chr6:27248048-27248121 (−)
GCTTCTGTAGTGTAGTGGTTATCAC

GTTCGCCTCACACGCGAAAGGTCCC

344
Asp_GTC_chr6:27447452-27447524 (+)
TTCCTCGTTAGTATAGTGGTGAGTA

TCCCCGCCTGTCACGCGGGAGACC

345
Ser_TGA_chr6:27473606-27473688 (−)
GTAGTCGTGGCCGAGTGGTTAAGG

CGATGGACTTGAAATCCATTGGGGT

346
Gln_CTG_chr6:27487307-27487379 (+)
AGGTTCCATGGTGTAATGGTTAGCA

CTCTGGACTCTGAATCCAGCGATCC

347
Asp_GTC_chr6:27551235-27551307 (−)
TCCTCGTTAGTATAGTGGTGAGTGT

CCCCGTCTGTCACGCGGGAGACCG

348
Val_AAC_chr6:27618706-27618779 (−)
GTTTCCGTAGTGTAGTGGTTATCAC

GTTCGCCTAACACGCGAAAGGTCC

349
Ile_AAT_chr6:27655966-27656040 (+)
CGGCCGGTTAGCTCAGTTGGTTAGA

GCGTGGTGCTAATAACGCCAAGGT

350
Gln_CTG_chr6:27759134-27759206 (−)
GGCCCCATGGTGTAATGGTCAGCA

CTCTGGACTCTGAATCCAGCGATCC

351
Gln_TTG_chr6:27763639-27763711 (−)
GGCCCCATGGTGTAATGGTTAGCAC

TCTGGACTTTGAATCCAGCGATCCG

352
Ala_AGC_chr6:28574932-28575004 (+)
TGGGGGTGTAGCTCAGTGGTAGAG

CGCGTGCTTAGCATGTACGAGGTCC

353
Ala_AGC_chr6:28626013-28626085 (−)
GGGGATGTAGCTCAGTGGTAGAGC

GCATGCTTAGCATGCATGAGGTCCC

354
Ala_CGC_chr6:28697091-28697163 (+)
AGGGGGTGTAGCTCAGTGGTAGAG

CGCGTGCTTCGCATGTACGAGGCCC

355
Ala_AGC_chr6:28806220-28806292 (−)
GGGGGTGTAGCTCAGTGGTAGAGC

GCGTGCTTAGCATGCACGAGGCCC

356
Ala_AGC_chr6:28831461-28831533 (−)
GGGGGTGTAGCTCAGTGGTAGAGC

GCGTGCTTAGCATGCACGAGGCCC

357
Leu_CAA_chr6:28863999-28864105 (−)
GTCAGGATGGCCGAGTGGTCTAAG

GCGCCAGACTCAAGCTAAGCTTCCT

358
Leu_CAA_chr6:28908829-28908934 (+)
TGTCAGGATGGCCGAGTGGTCTAA

GGCGCCAGACTCAAGCTTGGCTTCC

359
Gln_CTG_chr6:28909377-28909449 (−)
GGTTCCATGGTGTAATGGTTAGCAC

TCTGGACTCTGAATCCAGCGATCCG

360
Leu_AAG_chr6:28911398-28911480 (−)
GGTAGCGTGGCCGAGCGGTCTAAG

GCGCTGGATTAAGGCTCCAGTCTCT

361
Met_CAT_chr6:28912351-28912424 (+)
TGCCTCCTTAGCGCAGTAGGCAGCG

CGTCAGTCTCATAATCTGAAGGTCC

362
Lys_TTT_chr6:28918805-28918878 (+)
AGCCCGGATAGCTCAGTCGGTAGA

GCATCAGACTTTTAATCTGAGGGTC

363
Met_CAT_chr6:28921041-28921114 (−)
GCCTCCTTAGCGCAGTAGGCAGCG

CGTCAGTCTCATAATCTGAAGGTCC

364
Glu_CTC_chr6:28949975-28950047 (+)
TTCCCTGGTGGTCTAGTGGTTAGGA

TTCGGCGCTCTCACCGCCGCGGCCC

365
Leu_TAA_chr6:144537683-144537766 (+)
CACCAGGATGGCCGAGTGGTTAAG

GCGTTGGACTTAAGATCCAATGGA

366
Pro_AGG_chr7:128423503-128423575 (+)
TGGCTCGTTGGTCTAGGGGTATGAT

TCTCGCTTAGGGTGCGAGAGGTCCC

367
Arg_CCT_chr7:139025445-139025518 (+)
AGCCCCAGTGGCCTAATGGATAAG

GCATTGGCCTCCTAAGCCAGGGATT

368
Cys_GCA_chr7:149388271-149388343 (−)
GGGGATATAGCTCAGGGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

369
Tyr_GTA_chr8:67025601-67025694 (+)
CCCTTCGATAGCTCAGCTGGTAGAG

CGGAGGACTGTAGCTACTTCCTCAG

370
Tyr_GTA_chr8:67026222-67026311 (+)
CCCTTCGATAGCTCAGCTGGTAGAG

CGGAGGACTGTAGGCGCGCGCCCG

371
Ala_AGC_chr8:67026423-67026496 (+)
TGGGGGATTAGCTCAAATGGTAGA

GCGCTCGCTTAGCATGCGAGAGGT

372
Ser_AGA_chr8:96281884-96281966 (−)
GTAGTCGTGGCCGAGTGGTTAAGG

CGATGGACTAGAAATCCATTGGGG

373
Met_CAT_chr8:124169469-124169542 (−)
GCCTCGTTAGCGCAGTAGGTAGCG

CGTCAGTCTCATAATCTGAAGGTCG

374
Arg_TCT_chr9:131102354-131102445 (−)
GGCTCTGTGGCGCAATGGATAGCG

CATTGGACTTCTAGCTGAGCCTAGT

375
Asn_GTT_chr10:22518437-22518511 (−)
GTCTCTGTGGCGCAATCGGTTAGCG

CGTTCGGCTGTTAACCGAAAGGTTG

376
Ser_TGA_chr10:69524260-69524342 (+)
GGCAGCGATGGCCGAGTGGTTAAG

GCGTTGGACTTGAAATCCAATGGG

377
Val_TAC_chr11:59318101-59318174 (−)
GGTTCCATAGTGTAGTGGTTATCAC

GTCTGCTTTACACGCAGAAGGTCCT

378
Val_TAC_chr11:59318459-59318532 (−)
GGTTCCATAGTGTAGCGGTTATCAC

GTCTGCTTTACACGCAGAAGGTCCT

379
Arg_TCT_chr11:59318766-59318852 (+)
TGGCTCTGTGGCGCAATGGATAGC

GCATTGGACTTCTAGATAGTTAGAG

380
Leu_TAA_chr11:59319227-59319310 (+)
TACCAGAATGGCCGAGTGGTTAAG

GCGTTGGACTTAAGATCCAATGGAT

381
Lys_TTT_chr11:59323901-59323974 (+)
GGCCCGGATAGCTCAGTCGGTAGA

GCATCAGACTTTTAATCTGAGGGTC

382
Phe_GAA_chr11:59324969-59325042 (−)
GCCGAAATAGCTCAGTTGGGAGAG

CGTTAGACTGAAGATCTAAAGGTC

383
Lys_TTT_chr11:59327807-59327880 (−)
GCCCGGATAGCTCAGTCGGTAGAG

CATCAGACTTTTAATCTGAGGGTCC

384
Phe_GAA_chr11:59333852-59333925 (−)
GCCGAAATAGCTCAGTTGGGAGAG

CGTTAGACTGAAGATCTAAAGGTC

385
Ser_GCT_chr11:66115590-66115672 (+)
GGACGAGGTGGCCGAGTGGTTAAG

GCGATGGACTGCTAATCCATTGTGC

386
Pro_TGG_chr11:75946868-75946940 (−)
GGCTCGTTGGTCTAGGGGTATGATT

CTCGGTTTGGGTCCGAGAGGTCCCG

387
Ser_CGA_chr12:56584147-56584229 (+)
AGTCACGGTGGCCGAGTGGTTAAG

GCGTTGGACTCGAAATCCAATGGG

388
Asp_GTC_chr12:98897280-98897352 (+)
CTCCTCGTTAGTATAGTGGTTAGTA

TCCCCGCCTGTCACGCGGGAGACC

389
Trp_CCA_chr12:98898029-98898101 (+)
GGACCTCGTGGCGCAACGGTAGCG

CGTCTGACTCCAGATCAGAAGGCT

390
Ala_TGC_chr12:125406300-125406372 (−)
GGGGATGTAGCTCAGTGGTAGAGC

GCATGCTTTGCATGTATGAGGCCCC

391
Phe_GAA_chr12:125412388-125412461 (−)
GCCGAAATAGCTCAGTTGGGAGAG

CGTTAGACTGAAGATCTAAAGGTC

392
Ala_TGC_chr12:125424511-125424583 (+)
AGGGGATGTAGCTCAGTGGTAGAG

CGCATGCTTTGCACGTATGAGGCCC

393
Asn_GTT_chr13:31248100-31248174 (−)
GTCTCTGTGGCGCAATCGGTTAGCG

CGTTCGGCTGTTAACCGAAAGGTTG

394
Glu_TTC_chr13:45492061-45492133 (−)
TCCCACATGGTCTAGCGGTTAGGAT

TCCTGGTTTTCACCCAGGCGGCCCG

395
Thr_TGT_chr14:21081948-21082021 (−)
GGCTCCATAGCTCAGGGGTTAGAG

CGCTGGTCTTGTAAACCAGGGGTCG

396
Leu_TAG_chr14:21093528-21093610 (+)
TGGTAGTGTGGCCGAGCGGTCTAA

GGCGCTGGATTTAGGCTCCAGTCTC

397
Thr_TGT_chr14:21099318-21099391 (−)
GGCTCCATAGCTCAGGGGTTAGAG

CACTGGTCTTGTAAACCAGGGGTCG

398
Pro_TGG_chr14:21101164-21101236 (+)
TGGCTCGTTGGTCTAGTGGTATGAT

TCTCGCTTTGGGTGCGAGAGGTCCC

399
Tyr_GTA_chr14:21131350-21131444 (−)
CCTTCGATAGCTCAGCTGGTAGAGC

GGAGGACTGTAGATTGTACAGACA

400
Thr_TGT_chr14:21149848-21149921 (+)
AGGCCCTATAGCTCAGGGGTTAGA

GCACTGGTCTTGTAAACCAGGGGTC

401
Tyr_GTA_chr14:21151431-21151520 (+)
TCCTTCGATAGCTCAGCTGGTAGAG

CGGAGGACTGTAGTACTTAATGTGT

402
Pro_TGG_chr14:21152174-21152246 (+)
TGGCTCGTTGGTCTAGGGGTATGAT

TCTCGCTTTGGGTGCGAGAGGTCCC

403
Lys_CTT_chr14:58706612-58706685 (−)
GCCCGGCTAGCTCAGTCGGTAGAG

CATGGGACTCTTAATCCCAGGGTCG

404
Ile_AAT_chr14:102783428-102783502 (+)
CGGCCGGTTAGCTCAGTTGGTTAGA

GCGTGGTGCTAATAACGCCAAGGT

405
Glu_TTC_chr15:26327380-26327452 (−)
TCCCACATGGTCTAGCGGTTAGGAT

TCCTGGTTTTCACCCAGGCGGCCCG

406
Ser_GCT_chr15:40886022-40886104 (−)
GACGAGGTGGCCGAGTGGTTAAGG

CGATGGACTGCTAATCCATTGTGCT

407
His_GTG_chr15:45490803-45490875 (−)
GCCGTGATCGTATAGTGGTTAGTAC

TCTGCGTTGTGGCCGCAGCAACCTC

408
His_GTG_chr15:45493348-45493420 (+)
CGCCGTGATCGTATAGTGGTTAGTA

CTCTGCGTTGTGGCCGCAGCAACCT

409
Gln_CTG_chr15:66161399-66161471 (−)
GGTTCCATGGTGTAATGGTTAGCAC

TCTGGACTCTGAATCCAGCGATCCG

410
Lys_CTT_chr15:79152903-79152976 (+)
TGCCCGGCTAGCTCAGTCGGTAGA

GCATGGGACTCTTAATCCCAGGGTC

411
Arg_TCG_chr15:89878303-89878376 (+)
GGGCCGCGTGGCCTAATGGATAAG

GCGTCTGACTTCGGATCAGAAGATT

412
Gly_CCC_chr16:686735-686806 (−)
GCGCCGCTGGTGTAGTGGTATCATG

CAAGATTCCCATTCTTGCGACCCGG

413
Arg_CCG_chr16:3200674-3200747 (+)
GGGCCGCGTGGCCTAATGGATAAG

GCGTCTGATTCCGGATCAGAAGATT

414
Arg_CCT_chr16:3202900-3202973 (+)
CGCCCCGGTGGCCTAATGGATAAG

GCATTGGCCTCCTAAGCCAGGGATT

415
Lys_CTT_chr16:3207405-3207478 (−)
GCCCGGCTAGCTCAGTCGGTAGAG

CATGAGACCCTTAATCTCAGGGTCG

416
Thr_CGT_chr16:14379749-14379821 (+)
AGGCGCGGTGGCCAAGTGGTAAGG

CGTCGGTCTCGTAAACCGAAGATC

417
Leu_TAG_chr16:22207031-22207113 (−)
GGTAGCGTGGCCGAGTGGTCTAAG

GCGCTGGATTTAGGCTCCAGTCATT

418
Leu_AAG_chr16:22308460-22308542 (+)
GGGTAGCGTGGCCGAGCGGTCTAA

GGCGCTGGATTAAGGCTCCAGTCTC

419
Leu_CAG_chr16:57333862-57333945 (+)
AGTCAGGATGGCCGAGCGGTCTAA

GGCGCTGCGTTCAGGTCGCAGTCTC

420
Leu_CAG_chr16:57334391-57334474 (−)
GTCAGGATGGCCGAGCGGTCTAAG

GCGCTGCGTTCAGGTCGCAGTCTCC

421
Met_CAT_chr16:87417627-87417700 (−)
GCCTCGTTAGCGCAGTAGGCAGCG

CGTCAGTCTCATAATCTGAAGGTCG

422
Leu_TAG_chr17:8023631-8023713 (−)
GGTAGCGTGGCCGAGCGGTCTAAG

GCGCTGGATTTAGGCTCCAGTCTCT

423
Arg_TCT_chr17:8024242-8024330 (+)
TGGCTCTGTGGCGCAATGGATAGC

GCATTGGACTTCTAGTGACGAATAG

424
Gly_GCC_chr17:8029063-8029134 (+)
CGCATTGGTGGTTCAGTGGTAGAAT

TCTCGCCTGCCACGCGGGAGGCCC

425
Ser_CGA_chr17:8042198-8042280 (−)
GCTGTGATGGCCGAGTGGTTAAGG

CGTTGGACTCGAAATCCAATGGGG

426
Thr_AGT_chr17:8042769-8042843 (−)
GGCGCCGTGGCTTAGCTGGTTAAA

GCGCCTGTCTAGTAAACAGGAGAT

427
Trp_CCA_chr17:8089675-8089747 (+)
CGACCTCGTGGCGCAACGGTAGCG

CGTCTGACTCCAGATCAGAAGGTTG

428
Ser_GCT_chr17:8090183-8090265 (+)
AGACGAGGTGGCCGAGTGGTTAAG

GCGATGGACTGCTAATCCATTGTGC

429
Thr_AGT_chr17:8090477-8090551 (+)
CGGCGCCGTGGCTTAGTTGGTTAAA

GCGCCTGTCTAGTAAACAGGAGAT

430
Trp_CCA_chr17:8124186-8124258 (−)
GGCCTCGTGGCGCAACGGTAGCGC

GTCTGACTCCAGATCAGAAGGTTGC

431
Gly_TCC_chr17:8124865-8124937 (+)
AGCGTTGGTGGTATAGTGGTAAGC

ATAGCTGCCTTCCAAGCAGTTGACC

432
Asp_GTC_chr17:8125555-8125627 (−)
TCCTCGTTAGTATAGTGGTGAGTAT

CCCCGCCTGTCACGCGGGAGACCG

433
Pro_CGG_chr17:8126150-8126222 (−)
GGCTCGTTGGTCTAGGGGTATGATT

CTCGCTTCGGGTGCGAGAGGTCCCG

434
Thr_AGT_chr17:8129552-8129626 (−)
GGCGCCGTGGCTTAGTTGGTTAAAG

CGCCTGTCTAGTAAACAGGAGATC

435
Ser_AGA_chr17:8129927-8130009 (−)
GTAGTCGTGGCCGAGTGGTTAAGG

CGATGGACTAGAAATCCATTGGGG

436
Trp_CCA_chr17:19411493-19411565 (+)
TGACCTCGTGGCGCAATGGTAGCG

CGTCTGACTCCAGATCAGAAGGTTG

437
Thr_CGT_chr17:29877092-29877164 (+)
AGGCGCGGTGGCCAAGTGGTAAGG

CGTCGGTCTCGTAAACCGAAGATC

438
Cys_GCA_chr17:37023897-37023969 (+)
AGGGGGTATAGCTCAGTGGTAGAG

CATTTGACTGCAGATCAAGAGGTCC

439
Cys_GCA_chr17:37025544-37025616 (−)
GGGGGTATAGCTCAGTGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

440
Cys_GCA_chr17:37309986-37310058 (−)
GGGGGTATAGCTCAGTGGTAGAGC

ATTTGACTGCAGATCAAGAGGTCCC

441
Gln_TTG_chr17:47269889-47269961 (+)
AGGTCCCATGGTGTAATGGTTAGCA

CTCTGGACTTTGAATCCAGCGATCC

442
Arg_CCG_chr17:66016012-66016085 (−)
GACCCAGTGGCCTAATGGATAAGG

CATCAGCCTCCGGAGCTGGGGATT

443
Arg_CCT_chr17:73030000-73030073 (+)
AGCCCCAGTGGCCTAATGGATAAG

GCACTGGCCTCCTAAGCCAGGGATT

444
Arg_CCT_chr17:73030525-73030598 (−)
GCCCCAGTGGCCTAATGGATAAGG

CACTGGCCTCCTAAGCCAGGGATTG

445
Arg_TCG_chr17:73031207-73031280 (+)
AGACCGCGTGGCCTAATGGATAAG

GCGTCTGACTTCGGATCAGAAGATT

446
Asn_GTT_chr19:1383561-1383635 (+)
CGTCTCTGTGGCGCAATCGGTTAGC

GCGTTCGGCTGTTAACCGAAAGGTT

447
Gly_TCC_chr19:4724081-4724153 (+)
GGCGTTGGTGGTATAGTGGTTAGCA

TAGCTGCCTTCCAAGCAGTTGACCC

448
Val_CAC_chr19:4724646-4724719 (−)
GTTTCCGTAGTGTAGCGGTTATCAC

ATTCGCCTCACACGCGAAAGGTCCC

449
Thr_AGT_chr19:33667962-33668036 (+)
TGGCGCCGTGGCTTAGTTGGTTAAA

GCGCCTGTCTAGTAAACAGGAGAT

450
Ile_TAT_chr19:39902807-39902900 (−)
GCTCCAGTGGCGCAATCGGTTAGC

GCGCGGTACTTATATGACAGTGCG

451
Gly_GCC_chr21:18827106-18827177 (−)
GCATGGGTGGTTCAGTGGTAGAATT

CTCGCCTGCCACGCGGGAGGCCCG

Asialoglycoprotein Receptor Binding Moieties

The present disclosure features a TREM comprising an asialoglycoprotein receptor (ASGPR) binding moiety. The ASGPR is a C-type lectin primarily expressed on the sinusoidal surface of hepatocytes, and comprises a major (48 kDa, ASGPR-1) and a minor (40 kDa, ASGPR-2) subunit. The ASGPR is involved in the binding, internalization, and subsequent clearance of glycoproteins containing an N-terminal galactose (Gal) or N-terminal N-acetylgalactosamine (GalNAc) residues from circulation, such as antibodies. ASGPRs have also been shown to be involved in the clearance of low density lipoprotein, fibronection, and certain immune cells, and may be utilized by certain viruses for hepatocyte entry (see, e.g., Yang J., et al (2006) J Viral Hepat 13:158-165 and Guy, C S et al (2011) Nat Rev Immunol 8:874-887).

The ASGPR binding moiety as described herein may refer to structure comprising: (i) a ASGPR carbohydrate and (ii) an ASGPR linker (e.g., a linker connecting the carbohydrate to the TREM). The term “carbohydrate” as used herein refers to compound comprising one or more monosaccharide moieties comprising at least 3 carbon atoms (e.g., arranged in a linear, branched, or cyclic structure) and an oxygen, nitrogen, or sulfur atom, or a fragment or variant of a monosaccharide moiety comprising at least 3 carbon atoms (e.g., arranged in a linear, branched, or cyclic structure) and an oxygen, nitrogen, or sulfur atom. Each monosaccharide moiety or fragment or variant thereof may be a tetrose, pentose, hexose, or heptose. Each monosaccharide moiety or fragment or variant thereof may exist as an aldose, ketose, sugar alcohol, and, where appropriate, in the L or D form. Exemplary monosaccharide moieties may be amino sugars, N-acetylamino sugars, imino sugars, deoxysugars, or sugar acids. Carbohydrates may comprise individual monosaccharide moieties, or may further comprise a disaccharide, oligosaccharide (e.g., a trisaccharide, tetrasaccharide, pentasaccharide, hexasaccharide, heptasaccharide, octasaccharide), a polysaccharide, or combinations thereof. Exemplary carbohydrates include ribose, arabinose, lyxose, xylose, deoxyribose, ribulose, xylulose, glucose, galactose, mannose, gulose, idose, talose, allose, altrose, psicose, fructose, sorbose, tagatose, rhamnose, pneumose, quinovose, fucose, mannuheptulose, sedoheptulose, galactosamine, mannosamine, glucosamine, N-acetylglucosamine, N-acetylgalactosamine, N-acetylmannosamine, glucuronic acid, galacturonic acid, mannuronic acid, guluronic acid, iduronic acid, tagaturonic acid, frucuronic acid, galactosaminuronic acid, mannosaminuronic acid, glucosaminuronic acid, N-acetylglucosaminuronic acid, N-acetylgalactosaminuronic acid, N-acetylmannosaminuronic acid, maltose, lactose, sucrose, trehalose, gentiobiose, cellobiose, chitobiose, kojibiose, nigerose, sophorose, trehalulose, isomaltose, xylobiose, starch, cellulose, chitin, and dextran.

The carbohydrate may comprise one or more monosaccharide moieties linked by a glycosidic bond. In some embodiments, the glycosidic bond comprises a 1->2 glycosidic bond, a 1->3 glycosidic bond, a 1->4 glycosidic bond, or a 1->6 glycosidic bond. In some embodiments, each glycosidic bonds may be present in the alpha or beta configuration. In an embodiment, the one or more monosaccharide moieties are linked directly by a glycosidic bond or are separated by a linker.

In some embodiments, the ASGPR binding moiety comprises a galactose (Gal), galactosamine (GalNH₂), or an N-acetylgalactosamine (GalNAc) moiety, for example, a Gal, GalNH₂, or GalNAc, or an analog thereof. In an embodiment, the ASGPR binding moiety comprises a GalNAc moiety (e.g., GalNAc). In an embodiment, the ASGPR binding moiety comprises a plurality of GalNAc moieties (e.g., GalNAcs), e.g., at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, or more GalNAc moieties (e.g., GalNAcs). In an embodiment, the ASGPR binding moiety comprises between 2 and 20 GalNAcs moieties (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 GalNAc moieties). In an embodiment, the ASGPR binding moiety comprises between 2 and 10 GalNAc moieties (e.g., 2, 3, 4, 5, 6, 7, 8, 9, or 10 GalNAc moieties). In an embodiment, the ASGPR binding moiety comprises between 2 and 5 GalNAc moieties (e.g., 2, 3, 4, or 5 GalNAc moieties). In an embodiment, the ASGPR binding moiety comprises 2 GalNAc moieties. In an embodiment, the ASGPR binding moiety comprises 3 GalNAc moieties. In an embodiment, the ASGPR binding moiety comprises 4 GalNAc moieties. In an embodiment, the ASGPR moieties comprises 5 GalNAc moieties.

In some embodiments, the GalNAc moiety comprises a structure of Formula (I):

embedded image

or a salt thereof, wherein each of X and Y is independently O, N(R⁷), or S; each of R¹, R³, R⁴, and R⁵are independently hydrogen, alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, aryl, heteroaryl, cycloalkyl, heterocyclyl, C(O)-alkyl, C(O)-alkenyl, C(O)-alkynyl, C(O)-heteroalkyl, C(O)-haloalkyl, C(O)-aryl, C(O)-heteroaryl, C(O)-cycloalkyl, or C(O)-heterocyclyl, wherein each alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, aryl, heteroaryl, cycloalkyl, and heterocyclyl is optionally substituted with one or more R⁸; or R³and R⁴are taken together with the oxygen atoms to which they are connected to form a heterocyclyl ring optionally substituted with one or more R⁸; R^2ais hydrogen or alkyl; R^2bis —C(O)alkyl (e.g., C(O)CH₃); each of R^6aand R^6bis hydrogen, alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, halo, cyano, nitro, —OR^A, aryl, heteroaryl, cycloalkyl, or heterocyclyl, wherein each alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, aryl, heteroaryl, cycloalkyl, and heterocyclyl is optionally substituted with one or more R⁹; R⁷is hydrogen, alkyl, or C(O)-alkyl; each of R⁸and R⁹is independently hydrogen, halo, cyano, alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, cycloalkyl, or heterocyclyl; R^Ais hydrogen, or alkyl, alkenyl, alkynyl, and n is an integer between 0 and 6, wherein the structure of Formula (I) may be connected to a linker or a nucleobase within the ASt of a TREM.

In some embodiments, X is O. In some embodiments, Y is O. In some embodiments, each of R¹, R³, R⁴, and R⁵are independently hydrogen or alkyl (e.g., CH₃). In some embodiments, R^2ais hydrogen. In some embodiments, R^2bis C(O)CH₃. In some embodiments, each of R^6aand R^6bis hydrogen. In some embodiments, n is 0, 1, 2, or 3. In some embodiments, n is 1, 2, or 3. In some embodiments, n is 1. In some embodiments, the GalNAc moiety is connected to a linker or TREM at R^2a. In some embodiments, the GalNAc moiety is connected to a linker or TREM at R^2b. In some embodiments, the GalNAc moiety is connected to a linker or TREM at R³. In some embodiments, the GalNAc moiety is connected to a linker or TREM at R⁴. In some embodiments, the GalNAc moiety is connected to a linker or TREM at R⁵. In some embodiments, the GalNAc moiety is connected to a linker or TREM at R^6aor R^6b. In some embodiments, the GalNAc moiety is connected to a linker or TREM at a plurality of positions, e.g., at least two of R¹, R^2a, R^2b, R³, R⁴, R⁵, R^6a, and R^6b.

In some embodiments, the GalNAc moiety is comprises a structure of Formula (I-a)

embedded image

or a salt thereof, wherein R^2ais hydrogen or alkyl; R^2bis —C(O)alkyl (e.g., C(O)CH₃); each of R³, R⁴, and R⁵are independently hydrogen, alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, aryl, heteroaryl, cycloalkyl, heterocyclyl, C(O)-alkyl, C(O)-alkenyl, C(O)-alkynyl, C(O)-heteroalkyl, C(O)-haloalkyl, C(O)-aryl, C(O)-heteroaryl, C(O)-cycloalkyl, or C(O)-heterocyclyl, wherein each alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, aryl, heteroaryl, cycloalkyl, and heterocyclyl is optionally substituted with one or more R⁸; or R³and R⁴are taken together with the oxygen atoms to which they are connected to form a heterocyclyl ring optionally substituted with one or more R⁸; and R⁸is hydrogen, halo, cyano, alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, cycloalkyl, or heterocyclyl, wherein the custom-character represents a bond in any configuration, and represents an attachment point to a TREM, e.g., a linker, a nucleobase, internucleotide linkage, or terminus within the TREM sequence.

In some embodiments, each of R³, R⁴, and R⁵are independently hydrogen or alkyl (e.g., CH₃). In some embodiments, R^2ais hydrogen. In some embodiments, R^2bis C(O)CH₃.

In some embodiments, the GalNAc moiety comprises a structure of Formula (II):

embedded image

or a salt thereof, wherein X is O, N(R⁷), or S; each of W or Y is independently O or C(R^10a)(R^10b), wherein one of W and Y is O; each of R¹, R³, R⁴, and R⁵are independently hydrogen, alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, aryl, heteroaryl, cycloalkyl, heterocyclyl, C(O)-alkyl, C(O)-alkenyl, C(O)-alkynyl, C(O)-heteroalkyl, C(O)-haloalkyl, C(O)-aryl, C(O)-heteroaryl, C(O)-cycloalkyl, or C(O)-heterocyclyl, wherein each alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, aryl, heteroaryl, cycloalkyl, and heterocyclyl is optionally substituted with one or more R⁸; or R³and R⁴are taken together with the oxygen atoms to which they are connected to form a heterocyclyl ring optionally substituted with one or more R⁸; R^2ais hydrogen or alkyl; R^2bis —C(O)alkyl (e.g., C(O)CH₃); each of R^6aand R^6bis hydrogen, alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, halo, cyano, nitro, —OR^A, aryl, heteroaryl, cycloalkyl, or heterocyclyl, wherein each alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, aryl, heteroaryl, cycloalkyl, and heterocyclyl is optionally substituted with one or more R⁹; R⁷is hydrogen, alkyl, or C(O)-alkyl; each of R⁸and R⁹is independently hydrogen, halo, cyano, alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, cycloalkyl, or heterocyclyl; each of R^10aand R^10bis independently hydrogen, heteroalkyl, haloalkyl, or halo; and R^Ais hydrogen, or alkyl, alkenyl, alkynyl, wherein the structure of Formula (I) may be connected to a TREM, e.g., a linker, a nucleobase, internucleotide linkage, or terminus within the TREM sequence.

In some embodiments, the GalNAc moiety comprises a structure of Formula (II-a):

embedded image

or a salt thereof, wherein X is O, N(R⁷), or S; each of R¹, R³, R⁴, and R⁵are independently hydrogen, alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, aryl, heteroaryl, cycloalkyl, heterocyclyl, C(O)-alkyl, C(O)-alkenyl, C(O)-alkynyl, C(O)-heteroalkyl, C(O)-haloalkyl, C(O)-aryl, C(O)-heteroaryl, C(O)-cycloalkyl, or C(O)-heterocyclyl, wherein each alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, aryl, heteroaryl, cycloalkyl, and heterocyclyl is optionally substituted with one or more R⁸; or R³and R⁴are taken together with the oxygen atoms to which they are connected to form a heterocyclyl ring optionally substituted with one or more R⁸; R^2ais hydrogen or alkyl; R^2bis —C(O)alkyl (e.g., C(O)CH₃); each of R^6aand R^6bis hydrogen, alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, halo, cyano, nitro, —OR^A, aryl, heteroaryl, cycloalkyl, or heterocyclyl, wherein each alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, aryl, heteroaryl, cycloalkyl, and heterocyclyl is optionally substituted with one or more R⁹; R⁷is hydrogen, alkyl, or C(O)-alkyl; each of R⁸and R⁹is independently hydrogen, halo, cyano, alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, cycloalkyl, or heterocyclyl; and R^Ais hydrogen, or alkyl, alkenyl, alkynyl, wherein the structure of Formula (I) may be connected to a TREM, e.g., a linker, a nucleobase, internucleotide linkage, or terminus within the TREM sequence.

In some embodiments, the GalNAc moiety comprises a structure of Formula (II-b):

embedded image

In some embodiments, the ASGPR binding moiety comprises a structure of Formula (III):

embedded image

or a salt thereof, wherein each of R¹, R^2a, R^2b, R³, R⁴, R⁵, R^6a, and R^6band subvariables thereof are as defined for Formula (I), L is a linker, and n is an integer between 1 and 100, wherein custom-character represents an attachment point to a branching point, additional linker, or TREM, e.g., a linker, a nucleobase, internucleotide linkage, or terminus within the TREM sequence.

In some embodiments, X is O. In some embodiments, each of R¹, R³, R⁴, and R⁵are independently hydrogen or alkyl (e.g., CH₃). In some embodiments, R^2ais hydrogen. In some embodiments, R^2bis C(O)CH₃. In some embodiments, each of R^6aand R^6bis hydrogen. In some embodiments, n is an integer between 1 and 50. In some embodiments, n is an integer between 1 and 25. In some embodiments, n is an integer between 1 and 10. In some embodiments, n is an integer between 1 and 5. In some embodiments, n is 1, 2, 3, 4, or 5. In some embodiments, n is 1.

In an embodiment, L comprises an alkylene, alkenylene, alkynylene, heteroalkylene, or haloalkylene group. In an embodiment, L comprises an ester, amide, disulfide, ether, carbonate, aryl, heteroaryl, cycloalkyl, or heterocyclyl group. In an embodiment, L is cleavable or non-cleavable.

The term “linker” as used herein refers to an organic moiety that connects two or more parts of a compound, e.g., through a covalent bond. A linker may linear or branched. In some embodiments, a linker comprises a heteroatom, such as a nitrogen, sulfur, oxygen, phosphorus, silicon, or boron atom. In some embodiments, the linker comprises a cyclic group (e.g., an aryl, heteroaryl, cycloalkyl, or heterocyclyl group). In some embodiments, a linker comprises a functional group such as an amide, ketone, ester, ether, thioester, thioether, thiol, hydroxyl, amine, cyano, nitro, azide, triazole, pyrroline, p-nitrophenyl, alkene, or alkyne group. Any atom within a linker may be substituted or unsubstituted. In some embodiments, a linker comprises an arylalkyl, arylalkenyl, arylalkynyl, heteroarylalkyl, heteroarylalkenyl, heteroarylalkynyl, heterocyclylalkyl, heterocyclylalkenyl, heterocyclylalkynyl, aryl, heteroaryl, heterocyclyl, cycloalkyl, cycloalkenyl, alkylarylalkyl, alkylarylalkenyl, alkylarylalkynyl, alkenylarylalkyl, alkenylarylalkenyl, alkenylarylalkynyl, alkynylarylalkyl, alkynylarylalkenyl, alkynylarylalkynyl, alkylheteroarylalkyl, alkylheteroarylalkenyl, alkylheteroarylalkynyl, alkenylheteroarylalkyl, alkenylheteroarylalkenyl, alkenylheteroarylalkynyl, alkynylheteroarylalkyl, alkynylheteroarylalkenyl, alkynylheteroarylalkynyl, alkylheterocyclylalkyl, alkylheterocyclylalkenyl, alkylhererocyclylalkynyl, alkenylheterocyclylalkyl, alkenylheterocyclylalkenyl, alkenylheterocyclylalkynyl, alkynylheterocyclylalkyl, alkynylheterocyclylalkenyl, alkynylheterocyclylalkynyl, alkylaryl, alkenylaryl, alkynylaryl, alkylheteroaryl, alkenylheteroaryl, or alkynylhereroaryl group. In some embodiments, a linker comprises a polyethylene glycol group (e.g., PEG1, PEG2, PEG3, PEG4, PEG5, PEG6, PEG7, PEG8, PEG10, PEG12, PEG14, PEG16, PEG18, PEG20, PEG24, PEG28, PEG32, PEG100, PEG200, PEG250, PEG500, PEG600, PEG700, PEG750, PEG800, PEG900, PEG1000, PEG2000, or PEG3000). In some embodiments, L comprises a PEG1, PEG2, PEG3, PEG4, PEG5, or PEG6 group. In some embodiments, L comprises a plurality of PEG1, PEG2, PEG3, PEG4, PEG5, or PEG6 groups (e.g., 2, 3, 4, or 5 PEG1, PEG2, PEG3, PEG4, PEG5, or PEG6 groups). In some embodiments, L comprises a PEG2 group. In some embodiments, L comprises a plurality of PEG2 groups. In some embodiments, L comprises a PEG3 group. In some embodiments, L comprises a plurality of PEG3 groups. In some embodiments, L comprises a PEG4 group. In some embodiments, L comprises a plurality of PEG4 groups.

In some embodiments, the linker comprises between 1 and 1000 atoms (e.g., between 1 and 750 atoms, 1 and 500 atoms, 1 and 250 atoms, 1 and 100 atoms, 1 and 75 atoms, 1 and 50 atoms, 1 and 25 atoms, and 1 and 10 atoms). In some embodiments, the linker comprises between 1 and 100 atoms. In some embodiments, the linker comprises between 1 and 50 atoms. In some embodiments, the linker comprises between 1 and 25 atoms.

In some embodiments, the linker is linear and comprises between 1 and 1000 atoms (e.g., between 1 and 750 atoms, 1 and 500 atoms, 1 and 250 atoms, 1 and 100 atoms, 1 and 75 atoms, 1 and 50 atoms, 1 and 25 atoms, and 1 and 10 atoms). In some embodiments, the linker is linear and comprises between 1 and 100 atoms. In some embodiments, the linker is linear and comprises between 1 and 50 atoms. In some embodiments, the linker is linear and comprises between 1 and 25 atoms.

In some embodiments, the linker is branched, and each branch comprises between 1 and 1000 atoms (e.g., between 1 and 750 atoms, 1 and 500 atoms, 1 and 250 atoms, 1 and 100 atoms, 1 and 75 atoms, 1 and 50 atoms, 1 and 25 atoms, and 1 and 10 atoms). In some embodiments, the linker is branched, and each branch comprises between 1 and 100 atoms. In some embodiments, the linker is branched, and each branch comprises between 1 and 50 atoms. In some embodiments, the linker is branched, and each branch comprises between 1 and 25 atoms.

In some embodiments, the ASGPR binding moiety comprises a structure of Formula (III-a):

embedded image

or a salt thereof, wherein each of R¹, R^2a, R^2b, R³, R⁴, R⁵, R^6a, and R^6band subvariables thereof are as defined for Formula (I), each of L¹and L²is independently a linker, each of m and n is independently an integer between 1 and 100, and M is a linker, wherein “ custom-character ” represents an attachment point to a branching point, additional linker, or TREM, e.g., a linker, a nucleobase, internucleotide linkage, or terminus within the TREM sequence.

In some embodiments, X is O (e.g., X in each of A and B is O). In some embodiments, each of R¹, R³, R⁴, and R⁵are independently hydrogen or alkyl (e.g., CH₃) (e.g., R¹, R³, R⁴, and R⁵in each of A and B is independently hydrogen or alkyl). In some embodiments, R^2ais hydrogen (e.g., R^2ain each of A and B is hydrogen). In some embodiments, R^2bis C(O)CH₃(e.g., R^2bin each of A and B is C(O)CH₃). In some embodiments, each of R^6aand R^6bis hydrogen (e.g., R^6aand R^6bin each of A and B is hydrogen). In some embodiments, each of m and n is independently an integer between 1 and 50. In some embodiments, each of m and n is independently an integer between 1 and 25. In some embodiments, each of m and n is independently an integer between 1 and 10. In some embodiments, each of m and n is independently an integer between 1 and 5. In some embodiments, each of m and n is independently 1, 2, 3, 4, or 5. In some embodiments, each of m and n is independently 1.

In an embodiment, each of L¹and L²independently comprises an alkylene, alkenylene, alkynylene, heteroalkylene, or haloalkylene group. In an embodiment, each of L¹and L²independently comprises an ester, amide, disulfide, ether, carbonate, aryl, heteroaryl, cycloalkyl, or heterocyclyl group. In an embodiment, each of L¹and L²independently is cleavable or non-cleavable. In some embodiments, each of L¹and L²independently comprises a polyethylene glycol group (e.g., PEG1, PEG2, PEG3, PEG4, PEG5, PEG6, PEG7, PEG8, PEG10, PEG12, PEG14, PEG16, PEG18, PEG20, PEG24, PEG28, PEG32, PEG100, PEG200, PEG250, PEG500, PEG600, PEG700, PEG750, PEG800, PEG900, PEG1000, PEG2000, or PEG3000). In some embodiments, each of L¹and L²independently comprises a PEG1, PEG2, PEG3, PEG4, PEG5, or PEG6 group. In some embodiments, each of L¹and L²independently comprises a plurality of PEG1, PEG2, PEG3, PEG4, PEG5, or PEG6 groups (e.g., 2, 3, 4, or 5 PEG1, PEG2, PEG3, PEG4, PEG5, or PEG6 groups). In some embodiments, each of L¹and L²independently comprises a PEG2 group. In some embodiments, each of L¹and L²independently comprises a plurality of PEG2 groups. In some embodiments, each of L¹and L²independently comprises a PEG3 group. In some embodiments, each of L¹and L²independently comprises a plurality of PEG3 groups. In some embodiments, each of L¹and L²independently comprises a PEG4 group. In some embodiments, each of L¹and L²independently comprises a plurality of PEG4 groups.

In some embodiments, M comprises an alkylene, alkenylene, alkynylene, heteroalkylene, or haloalkylene group. In an embodiment, M comprises an ester, amide, disulfide, ether, carbonate, aryl, heteroaryl, cycloalkyl, or heterocyclyl group. In an embodiment, M is cleavable or non-cleavable.

In some embodiments, the ASGPR binding moiety comprises a structure of Formula (III-b):

embedded image

or a salt thereof, wherein each of R¹, R^2a, R^2b, R³, R⁴, R⁵, R^6a, and R^6band subvariables thereof are as defined for Formula (I), each of L¹, L², and L³is independently a linker, each of m, n, and o is independently an integer between 1 and 100, and M is a linker, wherein “ custom-character ” represents an attachment point to a branching point, additional linker, or TREM, e.g., a linker, a nucleobase, internucleotide linkage, or terminus within the TREM sequence.

In some embodiments, X is O (e.g., X in each of A, B, and C is O). In some embodiments, each of R¹, R³, R⁴, and R⁵are independently hydrogen or alkyl (e.g., CH₃) (e.g., R¹, R³, R⁴, and R⁵in each of A, B, and C is independently hydrogen or alkyl). In some embodiments, R^2ais hydrogen (e.g., R^2ain each of A, B, and C is hydrogen). In some embodiments, R^2bis C(O)CH₃(e.g., R^2bin each of A, B, and C is C(O)CH₃). In some embodiments, each of R^6aand R^6bis hydrogen (e.g., R^6aand R^6bin each of A, B, and C is hydrogen). In some embodiments, each of m, n, and o is independently an integer between 1 and 50. In some embodiments, each of m, n, and o is independently an integer between 1 and 25. In some embodiments, each of m, n, and o is independently an integer between 1 and 10. In some embodiments, each of m, n, and o is independently an integer between 1 and 5. In some embodiments, each of m, n, and o is independently 1, 2, 3, 4, or 5. In some embodiments, each of m, n, and o is independently 1.

In an embodiment, each of L¹, L², and L³independently comprises an alkylene, alkenylene, alkynylene, heteroalkylene, or haloalkylene group. In an embodiment, each of L¹, L², and L³independently comprises an ester, amide, disulfide, ether, carbonate, aryl, heteroaryl, cycloalkyl, or heterocyclyl group. In an embodiment, each of L¹, L², and L³independently is cleavable or non-cleavable. In an embodiment, each of L¹and L²independently is cleavable or non-cleavable. In some embodiments, each of L¹, L², and L³independently comprises a polyethylene glycol group (e.g., PEG1, PEG2, PEG3, PEG4, PEG5, PEG6, PEG7, PEG8, PEG10, PEG12, PEG14, PEG16, PEG18, PEG20, PEG24, PEG28, PEG32, PEG100, PEG200, PEG250, PEG500, PEG600, PEG700, PEG750, PEG800, PEG900, PEG1000, PEG2000, or PEG3000). In some embodiments, each of L¹, L², and L³independently comprises a PEG1, PEG2, PEG3, PEG4, PEG5, or PEG6 group. In some embodiments, each of L¹, L², and L³independently comprises a plurality of PEG1, PEG2, PEG3, PEG4, PEG5, or PEG6 groups (e.g., 2, 3, 4, or 5 PEG1, PEG2, PEG3, PEG4, PEG5, or PEG6 groups). In some embodiments, each of L¹, L², and L³independently comprises a PEG2 group. In some embodiments, each of L¹, L², and L³independently comprises a plurality of PEG2 groups. In some embodiments, each of L¹, L², and L³independently comprises a PEG3 group. In some embodiments, each of L¹, L², and L³independently comprises a plurality of PEG3 groups. In some embodiments, each of L¹, L², and L³independently comprises a PEG4 group. In some embodiments, each of L¹, L², and L³independently comprises a plurality of PEG4 groups.

In some embodiments, the ASGPR binding moiety comprises a structure of Formula (III-c):

embedded image

or a salt thereof, wherein each of R^2a, R^2b, R³, R⁴, R⁵, and subvariables thereof are as defined for Formula (I), each of L¹, L², and L³is custom-character independently a linker, and M is a linker, wherein represents an attachment point to a branching point, additional linker, or TREM, e.g., a linker, a nucleobase, internucleotide linkage, or terminus within the TREM sequence.

In some embodiments, each of R³, R⁴, and R⁵are independently hydrogen or alkyl (e.g., CH₃). In some embodiments, R^2ais hydrogen. In some embodiments, R^2bis C(O)CH₃.

In some embodiments, the ASGPR binding moiety comprises a compound selected from:

embedded image

In some embodiments, the ASGPR binding moiety is a compound (X-i). In some embodiments, the ASGPR binding moiety is compound (X-ii). In some embodiments, the ASGPR binding moiety is compound (X-iii). In some embodiments, the ASGPR binding moiety is compound (X-iv). In some embodiments, the ASGPR binding moiety is compound (X-v). In some embodiments, the ASGPR binding moiety is compound (X-vi). In some embodiments, the ASGPR binding moiety is compound (X-vii). In some embodiments, the ASGPR binding moiety is compound (X-viii). In some embodiments, the ASGPR binding moiety is compound (X-ix). In some embodiments, the ASGPR binding moiety is compound (X-x). In some embodiments, the ASGPR binding moiety is compound (X-xi). In some embodiments, the ASGPR binding moiety is compound (X-xii). In some embodiments, the ASGPR binding moiety is compound (X-xiii). In some embodiments, the ASGPR binding moiety is compound (X-xiv). In some embodiments, the ASGPR binding moiety is compound (X-xv). In some embodiments, the ASGPR binding moiety is compound (X-xvi). In some embodiments, the ASGPR binding moiety is compound (X-xvii). In some embodiments, the ASGPR binding moiety is compound (X-xviii). In some embodiments, the ASGPR binding moiety is compound (X-xix). In some embodiments, the ASGPR binding moiety is compound (X-xx). In some embodiments, the ASGPR binding moiety is compound (X-xxi). In some embodiments, the ASGPR binding moiety is compound (X-xxii). In some embodiments, the ASGPR binding moiety is compound (X-xxiii). In some embodiments, the ASGPR binding moiety is a compound selected from compound (X-i), (X-xxii), and (X-xxii).

In some embodiments, the ASGPR binding moiety comprises a linker comprising a cyclic moiety, such as a pyrroline ring. In an embodiment, the ASGPR binding moiety comprises a structure of Formula (CII):

embedded image

or a salt thereof, wherein E is absent or C(O), C(O)O, C(O)NH, C(S), C(S)NH, SO, SO₂, or SO₂NH; R¹¹, R¹², R¹³, R¹⁴, R₁₅, R¹⁶, R¹⁷, and R¹⁸are each independently for each occurrence H, —CH₂OR^a, or OR^b; R^aand R^bare each independently for each occurrence hydrogen, a hydroxyl protecting group, optionally substituted alkyl, optionally substituted aryl, optionally substituted cycloalkyl, optionally substituted aralkyl, optionally substituted alkenyl, optionally substituted heteroaryl, polyethyleneglycol (PEG), a phosphate, a diphosphate, a triphosphate, a phosphonate, a phosphonothioate, a phosphonodithioate, a phosphorothioate, a phosphorothiolate, a phosphorodithioate, a phosphorothiolothionate, a phosphodiester, a phosphotriester, an activated phosphate group, an activated phosphite group, a phosphoramidite, a solid support, P(Z¹)(Z²)—O-nucleoside, P(Z¹)(Z²)—O-oligonucleotide, P(Z¹)(O-linker-R^L)—O-nucleoside, or P(Z¹)(O-linker-R^L) O-oligonucleotide; R³¹is independently for each occurrence -linker-R^Lor R³¹; R^Lis hydrogen or a ligand; R³¹is C(O)CH(N(R³²)₂)(CH₂)_hN(R³²)₂; R³²is independently for each occurrence H, R^L, -linker-R^Lor R³¹; Z¹is independently for each occurrence O or S; Z²is independently for each occurrence O, S, N(alkyl) or optionally substituted alkyl; and h is independently for each occurrence 1-20.

In some embodiments, the compound of Formula (CII) is selected from:

embedded image

In some embodiments, the ASGPR binding moiety is a compound or substructure disclosed in U.S. Pat. No. 8,106,022, which is incorporated herein by reference in its entirety.

In some embodiments, the ASGPR binding moiety is a compound (CII-i). In some embodiments, the ASGPR binding moiety is a compound (CII-ii). In some embodiments, the ASGPR binding moiety is a compound (CII-iii). In some embodiments, the ASGPR binding moiety is a compound (CII-iv). In some embodiments, the ASGPR binding moiety is a compound (CII-v). In some embodiments, the ASGPR binding moiety is a compound (CII-vi).

In some embodiments, the ASGPR binding moiety is a compound of Formula (C-1), (C-2), (C-3) or (C4)

embedded image

or a pharmaceutically acceptable salt thereof, wherein: n is 1, 2, or 3; W is absent or a peptide; L is -(T-Q-T-Q)_m-, wherein each T is independently absent or is (C₁-C₁₀) alkylene, (C₂-C₁₀) alkenylene, or (C₂-C₁₀) alkynylene, wherein one or more carbon groups of said T may each independently be replaced with a heteroatom group independently selected from —O—, —S—, and —N(R⁴)— wherein the heteroatom groups are separated by at least 2 carbon atoms, and wherein alkylene, alkenylene, and alkynylene may each be independently substituted with one or more halo atoms; each Q is independently absent or is C(O), C(O)—NR⁴, NR⁴—C(O), O—C(O)—NR⁴, NR⁴—C(O)—O, —CH₂—, a heteroaryl, or a heteroatom group selected from O, S, S—S, S(O), S(O)₂, and NR⁴, wherein at least two carbon atoms separate the heteroatom groups O, S, S—S, S(O), S(O)₂and NR⁴from any other heteroatom group; each R⁴is independently —H, —(C₁-C₂₀)alkyl, or (C₃-C₅)cycloalkyl wherein one to six —CH₂— groups of the alkyl or cycloalkyl separated by at least two carbon atoms may be replaced with —O—, —S—, or —N(R⁴)—, and —CH₃— of the alkyl may each be independently replaced with a heteroatom group selected from —N(R⁴)₂, —OR⁴, and —S(R⁴) wherein the heteroatom groups are separated by at least 2 carbon atoms; and wherein the alkyl and cycloalkyl may be substituted with halo atoms; and m is independently 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, or 40.

In some embodiments, the ASGPR binding moiety is a compound (C-1). In some embodiments, the ASGPR binding moiety is a compound (C-2). In some embodiments, the ASGPR binding moiety is a compound (C-3). In some embodiments, the ASGPR binding moiety is a compound (C-4).

In some embodiments, the compound of Formula (C-1), (C-2), (C-3) or (C4) comprises:

embedded image

wherein n′ is 1 or 2 or a pharmaceutically acceptable salt thereof.

In some embodiments, the ASGPR binding moiety is a compound of Formula (E):

embedded image

or a pharmaceutically acceptable salt thereof, wherein: n is i, 2 or 3; W is absent or is a peptide; L is -(T-Q-T-Q)_m-, wherein each T is independently absent or is (C₁-C₁₀) alkylene, (C₂-C₁₀) alkenylene, or (C₂-C₁₀) alkynylene, wherein one or more carbon groups of said T may each independently be replaced with a heteroatom group independently selected from —O—, —S—, and —N(R⁴)— wherein the heteroatom groups are separated by at least 2 carbon atoms, wherein said alkylene, alkenylene, alkynylene, may each independently be substituted by one or more halo atoms; each Q is independently absent or is C(O), C(O)— R⁴, R⁴—C(O), O—C(O)— R⁴, R⁴—C(O)—O, —CH₂—, a heteroaryl, or a heteroatom group selected from O, S, S—S, S(O), S(O)₂, and NR⁴, wherein at least two carbon atoms separate the heteroatom groups O, S, S—S, S(O), S(O)₂and NR⁴from any other heteroatom group; each R⁴is independently —H, —(C₁-C₂₀)alkyl, —(C₁-C₂₀)alkenyl, —(C₂-C₂₀)alkynyl, or (C₃-C₆)cycloalkyl wherein one to six —CH₂— groups of the alkyl or cycloalkyl separated by at least two carbon atoms may be replaced with —O—, —S—, or —N(R⁴)—, and —CH₃of the alkyl may be replaced with a heteroatom group selected from —N(R⁴)₂, —OR⁴, and —S(R⁴) wherein the heteroatom groups are separated by at least 2 carbon atoms; and wherein the alkyl, alkenyl, alkynyl, and cycloalkyl may be substituted with halo atoms; each m is independently 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, or 40.

In some embodiments, the compound of Formula (E) is selected from:

embedded image

or a pharmaceutically acceptable salt thereof, and Y is as defined in Formula (E).

In some embodiments. n is 1. In some embodiments, n is 2. In some embodiments, n is 3.

In some embodiments of a compound of Formula (E), the compound is:

embedded image

or a pharmaceutically acceptable salt thereof.

In some embodiments, the ASGPR binding moiety is a compound or substructure disclosed in WO2017/083368, which is incorporated herein by reference in its entirety.

In other embodiments, the ASGPR binding moiety is selected from:

embedded image

wherein one of X or Y is a branching point, a linker, or a TREM, e.g., a linker, a nucleobase, internucleotide linkage, or terminus within the TREM sequence, and the other of X and Y is hydrogen.

In an embodiment, the ASGPR binding moiety comprises a structure of Formula (XII-a):

embedded image

In an embodiment, the ASGPR binding moiety is a compound or substructure disclosed in Nucleic Acids (2016) 5:e317 or WO2015/042447, each of which is incorporated herein by reference in its entirety.

In some embodiments, the ASGPR binding moiety comprises a structure of Formula (V-a):

embedded image

wherein n is an integer from 1 to 20. In some embodiments, the compound of Formula (V-a) is selected from:

embedded image

wherein Z is an oligomeric compound, e.g., a linker or a nucleobase within the ASt of a TREM.

In another embodiment, the ASGPR binding moiety comprises a structure of Formula (V-b):

embedded image

wherein A is O or S, A′ is O, S, or NH, and Z is an oligomeric compound, e.g., a linker or TREM, e.g., a linker, a nucleobase, internucleotide linkage, or terminus within the TREM sequence.

In some embodiments, the ASGPR binding moiety comprises

embedded image

In some embodiments, the ASGPR binding moiety is selected from:

embedded image

In an embodiment, the ASGPR binding moiety is a compound or substructure disclosed in WO 2017/156012, which is incorporated herein by reference in its entirety.

In some embodiments, a hydroxyl group within an ASGPR binding moiety is protected, for example, with an acetyl or acetonide moiety. In some embodiments, a hydroxyl group within an ASGPR binding moiety is protected with an acetyl group. In some embodiments, a hydroxyl group within an ASGPR binding moiety is protected with acetonide group. For example, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, or more hydroxyl groups within an ASGPR binding moiety may be protected, e.g., with an acetyl group or an acetonide group. In some embodiments, all of the hydroxyl groups with in an ASGPR binding moiety are protected.

Exemplary TREMs comprising an ASGPR binding moiety may have a binding affinity for an ASGPR of between 0.01 nM to 100 mM. In some embodiments, a TREM comprising an ASGPR binding moiety has a binding affinity of less than 10 mM, e.g., 7.5 mM, 5 mM, 2.5 mM, 1 mM, 0.75 mM, 0.5 mM, 0.25 mM, 0.1 mM, 75 nM, 50 nM, 25 nM, 10 nM, 5 nM, or less.

Exemplary TREMs comprising an ASGPR binding moiety may be internalized into a cell, e.g., a hepatocyte. In some embodiments, a TREM comprising an ASGPR binding moiety has an increased uptake into a cell compared with a TREM that does not comprise an ASGPR binding moiety. For example, a TREM comprising an ASGPR binding moiety may be internalized into a cell more than 1.1, 1.2, 1.3, 1.4, 1.5, 1.75, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 75, 100 times or more than a TREM that does not comprise an ASGPR binding moiety.

Additional exemplary ASGPR moieties are described in further detail in U.S. Pat. Nos. 8,828,956; 9,867,882; 10,450,568; 10,808,246; U.S. Patent Publication Nos. 2015/0246133; 2015/0203843; and 2012/0095200; and PCT Publication Nos. WO 2013/166155, 2012/030683, and 2013/166121, each of which are incorporated herein by reference in its entirety.

ASGPR Linkers

The ASGPR binding moiety comprises at least one linker that connects the carbohydrate to the TREM. In some embodiments, the TREM is connected to one or more carbohydrates (e.g., GalNAc moieties, e.g., of Formula (I)), through a linker as described herein. The linker may be monovalent or multivalent, e.g., bivalent, trivalent, tetravalent, or pentavalent. In some embodiments, the linker comprises a structure selected from:

embedded image

wherein q2A, q2B, q3A, q3B, q4A, q4B, q5A, q5B and q5C represent independently for each occurrence 0-20 and wherein the repeating unit can be the same or different; P^2Ap^2Bp^3Ap^3BP^4A, p^4B, p^5A, p^5B, P^5C, T^2A, T^2B, T^3A, T^3B, T^4A, T^4B, T^4A, T^5B, T^5Care each independently for each occurrence absent, CO, NH, O, S, OC(O), NHC(O), CH₂, CH₂NH or CH₂O; Q^2A, Q^2B, Q^3A, Q^3B, Q^4A, Q^4B, Q^5A, Q^5B, Q^5Care independently for each occurrence absent, alkylene, substituted alkylene wherein one or more methylenes can be interrupted or terminated by one or more of O, S, S(O), SO₂, N(R^N), C(R′)═C(R″), C≡C or C(O); R^2A, R^2B, R^3A, R^3B, R^4A, R^4B, R^5A, R^5B, R^5Care each independently for each occurrence absent, NH, O, S, CH₂, C(O)O, C(O)NH, NHCH(R^a)C(O), —C(O)—CH(R^a)—NH—, CO, CH═N—O,

embedded image

or heterocyclyl; L^2AL^2B, L^3AL^3BL^4AL^4BL^5AL^5Band L^5Crepresent the ligand; i.e. each independently for each occurrence a monosaccharide (such as GalNAc), disaccharide, trisaccharide, tetrasaccharide, oligosaccharide, or polysaccharide; and R^ais H or amino acid side chain.

In some embodiments, the linker comprises:

embedded image

wherein L^5A, L^5Band L^5Crepresent a monosaccharide, such as GalNAc derivative, e.g., as described herein.

A cleavable linking group is one which is sufficiently stable outside the cell, but which upon entry into a target cell is cleaved to release the two parts the linker is holding together. In a preferred embodiment, the cleavable linking group is cleaved at least about 10 times, 20, times, 30 times, 40 times, 50 times, 60 times, 70 times, 80 times, 90 times or more, or at least about 100 times faster in a target cell or under a first reference condition (which can, e.g., be selected to mimic or represent intracellular conditions) than in the blood of a subject, or under a second reference condition (which can, e.g., be selected to mimic or represent conditions found in the blood or serum).

Cleavable linking groups are susceptible to cleavage agents, e.g., pH, redox potential or the presence of degradative molecules. Generally, cleavage agents are more prevalent or found at higher levels or activities inside cells than in serum or blood. Examples of such degradative agents include: redox agents which are selected for particular substrates or which have no substrate specificity, including, e.g., oxidative or reductive enzymes or reductive agents such as mercaptans, present in cells, that can degrade a redox cleavable linking group by reduction; esterases; endosomes or agents that can create an acidic environment, e.g., those that result in a pH of five or lower; enzymes that can hydrolyze or degrade an acid cleavable linking group by acting as a general acid, peptidases (which can be substrate specific), and phosphatases. A cleavable linkage group, such as a disulfide bond can be susceptible to pH. The pH of human serum is 7.4, while the average intracellular pH is slightly lower, ranging from about 7.1-7.3. Endosomes have a more acidic pH, in the range of 5.5-6.0, and lysosomes have an even more acidic pH at around 5.0. Some linkers will have a cleavable linking group that is cleaved at a preferred pH, thereby releasing a cationic lipid from the ligand inside the cell, or into the desired compartment of the cell.

A linker can include a cleavable linking group that is cleavable by a particular enzyme. The type of cleavable linking group incorporated into a linker can depend on the cell to be targeted. For example, a liver-targeting ligand can be linked to a cationic lipid through a linker that includes an ester group. Liver cells are rich in esterases, and therefore the linker will be cleaved more efficiently in liver cells than in cell types that are not esterase-rich. Other cell-types rich in esterases include cells of the lung, renal cortex, and testis. Linkers that contain peptide bonds can be used when targeting cell types rich in peptidases, such as liver cells and synoviocytes.

In general, the suitability of a candidate cleavable linking group can be evaluated by testing the ability of a degradative agent (or condition) to cleave the candidate linking group. It will also be desirable to also test the candidate cleavable linking group for the ability to resist cleavage in the blood or when in contact with other non-target tissue. Thus, one can determine the relative susceptibility to cleavage between a first and a second condition, where the first is selected to be indicative of cleavage in a target cell and the second is selected to be indicative of cleavage in other tissues or biological fluids, e.g., blood or serum. The evaluations can be carried out in cell free systems, in cells, in cell culture, in organ or tissue culture, or in whole animals. It can be useful to make initial evaluations in cell-free or culture conditions and to confirm by further evaluations in whole animals. In preferred embodiments, useful candidate compounds are cleaved at least about 2, 4, 10, 20, 30, 40, 50, 60, 70, 80, 90, or about 100 times faster in the cell (or under in vitro conditions selected to mimic intracellular conditions) as compared to blood or serum (or under in vitro conditions selected to mimic extracellular conditions).

In one embodiment, a cleavable linking group is a redox cleavable linking group that is cleaved upon reduction or oxidation. An example of reductively cleavable linking group is a disulphide linking group (—S—S—). To determine if a candidate cleavable linking group is a suitable “reductively cleavable linking group,” or for example is suitable for use with a particular TREM moiety and particular targeting agent one can look to methods described herein. For example, a candidate can be evaluated by incubation with dithiothreitol (DTT), or other reducing agent using reagents know in the art, which mimic the rate of cleavage which would be observed in a cell, e.g., a target cell. The candidates can also be evaluated under conditions which are selected to mimic blood or serum conditions. In one, candidate compounds are cleaved by at most about 10% in the blood. In other embodiments, useful candidate compounds are degraded at least about 2, 4, 10, 20, 30, 40, 50, 60, 70, 80, 90, or about 100 times faster in the cell (or under in vitro conditions selected to mimic intracellular conditions) as compared to blood (or under in vitro conditions selected to mimic extracellular conditions). The rate of cleavage of candidate compounds can be determined using standard enzyme kinetics assays under conditions chosen to mimic intracellular media and compared to conditions chosen to mimic extracellular media.

In another embodiment, a cleavable linker comprises a phosphate-based cleavable linking group. A phosphate-based cleavable linking group is cleaved by agents that degrade or hydrolyze the phosphate group. An example of an agent that cleaves phosphate groups in cells are enzymes such as phosphatases in cells. Examples of phosphate-based linking groups are —O—P(O)(ORk)-O—, —O—P(S)(ORk)-O—, —O—P(S)(SRk)-O—, —S—P(O)(ORk)-O—, —O—P(O)(ORk)-S—, —S—P(O)(ORk)-S—, —O—P(S)(ORk)-S—, —S—P(S)(ORk)-O—, —O—P(O)(Rk)-O—, —O—P(S)(Rk)-O—, —S—P(O)(Rk)-O—, —S—P(S)(Rk)-O—, —S—P(O)(Rk)-S—, —O—P(S)(Rk)-S—. Preferred embodiments are —O—P(O)(OH)—O—, —O—P(S)(OH)—O—, —O—P(S)(SH)—O—, —S—P(O)(OH)—O—, —O—P(O)(OH)—S—, —S—P(O)(OH)—S—, —O—P(S)(OH)—S—, —S—P(S)(OH)—O—, —O—P(O)(H)—O—, —O—P(S)(H)—O—, —S—P(O)(H)—O, —S—P(S)(H)—O—, —S—P(O)(H)—S—, —O—P(S)(H)—S—. A preferred embodiment is —O—P(O)(OH)—O—. These candidates can be evaluated using methods analogous to those described above.

In another embodiment, a cleavable linker comprises an acid cleavable linking group. An acid cleavable linking group is a linking group that is cleaved under acidic conditions. In preferred embodiments acid cleavable linking groups are cleaved in an acidic environment with a pH of about 6.5 or lower (e.g., about 6.0, 5.75, 5.5, 5.25, 5.0, or lower), or by agents such as enzymes that can act as a general acid. In a cell, specific low pH organelles, such as endosomes and lysosomes can provide a cleaving environment for acid cleavable linking groups. Examples of acid cleavable linking groups include but are not limited to hydrazones, esters, and esters of amino acids. Acid cleavable groups can have the general formula —C═NN—, C(O)O, or —OC(O). A preferred embodiment is when the carbon attached to the oxygen of the ester (the alkoxy group) is an aryl group, substituted alkyl group, or tertiary alkyl group such as dimethyl pentyl or t-butyl. These candidates can be evaluated using methods analogous to those described above.

In another embodiment, a cleavable linker comprises an ester-based cleavable linking group. An ester-based cleavable linking group is cleaved by enzymes such as esterases and amidases in cells. Examples of ester-based cleavable linking groups include but are not limited to esters of alkylene, alkenylene and alkynylene groups. Ester cleavable linking groups have the general formula —C(O)O—, or —OC(O)—. These candidates can be evaluated using methods analogous to those described above.

In yet another embodiment, a cleavable linker comprises a peptide-based cleavable linking group. A peptide-based cleavable linking group is cleaved by enzymes such as peptidases and proteases in cells. Peptide-based cleavable linking groups are peptide bonds formed between amino acids to yield oligopeptides (e.g., dipeptides, tripeptides etc.) and polypeptides. Peptide-based cleavable groups do not include the amide group (—C(O)NH—). The amide group can be formed between any alkylene, alkenylene or alkynelene. A peptide bond is a special type of amide bond formed between amino acids to yield peptides and proteins. The peptide based cleavage group is generally limited to the peptide bond (i.e., the amide bond) formed between amino acids yielding peptides and proteins and does not include the entire amide functional group. Peptide-based cleavable linking groups have the general formula —NHCHRAC(O)NHCHRBC(O)— (SEQ ID NO: 13), where RA and RB are the R groups of the two adjacent amino acids. These candidates can be evaluated using methods analogous to those described above.

The ASGPR binding moiety may be bound to any nucleotide position within a domain (ASt Domain1, DH Domain, ACH Domain, VL Domain, TH Domain, and/or ASt Domain2) of a TREM. In an embodiment, the ASGPR moiety is bound to a nucleobase, terminus, or internucleotide linkage within a TREM. In an embodiment, the ASGPR moiety is bound to a nucleobase within a TREM. In an embodiment, the ASGPR binding moiety is bound to any adenine nucleobase within a domain (ASt Domain1, DH Domain, ACH Domain, VL Domain, TH Domain, and/or ASt Domain2) of the TREM. In an embodiment, ASGPR binding moiety is bound to any cytosine nucleobase within a domain (ASt Domain1, DH Domain, ACH Domain, VL Domain, TH Domain, and/or ASt Domain2) of the TREM. In an embodiment, it is bound to any guanosine nucleobase within a domain (ASt Domain1, DH Domain, ACH Domain, VL Domain, TH Domain, and/or ASt Domain2) of the TREM. In an embodiment, it is bound to any uracil nucleobase within a domain (ASt Domain1, DH Domain, ACH Domain, VL Domain, TH Domain, and/or ASt Domain2) of the TREM. In an embodiment, it is bound to any thymine nucleobase within a domain (ASt Domain1, DH Domain, ACH Domain, VL Domain, TH Domain, and/or ASt Domain2) of the TREM.

In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 1 (e.g., present within a nucleobase at TREM position 1). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 2 (e.g., present within a nucleobase at TREM position 2). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 3 (e.g., present within a nucleobase at TREM position 3). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 4 (e.g., present within a nucleobase at TREM position 4). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 5 (e.g., present within a nucleobase at TREM position 5). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 6 (e.g., present within a nucleobase at TREM position 6). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 7 (e.g., present within a nucleobase at TREM position 7). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 8 (e.g., present within a nucleobase at TREM position 8). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 9 (e.g., present within a nucleobase at TREM position 9).

In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 10 (e.g., present within a nucleobase at TREM position 10). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 11 (e.g., present within a nucleobase at TREM position 11). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 12 (e.g., present within a nucleobase at TREM position 12). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 13 (e.g., present within a nucleobase at TREM position 13). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 14 (e.g., present within a nucleobase at TREM position 14). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 15 (e.g., present within a nucleobase at TREM position 15). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 16 (e.g., present within a nucleobase at TREM position 16). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 17 (e.g., present within a nucleobase at TREM position 17). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 18 (e.g., present within a nucleobase at TREM position 18). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 19 (e.g., present within a nucleobase at TREM position 19). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 20 (e.g., present within a nucleobase at TREM position 20). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 21 (e.g., present within a nucleobase at TREM position 21). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 22 (e.g., present within a nucleobase at TREM position 22). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 23 (e.g., present within a nucleobase at TREM position 23). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 24 (e.g., present within a nucleobase at TREM position 24). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 25 (e.g., present within a nucleobase at TREM position 25). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 26 (e.g., present within a nucleobase at TREM position 26).

In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 27 (e.g., present within a nucleobase at TREM position 27). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 28 (e.g., present within a nucleobase at TREM position 28). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 29 (e.g., present within a nucleobase at TREM position 29). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 30 (e.g., present within a nucleobase at TREM position 30). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 31 (e.g., present within a nucleobase at TREM position 31). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 32 (e.g., present within a nucleobase at TREM position 32). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 33 (e.g., present within a nucleobase at TREM position 33). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 34 (e.g., present within a nucleobase at TREM position 34). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 35 (e.g., present within a nucleobase at TREM position 35). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 36 (e.g., present within a nucleobase at TREM position 36). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 37 (e.g., present within a nucleobase at TREM position 37). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 38 (e.g., present within a nucleobase at TREM position 38). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 39 (e.g., present within a nucleobase at TREM position 39). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 40 (e.g., present within a nucleobase at TREM position 40). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 41 (e.g., present within a nucleobase at TREM position 41). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 42 (e.g., present within a nucleobase at TREM position 42). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 43 (e.g., present within a nucleobase at TREM position 43).

In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 44 (e.g., present within a nucleobase at TREM position 44). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 45 (e.g., present within a nucleobase at TREM position 45). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 46 (e.g., present within a nucleobase at TREM position 46). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 47 (e.g., present within a nucleobase at TREM position 47). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 48 (e.g., present within a nucleobase at TREM position 48). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 49 (e.g., present within a nucleobase at TREM position 49).

In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 50 (e.g., present within a nucleobase at TREM position 50). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 51 (e.g., present within a nucleobase at TREM position 51). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 52 (e.g., present within a nucleobase at TREM position 52). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 53 (e.g., present within a nucleobase at TREM position 53). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 54 (e.g., present within a nucleobase at TREM position 54). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 55 (e.g., present within a nucleobase at TREM position 55). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 56 (e.g., present within a nucleobase at TREM position 56). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 57 (e.g., present within a nucleobase at TREM position 57). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 58 (e.g., present within a nucleobase at TREM position 58). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 59 (e.g., present within a nucleobase at TREM position 59). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 60 (e.g., present within a nucleobase at TREM position 60). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 61 (e.g., present within a nucleobase at TREM position 61). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 62 (e.g., present within a nucleobase at TREM position 62). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 63 (e.g., present within a nucleobase at TREM position 63). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 64 (e.g., present within a nucleobase at TREM position 64).

In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 65 (e.g., present within a nucleobase at TREM position 65). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 66 (e.g., present within a nucleobase at TREM position 66). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 67 (e.g., present within a nucleobase at TREM position 67). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 68 (e.g., present within a nucleobase at TREM position 68). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 69 (e.g., present within a nucleobase at TREM position 69). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 70 (e.g., present within a nucleobase at TREM position 70). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 71 (e.g., present within a nucleobase at TREM position 71). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 72 (e.g., present within a nucleobase at TREM position 72). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 73 (e.g., present within a nucleobase at TREM position 73). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 74 (e.g., present within a nucleobase at TREM position 74). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 75 (e.g., present within a nucleobase at TREM position 75). In an embodiment, the ASGPR binding moiety is present within a TREM at TREM position 76 (e.g., present within a nucleobase at TREM position 76).

In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 1 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 2 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 3 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 4 (U). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 5 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 6 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 7 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 8 (U). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 9 (G).

In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 10 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 11 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 12 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 13 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 14 (A). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 15 (A). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 16 (U). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 17 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 18 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 19 (A). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 20 (U). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 21 (A). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 22 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 23 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 24 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 25 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 26 (A).

In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 27 (U). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 28 (U). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 29 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 30 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 31 (A). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 32 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 33 (U). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 34 (U). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 35 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 36 (A). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 37 (A). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 38 (A). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 39 (U). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 40 (U). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 41 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 42 (A). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 43 (A).

In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 44 (A). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 45 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 46 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 47 (U). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 48 (U). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 49 (C)

In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 50 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 51 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 52 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 53 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 54 (U). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 55 (U). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 56 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 57 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 58 (A). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 59 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 60 (U). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 61 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 62 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 63 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 64 (G).

In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 76 (A). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 75 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 74 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 73 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 72 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 71 (U). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 70 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 69 (A). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 68 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 67 (G). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 66 (C). In an embodiment, the ASGPR binding moiety is bound to a nucleobase at TREM position 65 (G).

In an embodiment, the TREM comprising an ASGPR binding moiety comprises a ribonucleic acid (RNA) sequence encoded by a deoxyribonucleic acid (DNA) sequence disclosed in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1. In an embodiment the TREM comprising an ASGPR binding moiety comprises an RNA sequence at least 60%, 65%, 70%, 75%, 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% identical to an RNA sequence encoded by a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1. In an embodiment, the TREM comprising an ASGPR binding moiety comprises an RNA sequence encoded by a DNA sequence at least 60%, 65%, 70%, 75%, 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% identical to a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1.

In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least 5, 10, 15, 20, 25, or 30 consecutive nucleotides of an RNA sequence encoded by a DNA sequence disclosed in Table 1, e.g., at least 5, 10, 15, 20, 25, or 30 consecutive nucleotides of an RNA sequence encoded by any one of SEQ ID NOs: 1-451 disclosed in Table 1. In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least 5, 10, 15, 20, 25, or 30 consecutive nucleotides of an RNA sequence at least 60%, 65%, 70%, 75%, 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% identical to an RNA sequence encoded by a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1. In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least 5, 10, 15, 20, 25, or 30 consecutive nucleotides of an RNA sequence encoded by a DNA sequence at least 60%, 65%, 70%, 75%, 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% identical to a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1.

In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% of an RNA sequence encoded by a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1. In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% of an RNA sequence at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to an RNA sequence encoded by a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1. In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least 5% 10%, 15%, 20%, 25%, 30%, 35%, 40%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% of an RNA sequence encoded by a DNA sequence at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1.

In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least 5 ribonucleotides (nt), 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, 50 nt, 55 nt or 60 nt (but less than the full length) of an RNA sequence encoded by a DNA sequence disclosed in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1. In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least 5 ribonucleotides (nt), 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, 50 nt, 55 nt or 60 nt (but less than the full length) of an RNA sequence which is at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to an RNA sequence encoded by a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1. In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least 5 ribonucleotides (nt), 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, 50 nt, 55 nt or 60 nt (but less than the full length) of an RNA sequence encoded by a DNA sequence with at least 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, 99% or 100% identity to a DNA sequence provided in Table 1, e.g., any one of SEQ ID NOs: 1-451 disclosed in Table 1.

In an embodiment, the TREM comprising an ASGPR binding moiety comprises a ribonucleic acid (RNA) sequence encoded by a deoxyribonucleic acid (DNA) sequence disclosed in Table 4, e.g., any one of SEQ ID NOs: 452-561 disclosed in Table 4. In an embodiment the TREM comprising an ASGPR binding moiety comprises an RNA sequence at least 60%, 65%, 70%, 75%, 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% identical to an RNA sequence encoded by a DNA sequence provided in Table 4, e.g., any one of SEQ ID NOs: 452-561 disclosed in Table 4. In an embodiment, the TREM comprising an ASGPR binding moiety comprises an RNA sequence encoded by a DNA sequence at least 60%, 65%, 70%, 75%, 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% identical to a DNA sequence provided in Table 4, e.g., any one of SEQ ID NOs: 452-561 disclosed in Table 4.

In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least 5 ribonucleotides (nt), 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, 50 nt, 55 nt or 60 nt (but less than the full length) of an RNA sequence encoded by a DNA sequence provided in Table 4, e.g., any one of SEQ ID NOs: 452-561 disclosed in Table 4. In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least 5 ribonucleotides (nt), 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, 50 nt, 55 nt or 60 nt (but less than the full length) of an RNA sequence which is at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to an RNA sequence encoded by a DNA sequence provided in Table 4, e.g., any one of SEQ ID NOs: 452-561 disclosed in Table 4. In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least 5 ribonucleotides (nt), 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, 50 nt, 55 nt or 60 nt (but less than the full length) of an RNA sequence encoded by a DNA sequence with at least 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, 99% or 100% identity to a DNA sequence provided in Table 4, e.g., any one of SEQ ID NOs: 452-561 disclosed in Table 4.

In an embodiment, the TREM comprising an ASGPR binding moiety is a compound provided in Table 12, e.g., any one of Compound Nos. 99-131. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 99. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 100. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 101. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 102. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 103. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 104. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 105. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 106. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 107. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 108. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 109. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 110. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 111. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 112. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 113. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 114. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 115. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 116. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 117. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 118. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 119. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 120. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 121. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 122. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 123. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 124. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 125. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 126. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 127. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 128. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 129. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 130. In an embodiment, the TREM comprising an ASGPR binding moiety is Compound 131.

In an embodiment, the TREM comprising an ASGPR binding moiety comprises a compound having an RNA sequence at least 60%, 65%, 70%, 75%, 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% identical to an RNA sequence of a TREM provided in Table 12, e.g., any one of Compounds 100-131 provided in Table 12. In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least 5 ribonucleotides (nt), 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, 50 nt, 55 nt or 60 nt (but less than the full length) of a TREM provided in Table 12, e.g., any one of Compounds 100-131 disclosed in Table 12. In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least 5 ribonucleotides (nt), 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, 50 nt, 55 nt or 60 nt (but less than the full length) of a TREM which is at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to TREM provided in Table 12, e.g., any one of Compounds 100-131 disclosed in Table 12.

In an embodiment, the TREM comprising an ASGPR binding moiety comprises a sequence provided in Table 12, e.g., any one of SEQ ID NOs: 622-654. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 622. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 623. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 624. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 625. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 626. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 627. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 628. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 629. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 630. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 631. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 632. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 633. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 634. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 635. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 636. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 637. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 638. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 639. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 640. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 641. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 642. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 643. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 644. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 645. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 646. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 647. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 648. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 649. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 650. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 651. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 652. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 653. In an embodiment, the TREM comprising an ASGPR binding moiety comprises SEQ ID NO. 654.

In an embodiment, the TREM comprising an ASGPR binding moiety comprises a sequence that is at least 60%, 65%, 70%, 75%, 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% identical to a sequence of a TREM provided in Table 12, e.g., any one of SEQ ID NOs. 622-654 provided in Table 12. In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least 5 ribonucleotides (nt), 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, 50 nt, 55 nt or 60 nt (but less than the full length) of a TREM provided in Table 12, e.g., any one of SEQ ID NOs. 622-654 disclosed in Table 12. In an embodiment, the TREM comprising an ASGPR binding moiety comprises at least 5 ribonucleotides (nt), 10 nt, 15 nt, 20 nt, 25 nt, 30 nt, 35 nt, 40 nt, 45 nt, 50 nt, 55 nt or 60 nt (but less than the full length) of a TREM which is at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to TREM provided in Table 12, e.g., any one of SEQ ID NOs. 622-654 disclosed in Table 12.

In an embodiment, the TREM comprising an ASGPR binding moiety comprises a sequence that differs no more than 1 ribonucleotide (nt), 2 nt, 3 nt, 4 nt, 5 nt, 6 nt, 7 nt, 8 nt, 9 nt, 10 nt, 12 nt, 14 nt, 16 nt, 18, nt, or 20 nt from a TREM provided in Table 12, e.g., any one of SEQ ID NOs. 622-652 provided in Table 12.

In an embodiment, the TREM comprising an ASGPR binding moiety is at least 60%, 65%, 70%, 75%, 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO. 622. In an embodiment, the TREM comprising an ASGPR binding moiety is at least 60%, 65%, 70%, 75%, 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO. 650. In an embodiment, the TREM comprising an ASGPR binding moiety is at least 60%, 65%, 70%, 75%, 80%, 82%, 85%, 87%, 88%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO. 653.

In an embodiment, the TREM comprising an ASGPR binding moiety comprises a sequence that differs comprises by least 1 ribonucleotide (nt), 2 nt, 3 nt, 4 nt, 5 nt, 6 nt, 7 nt, 8 nt, 9 nt, 10 nt, 12 nt, 14 nt, 16 nt, 18 nt, 20 nt, 25 nt, 30 nt, 40 nt, 45 nt, 50 nt, 55 nt, or more from SEQ ID NO. 622. In an embodiment, the TREM comprising an ASGPR binding moiety comprises a sequence that differs no more than 1 ribonucleotide (nt), 2 nt, 3 nt, 4 nt, 5 nt, 6 nt, 7 nt, 8 nt, 9 nt, 10 nt, 12 nt, 14 nt, 16 nt, 18, nt, or 20 nt from SEQ ID NO. 622. In an embodiment, the TREM comprising an ASGPR binding moiety comprises a sequence that differs comprises by least 1 ribonucleotide (nt), 2 nt, 3 nt, 4 nt, 5 nt, 6 nt, 7 nt, 8 nt, 9 nt, 10 nt, 12 nt, 14 nt, 16 nt, 18 nt, 20 nt, 25 nt, 30 nt, 40 nt, 45 nt, 50 nt, 55 nt, or more from SEQ ID NO. 650. In an embodiment, the TREM comprising an ASGPR binding moiety comprises a sequence that differs no more than 1 ribonucleotide (nt), 2 nt, 3 nt, 4 nt, 5 nt, 6 nt, 7 nt, 8 nt, 9 nt, 10 nt, 12 nt, 14 nt, 16 nt, 18, nt, or 20 nt from SEQ ID NO. 650. In an embodiment, the TREM comprising an ASGPR binding moiety comprises a sequence that differs comprises by least 1 ribonucleotide (nt), 2 nt, 3 nt, 4 nt, 5 nt, 6 nt, 7 nt, 8 nt, 9 nt, 10 nt, 12 nt, 14 nt, 16 nt, 18 nt, 20 nt, 25 nt, 30 nt, 40 nt, 45 nt, 50 nt, 55 nt, or more from SEQ ID NO. 653. In an embodiment, the TREM comprising an ASGPR binding moiety comprises a sequence that differs no more than 1 ribonucleotide (nt), 2 nt, 3 nt, 4 nt, 5 nt, 6 nt, 7 nt, 8 nt, 9 nt, 10 nt, 12 nt, 14 nt, 16 nt, 18, nt, or 20 nt from SEQ ID NO. 653.

Chemically Modified TREMs In some embodiments, a TREM entity (e.g, a TREM, a TREM core fragment or a TREM fragment described herein) further comprises a chemical modification, e.g., a modification described in any one of Tables 5-9, in addition to an ASGPR binding moiety. A chemical modification can be made according to methods known in the art. In an embodiment, a chemical modification is a modification that a cell, e.g., a human cell, does not make on an endogenous tRNA.

In an embodiment, a chemical modification is a modification that a cell, e.g., a human cell, can make on an endogenous tRNA, but wherein such modification is in a location in which it does not occur on a native tRNA. In an embodiment, the chemical modification is in a domain, linker or arm which does not have such modification in nature. In an embodiment, the chemical modification is at a position within a domain, linker or arm, which does not have such modification in nature. In an embodiment, the chemical modification is on a nucleotide which does not have such modification in nature. In an embodiment, the chemical modification is on a nucleotide at a position within a domain, linker or arm, which does not have such modification in nature.

Any of the nucleic acids featured in the disclosure can be synthesized and/or modified by methods well established in the art, such as those described in “Current protocols in nucleic acid chemistry,” Beaucage, S. L. et al. (Edrs.), John Wiley & Sons, Inc., New York, NY, USA, which is hereby incorporated herein by reference. Modifications include, for example, end modifications, e.g., 5′-end modifications (phosphorylation, conjugation, inverted linkages) or 3 ′-end modifications (conjugation, DNA nucleotides, inverted linkages, etc.); base modifications, e.g., replacement with stabilizing bases, destabilizing bases, or bases that base pair with an expanded repertoire of partners, removal of bases (abasic nucleotides), or conjugated bases; sugar modifications (e.g., at the 2′-position or 4′-position) or replacement of the sugar; and/or backbone modifications, including modification or replacement of the phosphodiester linkages. Specific examples of TREM compounds useful in the embodiments described herein include, but are not limited to TREMs containing modified backbones or no natural internucleoside linkages. TREMs having modified backbones include, among others, those that do not have a phosphorus atom in the backbone. For the purposes of this specification, and as sometimes referenced in the art, modified RNAs that do not have a phosphorus atom in their internucleoside backbone can also be considered to be oligonucleosides. In some embodiments, a modified TREMs will have a phosphorus atom in its internucleoside backbone.

Modified TREM backbones include, for example, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 3′-alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 3′-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates having normal 3′-5′ linkages, 2′-5′-linked analogs of these, and those having inverted polarity wherein the adjacent pairs of nucleoside units are linked 3′-5′ to 5′-3′ or 2′-5′ to 5′-2′. Various salts, mixed salts and free acid forms are also included.

Representative U.S. patents that disclose the preparation of the above phosphorus-containing linkages include, but are not limited to, U.S. Pat. Nos. 3,687,808; 4,469,863; 4,476,301; 5,023,243; 5,177,195; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,316; 5,550,111; 5,563,253; 5,571,799; 5,587,361; 5,625,050; 6,028,188; 6,124,445; 6,160,109; 6,169,170; 6,172,209; 6,239,265; 6,277,603; 6,326,199; 6,346,614; 6,444,423; 6,531,590; 6,534,639; 6,608,035; 6,683,167; 6,858,715; 6,867,294; 6,878,805; 7,015,315; 7,041,816; 7,273,933; 7,321,029; and U.S. Pat. RE39464, the entire contents of each of which are hereby incorporated herein by reference.

Modified TREM backbones that do not include a phosphorus atom therein have backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatoms and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH₂component parts. Representative U.S. patents that teach the preparation of the above oligonucleosides include, but are not limited to, U.S. Pat. Nos. 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,64,562; 5,264,564; 5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437; and, 5,677,439, the entire contents of each of which are hereby incorporated herein by reference.

In other embodiments, suitable RNA mimetics are contemplated for use in TREMs, in which both the sugar and the internucleoside linkage, i.e., the backbone, of the nucleotide units are replaced with novel groups. The base units are maintained for hybridization with an appropriate nucleic acid target compound. One such oligomeric compound, an RNA mimetic that has been shown to have excellent hybridization properties, is referred to as a peptide nucleic acid (PNA). In PNA compounds, the sugar backbone of an RNA is replaced with an amide containing backbone, in particular an aminoethylglycine backbone. The nucleobases are retained and are bound directly or indirectly to aza nitrogen atoms of the amide portion of the backbone. Representative U.S. patents that teach the preparation of PNA compounds include, but are not limited to, U.S. Pat. Nos. 5,539,082; 5,714,331; and 5,719,262, the entire contents of each of which are hereby incorporated herein by reference. Additional PNA compounds suitable for use in the TREMs of the disclosure are described in, for example, in Nielsen et al., Science, 1991, 254, 1497-1500.

Some embodiments featured in the disclosure include TREMs with phosphorothioate backbones and oligonucleosides with heteroatom backbones, and in particular —CH₂—NH—CH₂—, —CH₂—N(CH₃)—O—CH₂— [known as a methylene (methylimino) or MMI backbone], —CH₂—O—N(CH₃)—CH₂—, —CH₂—N(CH₃)—N(CH₃)—CH₂— and —N(CH₃)—CH₂—CH₂— [wherein the native phosphodiester backbone is represented as —O—P—O—CH₂— ] of the above-referenced U.S. Pat. No. 5,489,677, and the amide backbones of the above-referenced U.S. Pat. No. 5,602,240. In some embodiments, the TREMs featured herein have morpholino backbone structures of the above-referenced U.S. Pat. No. 5,034,506.

The TREMs featured herein can include one of the following at the 2′-position: OH; F; 0-S—, or N-alkyl; O—, S—, or N-alkenyl; O—, S- or N-alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl can be substituted or unsubstituted Ci to C₁₀alkyl or C₂to C₁₀alkenyl and alkynyl. Exemplary suitable modifications include O[(CH₂)_nO]_mCH₃, O(CH₂)·_nOCH₃, O(CH₂)_nNH₂, O(CH₂)_nCH₃, O(CH₂)_nONH₂, and O(CH₂)_nON[(CH₂)_nCH₃)]₂, where n and m are from 1 to about 10. In other embodiments, TREMs may include one of the following at the 2′ position: Ci to C₁₀lower alkyl, substituted lower alkyl, alkaryl, aralkyl, O— alkaryl or O-aralkyl, SH, SCH₃, OCN, Cl, Br, CN, CF₃, OCF₃, SOCH₃, SO₂CH₃, ONO₂, NO₂, N3, NH₂, heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an intercalator, a group for improving the pharmacokinetic properties of a TREM, or a group for improving the pharmacodynamic properties of a TREM, and other substituents having similar properties.

In some embodiments, the modification includes a 2′-methoxyethoxy (2′-O—CH₂CH₂OCH₃, also known as 2′-O—(2-methoxyethyl) or 2′-MOE) (Martin et al., Helv. Chim. Acta, 1995, 78:486-504) i.e., an alkoxy-alkoxy group. Another exemplary modification is 2′-dimethylaminooxyethoxy, i.e., a O(CH₂)₂ON(CH₃)₂group, also known as 2′-DMAOE, as described in examples herein below, and 2′-dimethylaminoethoxyethoxy (also known in the art as 2′—O-dimethylaminoethoxyethyl or 2′-DMAEOE), i.e., 2′-O—CH₂—O—CH₂—N(CH₂)₂.

Other modifications include 2′-methoxy (2′-OCH₃), 2′-aminopropoxy (2′-OCH₂CH₂CH₂NH₂) and 2′-fluoro (2′-F). Similar modifications can also be made at other positions within the TREM, particularly the 3′ position of the sugar on the 3′ terminal nucleotide or in 2′-5′ linked TREMs and the 5′ position of 5′ terminal nucleotide. TREMs can also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar. Representative U.S. patents that teach the preparation of such modified sugar structures include, but are not limited to, U.S. Pat. Nos. 4,981,957; 5,118,800; 5,319,080; 5,359,044; 5,393,878; 5,446,137; 5,466,786; 5,514,785; 5,519,134; 5,567,811; 5,576,427; 5,591,722; 5,597,909; 5,610,300; 5,627,053; 5,639,873; 5,646,265; 5,658,873; 5,670,633; and 5,700,920, certain of which are commonly owned with the instant application. The entire contents of each of the foregoing are hereby incorporated herein by reference.

TREMs can also include nucleobase (often referred to in the art simply as “base”) modifications or substitutions. As used herein, “unmodified” or “natural” nucleobases include the purine bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified nucleobases include other synthetic and natural nucleobases such as deoxy-thymine (dT), 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl anal other 8-substituted adenines and guanines, 5-halo, particularly 5-bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine and 7-methyladenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-daazaadenine and 3-deazaguanine and 3-deazaadenine. Further nucleobases include those disclosed in U.S. Pat. No. 3,687,808, those disclosed in Modified Nucleosides in Biochemistry, Biotechnology and Medicine, Herdewijn, P. ed. Wiley-VCH, 2008; those disclosed in The Concise Encyclopedia Of Polymer Science And Engineering, pages 858-859, Kroschwitz, J. L, ed. John Wiley & Sons, 1990, these disclosed by Englisch et al., Angewandte Chemie, International Edition, 1991, 30, 613, and those disclosed by Sanghvi, Y S., Chapter 15, dsRNA Research and Applications, pages 289-302, Crooke, S. T. and Lebleu, B., Ed., CRC Press, 1993. Certain of these nucleobases are particularly useful for increasing the binding affinity of the oligomeric compounds featured in the invention. These include 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and 0-6 substituted purines, including 2-aminopropyladenine, 5-propynyluracil and 5-propynylcytosine. 5-methylcytosine substitutions have been shown to increase nucleic acid duplex stability by 0.6-1.2° C. (Sanghvi, Y. S., Crooke, S. T. and Lebleu, B., Eds., dsRNA Research and Applications, CRC Press, Boca Raton, 1993, pp. 276-278) and are exemplary base substitutions, even more particularly when combined with 2′—O-methoxyethyl sugar modifications.

Representative U.S. patents that teach the preparation of certain of the above noted modified nucleobases as well as other modified nucleobases include, but are not limited to, the above noted U.S. Pat. Nos. 3,687,808, 4,845,205; 5,130,30; 5,134,066; 5,175,273; 5,367,066; 5,432,272; 5,457,187; 5,459,255; 5,484,908; 5,502,177; 5,525,711; 5,552,540; 5,587,469; 5,594,121, 5,596,091; 5,614,617; 5,681,941; 5,750,692; 6,015,886; 6,147,200; 6,166,197; 6,222,025; 6,235,887; 6,380,368; 6,528,640; 6,639,062; 6,617,438; 7,045,610; 7,427,672; and 7,495,088, the entire contents of each of which are hereby incorporated herein by reference.

The TREM can also be modified to include one or more bicyclic sugar moieties. A “bicyclic sugar” is a furanosyl ring modified by the bridging of two atoms. A“bicyclic nucleoside” (“BNA”) is a nucleoside having a sugar moiety comprising a bridge connecting two carbon atoms of the sugar ring, thereby forming a bicyclic ring system. In certain embodiments, the bridge connects the 4′-carbon and the 2′-carbon of the sugar ring. Thus, in some embodiments an agent of the invention may include the RNA of a TREM can also be modified to include one or more locked nucleic acids (LNA). A locked nucleic acid is a nucleotide having a modified ribose moiety in which the ribose moiety comprises an extra bridge connecting the 2′ and 4′ carbons. In other words, an LNA is a nucleotide comprising a bicyclic sugar moiety comprising a 4′—CH₂—O—2′ bridge. This structure effectively “locks” the ribose in the 3′-endo structural conformation. The addition of locked nucleic acids to oligonucleotide sequences has been shown to increase their stability in serum, and to reduce off-target effects (Elmen, J. et al, (2005) Nucleic Acids Research 33(1):439-447; Mook, O R. et al, (2007) Mol Cane Ther 6(3):833-843; Grunweller, A. et al, (2003) Nucleic Acids Research 31(12):3185-3193)

In an embodiment, a TREM, a TREM core fragment or a TREM fragment described herein comprises a chemical modification provided in Table 5, or a combination thereof.

TABLE 5

Exemplary modifications

Name

7-deaza-adenosine

Nl-methyl-adenosine

N6,N6 (dimethyl)adenine

N6-cis-hydroxy-isopentenyl-adenosine

thio-adenosine

2-(amino)adenine

2-(aminopropyl)adenine

2-(methylthio) N6 (isopentenyl)adenine

2-(alkyl)adenine

2-(aminoalkyl)adenine

2-(aminopropyl)adenine

2-(halo)adenine

2-(propyl)adenine

2′-azido-2′-deoxy-adenosine

2′-Deoxy-2′-alpha-aminoadenosine

2′-Deoxy-2′-alpha-azidoadenosine

6-(alkyl)adenine

6-(methyl)adenine

6-(alkyl)adenine

6-(methyl)adenine

7-(deaza)adenine

8-(alkenyl)adenine

8-(alkynyl)adenine

8-(amino)adenine

8-(thioalkyl)adenine

8-(alkenyl)adenine

8-(alkyl)adenine

8-(alkynyl)adenine

8-(amino)adenine

8-(halo)adenine

8-(hydroxyl)adenine

8-(thioalkyl)adenine

8-(thiol)adenine

8-azido-adenosine

azaadenine

deazaadenine

N6-(methyl)adenine

N6-(isopentyl)adenine

7-deaza-8-aza-adenosine

7-methyladenine

1-deazaadenosine

2′-Fluoro-N6-Bz-deoxyadenosine

2′-OMe-2-Amino-adenosine

2′O-methyl-N6-Bz-deoxyadenosine

2′-alpha-ethynyladenosine

2-aminoadenine

2-Aminoadenosine

2-Amino-adenosine

2′-alpha-Trifluoromethyladenosine

2-Azidoadenosine

2′-beta-Ethynyladenosine

2-Bromoadenosine

2′-beta-Trifluoromethyladenosine

2-Chloroadenosine

2′-Deoxy-2′,2′-difluoroadenosine

2′-Deoxy-2′-alpha-mercaptoadenosine

2′-Deoxy-2′-alpha-thiomethoxyadenosine

2′-Deoxy-2′-beta-aminoadenosine

2′-Deoxy-2′-beta-azidoadenosine

2′-Deoxy-2′-beta-bromoadenosine

2′-Deoxy-2′-beta-chloroadenosine

2′-Deoxy-2′-beta-fluoroadenosine

2′-Deoxy-2′-beta-iodoadenosine

2′-Deoxy-2′-beta-mercaptoadenosine

2′-Deoxy-2′-beta-thiomethoxyadenosine

2-Fluoroadenosine

2-Iodoadenosine

2-Mercaptoadenosine

2-methoxy-adenine

2-methylthio-adenine

2-Trifluoromethyladenosine

3-Deaza-3-bromoadenosine

3-Deaza-3-chloroadenosine

3-Deaza-3-fluoroadenosine

3-Deaza-3-iodoadenosine

3-Deazaadenosine

4′-Azidoadenosine

4′-Carbocyclic adenosine

4′-Ethynyladenosine

5′-Homo-adenosine

8-Aza-adenosine

8-bromo-adenosine

8-Trifluoromethyladenosine

9-Deazaadenosine

2-aminopurine

7-deaza-2,6-diaminopurine

7-deaza-8-aza-2,6-diaminopurine

7-deaza-8-aza-2-aminopurine

2,6-diaminopurine

7-deaza-8-aza-adenine, 7-deaza-2-aminopurine

4-methylcytidine

5-aza-cytidine

Pseudo-iso-cytidine

pyrrolo-cytidine

alpha-thio-cytidine

2-(thio)cytosine

2′-Amino-2′-deoxy-cytosine

2′-Azido-2′-deoxy-cytosine

2′-Deoxy-2′-alpha-aminocytidine

2′-Deoxy-2′-alpha-azidocytidine

3 (deaza) 5 (aza)cytosine

3 (methyl)cytosine

3-(alkyl)cytosine

3-(deaza) 5 (aza)cytosine

3-(methyl)cytidine

4,2′-O-dimethylcytidine

5 (halo)cytosine

5 (methyl)cytosine

5 (propynyl)cytosine

5 (trifluoromethyl)cytosine

5-(alkyl)cytosine

5-(alkynyl)cytosine

5-(halo)cytosine

5-(propynyl)cytosine

5-(trifluoromethyl)cytosine

5-bromo-cytidine

5-iodo-cytidine

5-propynyl cytosine

6-(azo)cytosine

6-aza-cytidine

aza cytosine

deaza cytosine

N4 (acetyl)cytosine

1-methyl-1-deaza-pseudoisocytidine

1-methyl-pseudoisocytidine

2-methoxy-5-methyl-cytidine

2-methoxy-cytidine

2-thio-5-methyl-cytidine

4-methoxy-1-methyl-pseudoisocytidine

4-methoxy-pseudoisocytidine

4-thio-1-methyl-1-deaza-pseudoisocytidine

4-thio-1-methyl-pseudoisocytidine

4-thio-pseudoisocytidine

5-aza-zebularine

5-methyl-zebularine

pyrrolo-pseudoisocytidine

zebularine

(E)-5-(2-Bromo-vinyl)cytidine

2,2′-anhydro-cytidine

2′-Fluor-N4-Bz-cytidine

2′-Fluoro-N4-Acetyl-cytidine

2′-O-Methyl-N4-Acetyl-cytidine

2′-O-methyl-N4-Bz-cytidine

2′-a-Ethynylcytidine

2′-a-Trifluoromethylcytidine

2′-b-Ethynylcytidine

2′-b-Trifluoromethylcytidine

2′-Deoxy-2′,2′-difluorocytidine

2′-Deoxy-2′-alpha-mercaptocytidine

2′-Deoxy-2′-alpha-thiomethoxycytidine

2′-Deoxy-2′-betab-aminocytidine

2′-Deoxy-2′-beta-azidocytidine

2′-Deoxy-2′-beta-bromocytidine

2′-Deoxy-2′-beta-chlorocytidine

2′-Deoxy-2′-beta-fluorocytidine

2′-Deoxy-2′-beta-iodocytidine

2′-Deoxy-2′-beta-mercaptocytidine

2′-Deoxy-2′-beta-thiomethoxycytidine TP

2′-O-Methyl-5-(1-propynyl)cytidine

3′-Ethynylcytidine

4′-Azidocytidine

4′-Carbocyclic cytidine

4′-Ethynylcytidine

5-(1-Propynyl)ara-cytidine

5-(2-Chloro-phenyl)-2-thiocytidine

5-(4-Amino-phenyl)-2-thiocytidine

5-Aminoallyl-cytosine

5-Cyanocytidine

5-Ethynylara-cytidine

5-Ethynylcytidine

5′-Homo-cytidine

5-Methoxycytidine

5-Trifluoromethyl-Cytidine

N4-Amino-cytidine

N4-Benzoyl-cytidine

pseudoisocytidine

6-thio-guanosine

7-deaza-guanosine

8-oxo-guanosine

Nl-methyl-guanosine

alpha-thio-guanosine

2-(propyl)guanine

2-(alkyl)guanine

2′-Amino-2′-deoxy-guanosine

2′-Azido-2′-deoxy-guanosine

2′-Deoxy-2′-alpha-aminoguanosine

2′-Deoxy-2′-alpha-azidoguanosine

6-(methyl)guanine

6-(alky1)guanine

6-(methyl)guanine

6-methyl-guanosine

7-(alkyl)guanine

7-(deaza)guanine

7-(methyl)guanine

7-(alkyl)guanine

7-(deaza)guanine

7-(methyl)guanine

8-(alkyl)guanine

8-(alkynyl)guanine

8-(halo)guanine

8-(thioalkyl)guanine

8-(alkenyl)guanine

8-(alkyl)guanine

8-(alkynyl)guanine

8-(amino)guanine

8-(halo)guanine

8-(hydroxyl)guanine

8-(thioalkyl)guanine

8-(thiol)guanine

azaguanine

deaza guanine

N (methyl)guanine

N-(methyl)guanine

1-methyl-6-thio-guanosine

6-methoxy-guanosine

6-thio-7-deaza-8-aza-guanosine

6-thio-7-deaza-guanosine

6-thio-7-methyl-guanosine

7-deaza-8-aza-guanosine

7-methyl-8-oxo-guanosine

N2,N2-dimethyl-6-thio-guanosine

N2-methyl-6-thio-guanosine

1-Me-guanosine

2′Fluoro-N2-isobutyl-guanosine

2′O-methyl-N2-isobutyl-guanosine

2′-alpha-Ethynylguanosine

2′-alpha-Trifluoromethylguanosine

2′-beta-Ethynylguanosine

2′-beta-Trifluoromethylguanosine

2′-Deoxy-2′,2′-difluoroguanosine

2′-Deoxy-2′-alpha-mercaptoguanosine

2′-Deoxy-2′-alpha-thiomethoxyguanosine

2′-Deoxy-2′-beta-aminoguanosine

2′-Deoxy-2′-beta-azidoguanosine

2′-Deoxy-2′-beta-bromoguanosine

2′-Deoxy-2′-beta-chloroguanosine

2′-Deoxy-2′-beta-fluoroguanosine

2′-Deoxy-2′-beta-mercaptoguanosine

2′-Deoxy-2′-beta-iodoguanosine

2′-Deoxy-2′-beta-thiomethoxyguanosine

4′-Azidoguanosine

4′-Carbocyclic guanosine

4′-Ethynylguanosine

5′-Homo-guanosine

8-bromo-guanosine

9-Deazaguanosine

N2-isobutyl-guanosine

7-methylinosine

allyamino-thymidine

aza thymidine

deaza thymidine

deoxy-thymidine

5-propynyl uracil

alpha-thio-uridine

1-(aminoalkylamino-carbonylethylenyl)-2(thio)-pseudouracil

1-(aminoalkylaminocarbonylethylenyl)-2,4-(dithio)pseudouracil

1-(aminoalkylaminocarbonylethylenyl)-4 (thio)pseudouracil

1-(aminoalkylaminocarbonylethylenyl)-pseudouracil

1-(aminocarbonylethylenyl)-2(thio)-pseudouracil

1-(aminocarbonylethylenyl)-2,4-(dithio)pseudouracil

1-(aminocarbonylethylenyl)-4 (thio)pseudouracil

1-(aminocarbonylethylenyl)-pseudouracil

1-substituted 2-(thio)-pseudouracil

1-substituted 2,4-(dithio)pseudouracil

1-substituted 4 (thio)pseudouracil

1-substituted pseudouracil

1-(aminoalkylamino-carbonylethylenyl)-2-(thio)-pseudouracil

1-Methyl-3-(3-amino-3-carboxypropyl) pseudouridine

1-Methyl-3-(3-amino-3-carboxyproovl)pseudo-Uradine

1-Methyl-pseudo-UTP

2 (thio)pseudouracil

2′ deoxy uridine

2′ fluorouridine

2-(thio)uracil

2,4-(dithio)psuedouracil

2′-methyl, 2′-amino, 2′azido, 2′fluro-guanosine

2′-Amino-2′-deoxy-uridine

2′-Azido-2′-deoxy-uridine

2′-Azido-deoxyuridine

2′-O-methylpseudouridine

2′ deoxyuridine

2′ fluorouridine

2′-Deoxy-2′-alpha-aminouridine TP

2′-Deoxy-2′-alpha-azidouridine TP

2-methylpseudouridine

3-(3 amino-3-carboxypropyl)uracil

4-(thio)pseudouracil

4-(thio)pseudouracil

4-(thio)uracil

4-thiouracil

5-(1,3-diazole-1-alkyl)uracil

5-(2-aminopropyl)uracil

5-(aminoalkyl)uracil

5-(dimethylaminoalkyl)uracil

5-(guanidiniumalkyl)uracil

5-(methoxycarbonylmethyl)-2-(thio)uracil

5-(methoxycarbonyl-methyl)uracil

5-(methyl)-2-(thio)uracil

5-(methyl)-2,4-(dithio)uracil

5 (methyl) 4 (thio)uracil

5 (methylaminomethyl)-2 (thio)uracil

5 (methylaminomethyl)-2,4 (dithio)uracil

5 (methylaminomethyl)-4 (thio)uracil

5 (propynyl)uracil

5 (trifluoromethyl)uracil

5-(2-aminopropyl)uracil

5-(alky1)-2-(thio)pseudouracil

5-(alkyl)-2,4 (dithio)pseudouracil

5-(alky1)-4 (thio)pseudouracil

5-(alkyl)pseudouracil

5-(alkyl)uracil

5-(alkynyl)uracil

5-(allylamino)uracil

5-(cyanoalkyl)uracil

5-(dialkylaminoalkyl)uracil

5-(dimethylaminoalkyl)uracil

5-(guanidiniumalkyl)uracil

5-(halo)uracil

5-(1,3-diazole-1-alkyl)uracil

5-(methoxy)uracil

5-(methoxycarbonylmethyl)-2-(thio)uracil

5-(methoxycarbonyl-methyl)uracil

5-(methyl) 2(thio)uracil

5-(methyl) 2,4 (dithio)uracil

5-(methyl) 4 (thio)uracil

5-(methyl)-2-(thio)pseudouracil

5-(methyl)-2,4 (dithio)pseudouracil

5-(methyl)-4 (thio)pseudouracil

5-(methyl)pseudouracil

5-(methylaminomethyl)-2 (thio)uracil

5-(methylaminomethyl)-2,4(dithio)uracil

5-(methylaminomethyl)-4-(thio)uracil

5-(propyny1)uracil

5-(trifluoromethyl)uracil

5-aminoallyl-uridine

5-bromo-uridine

5-iodo-uridine

5-uracil

6 (azo)uracil

6-(azo)uracil

6-aza-uridine

allyamino-uracil

aza uracil

deaza uracil

N3 (methyl)uracil

Pseudo-uridine-1-2-ethanoic acid

pseudouracil

4-Thio-pseudouridine

1-carboxymethyl-pseudouridine

1-methyl-1-deaza-pseudouridine

1-propynyl-uridine

1-taurinomethyl-1-methyl-uridine

1-taurinomethyl-4-thio-uridine

1-taurinomethyl-pseudouridine

2-methoxy-4-thio-pseudouridine

2-thio-1-methyl-1-deaza-pseudouridine

2-thio-1-methyl-pseudouridine

2-thio-5-aza-uridine

2-thio-dihydropseudouridine

2-thio-dihydrouridine

2-thio-pseudouridine

4-methoxy-2-thio-pseudouridine

4-methoxy-pseudouridine

4-thio-1-methyl-pseudouridine

4-thio-pseudouridine

5-aza-uridine

dihydropseudouridine

(±)1-(2-Hydroxypropyl)pseudouridine

(2S)-1-(2-Hydroxypropyl)pseudouridine

(2R)-1-(2-Hydroxypropyl)pseudouridine

(E)-5-(2-Bromo-vinyl)ara-uridine

(E)-5-(2-Bromo-vinyl)uridine

(Z)-5-(2-Bromo-vinyl)ara-uridine

(Z)-5-(2-Bromo-vinyl)uridine

1-(2,2,2-Trifluoroethyl)-pseudouridine

1-(2,2,3,3,3-Pentafluoropropyl)pseudouridine

1-(2,2-Diethoxyethy1)pseudouridine

1-(2,4,6-Trimethylbenzyl)pseudouridine

1-(2,4,6-Trimethyl-benzyl)pseudo-uridine

1-(2,4,6-Trimethyl-phenyl)pseudo-uridine

1-(2-Amino-2-carboxyethyl)pseudo-uridine

1-(2-Amino-ethyl)pseudouridine

1-(2-Hydroxyethyl)pseudouridine

1-(2-Methoxyethyl)pseudouridine

1-(3,4-Bis-trifluoromethoxvbenzvl)pseudouridine

1-(3,4-Dimethoxybenzyl)pseudouridine

1-(3-Amino-3-carboxypropyl)pseudo-uridine

1-(3-Amino-propyl)pseudouridine

1-(3-Cyclopropyl-prop-2-ynyl)pseudouridine TP

1-(4-Amino-4-carboxybutyl)pseudouridine

1-(4-Amino-benzyl)pseudouridine

1-(4-Amino-butyl)pseudouridine

1-(4-Amino-phenyl)pseudouridine

1-(4-Azidobenzyl)pseudouridine

1-(4-Bromobenzyl)pseudouridine

1-(4-Chlorobenzyl)pseudouridine

1-(4-Fluorobenzyl)pseudouridin

1-(4-Iodobenzyl)pseudouridine

1-(4-Methanesulfonvlbenzvl)pseudouridine

1-(4-Methoxybenzyl)pseudouridine

1-(4-Methoxy-benzyl)pseudouridine

1-(4-Methoxy-phenyl)pseudouridine

1-(4-Methylbenzyl)pseudouridine

1-(4-Methyl-benzyl)pseudouridine

1-(4-Nitrobenzyl)pseudouridine

1-(4-Nitro-benzy!)pseudouridine

1(4-Nitro-phenyl)pseudouridine

1-(4-Thiomethoxybenzyl)pseudouridine

1-(4-Trifluoromethoxybenzvl)pseudouridine

1-(4-Trifluoromethylbenzyl)pseudouridine

1-(5-Amino-pentyl)pseudouridine

1-(6-Amino-hexyl)pseudouridine

1,6-Dimethyl-pseudouridine

1-[3-(2-{2-[2-(2-Aminoethoxy)-ethoxy]-ethoxy}-ethoxy)-propionyl]pseudouridine

1-{3-[2-(2-Aminoethoxy)-ethoxy]-propionvl} pseudouridine

1-Acetylpseudouridine

1-Alkyl-6-(1-propynyl)-pseudo-uridine

1-Alkyl-6-(2-propynyl)-pseudo-uridine

1-Alkyl-6-allyl-pseudo-uridine

1-Alkyl-6-ethynyl-pseudo-uridine

1-Alkyl-6-homoallyl-pseudo-uridine

1-Alkyl-6-vinyl-pseudo-uridine

1-Allylpseudouridine

1-Aminomethyl-pseudo-uridine

1-Benzoylpseudouridine

1-Benzyloxymethylpseudouridine

1-Benzyl-pseudo-uridine

1-Biotinyl-PEG2-pseudouridine

1-Biotinylpseudouridine

1-Butyl-pseudo-uridine

1-Cyanomethylpseudouridine

1-Cyclobutylmethyl-pseudo-uridine

1-Cyclobutyl-pseudo-uridine

1-Cycloheptylmethyl-pseudo-uridine

1-Cycloheptyl-pseudo-uridine

1-Cyclohexylmethyl-pseudo-uridine

1-Cyclohexyl-pseudo-uridine

1-Cyclooctylmethyl-pseudo-uridine

1-Cyclooctyl-pseudo-uridine

1-Cyclopentylmethyl-pseudo-uridine

1-Cyclopentyl-pseudo-uridine

1-Cyclopropylmethyl-pseudo-uridine

1-Cyclopropyl-pseudo-uridine

1-Ethyl-pseudo-uridine

1-Hexyl-pseudo-uridine

1-Homoallylpseudouridine

1-Hydroxymethylpseudouridine

1-iso-propyl-pseudo-uridine

1-Me-2-thio-pseudo-uridine

1-Me-4-thio-pseudo-uridine

1-Me-alpha-thio-pseudo-uridine

1-Methanesulfonylmethylpseudouridine

1-Methoxymethylpseudouridine uridine

1-Methyl-6-(2,2,2-Trifluoroethyl)pseudo-uridine

1-Methyl-6-(4-morpholino)-pseudo-uridine

1-Methyl-6-(4-thiomorpholino)-pseudo-uridine

1-Methyl-6-(substituted phenyl)pseudo-uridine

1-Methyl-6-amino-pseudo-uridine

1-Methyl-6-azido-pseudo-uridine

1-Methyl-6-bromo-pseudo-uridine

1-Methyl-6-butyl-pseudo-uridine

1-Methyl-6-chloro-pseudo-uridine

1-Methyl-6-cyano-pseudo-uridine

1-Methyl-6-dimethylamino-pseudo-uridine

1-Methyl-6-ethoxy-pseudo-uridine

1-Methyl-6-ethylcarboxylate-pseudo-uridine

1-Methyl-6-ethyl-pseudo-uridine

1-Methyl-6-fluoro-pseudo-uridine

1-Methyl-6-formyl-pseudo-uridine

1-Methyl-6-hydroxyamino-pseudo-uridine

1-Methyl-6-hydroxy-pseudo-uridine

1-Methyl-6-iodo-pseudo-uridine

1-Methyl-6-iso-propyl-pseudo-uridine

1-Methyl-6-methoxy-pseudo-uridine

1-Methyl-6-methylamino-pseudo-uridine

1-Methyl-6-phenyl-pseudo-uridine

1-Methyl-6-propyl-pseudo-uridine

1-Methyl-6-tert-butyl-pseudo-uridine

1-Methyl-6-trifluoromethoxy-pseudo-uridine

1-Methyl-6-trifluoromethyl-pseudo-uridine

1-Morpholinomethylpseudouridine

1-Pentyl-pseudo-uridineuridine

1-Phenyl-pseudo-uridine

1-Pivaloylpseudouridine

1-Propargylpseudouridine

1-Propyl-pseudo-uridine

1-propynyl-pseudouridine

1-p-tolyl-pseudo-uridine

1-tert-Butyl-pseudo-uridine

1-Thiomethoxymethylpseudouridine

1-Thiomorpholinomethylpseudouridine

1-Trifluoroacetylpseudouridine

1-Trifluoromethyl-pseudouridine

1-Vinylpseudouridine

2,2′-anhydro-uridine

2′-bromo-deoxyuridine

2′-F-5-Methyl-2′-deoxy-uridine

2′-OMe-5-Me-uridine

2′-OMe-pseudouridine

2′-alpha-Ethynyluridine

2′-alpha-Trifluoromethyluridine

2′-beta-Ethynyluridine

2′-beta-Trifluoromethyluridiner

2′-Deoxy-2′,2′-difluorouridine

2′-Deoxy-2′-a-mercaptouridin

2′-Deoxy-2′-alpha-thiomethoxyuridine

2′-Deoxy-2′-beta-aminouridine

2′-Deoxy-2′-beta-azidouridine

2′-Deoxy-2′-beta-bromouridine

2′-Deoxy-2′-beta-chlorouridine

2′-Deoxy-2′-beta-fluorouridine

2′-Deoxy-2′-beta-iodouridine

2′-Deoxy-2′-beta-mercaptouridine

2′-Deoxy-2′-beta-thiomethoxyuridine

2-methoxy-4-thio-uridine

2-methoxyuridine

2′-O-Methyl-5-(1-propynyl)uridine

3-Alkyl-pseudo-uridine

4′-Azidouridine

4′-Carbocyclic uridine

4′-Ethynyluridine

5-(1-Propynyl)ara-uridine

5-(2-Furanyl)uridine

5-Cyanouridine

5-Dimethylaminouridine

5′-Homo-uridine

5-iodo-2′-fluoro-deoxyuridine

5-Phenylethynyluridine

5-Trideuteromethyl-6-deuterouridine

5-Trifluoromethyl-Uridine

5-Vinylarauridine

6-(2,2,2-Trifluoroethyl)-pseudo-uridine

6-(4-Morpholino)-pseudo-uridine

6-(4-Thiomorpholino)-pseudo-uridine

6-(Substituted-Phenyl)-pseudo-uridine

6-Amino-pseudo-uridine

6-Azido-pseudo-uridine

6-Bromo-pseudo-uridine

6-Butyl-pseudo-uridine

6-Chloro-pseudo-uridine

6-Cyano-pseudo-uridine

6-Dimethylamino-pseudo-uridine

6-Ethoxy-pseudo-uridine

6-Ethylcarboxylate-pseudo-uridine

6-Ethyl-pseudo-uridine

6-Fluoro-pseudo-uridine

6-Formyl-pseudo-uridine

6-Hydroxyamino-pseudo-uridine

6-Hydroxy-pseudo-uridine

6-Iodo-pseudo-uridine

6-iso-Propyl-pseudo-uridine

6-Methoxy-pseudo-uridine

6-Methylamino-pseudo-uridine

6-Methyl-pseudo-uridine

6-Phenyl-pseudo-uridine

6-Phenyl-pseudo-uridine

6-Propyl-pseudo-uridine

6-tert-Butyl-pseudo-uridine

6-Trifluoromethoxy-pseudo-uridine

6-Trifluoromethyl-pseudo-uridine

Alpha-thio-pseudo-uridine

Pseudouridine 1-(4-methylbenzenesulfonic acid) TP

Pseudouridine 1-(4-methylbenzoic acid) TP

Pseudouridine 1-[3-(2-ethoxy)]propionic acid

Pseudouridine 1-[3-{2-(2-[2-(2-ethoxy)-ethoxy]-ethoxy)-ethoxy}]propionic acid

Pseudouridine 1-[3-{2-(2-[2-{2(2-ethoxy)-ethoxy}-ethoxy]-ethoxy)-ethoxy}]propionic acid

Pseudouridine 1-[3-{2-(2-[2-ethoxy]-ethoxy)-ethoxv}]propionic acid

Pseudouridine 1-[3-{2-(2-ethoxy)-ethoxy}] propionic acid

Pseudouridine 1-methylphosphonicacid

Pseudouridine TP 1-methylphosphonic acid diethyl ester

Pseudo-uridine-N1-3-propionic acid

Pseudo-uridine-N1-4-butanoic acid

Pseudo-uridine-N 1-5-pentanoic acid

Pseudo-uridine-N1-6-hexanoic acid

Pseudo-uridine-N1-7-heptanoic acid

Pseudo-uridine-N1-methyl-p-benzoic acid

Pseudo-uridine-N1-p-benzoic acid

In an embodiment, a TREM, a TREM core fragment or a TREM fragment described herein comprises a modification provided in Table 6, or a combination thereof. The modifications provided in Table 6 occur naturally in RNAs, and are used herein on a synthetic TREM, a TREM core fragment or a TREM fragment at a position that does not occur in nature.

TABLE 6

Additional exemplary modifications

Name

2-methylthio-N6-(cis-hvdroxvisopentenvl)adenosine

2-methylthio-N6-methyladenosine

2-methylthio-N6-threonyl carbamoyladenosine

N6-glycinylcarbamoyladenosine

N6-isopentenyladenosine

N6-methyladenosine

N6-threonylcarbamoyladenosine

1,2′-O-dimethyladenosine

1-methyladenosine

2′-O-methyladenosine

2′-O-ribosyladenosine (phosphate)

2-methyladenosine

2-methylthio-N6 isopentenyladenosine

2-methylthio-N6-hydroxynorvalyl carbamoyladenosine

2′-O-methyladenosine

2′-O-ribosyladenosine (phosphate)

isopenteny ladenosine

N6-(cis-hydroxyisopentenyl)adenosine

N6,2′-O-dimethyladenosine

N6,2′-O-dimethyladenosine

N6,N6,2′-O-trimethyladenosine

N6,N6-dimethyladenosine

N6-acetyladenosine

N6-hydroxynorvalylcarbamoyladenosine

N6-methyl-N6-threonylcarbamoyladenosine

2-methyladenosine

2-methylthio-N⁶-isopentenyladenosine

2-thiocytidine

3-methylcytidine

5-formylcytidine

5-hydroxymethylcytidine

5-methylcytidine

N4-acetylcytidine

2′-O-methylcytidine

2′-O-methylcytidine

5,2′-O-dimethylcytidine

5-formyl-2′-O-methylcytidine

lysidine

N4,2′-O-dimethy lcytidine

N4-acetyl-2′-O-methylcytidine

N4-methylcytidine

N4,N4-Dimethyl-2′-OMe-Cytidine

7-methylguanosine

N2,2′-O-dimethylguanosine

N2-methylguanosine

wyosme

1,2′-O-dimethylguanosine

1-methylguanosine

2′-O-methylguanosine

2′-O-ribosylguanosine (phosphate)

2′-O-methylguanosine

2′-O-ribosylguanosine (phosphate)

7-aminomethyl-7-deazaguanosine

7-cyano-7-deazaguanosine

archaeosine

methylwyosine

N2,7-dimethylguanosine

N2,N2,2′-O-trimethylguanosine

N2,N2,7-trimethylguanosine

N2,N2-dimethylguanosine

N2,7,2 ′-O-trimethylguanosine

1-methylinosine

mosme

1,2′-O-dimethylinosine

2′-O-methylinosine

2′-O-methylinosine

epoxyqueuosine

galactosyl-queuosine

mannosyl-queuosine

2′-O-methyluridine

2-thiouridine

3-methyluridine

5-carboxymethyluridine

5-hydroxyuridine

5-methyluridine

5-taurinomethyl-2-thiouridine

5-taurinomethyluridine

dihydrouridine

pseudouridine

(3-(3-amino-3-carboxypropyl)uridine

1-methyl-3-(3-amino-5-carboxypropyl)pseudouridine

1-methylpseduouridine

1-methyl-pseudouridine

2′-O-methyluridine

2′-O-methylpseudouridine

2′-O-methyluridine

2-thio-2′-O-methyluridine

3-(3-amino-3-carboxypropyl)uridine

3,2′-0-dimethyluridine

3-Methyl-pseudo-Uridine

4-thiouridine

5-(carboxyhydroxymethyl)uridine

5-(carboxyhydroxymethyl)uridine methyl ester

5,2′-O-dimethyluridine

5,6-dihydro-uridine

5-aminomethy1-2-thiouridine

5-carbamoylmethyl-2′-0-methyluridine

5-carbamoylmethyluridine

5-carboxyhydroxymethyluridine

5-carboxyhydroxymethyluridine methyl ester

5-carboxymethylaminomethyl-2′-O-methyluridine

5-carboxymethylaminomethyl-2-thiouridine

5-carboxymethylaminomethyl-2-thiouridine

5-carboxymethylaminomethyluridine

5-carboxymethylaminomethyluridine

5-Carbamoylmethyluridine

5-methoxycarbonylmethyl-2′-O-methyluridine

5-methoxycarbonylmethy1-2-thiouridine

5-methoxycarbonylmethyluridine

5-methoxyuridine

5-methyl-2-thiouridine

5-methylaminomethyl-2-selenouridine

5-methylaminomethyl-2-thiouridine

5-methylaminomethyluridine

5-Methyldihydrouridine

5-Oxyacetic acid-Uridine

5-Oxyacetic acid-methyl ester-Uridin Nl-methyl-pseudo-uridine

uridine 5-oxyacetic acid

uridine 5-oxyacetic acid methyl ester

3-(3-Amino-3-carboxypropyl)-Uridine

5-(iso-Pentenylaminomethyl)-2-thiouridine

5-(iso-Pentenylaminomethyl)-2′-O-methyluridine

5-(iso-Pentenylaminomethyl)uridine

wybutosine

hydroxywybutosine

isowyosme

peroxywybutosine

undermodified hydroxywybutosine

4-demethylwyosine

altriol

In an embodiment, a TREM, a TREM core fragment or a TREM fragment described herein comprises a chemical modification provided in Table 7, or a combination thereof.

TABLE 7

Additional exemplary chemical modifications

Name

2,6-(diamino)purine

1-(aza)-2-(thio)-3-(aza)-phenoxazin-1-yl

1,3-(diaza)-2-(oxo)-phenthiazin-1-yl

1,3-(diaza)-2-(oxo)-phenoxazin-1-yl

1,3,5-(triaza)-2,6-(dioxa)-naphthalene

2 (amino)purine

2,4,5-(trimethyl)phenyl

2′ methyl, 2′amino, 2′azido, 2′fluro-cytidine

2′ methyl, 2′amino, 2′azido, 2′fluro-adenine

2′methyl, 2′amino, 2′azido, 2′fluro-uridine

2′-amino-2′-deoxyribose

2-amino-6-Chloro-purine

2-aza-inosinyl

2′-azido-2′-deoxyribose

2′fluoro-2′-deoxyribose

2′-fluoro-modified bases

2′-O-methyl-ribose

2-oxo-7-aminopyridopyrimidin-3-yl

2-oxo-pyridopyrimidine-3-yl

2-pyridinone

3 nitropyrrole

3-(methyl)-7-(propynyl)isocarbostyrilyl

3-(methyl)isocarbostyrilyl

4-(fluoro)-6-(methyl)benzimidazole

4-(methyl)benzimidazole

4-(methyl)indolyl

4,6-(dimethyl)indolyl

5 nitroindole

5 substituted pyrimidines

5-(methyl)isocarbostyrilyl

5-nitroindole

6-(aza)pyrimidine

6-(azo)thymine

6-(methyl)-7-(aza)indolyl

6-chloro-purine

6-phenyl-pyrrolo-pyrimidin-2-on-3-yl

7-(aminoalkylhydroxy)-1-(aza)-2-(thio)-3-(aza)-phenthiazin-1-yl

7-(aminoalkylhydroxy)-1-(aza)-2-(thio)-3-(aza)-phenoxazin-1-yl

7-(aminoalkylhydroxy)-1,3-(diaza)-2-(oxo)-phenoxazin-1-yl

7-(aminoalkylhydroxy)-1,3-(diaza)-2-(oxo)-phenthiazin-1-yl

7-(aminoalkylhydroxy)-1,3-(diaza)-2-(oxo)-phenoxazin-1-yl

7-(aza)indolyl

7-(guanidiniumalkylhydroxy)-1-(aza)-2-(thio)-3-(aza)-phenoxazinl-yl

7-(guanidiniumalkylhydroxy)-1-(aza)-2-(thio)-3-(aza)-phenthiazin-1-yl

7-(guanidiniumalkylhydroxy)-1-(aza)-2-(thio)-3-(aza)-phenoxazin-1-yl

7-(guanidiniumalkylhydroxy)-1,3-(diaza)-2-(oxo)-phenoxazin-1-yl

7-(guanidiniumalkyl-hydroxy)-1,3-(diaza)-2-(oxo)-phenthiazin-1-yl

7-(guanidiniumalkylhydroxy)-1,3-(diaza)-2-(oxo)-phenoxazin-1-yl

7-(propynyl)isocarbostyrilyl

7-(propynyl)isocarbostyrilyl, propynyl-7-(aza)indolyl

7-deaza-inosinyl

7-substituted 1-(aza)-2-(thio)-3-(aza)-phenoxazin-1-yl

7-substituted 1,3-(diaza)-2-(oxo)-phenoxazin-1-yl

9-(methyl)-imidizopyridinyl

aminoindolyl

anthracenyl

bis-ortho-(aminoalkylhydroxy)-6-phenyl-pyrrolo-nvrimidin-2-on-3-yl

bis-ortho-substituted-6-phenyl-pyrrolo-pyrimidin-2-on-3-yl

difluorotolyl

hypoxanthine

imidizopyridinyl

inosinyl

isocarbostyrilyl

isoguanosine

N2-substituted purines

N6-methyl-2-amino-purine

N6-substituted purines

N-alkylated derivative

napthalenyl

nitrobenzimidazolyl

nitroimidazolyl

nitroindazolyl

nitropyrazolyl

nubularine

O-alkylated derivative

O6-substituted purines

ortho-(aminoalkylhydroxy)-6-phenyl-pyrrolo-pyrimidin-2-on-3-yl

ortho-substituted-6-phenyl-pyrrolo-pyrimidin-2-on-3-yl

Oxoformycin TP

para-(aminoalkylhydroxy)-6-phenyl-pyrrolo-pyrimidin-2-on-3-yl

para-substituted-6-phenyl-pyrrolo-pyrimidin-2-on-3-yl

pentacenyl

phenanthracenyl

phenyl

propynyl-7-(aza)indolyl

pyrenyl

pyridopyrimidin-3-yl

pyridopyrimidin-3-yl, 2-oxo-7-amino-pyridopyrimidin-3-yl

pyrrolo-pyrimidin-2-on-3-yl

pyrrolopyrimidinyl

pyrrolopyrizinyl

stilbenzyl

substituted 1,2,4-triazoles

tetraceny1

tubercidine

xanthine

Xanthosine

2-thio-zebularine

5-aza-2-thio-zebularine

7-deaza-2-amino-purine

pyridin-4-one ribonucleoside

2-Amino-riboside

Formycin A

Formycin B

Pyrrolosine

2′-OH-ara-adenosine

2′-OH-ara-cytidine

2′-OH-ara-uridine

2′-OH-ara-guanosine

5-(2-carbomethoxyvinyl)uridine

N6-(19-Amino-pentaoxanonadecyl)adenosine

In an embodiment, a TREM, a TREM core fragment or a TREM fragment described herein comprises a chemical modification provided in Table 8, or a combination thereof.

TABLE 8

Exemplary backbone modifications

Name

3′-alkylene phosphonates

3′-amino phosphoramidate

alkene containing backbones

aminoalkylphosphoramidates

aminoalkylphosphotriesters

boranophosphates

—CH2-0-N(CH3)—CH2—

—CH2—N(CH3)—N(CH3)—CH2—

—CH2—NH—CH2—

chiral phosphonates

chiral phosphorothioates

formacetyl and thioformacetyl backbones

methylene (methylimino)

methylene formacetyl and thioformacetyl backbones

methyleneimino and methylenehydrazino backbones

morpholino linkages

—N(CH3)—CH2—CH2—

oligonucleosides with heteroatom intenucleoside linkage

phosphinates

phosphoramidates

phosphorodithioates

phosphorothioate intenucleoside linkages

phosphorothioates

phosphotriesters

PNA

siloxane backbones

sulfamate backbones

sulfide sulfoxide and sulfone backbones

sulfonate and sulfonamide backbones

thionoalkylphosphotriesters

thionoalkylphosphonates

thionophosphoramidates

methylphosphonates

phosphonoacetates

Phosphorothioate

Constrained nucleic acid (CNA)

2′-O-methyl

2′-O-methoxyethyl (MOE)

2′ Fluoro

Locked nucleic acid (LNA)

(S)-constrained ethyl (cEt)

Fluoro hexitol nucleic acid (FHNA)

5′-phosphorothioate

Phosphorodiamidate Morpholino Oligomer (PMO)

Tricyclo-DNA (tcDNA)

(S) 5′-C-methyl

(E)-vinylphosphonate

Methyl phosphonate

(S) 5′-C-methyl with phosphate

(R) 5′-C-methyl with phosphate

DNA

(R) 5′-C-methyl

GNA (glycol nucleic acid)

alkyl phosphonates

Phosphorothioate

Constrained nucleic acid (CNA)

2′-O-methyl

2′-O-methoxyethyl (MOE)

2′ Fluoro

Locked nucleic acid (LNA)

(S)-constrained ethyl (cEt)

Fluoro hexitol nucleic acid (FHNA)

5′-phosphorothioate

Phosphorodiamidate Morpholino Oligomer (PMO)

Tricyclo-DNA (tcDNA)

(S) 5′-C-methyl

(E)-vinylphosphonate

Methyl phosphonate

(S) 5′-C-methyl with phosphate

(R) 5′-C-methyl with phosphate

DNA

(R) 5′-C-methyl

GNA (glycol nucleic acid)

alkyl phosphonates

In an embodiment, a TREM, a TREM core fragment or a TREM fragment described herein comprises a non-naturally occurring modification provided in Table 9, or a combination thereof.

TABLE 9

Exemplary non-naturally occurring backbone modifications

Name of synthetic backbone modifications

Phosphorothioate

Constrained nucleic acid (CNA)

2′ O′methylation

2-O-methoxyethylribose (MOE)

2 Fluoro

Locked nucleic acid (LNA)

(S)-constrained ethyl (cEt)

Fluoro hexitol nucleic acid (FHNA)

5 phosphorothioate

Phosphorodiamidate Morpholino Oligomer (PMO)

Tricyclo-DNA (tcDNA)

(S) 5-C-methyl

(E)-vinylphosphonate

Methyl phosphonate

(S) 5-C-methyl with phosphate

TREM, TREM Core Fragment and TREM Fragment Fusions

In an embodiment, a TREM, a TREM core fragment or a TREM fragment disclosed herein comprises an additional moiety, e.g., a fusion moiety. In an embodiment, the fusion moiety can be used for purification, to alter folding of the TREM, TREM core fragment or TREM fragment, or as a targeting moiety. In an embodiment, the fusion moiety can comprise a tag, a linker, can be cleavable or can include a binding site for an enzyme. In an embodiment, the fusion moiety can be disposed at the N terminal of the TREM or at the C terminal of the TREM, TREM core fragment or TREM fragment. In an embodiment, the fusion moiety can be encoded by the same or different nucleic acid molecule that encodes the TREM, TREM core fragment or TREM fragment.

TREM Consensus Sequence

In an embodiment, a TREM disclosed herein comprises a consensus sequence provided herein.

In an embodiment, a TREM disclosed herein comprises a consensus sequence of Formula I_ZZZ, wherein zzz indicates any of the twenty amino acids and Formula I corresponds to all species.

In an embodiment, a TREM disclosed herein comprises a consensus sequence of Formula II_ZZZ, wherein zzz indicates any of the twenty amino acids and Formula II corresponds to mammals.

In an embodiment, a TREM disclosed herein comprises a consensus sequence of Formula III_ZZZ, wherein zzz indicates any of the twenty amino acids and Formula III corresponds to humans.

In an embodiment, zzz indicates any of the twenty amino acids: alanine, arginine, asparagine, aspartate, cysteine, glutamine, glutamate, glycine, histidine, isoleucine, methionine, leucine, lysine, phenylalanine, proline, serine, threonine, tryptophan, tyrosine, or valine.

In an embodiment, a TREM disclosed herein comprises a property selected from the following:

- a) under physiological conditions residue R₀forms a linker region, e.g., a Linker 1 region;
- b) under physiological conditions residues R₁-R₂-R₃-R₄-R₅-R₆-R₇and residues R₆₅-R₆₆-R₆₇-R₆₈-R₆₉-R₇₀-R₇₁form a stem region, e.g., an AStD stem region;
- c) under physiological conditions residues R₈-R₉forms a linker region, e.g., a Linker 2 region;
- d) under physiological conditions residues -R₁₀-R₁₁-R₁₂-R₁₃-R₁₄R₁₅-R₁₆-R₁₇-R₁₈-R₁₉-R₂₀-R₂₁-R₂₂-R₂₃-R₂₄-R₂₅-R₂₆-R₂₇-R₂₈form a stem-loop region, e.g., a D arm Region;
- e) under physiological conditions residue -R₂₉forms a linker region, e.g., a Linker 3 Region;
- f) under physiological conditions residues -R₃₀-R₃₁-R₃₂-R₃₃-R₃₄-R₃₅-R₃₆-R₃₇-R₃₈-R₃₉-R₄₀-R₄₁-R₄₂-R₄₃-R₄₄-R₄₅-R₄₆form a stem-loop region, e.g., an AC arm region;
- g) under physiological conditions residue -[R₄₇]_xcomprises a variable region, e.g., as described herein;
- h) under physiological conditions residues -R₄₈-R₄₉-R₅₀-R₅₁-R₅₂-R₅₃-R₅₄-R₅₅-R₅₆-R₅₇-R₅₅-R₅₉-R₆₀-R₆₁-R₆₂-R₆₃-R₆₄form a stem-loop region, e.g., a T arm Region; or
- i) under physiological conditions residue R₇₂forms a linker region, e.g., a Linker 4 region.

Alanine TREM Consensus Sequence

In an embodiment, a TREM disclosed herein comprises the sequence of Formula I_ALA(SEQ ID NO: 562),