The technology described herein relates to sensory neurons and altering odor responses.
Scents are perceived by the olfactory sensory neurons (OSNs) that line the upper nasal cavity. Each OSN expresses one odorant receptor, and these odorant receptors contact the scent molecules. Methods for determining which odorant receptor(s) are activated by a scent are lacking, making it difficult to replicate or improve scents.
The inventors have identified a suite of genes whose expression is altered when an odorant receptor is activated by a scent (e.g., one or more volatilized chemical compounds). This discovery has been applied to provide methods of identifying which odorant receptors perceive a scent. The method can be further used to screen scents for those that activate a given odorant receptor or set of odorant receptors, e.g., to provide a mimic of a benchmark scent. The ability to match odors to their receptors is groundbreaking for scent and fragrance use, as it permits the rational design of odors for foods and fragrances.
In one aspect of any of the embodiments, described herein is a method comprising measuring the expression of at least one odor response gene in a first olfactory sensory neuron (OSN) of a first subtype that has been contacted with at least one volatilized chemical compound.
In one embodiment in any of the aspects, the method further comprises measuring the expression of at least one odor response gene in a second OSN of the first subtype that has not been contacted with the at least one volatilized chemical compound. In one embodiment of any of the aspects described herein, the method further comprises determining that the first subtype expresses an odorant receptor that binds to the at least one volatilized chemical compound if the expression of the at least one odor response gene in the first OSN is different from a reference level of expression in a second OSN of the first subtype that has not been contacted with the at least one volatilized chemical compound.
In one aspect of any of the embodiments, described herein is a method comprising: measuring the expression of at least one odor response gene in a first OSN of a first subtype that has been contacted with at least one volatilized chemical compound; measuring expression of at least one odor response gene in a second OSN of the first subtype that has not been contacted with the at least one volatilized chemical compound; and determining that the first subtype expresses an odorant receptor that binds to the at least one volatilized chemical compound if the expression of the at least one odor response gene in the first OSN and the second OSN are different.
In one embodiment in any of the aspects, the at least one odor response gene is selected from the group consisting of: Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, Srxn1, Gm13889, Golga4, Heg1, Cdkn1a, Gpr137c, Zfp655, Ublcp1, Efr3b, Etf1, Srrm4, Epcam, Trp53inp2, Arhgap17, Ric8b, 9530034E10Rik, Oaz2, Ctxn3, Slc38a2, Exoc6b, Nfatc1, Junb, Tle3, Siah2, Fos, Ago2, Arhgef28, Ddx3x, Hspa8, Hnrnph3, Erf, Ak2, Pnrc1, Mafg, D1Ertd622c, Arid5b, Scn3b, Phtf2, Hsph1, Cpeb4, Ubc, Sema7a, Dleu2, Rc3h1, Pcnx4, Btg2, 6430548M08Rik, Selenok, Psmd8, Map9, Cry2, Hspa41, Mcl1, Dnaja1, Fth1, Fam208a, Bmpr2, Zfp608, BC005561, Rab11fip4, Smim13, Ppp2ca, Sod1, Neat1, Brinp2, Tuba4a, Cdc14a, Kdm6b, Uckl1os, Stk40, Dnajb2, Dnajb9, Spsb3, Sub1, Rheb, Manea, Fzd3, Zfand5, Uhrf1bp11, Cebpb, Wdfy3, Mfgc8, Inpp4a, Nab1, Tceal9, Ube2g1, Pcdh7, Eef1a1, Tnrc6c, Cd82, BC005537, Sdcbp, Ddx5, Olfm1, Srpk1, M6pr, Braf, Ahi1, Clcn3, Eef2, Rbm26, Arfgef3, Eif5, Tmem150c, Acbp2, Snrk, Bex3, Epb4112, Samd4b, Stard10, Pappa, Grsf1, Ralgds, Arih1, Micu3, Ptges3, Kras, Morf412, Rtp2, Wdr26, Hspa5, Zmynd8, Sv2c, Paip2, Rab39b, Ywhaz, Gpat3, Psen2, Pde4a, Atp2b1, Irs2, Cfap69, Hspa9, Tob1, Hspa14, Pard6a, Mapk4, Nr4a1, Sik3, Epha7, Hagh, Kalrn, Prkar1a, Lmbrd1, Erich3, Sf3b1, Ksr2, Ybx1, Nfix, Hivep1, Mfn2, Elof1, Wdfy2, Emd, Azin1, Wbp1, Nop53, Syt7, Ncoa1, Smarca5, Aes, Rnf150, Plxnb2, Wdsub1, Dyrk4, Epb4111, Pcdh8, Daam1, Prpf4b, Tnrc6a, Rufy3, Ptdss2, Tmem230, Scn3a, Rock1, Dph3, Zbtb7b, Yipf4, Ift74, Ebf1, Wsb2, Rnf182, Arglu1, Ppm11, Ccdc189, Ptov1, Ccdc157, Ppa1, Pnpla8, Rack1, Rsrp1, Ttc9, Hon2, Ccdc40, Kat2b, Cngb1, Ndufa3, Eefla2, Fmn2, Macrod1, Polr2i, 1700016K19Rik, Jade1, Slcla2, Ralbp1, Eif4c3, Morn2, Trappc21, Scrn1, Rtp1, Phyh, Fetub, Mapre3, Dnajb13, Coa3, Spef1, 1810058124Rik, Cep290, Bbs4, Nin, D430042009Rik, Tex9, Sdc3, Ebf4, Oaz1, Napa, Ttc8, Sdhaf4, Cuta, Commd6, Sfr1, Hcfc1r1, Nxn12, Kcnh3, Trpm7, Glb112, Slc25a39, Nsmf, Pifo, Oscp1, 1110008P14Rik, Egr1, Ttll6, Sem1, Aplg1, Fkbp2, Suclg1, Cidea, Dynlrb2, Ndufa2, Map1a, Sun1, Rufy2, Rnh1, Dtna, Anapc16, Pdhb, A430035B10Rik, Faim2, Ebf2, Ccdc28b, Ttll3, Drc1, Sys1, Prdx5, Palm, Slu7, Dnajc15, Elmod1, Paqr9, Tubb3, Ascc1, Ndufc1, Pon2, Fkbp4, Trp1, Cldn3, Cby1, Tctex1d2, Slc48a1, Nme5, 1110004E09Rik, Cnppd1, Naa38, Vdac3, Puf60, Cers5, Cbr1, Ift43, Fam217a, Ankrd10, Ubc2n, Pfdn6, Mpc2, Ncbp2, Timm13, Lamtor5, Uqer10, Slc27a2, Arpc51, Adh5, Gkap1, Ndufaf4, Psip1, Omp, Bcl11a, Ift81, 2010107E04Rik, Ubxn1, Ech1, Tnrc6b, Tmpo, Ssbp4, 2410015M20Rik, Mapk8ip2, Naxc, Gm13589, Hdgfl3, Slc22a23, Cacna1h, Slc25a5, Ndufa13, Anp32a, Ndufb10, Lhx2, Cenpx, Rnf220, Ankrd54, Ppdpf, Dtd1, Auh, Mrp151, Psmd13, Lamtor4, Hist2h2bc, Mapkap1, Arhgdig, Ndufv3, Ilkap, Ctsa, Ift27, Lpcat4, Atp50, Ypel3, Gad11, Msln1, Hint2, Acbd7, Manf, Lamtor2, Ak1, Arl6, Uxs1, Sri, Tomm7, Ubxn6, Cd47, Snrpd3, Ppid, Rsph9, Rbfox2, Pigp, C1qbp, Fmc1, Bcl7a, Krtcap2, Flrt1, Sdhd, Bphl, Sod2, Ndufb7, Prrc2b, Rex1bd, Rab36, Pdrg1, Pfn2, Rarg, B230118H07Rik, Elavl3, Mlf1, Eif1, Mettl26, Kmt5a, Ndufa5, Ndufb3, Emc8, Bad, Smc3, Dpm3, Ndufa11, Cisd1, Mrpl27, Cldn9, Sec61b, Tpd5212, Polr2c, Ndufa7, Atf5, Gabarap, Srsf3, Psmb3, Bax, Atox1, 2410004P03Rik, Atpla1, Ptprs, Cib1, Hist1h2bc, Atp5k, Hmox2, Fam92b, Chd4, Elob, Kif5a, Tbcb, Ndufa4, Uqer11, Hras, Atp6vlf, Pam16, Pim3, Lrriq1, Nfu1, Supt4a, Swi5, Mtx2, Faim, Taldo1, Ccdc34, Rbm39, 2210016L21Rik, Dpy30, Tomm22, Pygo1, Mat2b, Mrp128, Eid1, Snrpe, Atp5j, Ndufs7, Park7, Ndufs4, Cirbp, Polr2f, Fbx09, Mgst3, Gtf2h5, Mtch1, Mrps24, Actr1b, Sh3bgrl3, Bmpr1a, Pfn1, Tesc, Tmem258, Mrps14, Abhd16a, Vdac2, Nxph3, Polr2g, Them6, Cfap126, Pitpnc1, Kctd1, and Dmkn. In one embodiment of any of the aspects, at least one odor response gene is selected from the group consisting of: Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, Srxn1, Nr4a2, Btg2, Egr1, Fos, Fosb, Gm13889, and Junb. In one embodiment of any of the aspects, the at least one odor response gene comprises at least one odor response gene, at least two odor response genes, at least three odor response genes, at least five odor response genes, at least ten odor response genes, at least fifty odor response genes, or at least one hundred odor response genes.
In one embodiment of any of the aspects, the method further comprises measuring the expression of at least one odorant receptor gene.
In one embodiment of any of the aspects, the expression of the at least one odor response gene is measured 2 or more hours after contacting the OSN with the at least one volatilized chemical compound. In one embodiment of any of the aspects, the expression of the at least one odor response gene is measured 3 days or more after contacting the OSN with the at least one volatilized chemical compound.
In one embodiment of any of the aspects, the first subtype is an odorant receptor subtype. In one embodiment of any of the aspects, the first subtype expresses one odorant receptor. In one embodiment of any of the aspects, the odorant receptor is a human odorant receptor. In one embodiment of any of the aspects, the odorant receptor is a murine odorant receptor. In one embodiment of any of the aspects, the human odorant receptor is selected from the group consisting of: human OR family 1, human OR family 2, human OR family 3, human OR family 4, human OR family 5, human OR family 6, human OR family 7, human OR family 8, human OR family 9, human OR family 10, human OR family 11, human OR family 12, human OR family 13, human OR family 14, human OR family 51, human OR family 52, human OR family 55, human OR family 56, and any variant thereof having an amino acid sequence having at least 80% identity with the amino acid sequence of one of said human odorant receptors. In one embodiment of any of the aspects, the odorant receptor is selected from the group consisting of: Olfr 545, Olfr 160, Olfr 17, Olfr 727, Olfr 728, and Olfr 729. In one embodiment of any of the aspects, the odorant receptor is selected from the group consisting of: Olfr 545, Olfr 160, and Olfr 17. In one embodiment of any of the aspects, the murine odorant receptor is selected from the group consisting of: murine OR family 1, murine OR family 2, murine OR family 3, murine OR family 4, murine OR family 5, murine OR family 6, murine OR family 7, murine OR family 8, murine OR family 9, murine OR family 10-15, murine OR family 16-34.
In one embodiment of any of the aspects, the first OSN and/or the second OSN expresses at least one olfactory receptor. In one embodiment of any of the aspects, the first OSN and/or the second OSN is in vitro during the contacting.
In one embodiment of any of the aspects, the first OSN and/or the second OSN is in vivo during the contacting. In one embodiment of any of the aspects, the contacting the first OSN comprises contacting a subject comprising the first OSN with air comprising the at least one volatilized chemical compound.
In one embodiment of any of the aspects, the not contacting the second OSN comprises: occluding the nostrils of a subject comprising the second OSN, or contacting a subject comprising the second OSN with air not comprising the at least one volatilized chemical compound. In one embodiment of any of the aspects, the second OSN is not contacted with the at least one volatilized chemical compound for at least two hours prior to the measuring.
In one embodiment of any of the aspects, measuring the expression comprises measuring the level of one or more mRNAs. In one embodiment of any of the aspects, measuring the level of one or more mRNAs comprises measuring the level of the one or more mRNAs in a single cell. In one embodiment of any of the aspects, measuring the level of one or more mRNAs comprises single-cell sequencing.
In one embodiment of any of the aspects, the method further comprises measuring the expression of at least one odor response gene in a plurality of OSNs contacted with at least one volatilized chemical compound, each of the plurality of OSNs expressing a different odorant receptor. In one embodiment of any of the aspects, the method further comprises measuring the expression of at least one odor response gene in a plurality of OSNs, each of the plurality of OSNs having been contacted with a different at least one volatilized chemical compound. In one embodiment of any of the aspects, each of the plurality of OSNs is cultured in different wells of a multi-well plate or different cell culture containers. In one embodiment of any of the aspects, the plurality of OSNs are cultured in the same well of a multi-well plate or the same cell culture container. In one embodiment of any of the aspects, each well of a multi-well plate or each cell culture container is contacted with the same at least one volatilized chemical compound. In one embodiment of any of the aspects, each well of a multi-well plate or cell culture container is contacted with a different at least one volatilized chemical compound.
In one embodiment of any of the aspects, each of the plurality of OSNs is present in a different subject during the contacting. In one embodiment of any of the aspects, the plurality of OSNs are present in the same subject during the contacting. In one embodiment of any of the aspects, each subject is contacted with the same at least one volatilized chemical compound. In one embodiment of any of the aspects, each subject is contacted with a different at least one volatilized chemical compound.
In one embodiment of any of the aspects, the method further comprises obtaining a sample comprising the first OSN and/or second OSN from the nose of a subject after the contacting and before the measuring. In one embodiment of any of the aspects, sample is obtained from the olfactory bulb, the olfactory epithelium, the nerve endings, and/or the nasal cavity. In one embodiment of any of the aspects, the sample comprises epithelial cells, endothelial cells, olfactory sensory neurons, goblet cells, cilia, and/or mast cells.
In one embodiment of any of the aspects, the subject is a mammal. In one embodiment of any of the aspects, the subject is a mouse or human.
In one embodiment of any of the aspects, the method further comprises comparing the expression of the at least one odor response gene in the first OSN to the expression of the at least one odor response genes in an OSN of the first subtype that has been contacted with a benchmark scent.
In one aspect of any of the embodiments, described herein is a method of identifying an odorant receptor ligand, or measuring the strength of an odorant receptor ligand's binding, the method comprising measuring the expression of at least one odor response gene in a first olfactory sensory neuron (OSN) expressing a first odorant receptor that has been contacted with a candidate odorant receptor ligand; and determining that the candidate odorant receptor ligand binds the first odorant receptor if the expression of the at least one odor response gene in the first OSN is different from a reference level of expression in OSN expressing the first odorant receptor that has not been contacted with the candidate odorant receptor ligand.
In one embodiment of any of the aspects, the method further comprises measuring the expression of at least odor response gene in a plurality of OSNs, each of the plurality of OSNs having been contacted with a different candidate odorant receptor ligand. In one embodiment of any of the aspects, the magnitude of the difference in expression of the at least one odor response gene in the first OSN from the reference level of expression in an OSN expressing the first odorant receptor that has not been contacted with the candidate odorant receptor ligand correlates to the strength of the binding of the candidate odorant receptor ligand and the odorant receptor.
In one aspect of any of the embodiments, described herein is a method comprising measuring the expression of at least one odor response gene in a first olfactory sensory neuron (OSN) that has been contacted with at least one volatilized chemical compound, wherein the first OSN is obtained from or present in a subject; and determining that the subject has a loss of smell if the expression of the at least one odor response gene in the first OSN is different from the expression in a second OSN of the first subtype obtained from or present in normal, healthy individual. In one embodiment of any of the aspects, the loss of smell indicates the subject has or is at risk of having Parkinson's Disease. Alzheimer's Disease COVID-19, SARS-COV, MERS-COV, coronavirus, influenza, staphalococcus, and streptococcus. In one embodiment of any of the aspects, the method further comprises administering a treatment for Parkinson's Disease, Alzheimer's Disease, COVID-19, SARS-COV, MERS-COV, coronavirus, influenza, staphalococcus, and streptococcus if the subject is determined to have a loss of smell.
In mammals, the sense of smell is controlled by olfactory sensory neurons (OSNs). Each OSN expresses an odorant receptor that binds to a volatizlied chemical compound (e.g., a scent molecule or a molecule that is part of a scent). The binding of the odorant receptor to the volatilized chemical compound is then transmitted to the central nervous system by the OSN, and the mammal perceives a scent. Importantly, each OSN expresses only one odorant receptor and there are approximately 1,000 different odorant receptors. It has therefore proven extremely difficult to determine which odorant receptor binds to which volatilized chemical compound(s), particularly in vivo.
The inventors have now discovered downstream transcriptional targets of odorant receptor activation. Combined with single cell sensitivity in expression assays (e.g., sequencing), this makes it possible to identify the individual OSNs that are responsive to any given volatilized chemical compound and thereby identify the odorant receptor that binds that volatilized chemical compound. This ability to dissect the chemical and genetic basis of scent perception permits improved methods of replicating or modulating scents.
In one aspect of any of the embodiments, described herein is a method comprising: measuring the expression of at least one odor response gene in a first olfactory sensory neuron (OSN) of a first subtype that has been contacted with at least one volatilized chemical compound. In some embodiments of any of the aspects, the method further comprises measuring the expression of at least one odor response gene in a second OSN that has not been contacted with the at least one volatilized chemical compound. In some embodiments of any of the aspects, the method further comprises determining that the first subtype expresses an odorant receptor that binds to the at least one volatilized chemical compound if the expression of the at least one odor response gene in the first OSN is different from a reference level of expression in a second OSN that has not been contacted with the at least one volatilized chemical compound. In some embodiments of any of the aspects, the method further comprises determining that the first subtype expresses an odorant receptor that binds to the at least one volatilized chemical compound if the expression of the at least one odor response gene in the first OSN is increased relative to a reference level of expression in a second OSN that has not been contacted with the at least one volatilized chemical compound.
In some embodiments of any of the aspects, the method further comprises measuring the expression of at least one odor response gene in a second OSN of the first subtype that has not been contacted with the at least one volatilized chemical compound. In some embodiments of any of the aspects, the method further comprises determining that the first subtype expresses an odorant receptor that binds to the at least one volatilized chemical compound if the expression of the at least one odor response gene in the first OSN is different from a reference level of expression in a second OSN of the first subtype that has not been contacted with the at least one volatilized chemical compound. In some embodiments of any of the aspects, the method further comprises determining that the first subtype expresses an odorant receptor that binds to the at least one volatilized chemical compound if the expression of the at least one odor response gene in the first OSN is increased relative to a reference level of expression in a second OSN of the first subtype that has not been contacted with the at least one volatilized chemical compound.
In one aspect of any of the embodiments, described herein is a method comprising: measuring the expression of at least one odor response gene in a first OSN of a first subtype that has been contacted with at least one volatilized chemical compound; measuring expression of at least one odor response gene in a second OSN that has not been contacted with the at least one volatilized chemical compound; and determining that the first subtype expresses an odorant receptor that binds to the at least one volatilized chemical compound if the expression of the at least one odor response gene in the first OSN and the second OSN are different. In one aspect of any of the embodiments, described herein is a method comprising: measuring the expression of at least one odor response gene in a first OSN of a first subtype that has been contacted with at least one volatilized chemical compound; measuring expression of at least one odor response gene in a second OSN of the first subtype that has not been contacted with the at least one volatilized chemical compound; and determining that the first subtype expresses an odorant receptor that binds to the at least one volatilized chemical compound if the expression of the at least one odor response gene in the first OSN and the second OSN are different.
In one aspect of any of the embodiments, described herein is a method comprising: measuring the expression of at least one odor response gene in a first OSN of a first subtype that has been contacted with at least one volatilized chemical compound; measuring expression of at least one odor response gene in a second OSN that has not been contacted with the at least one volatilized chemical compound; and determining that the first subtype expresses an odorant receptor that binds to the at least one volatilized chemical compound if the expression of the at least one odor response gene in the first OSN is increased relative to the expression of the at least one odor response gene in the second OSN. In one aspect of any of the embodiments, described herein is a method comprising: measuring the expression of at least one odor response gene in a first OSN of a first subtype that has been contacted with at least one volatilized chemical compound; measuring expression of at least one odor response gene in a second OSN of the first subtype that has not been contacted with the at least one volatilized chemical compound; and determining that the first subtype expresses an odorant receptor that binds to the at least one volatilized chemical compound if the expression of the at least one odor response gene in the first OSN is increased relative to the expression of the at least one odor response gene in the second OSN.
As used herein, an “odor response gene” is defined as a gene whose expression alters when a chemoreceptor binds its cognate scent molecule. These chemoreceptors, which are also known as odorant receptors, are expressed in the cell membranes of olfactory receptor neurons and are responsible for the detection of odorants (for example, compounds that have an odor) which give rise to the sense of smell. Activated odorant receptors trigger nerve impulses which transmit information about odor to the brain. Described herein are at least 800 human odor response genes at least 1400 murine odor response genes.
Examples of odor response genes include, but are not limited to Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, Srxn1, Gm13889, Golga4, Heg1, Cdkn1a, Gpr137c, Zfp655, Ublcp1, Efr3b, Etf1, Srrm4, Epcam, Trp53inp2, Arhgap17, Ric8b, 9530034E10Rik, Oaz2, Ctxn3, Slc38a2, Exoc6b, Nfatc1, Junb, Tle3, Siah2, Fos, Ago2, Arhgef28, Ddx3x, Hspa8, Hnmnph3, Erf, Ak2, Pnrc1, Mafg, D1Ertd622c, Arid5b, Scn3b, Phtf2, Hsph1, Cpeb4, Ubc, Sema7a, Dleu2, Rc3h1, Pcnx4, Btg2, 6430548M08Rik, Selenok, Psmd8, Map9, Cry2, Hspa41, Mcl1, Dnaja1, Fth1, Fam208a, Bmpr2, Zfp608, BC005561, Rab11fip4, Smim13, Ppp2ca, Sod1, Neat1, Brinp2, Tuba4a, Cdc14a, Kdm6b, Uckl1os, Stk40, Dnajb2, Dnajb9, Spsb3, Sub1, Rheb, Manea, Fzd3, Zfand5, Uhrflbp1l, Cebpb, Wdfy3, Mfgc8, Inpp4a, Nab1, Tccal9, Ubc2g1, Pcdh7, Eefla1, Tnrc6c, Cd82, BC005537, Sdcbp, Ddx5, Olfm1, Srpk1, M6pr, Braf, Ahi1, Clcn3, Eef2, Rbm26, Arfgef3, Eif5, Tmem150c, Acbp2, Snrk, Bcx3, Epb4112, Samd4b, Stard10, Pappa, Grsf1, Ralgds, Arih1, Micu3, Ptges3, Kras, Morf412, Rtp2, Wdr26, Hspa5, Zmynd8, Sv2c, Paip2, Rab39b, Ywhaz, Gpat3, Psen2, Pde4a, Atp2b1, Irs2, Cfap69, Hspa9, Tob1, Hspa14, Pard6a, Mapk4, Nr4a1, Sik3, Epha7, Hagh, Kalrn, Prkar1a, Lmbrd1, Erich3, Sf3b1, Ksr2, Ybx1, Nfix, Hivep1, Mfn2, Elof1, Wdfy2, Emd, Azin1, Wbp1, Nop53, Syt7, Ncoa1, Smarca5, Aes, Rnf150, Plxnb2, Wdsub1, Dyrk4, Epb4111, Pcdh8, Daam1, Prpf4b, Tnrc6a, Rufy3, Ptdss2, Tmem230, Scn3a, Rock 1, Dph3, Zbtb7b, Yipf4, Ift74, Ebf1, Wsb2, Rnf182, Arglu1, Ppm11, Ccdc189, Ptov1, Ccdc157, Ppa1, Pnpla8, Rack1, Rsrp1, Ttc9, Hcn2, Ccdc40, Kat2b, Cngb1, Ndufa3, Eefla2, Fmn2, Macrod1, Polr2i, 1700016K19Rik, Jade1, Slcla2, Ralbp1, Eif4c3, Morn2, Trappc21, Scrn1, Rtp1, Phyh, Fetub, Mapre3, Dnajb13, Coa3, Spef1, 1810058124Rik, Ccp290, Bbs4, Nin, D430042009Rik, Tex9, Sdc3, Ebf4, Oaz1, Napa, Ttc8, Sdhaf4, Cuta, Commd6, Sfr1, Hcfclr1, Nxn12, Kcnh3, Trpm7, Glb112, Slc25a39, Nsmf, Pifo, Oscp1, 1110008P14Rik, Egr1, Ttll6, Sem1, Aplg1, Fkbp2, Suclg1, Cidea, Dynlrb2, Ndufa2, Map1a, Sun1, Rufy2, Rnh1, Dtna, Anapc16, Pdhb, A430035B10Rik, Faim2, Ebf2, Ccdc28b, Ttll3, Drc1, Sys1, Prdx5, Palm, Slu7, Dnajc15, Elmod1, Paqr9, Tubb3, Ascc1, Ndufc1, Pon2, Fkbp4, Trp1, Cldn3, Cby1, Tctex1d2, Slc48a1, Nme5, 1110004E09Rik, Cnppd1, Naa38, Vdac3, Puf60, Cers5, Cbr1, Ift43, Fam217a, Ankrd10, Ubc2n, Pfdn6, Mpc2, Ncbp2, Timm13, Lamtor5, Uqcr10, Slc27a2, Arpc51, Adh5, Gkap1, Ndufaf4, Psip1, Omp, Bcl11a, Ift81, 2010107E04Rik, Ubxn1, Ech1, Tnrc6b, Tmpo, Ssbp4, 2410015M20Rik, Mapk8ip2, Naxc, Gm13589, Hdgf13, Slc22a23, Cacna1h, Slc25a5, Ndufa13, Anp32a, Ndufb10, Lhx2, Cenpx, Rnf220, Ankrd54, Ppdpf, Dtd1, Auh, Mrpl51, Psmd13, Lamtor4, Hist2h2be, Mapkap1, Arhgdig, Ndufv3, Ilkap, Ctsa, Ift27, Lpcat4, Atp50, Ypel3, Gadl1, Msln1, Hint2, Acbd7, Manf, Lamtor2, Ak1, Arl6, Uxs1, Sri, Tomm7, Ubxn6, Cd47, Snrpd3, Ppid, Rsph9, Rbfox2, Pigp, C1qbp, Fmc1, Bcl7a, Krtcap2, Flrt1, Sdhd, Bphl, Sod2, Ndufb7, Prrc2b, Rex1bd, Rab36, Pdrg1, Pfn2, Rarg, B230118H07Rik, Elavl3, Mlf1, Eif1, Mett126, Kmt5a, Ndufa5, Ndufb3, Emc8, Bad, Smc3, Dpm3, Ndufa11, Cisd1, Mrpl27, Cldn9, Sec61b, Tpd5212, Polr2c, Ndufa7, Atf5, Gabarap, Srsf3, Psmb3, Bax, Atox1, 2410004P03Rik, Atpla1, Ptprs, Cib1, Hist1h2bc, Atp5k, Hmox2, Fam92b, Chd4, Elob, Kif5a, Tbcb, Ndufa4, Uqcr11, Hras, Atp6vlf, Pam16, Pim3, Lrriq1, Nfu1, Supt4a, Swi5, Mtx2, Faim, Taldo1, Ccdc34, Rbm39, 2210016L21Rik, Dpy30, Tomm22, Pygo1, Mat2b, Mrpl28, Eid1, Snrpe, Atp5j, Ndufs7, Park7, Ndufs4, Cirbp, Polr2f, Fbxo9, Mgst3, Gtf2h5, Mtch1, Mrps24, Actr1b, Sh3bgrl3, Bmpr1a, Pfn1, Tesc, Tmem258, Mrps14, Abhd16a, Vdac2, Nxph3, Polr2g, Them6, Cfap126, Pitpnc1, Kctd1, and Dmkn. In some embodiments, the at least one odor response gene is selected from Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, Srxn1, Btg2, Egr1, Fos, Fosb, Gm13889, Junb, and Nr4a1. In some embodiments, the at least one odor response gene is selected from the group consisting of: Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, Srxn1, Fos, and Nr4a1. In some embodiments, the at least one odor response gene is selected from the at least one odor response gene is selected from the group consisting of: Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, and Srxn1. In some embodiments, the at least one odor response gene is selected from the group consisting of: Btg2, Egr1, Fos, Fosb, Gm13889, Junb, Nr4a1, Nr4a2, Pcdh10, and Srxn1. In some embodiments, the at least one odor response gene is selected from the group consisting of: Btg2, Egr1, Fos, Fosb, Gm13889, Junb, Nr4a1, Nr4a2, Pcdh10, Srxn1, Tsc22d1, Rcan1, and Dusp14.
The sequences of the odor response genes are known in the art for a number of species. For example, the sequences of the odor response genes can be the sequences available in the NCBI database for the Gene ID Nos, in Table 1, e.g., as of Dec. 2, 2022. The NCBI database entries provide genomic and mRNA sequences, including splice variants and homologs and orthologs for a number of species. As a further example, the sequences of the odor response genes can be the sequences available in the ENSEMBL database for the ESEMBL Gene ID Nos, in Table 1, e.g., as of Dec. 2, 2022 (ENSEMBL release 108). The ENSEMBL database entries provide genomic and mRNA sequences, including splice variants and homologs and orthologs for a number of species.
In some embodiments of any of the aspects, the odor response genes comprise at least 1 of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 2 of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 3 of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 4 of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 5 of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 6 of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 10 of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 20 of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 30 of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 40 of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 50 of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 100 of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise each of the genes of Table 1.
In some embodiments of any of the aspects, the odor response genes comprise at least 1 human ortholog of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 2 human orthologs of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 3 human orthologs of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 4 human orthologs of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 5 human orthologs of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 6 human orthologs of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 10 human orthologs of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 20 human orthologs of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 30 human orthologs of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 40 human orthologs of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 50 human orthologs of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise at least 100 human orthologs of the genes of Table 1. In some embodiments of any of the aspects, the odor response genes comprise human orthologs each of the genes of Table 1.
In some embodiments, measuring the expression of at least one odor response gene can comprise measuring the expression of at least one odor response gene, at least two odor response genes, at least three odor response genes, at least four response genes, at least five odor response genes, at least six odor response genes, at least seven odor response genes, at least eight odor response genes, at least nine odor response genes, at least ten odor response genes, at least 11 odor response genes, at least 12 odor response genes, at least 13 odor response genes, at least 14 odor response genes, at least 15 odor response genes, at least 16 odor response genes, at least 17 odor response genes, at least 18 odor response genes, at least 19 odor response genes, at least 20 odor response genes, at least 21 odor response genes, at least 22 odor response genes, at least 23 odor response genes, at least 24 odor response genes, at least 25 odor response genes, at least 26 odor response genes, at least 27 odor response genes, at least 28 odor response genes, at least 29 odor response genes, at least 30 odor response genes, at least 31 odor response genes, at least 32 odor response genes, at least 33 odor response genes, at least 34 odor response genes, at least 35 odor response genes, at least 36 odor response genes, at least 37 odor response genes, at least 38 odor response genes, at least 39 odor response genes, at least 40 odor response genes, at least 41 odor response genes, at least 42 odor response genes, at least 43 odor response genes, at least 44 odor response genes, at least 45 odor response genes, at least 46 odor response genes, at least 47 odor response genes, at least 48 odor response genes, at least 49 odor response genes, at least fifty odor response genes, at least 51 odor response genes, at least 52 odor response genes, at least 53 odor response genes, at least 54 odor response genes, at least 55 odor response genes, at least 55 odor response genes, at least 56 odor response genes, at least 57 odor response genes, at least 58 odor response genes, at least 59 odor response genes, at least 60 odor response genes, at least 61 odor response genes, at least 62 odor response genes, at least 63 odor response genes, at least 64 odor response genes, at least 65 odor response genes, at least 66 odor response genes, at least 67 odor response genes, at least 68 odor response genes, at least 69 odor response genes, at least 70 odor response genes, at least 71 odor response genes, at least 72 odor response genes, at least 73 odor response genes, at least 74 odor response genes, at least 75 odor response genes, at least 76 odor response genes, at least 77 odor response genes, at least 78 odor response genes, at least 79 odor response genes, at least 80 odor response genes, at least 81 odor response genes, at least 82 odor response genes, at least 83 odor response genes, at least 84 odor response genes, at least 85 odor response genes, at least 86 odor response genes, at least 87 odor response genes, at least 88 odor response genes, at least 89 odor response genes, at least 90 odor response genes, at least 91 odor response genes, at least 92 odor response genes, at least 93 odor response genes, at least 94 odor response genes, at least 95 odor response genes, at least 96 odor response genes, at least 97 odor response genes, at least 98 odor response genes, at least 99 odor response genes, at least one hundred odor response genes or more.
In some embodiments of any of the aspects, the odor response genes comprise Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, Srxn1, Fos, and Nr4a1. In some embodiments of any of the aspects, the odor response genes comprise Tsc22d1. In some embodiments of any of the aspects, the odor response genes comprise Rcan1. In some embodiments of any of the aspects, the odor response genes comprise Fosb. In some embodiments of any of the aspects, the odor response genes comprise Pcdh10. In some embodiments of any of the aspects, the odor response genes comprise Dusp14. In some embodiments of any of the aspects, the odor response genes comprise Srxn1. In some embodiments of any of the aspects, the odor response genes comprise Fos. In some embodiments of any of the aspects, the odor response genes comprise Nr4a1. In some embodiments of any of the aspects, the odor response genes comprise Tsc22d1 and Rcan1. In some embodiments of any of the aspects, the odor response genes comprise Tsc22d1 and Fosb. In some embodiments of any of the aspects, the odor response genes comprise Tsc22d1 and Pcdh10. In some embodiments of any of the aspects, the odor response genes comprise Tsc22d1 and Dusp14. In some embodiments of any of the aspects, the odor response genes comprise Tsc22d1 and Srxn1. In some embodiments of any of the aspects, the odor response genes comprise Tsc22d1 and Fos. In some embodiments of any of the aspects, the odor response genes comprise Tsc22d1 and Nr4a1 In some embodiments of any of the aspects, the odor response genes comprise Rcan1 and Fosb. In some embodiments of any of the aspects, the odor response genes comprise Rcan1 and Pcdh10. In some embodiments of any of the aspects, the odor response genes comprise Rcan1 and Dsup14. In some embodiments of any of the aspects, the odor response genes comprise Rcan1 and Srxn1. In some embodiments of any of the aspects, the odor response genes comprise Rcan1 and Fos. In some embodiments of any of the aspects, the odor response genes comprise Rcan1 and Nr4a1. In some embodiments of any of the aspects, the odor response genes comprise Pcdh10 and Dusp14. In some embodiments of any of the aspects, the odor response genes comprise Pcdh10 and Srxn1. In some embodiments of any of the aspects, the odor response genes comprise Pcdh10 and Fos. In some embodiments of any of the aspects, the odor response genes comprise Pcdh10 and Nr4a1. In some embodiments of any of the aspects, the odor response genes comprise Dusp14 and Srxn1. In some embodiments of any of the aspects, the odor response genes comprise Dusp14 and Fos. In some embodiments of any of the aspects, the odor response genes comprise Dusp14 and Nr4a1. In some embodiments of any of the aspects, the odor response genes comprise Srxn1 and Fos. In some embodiments of any of the aspects, the odor response genes comprise Srxn1 and Nr4a1. In some embodiments of any of the aspects, the odor response genes comprise Fos and Nr4a1.
In some embodiments of any of the aspects, the odor response genes comprise Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, Srxn1, Fos, and Nr4a1, In some embodiments of any of the aspects, the odor response genes comprise at least 1 of Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, Srxn1, Fos, and Nr4a1. In some embodiments of any of the aspects, the odor response genes comprise at least 2 of Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, Srxn1, Fos, and Nr4a1. In some embodiments of any of the aspects, the odor response genes comprise at least 3 of Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, Srxn1, Fos, and Nr4a1. In some embodiments of any of the aspects, the odor response genes comprise at least 4 of Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, Srxn1, Fos, and Nr4a1. In some embodiments of any of the aspects, the odor response genes comprise at least 5 of Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, Srxn1, Fos, and Nr4a1. In some embodiments of any of the aspects, the odor response genes comprise at least 6 of Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, Srxn1, Fos, and Nr4a1. In some embodiments of any of the aspects, the odor response genes comprise at least 7 of Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, Srxn1, Fos, and Nr4a1.
In some embodiments of any of the aspects, the odor response genes comprise Btg2, Egr1, Fos, Fosb, Gm13889, and Junb. In some embodiments of any of the aspects, the odor response genes comprise at least 1 of Btg2, Egr1, Fos, Fosb, Gm13889, and Junb. In some embodiments of any of the aspects, the odor response genes comprise at least 2 of Btg2, Egr1, Fos, Fosb, Gm13889, and Junb. In some embodiments of any of the aspects, the odor response genes comprise at least 3 of Btg2, Egr1, Fos, Fosb, Gm13889, and Junb. In some embodiments of any of the aspects, the odor response genes comprise at least 4 of Btg2, Egr1, Fos, Fosb, Gm13889, and Junb. In some embodiments of any of the aspects, the odor response genes comprise at least 5 of Btg2. Egr1. Fos. Fosb. Gm13889, and Junb.
In some embodiments of any of the aspects, the odor response genes comprise Btg2 and Egr1. In some embodiments of any of the aspects, the odor response genes comprise Btg2 and Fos. In some embodiments of any of the aspects, the odor response genes comprise Btg2 and Fosb. In some embodiments of any of the aspects, the odor response genes comprise Btg2 and Gm13889. In some embodiments of any of the aspects, the odor response genes comprise Btg2 and Junb. In some embodiments of any of the aspects, the odor response genes comprise Egr1 and Fos. In some embodiments of any of the aspects, the odor response genes comprise Egr1 and Fosb. In some embodiments of any of the aspects, the odor response genes comprise Egr1 and Gm13889. In some embodiments of any of the aspects, the odor response genes comprise Egr1 and Junb. In some embodiments of any of the aspects, the odor response genes comprise Fos and Fosb. In some embodiments of any of the aspects, the odor response genes comprise Fos and Gm13889. In some embodiments of any of the aspects, the odor response genes comprise Fos and Junb. In some embodiments of any of the aspects, the odor response genes comprise Fosb and Gm13889. In some embodiments of any of the aspects, the odor response genes comprise Fosb and Junb. In some embodiments of any of the aspects, the odor response genes comprise Gm13889 and Junb.
In some embodiments of any of the aspects, the odor response genes comprise Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, and Srxn1. In some embodiments of any of the aspects, the odor response genes comprise at least 1 of Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, and Srxn1. In some embodiments of any of the aspects, the odor response genes comprise at least 2 of Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, and Srxn1. In some embodiments of any of the aspects, the odor response genes comprise at least 3 of Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, and Srxn1. In some embodiments of any of the aspects, the odor response genes comprise at least 4 of Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, and Srxn1. In some embodiments of any of the aspects, the odor response genes comprise at least 5 of Tsc22d1, Rcan1, Fosb, Pcdh10, Dusp14, and Srxn1.
In some embodiments of any of the aspects, the odor response genes comprise Btg2, Egr1, Fos, Fosb, Gm13889, Junb, Nr4a1, Nr4a2, Pcdh10, Srxn1, Tsc22d1, Rcan1, and Dusp14. In some embodiments of any of the aspects, the odor response genes comprise at least 1 of Btg2, Egr1, Fos, Fosb, Gm13889, Junb, Nr4a1, Nr4a2, Pcdh10, Srxn1, Tsc22d1, Rcan1, and Dusp14. In some embodiments of any of the aspects, the odor response genes comprise at least 2 of Btg2, Egr1, Fos, Fosb, Gm13889, Junb, Nr4a1, Nr4a2, Pcdh10, Srxn1, Tsc22d1, Rcan1, and Dusp14. In some embodiments of any of the aspects, the odor response genes comprise at least 3 of Btg2, Egr1, Fos, Fosb, Gm13889, Junb, Nr4a1, Nr4a2, Pcdh10, Srxn1, Tsc22d1, Rcan1, and Dusp14. In some embodiments of any of the aspects, the odor response genes comprise at least 4 of Btg2, Egr1, Fos, Fosb, Gm13889, Junb, Nr4a1, Nr4a2, Pcdh10, Srxn1, Tsc22d1, Rcan1, and Dusp14. In some embodiments of any of the aspects, the odor response genes comprise at least 5 of Btg2, Egr1, Fos, Fosb, Gm13889, Junb, Nr4a1, Nr4a2, Pcdh10, Srxn1, Tsc22d1, Rcan1, and Dusp14. In some embodiments of any of the aspects, the odor response genes comprise at least 6 of Btg2, Egr1, Fos, Fosb, Gm13889, Junb, Nr4a1, Nr4a2, Pcdh10, Srxn1, Tsc22d1, Rcan1, and Dusp14. In some embodiments of any of the aspects, the odor response genes comprise at least 7 of Btg2, Egr1, Fos, Fosb, Gm13889, Junb, Nr4a1, Nr4a2, Pcdh10, Srxn1, Tsc22d1, Rcan1, and Dusp14. In some embodiments of any of the aspects, the odor response genes comprise at least 8 of Btg2, Egr1, Fos, Fosb, Gm13889, Junb, Nr4a1, Nr4a2, Pcdh10, Srxn1, Tsc22d1, Rcan1, and Dusp14. In some embodiments of any of the aspects, the odor response genes comprise at least 9 of Btg2, Egr1, Fos, Fosb, Gm13889, Junb, Nr4a1, Nr4a2, Pcdh10, Srxn1, Tsc22d1, Rcan1, and Dusp14. In some embodiments of any of the aspects, the odor response genes comprise at least 10 of Btg2, Egr1, Fos, Fosb, Gm13889, Junb, Nr4a1, Nr4a2, Pcdh10, Srxn1, Tsc22d1, Rcan1, and Dusp14. In some embodiments of any of the aspects, the odor response genes comprise at least 11 of Btg2, Egr1, Fos, Fosb, Gm13889, Junb, Nr4a1, Nr4a2, Pcdh10, Srxn1, Tsc22d1, Rcan1, and Dusp14. In some embodiments of any of the aspects, the odor response genes comprise at least 12 of Btg2, Egr1, Fos, Fosb, Gm13889, Junb, Nr4a1, Nr4a2, Pcdh10, Srxn1, Tsc22d1, Rcan1, and Dusp14.
An “olfactory sensory neuron (OSN)” is defined as a bipolar neuron with dendrites facing the external surface of the cribriform plate with axons that pass through the cribriform foramina with terminal end at olfactory bulbs. An OSN can also be referred to as an olfactory receptor neuron (ORN). Humans have between 10 and 20 million OSNs that are located in the olfactory epithelium in the nasal cavity. The cell bodies of the OSNs are distributed among all three of the stratified layers of the olfactory epithelium.
As used herein, a “subtype” of an OSN refers to an OSN or group of OSN expressing a specific odorant receptor. When a first OSN and a second OSN are of the same subtype, the first OSN and second OSN are characterized by expressing the same odorant receptor. In some embodiments of any of the aspects, the first subtype expresses one odorant receptor. In some embodiments of any of the aspects, the first subtype expresses a first odorant receptor.
An odor receptor, olfactory receptor, or “odorant receptor”, is a chemoreceptor expressed in the cell membranes of olfactory sensory neurons which is responsible for the detection of one or more odorants (for example, compounds that have an odor) which give rise to the sense of smell. Odorant receptors are located on the membranes of the cilia of the OSNs have been classified as a complex type of ligand-gated metabotropic channels. An odorant will dissolve into the mucus of the olfactory epithelium and then bind to an odor receptor. Odor receptors can bind to a variety of odor molecules, with varying affinities. The difference in affinities causes differences in activation patterns resulting in unique odorant profiles. The activated odor receptor in turn activates the intracellular G-protein. GOLF (GNAL), adenylate cyclase and production of cyclic AMP (CAMP) opens ion channels in the cell membrane, resulting in an influx of sodium and calcium ions into the cell, and an efflux of chloride ions. This influx of positive ions and efflux of negative ions causes the neuron to depolarize, generating an action potential. As described herein, the inventors have identified odor response genes, the expression of which in an OSN is modulated by activation of the odorant receptor.
In some embodiments of any of the aspects, the odorant receptor is a human odorant receptor. In some embodiments of any of the aspects, the odorant receptor is a murine odorant receptor. Odorant receptors and their sequences and relationships are known in the art. For further discussion of odorant receptor sequences and relationships, see. e.g., Glusman G. Yanai I. Rubin I. Lancet D (May 2001). “The complete human olfactory subgenome”. Genome Research. 11 (5); 685-702. doi: 10.1101/gr.171001. PMID 11337468 and Godfrey. P et al. (2004). “The mouse olfactory receptor gene family”. PNAS. 101 (7): 2156-61; each of which are incorporated by reference herein in their entireties.).
In some embodiments of any of the aspects, the human odorant receptor is selected from the group consisting of: human OR family 1, human OR family 2, human OR family 3, human OR family 4, human OR family 5, human OR family 6, human OR family 7, human OR family 8, human OR family 9, human OR family 10, human OR family 11, human OR family 12, human OR family 13, human OR family 14, human OR family 51, human OR family 52, human OR family 55, human OR family 56, and any variant thereof having an amino acid sequence having at least 80% identity with the amino acid sequence of one of said human odorant receptors.
In some embodiments of any of the aspects, the odorant receptor is selected from the group consisting of: Olfr 545, Olfr 160, Olfr 17, Olfr 727, Olfr 728, and Olfr 729.
In some embodiments of any of the aspects, the odorant receptor is selected from the group consisting of: Olfr 545, Olfr 160, and Olfr 17.
In some embodiments of any of the aspects, the murine odorant receptor is selected from the group consisting of: murine OR family 1, murine OR family 2, murine OR family 3, murine OR family 4, murine OR family 5, murine OR family 6, murine OR family 7, murine OR family 8, murine OR family 9, murine OR family 10-15, murine OR family 16-34.
In some embodiments of any of the aspects, the first OSN expresses at least one olfactory receptor. In some embodiments of any of the aspects, the second OSN expresses at least one olfactory receptor.
In some embodiments of any of the aspects, the method further comprises measuring the expression of at least one odorant receptor gene. An OSN expresses one odorant receptor gene, and thus the subtype of the OSN can be identified by detecting which odorant receptor gene is most highly expressed, e.g., in parallel or in the same assay as the measurement of the expression of the at least one odor response gene.
In some embodiments of any of the aspects, a first OSN and a second OSN are isogenic. In some embodiments of any of the aspects, a first OSN and a second OSN are derived from the same cell line. In some embodiments of any of the aspects, a first OSN and a second OSN are clonal. In some embodiments of any of the aspects, a first OSN and a second OSN are obtained from the same subject. In some embodiments of any of the aspects, a first OSN and a second OSN are of the same species.
In some embodiments of any of the aspects, a first OSN and a second OSN of the same subtype are isogenic. In some embodiments of any of the aspects, a first OSN and a second OSN of the same subtype are derived from the same cell line. In some embodiments of any of the aspects, a first OSN and a second OSN of the same subtype are clonal. In some embodiments of any of the aspects, a first OSN and a second OSN of the same subtype are obtained from the same subject. In some embodiments of any of the aspects, a first OSN and a second OSN of the same subtype are of the same species.
An “odor” or “odorant” is caused by one or more volatilized chemical compounds that are generally found in low concentrations that humans and animals can perceive via their sense of smell. An odor can also be referred to as a “smell” or a “scent”, which can refer to either a pleasant or an unpleasant odor. Volatile organic compounds (VOCs) are organic chemicals that have a high vapour pressure at room temperature. High vapor pressure correlates with a low boiling point, which relates to the number of the sample's molecules in the surrounding air, a trait known as volatility.
A volatilized chemical compound refers to a chemical compound that is or can be evaporated or dispersed in vapor. For an individual chemical or class of chemical compounds to be perceived as an odor, it must be sufficiently volatile for transmission via the air to the olfactory system in the upper part of the nose. A scent or odor can comprise one or more volatilized chemical compounds. In the methods described herein, an OSN can be contacted (or not contacted) with at least volatilized chemical compound, e.g., 1 volatilized chemical compound, multiple volatilized chemical compounds, or a scent or odor. Examples of volatilized chemical compounds include but are not limited to isoprene, terpenes, pinene isomers, sesquiterpenes, and methanol. Volatilized chemical compounds that are perceived by humans or mice, and/or which are common in scents and odors are known in the art. For example, see, Fahlbusch, Karl-Georg; Hammerschmidt, Franz-Josef; Panten, Johannes; Pickenhagen, Wilhelm; Schatkowski, Dietmar; Bauer, Kurt; Garbe, Dorothea; Surburg, Horst. “Flavors and fragrances”. Ullmann's Encyclopedia of Industrial Chemistry. Weinheim: Wiley-VCH, doi: 10.1002/14356007.a11_141, which is incorporated by reference herein in its entirety.
In some embodiments of any of the aspects, the olfactory sensory neuron has been contacted with or exposed to at least one volatilized chemical compound (e.g., odorant). In some embodiments, an OSN is in vitro during the contacting. In other embodiments, an OSN is in vivo during the contacting.
Contacting an OSN with at least one volatilized chemical compound can comprise contacting with air containing the at least one volatilized chemical compound, piping air comprising the at least one volatilized chemical compound towards the OSN/subject or into a space holding the OSN/subject, placing a source of the at least one volatilized chemical compound in a room or enclosure with the subject or OSN, or placing the at least one volatilized chemical compound on an object that is brought in close proximity to the subject's nose. In some embodiments of any of the aspects, the contacting an OSN with at least one volatilized chemical compound comprises contacting a subject comprising the OSN with air comprising the at least one volatilized chemical compound.
In other embodiments, the olfactory sensory neuron has not been contacted with or exposed to an odorant. In some embodiments, an OSN is in vitro during the not contacting. In other embodiments, an OSN is in vivo during the not contacting.
Not contacting an OSN with at least one volatilized chemical compound can comprise occluding the nostrils of a subject comprising the second OSN or contacting a subject comprising the second OSN with air not comprising the at least one volatilized chemical compound. Occluding the nostrils of a subject can comprise the use of a physical device (e.g., nose plug or mask) that blocks the nostrils, physical action (e.g., holding the subject's nose using fingers), providing instructions to the subject to breathe through their mouth, exposing the subject to an allergen or infectious agent that results in increased mucosal production and/or inflammatory cytokine production in the nostrils, exposing the subject to a physical or chemical irritant of the epithelial lining of the nostril (e.g., cigarette or cigar smoke), or selecting a human subject that is over the age of 60.
As described above herein, the methods of the instant technology relate to determining or measuring the expression of at least one odor response gene and/or odorant receptor (which are collectively referred to herein as “targets”). In some embodiments, the first subtype expresses an odorant receptor that binds to at least one volatilized chemical compound. The binding to at least one volatilized chemical compound activates odor response genes and initiates transcription of those genes. The expression level of the odor response genes in a first OSN is compared to the expression level of odor response genes in a second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)) that has not been contacted with the at least least one volatilized chemical compound. In some embodiments, if the expression level of the odor response genes in the first OSN is log 2 fold-change of −0.5 or greater relative to the expression level of the odor response genes in the second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)), then the odorant receptor of the OSNs binds specifically with the at least one volatilized chemical compound. In some embodiments, if the expression level of the odor response genes in the first OSN is log 2 fold-change of −1 or greater relative to the expression level of the odor response genes in the second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)), then the odorant receptor of the OSNs binds specifically with the at least one volatilized chemical compound. In some embodiments, if the expression level of the odor response genes in the first OSN is log 2 fold-change of −2 or greater relative to the expression level of the odor response genes in the second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)), then the odorant receptor of the OSNs binds specifically with the at least one volatilized chemical compound. In some embodiments, if the expression level of the odor response genes in the first OSN is log 2 fold-change of −3 or greater relative to the expression level of the odor response genes in the second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)), then the odorant receptor of the OSNs binds specifically with the at least one volatilized chemical compound. In some embodiments, if the expression level of the odor response genes in the first OSN is log 2 fold-change of −4 or greater relative to the expression level of the odor response genes in the second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)), then the odorant receptor of the OSNs binds specifically with the at least one volatilized chemical compound. In some embodiments, if the expression level of the odor response genes in the first OSN is log 2 fold-change of −5 or greater relative to the expression level of the odor response genes in the second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)), then the odorant receptor of the OSNs binds specifically with the at least one volatilized chemical compound. If the expression level of the odor response genes in the first OSN is the same or less than the expression level of the odor response genes in the second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)), then it is considered to be that the odorant receptor does not bind specifically to the at least one volatilized chemical compound.
In some embodiments, if the expression level of the odor response genes in the first OSN is at least 1.5× the expression level of the odor response genes in the second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)), then the odorant receptor of the OSNs binds specifically with the at least one volatilized chemical compound. In some embodiments, if the expression level of the odor response genes in the first OSN is at least 2.5× the expression level of the odor response genes in the second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)), then the odorant receptor of the OSNs binds specifically with the at least one volatilized chemical compound. In some embodiments, if the expression level of the odor response genes in the first OSN is at least 2.5× the expression level of the odor response genes in the second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)), then the odorant receptor of the OSNs binds specifically with the at least one volatilized chemical compound. In some embodiments, if the expression level of the odor response genes in the first OSN is at least 3× the expression level of the odor response genes in the second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)), then the odorant receptor of the OSNs binds specifically with the at least one volatilized chemical compound. In some embodiments, if the expression level of the odor response genes in the first OSN is at least 4× the expression level of the odor response genes in the second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)), then the odorant receptor of the OSNs binds specifically with the at least one volatilized chemical compound. In some embodiments, if the expression level of the odor response genes in the first OSN is at least 5× the expression level of the odor response genes in the second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)), then the odorant receptor of the OSNs binds specifically with the at least one volatilized chemical compound. In some embodiments, if the expression level of the odor response genes in the first OSN is at least 10× the expression level of the odor response genes in the second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)), then the odorant receptor of the OSNs binds specifically with the at least one volatilized chemical compound.
Alternatively, a plurality of first OSNs and a plurality of second OSNs (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)) can be used. In such cases, an odorant receptor can be considered to bind specifically to an at least one volatilized chemical compound if at least 50% of the first OSNs exhibit an increased in the expression level of the at least one odor response genes. In some embodiments, an odorant receptor can be considered to bind specifically to an at least one volatilized chemical compound if at least 60% of the first OSNs exhibit an increased in the expression level of the at least one odor response genes. In such cases, an odorant receptor can be considered to bind specifically to an at least one volatilized chemical compound if at least 70% of the first OSNs exhibit an increased in the expression level of the at least one odor response genes.
In some embodiments, an odorant receptor can be considered to bind specifically to an at least one volatilized chemical compound if at least 50% of the first OSNs exhibit an increased in the expression level of at least 4 odor response genes. In some embodiments, an odorant receptor can be considered to bind specifically to an at least one volatilized chemical compound if at least 60% of the first OSNs exhibit an increased in the expression level of at least 4 odor response genes. In such cases, an odorant receptor can be considered to bind specifically to an at least one volatilized chemical compound if at least 70% of the first OSNs exhibit an increased in the expression level of at least 4 odor response genes.
In some embodiments of any of the aspects, the first OSN and/or the second OSN is in vitro during the contacting. In some embodiments of any of the aspects, the first OSN and/or the second OSN is in vivo during the contacting.
In some embodiments of any of the aspects, the contacting the first OSN comprises contacting a subject comprising the first OSN with air comprising the at least one volatilized chemical compound.
In some embodiments of any of the aspects, the not contacting the second OSN comprises: occluding the nostrils of a subject comprising the second OSN, or contacting a subject comprising the second OSN with air not comprising the at least one volatilized chemical compound.
In some embodiments of any of the aspects, the second OSN is not contacted with the at least one volatilized chemical compound for at least two hours prior to the measuring.
In some embodiments of any of the aspects, measurement of the level of a target and/or detection of the level or presence of a target, e.g. of an expression product (nucleic acid or polypeptide of one of the genes described herein) can comprise a transformation. As used herein, the term “transforming” or “transformation” refers to changing an object or a substance, e.g., biological sample, nucleic acid or protein, into another substance. The transformation can be physical, biological or chemical. Exemplary physical transformation includes, but is not limited to, pre-treatment of a biological sample, e.g., from whole blood to blood serum by differential centrifugation. A biological/chemical transformation can involve the action of at least one enzyme and/or a chemical reagent in a reaction. For example, a DNA sample can be digested into fragments by one or more restriction enzymes, or an exogenous molecule can be attached to a fragmented DNA sample with a ligase. In some embodiments of any of the aspects, a DNA sample can undergo enzymatic replication, e.g., by polymerase chain reaction (PCR).
Transformation, measurement, and/or detection of a target molecule, e.g. a odor response gene and/or odorant receptor mRNA or polypeptide can comprise contacting a sample (or OSN) with a reagent (e.g. a detection reagent) which is specific for the target, e.g., a target-specific reagent. In some embodiments of any of the aspects, the target-specific reagent is detectably labeled. In some embodiments of any of the aspects, the target-specific reagent is capable of generating a detectable signal. In some embodiments of any of the aspects, the target-specific reagent generates a detectable signal when the target molecule is present.
In certain embodiments, the gene expression products as described herein can be measured by determining the level of messenger RNA (mRNA) expression of the genes (e.g., targets) described herein. Such molecules can be isolated, derived, or amplified from a biological sample, such as an epithelialsample or OSN. Techniques for the detection of mRNA expression are known by persons skilled in the art, and can include but are not limited to, PCR procedures, RT-PCR, quantitative RT-PCR Northern blot analysis, differential gene expression. RNAse protection assay, microarray based analysis, next-generation sequencing: hybridization methods, etc.
In general, a PCR procedure relates a method of gene amplification which is comprised of (i) sequence-specific hybridization of primers to specific genes or sequences within a nucleic acid sample or library. (ii) subsequent amplification involving multiple rounds of annealing, elongation, and denaturation using a thermostable DNA polymerase, and (iii) screening the PCR products for a band of the correct size. The primers used are oligonucleotides of sufficient length and appropriate sequence to provide initiation of polymerization. i.e. each primer is specifically designed to be complementary to a strand of the genomic locus to be amplified. In an alternative embodiment. mRNA level of gene expression products described herein can be determined by reverse-transcription (RT) PCR and by quantitative RT-PCR (QRT-PCR) or real-time PCR methods. Methods of RT-PCR and QRT-PCR are well known in the art.
In some embodiments of any of the aspects, the level of an mRNA can be measured by a quantitative sequencing technology, e.g. a quantitative next-generation sequence technology. Methods of sequencing a nucleic acid sequence are well known in the art. Briefly, a sample obtained from a subject can be contacted with one or more primers which specifically hybridize to a single-strand nucleic acid sequence flanking the target gene sequence and a complementary strand is synthesized. In some next-generation technologies, an adaptor (double or single-stranded) is ligated to nucleic acid molecules in the sample and synthesis proceeds from the adaptor or adaptor compatible primers. In some third-generation technologies, the sequence can be determined, e.g. by determining the location and pattern of the hybridization of probes, or measuring one or more characteristics of a single molecule as it passes through a sensor (e.g. the modulation of an electrical field as a nucleic acid molecule passes through a nanopore). Exemplary methods of sequencing include, but are not limited to, Sanger sequencing, dideoxy chain termination, high-throughput sequencing, next generation sequencing, 454 sequencing. SOLID sequencing, polony sequencing, Illumina sequencing, Ion Torrent sequencing, sequencing by hybridization, nanopore sequencing, Helioscope sequencing, single molecule real time sequencing, RNAP sequencing, and the like. Methods and protocols for performing these sequencing methods are known in the art, see, e.g. “Next Generation Genome Sequencing” Ed. Michal Janitz, Wiley-VCH; “High-Throughput Next Generation Sequencing” Eds. Kwon and Ricke, Humanna Press, 2011; and Sambrook et al., Molecular Cloning; A Laboratory Manual (4 ed.). Cold Spring Harbor Laboratory Press. Cold Spring Harbor, N.Y., USA (2012); which are incorporated by reference herein in their entireties.
In some embodiments of any of the aspects, measuring the level of one or more mRNAs can be performed using single cell sequencing, q-RT-PCR, MERFISH and related multiplexed in situ detection methods. MERFISH is an imaging method capable of simultaneously measuring the copy number and spatial distribution of hundreds to thousands of RNA species in single cells (See e.g.; Moffit, J. M. et al. Chapter One-RNA Imaging with Multiplexed Error-Robus Fluorescence In Situ Hybridization (MERFISH). Methods in Enzymology. 572. 1-49, which is incorporated by reference in its entirety). One who is skilled in the art will be able to perform these techniques.
In some embodiments of any of the aspects, measuring the expression comprises measuring the level of one or more mRNAs. In some embodiments of any of the aspects, measuring the expression comprises measuring the level of the mRNA(s) of the at least one odor response gene.
In some embodiments of any of the aspects, measuring the level of one or more mRNAs comprises measuring the level of the one or more mRNAs in a single cell. In some embodiments of any of the aspects, measuring the level of one or more mRNAs comprises single-cell sequencing.
Nucleic acid and ribonucleic acid (RNA) molecules can be isolated from a particular biological sample using any of a number of procedures, which are well-known in the art, the particular isolation procedure chosen being appropriate for the particular biological sample. For example, freeze-thaw and alkaline lysis procedures can be useful for obtaining nucleic acid molecules from solid materials; heat and alkaline lysis procedures can be useful for obtaining nucleic acid molecules from urine; and proteinase K extraction can be used to obtain nucleic acid from blood (Roiff. A et al. PCR: Clinical Diagnostics and Research. Springer (1994)).
In some embodiments of any of the aspects, measuring the expression of a target can comprise measuring the level of a polypeptide gene expression product. Such methods to measure gene expression products, e.g., protein level, include immunocytochemistry. ELISA (enzyme linked immunosorbent assay), western blot, immunoprecipitation, LFIA, immunoprecipitation: enzyme-linked immunosorbent assay (ELISA); radioimmunological assay (RIA); sandwich assay; fluorescence in situ hybridization (FISH); immunohistological staining: radioimmunometric assay; immunofluorescence assay: mass spectroscopy: immunoelectrophoresis assay; and/or immunofluorescence using detection reagents such as an antibody or protein binding agents. Alternatively, a peptide can be detected in a subject by introducing into a subject a labeled anti-peptide antibody and other types of detection agent. For example, the antibody can be labeled with a detectable marker whose presence and location in the subject is detected by standard imaging techniques.
In some embodiments of any of the aspects, one or more of the reagents (e.g. an antibody reagent and/or nucleic acid probe) described herein can comprise a detectable label and/or comprise the ability to generate a detectable signal (e.g. by catalyzing reaction converting a compound to a detectable product). Detectable labels can comprise, for example, a light-absorbing dye, a fluorescent dye, or a radioactive label. Detectable labels, methods of detecting them, and methods of incorporating them into reagents (e.g. antibodies and nucleic acid probes) are well known in the art.
In some embodiments of any of the aspects, detectable labels can include labels that can be detected by spectroscopic, photochemical, biochemical, immunochemical, electromagnetic, radiochemical, or chemical means, such as fluorescence, chemifluorescence, or chemiluminescence, or any other appropriate means. The detectable labels used in the methods described herein can be primary labels (where the label comprises a moiety that is directly detectable or that produces a directly detectable moiety) or secondary labels (where the detectable label binds to another moiety to produce a detectable signal, e.g., as is common in immunological labeling using secondary and tertiary antibodies). The detectable label can be linked by covalent or non-covalent means to the reagent. Alternatively, a detectable label can be linked such as by directly labeling a molecule that achieves binding to the reagent via a ligand-receptor binding pair arrangement or other such specific recognition molecules. Detectable labels can include, but are not limited to radioisotopes, bioluminescent compounds, chromophores, antibodies, chemiluminescent compounds, fluorescent compounds, metal chelates, and enzymes.
In other embodiments, the detection reagent is label with a fluorescent compound. When the fluorescently labeled reagent is exposed to light of the proper wavelength, its presence can then be detected due to fluorescence. In some embodiments of any of the aspects, a detectable label can be a fluorescent dye molecule, or fluorophore including, but not limited to fluorescein, phycocrythrin, phycocyanin, o-phthaldehyde, fluorescamine, Cy3™, Cy5™, allophycocyanine, Texas Red, peridenin chlorophyll, cyanine, tandem conjugates such as phycoerythrin-Cy5™, green fluorescent protein, rhodamine, fluorescein isothiocyanate (FITC) and Oregon Green™, rhodamine and derivatives (e.g., Texas red and tetrarhodimine isothiocynate (TRITC)), biotin, phycocrythrin, AMCA, CyDyes™, 6-carboxyfhiorescein (commonly known by the abbreviations FAM and F), 6-carboxy-2′,4′,7′,4,7-hexachlorofluorescein (HEX), 6-carboxy-4′,5′-dichloro-2′,7′-dimethoxyfluorescein (JOE or J). N,N,N′,N′-tetramethyl-6carboxyrhodamine (TAMRA or T), 6-carboxy-X-rhodamine (ROX or R), 5-carboxyrhodamine-6G (R6G5 or G5), 6-carboxyrhodamine-6G (R6G6 or G6), and rhodamine 110; cyanine dyes. e.g. Cy3, Cy5 and Cy7 dyes: coumarins, e.g umbelliferone; benzimide dyes, e.g. Hoechst 33258; phenanthridine dyes, e.g. Texas Red; ethidium dyes; acridine dyes; carbazole dyes; phenoxazine dyes; porphyrin dyes; polymethine dyes, e.g. cyanine dyes such as Cy3, Cy5, etc; BODIPY dyes and quinoline dyes. In some embodiments of any of the aspects, a detectable label can be a radiolabel including, but not limited to 3H. 125I, 35S, 14C, 32P, and 33P. In some embodiments of any of the aspects, a detectable label can be an enzyme including, but not limited to horseradish peroxidase and alkaline phosphatase. An enzymatic label can produce, for example, a chemiluminescent signal, a color signal, or a fluorescent signal. Enzymes contemplated for use to detectably label an antibody reagent include, but are not limited to, malate dehydrogenase, staphylococcal nuclease, delta-V-steroid isomerase, yeast alcohol dehydrogenase, alpha-glycerophosphate dehydrogenase, triose phosphate isomerase, horseradish peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urcase, catalase, glucose-VI-phosphate dehydrogenase, glucoamylase and acetylcholinesterase. In some embodiments of any of the aspects, a detectable label is a chemiluminescent label, including, but not limited to lucigenin, luminol, luciferin, isoluminol, theromatic acridinium ester, imidazole, acridinium salt and oxalate ester. In some embodiments of any of the aspects, a detectable label can be a spectral colorimetric label including, but not limited to colloidal gold or colored glass or plastic (e.g., polystyrene, polypropylene, and latex) beads.
In some embodiments of any of the aspects, detection reagents can also be labeled with a detectable tag, such as c-Myc, HA, VSV-G, HSV, FLAG, V5, HIS, or biotin, Other detection systems can also be used, for example, a biotin-streptavidin system. In this system, the antibodies immunoreactive (i.e. specific for) with the biomarker of interest is biotinylated. Quantity of biotinylated antibody bound to the biomarker is determined using a streptavidin-peroxidase conjugate and a chromagenic substrate. Such streptavidin peroxidase detection kits are commercially available, e.g. from DAKO; Carpinteria, CA. A reagent can also be detectably labeled using fluorescence emitting metals such as 152Eu, or others of the lanthanide series. These metals can be attached to the reagent using such metal chelating groups as diethylenetriaminepentaacetic acid (DTPA) or ethylenediaminetetraacetic acid (EDTA).
A level which is less than a reference level can be a level which is less by at least about 10%, at least about 20%, at least about 50%, at least about 60%, at least about 80%, at least about 90%, or less relative to the reference level. In some embodiments of any of the aspects, a level which is less than a reference level can be a level which is statistically significantly less than the reference level.
A level which is more than a reference level can be a level which is greater by at least about 10%, at least about 20%, at least about 50%, at least about 60%, at least about 80%, at least about 90%, at least about 100%, at least about 200%, at least about 300%, at least about 500% or more than the reference level. In some embodiments of any of the aspects, a level which is more than a reference level can be a level which is statistically significantly greater than the reference level.
In some embodiments of any of the aspects, the reference can be a level of the target molecule in a OSN or subject not contacted with the at least one volatilized chemical compound. In some embodiments of any of the aspects, the reference can also be a level of expression of the target molecule in a control sample, a pooled sample of control individuals or a numeric value or range of values based on the same. In some embodiments of any of the aspects, the reference can be the level of a target molecule in a sample obtained from the same subject at an earlier point in time. e.g., the methods described herein can be used to determine if a subject's sensitivity or response is changing over time.
In some embodiments of any of the aspects, the reference can be a level of the target molecule in a population of subjects who do not have or are not diagnosed as having, and/or do not exhibit signs or symptoms of Parkington's Disease, Alzheimer's Disease, COVID-19, SARS-COV, MERS-COV, related coronavirus infections, and infections from common viral and bacterial agents like influenza, staphalococcus, and streptococcus. In some embodiments of any of the aspects, the reference can also be a level of expression of the target molecule in a control sample, a pooled sample of control individuals or a numeric value or range of values based on the same. In some embodiments of any of the aspects, the reference can be the level of a target molecule in a sample obtained from the same subject at an earlier point in time, e.g., the methods described herein can be used to determine if a subject's sensitivity or response to a given therapy is changing over time.
In some embodiments of any of the aspects, the expression of a target described herein (e.g. the at least one odor response gene and/or the odorant receptor) is measured after contacting the OSN with the at least one volatilized chemical compound. In some embodiments of any of the aspects, the expression of a target described herein (e.g, the at least one odor response gene and/or the odorant receptor) is measured 10 minutes or more after contacting the OSN with the at least one volatilized chemical compound. In some embodiments of any of the aspects, the expression of a target described herein (e.g, the at least one odor response gene and/or the odorant receptor) is measured 20 minutes or more after contacting the OSN with the at least one volatilized chemical compound. In some embodiments of any of the aspects, the expression of the a target described herein (e.g, the at least one odor response gene and/or the odorant receptor) is measured 30 minutes or more after contacting the OSN with the at least one volatilized chemical compound. In some embodiments of any of the aspects, the expression of the a target described herein (e.g, the at least one odor response gene and/or the odorant receptor) is measured 1 hour or more after contacting the OSN with the at least one volatilized chemical compound. In some embodiments of any of the aspects, the expression of the a target described herein (e.g, the at least one odor response gene and/or the odorant receptor) is measured 2 or more hours after contacting the OSN with the at least one volatilized chemical compound. In some embodiments of any of the aspects, the expression of the a target described herein (e.g, the at least one odor response gene and/or the odorant receptor) is measured 3 days or more after contacting the OSN with the at least one volatilized chemical compound.
In some embodiments, the expression of a target described herein (e.g, the at least one odor response gene and/or the odorant receptor is measured between 0.5 hours and 6 weeks, measured between 0.5 hours and 5.5 weeks, measured between 0.5 hours and 5 weeks, measured between 0.5 hours and 4.5 weeks, measured between 0.5 hours and 4 weeks, measured between 0.5 hours and 3.5 weeks, measured between 0.5 hours and 3 weeks, measured between 0.5 hours and 2.5 weeks, measured between 0.5 hours and 2 weeks, measured between 0.5 hours and 1.5 weeks, measured between 0.5 hours and 1 week, measured between 0.5 hours and 6.5 days, measured between 0.5 hours and 6 days, measured between 0.5 hours and 5.5 days, measured between 0.5 hours and 5 days, measured between 0.5 hours and 4.5 days, measured between 0.5 hours and 4 days, measured between 0.5 hours and 3.5 days, measured between 0.5 hours and 3 days, measured between 0.5 hours and 2.5 days, measured between 0.5 hours and 2 days, measured between 0.5 hours and 1.5 days, measured between 0.5 hours and 24 hours, measured between 0.5 hours and 23.5 hours, measured between 0.5 hours, and 23 hours, measured between 0.5 hours and 22.5 hours, measured between 0.5 hours and 22 hours, measured between 0.5 hours and 21.5 hours, measured between 0.5 hours and 21 hours, measured between 0.5 hours and 20.5 hours, measured between 0.5 hours and 20.5 hours, measured between 0.5 hours and 20 hours, measured between 0.5 hours and 20 hours, measured between 0.5 hours and 19.5 hours, measured between 0.5 hours and 19 hours, measured between 0.5 hours and 18.5 hours, measured between 0.5 hours and 18 hours, measured between 0.5 hours and 17.5 hours, measured between 0.5 hours and 17 hours, measured between 0.5 hours and 16.5 hours, measured between 0.5 hours and 16 hours, measured between 0.5 hours and 15.5 hours, measured between 0.5 hours and 15 hours, measured between 0.5 hours and 14.5 hours, measured between 0.5 hours and 14 hours, measured between 0.5 hours and 13.5 hours, measured between 0.5 hours and 13 hours, measured between 0.5 hours and 12.5 hours, measured between 0.5 hours and 12 hours, measured between 0.5 hours and 11.5 hours, measured between 0.5 hours and 11 hours, measured between 0.5 hours and 10.5 hours, measured between 0.5 hours and 10 hours, measured between 0.5 hours and 9.5 hours, measured between 0.5 hours and 9 hours, measured between 0.5 hours and 8.5 hours, measured between 0.5 hours and 8 hours, measured between 0.5 hours and 7.5 hours, measured between 0.5 hours and 7.0 hours, measured between 0.5 hours and 6.5 hours, measured between 0.5 hours and 6 hours, measured between 0.5 hours and 6 hours, measured between 0.5 hours and 5.5 hours, measured between 0.5 hours and 5 hours, measured between 0.5 hours and 4.5 hours, measured between 0.5 hours and 4 hours, measured between 0.5 hours and 3.5 hours, measured between 0.5 hours and 3 hours, measured between 0.5 hours and 2.5 hours, measured between 0.5 hours and 2 hours, measured between 0.5 hours and 1.5 hours, or measured between 0.5 hours and 1 hour.
In some embodiments, the expression of a target described herein (e.g., the at least one odor response gene and/or the odorant receptor) is measured after at least 2 hours after contacting an OSN with at least one volatilized chemical compound, after at least 3 hours, after at least 4 hours, after at least 5 hours, after at least 6 hours, after at least 7 hours, after at least 8 hours, after at least 9 hours, after at least 10 hours, after at least 11 hours, after at least 12 hours, after at least 13 hours, after at least 14 hours, after at least 15 hours, after at least 16 hours, after at least 17 hours, after at least 18 hours, after at least 19 hours, after at least 20 hours, after at least 21 hours, after at least 22 hours, after at least 23 hours, after at least 24 hours, after at least 25 hours, after at least 26 hours, after at least 27 hours, after at least 28 hours, after at least 29 hours, after at least 30 hours, after at least 31 hours, after at least 32 hours, after at least 33 hours, after at least 34 hours, after at least 35 hours, after at least 36 hours, after at least 37 hours, after at least 38 hours, after at least 39 hours, after at least 40 hours, after at least 41 hours, after at least 42 hours, after at least 43 hours, after at least 44 hours, after at least 45 hours, after at least 46 hours, after at least 47 hours, after at least 48 hours, after at least 49 hours, after at least 50 hours, after at least 51 hours, after at least 52 hours, after at least 53 hours, after at least 54 hours, after at least 55 hours, after at least 56 hours, after at least 57 hours, after at least 58 hours, after at least 59 hours, after at least 60 hours, after at least 61 hours, after at least 62 hours, after at least 63 hours, after at least 64 hours, after at least 65 hours, after at least 66 hours, after at least 67 hours, after at least 68 hours, after at least 69 hours, after at least 70 hours, after at least 71 hours, after at least 72 hours, after at least 73 hours, after at least 74 hours, after at least 75 hours, after at least 76 hours, after at least 77 hours, after at least 78 hours, after at least 79 hours, after at least 80 hours, after at least 81 hours, after at least 82 hours, after at least 83 hours, after at least 84 hours, after at least 85 hours, after at least 86 hours, after at least 87 hours, after at least 88 hours, after at least 89 hours, after at least 90 hours, after at least 91 hours, after at least 92 hours, after at least 92 hours, after at least 93 hours, after at least 94 hours, after at least 95 hours, after at least 96 hours, after at least 97 hours, after at least 98 hours, after at least 99 hours, after at least 100 hours, after at least 101 hours, after at least 102 hours, after at least 103 hours, after at least 104 hours, after at least 105 hours, after at least 106 hours, after at least 107 hours, after at least 108 hours, after at least 109 hours, after at least 110 hours, after at least 111 hours, after at least 112 hours, after at least 113 hours, after at least 114 hours, after at least 115 hours, after at least 116 hours, after at least 117 hours, after at least 118 hours, after at least 119 hours, or after at least 120 hours or more.
The methods described herein also relate to screens, e.g., for a volatilized chemical compound or mixture of volatilized chemical compounds that activates a particular OSN subtype, or activates a plurality of OSN subtypes in a specific pattern, or which activates or or more OSN subtypes in a way that most closely resembles a reference volatilized chemical compound or benchmark scent.
As used herein, the term “benchmark scent” is a selected scent that one is trying to mimic or improve. e.g. by using it as a benchmark to compare a library of compounds/scents to.
As used herein. “candidate” refers to an element, e.g., a ligand, volatilized chemical compounds, or odorant receptor, that are to be screened and/or analyzed for the strength of their interaction with one or more other elements. A “candidate” element can be known to have the relevant activity or structure, but be a candidate in the sense of being a candidate for binding of a particular level of avidity or specificity, a candidate for binding in particular physical conditions, or a candidate for binding as compared to or in direct competition with other candidates. Where a “candidate” element is referred to herein, the reference includes known elements of that category. For example, where “candidate ligand” is used herein, it encompasses volatilized chemical compounds known in the art. For the methods described herein, candidates may be screened individually, or in groups. Group screening is particularly useful where hit rates for effective candidates are expected to be low such that one would not expect more than one positive result for a given group.
As used herein, the term “ligand” refers to a molecule that binds specifically with an odorant receptor to. The ligand can be a volatilized chemical compound. When the ligand is bound to the odorant receptor, odor response genes are modulated as described elsewhere herein.
In some embodiments of any of the aspects, the method further comprises measuring the expression of at least one odor response gene in a plurality of OSNs contacted with at least one volatilized chemical compound, each of the plurality of OSNs expressing a different odorant receptor. Such methods permit identification of which odorant receptors an at least one volatilized chemical compound activates, both qualitatively and quantitatively. Such methods can be used to identify volatilized chemical compounds that activate a set of odorant receptors in a pattern that mimics a benchmark scent.
In some embodiments of any of the aspects, the method further comprises measuring the expression of at least one odor response gene in a plurality of OSNs, each of the plurality of OSNs having been contacted with a different at least one volatilized chemical compound. Such methods permit identification of which compounds activate a particular odorant receptor, either quantitatively or qualitatively. Such methods can be used to identify volatilized chemical compounds that activate a particular odorant receptor to a degree that mimics a benchmark scent.
In some embodiments of any of the aspects, each of the plurality of OSNs is cultured in different wells of a multi-well plate or different cell culture containers. In some embodiments of any of the aspects, the plurality of OSNs are cultured in the same well of a multi-well plate or the same cell culture container. In some embodiments of any of the aspects, each well of a multi-well plate or each cell culture container is contacted with the same at least one volatilized chemical compound. In some embodiments of any of the aspects, each well of a multi-well plate or cell culture container is contacted with a different at least one volatilized chemical compound.
In some embodiments of any of the aspects, each of the plurality of OSNs is present in a different subject during the contacting. Such approaches can permit a larger sample pool, accounting for individual differences in scent perception. In some embodiments of any of the aspects, the plurality of OSNs are present in the same subject during the contacting, e.g., to reflect the complexity of scent perception, which typically involves multiple odorant receptors in multiple OSNs at one time. In some embodiments of any of the aspects, each subject is contacted with the same at least one volatilized chemical compound. In some embodiments of any of the aspects, each subject is contacted with a different at least one volatilized chemical compound.
In some embodiments of any of the aspects, the method further comprising obtaining a sample comprising the first OSN and/or second OSN from the nose of a subject after the contacting and before the measuring, e.g., the contacting occurs in vivo and the measuring is done ex vivo. The obtaining of the sample can be done noninvasively, e.g., with nasal swabbing or scrapping. In some embodiments of any of the aspects, the obtaining of the sample does not comprise surgical methods.
In some embodiments of any of the aspects, the method further comprises comparing the expression of the at least one odor response gene in the first OSN to the expression of the at least one odor response genes in an OSN of the first subtype that has been contacted with a benchmark scent. In some embodiments of any of the aspects, a method of identifying an odorant receptor ligand, or measuring the strength of an odorant receptor ligand's binding comprises: measuring the expression of at least one odor response gene in a first olfactory sensory neuron (OSN) expressing a first odorant receptor that has been contacted with a candidate odorant receptor ligand; and determining that the candidate odorant receptor ligand binds the first odorant receptor if the expression of the at least one odor response gene in the first OSN is different from a reference level of expression in OSN expressing the first odorant receptor that has not been contacted with the candidate odorant receptor ligand. In some embodiments of any of the aspects, the method further comprises measuring the expression of at least odor response gene in a plurality of OSNs, each of the plurality of OSNs having been contacted with a different candidate odorant receptor ligand. In some embodiments of any of the aspects, the magnitude of the difference in expression of the at least one odor response gene in the first OSN from the reference level of expression in an OSN expressing the first odorant receptor that has not been contacted with the candidate odorant receptor ligand correlates to the strength of the binding of the candidate odorant receptor ligand and the odorant receptor.
In some embodiments of any of the aspects, a method of identifying an odorant receptor ligand, or measuring the strength of an odorant receptor ligand's binding comprises: measuring the expression of at least one odor response gene in a first olfactory sensory neuron (OSN) expressing a first odorant receptor that has been contacted with a candidate odorant receptor ligand; and determining that the candidate odorant receptor ligand binds the first odorant receptor if the expression of the at least one odor response gene in the first OSN is increased relative to a reference level of expression in OSN expressing the first odorant receptor that has not been contacted with the candidate odorant receptor ligand. In some embodiments of any of the aspects, the method further comprises measuring the expression of at least odor response gene in a plurality of OSNs, each of the plurality of OSNs having been contacted with a different candidate odorant receptor ligand. In some embodiments of any of the aspects, the magnitude of the increase in expression of the at least one odor response gene in the first OSN relative to the reference level of expression in an OSN expressing the first odorant receptor that has not been contacted with the candidate odorant receptor ligand correlates to the strength of the binding of the candidate odorant receptor ligand and the odorant receptor.
In some embodiments of any of the aspects, the level of expression products of no more than 1,000 other genes is determined. In some embodiments of any of the aspects, the level of expression products of no more than 500 other genes is determined. In some embodiments of any of the aspects, the level of expression products of no more than 200 other genes is determined. In some embodiments of any of the aspects, the level of expression products of no more than 100 other genes is determined. In some embodiments of any of the aspects, the level of expression products of no more than 20 other genes is determined. In some embodiments of any of the aspects, the level of expression products of no more than 10 other genes is determined. In some embodiments of any of the aspects, the level of expression of no more than 6 other genes is determined.
In some embodiments of the foregoing aspects, the expression level of a given gene can be normalized relative to the expression level of one or more reference genes or reference proteins.
In some embodiments, the reference level can be the level in a sample of similar cell type, sample type, sample processing, and/or obtained from a subject of similar age, sex and other demographic parameters as the sample/subject for which the level of odor response gene activity is to be determined. In some embodiments, the test sample and control reference sample are of the same type, that is, obtained from the same biological source, and comprising the same composition, e.g, the same number and type of cells.
In some embodiments of any of the aspects, the sample is obtained from the olfactory bulb, the olfactory epithelium, the nerve endings, and/or the nasal cavity. In some embodiments of any of the aspects, the sample is obtained from the nasal cavity. In some embodiments of any of the aspects, the sample is obtained from the olfactory bulb. In some embodiments of any of the aspects, the sample is obtained from the olfactory epithelium. In some embodiments of any of the aspects, the sample comprises epithelial cells, endothelial cells, olfactory sensory neurons, goblet cells, cilia, and/or mast cells. In some embodiments of any of the aspects, a biological sample can comprise epithelial cells, endothelial cells, olfactory sensory neurons, goblet cells, cilia, and/or mast cells that originate from the nostril of a nose of a subject.
The term “sample” or “test sample” as used herein denotes a sample taken or isolated from a biological organism. e.g., an epithelial or tissue sample from a subject. In some embodiments of any of the aspects, the present invention encompasses several examples of a biological sample. In some embodiments of any of the aspects, the biological sample is cells, or tissue, or bodily fluid. Exemplary biological samples include, but are not limited to, a biopsy, biofluid sample mucus; tissue biopsy; organ biopsy; mucosal secretion; saliva; and/or tissue sample etc. The term also includes a mixture of the above-mentioned samples. The term “test sample” also includes untreated or pretreated (or pre-processed) biological samples. In some embodiments of any of the aspects, a test sample can comprise cells from a subject. In some embodiments of any of the aspects, the test sample can be epithelial cells, endothelial cells, olfactory sensory neurons, goblet cells, cilia, and/or mast cells that originate from the nostril of a nose of a subject.
The sample can be obtained by removing a sample from a subject, but can also be accomplished by using a previously isolated sample (e.g. isolated at a prior timepoint and isolated by the same or another person).
In some embodiments of any of the aspects, the test sample can be an untreated test sample. As used herein, the phrase “untreated test sample” refers to a test sample that has not had any prior sample pre-treatment except for dilution and/or suspension in a solution. Exemplary methods for treating a test sample include, but are not limited to, centrifugation, filtration, sonication, homogenization, heating, freezing and thawing, and combinations thereof. In some embodiments of any of the aspects, the test sample can be a frozen test sample. e.g., a frozen tissue. The frozen sample can be thawed before employing methods, assays and systems described herein. After thawing, a frozen sample can be centrifuged before being subjected to methods, assays and systems described herein. In some embodiments of any of the aspects, the test sample is a clarified test sample, for example, by centrifugation and collection of a supernatant comprising the clarified test sample. In some embodiments of any of the aspects, a test sample can be a pre-processed test sample, for example, supernatant or filtrate resulting from a treatment selected from the group consisting of centrifugation, filtration, thawing, purification, and any combinations thereof. In some embodiments of any of the aspects, the test sample can be treated with a chemical and/or biological reagent. Chemical and/or biological reagents can be employed to protect and/or maintain the stability of the sample, including biomolecules (e.g., nucleic acid and protein) therein, during processing. One exemplary reagent is a protease inhibitor, which is generally used to protect or maintain the stability of protein during processing. The skilled artisan is well aware of methods and processes appropriate for pre-processing of biological samples required for determination of the level of an expression product as described herein.
In some embodiments of any of the aspects, the methods, assays, and systems described herein can further comprise a step of obtaining or having obtained a test sample from a subject. In some embodiments of any of the aspects, the subject is a mammal. In some embodiments of any of the aspects, the subject is a mouse. In some embodiments of any of the aspects, the subject is a mammal. In some embodiments of any of the aspects, the subject is a human. In some embodiments of any of the aspects, the subject can be a subject in need of treatment for (e.g. having or diagnosed as having) Parkinson's Disease, Alzheimer's Disease, COVID-19, SARS-COV, MERS-COV, and related coronavirus infections, as well as infections from common viral and bacterial agents such as influenza, staphalococcus, streptococcus or a subject at risk of or at increased risk of developing Parkinson's Disease, Alzheimer's Disease, COVID-19, SARS-COV, MERS-COV, and related coronavirus infections, as well as infections from common viral and bacterial agents such as influenza, staphalococcus, streptococcus as described elsewhere herein.
In one aspect of any of the embodiments, described herein is a kit comprising primers that hybridize to and permit amplification of at least one odor response gene mRNA. The kit can optionally include primers that hybridize and permit amplication of at least one odorant receptor mRNA, one more more volatilized chemical compounds, a benchmark scent, or an OSN as described herein. The primers and other elements of the kit can be present in separate formulations of the kit, e.g., for separate administration or for mixing prior to administration.
A kit is any manufacture (e.g., a package or container) comprising at least one reagent, e.g., primers that hybridize with at least one odor response gene, the manufacture being promoted, distributed, or sold as a unit for performing the methods described herein. The kits described herein can optionally comprise additional components useful for performing the methods described herein. By way of example, the kit can comprise fluids and compositions (e.g., buffers, needles, syringes etc.) suitable for performing one or more of the administrations according to the methods described herein, an instructional material which describes performance of a method as described herein, and the like. Additionally, the kit may comprise an instruction leaflet.
Several diseases are characterized by a loss or reduction in the sense of smell. Accordingly, the quantitative measures of scent perception used herein can be used to detect or diagnose such diseases or symptoms. In some embodiments of any of the aspects, the method comprises measuring the expression of at least one odor response gene in a first olfactory sensory neuron (OSN) that has been contacted with at least one volatilized chemical compound, wherein the first OSN is obtained from or present in a subject; and determining that the subject has a loss of smell if the expression of the at least one odor response gene in the first OSN is different from the expression in a second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)) that has been contacted with the at least one volatilized chemical compound and obtained from or is present in normal, healthy individual. In some embodiments of any of the aspects, the method comprises measuring the expression of at least one odor response gene in a first olfactory sensory neuron (OSN) that has been contacted with at least one volatilized chemical compound, wherein the first OSN is obtained from or present in a subject; and determining that the subject has a loss of smell if the expression of the at least one odor response gene in the first OSN is lower than from the expression in a second OSN (optionally of the same subtype (e.g., expressing the same odorant receptor as the first OSN)) that has been contacted with the at least one volatilized chemical compound and obtained from or is present in normal, healthy individual. In some embodiments of any of the aspects, the loss of smell indicates the subject has or is at risk of having Parkinson's Disease, Alzheimer's Disease, COVID-19, SARS-COV, MERS-COV, and related coronavirus infections, as well as infections from common viral and bacterial agents such as influenza, staphalococcus, streptococcus. In some embodiments of any of the aspects, the method further comprises administering a treatment for Parkinson's Disease, Alzheimer's Disease, COVID-19, SARS-COV, MERS-COV, and related coronavirus infections, as well as infections from common viral and bacterial agents such as influenza, staphalococcus, streptococcus, if the subject is determined to have a loss of smell. Treatments for such conditions are known in the art and include monoclonal antibody therapties, antivirals, antibiotics, antidepressants (eg., rasagiline, selegiline, etc), anti-tremor medications, dopamine promoters, memantine, rivastigmine, galantamine, etc.
In one respect, the present invention relates to the herein described compositions, methods, and respective component(s) thereof, as essential to the technology, yet open to the inclusion of unspecified elements, essential or not (“comprising). In some embodiments of any of the aspects, other elements to be included in the description of the composition, method or respective component thereof are limited to those that do not materially affect the basic and novel characteristic(s) of the technology (e.g., the composition, method, or respective component thereof “consists essentially of” the elements described herein). This applies equally to steps within a described method as well as compositions and components therein. In other embodiments of any of the aspects, the compositions, methods, and respective components thereof, described herein are intended to be exclusive of any element not deemed an essential element to the component, composition or method (e.g., the composition, method, or respective component thereof “consists of” the elements described herein). This applies equally to steps within a described method as well as compositions and components therein.
For convenience, the meaning of some terms and phrases used in the specification, examples, and appended claims, are provided below. Unless stated otherwise, or implicit from context, the following terms and phrases include the meanings provided below. The definitions are provided to aid in describing particular embodiments, and are not intended to limit the claimed invention, because the scope of the invention is limited only by the claims. Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. If there is an apparent discrepancy between the usage of a term in the art and its definition provided herein, the definition provided within the specification shall prevail.
For convenience, certain terms employed herein, in the specification, examples and appended claims are collected here.
The terms “decrease”, “reduced”, “reduction”, or “inhibit” are all used herein to mean a decrease by a statistically significant amount. In some embodiments, “reduce,” “reduction” or “decrease” or “inhibit” typically means a decrease by at least 10% as compared to a reference level (e.g, the absence of a given treatment or agent) and can include, for example, a decrease by at least about 10%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 45%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 98%, at least about 99%, or more. As used herein, “reduction” or “inhibition” does not encompass a complete inhibition or reduction as compared to a reference level. “Complete inhibition” is a 100% inhibition as compared to a reference level.
The terms “increased”, “increase”, “enhance”, or “activate” are all used herein to mean an increase by a statically significant amount. In some embodiments, the terms “increased”, “increase”, “enhance”, or “activate” can mean an increase of at least 10% as compared to a reference level, for example an increase of at least about 20%, or at least about 30%, or at least about 40%, or at least about 50%, or at least about 60%, or at least about 70%, or at least about 80%, or at least about 90% or up to and including a 100% increase or any increase between 10-100% as compared to a reference level, or at least about a 2-fold, or at least about a 3-fold, or at least about a 4-fold, or at least about a 5-fold or at least about a 10-fold increase, or any increase between 2-fold and 10-fold or greater as compared to a reference level. In the context of a marker or symptom, an “increase” is a statistically significant increase in such level.
As used herein, a “subject” means a human or animal. Usually the animal is a vertebrate such as a primate, rodent, domestic animal or game animal. Primates include chimpanzees, cynomologus monkeys, spider monkeys, and macaques, e.g., Rhesus. Rodents include mice, rats, woodchucks, ferrets, rabbits and hamsters. Domestic and game animals include cows, horses, pigs, deer, bison, buffalo, feline species. e.g., domestic cat, canine species, e.g., dog, fox, wolf, avian species, e.g., chicken, emu, ostrich, and fish, e.g., trout, catfish and salmon. In some embodiments, the subject is a mammal, e.g., a primate. e.g., a human. The terms, “individual,” “patient” and “subject” are used interchangeably herein.
Preferably, the subject is a mammal. The mammal can be a human, non-human primate, mouse, rat, dog, cat, horse, or cow, but is not limited to these examples. Mammals other than humans can be advantageously used as subjects. A subject can be male or female.
A subject can be one who has been previously diagnosed with or identified as suffering from or having a condition in need of treatment (e.g., being unable to detect odor) or one or more complications related to such a condition, and optionally, have already undergone treatment for the condition or the one or more complications related to the condition. Alternatively, a subject can also be one who has not been previously diagnosed as having the condition or one or more complications related to the condition. For example, a subject can be one who exhibits one or more risk factors for the condition or one or more complications related to the condition or a subject who does not exhibit risk factors.
A “subject in need” of treatment for a particular condition can be a subject having that condition, diagnosed as having that condition, or at risk of developing that condition.
As used herein, the terms “protein” and “polypeptide” are used interchangeably herein to designate a series of amino acid residues, connected to each other by peptide bonds between the alpha-amino and carboxy groups of adjacent residues. A protein or polypeptide can be produced naturally or in vitro by synthetic means. The terms “protein”, and “polypeptide” refer to a polymer of amino acids, including modified amino acids (e.g., phosphorylated, glycated, glycosylated, etc.) and amino acid analogs, regardless of its size or function. “Protein” and “polypeptide” are often used in reference to relatively large polypeptides, whereas the term “peptide” is often used in reference to small polypeptides, but usage of these terms in the art overlaps. Polypeptides can also undergo maturation or post-translational modification processes that may include, but are not limited to: glycosylation, proteolytic cleavage, lipidization, signal peptide cleavage, pro peptide cleavage, phosphorylation, and such like.
The terms “protein” and “polypeptide” are used interchangeably herein when referring to a gene product and fragments thereof. Thus, exemplary polypeptides or proteins include gene products, naturally occurring proteins, homologs, orthologs, paralogs, fragments and other equivalents, variants, fragments, and analogs of the foregoing. The terms also refer to fragments or variants of the polypeptide that maintain at least 50% of the activity or effect, e.g. odor detection by the full length polypeptide. Conservative substitution variants that maintain the activity of a wildtype protein will include a conservative substitution as defined herein. The identification of amino acids most likely to be tolerant of conservative substitution while maintaining at least 50% of the activity of the wildtype is guided by, for example, sequence alignment with homologs or paralogs from other species. Amino acids that are identical between homologs are less likely to tolerate change, while those showing conservative differences are obviously much more likely to tolerate conservative change in the context of an artificial variant. Similarly, positions with non-conservative differences are less likely to be critical to function and more likely to tolerate conservative substitution in an artificial variant. Variants, fragments, and/or fusion proteins can be tested for activity, for example, by administering the variant to an appropriate animal model of loss of smell as described herein.
In some embodiments, a polypeptide, can be a variant of a sequence described herein. In some embodiments, the variant is a conservative substitution variant. Variants can be obtained by mutations of native nucleotide sequences, for example. A “variant.” as referred to herein, is a polypeptide substantially homologous to a native or reference polypeptide, but which has an amino acid sequence different from that of the native or reference polypeptide because of one or a plurality of deletions, insertions or substitutions. Polypeptide-encoding DNA sequences encompass sequences that comprise one or more additions, deletions, or substitutions of nucleotides when compared to a native or reference DNA sequence, but that encode a variant protein or fragment thereof that retains the relevant biological activity relative to the reference protein, e.g., at least 50% of the wildtype protein. As to amino acid sequences, one of skill will recognize that individual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which alters a single amino acid or a small percentage. (i.e., 5% or fewer. e.g. 4% or fewer, or 3% or fewer, or 1% or fewer) of amino acids in the encoded sequence is a “conservatively modified variant” where the alteration results in the substitution of an amino acid with a chemically similar amino acid. It is contemplated that some changes can potentially improve the relevant activity, such that a variant, whether conservative or not, has more than 100% of the activity of the wildtype, e.g., 110%, 125%, 150%, 175%, 200%, 500%, 1000% or more.
One method of identifying amino acid residues which can be substituted is to align, for example, human to a homolog from one or more non-human species. Alignment can provide guidance regarding not only residues likely to be necessary for function but also, conversely, those residues likely to tolerate change. Where, for example, an alignment shows two identical or similar amino acids at corresponding positions, it is more likely that that site is important functionally. Where, conversely, alignment shows residues in corresponding positions to differ significantly in size, charge, hydrophobicity, etc., it is more likely that that site can tolerate variation in a functional polypeptide. The variant amino acid or DNA sequence can be at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or more, identical to a native or reference sequence, e.g., a nucleic acid encoding one of those amino acid sequences. The degree of homology (percent identity) between a native and a mutant sequence can be determined, for example, by comparing the two sequences using freely available computer programs commonly employed for this purpose on the world wide web. The variant amino acid or DNA sequence can be at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or more, similar to the sequence from which it is derived (referred to herein as an “original” sequence). The degree of similarity (percent similarity) between an original and a mutant sequence can be determined, for example, by using a similarity matrix. Similarity matrices are well known in the art and a number of tools for comparing two sequences using similarity matrices are freely available online, e.g., BLASTp or BLASTn (available on the world wide web at blast.ncbi.nlm.nih.gov), with default parameters set.
In the various embodiments described herein, it is further contemplated that variants (naturally occurring or otherwise), alleles, homologs, conservatively modified variants, and/or conservative substitution variants of any of the particular polypeptides described are encompassed. As to amino acid sequences, one of skill will recognize that individual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which alters a single amino acid or a small percentage of amino acids in the encoded sequence is a “conservatively modified variant” where the alteration results in the substitution of an amino acid with a chemically similar amino acid and retains the desired activity of the polypeptide. Such conservatively modified variants are in addition to and do not exclude polymorphic variants, interspecies homologs, and alleles consistent with the disclosure.
A given amino acid can be replaced by a residue having similar physiochemical characteristics, e.g., substituting one aliphatic residue for another (such as Ile. Val. Leu, or Ala for one another), or substitution of one polar residue for another (such as between Lys and Arg: Glu and Asp; or Gin and Asn). Other such conservative substitutions, e.g., substitutions of entire regions having similar hydrophobicity characteristics, are well known. Polypeptides comprising conservative amino acid substitutions can be tested in any one of the assays described herein to confirm that a desired activity. e.g., the activity and/or specificity of a native or reference polypeptide is retained.
A given amino acid can be replaced by a residue having similar physiochemical characteristics, e.g., substituting one aliphatic residue for another (such as Ile, Val, Leu, or Ala for one another), or substitution of one polar residue for another (such as between Lys and Arg; Glu and Asp; or Gin and Asn). Other such conservative substitutions, e.g., substitutions of entire regions having similar hydrophobicity characteristics, are well known. Polypeptides comprising conservative amino acid substitutions can be tested in any one of the assays described herein to confirm that a desired activity of a native or reference polypeptide is retained. Conservative substitution tables providing functionally similar amino acids are well known in the art. Such conservatively modified variants are in addition to and do not exclude polymorphic variants, interspecies homologs, and alleles consistent with the disclosure.
Amino acids can be grouped according to similarities in the properties of their side chains (in A. L. Lehninger, in Biochemistry, second ed., pp. 73-75. Worth Publishers. New York (1975)): (1) non-polar: Ala (A), Val (V), Leu (L), Ile (I), Pro (P), Phe (F), Trp (W), Met (M); (2) uncharged polar; Gly (G), Ser(S), Thr (T), Cys (C), Tyr (Y), Asn (N), Gln (Q); (3) acidic: Asp (D), Glu (E); (4) basic; Lys (K), Arg (R), His (H), Alternatively, naturally occurring residues can be divided into groups based on common side-chain properties: (1) hydrophobic: Norleucine, Met, Ala, Val, Leu, Ile; (2) neutral hydrophilic: Cys, Ser, Thr, Asn, Gln; (3) acidic: Asp, Glu; (4) basic: His, Lys, Arg; (5) residues that influence chain orientation: Gly, Pro; (6) aromatic: Trp, Tyr, Phe. Non-conservative substitutions will entail exchanging a member of one of these classes for another class. Particular conservative substitutions include, for example: Ala into Gly or into Ser; Arg into Lys; Asn into Gln or into His; Asp into Glu; Cys into Ser; Gln into Asn; Glu into Asp; Gly into Ala or into Pro; His into Asn or into Gln; Ile into Leu or into Val; Leu into Ile or into Val; Lys into Arg, into Gln or into Glu; Met into Leu, into Tyr or into Ile; Phe into Met, into Leu or into Tyr; Ser into Thr; Thr into Ser; Trp into Tyr; Tyr into Trp; and/or Phe into Val, into Ile or into Leu. Typically conservative substitutions for one another also include: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 7) Serine(S), Threonine (T); and 8) Cysteine (C), Methionine (M) (sec. e.g., Creighton, Proteins (1984)).
In some embodiments, the polypeptide described herein (or a nucleic acid encoding such a polypeptide) can be a functional fragment of one of the amino acid sequences described herein. As used herein, a “functional fragment” is a fragment or segment of a peptide which retains at least 50% of the wildtype reference polypeptide's activity according to the assays described below herein. A functional fragment can comprise conservative substitutions of the sequences disclosed herein.
In some embodiments, the polypeptide described herein can be a variant of a sequence described herein. In some embodiments, the variant is a conservatively modified variant. Conservative substitution variants can be obtained by mutations of native nucleotide sequences, for example, A “variant,” as referred to herein, is a polypeptide substantially homologous to a native or reference polypeptide, but which has an amino acid sequence different from that of the native or reference polypeptide because of one or a plurality of deletions, insertions or substitutions. Variant polypeptide-encoding DNA sequences encompass sequences that comprise one or more additions, deletions, or substitutions of nucleotides when compared to a native or reference DNA sequence, but that encode a variant protein or fragment thereof that retains activity. A wide variety of PCR-based site-specific mutagenesis approaches are known in the art and can be applied by the ordinarily skilled artisan.
In some embodiments, a polypeptide can comprise one or more amino acid substitutions or modifications. In some embodiments, the substitutions and/or modifications can prevent or reduce proteolytic degradation and/or prolong half-life of the polypeptide in a subject. In some embodiments, a polypeptide can be modified by conjugating or fusing it to other polypeptide or polypeptide domains such as, by way of non-limiting example, transferrin (WO06096515A2), albumin (Yeh et al., 1992), growth hormone (US2003104578AA); cellulose (Levy and Shoseyov. 2002); and/or Fc fragments (Ashkenazi and Chamow, 1997). The references in the foregoing paragraph are incorporated by reference herein in their entireties.
In some embodiments, a polypeptide as described herein can comprise at least one peptide bond replacement. A polypeptide as described herein can comprise one type of peptide bond replacement or multiple types of peptide bond replacements, e.g., 2 types, 3 types, 4 types, 5 types, or more types of peptide bond replacements. Non-limiting examples of peptide bond replacements include urea, thiourea, carbamate, sulfonyl urea, trifluoroethylamine, ortho-(aminoalkyl)-phenylacetic acid, para-(aminoalkyl)-phenylacetic acid, meta-(aminoalkyl)-phenylacetic acid, thioamide, tetrazole, boronic ester, olefinic group, and derivatives thereof.
In some embodiments, a polypeptide as described herein can comprise naturally occurring amino acids commonly found in polypeptides and/or proteins produced by living organisms. e.g. Ala (A), Val (V), Leu (L), Ile (I), Pro (P), Phe (F), Trp (W), Met (M), Gly (G), Ser(S), Thr (T), Cys (C), Tyr (Y), Asn (N), Gln (Q), Asp (D), Glu (E), Lys (K), Arg (R), and His (H). In some embodiments, a polypeptide as described herein can comprise alternative amino acids. Non-limiting examples of alternative amino acids include, D-amino acids; beta-amino acids; homocysteine, phosphoserine, phosphothreonine, phosphotyrosine, hydroxyproline, gamma-carboxyglutamate; hippuric acid, octahydroindole-2-carboxylic acid, statine, 1,2,3,4,-tetrahydroisoquinoline-3-carboxylic acid, penicillamine (3-mercapto-D-valine), ornithine, citruline, alpha-methyl-alanine, para-benzoylphenylalanine, para-amino phenylalanine, p-fluorophenylalanine, phenylglycine, propargylglycine, sarcosine, and tert-butylglycine), diaminobutyric acid, 7-hydroxy-tetrahydroisoquinoline carboxylic acid, naphthylalanine, biphenylalanine, cyclohexylalanine, amino-isobutyric acid, norvaline, norleucine, tert-leucine, tetrahydroisoquinoline carboxylic acid, pipecolic acid, phenylglycine, homophenylalanine, cyclohexylglycine, dehydroleucine, 2,2-diethylglycine, 1-amino-1-cyclopentanecarboxylic acid, 1-amino-1-cyclohexanecarboxylic acid, amino-benzoic acid, amino-naphthoic acid, gamma-aminobutyric acid, difluorophenylalanine, nipecotic acid, alpha-amino butyric acid, thienyl-alanine, t-butylglycine, trifluorovaline; hexafluoroleucine; fluorinated analogs; azide-modified amino acids; alkyne-modified amino acids; cyano-modified amino acids; and derivatives thereof.
As used herein, the term “nucleic acid” or “nucleic acid sequence” refers to any molecule, preferably a polymeric molecule, incorporating units of ribonucleic acid, deoxyribonucleic acid or an analog thereof. The nucleic acid can be either single-stranded or double-stranded. A single-stranded nucleic acid can be one nucleic acid strand of a denatured double-stranded DNA. Alternatively, it can be a single-stranded nucleic acid not derived from any double-stranded DNA. In one aspect, the nucleic acid can be DNA. In another aspect, the nucleic acid can be RNA. Suitable DNA can include, e.g., genomic DNA or cDNA. Suitable RNA can include, e.g., mRNA.
Nucleic acids may further include modified DNA or RNA, for example DNA or RNA that has been methylated, or RNA that has been subject to post-translational modification, for example 5′-capping with 7-methylguanosine, 3′-processing such as cleavage and polyadenylation, and splicing. Nucleic acids may also include synthetic nucleic acids (XNA), such as hexitol nucleic acid (HNA), cyclohexene nucleic acid (CeNA), threose nucleic acid (TNA), glycerol nucleic acid (GNA), locked nucleic acid (LNA) and peptide nucleic acid (PNA).
In some embodiments, the expression of a biomarker(s), target(s), or gene/polypeptide described herein is/are tissue-specific. In some embodiments, the expression of a biomarker(s), target(s), or gene/polypeptide described herein is/are global. In some embodiments, the expression of a biomarker(s), target(s), or gene/polypeptide described herein is systemic.
As used herein, the terms “treat,” “treatment,” “treating,” or “amelioration” refer to therapeutic treatments, wherein the object is to reverse, alleviate, ameliorate, inhibit, slow down or stop the progression or severity of a condition associated with a disease or disorder, e.g., an infectious disease. The term “treating” includes reducing or alleviating at least one adverse effect or symptom of a condition, disease or disorder associated with an infectious disease. Treatment is generally “effective” if one or more symptoms or clinical markers are reduced. Alternatively, treatment is “effective” if the progression of a disease is reduced or halted. That is. “treatment” includes not just the improvement of symptoms or markers, but also a cessation of, or at least slowing of, progress or worsening of symptoms compared to what would be expected in the absence of treatment. Beneficial or desired clinical results include, but are not limited to, alleviation of one or more symptom(s), diminishment of extent of disease, stabilized (i.e., not worsening) state of disease, delay or slowing of disease progression, amelioration or palliation of the disease state, remission (whether partial or total), and/or decreased mortality, whether detectable or undetectable. The term “treatment” of a disease also includes providing relief from the symptoms or side-effects of the disease (including palliative treatment).
In some embodiments of any of the aspects, described herein is a prophylactic method of treatment. As used herein “prophylactic” refers to the timing and intent of a treatment relative to a disease or symptom, that is, the treatment is administered prior to clinical detection or diagnosis of that particular disease or symptom in order to protect the patient from the disease or symptom. Prophylactic treatment can encompass a reduction in the severity or speed of onset of the disease or symptom, or contribute to faster recovery from the disease or symptom. Accordingly, the methods described herein can be prophylactic relative to a loss of smell or a disease characterized by a loss of smell. In some embodiments of any of the aspects, prophylactic treatment is not prevention of all symptoms or signs of a disease.
As used herein, the term “administering.” refers to the placement of a compound as disclosed herein into a subject by a method or route which results in at least partial delivery of the agent at a desired site. Pharmaceutical compositions comprising the compounds disclosed herein can be administered by any appropriate route which results in an effective treatment in the subject. In some embodiments, administration comprises physical human activity, e.g., an injection, act of ingestion, an act of application, and/or manipulation of a delivery device or machine. Such activity can be performed, e.g., by a medical professional and/or the subject being treated.
As used herein. “contacting” refers to any suitable means for delivering, or exposing, an agent to at least one cell. Exemplary delivery methods include, but are not limited to, direct delivery to cell culture medium, perfusion, injection, or other delivery method well known to one skilled in the art. In some embodiments, contacting comprises physical human activity, e.g., an injection: an act of dispensing, mixing, and/or decanting; and/or manipulation of a delivery device or machine.
The term “statistically significant” or “significantly.” refers to statistical significance and generally means a two standard deviation (2SD) or greater difference.
Other than in the operating examples, or where otherwise indicated, all numbers expressing quantities of ingredients or reaction conditions used herein should be understood as modified in all instances by the term “about.” The term “about” when used in connection with percentages can mean±1%.
Other than in the operating examples, or where otherwise indicated, all numbers expressing quantities of ingredients or reaction conditions used herein should be understood as modified in all instances by the term “about.” The term “about” when used in connection with percentages can mean±1%.
As used herein, the term “comprising” means that other elements can also be present in addition to the defined elements presented. The use of “comprising” indicates inclusion rather than limitation.
The term “consisting of” refers to compositions, methods, and respective components thereof as described herein, which are exclusive of any element not recited in that description of the embodiment.
As used herein the term “consisting essentially of” refers to those elements required for a given embodiment. The term permits the presence of additional elements that do not materially affect the basic and novel or functional characteristic(s) of that embodiment of the invention.
As used herein, the term “corresponding to” refers to an amino acid or nucleotide at the enumerated position in a first polypeptide or nucleic acid, or an amino acid or nucleotide that is equivalent to an enumerated amino acid or nucleotide in a second polypeptide or nucleic acid. Equivalent enumerated amino acids or nucleotides can be determined by alignment of candidate sequences using degree of homology programs known in the art, e.g., BLAST.
As used herein, the term “specific binding” refers to a chemical interaction between two molecules, compounds, cells and/or particles wherein the first entity binds to the second, target entity with greater specificity and affinity than it binds to a third entity which is a non-target. In some embodiments, specific binding can refer to an affinity of the first entity for the second target entity which is at least 10 times, at least 50 times, at least 100 times, at least 500 times, at least 1000 times or greater than the affinity for the third nontarget entity. A reagent specific for a given target is one that exhibits specific binding for that target under the conditions of the assay being utilized.
The singular terms “a.” “an.” and “the” include plural referents unless context clearly indicates otherwise. Similarly, the word “or” is intended to include “and” unless the context clearly indicates otherwise. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of this disclosure, suitable methods and materials are described below. The abbreviation. “e.g.” is derived from the Latin exempli gratia, and is used herein to indicate a non-limiting example. Thus, the abbreviation “e.g.” is synonymous with the term “for example.”
Groupings of alternative elements or embodiments of the invention disclosed herein are not to be construed as limitations. Each group member can be referred to and claimed individually or in any combination with other members of the group or other elements found herein. One or more members of a group can be included in, or deleted from, a group for reasons of convenience and/or patentability. When any such inclusion or deletion occurs, the specification is herein deemed to contain the group as modified thus fulfilling the written description of all Markush groups used in the appended claims.
Unless otherwise defined herein, scientific and technical terms used in connection with the present application shall have the meanings that are commonly understood by those of ordinary skill in the art to which this disclosure belongs. It should be understood that this invention is not limited to the particular methodology, protocols, and reagents, etc., described herein and as such can vary. The terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the present invention, which is defined solely by the claims. Definitions of common terms in immunology and molecular biology can be found in The Merck Manual of Diagnosis and Therapy, 20th Edition, published by Merck Sharp & Dohme Corp., 2018 (ISBN 0911910190, 978-0911910421); Robert S. Porter et al. (eds.). The Encyclopedia of Molecular Cell Biology and Molecular Medicine, published by Blackwell Science Ltd., 1999-2012 (ISBN 9783527600908); and Robert A. Meyers (ed.). Molecular Biology and Biotechnology: a Comprehensive Desk Reference, published by VCH Publishers. Inc., 1995 (ISBN 1-56081-569-8); Immunology by Werner Luttmann, published by Elsevier, 2006; Janeway's Immunobiology. Kenneth Murphy. Allan Mowat. Casey Weaver (eds.). W. W. Norton & Company, 2016 (ISBN 0815345054, 978-0815345053); Lewin's Genes XI, published by Jones & Bartlett Publishers, 2014 (ISBN-1449659055); Michael Richard Green and Joseph Sambrook. Molecular Cloning: A Laboratory Manual, 4th ed., Cold Spring Harbor Laboratory Press. Cold Spring Harbor, N.Y., USA (2012) (ISBN 1936113414); Davis et al., Basic Methods in Molecular Biology. Elsevier Science Publishing. Inc., New York. USA (2012) (ISBN 044460149X); Laboratory Methods in Enzymology: DNA. Jon Lorsch (ed.) Elsevier, 2013 (ISBN 0124199542); Current Protocols in Molecular Biology (CPMB), Frederick M. Ausubel (ed.). John Wiley and Sons, 2014 (ISBN 047150338X, 9780471503385). Current Protocols in Protein Science (CPPS), John E. Coligan (ed.). John Wiley and Sons. Inc., 2005; and Current Protocols in Immunology (CPI) (John E. Coligan. ADA M Kruisbeck. David H Margulies. Ethan M Shevach. Warren Strobe. (eds.) John Wiley and Sons, Inc., 2003 (ISBN 0471142735, 9780471142737), the contents of which are all incorporated by reference herein in their entireties.
Other terms are defined herein within the description of the various aspects of the invention.
All patents and other publications: including literature references, issued patents, published patent applications, and co-pending patent applications: cited throughout this application are expressly incorporated herein by reference for the purpose of describing and disclosing, for example, the methodologies described in such publications that might be used in connection with the technology described herein. These publications are provided solely for their disclosure prior to the filing date of the present application. Nothing in this regard should be construed as an admission that the inventors are not entitled to antedate such disclosure by virtue of prior invention or for any other reason. All statements as to the date or representation as to the contents of these documents is based on the information available to the applicants and does not constitute any admission as to the correctness of the dates or contents of these documents.
The description of embodiments of the disclosure is not intended to be exhaustive or to limit the disclosure to the precise form disclosed. While specific embodiments of, and examples for, the disclosure are described herein for illustrative purposes, various equivalent modifications are possible within the scope of the disclosure, as those skilled in the relevant art will recognize. For example, while method steps or functions are presented in a given order, alternative embodiments may perform functions in a different order, or functions may be performed substantially concurrently. The teachings of the disclosure provided herein can be applied to other procedures or methods as appropriate. The various embodiments described herein can be combined to provide further embodiments. Aspects of the disclosure can be modified, if necessary, to employ the compositions, functions and concepts of the above references and application to provide yet further embodiments of the disclosure. Moreover, due to biological functional equivalency considerations, some changes can be made in protein structure without affecting the biological or chemical action in kind or amount. These and other changes can be made to the disclosure in light of the detailed description. All such modifications are intended to be included within the scope of the appended claims.
Specific elements of any of the foregoing embodiments can be combined or substituted for elements in other embodiments. Furthermore, while advantages associated with certain embodiments of the disclosure have been described in the context of these embodiments, other embodiments may also exhibit such advantages, and not all embodiments need necessarily exhibit such advantages to fall within the scope of the disclosure.
In some embodiments, the present technology may be defined in any of the following numbered paragraphs:
1. A method comprising:
2. The method of paragraph 1, wherein the method further comprises measuring the expression of at least one odor response gene in a second OSN that has not been contacted with the at least one volatized chemical compound.
3. The method of any one of the preceding paragraphs, further comprising determining that the first subtype expresses an odorant receptor that binds to the at least one volatized chemical compound if the expression of the at least one odor response gene in the first OSN is different from a reference level of expression in a second OSN that has not been contacted with the at least one volatized chemical compound.
4. A method comprising:
5. The method of any one of the preceding paragraphs, wherein the at least one odor response gene is selected from the group consisting of:
6. The method of any one of the preceding paragraphs, wherein the at least one odor response gene is selected from the group consisting of:
7. The method of any one of the preceding paragraphs, wherein the at least one odor response gene is selected from the group consisting of:
8. The method of any one of the preceding paragraphs, wherein the at least one odor response gene is selected from the group consisting of:
9. The method of any one of the preceding paragraphs, wherein the at least one odor response gene is selected from the group consisting of:
10. The method of any one of the preceding paragraphs, wherein the at least one odor response gene comprises wat least one odor response gene, at least two odor response genes, at least three odor response genes, at least five odor response genes, at least ten odor response genes, at least fifty odor response genes, or at least one hundred odor response genes.
11. The method of any one of preceding paragraphs, further comprising measuring the expression of at least one odorant receptor gene.
12. The method of any one of the preceding paragraphs, wherein the expression of the at least one odor response gene is measured 2 or more hours after contacting the OSN with the at least one volatized chemical compound.
13. The method of any one of the preceding paragraphs, wherein the expression of the at least one odor response gene is measured 3 days or more after contacting the OSN with the at least one volatized chemical compound.
14. The method of any one of the preceding paragraphs, wherein the first subtype is an odorant receptor subtype.
15. The method of any one of the preceding paragraphs, wherein the first subtype expresses one odorant receptor.
16. The method of any one of the preceding paragraphs, wherein the odorant receptor is a human odorant receptor.
17. The method of any one of the preceding paragraphs, wherein the odorant receptor is a murine odorant receptor.
18. The method of paragraph 16, wherein the human odorant receptor is selected from the group consisting of:
19. The method of any one of the preceding paragraphs, wherein the odorant receptor is selected from the group consisting of:
20. The method of paragraph any one of the preceding paragraphs, wherein the odorant receptor is selected from the group consisting of:
21. The method of paragraph 17, wherein the murine odorant receptor is selected from the group consisting of:
22. The method of any one of the preceding paragraphs, wherein the first OSN and/or the second OSN expresses at least one olfactory receptor.
23. The method of any one of the preceding paragraphs, wherein the first OSN and/or the second OSN is in vitro during the contacting.
24. The method of any one of the preceding paragraphs, wherein the first OSN and/or the second OSN is in vivo during the contacting.
25. The method of any one of the preceding paragraphs, wherein the contacting the first OSN comprises contacting a subject comprising the first OSN with air comprising the at least one volatilized chemical compound.
26. The method of any one of the preceding paragraphs, wherein the not contacting the second OSN comprises:
27. The method of any one of the preceding paragraphs, wherein the second OSN is not contacted with the at least one volatilized chemical compound for at least two hours prior to the measuring.
28. The method of any one of the preceding paragraphs, wherein measuring the expression comprises measuring the level of one or more mRNAs.
29. The method of paragraph 28, wherein measuring the level of one or more mRNAs comprises measuring the level of the one or more mRNAs in a single cell.
30. The method of paragraph 29, wherein measuring the level of one or more mRNAs comprises single-cell sequencing.
31. The method of any one of the preceding paragraphs, further comprising measuring the expression of at least one odor response gene in a plurality of OSNs contacted with at least one volatilized chemical compound, each of the plurality of OSNs expressing a different odorant receptor.
32. The method of any one of the preceding paragraphs, further comprising measuring the expression of at least one odor response gene in a plurality of OSNs, each of the plurality of OSNs having been contacted with a different at least one volatilized chemical compound.
33. The method of any one of paragraphs 31-32, wherein each of the plurality of OSNs is cultured in different wells of a multi-well plate or different cell culture containers.
34. The method of any one of paragraphs 31-32, wherein the plurality of OSNs are cultured in the same well of a multi-well plate or the same cell culture container.
35. The method of any one of paragraphs 33-34, wherein each well of a multi-well plate or each cell culture container is contacted with the same at least one volatilized chemical compound.
36. The method of any one of the preceding paragraphs, wherein each well of a multi-well plate or cell culture container is contacted with a different at least one volatilized chemical compound.
37. The method of any one of paragraphs 31-32, wherein each of the plurality of OSNs is present in a different subject during the contacting.
38. The method of any one of paragraphs 31-32, wherein the plurality of OSNs are present in the same subject during the contacting.
39. The method of any one of paragraphs 37-38, wherein each subject is contacted with the same at least one volatilized chemical compound.
40. The method of any one of paragraphs 37-38, wherein each subject is contacted with a different at least one volatilized chemical compound.
41. The method of any one of the preceding paragraphs, further comprising obtaining a sample comprising the first OSN and/or second OSN from the nose of a subject after the contacting and before the measuring.
42. The method of any one of the preceding paragraphs, wherein the sample is obtained from the olfactory bulb, the olfactory epithelium, the nerve endings, and/or the nasal cavity.
43. The method of any one of the preceding paragraphs, wherein the sample comprises epithelial cells, endothelial cells, olfactory sensory neurons, goblet cells, cilia, and/or mast cells.
44. The method of any one of the preceding paragraphs, wherein the subject is a mammal.
45. The method of any one of the preceding paragraphs, wherein the subject is a mouse or human.
46. The method of any one of the preceding paragraphs, further comprising comparing the expression of the at least one odor response gene in the first OSN to the expression of the at least one odor response genes in an OSN of the first subtype that has been contacted with a benchmark scent.
47. A method of identifying an odorant receptor ligand, or measuring the strength of an odorant receptor ligand's binding, the method comprising
48. The method of paragraph 47, further comprising measuring the expression of at least odor response gene in a plurality of OSNs, each of the plurality of OSNs having been contacted with a different candidate odorant receptor ligand.
49. The method of paragraphs 47 or 48, wherein the magnitude of the difference in expression of the at least one odor response gene in the first OSN from the reference level of expression in an OSN expressing the first odorant receptor that has not been contacted with the candidate odorant receptor ligand correlates to the strength of the binding of the candidate odorant receptor ligand and the odorant receptor.
50. A method comprising
51. The method of paragraph 50, wherein the loss of smell indicates the subject has or is at risk of having Parkinson's Disease, Alzheimer's Disease, COVID-19, SARS-COV, MERS-COV, coronavirus, influenza, staphalococcus, and streptococcus.
52. The method of any one of paragraphs 50 or 51, further comprising administering a treatment for Parkinson's Disease, Alzheimer's Disease, COVID-19, SARS-COV, MERS-COV, coronavirus, influenza, staphalococcus, and streptococcus if the subject is determined to have a loss of smell.
The technology described herein is further illustrated by the examples which in no way should be construed as being further limiting.
Animals traversing different environments encounter both stable background stimuli and novel cues, which are thought to be detected by primary sensory neurons and then distinguished by downstream brain circuits. Here it is shown that that each of the ˜1000 olfactory sensory neuron (OSN) subtypes in the mouse harbors a distinct transcriptome whose content is precisely determined by interactions between its odorant receptor and the environment. This transcriptional variation is systematically organized to support sensory adaptation: expression levels of more than 70 genes relevant to transforming odors into spikes continuously vary across OSN subtypes, dynamically adjust to new environments over hours, and accurately predict acute OSN-specific odor responses. The sensory periphery therefore separates salient signals from predictable background via a transcriptional rheostat whose moment-to-moment state reflects the past and constrains the future; these findings show a general model in which structured transcriptional variation within a cell type reflects individual experience.
Sensory adaptation allows neurons and networks to minimize responses to background stimuli, thereby building more efficient neural codes that emphasize surprising or novel information (Attneave, 1954; Barlow, 1961; Benda, 2021; Weber et al., 2019). Rapid (milliseconds to seconds) sensory adaptation via post-translational mechanisms occurs in most sensory neurons, and plays an important role in shaping peripheral responses to highly dynamic stimuli (Kostal et al., 2008; Martelli and Storace, 2021; Moore, 1994). However, it is not clear how sensory neurons adapt at the longer timescales over which animals traverse different environments—and therefore encounter different background stimuli—as is common during a typical circadian cycle.
Activity-dependent transcription evolves over minutes to hours and therefore in principle could support longer-term adaptation (Yap and Greenberg, 2018). For example, mature brain neurons are thought to use the regulated transcription of synaptic proteins and ion channels to homeostatically maintain their firing rates within a narrow target range (Davis, 2006, 2013; Marder, 2011; Marder and Goaillard, 2006; Turrigiano, 2011; Turrigiano, 1999; Turrigiano, 2017). If similar transcriptional mechanisms underlic adaptation in the periphery, then sensory neurons of a given type would be expected to occupy a range of transcriptional states, with the specific state adopted by any given neuron both reflecting its prior activity history and predicting its future responses to sensory inputs (Tyssowski and Gray, 2019). However, the relationship between activity-dependent gene regulation and functional adaptation remains elusive, in part because it has not yet been shown in either peripheral or central neurons that analog changes in activity yield proportional, bidirectional and systematic changes in the expression of genes that tune neural function (Davis, 2013).
There are ˜1000 different olfactory sensory neuron (OSN) subtypes in the mouse, each of which can be identified across individuals based upon its expression of a single odorant receptor (OR) (Monahan and Lomvardas, 2015). ORs are G-protein coupled receptors that transduce odor binding into an influx of calcium, which leads to action potentials. The expressed OR endows each OSN with a specific level of neural activity that depends upon the odors present within a given environment. Although post-translational mechanisms sculpt OSN responses to rapidly fluctuating odor plumes, at longer timescales it is thought that OSNs faithfully report odor-receptor interactions to the brain. For example, OSN responses to odors (as assessed by functional imaging of OSN axons in the olfactory bulb) appear similar in individual mice tested at different times, across different mice, and in mice before and after training to associate a test odorant with a reward (Bozza et al., 2004; Chu et al., 2017; Kato et al., 2012; Rubin and Katz, 1999; Soucy et al., 2009; Spors and Grinvald, 2002; Wachowiak and Cohen, 2001). In contrast, olfactory bulb and cortical neurons adapt to the repeated presentation of an odor on timescales of minutes (Best et al., 2005; Cleland and Linster, 2005; McNamara et al., 2008; Wilson, 1998; Wilson, 2000; Yadon and Wilson, 2005). These findings indicate that ORs confer stable odor response properties upon OSNs, while the brain separates salient odors from background; this idea has pervasively influenced theoretical and computational models of odor coding (Brann and Datta, 2020; Chong et al., 2020; Grabska-Barwińska et al., 2016; Hopfield, 1991; Litwin-Kumar et al., 2017; Schaffer et al., 2018; Teşilcanu et al., 2019; Wilson et al., 2017; Zwieker et al., 2016).
Consistent with this view, transcriptional analyses indicate that the genes expressed by mature OSNs are largely similar across subtypes, with the exception of the ORs themselves, a handful of genes related to dorsoventral position in the olfactory epithelium, and ˜10 axon guidance genes that enable glomerular targeting (Fletcher et al., 2017; Hanchate et al., 2015; Mori and Sakano, 2011; Nakashima et al., 2013; Scholz et al., 2016; Vihani et al., 2020; Wu et al., 2018). This homogeneity indicates that individual OSN subtypes-which vary widely in their level of activity within a given environment-do not adapt their odor responses by actively modulating gene expression. In contrast, bulk RNA and activity measurements that pool across all OSNs in mice have hinted that OSNs may use transcription to compensate for the chronic absence or presence of activity, although results and interpretations have varied across experiments (Barber and Coppola, 2015; Cadiou et al., 2014; Coppola and Waggener, 2012; Coppola et al., 2006; Fischl et al., 2014; Fitzwater and Coppola, 2020; Hagendorf et al., 2009; Kass et al., 2012; Waggener and Coppola, 2007; Wang et al., 1993; Wang et al., 2017). Resolving these discrepancies—and ultimately testing the hypothesis that dynamic gene expression mediates long-term adaptation in sensory neurons—requires asking how odor-evoked activity influences gene expression within particular OSN subtypes; this level of specificity is essential to clarify whether OSN transcriptomes are changed in an OR- and environment-specific manner: whether these changes are limited to small numbers of genes (as appears to be the case in C. elegans (Cho et al., 2016; Juang et al., 2013; L'Etoile et al., 2002)) or organized into a systematic program of gene expression; and whether environment-dependent transcriptional changes actually adapt OR-specific odor responses.
To address these questions the inventors characterized, via single cell RNA sequencing (scSeq), transcriptional variation across the extraordinarily large array of identifiable OSN subtypes. These experiments reveal that each OSN subtype expresses a distinguishable set of transcripts that is stereotyped across mice within a given environment. The main axis of transcriptional variability includes more than 70 functionally-relevant genes that adaptively attenuate or amplify the transformation of odors into spikes. Furthermore, the expression of these genes coordinately and continuously varies across OSN subtypes, is modulated by specific interactions between ORs and the environment, and predicts OSN responses to new odors. Transcriptional variation among OSNs is therefore systematically organized through a rheostat-like mechanism (akin to a balance control on a stereo) whose setting in each OSN is defined by OR-environment interactions. The findings indicate that the olfactory system uses this transcriptional rheostat to proportionally and bidirectionally adapt to persistent background odors—thereby enabling OSNs to engage in a form of sensory predictive coding—before odor information is transmitted to neural circuits responsible for perception. These data reveal that peripheral odor codes are flexible rather than fixed, and support a broad model in which neurons continuously individualize their transcriptomes to facilitate functional adaptation.
To assess relationships between transcription and function in the peripheral olfactory system scSeq was performed on −770,000 mature mouse OSNs. The initial analysis focused on −40,000 mature OSNs derived from adult mice housed in a typical home cage environment; collectively these OSNs expressed over 1,000 odorant receptors (ORs), with nearly every mature neuron expressing only one OR (
Given the few known transcriptional differences among mature OSNs, it was expected that OR-defined OSN subpopulations would overlap in UMAP space: instead. OSNs expressing each OR were locally clustered and separated from those expressing different ORs, showing that each OSN subtype is transcriptionally unique (
The unexpectedly close relationship between OR expression and OSN transcriptomes could result from shared regulatory mechanisms, or from each OR determining its associated transcriptome. To distinguish these possibilities, scSeq was performed in mice with swapped coding regions for the M72 (Olfr160) and S50 (Olfr545) receptor genes (
OSNs express a limited number of transcription factors and axon guidance genes in an OR-dependent manner during development (Mori and Sakano, 2011; Parrilla et al., 2016). However, genes from these categories were neither required for accurate OR predictions nor were as predictive as either the complete set or the most predictive subset of HVGs (
Decomposing OSN Transcriptomes into Identity and Activity Gene Expression Programs
Alternatively, the diversity in OSN transcriptomes might reflect OR-associated differences in neural activity. To test this possibility, consensus non-negative matrix factorization (cNMF) was performed, which decomposes gene expression patterns observed across OSNs into sets of co-expressed genes called gene expression programs (GEPs) (
Of the 10 identified GEPs, the seven whose constituent genes indicated putative functions were focused on (
It was hypothesized that persistent interactions between each OR and the wide variety of odors in a given environment (like the food, bedding and semiochemical odors present in a typical home cage) might explain the continuous variation of GEPLow or GEPHigh across OSN subtypes. To explore this idea, a metric was developed (simply the difference between GEPHigh and GEPLow) that is provisionally refer to as the environmental state(ES) score, (
Consistent with the hypothesis that GEPHigh and GEPLow genes, and thus ES scores, are sensitive to the chronic activity state of each OSN, artificially lowering (via nares occlusion for a month) or raising (via pulsed optogenetic stimulation for 12 hours) OSN activity systematically decreased or increased ES scores, respectively (
Three additional lines of evidence argue that ES scores reflect specific interactions between each OR and each environment. First, occlusion-dependent changes in ES scores for each OR-defined OSN subtype were negatively correlated with the ES score in the open nostril (
Several alternative explanations were ruled out for environment-dependent changes in OSN ES scores. Odor-dependent modulation in OR expression levels or β-Arresting-mediated OR endocytosis could, in principle, influence ES scores (Ibarra-Soria et al., 2017; Mashukova et al., 2006; von der Weid et al., 2015). However, across all three of the experimental manipulations. OR gene expression and the usage of identity-related GEPs remained constant (
What functions might environment-dependent gene expression confer upon mature OSNs? Inspection revealed that GEPHigh and GEPLow include 73 genes with known or likely roles in shaping OSN sensory responses (e.g., calcium homeostasis. OR signaling, and intrinsic excitability), and an additional −40 genes putatively involved in axon guidance or synaptic transmission (
To test the possibility that the continuous variation in functional gene expression uniquely adapts the odor responses of each OR-defined OSN subtype to each environment, the olfactory epithelium was subjected to Act-Seq, a scSeq variant in which neural activity is read out as a rapid change in gene expression (Wu et al., 2017). Act-Seq reliably identified a subset of OSNs (and therefore ORs) that acutely responded after a two-hour exposure to the volatile odorants acetophenone and octanal, as assessed by immediate early gene (IEG) expression (with similar results obtained using unsupervised analysis of odor-evoked transcriptional changes:
Acetophenone and octanal elicited similar acute transcriptional changes across many genes (
To test whether activation scores also capture differences in binding affinities between different ligands for the same receptor. 2-hydroxyacetophenone (2-HA) was used which, relative to acetophenone, binds more strongly to the M72 receptor (Arncodo et al., 2018; Zhang et al., 2012). Act-Seq revealed that 2-HA indeed elicits higher activation scores than acetophenone in OSNs expressing M72 (
Consistent with the hypothesis that the transcriptome of each OSN shapes its acute odor responses. ES scores were negatively correlated with odor activation: the higher the ES score for a given acetophenone-responsive OSN subtype (identified across experiments via its OR), the lower its acute, acetophenone-evoked response (
If functional gene expression indeed plays a causal role in determining odor responses, then experimentally raising or lowering ES scores should predictably change acute odor responses (
Similarly, switching mice from the home cage into a new odor environment (which bidirectionally modulates ES scores (
It was hypothesized how different environments induce changes in OSN transcriptomes, and that acute activity triggered by new odor environments sculpts long term functional gene expression (
It was observed that the long-term ES score changes observed after chronic acetophenone exposure could be predicted by the baseline ES score of each acetophenone-responsive OSN subtype in the home cage: a similar relationship was observed following optogenetic stimulation (
To test whether the ambient environment also shapes odor codes in the brain, we performed presynaptic functional imaging of OSN axons in the olfactory bulb (
To determine how environmental history influences odor coding, responses were imaged to a panel of 16 odors that broadly activated dorsal glomeruli (
OSNs are thought to primarily communicate information about odor-OR interactions, which is used by the brain to facilitate odor perception (Buck. 1996; Cleland and Linster. 2005; de March et al., 2020; Firestein. 2001; Imai et al., 2010; Sullivan et al., 1995; Touhara. 2002). Here it is found that each of the ˜1000 OR-defined OSN subtypes in the mouse olfactory system adapt to the environment through a common mechanism: a transcriptional rheostat. This rheostat is composed of more than 70 genes relevant to OSN function, whose expression levels vary continuously across OSNs in an OR-dependent manner and adapt dynamically as mice traverse distinct environments. The specific position adopted by this rheostat (i.e., the OR-specific pattern of functional gene expression) predicts the odor response amplitude of a given OSN: furthermore, acute odor responses triggered by novel odor environments predict future patterns of functional gene expression (i.e., the setting of the rheostat).
These observations indicate a closed regulatory loop in which analog changes in environment-driven activity yield proportional transcriptional changes that predictably influence future neural responses to odors. Thus, rather than faithfully reporting the extent of odor-OR interactions, the peripheral olfactory system uses gene expression to instantiate expectation, thereby building odor codes that are personalized by each animal's experience. While fast, post-translational mechanisms for sensory adaptation have been observed across modalities—and many circuit-level mechanisms have been characterized that adapt central sensory responses—the data reveal a systematic, large-scale adaptive transcriptional program that operates at the level of sensory neurons themselves (Benda. 2021; Burns and Baylor. 2001; Fettiplace and Ricci. 2003; Kadohisa and Wilson. 2006; Martelli and Storace. 2021; Wark et al., 2007; Zufall and Leinders-Zufall, 2000).
The data demonstrate that the OR repertoire is densely activated by airflow and odors in our environments, indicating that every OSN must contend with some degree of chronic activation. Because experience bidirectionally alters ES gene expression on timescales of hours, transcription-mediated adaptation helps to center OSNs in their dynamic range as animals traverse different odor contexts, or as odor environments evolve during circadian cycles. As such, the adaptive mechanism identified here serves a distinct purpose from those that rapidly truncate responses to odor filaments (and which support e.g., odor-guided navigation), and from central habituation mechanisms that sparsen odor representations on the minutes-long timescale but cannot restore information that is lost when OSNs operate outside their dynamic range (Kadohisa and Wilson, 2006; Kostal et al., 2008; Lecoq et al., 2009; Martelli and Storace. 2021; Moore. 1994; Nagel and Wilson, 2011; Wilson, 2009).
Transcription factors and axon guidance genes appear insufficient to effectively distinguish OSN transcriptomes or predict OR expression. Although known identity markers have been thought to only assign OSNs to broad dorsal or ventral domains (Bozza et al., 2009; Kobayakawa et al., 2007; Tan and Xie, 2018), the dorsal, ventral, anterior and posterior GEPs are sufficiently diverse and organized to support OR predictions (
There is a close conceptual relationship between the bidirectional sensory adaptation we describe here and firing rate homeostasis (FRH), the process through which neurons in a network maintain stable firing rates by adjusting both synaptic weights and intrinsic neural properties (Davis. 2006, 2013; Marder. 2011; Marder and Goaillard. 2006; Turrigiano, 2011; Turrigiano. 1999; Turrigiano. 2008, 2017). Invertebrate neurons of a given type with identical firing properties can exhibit significant variation in ion channel expression, demonstrating that there are many transcriptional means to achieving the same functional end (Goldman et al., 2001; Marder. 2011; O'Leary et al. 2013; Schulz et al., 2007). In contrast, the experiments in OSNs identify a deterministic and proportional interdependence between environment-dependent activity and functional gene expression patterns. It has been proposed that structured correlations in the expression of functional genes can be used to define different cell types (O'Leary et al., 2013; Schulz et al., 2007); OSNs, whose functional genes exhibit near-perfect expression correlations along a continuum of activity levels, clearly meet this operational definition.
scSeq analysis reveals a startling diversity of transcriptomes associated with neurons of a single putative type (Kim et al., 2019; Li et al., 2017; Tasic et al., 2018). Together, the findings—enabled by the ability to query gene expression and activity in ˜1000 different OSN subtypes across experiments—are consistent with a general model in which neurons systematically modulate their transcriptomes to continuously adapt to their inputs.
C57BL/6J. OMP-IRES-GEP. Ai95D (GCaMP6f) and β-arrestin2 knock out mice were obtained from Jackson Laboratory (stock number 000664, 006667, 028865, 011130). “OR-swap” mice (M72 (S50 locus) and S50 (M72 locus)) were maintained in the Bozza laboratory (and are available from Jackson Laboratory with stock numbers 006715, 006714) (Bozza et al., 2009). P2-IRES-GFP mice (P2 (P2 locus)) were obtained from the Lomvardas laboratory (and are available from Jackson Laboratory with stock number 006669) (Feinstein and Mombaerts, 2004). OMP-IRES-Cre mice were obtained from the Axel laboratory (Eggan et al., 2004). OMP-ChR2 (H134R)-Venus mice were maintained in the Bozza laboratory and obtained from the Rinberg laboratory (Li et al., 2014). Mice of either sex between 6-16 weeks-old were used for experiments. Mouse husbandry and experiments were performed following institutional and federal guidelines and were approved by Harvard Medical School's Institutional Animal Care and Use Committee.
To identify gene expression programs sensitive to ongoing activity from environmental odorants, 7 day old mice were anesthetized on ice, and one of the two nostrils was occluded by cautery, as previously described (Fischl et al., 2014). This age was chosen to minimize deficits in axon targeting that occur from manipulating activity levels at earlier post-natal timepoints (Ma et al., 2014). Unilateral naris occlusion was confirmed using a dissection microscope. After ˜1 month, mice were used for scSeq experiments. Cells isolated from occluded and open nostrils were processed and analyzed separately.
Act-Seq (Wu et al., 2017) was performed following odor exposure to quantify odor-evoked responses in each OSN subtype (defined by their expressed OR) across the entire OR repertoire. Mice were habituated to a reversed light cycle for at least for 1 week before odor exposure, transferred to a new disposable cage with regular bedding/food and kept overnight in a satellite animal facility. On the experimental day, each mouse was first transferred to a new empty disposable cage during the dark cycle and habituated for 20 minutes. Empty cages were used to avoid any activation by odorants present within the environments in which the mice were housed. Odors were added to a piece of filter paper (100 μL of odorant) in a 35 mm petri dish and a cotton ball (200 μL of odorant) and placed in each cage. After 2 hours, mice were euthanized, and dissociation was performed as described below. Odors used in this study were as follows. Dipropylene glycol (DPG, control solvent), and 10% of acetophenone, octanal, 2-hydroxy acetophenone, 4-methyl acetophenone, and methyl salicylate. As a control, mice were exposed to DPG alone for 30 min. This control condition was used to account for any drift in transcriptomes during the overnight housing and habituation periods prior to odor exposure and designed to capture the transcriptome in the state it would have been in the odor-exposed animals prior to odor exposure. The 30 min DPG control was used for all Act-Seq experiments, except following transient naris occlusion (see below); in the Act-Seq experiments our results did not depend upon the specific control condition we used, which likely reflects the relative stability of the transcriptome for each OSN subtype. Act-Seq was also performed using lower concentrations of acetophenone (1.0, 0.1, 0.01%) and 2-hydroxy acetophenone (0.1%, 0.01%). All odor dilutions were made with DPG, and all odors and DPG were obtained from Sigma. To assess the persistence of any transcriptional changes observed as a result of odor exposure, a cohort of mice was exposed to acetophenone or DPG for 2 hours, transferred to new clean regular home cages, and subjected to scSeq after an additional 22 hours.
Because odor responses in occluded OSNs cannot be examined in the chronic occlusion experiment due to permanent cautery, transient unilateral naris occlusion was performed using removable nasal plugs. Adult mice (6-12 weeks old) were anesthetized by isoflurane and removal nasal plugs were inserted into one of the two nostrils, as previously described (Galliano et al., 2021; Kass et al., 2012). The occlusion was confirmed by measuring airflow from the occluded nostril via thermocouple. After 5 days, a subset of mice was used for scSeq experiment directly (control mice) to assess the effects of naris occlusion on changes in gene expression. Under brief anesthesia by isoflurane, the remaining mice were unplugged and transferred to a new empty disposable cage containing either DPG alone or 10% acetophenone. After 2 hours, mice were subjected to scSeq. Each of the two nostrils was used separately in both control (open and plugged) and odor-exposed mice (open and unplugged). Mice whose nostrils did not experience airflow after unplugged, as assessed transcriptionally, were excluded from any analyses.
To examine environment-dependent changes in gene expression and their effects on odor-evoked responses. Act-Seq was performed 2 weeks after environment switches. Adult mice (6-12 weeks old) were group-housed in the regular home-cage environment for at least one week before the transfer to their novel environments. Individual mice were transferred to a new disposable cage containing materials for a novel environment in the satellite animal facility. Two different novel environments were used, with each containing non-overlapping contents; Environment A (paper bedding, hay, dried flowers (such as marigold), peanuts, seeds (e.g., sunflower seeds, pumpkin seeds), puffed rice), and Environment B (garden soil, aspen shavings, coconut husks, dried fruits (e.g., berries, mango, pineapple), dried vegetables (e.g., green peas, corn, bell peppers), millet, fresh fruits (banana, apple, peach), corn flakes). Mice were singly-housed throughout the entire 2 weeks and were transferred to newly prepared environmental cages every 12 hours to minimize the contributions of any murine odors, which are likely the predominant odorants present within standard home-cage housing environments. After 2 weeks, individual mice were transferred to new empty disposable cages, and Act-Seq using acetophenone was performed as above.
To assess the dynamics of gene expression changes in response to novel environments, mice were housed overnight in a new disposable cage with regular bedding/food in a satellite animal facility. On the experimental day, each mouse was first transferred to Environment A cages and subjected to scSeq after 45 minutes. 2, 4, 12, 24 hours, or 3, 5, 14 days (with cages refreshed every 12 hours for durations longer than 12 hours). To account for any non-specific effects resulting from transferring mice from the animal facility to disposable cages within the satellite facility, data from mice transferred and housed in new disposable cages with regular bedding and food overnight were used as controls for downstream analyses of the effects of Environment A on gene expression.
For β-arrestin2 knock out mice, scSeq was performed on cohorts of mice that were either housed in home cages, housed in new disposable cages with regular bedding and food overnight and then transferred to Environment A cages for 5 days, or housed in new disposable cages with regular bedding and food overnight as controls, as described above. Each of the three knock out datasets was compared to its respective dataset from wild-type mice housed in the same condition.
To determine whether OSN activation reflects salient differences between environments, one set of mice were housed in a new disposable cage with regular bedding/food overnight in a satellite animal facility and then transferred to Environment A for 5 days. On the experimental day, mice were transferred to either a new Environment A cage or a home cage for two hours before being subjected to scSeq. An additional cohort of mice was housed in the home cage. The day before the experiment, they were transferred to a new disposable cage with regular bedding/food overnight in a satellite animal facility. On the experimental day they were transferred to a new home cage for two hours before being subjected to scSeq. These three conditions were compared to mice transferred to Environment A cages for 2 hours as part of the time course described above, thus generating all four combinations of environmental switches between mice housed in Environment A and home cages.
To compare the effects of chronic odor exposure with the acute activation observed via Act-Seq, scSeq was performed after exposure to DPG alone or 0.1, 1. 10% of acetophenone (diluted by DPG) for 5 days. Adult mice were individually housed in regular home cages in a satellite animal facility overnight, then two cotton balls, each of which was soaked with 300 μL of DPG or diluted acetophenone solution, were added into each cage. The odorized cotton balls were replaced every 12 hours and cages were replaced every 2 days.
To characterize quantitative changes in gene expression after activation of OSNs. OMP-ChR2 (H134R)-Venus mice were subject to optogenetic activation and analyzed via scSeq. Mice (N=14) were anesthetized with 2% isoflurane and injected with bupivacaine (1.25 mg/kg) under the scalp. An incision was then made to expose the dorsal skull and a scalpel (Aspen Surgical 372615 Bard-Parker) was used to thin the bone overlying the right olfactory bulb, after which cyanoacrylate glue (Loctite Glass Gluc) was applied over the thinned bone. A blue LED (470 nm, Lumileds LXZI-PB01, Digikey) was soldered to a two-pin connector (Millmax, Digikey ED8450-ND) and manually placed over the bulb. A titanium head bar was then placed over the caudal end of the skull, and the whole dorsal surface was covered with dental cement (Metabond, Parkell). After the dental cement dried, a layer of black nail polish was applied to minimize the leak of blue light during stimulation. Postoperative care included a subcutaneous injection of buprenorphine SR (1 mg/kg, given 1 hour prior to surgery start) and carprofen (5 mg/kg) administered through drinking water. Before experiments, mice were tethered to a cable connecting the LED to a microcontroller (Teensy 3.2. Adafruit) that provided a custom stimulation pattern. The LED emitted 100 mW at 470 nm, measured with a light meter (Thorlabs PM100D).
For chronic activation. OSNs were activated by 5 pulses (50 msec on. 50 msec off cycle) with a 10 second inter-pulse interval for 12 hours before being subjected to scSeq. For acute activation. OSNs were activated by the same cycle as above with a 5, 10, or 20 second inter-pulse interval for 2 hours (denoted as High, Medium, and Low, respectively) and then subjected to scSeq. For both chronic and acute activation, control mice underwent the same surgery and tethering, but were implanted with a dummy connector and did not receive LED stimulation. Home cage control samples for optogenetic experiments were mice that underwent the same surgery but were subjected to scSeq directly from the home cage without any optogenetic stimulation. Because the LED was placed over the right olfactory bulb, only the right nostril was dissociated and used for scSeq for all optogenetic stimulation and control experiments.
Preparation of Single Cell Suspensions for scSeq
The main olfactory epithelium was dissected in Earle's Balanced Salt Solution (EBSS, pre-treated with carbogen for at least 5 minutes before each use, Worthington), then transferred to a round-bottomed glass dish containing 750 μL of papain solution (one vial of Papain (Worthington) dissolved in 5 mL of EBSS and then equilibrated 10 minutes at 37° C.) and 100 μL of DNase solution (one vial of DNase-I (Worthington) dissolved in 500 μL of EBSS). Bone was removed from the epithelium under a dissecting microscope and the resulting epithelial tissue was placed in a 5 mL tube (Becton Dickinson) with an additional 1.75 mL of Papain solution and 200 μL of DNase-I solution and rocked gently for-60 minutes at 37° C. The tissue was then gently triturated with a 5 mL pipette 10-15 times, passed through a 40 μm cell strainer (Becton Dickinson), and washed with 1 mL of Hibernate-A medium containing 10% Fetal Bovine Serum (FBS. GIBCO). Filtered cells were transferred to a new 5 mL tube and centrifuged 5 minutes at 300×g. The supernatant was decanted, the cells were washed once with 4 mL Hibernate-A containing 10% FBS and resuspended in 1 mL Hibernate-A containing 0.2% FBS. Importantly, each of the solutions above contained transcriptional inhibitors (5 μg/mL of Actinomycin D, 10 μg/mL of Anisomycin and 10 μM of Triptolide, all obtained from Sigma) for all experiments, especially the Act-Seq experiments, except where noted.
Dissociated cells were stained with propidium iodide (final concentration 1.65 μg/mL) and subjected to FACS to remove dead cells and obvious doublets. For fluorescent reporter-expressing mouse lines (OMP-IRES-GFP. P2 (P2 locus), M72 (S50 locus), S50 (M72 locus), OMP-ChR2 (H134R)-Venus), fluorescence positive and negative cells were sorted separately using a FITC filter. Cells were sorted into Hibernate-A containing 5% FBS (and 0.1× concentration of transcriptional inhibitors), centrifuged for 5 minutes at 300 g, and resuspended with PBS. For samples from OMP-IRES-GFP and OMP-ChR2 (H134R)-Venus mice, fluorescence positive cells were used for single cell RNA-seq (scSeq) experiments. For samples from OR lines (P2 (P2 locus), M72 (S50 locus) and S50 (M72 locus)), fluorescence positive and negative cells were combined for scSeq experiments.
Single cell RNA-seq library were prepared from the single cell suspensions via the chromium single cell gene expression system (chromium single cell 3′ reagents and GEM v2, v3 or v3.1 dual index, 10× genomics), using the default protocols provided by 10× genomics. Each replicate, other than GFP-positive ones, was loaded at a concentration predicted to yield 10,000 cells, for which the expected multiplet rate is-8.0%.
Sequencing library fragments were examined using the Agilent High Sensitivity DNA kit (Agilent) and quantified via qPCR by the KAPA library quantification kit (Roche). NextSeq and NovaSeq platforms were used for sequencing libraries. 75 cycle High output kit was used for NextSeq (Read1=26 cycle. Index (17)=8 cycle. Read2=58 cycle). Full flowcells of 100 cycle SP/S1/S2 kits, full flowcells or single lanes of 200 cycle S4 kit were used for NovaSeq (minimum read lengths: Read1=28 cycle. Index (17)=8 cycle. Read2=89 cycle for v2 and v3 kits and Read1=28 cycle. Index (17)=10 cycle. Index (15)=10 cycle. Read2=77 cycle for v3.1 dual index kits).
Demultiplexed fastq files were generated by mkfastq function in Cell Ranger software.
Generating gene expression matrix (raw UMI count) from sequencing data Demultiplexed fastq files were aligned to the mouse reference genome mm 10 (Ensembl 93) and converted into gene expression matrices using the 10× Genomics Cell Ranger software (version 2.2.0 (v2 and v3 samples) or 4.0.0 (v3.1 samples)) with two key modifications. First, despite the fact that the 10× genomic platforms have relatively high rates of intronic priming (La Manno et al., 2018). Cell Ranger by default considers multi-mapped reads that map to a single exonic locus as well as to non-exonic (e.g. intronic) locations as being uniquely mapped to that gene (and modifies the MAPQ scores accordingly), even when the read may have better alignment to the non-exonic loci. These multimapped reads (typically 510% of all reads) were filtered out by removing any read in the bam file with a MM:i:1 tag. Second, the output BAM files from Cell Ranger also contained many reads that were uniquely mapped to separate genes despite sharing the same cell barcode and unique molecular identifier (UMI), which is biologically implausible given that the cell barcode and UMI should uniquely identify each individual transcript. This could be either the result of UMI collision or misalignment. In support of the latter possibility, different reads of individual UMIs were often each uniquely aligned to multiple related genes, such as the large family of olfactory receptors (ORs). To avoid double-counting UMIs and inflating the numbers of genes. ORs, or UMIs detected in each cell, all ambiguous UMIs that mapped to multiple genes were removed from the BAM file with custom scripts, using samtools and pysam. The gene expression matrix was then recomputed by counting the number of distinct UMIs for each cell barcode for each gene.
Extracting Mature Olfactory Sensory Neurons from all Cells
Mature OSNs were identified using an iterative subclustering procedure via the Scanpy python package (Wolf et al., 2018). In each cell, the UMIs were total-count normalized and scaled by 10,000 (TPT normalized). Variable genes that were overdispersed relative to their mean were identified using the SPRING gene filtering function “filter genes” with parameters (90, 3, 10; see (Weinreb et al., 2018)). Ribosomal, mitochondrial, and OR genes were excluded from the variable genes. To identify mature OSNs, the gene expression of the set of variable genes was log-normalized, and for each gene, the residuals from linear regression models using the total number of UMIs and percent of UMIs for mitochondrial genes as predictors were then scaled via z-scoring and reduced to a smaller number of dimensions via principal component analysis (PCA). Cells were clustered using the top 35 PCs via the Leiden algorithm (resolution=1.2). Clusters containing mature OSNs were identified based on their expression of known mature OSN marker genes (e.g. Omp, Stoml3, Cnga2, Adcy3), the presence of cells expressing OR genes, as well as the absence of immature OSN marker genes or genes found in non-neuronal cells (e.g. Sox11, Gap43, Chr2, Clqb, Aqp3). A small cluster containing the non-canonical Gucy2d (GCD) and Gucylb2 expressing neurons was routinely identified but not used for any analyses in this paper. Clusters containing dying cells with low numbers of total counts, low numbers of genes, and high percentages of mitochondrial genes were removed. This clustering procedure typically revealed a cluster of OSNs that had higher than average total counts (often close to double the mean of other OSN clusters) and the majority of cells in such clusters often expressed multiple ORs; these cells are likely OSN-OSN doublets and were not considered further. Similarly, clusters containing mixtures of OSN and non-neuronal markers were also likely doublets between OSNs and other cell types and were not considered further. Using only the mature OSN clusters that passed this initial filtering step (˜60-65% of cells), the above procedure (starting from variable gene identification but using only 20 PCs) was repeated multiple times to remove any additional immature or unhealthy OSNs (with low counts or >10% of total counts in mitochondrial genes) until stable results were obtained. Importantly, although immature OSNs (e.g. Gap43+ and weakly Omp+) routinely express OR genes and other mature OSN markers, these OSNs were excluded by the conservative subclustering procedure described above so that any comparisons between cells expressing the same OR would not be confounded by any differences in OSN maturity.
The Ensembl database (Release 93) was used for OR gene annotation. Functional OR genes were identified as genes that are associated with GO term “olfactory receptor activity” (GO: 0004984); the TAAR family of receptors was also included (GO: 0001594. GO: 1990081). To identify the OR class for each mouse OR, phylogenetic analysis was performed using all mouse and zebrafish OR proteins in MEGA7 with default parameters. As previously described. OR proteins cluster into two main phylogenetic clusters (Zhang and Firestein. 2002). The ORs that clustered together with zebrafish ORs are the class I ORs and the remaining ORs are class II ORs. To compare OR protein similarity, the pairwise phylogenetic distance matrix between OR protein sequences was calculated by taking the cophenetic distance of the phylogenetic tree of all ORs. For each pair of ORs, the cophenetic distance is equal to the height of a dendrogram in which the two branches of the phylogenetic tree containing both ORs first merge into a single cluster. Unsupervised graph-based clustering of cells identified a cluster of OSNs expressing Cd36, as previously described (Oberland et al., 2015; Xavier et al., 2016). ORs were identified as Cd36-positive if the majority of OSNs expressing that OR were found within the Cd36-positive clusters. OSNs were categorized as dorsal or ventral based on their expression of known dorsal (Nqo1, Acsm4) and ventral (Ncam2, Nfix) marker genes. ORs expressed in dorsal or ventral OSNs were referred to as dorsal or ventral ORs. With the exception of three known ventral class I ORs, all other class I ORs are dorsal, whereas OSNs expressing individual class II OR expressed either dorsal or ventral marker genes. The OR genes are located in gene clusters across most chromosomes in the genome: OR genes whose transcription start sites (TSS) were within 3 megabases of any other OR gene were considered part of the same gene cluster (Monahan et al., 2017).
Of the 1172 identified functional ORs, an OR was considered expressed in any given OSN if at least 3 UMIs were detected (except for the experiments using the 10×v2 chemistry, where a threshold of 2 UMIs was used). Mature OSNs expressing single ORs were identified based on the OR that they expressed, and all cells expressing the same OR were considered as the same OSN subtype for downstream analyses. The vast majority of mature OSNs expressed a single OR (Figure S1D). OSNs expressing multiple ORs at high levels typically had higher numbers of total counts and were likely cell doublets that remained even after the previous filtering steps. OSN expressing multiple ORs with higher expression for a single OR gene also sometimes expressed other OR genes at low levels, as seen during development (Hanchate et al., 2015), or due to contamination from ambient RNA or errors in gene alignment. Regardless, any OSN expressing cither zero or multiple OR genes was not considered further for any downstream analyses. OR frequency was defined empirically based on the fraction of all mature OSNs expressing a given OR. The expression level for each OR was determined based on the normalized expression (see below) of that OR gene within the OSNs singly-expressing that OR. In the “OR-swap” mice, in which the coding sequence for each OR was knocked into the locus of the other, cells expressing swapped ORs were identified based on the gene counts for the locus rather than the expressed OR (e.g. in M72 (S50 locus) mice. OR transcripts that originated from the S50 locus indicate the expression of M72 swap cells while OSNs expressing M72 from the endogenous locus were not considered and vice versa for S50 (M72 locus) mice) due to fact that only coding sequences were swapped whereas the 3′ UTR for the original OR remain and are detected given the 3′ bias of the 10× data that arises from the use of oligo (dT) primers. The median number of UMIs/genes detected per mature OSNs and number of mature OSNs used for each replicate are summarized.
ORs were expressed at a wide range of frequencies, spanning multiple orders of magnitude (Figure S1A-B). For instance, given that the 200 OR genes with the highest frequencies are expressed in ˜50% of all OSNs, if standard algorithms like PCA are applied on all OSNs at once, they would weight ORs expressed in 100 cells 100× more than an OR expressed in a single cell, and the resulting top PCs would capture the axes of transcriptional variation that distinguished highly-expressed ORs rather than all OSN subtypes. Similarly, classifiers trained on all cells would more easily distinguish highly-expressed ORs (from their higher influence on the PCs, increased presence in training data, and higher likelihood of accurate predictions by chance), and one could achieve high levels of overall prediction accuracy even with models that were unable to distinguish among the majority of ORs. Therefore, to account for OR frequency, most analyses were performed using equal numbers of OSNs per OR by subsampling from the population of OSNs that expressed the set of ORs each detected in at least a certain number of cells (e.g. at least 4-10, as specified, depending on the analysis). This subsampling procedure was performed 1,000 times and results were summarized for each OSN subtype (as defined by the expressed OR) across the 1,000 restarts. Analyses at the OSN subtype level were performed by first averaging across all OSNs expressing the same OR within a given condition.
The filtered mature OSN datasets containing only cells expressing single ORs were renormalized, starting again from the raw UMI counts for each gene. Normalization was performed by dividing the counts for each gene in each cell by the total counts across all genes (excluding mitochondrial and ribosomal genes, as well as highly-expressed lncRNAs and lincRNAs like Malat1 and Gm42418, and sex-specific transcripts like Xist) in that cell and multiplying by 10,000 to yield tags per ten thousand (TPT) normalized data. Normalized expression in the figures and throughout refers to TPT-normalized data, where a value of 1 indicates on average 1 UMI for that gene per 10,000 UMIs. For analyses that required scaled data (e.g. PCA and classification), the logarithm of the TPT-normalized data was used (log (TPT+1)). Scaling via z-scoring was performed for each gene, using means and standard deviations identified by taking the mean of the values from 100 restarts containing equal numbers of cells per OR. The home-cage data consisted of six replicates distributed across 2 batches: separate scaling was used for each batch, but similar results were obtained by scaling both batches together. For the “OR-swap” experiments, the means and standard deviations were fit using only cells from wild-type mice collected with the same 10×v2 chemistry platform, and these were applied to the “OR-swap” and P2 (P2 locus) OSNs. In the chronic occlusion data, the means and standard deviations were recomputed using data from both nostrils. In the environment switch experiments, the gene scaling was recomputed using data from all environments.
Importantly, besides scaling and normalizing the gene expression data, no methods for batch correction or dataset integration were performed for any dataset. The mature OSNs were readily identified and separately isolated from each dataset (obviating the need for any additional alignment of cell types across datasets), and batch effects were rarely observed when comparing the normalized expression between datasets (apart from changes in the levels of mitochondrial and ribosomal genes, which were excluded from normalization procedures and downstream analyses). This analytical choice was deliberate, as data integration procedures are affected by changes in OSN composition (due to differences in the set and frequency of ORs detected across datasets and the wide range of OR frequencies), can modify the transcriptomes by averaging and leaking data across cells (which may express different ORs), and could make OSNs transcriptomes more or less similar to each other in a manner that would hinder and bias downstream comparisons between cells expressing the same OR across experimental conditions. For instance, in the Act-Seq experiments, our expectation was that a small fraction of OSNs would exhibit difference in their transcriptomes upon being activated by odorants; however, dataset integration procedures typically attempt to normalize such differences, and thus tend to make control OSNs appear more activated and odor-exposed OSNs appear less activated. Some datasets were collected with different 10× kits (v2, v3, and v3.1); as these kits differ in their sensitivities, each sample was compared with appropriate control samples collected with the same platform.
A consensus set of genes whose expression varied across mature OSNs were identified and used as the basis of most downstream analyses where transcriptomes were compared across OSNs and conditions. After using the OR genes to identify the OR expressed in each cell, the OR genes were not considered further, and were subsequently removed from all downstream analyses. Mitochondrial and ribosomal genes, and the lncRNAs mentioned above were also excluded. Using the home-cage data, the population of OSNs that expressed 831 ORs in at least 10 cells was used. To identify genes whose expression varied across cells expressing different ORs. 10 cells were selected for each OR and the F-score between the scaled gene expression and OR identity was computed for each gene, using the set of ˜15.000 genes that were expressed in at least 0.2% of OSNs. This procedure was performed 1000 times and genes that were consistently strongly significant (Benjamini-Hochberg FDR-corrected p-value≤1×10−7 for at least 90% of restarts) were considered highly-variable genes (HVGs). This procedure identified a set of 1350 non-OR genes that were used for downstream analyses. Varying thresholds, such as the number of restarts to consider a gene as a HVG, yielded different numbers of HVGs but gave consistent results with respect to other analyses (Figure S1I).
Principal component analysis (PCA) was performed using equal numbers of cells for each OR to avoid any OR having undue weights on any PC based on its frequency of expression. All versions of PCA shown used the scaled expression of the set of 1350 HVGs, and these HVGs were reduced to the top 20 PCs: however, stable results were obtained with different numbers of HVGs and PCs (Figure S1I). After fitting PCA in this manner using equal numbers of cells for each OR the fitted gene loadings (VT) were applied to the scaled HVG expression for all cells to obtain PCA scores for the remaining cells. In analyses in which this subsampling procedure was repeated across restarts. PCA was fit on the subset of cells sampled for each restart. For UMAP visualization, the results from a single restart are shown (although qualitatively similar restarts were obtained across restarts). In the “OR-swap” experiments. PCA was fit using only OSNs from wild-type mice (with equal numbers of cells per OR) and then applied to the cells from the “OR-swap” and P2 (P2 locus) mouse lines. In the chronic occlusion experiments. PCA was refit using equal numbers of cells per each OR for each nostril, and in the environment switch experiments. PCA was fit using equal cells for each OR from each of the DPG-exposed control mice from each of the three environments.
UMAP was used solely for visualization purposes (McInnes et al., 2018). To generate UMAP visualizations, the gene expression of each cell was further reduced from 20 PCs to 2 dimensions using the Uniform Manifold Approximation and Projection (UMAP) technique with parameters n_neighbors=25, min_dist=0.6, and metric=“euclidean”. As for PCA. UMAP embeddings were fit using balanced numbers of cells for each OR (and only for wild-type OSNs in the “OR-swap” experiments), and these embeddings were then used to transform to the PCA scores for all cells of a given dataset into the same UMAP space. UMAP was fit using 1000 epochs and a learning rate of 0.25 to improve the stability of this transformation step. The UMAP embeddings were refit separately using the PCA scores from each dataset, yielding embeddings that differed qualitatively in appearance yet retained similar features (dorsal, ventral, and (′d36-positive OSNs were separated, the main axis was activity-dependent, cells expressing the same OR were locally clustered). UMAP and PCA were also refit using only cells from the occluded nostrils to determine if OSN subtypes cluster based on intrinsic activity in the absence of odor-evoked activity.
To measure the similarity of cells based on their gene expression, the transcriptome distance was defined as the cosine distance (1-cosine similarity) between the PC scores of pairs of OSNs. The cosine distance is bounded between 0 (an angle of 0° indicating similar transcriptomes). 1 (an angle of 90° indicating orthogonality) and 2 (an angle of 180° indicating opposing transcriptomes).
Equal numbers of cells (10) were selected for each OR (of the 831 ORs detected in at least 10 cells in the home-cage dataset). PCA was refit based on the scaled HVG expression of these 8310 cells and the pairwise transcriptome distance matrix was then evaluated. Within-OR distances we calculated, for each OR, as the median pairwise distance between all pairs of cells sharing that OR. For between-OR distances, the median pairwise distance between all pairs of ORs was calculated. Then, since each OR is more similar and dissimilar to specific subsets of other ORs, for each OR, the distribution of between-OR distances were summarized across all pairs of ORs sharing that OR. Starting from subsampling OSNs, the within- and between-OR distances were recomputed across 1000 restarts, and the distribution of within-OR distances across ORs was compared to the distribution of pairwise distances between each OR and all other ORs. On each restart, the OR labels were also shuffled across cells, and within-OR distances were recomputed (and were close to 1 indicating that the transcriptomes of random cells were dissimilar). Similar results were also obtained using the Euclidean and Correlation distance metrics. To evaluate whether these distributions of within- and between-OR transcriptome distances remained separable (even though distances increase with increasing dimensionality) across various choices of gene sets and numbers of PCs, the Jensen-Shannon divergence was computed by comparing the discretized versions of these two distributions on each restart (with 111 evenly spaced bins between 0 and 2) and comparing the entropy of the average distribution to that of the average of the entropics of each individual distribution.
k-Nearest Neighbor Analysis
To determine whether cells sharing the same OR are clustered locally within gene expression space, a k-nearest neighbor (KNN) approach was used. For each of 1,000 restarts, using the pairwise distance matrix between cells described above, for each cell for each OR, the fraction of all the other cells that share the same OR within k neighbors was computed empirically for progressively larger groups of k neighbors. The fraction of same-OR cells found within k neighbors was averaged for each OR across restarts and was then summarized across the entire set of 831 ORs. The local clustering of cells sharing the same-OR was not apparent when shuffling the OR labels across the cells from each restart.
In all instances, classification was performed in a cross-validated manner using linear classifiers fit on equal numbers of cells for each OR. Equal numbers of cells were randomly selected for each OR (10 cells for each of the 831 ORs expressed in at least 10 OSNs in the home-cage data). For both pairwise and OR identity classification, the number of cross-validation folds was chosen to be equal to the number of subsampled cells (10) for each OR. For each fold, one cell expressing each OR was held out and the remained cells were used to train linear support vector machine (SVM) models (SVC with kernel=“linear”, decision_function_shape=“one-vs-one”, and regularization parameter C=0.03). The hyperparameters for the classification pipeline were identified via a random search, but similar results were obtained across a range of hyperparameters. The input to the classifiers was the scaled gene expression (excluding any OR, ribosomal, or mitochondrial genes), using all genes (˜13,000) that were expressed in at least 0.5% of all cells. During each cross-validation fold, feature selection was performed by keeping the top 10% of genes with the highest F-score with the OR labels of the training fold cells. Importantly, although this is the same procedure used for HVG selection described above and the 10% threshold was designed to yield a similar number of genes, because feature selection was performed in a cross-validated manner, the resulting set of 1,300 genes identified for each fold may differ slightly from the set of HVGs used for other analyses). Next, the dimensionality of these 1,300 genes was further reduced to 20 dimensions with linear transformations via PCA and Linear Discriminant Analysis (LDA), also using only the cells in the training fold. Importantly, in addition to using equal numbers of cells per OR (to make predictions at similar chance levels for each OR), the feature selection. PCA and LDA transformations, and SVM decision boundaries were all fit using only the data in the training folds and then applied to the held-out cells in the test fold to generate predictions of the expressed OR in each held-out cell. This classification pipeline therefore avoids data leakage (which could aid in the prediction of OR identities and potentially bias results) between training and test folds.
Pairwise classification was performed using the same classification pipeline. For each training fold, as described above, the gene sets. PCA and LDA transformations were fit using all cells in the training fold, and then applied to transform all held-out cells into the same 20D subspace. Then, using the training data, separate SVM models were fit for each pair of ORs (831C2=344.865 pairs) and these decisions boundaries were applied to the held-out cells of that OR pair to predict which OR within that pair each held-out cell expressed. This cross-validation procedure was repeated for each cross-validation fold, and the mean classification accuracy was computed for each OR pair. The entire pairwise classification procedure (starting from subsampling OSNs) was repeated across 1,000 restarts (yielding in total 344.865×10×1,000=3.45 billion models). The mean accuracy for each OR pair was summarized across the 1000 restarts, and the distributions of classifications accuracies were shown for the 344.865 OR pairs. The median prediction accuracy was 100% (compared to a chance accuracy of 50%), indicating that in the majority of OR pairs, the OR identity for every OSN of that OR pair was correctly predicted in each of the 1,000 restarts. The same classification procedure was performed separately for both nostrils for the 436 ORs detected in at least 10 cells in each nostril in the chronic occlusion data.
The same cross-validation and classification procedure was used to predict which OR (out of 831) was expressed in each held-out cell. “One-vs-one” SVM models were first fit on the PCA and LDA-transformed gene expression of all OR pairs in the training set and then these transformations and models were applied to each cell in the test fold: this procedure was repeated for each fold. The “decision function” from this procedure thus returned for each held-out cell the predictions from the model fitted on each pair of ORs in the respective training fold, yielding a matrix of 8310 cells×344.865 models that indicates for each of the 344.865 “one-vs-one” binary pairwise classifiers which OR within that pair was more likely to be expressed. Predictions were generated using the standard voting procedure: for each cell, for each OR, the number of times that the subset of 831 pairwise models containing that OR predicted that specific OR (rather than the other OR in each pair) as more likely to be expressed were summed. The OR with the highest number of “+1” votes (out of a maximum possible 831) was predicted as the OR that was most likely to be expressed in a given cell. Since this procedure calculated the total number of votes for each OR for each cell, it also generated the likelihood (based on the ranks of the number of “+1” votes for each OR) any given OR was expressed in any given cell. Cells whose OR identity was correctly predicted were those in which the actual OR expressed in that cell was the one that received the most votes (or on rare occasions tied for the most votes).
Given that correctly predicting the exact OR (out of 831) of each held-out cell given only 9 training cells expressing the same OR and 7470 (830×9) cells expressing other ORs is a difficult task (with chance accuracy 1/831), the frequency in which the OR expressed in each held-out cell was within the top-N % of predicted ORs was also calculated using the ranks of the number of votes for each OR described above. In this case, predictions were considered as accurate at increasing percent thresholds (e.g. 0.25%=the actual OR was within the top 2 ORs with the most votes and 1%=within the top 8 ORs out of 831). For both the exact and top-N % classification procedures, the classification accuracy for each OR was summarized across the 10 cells expressing that OR in each restart and the accuracy for each OR was then averaged across 1000 restarts yielding the average classification accuracy for each of the 831 ORs, which are shown in the figures. Across the 831 ORs, the median classification accuracy was-50%. Given the balanced number of cells per OR, classification accuracy did not depend on OR frequency (Figure S1J) and was at chance levels for classifiers fit on the same data in each restart but with shuffled OR labels (1/831=0.1% for the exact predictions and 1% for the top-1% predictions). This same OR identity classification procedure was performed separately for both nostrils for the 436 ORs detected in at least 10 cells in each nostril in the chronic occlusion data.
Classification with Different Numbers of Genes
OR identity classification was performed as described above, but the % threshold for the F-score feature selection procedure of the classification pipeline was modified to assess the classification performance with varying numbers of genes. First, the high threshold was progressively increased from 0.5 to 12.5% of the −13.000 expressed genes (compared to the default of 10%). Next, starting from the default threshold of 10%, the most informative genes were progressively excluded by increasing the low threshold in 0.5% intervals such that the top 0.5%, then the top 1%, were excluded until only genes in the top 9.5-10% remained. 10 restarts were performed for each gene threshold, and the mean accuracy across ORs was calculated.
Classification with Different Gene Sets
OR identity classification was performed as above for the different gene sets, using genes from each set that were expressed in at least 0.5% of cells. These gene sets included the set of 543 mouse transcription factors detected in the OSNs (Aibar et al., 2017), and a set of 119 axon guidance genes (defined as known cell adhesion molecules with the following
prefixes: Slit, Robo, Epha, Ephb, Efn, Nrp, Sema, Plx, Kirrel, Pcdh, Cdh, Tenm; using these prefixes was important because common GO terms for axon guidance genes also include OSN-specific transcription factors like Lhx2 and Bcl11b and other genes like ion channels that are not directly involved in axon guidance). As a comparison, classification was performed using only the top 0.75% of highly variable genes based on their F-score (rather than the default used above of 10%, thus yielding −100 rather than 1,300 genes). Optimal hyperparameters for feature selection (based on their F-score in the training folds, as before) for the percent of genes from each gene set to keep, as well as the number of dimensions for PCA and LDA and SVM regularization strength were identified via randomized searches for each gene set. Classification was also performed after removing all transcription factors and axon guidance genes (and increasing the F-score threshold to 10.5% to again use the top −1,300 genes). Models trained with only axon guidance genes or transcription factors performed substantially worse than those with the top 100 F-score genes, and excluding all axon guidance genes and transcription factors genes had little effect on classification accuracy (Figure S1M).
Pairwise and OR identity classification was performed with the same pipeline described above. Rather than just generalizing across held-out cells (which may have been from either the same or other mice), here classification was performed by leaving out all of the OSNs from single mouse, while the remaining 5 out of 6 mice were used for training to fit the classification pipeline (gene selection, PCA and LDA transformations, and the SVM decision boundaries). For each mouse, ORs found in at least 10 cells in the 5 training mice and at least 4 cells in the test mouse were used, and classification accuracies for each OR in the test mouse were evaluated across 1,000 restarts. Similar accuracies were observed when shuffling mouse identities, demonstrating that models successful generalize across mice. Generalization performance was at chance levels when trained on models with shuffled OR labels.
Classification was performed for the “OR-swap” mice using the PCs fit on the set of 1350 HVGs. The top 10 PCs were used as inputs to “one-vs-one” SVM linear classifiers trained to distinguish between M72, S50, and P2. Using the 42/18/84 cells for M72/S50/P2 in the wild-type data, the models were trained using 15-fold cross validation (subsampling 15 cells for each OR) to evaluate their ability to distinguish between wild-type OSNs expressing the three ORs. Models fit on only the 15 wild-type cells subsampled for each OR also successfully generalized to accurately predict the OR identities in the more than 500 held-out cells for each OR from the “OR-swap” and P2 (P2 locus) mouse lines. Classification accuracy was summarized across 1,000 restarts.
Classification of OR class was performed at the OSN subtype level (by taking the means of all the OSNs expressing each OR) between the 504/207/105 ventral class II, dorsal class II, and dorsal class I ORs. For each of 1,000 restarts. 80 OSN subtypes were selected for each OR type and classification was performed using 10-fold cross-validation with a pipeline that used linear SVMs (SVC with kernel=“linear”, decision_function_shape=“one-vs-one”, and regularization parameter C=0.03) fit on the top 10 PCs in each training fold to predict the OR class of the held-out ORs. Classification accuracies were at chance performance (33.3%) upon shuffling the OR type labels across OSN subtypes.
Consensus Non-Negative Matrix Factorization (cNMF)
Non-negative Matrix Factorization (NMF) was used to decompose the transcriptomes of each OSN into a smaller set of interpretable factors. NMF decomposes the gene expression matrix (cells×HVGs) into a combination of two matrices (W*H), one containing gene loadings (H) for each factor and another containing the usages (W) of these factors in each cell (
In brief, as previously described (Kotliar et al., 2019), cNMF applies NMF multiple times and, after excluding outliers (based on their distance to nearest neighbors, density_threshold=0.3), clusters the gene loadings across restarts with k-means clustering and uses the median of these clusters as the new gene loadings. In each of 20 restarts in which 10 cells were resampled for each of the 831 ORs. NMF was refit 150 times across a range of factors. A rank 18 decomposition, which balanced the tradeoff between reconstruction error (as measured by the Frobenius norm) and the consistency of the identified GEPs across restarts (as measured by the silhouette score), was used for the analyses presented here, but similar GEPs were observed with factorizations of other ranks. To match factors across restarts, the 54.000×54.000 distance matrix (18×150×20) was subject to k-means clustering using the value of k (19) that maximized the silhouette score. Clusters for stable factors should have close to 3,000 members (20×150). For each of the 19 clusters, the median loading across cluster members for each gene were used as the consensus gene loadings for that GEP. Three of these 19 clusters either appeared in few restarts or had non-zero usages only in small numbers of cells and were not considered further. For the 16 remaining GEPs, the loadings were normalized to sum to 1 for each GEP. Using this set of 16 GEPs as the new H matrix, the usages of these GEPs for each OSN were calculated by reapplying NMF while keeping the gene loadings constant (with update_H=False).
Individual GEPs may explain variation across ORs, allowing for accurate classification of OR identities, or they may explain other, independent, aspects of variation. To distinguish between these possibilities, across 10,000 restarts 10 cells were randomly subsampled for each OR, and the correlation between the mean GEP usages for cells expressing each OR (vectors of length 831 for each of the 16 GEPs) were compared between the first and second half of cells for each OR. The 10 GEPs with consistently-high correlations (R>0.9) were considered “OR-specific”, and each had higher gene loadings in a restricted set of HVGs (Figure S2A-B).
GEPs were identified based on manual inspection of genes with high loadings and high correlations with the usages for that GEP: four GEPs included known dorsal, ventral, anterior or posterior identity marker genes, while one GEP was specific to genes associated with CD36 positivity. One GEP contained genes like Ddit3 (CHOP) that are associated with ER stress and the unfolded protein response, but this GEP was not identified as OR-specific in our mature OSN dataset. GEPs that were not OR-specific (GEPs 11-16) and whose means across cells expressing the same OR are therefore not meaningful, or GEPs whose functions were unable to be identified based on their gene loadings (GEPs 8-10, which also had usages that were weakly correlated with other GEPs) were not considered for downstream analyses.
Two of the OR-specific GEPs encoded many genes associated with neural activity in OSNs (reviewed in (Wang et al., 2017)). One included S100a5, Pcp411, and Kirrel2 (which are downregulated by chronic sensory deprivation), while another included Calb2, Kirrel3, Ppp3ca (which are upregulated after chronic sensory deprivation: see
Among the identity GEPs, GEPDorsal and GEPVentral, as well as GEPAnterior and GEPPosterior also varied smoothly and in a similar mutually-exclusive manner across ORs; their differences in usages were therefor also summarized into single metrics (DV score=GEPDorsal-GEPVentral and AP score=GEPAnterior-GEPPosterior) to capture the continuous variation across these 3 independent axes (DV. AP, and ES: see Figure S2). The DV scores were distributed bimodally, and OSNs whose DV score=GEPDorsal-GEPVentral was at least 40 were considered as dorsal OSNs, yielding consistent results as the manual gene-based identification of dorsal OSNs described above.
Genes associated with each of GEPHigh and GEPLow were identified by two criteria: 1) the gene was within the top 200 genes with the highest loading for that GEP and 2) the expression of that gene was correlated with the GEP usage on OR-by-OR basis in the home cage data (the Spearman's rank correlation between the vectors of GEP means and gene means across the 831 ORs was 0.25). This approach identified 168 and 177 genes in GEPHigh and GEPLow, respectively. The potential functions/ontologies of these genes were manually annotated, and 63 and 54 “neuronal” genes whose known or proposed function in neurons were identified for the two ES GEPs. These neuronal genes were categorized according to their putative roles in either calcium homeostasis (calcium binding, calcium signaling, inositol phosphatide related) ion transfer (ion channel, ion pump/transporter), axon guidance (axon guidance, cell adhesion), synapse (synapse, secretory peptide), or protein transport (ER/golgi/cilia related). The remaining genes with high loadings and correlations with the ES GEPs had similar patterns of gene expression in OSNs as the above genes, but their functions in OSNs remain unannotated. The “neuronal” genes were further restricted to include only the “functional” gene categories that are predicted to directly impact OSN odor responses, and which included the 73 genes that were part of the calcium homeostasis, ion transfer and OR signaling categories. Of note, while many known components of OR signaling are included in this list, the expression of Ano2 and Adcy3 remained constant across OSN subtypes and did not meet the above correlation thresholds (p<0.2 for GEPHigh. GEPLow, and the ES score for both genes). Furthermore, although Omp was weakly correlated with the ES score (p=0.36), because it was not part of the HVGs it did not contribute to the usages of any GEP.
Applying cNME from Home-Cage Data to Other Data Sets
Although cNMF gene loadings were identified using the home-cage data, they were subsequently applied to other datasets to thus compare, on an OR-by-OR basis, changes in gene expression and GEP usages across experimental conditions. For all datasets, the TPT-normalized HVG expression from that dataset was scaled by the standard deviations from the home-cage data that were used when initially fitting cNMF. Using the fixed set of gene loadings (H), new usages (W) were solved by refitting NMF (update_H=False). To compare GEP usages across conditions, the mean GEP usage in that condition was calculated for each OSN subtype for all cells expressing that OR, and then differences between conditions were computed for each OSn subtype; distributions depict the change across OSN subtypes.
Changes in ES scores were evaluated by comparing the ES scores for each OSN subtype across experimental conditions. For chronic and transient occlusion ES scores from occluded nostrils were compared to those from open nostrils. For environment switches ES scores were compared for each OSN subtype between pairs of environments. For optogenetic stimulation, the change in ES scores was compared between mice housed in the home cages and those that either were or were not optogenetically-stimulated. In the chronic acetophenone experiments. ES scores were compared between mice that were chronically exposed to various concentrations of acetophenone and those that were exposed to the control solvent DPG. The changes in ES scores and z-scored functional gene expression across conditions for each OSN subtype are described. The relative ES score change for each OSN subtype was calculated by subtracting the mean across all OSN subtypes, to normalize for any global changes across conditions that were present across the population of OSNs.
The Spearman's rank correlation was also used to compare the consistency of ES scores in OSNs expressing the same OR across datasets. ORs identified in each dataset were compared, and the correlation coefficient was calculated on an OR-by-OR basis between the vectors of mean usages for each OSN subtype in each dataset (where each value is the mean ES score across all OSNs that express that OR in each dataset), On-diagonal values for pairwise correlation matrices were calculated via bootstrapping, by resampling with replacement the ES scores for each OSN subtype from the OSNs expressing that OR and taking the mean of the pairwise correlations (of the OR-by-OR ES score means) between all pairs (10,000) from 100 bootstrap restarts.
In all experiments, identity-related GEP usages were stable for each OR across conditions (p>0.95), whereas the ES GEPs and ES scores moved predictably with changes in activity. To assess how the change in ES scores related to the overall change in transcriptomes across conditions, equal numbers of cells were subsampled for each OR, and the difference in the mean ES score for each OR was correlated with the PCs of the \ HVG vs OR matrix (the difference in mean expression of each HVG between cells expressing each OR in each condition). In the chronic occlusion, optogenetic stimulation, and environment switch experiments, the changes in ES scores were strongly correlated (p>0.8) with the top PC of this matrix, and thus well aligned with the main axis of overall gene expression changes. Given that ES scores are also aligned with the top PC of the transcriptomes in the home cage, these data together indicate that activity-regulated ES genes define a main axis of transcriptional variation, and experimental manipulation causes restricted changes in gene expression along this axis.
Identifying ORs with Significant ES Score Changes Across Datasets
ORs with significant changes in their associated ES scores across datasets were identified empirically using the d-prime metric, which measured the separability of the ES score distributions for each OSN subtype. Within each OSN subtype, the d-prime statistic was calculated by taking the difference in the mean ES scores for each condition and dividing by the square-root of the average of the variance in the two conditions. This observed d-prime statistic was compared to the distribution of resulting d-prime values from permuting the condition labels across all cells of a given OSN subtype 10,000 times. P-values were calculated empirically, corrections for multiple comparisons were performed via the Benjamini-Hochberg FDR procedure, and FDR-corrected p-values≤0.01 were considered as statistically significant. ORs with significant shifts in the chronic occlusion and environment switch experiments had mean d-prime values of 5.63 and 2.50, respectively, indicating excellent separability. OSN subtypes with significant ES score changes were separated into those whose mean ES scores increased and decreases based on the sign of the difference in mean ES scores across datasets. In the environment switch experiments, each pair of environments was analyzed separately, but results are concatenated across the 3 environment pairs (and therefore in plots depicting OSN subtypes there are 3 datapoints for OSN subtypes identified in all 3 environments).
Classification of OR Identity with NMF GEPs
Pairwise OR classification and classification of OR identity was performed using the same cross-validation and balanced subsampling procedures described above. However, rather than performing gene selection and dimensionality reduction during training, the direct input to the linear SVM classifiers was the GEP usages for each OSN for either all 10 OR-specific GEPs or only the two ES GEPs, as specified. Classification was performed using the 831 ORs found in at least 10 cells in the home-cage data, the 436 ORs found in at least 10 cells in both nostrils in the chronic occlusion data, and in the 488/496/522 ORs found in at least 8 cells in home-cage, environment A and environment B datasets from the environment switch experiments. Despite using only 2 or 10 dimensions to distinguish the entire set of ORs, classification accuracies were above chance for all datasets, and at chance levels upon shuffling OR labels for each OSN.
Rather than identifying the OR expressed in each cell, linear SVM classifiers were also used to test whether OSNs expressing the same OR were distinguishable across datasets based on their GEP usages. Separate classifiers were used for each OSN subtype and k-fold cross-validation was used, with k set to be equal to the numbers of cells subsampled per condition. As expected, classification accuracy was significantly above chance for the chronic occlusion and environment switch experiments, especially for the ORs with significant ES shifts (but ORs with smaller but non-significant shifts were still discriminable at above-chance levels). Classification performance was similar using all 10 GEPs or only the 2 ES GEPs, and at chance levels when the dataset label was permuted across OSNs for each of 1,000 restarts, further demonstrating that the main change across conditions is restricted to changes in the ES score axis.
Classification of environment identity was also performed across subpopulations of OSN subtypes, using a minimum distance classification procedure. Across 1,000 restarts, six OSNs were subsampled for each OSN subtype for each environment (from the 477 subtypes present with enough cells in all environments), and five OSNs were used for training. The mean ES score for each subtype for each of the three environments was calculated (yielding three training vectors containing the mean ES scores across the population of 477 subtypes). The ES scores of the held-out cells for each subtype were similarly concatenated into three test vectors. For each of the 1,000 restarts, subpopulations of the OSN subtypes (containing 1-50 subtypes) were randomly sub-selected 1,000 times and the training vector for that subpopulation with the minimum distance to the same subpopulation from each test vector was the predicted environment label: predictions were considered accurate if the training vector with the minimum distance came from the same environment as the test vector. Accuracies were summarized across restarts, and classification performance was at chance levels when the environment labels were shuffled across training cells for each OSN subtype.
Modified ES Scores with HVG Subsets
Modified versions of the ES GEPs were constructed to assess the contributions of individual genes within the ES GEPs. Although many genes have low loadings for each GEP and a restricted subset have higher loadings, each GEP was defined based on their loadings across the entire set of 1350 HVGs. To test whether the 73 functional genes could substitute for the entire set of HVGs, the loadings for all of the remaining HVGs were set to 0, the total loadings for each GEP were rescaled to sum to 1 across the 73 functional genes, and this new gene loading (H) matrix was used to calculate updated usages (W) by reapplying NMF (with update_H=False).
The “functional” ES score, constructed again as GEPHigh-GEPLow from the new set of usages was highly correlated (p>0.99) with the ES score from all HVGs, decreased as expected upon occlusion, and could distinguish between OSNs expressing different pairs of ORs using the classification procedure described above, thus indicating that the “functional” genes and their expression can substitute for the entire set of HVGs (Figure S4). We also observed that the levels of some of the activation genes varied slightly across environments. To assess the contribution of these activation genes on ES scores and on the ES scores changes measured after environmental shifts, the loadings for the 169 activation genes that were part of the HVGs were set to 0 for all GEPs and the GEP usage and ES score was recalculated as described above. The ES scores and changes in ES scores calculated using all HVGs or excluding the activation genes were highly correlated (p>0.99), indicating that activation genes (whose expression primarily changes during periods of acute activation) do not account for the significant shifts in ES scores observed across environments.
Pseudotime analysis was performed on the immature OSNs identified from mice housed in the home cages. The Leiden clustering procedure that was used to identify mature OSNs also identified clusters of immature OSNs identified, which were used for pseudotime analysis. Diffusion pscudotime (n_des=10) was calculated in Scanpy on the nearest neighbor graph (n_pcs=10, n_neighbors=10), and normalized to the lowest pseudotime values of immature OSNs that had undergone OR choice and were expressing ORs. To compare gene expression patterns and GEP usages in immature OSNs expressing given ORs with those of their respective mature OSN subtype, only immature OSNs that had chosen and expressed ORs were used. Gene expression and GEP usage as a function of pseudotime were evaluated using linear generalized additive models (GAMs) in pyGAM. A linear GAM was fit using 15 basis splines (lambda=1) for individual genes and fit using 50 splines (lambda=5) for GEP usages, and the mean and 95% confidence intervals of the mean were evaluated across a grid of 500 evenly-spaced pseudotime values. The mature ES scores across OSN subtypes were discretized into five quantiles, and separate GAMs were fit using the immature OSNs expressing ORs from each mature ES score quantile.
Given the wide variation in gene expression across OSNs in the home-cage condition, differential expression testing was performed at the OR-by-OR level to identify genes that consistently changed across multiple OSN subtypes. Differential expression (DE) testing was performed using ORs found in at least 6 cells in each dataset to balance the trade-off between the number of OSN subtypes and within-subtype variance. Testing was performed via the Wilcoxon signed-rank test (comparing, for each gene, the differences between means for each OSN subtype across conditions). P-values were corrected for multiple comparisons across genes via the Benjamini-Hochberg FDR procedure.
For chronic occlusion, the DE genes with the most significant changes were defined as those among the top 500 genes with the lowest p-values that had consistent changes across ORs (same sign of change in at least 50% of the 673 OSN subtypes detected in at least 6 cells in each nostril and absolute value of the median log 2-fold change across OSN subtypes of at least 0.5); however, consistent results were obtained across a wide range of thresholds given that most DE genes changed in the majority of the OSN subtypes. As expected, genes that increased or decreased during occlusion had high loadings for GEPLow and GEPHigh,
For each odor, the same DE testing procedure was performed separately for activated and non-activated OSN subtypes (as defined based on the percentage of activated cells expressing that OR; see below). To exclude the small subsets of genes that changed indiscriminately across all OSN subtypes regardless of whether they had been activated by odor, genes whose p-value was higher for non-activated ORs or those whose mean change in expression across OSN subtypes was less than 1.5 times higher in the activated ORs than non-activated ORs were removed and not considered for downstream analyses.
OSN subtypes that were responsive to odorants were identified via a two-step process. First, individual OSNs (as defined by the expressed OR) were determined as activated based on their mean z-scored immediate early genes (IEG) expression. A set of 10 IEGs were used (Btg2, Egr1, Fos, Fosb, Gm13889, Junb, Nr4a1, Nr4a2, Pcdh10, and Srxn1). These genes were reliably and strongly induced in odor-exposed mice and, unlike many of the activity-dependent genes that are part of the ES GEPs and whose expression varies widely across OSNs expressing different ORs, these IEGs had consistently low expression across OSNs in control mice (average expression −0.15 in the control conditions and average log 2 fold-change of −5 in the odor conditions). The set of 10 IEGs was used since even though all of the IEGs were specific to acute activation by odors, they were still only detected in a minority of cells due to the sparse nature of scSeq. The means and standard deviations for IEG z-scoring were fit on the TPT-normalized IEG expression (without the log transformation due to the sparsity of expression) across 100 restarts containing equal numbers of subsampled cells for each OR from mice that were exposed to either acetophenone or octanal. OSNs whose mean z-scored expression was above 0.2 (−15% of OSNs in odor-exposed mice but less than 2% of OSNs in control mice exposed to the DPG solvent alone).
Second, as expected, the distribution of the percent of OSNs per OSN subtype was bimodal, with a majority of OSN subtypes showing low percentages of activated OSNs with a smaller subset showing high percentages of activated cells. Using the ORs detected in at least 4 cells in both control and odor-exposed conditions, activated ORs were defined as those in which less than 20% of cells expressing that OR were activated in the control conditions and more than 70% of cells were activated in the odor-exposed conditions, with the additional requirement of non-zero expression of at least 4 IEGs (to avoid false positives from OSNs activated by semiochemicals in the empty cages used for Act-Seq or those that strongly induced a small number of IEGs). Given that the set of ORs and the number of cells expressing each OR differed across datasets, odor-responsive OSN subtypes were re-evaluated for each dataset. Overall, odor-responsive OSN subtypes that were detected in multiple datasets were consistently identified as odor-responsive, though a small fraction of them (and thus their expressed ORs) were identified as activated in some datasets but only partially activated in others. These results are likely due to both the consequences of experimental manipulations, which were designed to affect odor responses, as well as the relatively conservative thresholds used above to identify ORs as activated.
For the environment switch experiments and the Act-Seq experiments using the four acetophenone-related odors, activated ORs were identified as described above, except using an IEG threshold of −0.05 because of the higher levels of IEG expression in the 10×v3.1 data. For the acetophenone concentration series experiment, activated ORs were either identified for each concentration, or the ORs identified as activated at 10% were also examined at lower concentrations, as specified in the text. In the optogenetic stimulation experiments, no IEG thresholds were used to identify activated cells, and instead all dorsal OSN subtypes were used for downstream analyses. In the chronic acetophenone exposure experiments, the set of acetophenone-responsive ORs identified via the acute Act-Seq experiments were used.
Although IEG induction could identify activated OSNs in a binary manner, the sparse nature of the IEGs made them less suitable for measuring differences in the degree of activation. Therefore, a larger set of odor-responsive genes (from the DE genes identified above) were used to construct an analog metric for activation and therefore compare
differences in the amount of activation across OSN subtypes (as defined by their expressed ORs). The set of “activation” genes used for this metric satisfied the following criteria 1) significant responses (FDR≤1×10−3) to both acetophenone and octanal, and 2) average TPT-expression of at least 0.8 in activated ORs (such that the metric wouldn't be biased by the presence of zeros). Acetophenone and octanal induced similar changes in gene expression in their respective responsive OSN subtypes. Of the ˜500 genes passing these criteria, the small number of genes that had high (p>0.4) correlations with the ES score in the control mice were removed, leaving a set of 472 “activation” genes that were used for all Act-Seq experiments.
Consistent with activation score genes generally not being differentially used in the home cage, the majority of these activation genes were not part of the 1350 HVGs in the home-cage data, and thus do not contribute to any GEP usages. The remaining 169 activation genes that were also considered as HVGs often had weak loadings across multiple GEPs. Only 16 of these 169 genes had high loadings for GEPHigh, and removing all activation genes from the HVGs and recomputing GEP usages yielded similar results. Together, these results indicate that activation gene expression is specific to the acute effects of odor exposure and does not contribute significantly to baseline variation in transcriptional identities or ES scores across OSN subtypes. To summarize the effects of odor activation across this set of activation genes, the TPT-expression of these activation genes were z-scored (using means and standard deviations for these genes in the control, acetophenone, and octanal datasets, identified by subsampling equal numbers of cells across all ORs expressed in at least 6 cells across 100 restarts). Next, using the combined set of activated acetophenone- and octanal-responsive OSN subtypes. PCA was performed on the differences in z-scored activation gene expression between control and odor conditions. The top PC of this activation gene delta matrix was kept, and the activation score was defined as the product of the weights for this top PC and the difference in z-scored activation gene expression between odor and control conditions.
Thus, the activation score summarizes for each OSN subtype the overall movement of the activation genes between control and odor conditions along this main activation axis. Although the activation score weights were fit using the combined set of acetophenone- and octanal-responsive OSN subtypes expressed in at least 6 cells for stability purposes, all subsequent Act-Seq results are shown using the larger sets of ORs detected in at least 4 cells in both control and odor conditions. To obtain activation scores for OSN subtypes from other datasets (such as those in the transient occlusion and environment switch experiments), the activation score was evaluated in a similar manner by multiplying the activation PC weights and the delta activation gene expression (odor-control) for each OSN subtype. Only activated OSNs from the odor-exposed mice were used for calculating the activation score. The weights were kept constant across experiments, but three separate versions of the gene scaling was used for the ACE and OCT Act-Seq experiments, the transient occlusion experiments, and all other Act-Seq and environment switch experiments, respectively, to account for changes in the gene means due to differences in 10× kit versions and sequencing depth across these datasets. In all experiments, activation scores were higher in odor-exposed and odor-responsive OSN subtypes. Across replicates from the same condition. OSN subtypes had consistent levels of activation, demonstrating that the activation score successfully captures differences in the amount of activation in each condition across OSNs expressing different ORs.
Two additional complementary methods were used to identify activated cells and measure activation in the ACE and OCT Act-Seq experiments. First unsupervised graph-based Leiden clustering (resolution=0.6) was fit on the top 20 PCs from the scaled HVGs (where PCA was fit using equal numbers of cells from control, acetophenone, and octanal mice and applied to all cells). A single cluster bore the hallmarks of activated cells, and primarily contained cells from the odor condition with increased IEG expression. Additionally, nearly all of the OSNs from odor-exposed mice that expressed activated ORs, as identified via the IEG approach described above, were part of this activated cluster.
Second, activated ORs and weights on genes affected by odor exposure were simultaneously identified via unsupervised tensor component analysis (TCA). Nonnegative tensor decomposition was fit using the hierarchical alternating least-squares algorithm (ncp_hals) in the tensor tools python package (Williams et al., 2018), and the tensor of conditions×activation genes×OSN subtypes was reduced with a rank 4 decomposition into 4 rank-1 tensors. One of these tensors represented activation and had 1) high weights in the odor but not control condition. 2) high weights for activation genes affected by acute odor exposure (like Pcdh10 and Fos), as well as 3) high weights for the odor-responsive OSN subtypes identified above based on their IEG expressions: furthermore, the TCA weights for the activation factor each OSN subtype was correlated with the mean activation score for that subtype (p>0.85 for both ACE and OCT).
ES scores were compared to the activation scores for each OSN subtype to test how OSN transcriptional variation affects odor responses. All correlations were calculated between the ES scores in control mice and the activation scores from separate odor-exposed mice, using the set of odor-responsive OSN subtypes detected in a least 4 cells in both control and odor conditions. Comparisons were made at the OSN subtype level, between the mean ES score from the control condition and that OSN subtype's activation score. When comparing the change in ES score and change in activation scores across datasets, only OSN subtypes present in at least 4 cells in both control and odor conditions for each dataset being compared were used.
Predictions of Activation Scores from Control ES Scores and Functional Gene Expression Via Linear Regression Models
Cross-validated linear regression models were constructed using the TPT-normalized expression of the functional genes in the control mice as predictors of the activation score in odor-exposed mice. Regularization was performed using elastic-net regularized linear models, where the optimal hyperparameters (alpha, the 11-ratio, and standardization of regressors) were identified through grid searches for each dataset. 5-fold cross-validation was used, and model performance was evaluated via the mean squared error (MSE) between predicated and observed activation scores from each test fold. This cross-validation procedure was repeated 1,000 times and the distribution of observed MSEs was compared to the predictions obtained when the same models were trained and tested on data in which the activation scores were permuted across OSN subtypes: this shuffling procedure represents performance similar to that of null models that predict the mean activation score for all subtypes. Reported errors were normalized as a percentage of the observed data to facilitate comparisons across datasets. For a subset of datasets, the predictions of the activation score from the functional genes were also compared to predictions from ordinary least squares (OLS) linear regression models that used the control ES score as a single predictor. Predictions of the changes in activation scores were evaluated in a similar way to those of the activation score, except the changes in functional gene expression for each OSN subtype were used as predictors. In addition to shuffling the change in activation score across OSN subtypes, these models were also evaluated on data where the condition label was permuted across OSNs for each OSN subtype before calculating the changes in the functional gene expression for that subtype.
Classification of Odors and Environments from Activation Scores
Classification of odor identity for the acetophenone analogs based on activation scores was performed using a minimum distance procedure in a similar manner to classification of environmental identity via ES scores. The input was the set of 106 OSN subtypes whose ORs were activated by at least one of the four acetophenone-related odors and were detected in at least four OSNs for each odor (since few ORs were present and activated in enough cells across all four datasets: but see below). Across 1,000 restarts, three OSNs per odor were used for training and the fourth was used for testing. The mean activation score for each subtype for each odor was calculated. For each of the 1,000 restarts, subpopulations of the OSN subtypes were randomly sub-selected 1,000 times and predictions were considered accurate if, across the subpopulation of activation scores, the training vector with the minimum distance for each test vector shared the same odor label. Predictions were at chance levels upon shuffling odor labels across cells for each OSN subtype. The same classification procedure was also used for each pair of acetophenone odors, but the input was the set of ORs activated by both analogs for each pair to determine if the pattern of activation scores across OSN subtypes can distinguish between odors, even for ORs activated by both odors.
Classification of environments using OSN activation scores after odor exposure as predictors was performed using cross-validated linear SVM classifiers. Activation scores were computed for each OSN (rather than the OSN subtype as described above) by applying the activation gene weights to the difference in expression between the activation gene expression in each cell in the odor condition and that of the mean of its respective OSN subtype in the control condition. Classification was performed as before by subsampling equal numbers of cells per OR on each of 1,000 restarts, and prediction accuracies were at chance levels when the OR labels were shuffled across OR pairs or the environment pair labels were shuffled across the OSNs expression each OR.
The set of activated ORs identified via Act-Seq were compared to those identified via the DREAM and phospho-S6 immunoprecipitation methods (Jiang et al., 2015; von der Weid et al., 2015). In vitro EC50 values to acetophenone for its receptors were obtained from (Jiang et al., 2015; Saito et al., 2009). Only ORs from these prior papers that were also identified in the Act-Seq experiments described above were used for comparisons. The 717 ORs detected in four or more OSNs in both DPG and acetophenone-exposed animals included 16/22 DREAM and 25/49 phospho-S6 ACE ORs. ORs that were not identified by Act-Seq were also not identified in more than 4 OSNs in these experiments, indicating that these receptors might be “missing” because they are expressed in particularly small numbers of OSNs: note that this observation raises the possibility that these receptors are also near the noise floor in the DREAM and pS6 methods, since these approaches rely on bulk sequencing approaches that are also likely affected by OR frequency. In contrast to the relationship observed between the control ES score and activation scores, there was no relationship between in vitro EC50 values and activation scores from any concentration of ACE and OLS linear regression models were unable to predict activation scores from the EC50 values alone.
OSN specific activation scores were measured after mice were shifted from the home-cage to environment A (envA). The activation score at these timepoints was then compared to the two-week changes in the ES score (as measured as the difference in the ES score between home-cage and env-housed mice in the environment switch experiments). The activation score was calculated by applying the activation gene weights to the activation gene deltas between mice from each timepoint and control mice housed overnight. The activation scores were normalized by subtracting the mean activation score across OSN subtypes, to account for any non-specific changes in activation across the entire population (e.g. due to changes in odor sampling that may occur upon the transfer to new environment cages). The resulting “relative activation scores” were compared between OSN subtypes with significant increases and decreases in their ES scores after two weeks in envA, for subtypes present in at least 4 cells in all 4 conditions (home-cage and envA after two-weeks, as well as in the overnight control data and in the envA two hour dataset). As expected. OSN subtypes with significant increases/decreases in their ES scores after two weeks showed increased/decreased activation (relative to the mean) at each timepoint. In contrast, the opposite effects were observed for mice switched from envA back to home cages, and little activation was observed between mice switched to cages of the same environment.
The correlation between the activation scores at two hours and the two-week ES score shifts was evaluated using all OSN subtypes, and the two-week ES score shifts were also compared to the change in the ES score after 24 hours. Predictions of the sign of the two-week ES score based on the activation score at each timepoint was performed using 10-fold cross-validated linear SVMs (fit at the OSN subtype level using a random subset of 80 of the OSN subtypes whose ES scores significantly increased and 80 that decreased after two-weeks for classification on each of 1,000 restarts). Classification accuracy was consistently above chance levels but was at chance levels upon shuffling the sign of the two-week ES score change for each restart.
All surgical procedures were performed in accordance with the guidelines provided by IACUC at Harvard Medical School. Prior to surgery. 4-6 weeks old Omp-IRES-Cre: Ai951) (GCaMP6f) mice, were subcutaneously injected with the analgesic Buprenorphine-SR (1 mg/kg) and anesthetized 1-2 hours later with isoflurane (3% induction. 1-2% maintenance). The skin overlying the left olfactory bulb was retracted and the underlying periosteum was scraped off with a scalpel. The skull was subsequently cleaned with saline and allowed to dry. A 2.5 mm circular craniotomy was made using a biopsy punch (Integra Miltex) taking care not to damage the underlying dura. Gelfoam saturated with cold saline was placed over the craniotomy to reduce swelling and suppress minor bleeds. A custom cranial window was constructed using two circular coverslips (Warner Instruments, 3 mm. #1 thickness). The first coverslip was ground down to 2.5 mm in diameter using a diamond scribe and subsequently bonded to the second 3 mm coverslip using Norland 71 optical epoxy. With the smaller coverslip in contact with the dura, the exposed edges of the 3 mm coverslip were gently depressed until flush with the surrounding skull and secured in place with glass glue (Loctite. P.N. 233841). This “top hat” design ensured a tight seal around the craniotomy. For head-fixation, a custom titanium head plate was positioned over the skull and, using a manual manipulator, aligned to be coplanar with the cranial window. Following alignment, the head-plate was secured to the skull using dental cement (C&B Metabond. Parkell) and the animal was returned to the home cage for recovery. The supplementary analgesic Carprofen (5 mg/kg) was provided in drinking water for 4 days following surgery. Throughout the 1-2 weeks prior to chronic imaging, the craniotomy was regularly inspected for any changes in clarity. Animals with excessive overgrowth of dura and skull tissue were not imaged and were removed from the study. In order to keep the cranial window clear of debris between imaging sessions, the dorsal surface of the skull was covered with silicone sealant (Kwik-Cast. World Precision Instruments).
A 23-valve olfactometer was used to present odors, as previously described (Island Motion Systems, see (Pashkovski et al., 2020)). In brief, custom Arduino software was used to control valve opening and closing, thereby enabling switching between odor vials and the blank vial. This software also controlled the output of two mass flow controllers (MFC). The first MFC delivered a constant carrier flow at 0.75 L min−1 of purified air into a common channel: the second MFC supplied a constant flow at 0.25 L min−1 of clean air that was injected into an odor vial (see below) and then merged with the carrier flow 1 inch (2.54 cm) in front of the mouse's nose. A larger exhaust fan drew air from the cage enclosing the imaging rig to prevent any cross-contamination between odors. Monomolecular odors were diluted in Mineral oil (Sigma), and this vapor-phase concentration was further diluted 1:4 by the carrier airflow. Odor presentations lasted for two seconds and were interleaved by 30 seconds of blank (Mineral oil) delivery. The order of presentation of odors was pseudo-randomized for each experiment, such that on any given trial, odors were presented once in no predictable order.
Chronic imaging of the dorsal surface of the olfactory bulb was performed after recovery from the surgery (1-2 weeks) under ketamine (100 mg/kg) and xylazine (10 mg/kg) induced anesthesia, to specifically isolate responses in OSN axons with less modulation from local interneurons and long-range cortical feedback. Depth of anesthesia was assessed every 15 minutes via the toc-pinch reflex and intraperitoneal ketamine was administered at 1/3 concentration of the induction dose for anesthesia maintenance. Multiphoton imaging was performed at 60 Hz using a 16 kHz resonant galvo scanner (Cambridge Technologies) with a chameleon laser tuned to 920 nm delivered to 30-50 mW of excited power at the front end of the objective (10× Olympus XLUMPLFL10XW-SP lens. NA 0.6), as previously described (Pashkovski et al. 2020). Emitted fluorescence was detected using Hamamatsu H10770PA-40 PMTs. Scanimage 5 was used for hardware control and image data acquisition: WaveSurfer was used for recording of acquisition, odor and frame triggers, as well as sniffing data in a subset of mice. To obtain consistent field of views across sessions. 0.5 mg of Texas-Red conjugated Dextran (3000 MW. D3328 Thermo Fisher) dye was injected subcutaneously, and the field of view was aligned using both red and green channels based on patterns of vasculature and glomeruli, respectively (using the first session as a template). Animals were monitored throughout each imaging session for any drift in the field of view across trials.
Initial experiments consisted of a cohort of four mice imaged across multiple environment shifts. Each imaging session consisted of at least 8 trials, in which each of the 7 odors were presented. The odors used included 3 odors that activate broad sets of the dorsal glomeruli and which were presented at both a high/low concentration (pentanone (25/2.5%), ethyl butyrate (2.5/0.25%), and pentanal (25/2.5%)) as well as a 7th blank odorant (Mineral oil). When mice were not being imaged, they were housed in either regular home-cage or Environment A (envA) cages, consisting of the same contents as used for the scSeq experiments. Throughout the course of the experiment, mice were sequentially housed in the regular home cages (home1), envA cages (envA), and then regular home cages again (home2), 4-6 sessions were obtained for home1, 4-6 sessions were obtained for envA (across 14 days of housing in envA), and 2-3 sessions were obtained for home2 (across an additional 14 days of housing in the regular home-cage environments).
In an additional cohort of two mice, the responses to a broader panel of 16 odors (to facilitate decoding analyses) that densely activated the dorsal olfactory bulb were imaged. Each session consisted of at least 6 trials, in which a blank odor and each of the 16 odors (2.5% of Pentanal, Pentanone, Ethyl butyrate, Heptanal, Hexanal, Propanal, Butanal, Butanone, Hexanone, Ethyl acetate, Propyl acetate, Butyl acetate, Benzaldehyde, Tiglaldehyde, Methyl tiglate, Methyl valerate, all of which were obtained from Sigma except for Methyl tiglate, which was obtained from Thermo Fisher) were presented pseudo-randomly. Throughout the course of the experiment, mice were sequentially housed in the regular home cages (home-cage) and envA cages (envA).
For alignment of the FOV across sessions, baseline-subtracted images were downsampled (from 60 Hz to 10 Hz) and the mean image from each session was used to register the field of view across imaging sessions. Registration across sessions was performed using an iterative approach to align each session to a common template session. First, the optimal geometric transformation that maximized the overall cross correlation between mean images was identified using OpenCV (findTransformECC with warp_mode=MOTION_HOMOGRAPHY and gaussFiltSize=1). Next, local deformations were further aligned with Dipy (SymmetricDiffcomorphicRegistration with metric-CCMetric, level_iters=[250, 100, 50, 25], and inv_iter=50). The resulting procedure gave an invertible warping for each session that was subsequently applied to all frames from that session. Throughout a session, the field of view was stable, and no motion correction was applied: therefore, after registration, glomeruli were stably aligned across all trials and sessions. The z-depth of the image plane was aligned for every imaging session based on the specific pattern of vasculature and glomeruli found in each mouse, and this single plane was imaged throughout each session. Figure S7B shows that FOVs are stable across sessions and environments, as assessed by the pairwise enhanced correlation coefficient between mean images of the entire FOVs (computeECC function in OpenCV). Regions of interest (ROIs, glomeruli) were identified via Suite2p (skipping the registration and deconvolution steps) from the entire set of registered baseline-subtracted and down-sampled images from all sessions (Pachitariu et al., 2017). ROIs that covered multiple glomeruli or masks glomeruli split into multiple ROIs were manually corrected. Using the set of identified ROIs, fluorescence traces were manually extracted from the raw movies from each session by applying the session-specific warping described above to each movie and then taking the dot product of the fluorescence in the aligned movies with the pixel weights of each ROI mask. Neuropil subtraction was not performed since annuli from each glomerulus would overlap with other glomeruli and there was little background fluorescence outside of activated glomeruli.
For each trial and glomerulus, GCaMP signals were baseline-subtracted and normalized relative to their baseline (dF/F). The 10th percentile of the ten seconds prior to each odor presentation was considered as the baseline, dF/F values were interpolated to 50 Hz and smoothed by convolving with a gaussian filter (length=21, standard deviation=5). Because of the long duration of the initial cohort, in which mice were imaged across multiple environment shifts over more than a month, dF/F traces were z-scored for each trial: dF/F values were not z-scored for the later cohort, which was imaged for a shorter duration across a single environment shift.
Responsive glomeruli were identified separately for each environment. Responsive glomeruli were defined as those whose mean response from 0.0 to 3.0 seconds after odor valve onset was significantly higher than that of the prior 10 seconds (p≤1×10−3, via the Wilcoxon signed-rank test and adjusted via the Holm-Bonferroni method across odors for each glomerulus) and whose mean z-scored dF/F was greater than 0.5 (or mean dF/F was greater than 0.125 for the second cohort).
Responses were aligned across trials using the population mean to account for the variability in response onset present because odors were given in open loop without controlling for sniffing. The set of glomeruli that responded in at least two environments with a positive mean odor response were used for aligning odor responses across trials. The population mean was computed across these responsive glomeruli for each trial and the population onset was defined as the first timepoint in which the population exceeded a threshold (0.75 for z-scored dF/F data and 0.25 for dF/F); traces were aligned across trials by shifting them based on the identified population onset. This alignment procedure increased the correlation of the odor response across trials for each odor, indicating that responses are more reliable once aligned to population, rather than odor, onset. Sniffing was also recorded with a nasal pressure sensor (Honeywell AWM3100V) in a subset of mice to further assess the performance of this alignment procedure. Sniffing signals were uncorrelated across trials when using the odor valve times but were correlated across trials when using the shifts identified via the population onset. After alignment, the population response reliably increased following inspiration, further indicating that this alignment procedure helps to correct to variability in timing of odor responses across trials due to sniffing. After alignment relative to the population mean, responsive glomeruli were re-identified used the same thresholds and procedure described above, except now using the first 3 seconds after population onset, and responsive glomeruli kept for downstream analyses were those that were identified as responsive in at least two environments, and whose response to a given odor was a least twice that of the response to the blank.
For the responsive glomeruli for each odor, the mean response amplitudes for each trial (the mean dF/F or z-scored dF/F values in the first 3 seconds following population onset) were averaged across trials within each environment. Glomeruli whose odor response amplitude differed across environment pairs were identified via permutation testing. The observed differences in means across environments was compared to a null distribution calculated by following the same procedure after permuting environment labels across trials 100,000 times. P-values were calculated empirically for each glomerulus-odor pair and were adjusted pairs by the Benjamini-Hochberg FDR procedure. Pairs whose FDR-corrected p-value was ≤0.01 were considered as significantly different across environments. Across mice and odors. 20-80% of responsive glomeruli demonstrated significant shifts in response amplitudes across the shift from home-cage to envA. In contrast, fewer glomerular-odor pairs were significantly different between home1 and home2. Glomerular responses were considered as reverted if the response amplitudes in home2 were more similar to home1 than envA. While one would naively expect reversions via regression to the mean if data were independent, no glomeruli were identified as environmentally sensitive when environmental labels were shuffled before calculating the observed difference, indicating that the observed differences in response amplitudes between home1 and envA (as well as those between envA and home2) are not the results of spurious partitioning of responses. Together, these results indicate that a large fraction of glomeruli responsive to a given odor are stable within environments but vary across environments.
Population odor responses were analyzed using pseudo-populations of glomeruli imaged across mice. Pentanal elicited the broadest responses across the population of glomeruli and was therefore used to assess how the population response differed between environments. To test whether differences at the population level in the pentanal responses in envA compared to home1 were reverted in home2, an “environmental axis” that separated the home1 and envA responses was constructed. For each mouse, two testing sessions were held out for both home1 and envA, and the remaining sessions were used for training, and the response at each timepoint in these training sessions was averaged across all training trials. PCA was fit on the population responses in the training data to reduce the dimensionality of the glomerular population to 10 dimensions, and the mean response was calculated in 30 overlapping windows in time (each 2 seconds long starting from −0.2 s and each subsequent one offset by 0.05 s from odor onset). The differences across the 10 PCs between the response in envA and home1 was calculated for each window, and the weights were averaged across the 30 windows and then normalized by their L-2 norm. This procedure thus returns the projection vector designed to maximally separate the odor responses in the two environments. To test whether this projection generalized to held-out sessions, the PCA transformation was applied to the mean response in the held-out sessions for home1 and envA, as well as to the home2 sessions. The difference between the population responses along this environment axis between the held-out home1 and envA session sessions was compared to the difference between the home2 and held-out home1 sessions. This cross-validation procedure was repeated across 1,000 restarts, and the mean and standard deviations across restarts were evaluated. Across all restarts, the response in envA readily diverged after odor onset, whereas the response in home2 remained similar to that in home1 at all timepoints after odor onset.
Pairwise Odor Correlations within and Across Environments
Pairwise correlations for each odor pair were calculated for each environment using the population vectors of mean odor responses (averaged across the first 3 seconds following population onset). For each odor pair, the union of responsive glomeruli was used. The pairwise correlation distance between all trials for each odor was computed and the median across trial pairs was computed for each odor pair. This same procedure also applied to pairs of the same odor, and the on-diagonal of the resulting correlation matrix represents the median across trials pairs and demonstrates the stability of the population response to the same odor across trials. The difference in correlation matrices across environments was computed, and the observed difference in the mean absolute change in pairwise correlations of the lower triangle (excluding the diagonal) was compared to a null distribution of means from shuffling the environment identity of each trial across 10,000 shuffles. For both mice, the observed difference was greater than that observed in any shuffle. Correlation matrices are shown after hierarchical clustering odors using Ward's method.
Classification of environment identity was performed for each odor separately using the responsive glomeruli for each odor. Classification was performed using pseudo-populations of glomeruli combined across mice, using the mean response for each glomerulus across the first 3 seconds following population onset. For each of 100 restarts. 18 trials were randomly chosen for each mouse for each environment and were split into 12 training and 6 test trials. Glomerular responses were permuted across trials within each mouse and then concatenated across mice to generate pseudo-trials for model training and testing. On each restart, subpopulations of glomeruli were randomly selected 100 times, thus yielding 10,000 models. Standardization across trials for each glomerulus via z-scoring and linear SVM classifiers (regularization parameter C=0.1) were both fit on the training data and applied to predict the environment label of the held-out test pseudo-trials, and the accuracy was summarized for each subpopulation across restarts. Performance was a chance levels upon shuffling the environment labels of the training and test data.
Odor identity was predicted in a similar manner, using the mean responses across the first 3 seconds following population onset in pseudo-populations of glomeruli. For each of 100 restarts, for each environment, 6 test trials (containing the responses to each of the 16 odors) were randomly held out, and the glomeruli were randomly permuted across test trials 10 times to generate pseudo-populations for 60 test pseudo-trials for each of the 16 odors. The remaining training trials were randomly subsampled from each environment to yield 24 training trials for each odor. On each restart, subpopulations of glomeruli were randomly selected 20 times, thus yielding 2.000 (20×100) models for each threshold. A classification pipeline was fit on the training data, consisting of standardization across trials for each glomerulus via z-scoring, reducing the dimensionality via PCA (n_components=5 when the number of glomeruli was <10, else n_components=10), and fitting linear SVM classifiers (“one-vs-one” configuration with regularization parameter C=0.5), and then applied to the test data. The training data from each environment and the test data from held-out trials from each environment were each used for training and testing, thus generating all four combinations (2×2) of training and testing data from either the same or different environments. Classification performance was summarized across restarts for each threshold for each of the within-environment and between-environment training/testing combinations and compared to 1) performance when environment labels were shuffled across the training trials and across the test trials from each environment and 2) performance when the odor labels were shuffled. Classification performance quickly saturated with increasing number of glomeruli, was higher for held-out trials from the same environment, and was at chance performance when odor labels were shuffled.
Odor generalization was performed to determine how the changes in pairwise odor correlations observed across environments impacted odor coding. The same pseudo-populations were constructed for training and test trials from each environment as described above for odor decoding. The entire pseudo-population of glomeruli was used without subsampling. The training and test data was first transformed with the z-scoring and PCA transformation (n_components=10) described above and fit on the training data. However, rather than fitting a single classifier to all trials from all odors, a separate classifier was fit on each odor pair in the training data (16C2=120 pairs) and applied to the remaining odor pairs (119) in the test data, thus yielding 120×119=14.280 predictions. This same procedure was repeated for each of the four within- and between-environment training/testing combinations, and the entire procedure was repeated when the environment labels were shuffled across the training trials and across the test trials from each environment. Classification accuracy from classifiers trained and tested on different odor pairs from the same environment (within-environment) was sorted across the 14.280 pairs and the accuracy for each pair in the within-environment condition was compared to that of the between-environment and shuffled-environment conditions. Although the overall distribution of classification accuracies was similar for each of the three conditions, the mean absolute difference between within-environment and between-environment accuracies for each pair was greater for the observed, rather than the shuffled environmental labels, further demonstrating that pairwise relationships between odors are altered across environments.
Single-cell analyses were performed in python (versions 3.6-3.8) using the Scanpy package (Wolf et al., 2018) as well as custom-written scripts using the open-source python scientific stack (pysam, SciPy, NumPy, scikit-learn, umap-learn, pandas, statsmodels, matplotlib, seaborn, numba, dask). cNMF was performed using modified versions of the code described in (Kotliar et al., 2019) and accessed on the world wide web at github.com/dylkot/cNMF. Imaging data was analyzed using additional packages, including OpenCV and Suite2p (Pachitariu et al., 2017). Scripts to replicate data analyses are available on the world wide web at github.com/dattalab/Tsukahara_Brann_OSN.
Hypothesis testing was performed using non-parametric statistical tests, including the Mann-Whitney U Test and the Wilcoxon signed-rank test, except in the cases where p-values were calculated empirically using resampling-based permutation tests. Distributions were compared via the Kolmogorov-Smirnov (KS) test. Experiments with multiple factors were tested with either a two-way ANOVA or with the Jonckheere-Terpstra trend test (calculated empirically via 10,000 permutations), a non-parametric extension of the Kruskal-Wallis test to evaluate trends across groups. Statistical tests that were performed for multiple genes, ORs, GEPs, glomeruli, and odors were corrected for multiple comparisons using the Benjamini-Hochberg FDR Procedure or the Holm-Bonferroni method, as specified in the text. The results of statistical tests are listed in the figure legends. The precision of sample statistics and regression trend lines were evaluated using bootstrapping, and, except where noted, plots and error bars depict the mean and the 95% confidence intervals of the mean across 1,000-10,000 bootstraps. Results from classification, regression, and other analyses were repeated across 1,000 restarts, using balanced subsamples of the data in analyses at the OSN-level to account for differences in OR frequencies. Throughout the paper, a non-parametric version of the box plot (also known as a letter-value plot) was used to represent multiple quantiles and tails of large distributions of data (e.g. to summarize across OSN subtypes or sets of ORs or OR pairs) in an agnostic way that does require setting bandwidth parameters as in violin plots or kernel density estimates. Like a conventional box plot, the largest box represents the interquartile range (25-75 percentile) and the median (dotted-line). Subsequent boxes recursively represent exponentially-smaller quantiles (the 12.5-25 and 75-87.5 percentiles, then 6.25-12.5 and 87.5-93.75 percentiles, then 3.125-6.25 and 93.75-96.875 percentiles, and so forth).
This application claims benefit under 35 U.S.C. 119 (e) of U.S. Provisional Application No. 63/285,307, filed Dec. 2, 2021, the contents of which are incorporated herein by reference in their entirety.
This invention was made with government support under Grant Nos. DC016222 and NS112953 awarded by the National Institutes of Health (NIH). The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2022/051618 | 12/2/2022 | WO |
Number | Date | Country | |
---|---|---|---|
63285307 | Dec 2021 | US |