As a general overview, mass spectrometry (MS) is an analytical technique for the detection and quantitation of chemical compounds based on the analysis of mass-to-charge (m/z) values of ions formed from those compounds. MS involves ionization of one or more compounds of interest from a sample, producing precursor ions, and mass analysis of the precursor ions. Tandem mass spectrometry or mass spectrometry/mass spectrometry (MS/MS) involves ionization of one or more compounds of interest from a sample, selection of one or more precursor ions of the one or more compounds, fragmentation of the one or more precursor ions into product ions, and mass analysis of the product ions.
Both MS and MS/MS can provide qualitative and quantitative information. The measured precursor or product ion spectrum can be used to identify a molecule of interest. The intensities of precursor ions and product ions can also be used to quantitate the amount of the compound present in a sample.
Mass spectrometry techniques often generate mass spectrum data utilizing a mass-to-charge ratio (m/z) for detected ions. Knowledge of the actual charge or mass of the detected ions, however, is often not directly measurable. As a result, some overlap of detected ions may occur in certain scenarios. For example, a singularly charged ion with a mass may appear in the mass spectrum as having the same mass-to-charge ratio as a doubly charged ion with double the mass. This issue may generally be referred to as a peak overlapping problem.
In top-down mass spectrometry (MS) protein analysis, for example, overlapping of mass or mass-to-charge (m/z) peaks in a mass spectrum is a significant problem. In this type of analysis, a very wide range of different fragment or product ions are produced, including product ions that have lengths of 1-200 amino acids and have 1-50 different charge states. The product ion peaks are heavily overlapped with each other in a single spectrum. In addition, the overlap can be so extensive that even mass spectrometers with the highest mass resolution (Fourier transform ion cyclotron resonance (FT-ICR) or orbitrap) cannot deconvolve such overlapped peaks. As a result, large product ions are often lost in top-down protein analysis, limiting the sequence coverage of large proteins. International Publication WO2020/157720, published on Aug. 6, 2020, and International Publication WO2019/197983, published on Oct. 17, 2019, both provide additional discussion of a top-down MS protein analysis and associated challenges.
In an aspect, the technology relates to a method for classifying a charge state of detected ions, the method comprising: generating a pulse for each ion in a plurality of ions detected by a detector, wherein each pulse has a pulse characteristic; generating a pulse-characteristic distribution of the generated pulses; and based on the pulse-characteristic distribution, generating an identification of the charge state of one or more ions in the plurality of ions.
In an example, the pulse-characteristic distribution is a plot of probability versus pulse characteristic. In another example, the pulse characteristic is at least one of a pulse height, a pulse width, or a pulse area. In a further example, the pulse characteristic is pulse height, and the pulse height is a maximum voltage of the pulse. In yet another example, the detector is an electron multiplier detector and the detector is configured to detect predominantly single ion events. In still another example, generating an identification of the charge state comprises comparing the pulse-characteristic distribution with a reference pulse-characteristic distribution. In still yet another example, the ions detected by the detector are generated from ionization of a sample, and the reference pulse-characteristic distribution is identified based on known characteristics of the sample.
In another example, the generated identification comprises a probability of the charge state. In a further example, the method further includes based on the identification of the charge state, generating a deconvolved mass spectrum for the detected ions, wherein one axis of the mass spectrum is mass rather than mass per charge (m/z). In still another example, the plurality of ions are grouped using m/z domain into different groups, and the identification of charge state based on the pulse-characteristic distribution is performed for each group. In yet another example, the grouping step comprises generating mass spectrums based on the plurality of detected ions; identifying a first peak in the mass spectrum, wherein the first peak has a mass per charge (m/z) value; and grouping ions within a mass per charge (m/z) range, based on the m/z value of the first peak. In still yet another example, the grouping step comprises selecting a first subset of the plurality of ions into a first intensity band and selecting a second subset of the plurality of ions into a second intensity band; generating a first mass spectrum for the first intensity band; generating a second mass spectrum for the second intensity band; identifying a first peak in at least one of the mass spectra, wherein the first peak has a mass per charge (m/z) value; and grouping ions within a mass per charge (m/z) range, based on the m/z value of the first peak.
In another example, the method further includes generating a second pulse-characteristic distribution for the ions within the m/z range, and generating the identification includes: determining that ions forming the first pulse-characteristic distribution have a first charge state; and determining that ions forming the second pulse-characteristic distribution have a second charge state. In a further example, the method further includes based on the identification of the charge state, determining one or more isotopes corresponding to the one or more ions forming the first peak. In yet another example, generating an identification of the charge state comprises comparing the first pulse-characteristic distribution with a reference pulse-characteristic distribution. In still another example, the ions detected by the detector are generated from ionization of a sample, and the reference pulse-characteristic distribution is identified based on known characteristics of the sample. In still yet another example, the generated identification comprises a probability of the charge state. In another example, the method is performed as part of a top-down protein analysis.
In another example, the method further includes identifying a second peak; based on at least the first peak and the second peak, determining a consensus m/z distance; identifying that the first peak and the second peak form a feature; and wherein generating the identification of the charge state is based on the consensus distance. In a further example, the step of identifying that the peaks form the feature comprises comparing pulse-characteristic distributions of said peaks and selecting peaks with substantially the same pulse-characteristic distributions. In still another example, the comparison of pulse-characteristic distribution is performed by calculating Euclidean distance between said pulse-characteristic distributions and comparing it with a predetermined threshold. In yet another example, the method further includes, based on the consensus distance, identifying a missing peak corresponding to the feature. In still yet another example, the method further comprises, based on the identification of the charge state of the ions, generating a deconvolved mass spectrum for the detected ions, wherein one axis of the mass spectrum is mass rather than m/z. In another example, the pulse characteristic is a maximum voltage of the pulse.
In another aspect, the technology relates to a mass analysis system. The mass analysis system includes a detector to configured to generate a pulse for each ion detected by the detector; a processor; and a memory storing instructions that are configured to, when executed by the processor, cause the system to perform a set of operations. The operations include generating a pulse for each ion in a plurality of ions impacting an electron multiplier detector, wherein each pulse has a pulse characteristic; generating a pulse-characteristic distribution of the generated pulses; and based on the pulse-characteristic distribution, generating an identification of a charge state of one or more ions in the plurality of ions. In an example, the detector is an electron multiplier detector. In another example, the mass analysis system further includes an ion source device, a dissociation device, and a mass analyzer.
In another aspect, the technology relates to a method for classifying a charge state of detected ions, the method includes detecting a transient time-domain signal induced on an image-charge detector of the mass analyzer by oscillations of a plurality ions in the mass analyzer using a processor; converting the transient time-domain signal to a plurality of frequency-domain (FD) peaks corresponding to ions in the plurality of ions; generating an FD-peak-characteristic distribution of the generated pulses; and based on the FD-peak-characteristic distribution, generating an identification of the charge state of one or more ions in the plurality of ions.
In an example, the FD-peak-characteristic distribution is a plot of probability versus FD-peak characteristic. In another example, the FD-peak characteristic is a peak intensity. In still another example, generating an identification of the charge state includes comparing the FD-peak-characteristic distribution with a reference FD-peak-characteristic distribution. In a further example, the ions detected by the detector are generated from ionization of a sample, and the reference FD-peak-characteristic distribution is identified based on known characteristics of the sample. In still another example, the generated identification comprises a probability of the charge state. In yet another example, the method further includes, based on the identification of the charge state, generating a deconvolved mass spectrum for the detected ions, wherein one axis of the mass spectrum is mass rather than mass per charge (m/z).
In another aspect, the technology relates to a method for classifying a charge state of detected ions. The method includes generating a pulse for each ion in a plurality of ions detected by a detector, wherein each pulse has a pulse characteristic; generating pulse-characteristic distributions of the generated pulses; and based on the pulse-characteristic distributions, identifying a coarse charge state; identifying a peak pair of a first ion peak and a second ion peak, such that said peaks have adjacent charge states; and based on an m/z value for the first ion peak, an m/z value for the second ion peak, and a mass of a charge carrier, determining a refined charge state of the second ion peak.
In an example, the coarse charge state identification is accurate to a range of possible charge states for at least one peak forming a pair and at least one charge state from the range is adjacent to the charge state identified for the second peak. In another example, the method further includes accepting the refined charge state identification if the refined identified charge state is an integer within a certain threshold. In still another example, a third peak with an adjacent charge state is identified and the third peak forms a pair with at least one of the peaks and charge state identifications for said common peak are matching in both pairs. In yet another example, the method further includes, based on the determined charge state of the second ion peak, determining the charge state of the first ion peak. In still yet another example, the method further includes obtaining the mass of the charge carrier based on known characteristics of a sample ionized to generate the plurality of ions.
This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter. Additional aspects, features, and/or advantages of examples will be set forth in part in the description which follows and, in part, will be apparent from the description, or may be learned by practice of the disclosure.
Non-limiting and non-exhaustive examples are described with reference to the following figures.
As briefly discussed above, peak overlapping of detected ions is problematic for analysis of MS results. To resolve this peak overlapping problem, one solution is to determine or infer the charge state of the ions forming the peak. By determining the charge state, the mass of the ion may then be resolved and ions from different species can be distinguished from one other. In addition, multiple peaks in a mass spectrum may be represent an isotopic cluster or feature. In some cases, it may be unclear however which peaks belong to which cluster. Charge state identification techniques can further resolve the identification of the proper peaks for such clusters.
Analog to Digital Conversion (ADC) banding methods have previously been proposed for separating ions based on their intensity. One such example of a banding method is disclosed in International Publication WO2020/157720 (the '720 Publication), published on Aug. 6, 2020, which is incorporated by reference herein in its entirety. Such separation based on ion intensity facilitates charge state separation, thus improving peak capacity. However, the described methods do not teach how to separate ions based on their charge states. This leads to two problems: first, the signal originating from the same specie is diluted between multiple data channels, second, there is no proposed way to construct deconvolved mass spectrum convenient for subsequent data interpretation. It is therefore desired to have an improved method, which can assign charge states to individual ion detection events.
One of such methods has been recently published in the following paper: Kafader et al., Multiplexed mass spectrometry of individual ions improves measurement of proteoforms and their complexes, Nature Methods, Nature Methods volume 17, pages 391-394(2020). However, the described method in that paper is limited to mass spectrometers with detection systems where the detected signal has a deterministic relationship with respect to measured charge (e.g. image-charge induced detectors). Accordingly, among other things, the described method in that paper does not teach how to setup charge assignment for each individual ion measurement event for a mass spectrometer based on detection system where the measured signal has probabilistic relationship with respect to measured charge (e.g., electron-multiplier based detection systems).
In some of such new class of acquisition strategies, direct identification of charge state is attempted prior to co-adding corresponding signal to a mass spectrum (see Kafader et. al.). However, such strategies are not applicable for charge state assignment in electron-multiplier detection systems. In many cases for systems based on electron-multiplier detection systems, each charge state does not have a distinct detector response, but instead has a distinct pulse height distribution (more generically intensity distribution) specific to each charge state and m/z values.
The present technology allows for a determination or inference of a charge state of an ion according to characteristics of a pulse generated by the detector upon detecting the ion. To do so, the present technology generates a distribution of a pulse characteristic for multiple detected ions. The pulse characteristic may include a pulse height, a pulse width, or a pulse area, among other possible characteristics. The distribution of the pulse characteristics form distinct profiles depending on the charge state of the detected ions. Thus, charge state may be determined the pulse-characteristic distributions. Once the charge state of the ion is determined, the mass of the ion may be determined based on m/z of the ion, and the ion may be distinguished from other ions. Ultimately, the compound that is being analyzed by the MS technique may be identified based on the determined charge states of the detected ions. Thus, by identifying and/or assigning the charge state of an ion, the measurement capabilities of a mass analysis instrument are improved. The accuracy of the mass analysis instrument may also similarly be improved.
The mass analyzer 103 can be any type of mass analyzer used for a desired technique, such as a time-of-flight (TOF), an ion trap, or a quadrupole mass analyzer. The detector 104 may be an appropriate detector for detection ions and generating the signals discussed herein. For example, the detector 104 may include an electron multiplier detector that may include analog-to-digital conversion (ADC) circuitry. The detector 104 may also be an image charge induced detector. The detector 104 produces detection pulses for detected ions.
The computing elements of the system 100, such as the processor 105 and memory 106, may be included in the mass spectrometer itself, located adjacent to the mass spectrometer, or be located remotely from the mass spectrometer. In general, the computing elements of the system may be in electronic communication with the detector 104 such that the computing elements are able to receive the signals generated from the detector 104. The processor 105 may include multiple processors and may include any type of suitable processing components for processing the signals and generating the results discussed herein. Depending on the exact configuration, memory 106 (storing, among other things, mass analysis programs and instructions to perform the operations disclosed herein) can be volatile (such as RAM), non-volatile (such as ROM, flash memory, etc.), or some combination of the two. Other computing elements may also be included in the system 100. For instance, the system 100 may include storage devices (removable and/or non-removable) including, but not limited to, solid-state devices, magnetic or optical disks, or tape. The system 100 may also have input device(s) such as touch screens, keyboard, mouse, pen, voice input, etc., and/or output device(s) such as a display, speakers, printer, etc. One or more communication connections, such as local-area network (LAN), wide-area network (WAN), point-to-point, Bluetooth, RF, etc., may also be incorporated into the system 100.
In
Each of the pulses may be characterized by pulse characteristics. The pulse characteristics may include characteristics such as pulse height, pulse width, and/or area under the curve of the pulse. The pulse height of each pulse is indicated by the rectangles 131, 132, and 132. The pulse height may be the maximum pulse height for the respective pulse, and the pulse height may have units of voltage. The pulse width may be at any point of the pulse, but one measure of pulse width may be the full width at half maximum (FWHM). The pulse width may have units of time. The area under the pulse curve may be generated by integrating the area under the respective pulse signal for each pulse.
The pulse characteristics may be used to separate the detected ions into different bands.
A first pulse-height distribution 202 and a second pulse-height distribution 204 are depicted in the plot 200. As can be seen from the plot 200, the pulse-height distributions overlap, but the first pulse-height distribution 202 has a profile that is distinct from the profile of the second pulse-height distribution 204. The difference in profile shape is predominately due to a different in charge state of the detected ions forming the respective pulse-height distributions. For instance, the detected ions forming the first pulse-height distribution 202 correspond to a 3+ charge ion, and the detected ions forming the second pulse-height distribution 204 correspond to a 7+ charged ion. Accordingly, once various pulse-height distributions have been established or generated, it may be possible to determine a charge state of any single detected ion by determining on which pulse-height distribution profile the corresponding ion pulse fits.
As some additional detail, the pulse-height distributions 202, 204 were generated for product ions having very similar m/z values at approximately 517. The product ions were generated from a top-down ECD analysis of carbonic anhydrase 2 (CA2). As discussed above, as can be seen in plot 200, the pulse-height distributions originating from different charge states can significantly overlap. In case of such overlap, a single intensity data point is insufficient, and any charge state determination based on a single intensity data point will have a significant chance being incorrect.
However, for ensembles of single ion detection events originating from the same sample, it is sometimes possible to infer charge state. This can be achieved either by comparing the pulse-height distribution of such ensemble to a set of pulse-height distributions of known compounds with similar m/z and charge states or using extra information usually available based on the nature of the analyzed sample. An example of such extra information can be a distinct isotope pattern, which if resolved by mass spectrometer encode charge state information in their relative positions in m/z space. Alternatively, a charge state distribution can be used for this purpose, which also encode charge state information in their relative positions in m/z space. In this disclosure, a class of methods for charge state identification of single detection events based on grouping of similar detection events in ensembles, assignment of charge state to the ensembles, and subsequent assignment of charge state information for individual events is described.
In an example, a data set containing pulse-height distributions for different charge states and m/z is collected. The pulse-height distributions for the group of ions may be used to assign a charge state for individual detection events. For a detection event corresponding to unknown specie, an ensemble of detection events may be selected based on their relative proximity in m/z space. A pulse-height distribution may be calculated for such ensemble. This pulse-height distribution may then be matched to the known pulse-height distributions from a similar m/z and a “best” match is selected. As will be appreciated, the term “best” may be used to identify a relatively determined optimum state given the data acquired and the determination effort applied. A charge state of best match is then inferred for each detection event and/or corresponding ion from the ensemble. Alternatively or additionally, a model may be constructed based on well characterized data of intensity distributions for different m/z and charge state can be predicted based on this model. This can be achieved for example using machine learning techniques, where training data set may contain annotated data collected a priori.
In another example, in the first step, collected data of individual detection events may be summed to form a conventional mass spectrum or alternatively multiple mass spectra based on its intensity. In the second step, algorithms are employed to do feature extraction and feature charge state assignment, where the feature is an isotope cluster or charge state cluster corresponding to the same molecule. In the third step, the determination of ensembles of ion detection events corresponding to those features and subsequent inference of the charge state is performed.
In another example, a data acquisition and processing strategy is presented, which combines both the information on pulse-height distributions of individual groups of ions and extra information known about the nature of the sample, e.g., isotope pattern or charge state distributions, to identify ensembles of detection events originating from similar ions and subsequently assign charge states to the individual ions.
Based on the pulse-characteristic distribution(s) generated in operation 302, an identification of the charge state of one or more ions in the plurality of ions may be generated at operation 303. For example, generated pulse-characteristic distribution may be compared to a set of reference pulse-characteristic distributions with known charge states to determine the closest match to the generated pulse-characteristic distribution. The charge state of the ions forming the generated pulse-characteristic distribution may then be assigned the charge state associated with the reference pulse-characteristic distribution. The number of reference pulse-characteristic distributions for which the generated pulse-characteristic distribution is compared may be limited or reduced based on external information or known characteristics about the sample being analyzed and/or the m/z values of the ions forming the generated pulse-characteristic distribution. For instance, a subset of reference pulse-characteristic distributions may exist for a particular m/z value or range, and/or a subset of reference pulse-characteristic distributions may correspond to particular isotopes, compounds, and/or samples.
In other examples, a machine learning model (e.g., neural network) may be trained based on a set of reference pulse-characteristic distributions with known charge states. The generated pulse-characteristic distribution may be provided as an input into the trained machine learning model. The trained machine learning model processes the input generated pulse-characteristic distribution, and the output of the trained machine learning model is indicative of the charge state or a likely charge state that corresponds to the generated pulse-characteristic distribution. For instance, the output of the machine learning model may be a charge state indication and/or an indication of the reference pulse-characteristic distribution that most closely matches the generated pulse-characteristic distribution. In some examples, the input to the trained machine learning model may also include an m/z value or an m/z range for the ions forming the generated pulse-characteristic distribution. Additionally or alternatively, the input may also include external data regarding the sample, such as an expected compound or isotope type. In other examples, different machine learning models may be trained for different types of samples, and the machine learning model for the sample being analyzed may be selected for use in analyzing the generated pulse-characteristic distribution.
The identification or assignment of charge state may also, or alternatively, be based on grouping peaks together as features and then analyzing the relative distance between grouped peaks. Additional detail regarding such a charge assignment process is discussed in further detail below with respect to method 3000 in
Returning to method 300 in
At operation 314, one or more pulse-characteristic distributions are generated for the ions forming the peak identified in operation 313. The ions forming the peak may be ions that are identified via a peak finding algorithm. The ions may also be ions that are within a particular m/z range of the m/z value of the identified peak. For instance, the m/z range may be selected based on characteristics of the peak and/or may be a preset range (e.g., a fixed m/z value). As an example, the m/z range may be from the beginning m/z value of the peak (i.e., where the peak begins) to the ending m/z value of the peak (i.e., wherein the peak ends).
For each of the ions forming the peak and/or within the m/z range, the corresponding pulses for those ions may be accessed. The pulses may be plotted or stored in manner that indicates probability or frequency versus pulse characteristic (e.g., pulse height). For instance, the plot may be similar to the plot in
At operation 315, based on a pulse-characteristic distribution generated in operation 314, a charge state of one or more ions forming the identified peak is identified and/or assigned. Operation 315 may be similar to operation 303 in
At operation 323, a mass spectrum for each of the intensity bands may be generated. For instance, where two intensity bands are utilized, a first mass spectrum for the first intensity band may be generated, and second mass spectrum from the second intensity band may be generated. The mass spectrums may be similar to the mass spectrums depicted in
At operation 324, one or more peaks are identified from the mass spectrums generated in operation 323. Peak identification and/or selection may be performed in the same or similar manner as in operation 313 discussed above. By generating the mass spectrums prior to identifying the peak(s), background noise may be reduced or removed. For instance, one or more of the intensity bands may present cleaner signals and/or more well-defined peaks than a single aggregated mass spectrum for all ions. As such, peaks may be more accurately identified.
At operation 325, one or more pulse-characteristic distributions may be generated for the ions forming the peak(s) identified in operation 323 and/or within an m/z range of the m/z values of the identified peak(s). For example, a first pulse-characteristic distribution corresponding to ions having a first charge state and a second pulse-characteristic distribution corresponding to ions having a second charge state may be generated in operation 314. The pulse-characteristic distributions may be generated as discussed above, for instance with reference to operations 315 and 303.
At operation 326, based on at least one of the pulse-characteristic distribution(s) generated in operation 325, an identification of a charge state of ions forming the identified peak(s) is generated. Identifying the charge state from the pulse-characteristic distribution(s) may be performed as discussed above. For instance, operation 326 may be similar to operation 303 in
Returning to
In step 3020, the recorded data is summed to a single spectrum or multiple mass spectra based on peak intensity as described above and/or in the '720 Publication. In this example, the data is summed to form multiple mass spectra corresponding to different intensity bands.
Returning to
In step 3040, pulse-characteristic distributions are calculated for each peak using the detection events filtered by their proximity to the peak apex. For instance, all pulses corresponding to ions in the highlighted area for a particular peak in
The generated pulse-characteristic distributions may be pulse-height distributions. An example of calculated pulse-height distributions for multiple peaks of
Returning to
Returning to
An example of such a calculation is shown in
Returning to
An example of using such algorithm may be demonstrated with reference to
Another algorithm may also be alternatively or additionally employed to search for missing peaks, which is not limited to finding only internal peaks. This algorithm computes positions of possible adjacent peaks and then extracts the recorded signals corresponding to such positions. This signal is further processed to form pulse-characteristic distributions, which then can be compared to one (or alternatively average) of the pulse-characteristic distributions computed for the peaks in the group using similar methods as described in steps 3050-3060. A confirmed peak is then added to the peak list of the feature.
Step 3080 may include performing an analysis to find overlapping peaks in multiple features. This step can be accomplished by comparing peak positions in each feature and finding peaks at substantially the same positions. For example, a newly found peak with m/z of 517.994 from previous step 3070 is at substantially the same position as peak 0 from feature 2 (see
Once an overlapping peak has been identified, an extra step of finding contributions of each feature to an overlapping peak can be performed. For this a system of linear equations can be written and solved approximately with constraints or without constraints. Such constraints can include a requirement of each contribution to be non-negative. This step can be accomplished for example using algorithms of non-negative least square approximations.
At step 3090, detection events (e.g., ions corresponding to pulses) may be assigned to the features. Step 3090 may be performed for example using following algorithm. First, using a consensus distance from step 3070 and one or more additional instrumental parameters such as instrument resolution, an m/z distribution can be modelled for each peak for the feature. Second, using a consensus pulse-characteristic distribution for said feature (which can be calculated as an average of pulse-characteristic distributions of all non-overlapping peaks) and the m/z distribution from the first part of this step, two values reflecting the probability of a feature to have such detection event can be computed. These values are intersections of the m/z position and calculated m/z distribution, and similar intersection of the detection event intensity and a consensus intensity distribution attributed to a feature. A product of those values is a score, which can be used to attribute a detection event to a feature using a threshold.
An example of calculating such a score may be provided with reference to
In case of multiple overlapping features, the feature that results in the highest score is selected. Additional constraints that balance the total contribution of said feature to an overlapped peak may also be set and implemented. Additional or alternative algorithms may also be implemented for this step to determine the most appropriate feature for which to assign a detection event corresponding to a detected ion. Such algorithms may estimate probabilities of the detection event belonging to a certain feature, such as by using a Bayesian framework. In step 3010, the feature charge state is assigned to the detection event.
At operation 3206, based on the pulse-characteristic distributions, ion peaks having adjacent charge states are identified. Identifying peaks having adjacent charge states may include determining an estimate or coarse charge state based on the pulse-characteristic distributions. The coarse charge state identification may only be accurate to a range of possible charge states for at least one peak forming a pair and at least one charge state from this range is adjacent to the charge state identified for the second peak.
An example of ion peaks having adjacent charge states are shown in the example plot 1100 in
At operation 3208, the charge state of the charge states of the ions forming the peaks may be further determined based on the following equations:
Equation 1 expresses a relationship of the m/z position of the second peak ((m/z)2) with the non-charge carrier mass of the ion (M), the charge state of the second peak (z2), and the mass of the charge carrier (X). Equation 2 expresses the relationship of the m/z position of the first peak ((m/z)1) with the non-charge carrier mass of the ion (M), the charge state of the second peak (z2), and the mass of the charge carrier (X). Of note, Equation 2 assumes that the charge state difference between the first peak and the second peak is 1. Thus, in other examples where the estimated charge state difference between the first peak and the second peak is value other than 1, the 1 in Equation 2 is replaced with that value.
Equation 3, which is based on Equations 1 and 2, expresses a relationship between the charge state for ions in the second peak (z2) with the m/z position of the second peak ((m/z)2), the m/z position of the first peak ((m/z)1), and the mass of the charge carrier (M). Each of these values is either measured by the detector or is known from the ionized sample and/or sample preparation process. For instance, a common charge carrier is proton, which has a mass of ˜1 atomic mass units (AMU). Accordingly, the charge state of the ions forming the second peak (z2) may be determined using Equation 3. Based on the determined charge state for the ions forming the second peak (z2), the charge state for ions forming the first peak (z1) may be determined. The determined charge state may be a refined charge state as compared to the coarse charge state initially estimated or determined.
The refined charge state should be an integer or near an integer, such as within a threshold of an integer. If it is not, the coarse charge state identification or the refined charge state identification may be incorrect. Accordingly, the coarse and/or refined charge state identification may only be accepted if the refined identified charge state is an integer within a certain threshold. If not, the coarse charge state may be re-estimated, and the method is performed again with the revised coarse charge state. In addition, to potentially increase confidence in the assignment, a third peak with an adjacent charge state may be identified. The third peak forms a pair with at least one of the first two peaks, and the charge state identifications for said common peak are matching in both pairs.
For image-charge detectors, therefore, the intensity of frequency-domain signals or peaks are proportional to the charge state of the underlying ions similar as to how the pulses described above are proportional to charge state. Thus, the intensity or other characteristics of the frequency-domain (FD) peaks may be used to generate distributions similar to the pulse-characteristic distributions discussed above. Distributions generated from the characteristics of the FD peaks may be referred to as FD-peak-characteristic distributions or FD-peak-intensity distributions where intensity of the FD peak is used as the characteristic of interest. The FD-peak-characteristic distributions may then be used in substantially the same manner as the pulse-characteristic distributions to determine charge state.
Returning to
At operation 3306, one or more FD-peak-characteristic distributions are generated. For example, the generated FD-peak-characteristic distributions may be generated for a particular FD-peak-characteristic, such as intensity. At operation 3308, based on the one or more FD-peak-characteristic distributions generated in operation 3306, an identification of a charge state of one or more ions in the plurality of ions detected in operation 3302 is generated. Identifying the charge state based on the FD-peak-characteristic distributions may be performed using any of the method described herein using the pulse-characteristic distributions. For instance, the FD-peak characteristic distributions may be used in place of the pulse-characteristic distributions.
At operation 3310, a mass spectrum is generated for the detected ions. The mass spectrum may be generated based on the identified charge state(s) in operation 3308. For example, with the known charge state, overlapping peaks may be resolved or otherwise indicated in the mass spectrum. For instance, because the charge state for ions forming the mass spectrum is known or identified in operation 3308, a deconvolved mass spectrum for the detected ions may be generated. For the deconvolved mass spectrum, one axis of the mass spectrum may be mass rather than mass per charge (m/z). At operation 3312, a compound, or an amount of a compound, in the sample corresponding to the detected ions may be identified. The compound or compound amount may be identified from the mass spectrum generated in operation 304 and/or from the charge state(s) identified in operation 3312.
Mass spectrometer 1310 includes mass analyzer 1317. Mass analyzer 1317 includes image-charge detector 1318. Image-charge detector 1318 produces oscillating signals or transient time-domain signals for detected ions with amplitudes that are proportional to the ion charge state. Mass analyzer 1317 can be any type of mass analyzer that can detect ions using an image-charge detector including, but not limited to, an electrostatic linear ion trap (ELIT), an FT-ICR, or an orbitrap mass analyzer. Mass analyzer 1317 is shown in
The mass analyzer 1317 detects transient time-domain signal 1319 induced on image-charge detector 1318 by oscillations of a plurality of ions in mass analyzer 1317. The plurality of ions is transmitted to mass analyzer 1317 by mass spectrometer 1310. Processor 1320 converts transient time-domain signal 1319 to a plurality of frequency-domain pulses or peaks 1321. Each frequency-domain signal corresponds to an ion of the plurality of ions. Processor 1320 converts transient time-domain signal 1319 to a plurality of frequency-domain peaks 1321 using a Fourier transform, for example.
Processor 1320 may compare an intensity of each frequency-domain peak of plurality of frequency-domain peaks 1321 to two or more different predetermined intensity ranges corresponding to two or more different charge state ranges. Processor 1320 may store each frequency-domain peak in one of two or more data sets 1322 corresponding to the two or more predetermined intensity ranges based on the comparison. Processor 1320 may create a mass spectrum based on frequency-domain peaks and/or the identified charge states discussed herein.
In various embodiments, processor 1320 converts transient time-domain signal 1319 to plurality of frequency-domain peaks 1321, compares an intensity of each frequency-domain peak to two or more different predetermined intensity ranges, and stores each frequency-domain peak in one of two or more data sets 1322 during acquisition. In an alternative embodiment, processor 1320 converts transient time-domain signal 1319 to plurality of frequency-domain peaks 1321, compares an intensity of each frequency-domain peak to two or more different predetermined intensity ranges, and stores each frequency-domain peak in one of two or more data sets 1322 after acquisition.
As described above, if multiple copies of the same ion are oscillating in mass analyzer 1317 at the same time, the measured intensity may not be proportional to the charge state. As a result, in various embodiments, mass spectrometer 1310 transmits ions to mass analyzer 1317 so that mass analyzer 1317 only includes a single ion of a specific m/z and charge state at any given time.
In various embodiments, the system of
In addition, mass spectrometer 1310 further includes a dissociation device. The dissociation device can be, but is not limited to, ExD device 1315 or CID device 1313. A dissociation device can be used for top-down protein analysis, for example.
In top-down protein analysis, ion source device 1311 ionizes a protein of a sample, producing a plurality of precursor ions for the protein in an ion beam. The dissociation device dissociates the plurality of precursor ions in the ion beam, producing a plurality of product ions with different charge states in the ion beam. The mass spectrometer 1310 transmits the plurality of product ions to mass analyzer 1317 so that the plurality of product ions are the plurality of ions transmitted to mass analyzer 1317 by mass spectrometer 1310, as described above.
In various embodiments, processor 1320 is used to control or provide instructions to ion source device 1311 and mass spectrometer 1310 and to analyze data collected. Processor 1320 controls or provides instructions by, for example, controlling one or more voltage, current, or pressure sources (not shown).
In another example, the methods in
For all these methods, three positive outcomes can generally be envisaged and may be practically identical in their utility. First, using the information (e.g., charge state) about an individual detection events, a deconvolved mass spectrum may be generated using formulae (m/z−mp)*Z, Z is the determined charge state and where mp is a mass of proton. Second, this information may be used to compose a set of spectra for each charge state covering either whole m/z range or sections of m/z ranges. Third, this information may be used to generate a list of individual features each having m/z and z assigned to it. A compound and/or an amount of a compound present in an analyzed sample may then be determined or generated based on the identified features.
For all described embodiments, the step of assigning a charge state to an individual detection event can be substituted with or include assigning probabilities of said detection event originating from the ions forming a feature and hence assigning a probability of said detection event being associated with a charge state assigned to this feature. Such probabilities may, for instance, be calculated using a Bayesian framework. At a subsequent step of collating a representative mass spectrum (either entire or partial), a proportional contribution from the detection event can then be accordingly distributed among the features it represents with their respective positions within the mass spectrum.
While the present teachings are described in conjunction with various embodiments, it is not intended that the present teachings be limited to such embodiments. On the contrary, the present teachings encompass various alternatives, modifications, and equivalents, as will be appreciated by those of skill in the art.
Aspects of the present disclosure, for example, are described above with reference to block diagrams and/or operational illustrations of methods, systems, and computer program products according to aspects of the disclosure. The functions/acts noted in the blocks may occur out of the order as shown in any flowchart. For example, two blocks shown in succession may in fact be executed substantially concurrently or the blocks may sometimes be executed in the reverse order, depending upon the functionality/acts involved. Further, as used herein and in the claims, the phrase “at least one of element A, element B, or element C” is intended to convey any of: element A, element B, element C, elements A and B, elements A and C, elements B and C, and elements A, B, and C.
The description and illustration of one or more aspects provided in this application are not intended to limit or restrict the scope of the disclosure as claimed in any way. The aspects, examples, and details provided in this application are considered sufficient to convey possession and enable others to make and use the best mode of claimed disclosure. The claimed disclosure should not be construed as being limited to any aspect, example, or detail provided in this application. Regardless of whether shown and described in combination or separately, the various features (both structural and methodological) are intended to be selectively included or omitted to produce an embodiment with a particular set of features. Having been provided with the description and illustration of the present application, one skilled in the art may envision variations, modifications, and alternate aspects falling within the spirit of the broader aspects of the general inventive concept embodied in this application that do not depart from the broader scope of the claimed disclosure.
This application is being filed on May 14, 2021, as a PCT International Patent Application and claims the benefit of priority to U.S. Patent Application Ser. No. 63/024,987, filed May 14, 2020, the entire disclosure of which is hereby incorporated by reference in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IB2021/000330 | 5/14/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63024897 | May 2020 | US |