The present invention relates to mass spectrometers and mass spectrometric analyses.
Mass spectrometry is a well-established technology for analyzing the presence and concentration (or amount) of a wide variety of chemical constituents with high sensitivity. Such studies often employ the well-known technique of tandem mass spectrometry, often referred to as MS/MS or MS2 mass spectrometry, in which particular precursor ions are selected, isolated and fragmented or otherwise reacted (e.g., in a collision cell or a reaction cell), and the reaction-product ions or fragment ions (all of which are referred to as product ions) are mass analyzed in a mass analyzer. The mass analysis generates a record of the intensities of the various fragment-ion or other product-ion species as a function of their mass-to-charge ratio (m/z) values. The MS2 method can be extended by further fragmentation or reaction of selected and isolated first-generation product ion species so as to generate second-generation product ion species, yet further fragmentation or reaction of selected and isolated second-generation product-ion species so as to generate third-generation product-ion species and so on, with possible mass analysis of the resulting product ions of each generation. Such extensions of the tandem mass spectrometry technique are typically referred to an MSn spectrometry, with “n” indicating the number of steps of mass analysis and the number of generations of ions, with the original unfragmented and unreacted ions comprising the first generation.
As each sample portion or fraction is received by the mass spectrometer 13, it is ionized by an ionization source 14. The ionization source may produce a plurality of ions comprising a plurality of ion species (i.e., a plurality of precursor ion species) comprising differing charges or masses from each chemical component. Thus, a plurality of ion species of differing respective m/z ratios may be produced for each chemical component, each such component eluting from the chromatograph at its own characteristic time. These various ion species are mass analyzed by a mass analyzer 15 of the mass spectrometer and detected by a detector 16. As a result of this process, the ion species may be appropriately identified according to their various m/z ratios. As illustrated in
Still referring to
Frequently, tandem mass spectra of natural samples are highly complex, with each one of many chemical components represented by peaks corresponding to a plurality of different-size fragment ions and each fragment of each original molecule represented by peaks corresponding to a plurality of charge states. If a fractionation apparatus 11 (
Typically, post-data-acquisition mathematical or logical data analysis of the measured MS2 mass spectra may be subsequently carried out in order to identify compounds present in the sample and to calculate their absolute or relative abundances within the sample. The data analyses may make use of searchable mass spectral libraries in order to match MS1 precursor ions and their corresponding MS2 (or MSn) product ions to known compounds. The rate of success in identifying and quantifying the compounds during such post-acquisition analysis often depends upon the quality of the prior real-time decision making processed during the earlier data acquisition. However, the number of choices that are available for any real-time analysis decision are limited, since the various decision trees are either coded into instrument control software or are input into memory by a user during development of an analysis method.
Recently, with the advent of faster computer processors and speedier data transfer rates, the concept of real-time searching of tabulated mass spectral data or real-time algorithmic computation of fragmentation patterns has been introduced in order to better inform automated DDA decision-making (e.g., Erickson, Brian K., Julian Mintseris, Devin K. Schweppe, Jose Navarrete-Perea, Alison R. Erickson, David P. Nusinow, Joao A. Paulo, and Steven P. Gygi. “Active instrument engagement combined with a real-time database search for improved performance of sample multiplexing workflows.” Journal of proteome research 18, no. 3 (2019): 1299-1306. Schweppe, Devin K., Jimmy K. Eng, Cling Yu, Derek Bailey, Ramin Rad, Jose Navarrete-Perea, Edward L. Huttlin, Brian K. Erickson, Joao A. Paulo, and Steven P. Gygi. “Full-featured, real-time database searching platform enables fast and accurate multiplexed quantitative proteomics.”Journal of proteome research 19, no. 5 (2020): 2026-2034.) In this sense, the term “real-time” refers to performing such database consultation or algorithmic fragmentation prediction at the same time that a mass spectral experiment is in progress.
Real-time search was introduced commercially along with the launch of the ORBITRAP™ ECLIPSE™ line of mass spectrometer systems by Thermo Fisher Scientific of Waltham, Mass. USA. Real-time library searching was commercially introduced with the ORBITRAP™ IQ.-X™. Following the work of Schweppe et al., the commercial implementation permits users to submit MS2 spectra for real-time searching search by the common open-source peptide database matching program COMET. In a typical workflow employing real-time search, MS2 spectra are submitted to the search engine employing one of two methods: (1) comparison against theoretical spectra computed in real-time; or (2) comparison against previously acquired experimental spectra stored in a library. Typically, the result returned from the search engine results in a pass/fail or “go/no-go” result for a subsequent stage of data-dependent acquisition. Results from the search are typically used to control subsequent data-dependent mass spectral data acquisitions, most typically MS3 data acquisitions, that are triggered only after the real-time search or real-time computation results have been used to make a confident compound identification from an immediately preceding MS2 spectrum. The results of such analyses have shown that such a workflow increases instrument efficiency, resulting in improved performance as measured by the number and quality of quantified proteins in a tandem-mass-tags (TMT) experiment. (Schweppe et al.)
In step 258 of the method 250 (
Either one of the search engine and the theoretical peptide listing may be located either at the site of the analysis instrument or else at a site that is remote relative the analysis instrument. Communications between the site at which the analysis instrument is located and the remote site may be made through the Internet. For example,
In operation, the information routing service 210 waits to receive spectra from the instrument 204 which typically occurs after each execution of step 258 (
The search results 214 include specific m/z values that correspond to MS2 ion species that are to be isolated in step 262 and subsequently fragmented in step 264. For example, the m/z values of the search results 214 may be used to populate a mass spectrometer inclusion list that controls the looping of execution of the method 250 from step 264 back to step 262. The MS2 m/z values that are provided the mass spectrometer instrument 204 in search results 214 correspond to recognized known or potential matches of the MS1 and MS2 spectral data to known compounds in the listing that is being searched, as determined by the search engine 212. Since the confidence levels of the various matches will normally differ between matches, the search results 214 may include confidence scores in addition to the m/z values. These confidence scores may be utilized by the mass spectrometer firmware 206 to prioritize the order of MS2 ion species isolation and fragmentation that occurs during the multiple iterations of steps 262-264.
Each fragmentation that is performed in step 264 generates an MS3 spectrum that corresponds to a respective ion species isolation in step 256. Once all MS2 species of interest that correspond to a particular MS1 peak (e.g., as isolated in step 256) have been isolated (step 262) and fragmented (step 264), then, if the mass spectral signal has not been lost due to the end of elution, execution of the method 250 returns to step 256 whereby a next MS1 peak is isolated and the steps 258-264 are repeated. Otherwise, if the mass spectral signal of an elution peak has been lost but a chromatographic separation has not completed, then execution returns to step 252 and the steps 254-264 are repeated. Otherwise, the method 250 may terminate.
The workflow outlined in the method 250 of
Regardless of whether a theoretical database listing or a mass spectral library is employed for real-time searching, situations may arise in which the returned identification of a mass spectrum is ambiguous or indeterminate. In such cases, the search may fail if scoring criteria are not met for any of the database or library entries, or if only the best-matching database or library entry is returned, even if the scores for the top several matches are all very similar to each other. Accordingly, the present inventors have recognized that it is advantageous to leverage additional information and metadata that may be embedded in the search tools to resolve such ambiguities. The inventors have further recognized that, to further increase the likelihood of obtaining a confident match, it is advantageous to allow real-time modification of data acquisition strategies based on metadata and other extra information that can be included or derived from a mass spectral library or other search engine tools.
This invention proposes embedding information in the search engine—either in a computational algorithm or in a mass spectral library—that would, in the case of ambiguous results or other conditions, not only trigger a new data-dependent scan, but also could change the parameters of the next data-dependent scan. This disclosure thus describes extensions to the conventional real-time search functionality described above that would allow a search engine to package certain kinds of metadata together with returned search results. These metadata would provide the instrument firmware with additional information and/or commands necessary to effectively rewrite the downstream data dependencies and acquire, during analysis of a sample, potentially any number, m, of MSn scans at any number, n, (i.e., as used in the “MSn” designation) of fragmentation levels, where the integers m and n are not known prior to the analysis and wherein the integer n is greater than a default maximum value. It is anticipated that this flexibility will allow the overall method to accommodate numerous and diverse workflows for different compound classes, as well as opening the door for exploration of “intelligent” data acquisition strategies.
According to a first aspect of the present teachings, a method of mass spectral analysis of a sample comprises:
Some embodiments may include determining based on the determined measure of success, whether or not to block execution of one or more previously planned subsequent actions. The determination of whether to execute subsequent actions and/or the determination of whether to block execution of pre-planned actions, based on the measure of success, may be formulated in several different ways. For example, the determination may be based on whether the measure of success is greater than or less than a designated threshold value. In some embodiments, the determination may be made as the result of either a “success” or “failure” designation returned by a searching algorithm. In some embodiments, the determination may be based on whether the search of the mass spectral database returns certain keywords (e.g., names of potentially matching compounds, names of potentially matching classes of compounds, etc.) The determined measure of success may be based on standard statistical measures of confidence in any proposed matches, as may be returned by a searching algorithm.
Some embodiments may include determining to both block certain previously planned actions and to, instead, execute certain replacement actions, based on the determined measure of success. For example, planned fragmentation and subsequent fragment-ion analysis of certain previously targeted ion species may be abandoned in favor of fragmentation and subsequent fragment-ion analyses of other ion species that were not targeted prior to the analysis. Such blocking of planned or anticipated actions in favor of other unplanned replacement actions may occur, for example, when a searching algorithm finds matches to previously unexpected compounds of interest or else when a database search indicates that the fragmentation and fragment-ion analyses of the other ion species may yield improved detection or quantification of the originally targeted ion species. According to some embodiments, the fragmentation and subsequent fragment-ion analysis may be performed on ions that are matched by the search algorithm; according to some other embodiments, the fragmentation and subsequent fragment-ion analysis may be performed on observed ions for which matches were not found by the search algorithm.
In some embodiments, the fragmentation conditions used during fragmentation of the isolated first portion of the primary ion species include application of a first collision energy to the isolated first portion of the selected primary ion species and the fragmentation conditions employed during fragmentation of the isolated second portion of the primary ion species comprise application of a second collision energy, different than the first applied collision energy, to the isolated second portion of the primary ion species. In such instances, the second collision energy may be determined by consultation of the mass spectral database or mass spectral library.
In some embodiments, the fragmentation conditions used during fragmentation of the isolated first portion of the primary ion species comprise a first one of the group of techniques consisting of: resonantly-excited collision-induced dissociation (CID), linearly-accelerated ion beam type collision-induced dissociation (referred to either as “CID” or “HCD”), electron transfer dissociation (ETD), electron capture dissociation (ECD), surface-induced dissociation (SID), Infrared multiple photon dissociation (IRMPD), ultra-violet photo-dissociation (UVPD), etc. and the fragmentation conditions used during fragmentation of the isolated second portion of the primary ion species comprise a second, different one of said group of techniques.
In some embodiments, the one or more altered or replacement procedures comprise: isolating a second portion of the selected primary ion species; and subjecting the isolated second portion of the selected primary ion species to proton transfer reaction (PTR). In such instances, the one or more altered or replacement procedures may further comprise fragmenting the reaction products of the proton transfer reaction.
According to a second aspect of the present teachings, an analysis system is provided, the system comprising:
Accordingly, the present invention extends current capabilities of real-time search to encompass the ability for the real-time information derived from the search engine to direct downstream data acquisition. Such search-directed acquisition has the potential to push further the idea of “intelligent” data acquisition, where instrument efficiency is optimized and less instrument time is spent acquiring data that are either of poor quality or have little use for the experiment at hand.
The above noted and various other aspects of the present invention will become apparent from the following description which is given by way of example only and with reference to the accompanying drawings, not necessarily drawn to scale, in which:
The following description is presented to enable any person skilled in the art to make and use the invention, and is provided in the context of a particular application and its requirements. Various modifications to the described embodiments will be readily apparent to those skilled in the art and the generic principles herein may be applied to other embodiments. Thus, the present invention is not intended to be limited to the embodiments and examples shown but is to be accorded the widest possible scope in accordance with the features and principles shown and described. To fully appreciate the features of the present invention in greater detail, please refer to
In the description of the invention herein, it is understood that a word appearing in the singular encompasses its plural counterpart, and a word appearing in the plural encompasses its singular counterpart, unless implicitly or explicitly understood or stated otherwise. Furthermore, it is understood that, for any given component or embodiment described herein, any of the possible candidates or alternatives listed for that component may generally be used individually or in combination with one another, unless implicitly or explicitly understood or stated otherwise. Moreover, it is to be appreciated that the figures, as shown herein, are not necessarily drawn to scale, wherein some of the elements may be drawn merely for clarity of the invention. Also, reference numerals may be repeated among the various figures to show corresponding or analogous elements. Additionally, it will be understood that any list of candidates or alternatives is merely illustrative, not limiting, unless implicitly or explicitly understood or stated otherwise.
In order to emphasize the specific differences between methods of the present teachings and conventionally employed real-time data-dependent mass spectrometric data acquisition methods, the flow diagram method 350 is organized in parallel with the flow diagram of the conventional method 250. Specifically, in similarity to the flow diagram of the method 250 (
Steps 352, 354, 356 and 358 of the method 350 are similar to the steps 252, 254, 256 and 258 of the method 250 (
In step 360b of the method 350 (
If, in step 360b, the analysis service determines to execute the option (i) of simply forwarding the search results to the mass spectrometer, then execution of the method 350 branches to the default analysis steps 362 and 364, which are similar to the steps 262 and 264 of the conventional method 250. These steps may be executed when the analysis in step 360b determines, with high confidence, that a most recent search was successful and unambiguous. Otherwise, if the analysis service determines to execute either the option (ii) or the option (iii) as described above, then the execution of the method 350 branches to step 361, in which the search results and/or metadata are sent to the mass spectrometer and retrieved thereat. In this case the metadata may include altered instructions or replacement instructions that override the default analysis steps. These altered or replacement instructions are then carried out by the mass spectrometer in step 366. The steps 361 and 366 may be carried out when the analysis in step 360b determines that a most recent search was either unsuccessful or ambiguous.
Generally, the execution of the new instructions in step 366 will cause the mass spectrometer to generate new mass spectral data comprising determined m/z values of new ion species that are generated by the execution of those instructions. This newly generated mass spectral data may, itself, be submitted back to the database or library in a re-iteration of step 360 and, under such circumstances, the newly created search results are themselves analyzed in a re-iteration of step 360b. If the new analysis of the new search results in a successful match between the acquired data and an entry for a compound in the database, then steps 362 and 364 are executed after which the method 350 return to steps 356 and 358 in which a new MS1 peak is selected and isolated, respectively. Otherwise, if the new analysis of the newly generated data fails to produce a confident or unambiguous match, then either: (i) steps 361 and 366 may be executed again with yet new instructions to create yet new mass spectral data that is itself submitted for search, or (ii) if a stopping condition has been reached, then execution of the method 350 may break out of the loop of steps 360-366 with a “fail” indication. For example, the stopping condition may be a maximum permitted number of iterations of the loop of steps 360-366. The following examples illustrate a few of the ways in which the mass spectrometer operations may be altered or replaced during the execution of step 366.
Spectra of many classes of metabolites are present in libraries under many experimental conditions, most notably numerous different values of collision energy. These values may not match what is defined in a user's data acquisition method, and so while an identification might be obtained via real-time spectral library search (step 360), the search results may not satisfy threshold scoring criteria, as determined by the search engine 212 and/or the analysis service module of component 310 (
In some cases, a preset canonical activation method for general mass spectrometry analysis, collision induced dissociation (either beam type or resonance type), may not be effective at all for generating useful information for members of a compound class under study. In such cases, auxiliary mass spectral libraries having additional content, such as mass spectral data pertaining to fragment ions generated from members of the class using ultraviolet photodissociation (UVPD) could be consulted. Then, when and if collision-induced fragmentation does not yield useful information, the analysis service module may send commands 314 to the mass spectrometer that direct the instrument to acquire a fragmentation spectrum (e.g., at step 366) of a same precursor ion species that was isolated in step 358, but using UVPD instead of collision-induced fragmentation. The analysis service module, if present, may then direct the search engine 212 to consult the auxiliary database, instead of the preset database, for matches at the next iteration of step 360. This example could clearly be extended to other activation or reaction types such as electron transfer dissociation (ETD) electron capture dissociation (ECD), Surface-Induced dissociation (SID), Infrared multiple photon dissociation (IRMPD), Proton Transfer Reaction (PTR), etc. that cause fragmentation or other reaction of precursor ions. For example, a second portion of primary ions may be subjected to PTR procedure in order to generate PTR reaction product ions having charge states that are reduced relative to the charge states of the original primary ions. The reduced charge states may more closely correspond to entries in a searchable database or library. The PTR reaction product ions may then be subjected to fragmentation.
In some cases, a user's data acquisition method using real-time search may require, prior to triggering an MS(n+1) analysis to quantify a compound (e.g., step 364, where n+1=3), a high degree of confidence in the identification of the compound as obtained from searching observed MSn peaks (e.g., step 360, where n=2) against a library or database. In the case that the initial search returns ambiguous results, the analysis service module, if present, could survey the available spectra and find that the ambiguity could be resolved by the acquisition of an appropriate additional MS3 spectrum. The information routing and analysis module 310 could then send (at step 361) metadata commands 314 to the mass spectrometer 204 that direct the instrument to acquire the additional spectrum (step 366) before issuing a pass/fail result. In other cases, a sub-par match may need to be confirmed by acquisition of higher-level MSn spectra, such as MS4 mass spectra. Rather than acquiring MS4 spectra of every precursor ion, the search engine can direct the instrument to only acquire MS4-level spectra when it is likely to yield useful information.
The discussion included in this application is intended to serve as a basic description. The present invention is not intended to be limited in scope by the specific embodiments described herein, which are intended as single illustrations of individual aspects of the invention. Functionally equivalent methods and components are within the scope of the invention. Various other modifications of the invention, in addition to those shown and described herein will become apparent to those skilled in the art from the foregoing description and accompanying drawings. Any technical publications, patents or patent application publications mentioned herein are hereby incorporated by reference herein in their entirety.
The present application claims, under 35 U.S.C. 119(e), the benefit of the filing date and the right of priority to co-pending U.S. Provisional Application No. 63/273,404, filed on Oct. 29, 2021 and titled “Methods for Modifying Mass Spectral Data Acquisition in Real Time”, the disclosure of which is hereby incorporated by reference herein in its entirety. Additionally, the present application is related to commonly-assigned and co-pending U.S. application Ser. No. 17/825,230, filed May 26, 2022 and titled “Mass Spectrometer Utilizing Mass Spectral Database Search for Compound Identification”, said co-pending application hereby incorporated by reference herein in its entirety.
Number | Date | Country | |
---|---|---|---|
63273404 | Oct 2021 | US |