Compositions and methods for diagnosing and treating inflammatory bowel disease and related disorders

Information

  • Patent Grant
  • 10730916
  • Patent Number
    10,730,916
  • Date Filed
    Friday, May 20, 2016
    8 years ago
  • Date Issued
    Tuesday, August 4, 2020
    3 years ago
Abstract
The present invention features biomarkers capable of diagnosing inflammatory bowel disease and methods of using such biomarkers to diagnose and selecting treatments for inflammatory bowel diseases.
Description
BACKGROUND OF THE INVENTION

Crohn's disease (CD) and ulcerative colitis (UC) are chronic, idiopathic and clinically heterogeneous intestinal disorders collectively known as inflammatory bowel disease (IBD). At present, a combination of clinical, endoscopic and radiological criteria is used to differentiate CD from US. Because the symptoms of Crohn's disease are similar to other intestinal disorders, such as ulcerative colitis, it can be difficult to diagnose. Ulcerative colitis causes inflammation and ulcers in the top layer of the lining of the large intestine. In Crohn's disease, all layers of the intestine may be involved, and normal healthy bowel can be found between sections of diseased bowel. Complications of Crohn's disease include intestinal blockages, which may require surgery, as well as fistulas and fissues. To avoid such complications, it is important to get an accurate diagnosis early in the course of the illness to ensure that appropriate therapies are selected. Current diagnostic methods for inflammatory bowel disease are invasive and patients typically find these tests unpleasant. To improve patient compliance, diagnostic accuracy, and early and appropriate treatment selection, new methods for distinguishing among inflammatory bowel diseases are required.


SUMMARY OF THE INVENTION

As described below, the present invention features biomarkers capable of diagnosing a subject as having inflammatory bowel disease and methods of using such biomarkers to diagnose, monitor and select approprate treatments for said subject.


In one aspect, the invention provides a microchip containing at least about 85, 90, 95 or 100% of the E. coli proteome.


In another aspect, the invention features a microchip containing a set of biomarkers for characterizing an inflammatory bowel disease (IBD) in a subject, where the set is selected from any one or more of E. coli polypeptides delineated herein, pairs and sets of polypeptides features in Tables 2-5, 7, and FIG. 5, or any of (yhcP), (yhhT), (yhiW), 16-3B0, 214#3, 233#6, 273#6, 280#1, 316#4, 321#3, 323#1, 331#2, 356#7, 406#7, 409#5, 411#1, 420#7, 452#13, 610#6.1, aceF, allP, ansP, aqpZ, atoE, brnQ, celD, cobU, codB, cybB, cydB, cydC, dgt, dnaQ, ebgA, emrB, emrD, exuR, fabH, fabZ, fadA, fepD, flhD, glnQ, glpF, gppA, greA, hemY, JW0438, JW1949, lipA, lpxC, malX, malZ, menG, mrdB, murG, mutT, narU, nfrB, nrfE, ompC, oppC, oppF, pbuX, pheP, phsE, pnuC, potC, pssR, ptsH, putP, queA, rfaL, rffG, rocE, rplO, sdhD, secB, sfsA, slyX, sucB, sucD, tauB, thiL, trkH, udk, uidB, virK, yaaH, yabK, yadQ, yaeG, yagG, yagM, yaiV, yajR, ybaN, ybdS, ybfB, ybfC, ybgE, ybhA, ybhL, ybhM, ybhN, ybhR, ycaD, yccY, ycdG, yciQ, yciR, yciS, ydcD, yddH, ydeF, ydeZ, ydfO, ydjS, ydjZ, yeaS, yehK, yehY, yejF, yfjY, ygeD, ygfF, yggH, yghK, yghT, ygjQ, yhaH, yhaO, yhbX, yhcO, yhdM, yhdT, yheG, yheU, yhfU, yhhL, yhhS, yhiP, yhiQ, yhjX, yiaL, yiaQ, yibL, yibQ, yicO, yidY, yifE, yigF, yihG, yjeM, yjfF, yjfP, yjfY, yjhB, ymdD, ynaJ, yneC, yneG, ynjC, yoaA, yohG, yphA, yphG, yqcE, yzgL, rpsK, rpsL, sixA, ycfF, yhdN, yjhA, (gntU), (phnE), (rcsC), (thiS), (ycfA), (yfjV), 221#15, 267#6, 304#1, 319#17, 336#6, 348#4, 405#2, 411#4, 416#1, 430#8, 445#15, 448#2, 633#5, aceB, agaZ, aidA, argB, argC, aroF, aroK, aspA, atpB, baiF, basS, cedA, citB, citG, clpX, cysD, cysJ, cysM, cysW, dgxA, dicC, dinD, ecnB, eno, fadB, fba, fdhE, fecB, fecR, fimB, fimC, fliA, fliS, flxA, folP, frvX, ftn, fumB, gabD, galR, gcpE, glvB, grxA, grxC, gst, hemX, hflX, hisS, hofH, hoxK, hsdM, hycA, hycF, hypC, ibpB, infC, ivbL, lasT, LDR-ABC, LDR-D, lldR, mcrD, metB, metJ, mltB, mviN, narY, nuoE, phbA, phnB, phnG, phnM, ppdB, ptpS, ptsN, purM, radC, rbfA, rbsB, recN, rffD, rpiB, rplP, rplQ, rplT, rpmJ, rpsG, rpsR, selD, slyA, slyD, ssi6, sugE, tdcB, thiF, torA, trmD, ubiG, umuD, vacJ, wcaA, ybbA, ybbQ, ycbF, ycdB, yceD, ycgN, ydfA, ydfD, ydfR, ydhL, ydiL, yedF, yehR, yejG, yejO, yfeC, yfhD, yfiC, yfiD, yfjI, yfjQ, ygbA, ygbD, ygbO, ygcQ, ygeW, ygfY, yhaA, yhaC, yhaN, yhcI, yhcJ, yhfR, yhfV, yhgH, yhgI, yicC, yieF, yieJ, yigK, yihA, yihD, yihK, yiiF, yijD, yijF, yjaI, yjbI, yjbL, yjbQ, yjbR, yjcS, yjeB, yjeJ, yjgF, yjhC, yjhE, yjhF, yjhO, yjiA, yliG, ymfE, yohL, yphC, yrbB, (rtn), cadA, lueO, mesJ, mhpF, modC, murA, nrfF, prpE, purK, tpiA, yciD, yejA, ygcE, ygfQ, yhjC, yjfH, yjiJ, (yeeF), 211#11, 23-12A0, 279#6, 427#1, dgkA, dinI, emrY, focA, folK, fsr, glnD, kch, maoC, msbA, nac, nagE, narI, ppx, prtC, rfaB, secF, secY/prlA, trkG, yafJ, yaiM, ybbC, ycbM, ydaA, ydbD, ydhV, yefI, yeiO, ygjR, yhiN, yjgT, yojl, and/or yphD, or fragments thereof.


In another aspect, the invention features a microchip containing a set of biomarkers for characterizing Chrohn's disease in a subject, where the set is any one or more of E. coli polypeptides rpsK, rpsL, sixA, ycfF, yhdN, yjhA, (gntU), (phnE), (rcsC), (thiS), (ycfA), (yfjV), 221#15, 267#6, 304#1, 319#17, 336#6, 348#4, 405#2, 411#4, 416#1, 430#8, 445#15, 448#2, 633#5, aceB, agaZ, aidA, argB, argC, aroF, aroK, aspA, atpB, baiF, basS, cedA, citB, citG, clpX, cysD, cysJ, cysM, cysW, dgxA, dicC, dinD, ecnB, eno, fadB, fba, fdhE, fecB, fecR, fimB, fimC, fliA, fliS, flxA, folP, frvX, ftn, fumB, gabD, galR, gcpE, glvB, grxA, grxC, gst, hemX, hflX, hisS, hofH, hoxK, hsdM, hycA, hycF, hypC, ibpB, infC, ivbL, lasT, LDR-ABC, LDR-D, lldR, mcrD, metB, metJ, mltB, mviN, narY, nuoE, phbA, phnB, phnG, phnM, ppdB, ptpS, ptsN, purM, radC, rbfA, rbsB, recN, rffD, rpiB, rplP, rplQ, rplT, rpmJ, rpsG, rpsR, selD, slyA, slyD, ssi6, sugE, tdcB, thiF, torA, trmD, ubiG, umuD, vacJ, wcaA, ybbA, ybbQ, ycbF, ycdB, yceD, ycgN, ydfA, ydfD, ydfR, ydhL, ydiL, yedF, yehR, yejG, yejO, yfeC, yfhD, yfiC, yfiD, yfjI, yfjQ, ygbA, ygbD, ygbO, ygcQ, ygeW, ygfY, yhaA, yhaC, yhaN, yhcI, yhcJ, yhfR, yhfV, yhgH, yhgI, yicC, yieF, yieJ, yigK, yihA, yihD, yihK, yiiF, yijD, yijF, yjaI, yjbI, yjbL, yjbQ, yjbR, yjcS, yjeB, yjeJ, yjgF, yjhC, yjhE, yjhF, yjhO, yjiA, yliG, ymfE, yohL, yphC, yrbB, (yhcP), (yhhT), (yhiW), 16-3B0, 214#3, 233#6, 273#6, 280#1, 316#4, 321#3, 323#1, 331#2, 356#7, 406#7, 409#5, 411#1, 420#7, 452#13, 610#6.1, aceF, allP, ansP, aqpZ, atoE, brnQ, celD, cobU, codB, cybB, cydB, cydC, dgt, dnaQ, ebgA, emrB, emrD, exuR, fabH, fabZ, fadA, fepD, flhD, glnQ, glpF, gppA, greA, hemY, JW0438, JW1949, lipA, lpxC, malX, malZ, menG, mrdB, murG, mutT, narU, nfrB, nrfE, ompC, oppC, oppF, pbuX, pheP, phsE, pnuC, potC, pssR, ptsH, putP, queA, rfaL, rffG, rocE, rplO, sdhD, secB, sfsA, slyX, sucB, sucD, tauB, thiL, trkH, udk, uidB, virK, yaaH, yabK, yadQ, yaeG, yagG, yagM, yaiV, yajR, ybaN, ybdS, ybfB, ybfC, ybgE, ybhA, ybhL, ybhM, ybhN, ybhR, ycaD, yccY, ycdG, yciQ, yciR, yciS, ydcD, yddH, ydeF, ydeZ, ydfO, ydjS, ydjZ, yeaS, yehK, yehY, yejF, yfjY, ygeD, ygfF, yggH, yghK, yghT, ygjQ, yhaH, yhaO, yhbX, yhcO, yhdM, yhdT, yheG, yheU, yhfU, yhhL, yhhS, yhiP, yhiQ, yhjX, yiaL, yiaQ, yibL, yibQ, yicO, yidY, yifE, yigF, yihG, yjeM, yjfF, yjfP, yjfY, yjhB, ymdD, ynaJ, yneC, yneG, ynjC, yoaA, yohG, yphA, yphG, yqcE, and/or yzgL, or fragments thereof.


In one embodiment of the above aspect, the set is any one or more of E. coli polypeptides rpsK, rpsL, sixA, ycfF, yhdN, yjhA, (gntU), (phnE), (rcsC), (thiS), (ycfA), (yfjV), 221#15, 267#6, 304#1, 319#17, 336#6, 348#4, 405#2, 411#4, 416#1, 430#8, 445#15, 448#2, 633#5, aceB, agaZ, aidA, argB, argC, aroF, aroK, aspA, atpB, baiF, basS, cedA, citB, citG, clpX, cysD, cysJ, cysM, cysW, dgxA, dicC, dinD, ecnB, eno, fadB, fba, fdhE, fecB, fecR, fimB, fimC, fliA, fliS, flxA, folP, frvX, ftn, fumB, gabD, galR, gcpE, glvB, grxA, grxC, gst, hemX, hflX, hisS, hofH, hoxK, hsdM, hycA, hycF, hypC, ibpB, infC, ivbL, lasT, LDR-ABC, LDR-D, lldR, mcrD, metB, metJ, mltB, mviN, narY, nuoE, phbA, phnB, phnG, phnM, ppdB, ptpS, ptsN, purM, radC, rbfA, rbsB, recN, rffD, rpiB, rplP, rplQ, rplT, rpmJ, rpsG, rpsR, selD, slyA, slyD, ssi6, sugE, tdcB, thiF, torA, trmD, ubiG, umuD, vacJ, wcaA, ybbA, ybbQ, ycbF, ycdB, yceD, ycgN, ydfA, ydfD, ydfR, ydhL, ydiL, yedF, yehR, yejG, yejO, yfeC, yfhD, yfiC, yfiD, yfjI, yfjQ, ygbA, ygbD, ygbO, ygcQ, ygeW, ygfY, yhaA, yhaC, yhaN, yhcI, yhcJ, yhfR, yhfV, yhgH, yhgI, yicC, yieF, yieJ, yigK, yihA, yihD, yihK, yiiF, yijD, yijF, yjaI, yjbI, yjbL, yjbQ, yjbR, yjcS, yjeB, yjeJ, yjgF, yjhC, yjhE, yjhF, yjhO, yjiA, yliG, ymfE, yohL, yphC, and yrbB, or fragments thereof.


In another aspect, the invention features a microchip containing a set of biomarkers for distinguishing Crohn's Disease from ulcerative colitis, the set containing E. coli polypeptides era, ybaN, yhgN, focA, ga bT and ycdG, or fragments thereof.


In another aspect, the invention features a microchip containing a set of biomarkers for diagnosing ulcerative colitis, where the set is any one or more of E. coli polypeptides (rtn), cadA, lueO, mesJ, mhpF, modC, murA, nrfF, prpE, purK, tpiA, yciD, yejA, ygcE, ygfQ, yhjC, yjfH, yjiJ, (yeeF), 211#11, 23-12A0, 279#6, 427#1, dgkA, dinI, emrY, focA, folK, fsr, glnD, kch, maoC, msbA, nac, nagE, narI, ppx, prtC, rfaB, secF, secY/prlA, trkG, yafJ, yaiM, ybbC, ycbM, ydaA, ydbD, ydhV, yefI, yeiO, ygjR, yhiN, yjgT, yojl, yphD, (yhcP), (yhhT), (yhiW), 16-3B0, 214#3, 233#6, 273#6, 280#1, 316#4, 321#3, 323#1, 331#2, 356#7, 406#7, 409#5, 411#1, 420#7, 452#13, 610#6.1, aceF, allP, ansP, aqpZ, atoE, brnQ, celD, cobU, codB, cybB, cydB, cydC, dgt, dnaQ, ebgA, emrB, emrD, exuR, fabH, fabZ, fadA, fepD, flhD, glnQ, glpF, gppA, greA, hemY, JW0438, JW1949, lipA, lpxC, malX, malZ, menG, mrdB, murG, mutT, narU, nfrB, nrfE, ompC, oppC, oppF, pbuX, pheP, phsE, pnuC, potC, pssR, ptsH, putP, queA, rfaL, rffG, rocE, rplO, sdhD, secB, sfsA, slyX, sucB, sucD, tauB, thiL, trkH, udk, uidB, virK, yaaH, yabK, yadQ, yaeG, yagG, yagM, yaiV, yajR, ybaN, ybdS, ybfB, ybfC, ybgE, ybhA, ybhL, ybhM, ybhN, ybhR, ycaD, yccY, ycdG, yciQ, yciR, yciS, ydcD, yddH, ydeF, ydeZ, ydfO, ydjS, ydjZ, yeaS, yehK, yehY, yejF, yfjY, ygeD, ygfF, yggH, yghK, yghT, ygjQ, yhaH, yhaO, yhbX, yhcO, yhdM, yhdT, yheG, yheU, yhfU, yhhL, yhhS, yhiP, yhiQ, yhjX, yiaL, yiaQ, yibL, yibQ, yicO, yidY, yifE, yigF, yihG, yjeM, yjfF, yjfP, yjfY, yjhB, ymdD, ynaJ, yneC, yneG, ynjC, yoaA, yohG, yphA, yphG, yqcE, and yzgL, or fragments thereof. In one embodiment, the set of biomarkers is any one or more of (rtn), cadA, lueO, mesJ, mhpF, modC, murA, nrfF, prpE, purK, tpiA, yciD, yejA, ygcE, ygfQ, yhjC, yjfH, yjiJ, (yeeF), 211#11, 23-12A0, 279#6, 427#1, dgkA, dinI, emrY, focA, folK, fsr, glnD, kch, maoC, msbA, nac, nagE, narI, ppx, prtC, rfaB, secF, secY/prlA, trkG, yafJ, yaiM, ybbC, ycbM, ydaA, ydbD, ydhV, yefI, yeiO, ygjR, yhiN, yjgT, yojl, and yphD, or fragments thereof.


In yet another aspect, the invention features a microchip containing a set of biomarkers for distinguishing Chrohn's disease from ulcerative colitis, the chip containing a set of biomarkers that is any one or more of rpsK, rpsL, sixA, ycfF, yhdN, yjhA, (gntU), (phnE), (rcsC), (thiS), (ycfA), (yfjV), 221#15, 267#6, 304#1, 319#17, 336#6, 348#4, 405#2, 411#4, 416#1, 430#8, 445#15, 448#2, 633#5, aceB, agaZ, aidA, argB, argC, aroF, aroK, aspA, atpB, baiF, basS, cedA, citB, citG, clpX, cysD, cysJ, cysM, cysW, dgxA, dicC, dinD, ecnB, eno, fadB, fba, fdhE, fecB, fecR, fimB, fimC, fliA, fliS, flxA, folP, frvX, ftn, fumB, gabD, galR, gcpE, glvB, grxA, grxC, gst, hemX, hflX, hisS, hofH, hoxK, hsdM, hycA, hycF, hypC, ibpB, infC, ivbL, lasT, LDR-ABC, LDR-D, lldR, mcrD, metB, metJ, mltB, mviN, narY, nuoE, phbA, phnB, phnG, phnM, ppdB, ptpS, ptsN, purM, radC, rbfA, rbsB, recN, rffD, rpiB, rplP, rplQ, rplT, rpmJ, rpsG, rpsR, selD, slyA, slyD, ssi6, sugE, tdcB, thiF, torA, trmD, ubiG, umuD, vacJ, wcaA, ybbA, ybbQ, ycbF, ycdB, yceD, ycgN, ydfA, ydfD, ydfR, ydhL, ydiL, yedF, yehR, yejG, yejO, yfeC, yfhD, yfiC, yfiD, yfjI, yfjQ, ygbA, ygbD, ygbO, ygcQ, ygeW, ygfY, yhaA, yhaC, yhaN, yhcI, yhcJ, yhfR, yhfV, yhgH, yhgI, yicC, yieF, yieJ, yigK, yihA, yihD, yihK, yiiF, yijD, yijF, yjaI, yjbI, yjbL, yjbQ, yjbR, yjcS, yjeB, yjeJ, yjgF, yjhC, yjhE, yjhF, yjhO, yjiA, yliG, ymfE, yohL, yphC, yrbB, (rtn), cadA, lueO, mesJ, mhpF, modC, murA, nrfF, prpE, purK, tpiA, yciD, yejA, ygcE, ygfQ, yhjC, yjfH, yjiJ, (yeeF), 211#11, 23-12A0, 279#6, 427#1, dgkA, dinI, emrY, focA, folK, fsr, glnD, kch, maoC, msbA, nac, nagE, narI, ppx, prtC, rfaB, secF, secY/prlA, trkG, yafJ, yaiM, ybbC, ycbM, ydaA, ydbD, ydhV, yefI, yeiO, ygjR, yhiN, yjgT, yojl, and yphD, or fragments thereof.


In various embodiments of the above aspects, the microarrays further contain one or more biomarkers any one or more of antibodies that specifically bind chitobioside IgA (ACCA), laminaribioside IgG (ALCA), manobioside IgG (AMCA), Man α-1,3 Man α-1,2 Man (ΣMan3), Man α-1,3 Man α-1,2 Man α-1,2 Man (ΣMan4) pANCA, antineutrophil cytoplasmic antibody, yeast oligoinanna, Saccharomyces cerevisiae, RSCA, bacterial outer membrane porin C (OmpC), Pseudomonas fluorescens bacterial sequence I2, and bacterial flagellin (Cbir).


In one embodiment of the above aspects, the E. coli polypeptide induces an imm response when injected into a subject.


In still another aspect, the invention features a microchip containing a polypeptide, polypeptide set, or polypeptide pair delineated in any of Tables 2-5, 7, or FIG. 5.


In another aspect, the invention features a method for characterizing a condition associated with a dysregulated immune response to a polypeptide in a subject (e.g., human), the method involving contacting a microarray containing a set of polypeptides with sera derived from the subject, and detecting differential antibody binding to a polypeptide on the microarray in the subject sera relative to a control, where detection of differential antibody binding identifies the subject as having a condition associated with a dysregulated immune response to a.


In another aspect, the invention features a method for diagnosing a subject as having or having a propensity to develop inflammatory bowel disease, the method involving contacting an array containing a set of E. coli polypeptides with sera derived from the subject, and detecting differential antibody binding to the polypeptide on the array in the subject sera relative to a healthy control, thereby characterizing inflammatory bowel disease in the subject. In one embodiment, antibodies that bind an E. coli polypeptide are any one or more of (yhcP), (yhhT), (yhiW), 16-3B0, 214#3, 233#6, 273#6, 280#1, 316#4, 321#3, 323#1, 331#2, 356#7, 406#7, 409#5, 411#1, 420#7, 452#13, 610#6.1, aceF, allP, ansP, aqpZ, atoE, brnQ, celD, cobU, codB, cybB, cydB, cydC, dgt, dnaQ, ebgA, emrB, emrD, exuR, fabH, fabZ, fadA, fepD, flhD, glnQ, glpF, gppA, greA, hemY, JW0438, JW1949, lipA, lpxC, malX, malZ, menG, mrdB, murG, mutT, narU, nfrB, nrfE, ompC, oppC, oppF, pbuX, pheP, phsE, pnuC, potC, pssR, ptsH, putP, queA, rfaL, rffG, rocE, rplO, sdhD, secB, sfsA, slyX, sucB, sucD, tauB, thiL, trkH, udk, uidB, virK, yaaH, yabK, yadQ, yaeG, yagG, yagM, yaiV, yajR, ybaN, ybdS, ybfB, ybfC, ybgE, ybhA, ybhL, ybhM, ybhN, ybhR, ycaD, yccY, ycdG, yciQ, yciR, yciS, ydcD, yddH, ydeF, ydeZ, ydfO, ydjS, ydjZ, yeaS, yehK, yehY, yejF, yfjY, ygeD, ygfF, yggH, yghK, yghT, ygjQ, yhaH, yhaO, yhbX, yhcO, yhdM, yhdT, yheG, yheU, yhfU, yhhL, yhhS, yhiP, yhiQ, yhjX, yiaL, yiaQ, yibL, yibQ, yicO, yidY, yifE, yigF, yihG, yjeM, yjfF, yjfP, yjfY, yjhB, ymdD, ynaJ, yneC, yneG, ynjC, yoaA, yohG, yphA, yphG, yqcE, and yzgL. In another embodiment, antibodies that bind an E. coli polypeptide are any one or more of rpsK, rpsL, sixA, ycfF, yhdN, yjhA, (gntU), (phnE), (rcsC), (thiS), (ycfA), (yfjV), 221#15, 267#6, 304#1, 319#17, 336#6, 348#4, 405#2, 411#4, 416#1, 430#8, 445#15, 448#2, 633#5, aceB, agaZ, aidA, argB, argC, aroF, aroK, aspA, atpB, baiF, basS, cedA, citB, citG, clpX, cysD, cysJ, cysM, cysW, dgxA, dicC, dinD, ecnB, eno, fadB, fba, fdhE, fecB, fecR, fimB, fimC, fliA, fliS, flxA, folP, frvX, ftn, fumB, gabD, galR, gcpE, glvB, grxA, grxC, gst, hemX, hflX, hisS, hofH, hoxK, hsdM, hycA, hycF, hypC, ibpB, infC, ivbL, lasT, LDR-ABC, LDR-D, lldR, mcrD, metB, metJ, mltB, mviN, narY, nuoE, phbA, phnB, phnG, phnM, ppdB, ptpS, ptsN, purM, radC, rbfA, rbsB, recN, rffD, rpiB, rplP, rplQ, rplT, rpmJ, rpsG, rpsR, selD, slyA, slyD, ssi6, sugE, tdcB, thiF, torA, trmD, ubiG, umuD, vacJ, wcaA, ybbA, ybbQ, ycbF, ycdB, yceD, ycgN, ydfA, ydfD, ydfR, ydhL, ydiL, yedF, yehR, yejG, yejO, yfeC, yfhD, yfiC, yfiD, yfjI, yfjQ, ygbA, ygbD, ygbO, ygcQ, ygeW, ygfY, yhaA, yhaC, yhaN, yhcI, yhcJ, yhfR, yhfV, yhgH, yhgI, yicC, yieF, yieJ, yigK, yihA, yihD, yihK, yiiF, yijD, yijF, yjaI, yjbI, yjbL, yjbQ, yjbR, yjcS, yjeB, yjeJ, yjgF, yjhC, yjhE, yjhF, yjhO, yjiA, yliG, ymfE, yohL, yphC, and yrbB. In yet another embodiment, antibodies that bind an E. coli polypeptide are any one or more of (rtn), cadA, lueO, mesJ, mhpF, modC, murA, nrfF, prpE, purK, tpiA, yciD, yejA, ygcE, ygfQ, yhjC, yjfH, yjiJ, (yeeF), 211#11, 23-12A0, 279#6, 427#1, dgkA, dinI, emrY, focA, folK, fsr, glnD, kch, maoC, msbA, nac, nagE, narI, ppx, prtC, rfaB, secF, secY/prlA, trkG, yafJ, yaiM, ybbC, ycbM, ydaA, ydbD, ydhV, yefI, yeiO, ygjR, yhiN, yjgT, yojl, and yphD. In one embodiment, an increase in levels of antibodies that specifically bind one or more E. coli polypeptides any one or more of rpsK, rpsL, sixA, ycfF, yhdN, yjhA, (gntU), (phnE), (rcsC), (thiS), (ycfA), (yfjV), 221#15, 267#6, 304#1, 319#17, 336#6, 348#4, 405#2, 411#4, 416#1, 430#8, 445#15, 448#2, 633#5, aceB, agaZ, aidA, argB, argC, aroF, aroK, aspA, atpB, baiF, basS, cedA, citB, citG, clpX, cysD, cysJ, cysM, cysW, dgxA, dicC, dinD, ecnB, eno, fadB, fba, fdhE, fecB, fecR, fimB, fimC, fliA, fliS, flxA, folP, frvX, ftn, fumB, gabD, galR, gcpE, glvB, grxA, grxC, gst, hemX, hflX, hisS, hofH, hoxK, hsdM, hycA, hycF, hypC, ibpB, infC, ivbL, lasT, LDR-ABC, LDR-D, lldR, mcrD, metB, metJ, mltB, mviN, narY, nuoE, phbA, phnB, phnG, phnM, ppdB, ptpS, ptsN, purM, radC, rbfA, rbsB, recN, rffD, rpiB, rplP, rplQ, rplT, rpmJ, rpsG, rpsR, selD, slyA, slyD, ssi6, sugE, tdcB, thiF, torA, trmD, ubiG, umuD, vacJ, wcaA, ybbA, ybbQ, ycbF, ycdB, yceD, ycgN, ydfA, ydfD, ydfR, ydhL, ydiL, yedF, yehR, yejG, yejO, yfeC, yfhD, yfiC, yfiD, yfjI, yfjQ, ygbA, ygbD, ygbO, ygcQ, ygeW, ygfY, yhaA, yhaC, yhaN, yhcI, yhcJ, yhfR, yhfV, yhgH, yhgI, yicC, yieF, yieJ, yigK, yihA, yihD, yihK, yiiF, yijD, yijF, yjaI, yjbI, yjbL, yjbQ, yjbR, yjcS, yjeB, yjeJ, yjgF, yjhC, yjhE, yjhF, yjhO, yjiA, yliG, ymfE, yohL, yphC, and yrbB identifies the subject as having Crohn's disease.


In various embodiments of the above aspects, an increase in levels of antibodies that specifically bind one or more E. coli polypeptides any one or more of (rtn), cadA, lueO, mesJ, mhpF, modC, murA, nrfF, prpE, purK, tpiA, yciD, yejA, ygcE, ygfQ, yhjC, yjfH, yjiJ, (yeeF), 211#11, 23-12A0, 279#6, 427#1, dgkA, dinI, emrY, focA, folK, fsr, glnD, kch, maoC, msbA, nac, nagE, narI, ppx, prtC, rfaB, secF, secY/prlA, trkG, yafJ, yaiM, ybbC, ycbM, ydaA, ydbD, ydhV, yefI, yeiO, ygjR, yhiN, yjgT, yojl, and yphD identifies the subject as having ulcerative colitis. In other embodiments, an increase in levels of antibodies that specifically bind one or more E. coli polypeptides any one or more of (yhcP), (yhhT), (yhiW), 16-3B0, 214#3, 233#6, 273#6, 280#1, 316#4, 321#3, 323#1, 331#2, 356#7, 406#7, 409#5, 411#1, 420#7, 452#13, 610#6.1, aceF, allP, ansP, aqpZ, atoE, brnQ, celD, cobU, codB, cybB, cydB, cydC, dgt, dnaQ, ebgA, emrB, emrD, exuR, fabH, fabZ, fadA, fepD, flhD, glnQ, glpF, gppA, greA, hemY, JW0438, JW1949, lipA, lpxC, malX, malZ, menG, mrdB, murG, mutT, narU, nfrB, nrfE, ompC, oppC, oppF, pbuX, pheP, phsE, pnuC, potC, pssR, ptsH, putP, queA, rfaL, rffG, rocE, rplO, sdhD, secB, sfsA, slyX, sucB, sucD, tauB, thiL, trkH, udk, uidB, virK, yaaH, yabK, yadQ, yaeG, yagG, yagM, yaiV, yajR, ybaN, ybdS, ybfB, ybfC, ybgE, ybhA, ybhL, ybhM, ybhN, ybhR, ycaD, yccY, ycdG, yciQ, yciR, yciS, ydcD, yddH, ydeF, ydeZ, ydfO, ydjS, ydjZ, yeaS, yehK, yehY, yejF, yfjY, ygeD, ygfF, yggH, yghK, yghT, ygjQ, yhaH, yhaO, yhbX, yhcO, yhdM, yhdT, yheG, yheU, yhfU, yhhL, yhhS, yhiP, yhiQ, yhjX, yiaL, yiaQ, yibL, yibQ, yicO, yidY, yifE, yigF, yihG, yjeM, yjfF, yjfP, yjfY, yjhB, ymdD, ynaJ, yneC, yneG, ynjC, yoaA, yohG, yphA, yphG, yqcE, and yzgL identifies the subject as a healthy control.


In another aspect, the invention features a method for diagnosing Crohn's disease, the method involving contacting an array containing a set of E. coli polypeptides with sera derived from the subject, and detecting in said subect's sera greater immunogenic reactivity to era than to ybaN, greater immunogenic reactivity to yhgN than to focA, and/or greater immunogenic reactivity to gabT than to ycdG, thereby identifying the subject as having Crohn's Disease.


In another aspect, the invention features a method for diagnosing ulcerative colitis in a subject, the method involving contacting an array containing a set of E. coli polypeptides with sera derived from the subject, and detecting in said subject's sera greater immunogenic reactivity to relE>cysE/wcaB, pyrI>yjgK, lnt>ybiO, ftsE>pssR, yhgN>yhfG, yafN>dsbB, yihI>yabK, 421#15>yhdN, hisP>rplO, cml>nuoM, yieC>nuol, thereby identifying the subject as having ulcerative colitis.


In another aspect, the invention features a method for diagnosing ulcerative colitis, the method involving contacting an array involving a set of E. coli polypeptides with sera derived from the subject, and detecting in said subect's sera greater immunogenic reactivity to frvX than to yidX identifies a subject as having ulcerative colitis.


In another aspect, the invention features a method for selecting an appropriate treatment for a subject, the method involving contacting a microarray delineated herein with subject sera and detecting binding to a polypeptide that identifies the subject as having inflammatory bowel disease, thereby indicating that inflammatory bowel disease therapy is appropriate for said subject. In one embodiment, the subject is identified as having Crohn's disease or ulcerative colitis.


In another aspect, the invention features a method for selecting surgery for a subject, the method involving contacting a microarray of any of claims 1-11 with subject sera and detecting binding to a polypeptide that identifies the subject as having inflammatory bowel disease, thereby indicating that surgery is appropriate for said subject. In one embodiment, the method detects greater immunogenic reactivity to era than to ybaN, greater immunogenic reactivity to yhgN than to focA, and/or greater immunogenic reactivity to gabT than to ycdG. In another embodiment, the method detects relE>cysE/wcaB, pyrI>yjgK, lnt>ybiO, ftsE>pssR, yhgN>yhfG, yafN>dsbB, yihI>yabK, 421#15>yhdN, hisP>rplO, cml>nuoM, and/or yieC>nuol.


In embodiments of the previous aspects, the method further involves detecting an antibody that specifically binds any one or more of chitobioside IgA (ACCA), laminaribioside IgG (ALCA), manobioside IgG (AMCA), Man α-1,3 Man α-1,2 Man (ΣMan3), Man α-1,3 Man α-1,2 Man α-1,2 Man (ΣMan4) pANCA, antineutrophil cytoplasmic antibody, yeast oligomanna, Saccharomyces cerevisiae, ASCA, bacterial outer membrane p rin C (OmpC), Pseudomonas fluorescens bacterial sequence 12, and bacterial flagellin.


In another aspect, the invention features a method for selecting an appropriate treatment method for a subject, the method involving contacting a microarray delineated herein with subject sera and detecting binding to a polypeptide that identifies the subject as not having inflammatory bowel disease, thereby indicating that inflammatory bowel disease therapy is not appropriate for said subject.


In another aspect, the invention features a method for monitoring the condition of a subject having Crohn's disease, the method involving contacting an array containing a set of E. coli polypeptides with sera derived from the subject, and detecting in said subect's sera immunogenic reactivity to era relative to ybaN, immunogenic reactivity to yhgN relative to focA, and immunogenic reactivity to gabT relative to ycdG, where a reduction in said immunogenic reactivity identifies an improvement in the subject's condition, and an increase in said immunogenic reactivity identifies a worsening in the subject's condition.


In yet another aspect, the invention features a method for monitoring the condition of a subject having ulcerative colitis, the method involving contacting an array containing a set of E. coli polypeptides with sera derived from the subject, and detecting in said subject's sera immunogenic reactivity to frvX relative to yidX, where a reduction in said immunogenic reactivity identifies an improvement in the subject's condition, and an increase in said immunogenic reactivity identifies a worsening in the subject's condition.


In another aspect, the invention features a method for determining whether a therapy is efficacious for a subject, the method involving contacting a microarray of any previous aspect with subject sera collected at a first time and detecting binding to a polypeptide that identifies the subject as having inflammatory bowel disease, and contacting a microarray of any previous aspect with subject sera collected at a second time and detecting binding to a polypeptide that identifies the subject as having inflammatory bowel disease, where detection of a reduction in binding at the second time relative to the first indicates that said therapy is efficacious and a failure to detect a reduction in binding indicates that said therapy is not efficacious.


In another aspect, the invention features a kit containing a microarray of any previous aspect, and instructions for use of the array in diagnosing inflammatory bowel disease, Crohn's disease, or ulcerative colitis.


In various embodiments of the above aspect, or any other method delineated herein, binding is detected in an immunoassay (e.g., ELISA). In other embodiments of the above aspects, the control is a healthy subject, a subject with Crohn's disease, or a subject with ulcerative colitis. In other embodiments of the above aspects, the detecting is of differential binding of a pair of antibodies to a pair of polypeptides on the array (i.e., comparing binding of one antibody to one polypeptide relative to the binding of the other antibody to the other polypeptide). In other embodiments of the above aspects, the array comprises cell wall polypeptides, intracellular polypeptides, and macromolecular complex polypeptides. In other embodiments, an increase in subject antibody binding to a polypeptide relative to healthy control antibody binding identifies the polypeptide as immunogenic in subjects having a dysregulated immune response to the polypeptide. In still other embodiments, the condition is any one or more of inflammatory bowel disease, Crohn's disease, ulcerative colitis, and indeterminate colitis. In various embodiments of the above aspects, the array comprises at least about 85, 90, 95, or 100% of the E. coli proteome. In still other embodiments, the polypeptides are differentially immunogenic in healthy controls, Crohn's disease, and/or ulcerative colitis. In still other embodiments, the method further involves detecting an antibody that specifically binds any one or more of chitobioside IgA (ACCA), laminaribioside IgG (ALCA), manobioside IgG (AMCA), Man α-1,3 Man α-1,2 Man (ΣMan3), Man α-1,3 Man α-1,2 Man α-1,2 Man (ΣMan4) pANCA, antineutrophil cytoplasmic antibody, yeast oligomanna, Saccharomyces cerevisiae, ASCA, bacterial outer membrane porin C (OmpC), Pseudomonas fluorescens bacterial sequence I2, and bacterial flagellin. In still other embodiments, the methods further involve stool sample analysis, colonoscopy, sigmoidoscopy, barium x-ray, computerized axial tomography, and/or capsule endoscopy. In still other embodiments, the method identifies the subject as a healthy control or as not having Crohn's disease or ulcerative colitis. In other embodiments of the above aspects, the invention features a microchip containing a polypeptide, polypeptide set, or polypeptide pair delineated in any of Tables 2-5, 7, or FIG. 5; accordingly, the invention further provides for the detection of differential immunogenicity between pairs of polypeptides or sets of polypeptides relative to a control (e.g., healthy control, UC, or CD)


The invention provides compositions and methods useful for the diagnosis of inflammatory bowel diseases, including distinguishing Crohn's disease from healthy controls and ulcerative colitis. Compositions and articles defined by the invention were isolated or otherwise manufactured in connection with the examples provided below. Other features and advantages of the invention will be apparent from the detailed description, and from the claims.


Definitions

By “inflammatory bowel disease” is meant a disease characterized by inflammation of the small and/or large intestines.


By “Crohn's disease” is meant an inflammatory bowel disease characterized by chronic inflammation of the gastrointestinal tract.


By “ulcerative colitis” is meant an inflammatory bowel disease characterized by inflammation of the rectum and/or large intestine.


By “agent” is meant any small molecule chemical compound, antibody, nucleic acid molecule, or polypeptide, or fragments thereof.


By “ameliorate” is meant decrease, suppress, attenuate, diminish, arrest, or stabilize the development or progression of a disease.


By “alteration” is meant a change (increase or decrease) in the expression levels or activity of a gene or polypeptide as detected by standard art known methods such as those described herein. As used herein, an alteration includes a 10%-100% change in expression levels (e.g., 10, 20, 30, 40, 50, 60, 75, 80, 85, 90, 95, 100%) change in expression levels.


By “derived from” is meant isolated from or having the sequence of a naturally-occurring sequence (e.g., a cDNA, genomic DNA, synthetic, or combination thereof).


By “microarray” is meant an organized collection of at least two proteins or polypeptides affixed to a solid support. In some embodiments, a polypeptide microarray contains at least a polypeptide or fragment thereof (e.g., 10, 20, 30, 40, 50, 75, or 100 amino acids) listed in any of FIG. 5 and Tables 2-5, and 7. A microarray contains at least 2, 5, 10, 25, 50, 75, 100, 150, 200, 250, or 300 polypeptide or nucleic acid molecule members. Frequently, the surface of the microarray comprises a plurality of addressable locations, each of which location has the adsorbent bound there.


By “biomarker” is meant a polypeptide, polynucleotide, or other molecule that is altered in level or activity in a disease state relative to the level or activity present in a healthy control, or from one disease type (such as Crohn's) from another (such as UC). In one embodiment, a biomarker is a polypeptide that is differentially immunogenic, i.e., that induces an immune response that differs between healthy control subjects and subjects having a disease or disorder. In another embodiment, a biomarker is a serum antibody that binds to a polypeptide where the serum antibody is differentially present in a subject having a disease or disorder relative to a healthy control subject or a subject not having the disease or disorder.


In this disclosure, “comprises,” “comprising,” “containing” and “having” and the like can have the meaning ascribed to them in U.S. Patent law and can mean “includes,” “including,” and the like; “consisting essentially of” or “consists essentially” likewise has the meaning ascribed in U.S. Patent law and the term is open-ended, allowing for the presence of more than that which is recited so long as basic or novel characteristics of that which is recited is not changed by the presence of more than that which is recited, but excludes prior art embodiments.


“Detect” refers to identifying the presence, absence or amount of an analyte to be detected.


By “detectable label” is meant a composition that when linked to a molecule of interest renders the latter detectable, via spectroscopic, photochemical, biochemical, immunochemical, or chemical means. For example, useful labels include radioactive isotopes, magnetic beads, metallic beads, colloidal particles, fluorescent dyes, electron-dense reagents, enzymes (for example, as commonly used in an ELISA), biotin, digoxigenin, or haptens.


“Diagnostic” means identifying the presence or nature of a pathologic condition. While a particular diagnostic method may not provide a definitive diagnosis of a condition, it suffices if the method provides a positive indication that aids in diagnosis.


By “differentially immunogenic” is meant that a polypeptide induces an altered immune response in a subject having a disease relative to the immune response that the polypeptide induces in a healthy control or a subject not having the disease, or a subject having one type of disease (such as CD) relative to a subject having another disease (such as UC) or vice versa. This difference may be either an increase or a decrease in immune response when compared to control conditions. Preferably, the increase or decrease is at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or even 100%.


By “disease” is meant any condition or disorder that damages or interferes with the normal function of a cell, tissue, or organ.


The invention provides a number of targets that are useful for the development of highly specific drugs to treat inflammatory bowel disease characterized by the methods delineated herein. In addition, the methods of the invention provide a facile means to identify therapies that are safe for use in eukaryotic host organisms. In addition, the methods of the invention provide a route for analyzing virtually any number of compounds for effects on a disease described herein with high-volume throughput, high sensitivity, and low complexity.


By “dysregulated immune response to a pathogen” is meant an excessive or undesirable immune response that causes cell, tissue or organ damage.


By “E. coli polypeptide” is meant a protein that naturally occurs in E. coli. Such polypeptides are available in Genbank or in the E. coli genome and proteome database.


By “fragment” is meant a portion of a polypeptide. This portion contains, preferably, at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% of the entire length of the reference polypeptide. A fragment may contain 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100, 200, 300, 400, 500, 600, 700, 800, 900, or 1000 amino acids.


By “function” is meant any biological activity of a polypeptide or polynucleotide. In one embodiment, a polypeptide is an antibody. In another embodiment, a biological activity is immunogenicity.


As used herein, the terms “gene” and “recombinant gene” refer to nucleic acid molecules comprising an open reading frame encoding a marker protein.


By “immunological assay” is meant an assay that relies on an immunological reaction, for example, antibody binding to an antigen. Examples of immunological assays include ELISAs, Western blots, immunoprecipitations, and other assays known to the skilled artisan.


By “polypeptide” is meant any chain of amino acids, regardless of length or post-translational modification (for example, glycosylation or phosphorylation). In one example, an antibody is a polypeptide.


By an “isolated polypeptide” is meant a polypeptide of the invention that has been separated from components that naturally accompany it. Typically, the polypeptide is isolated when it is at least 60%, by weight, free from the proteins and naturally occurring organic molecules with which it is naturally associated. Preferably, the preparation is at least 75%, more preferably at least 90%, and most preferably at least 99%, by weight, a polypeptide of the invention. An isolated polypeptide of the invention may be obtained, for example, by extraction from a natural source, by expression of a recombinant nucleic acid encoding such a polypeptide; or by chemically synthesizing the protein. Purity can be measured by any appropriate method, for example, column chromatography, polyacrylamide gel electrophoresis, or by HPLC analysis.


By “microarray” is meant an organized collection of at least two polypeptides, polynucleotides, or fragments thereof affixed to a solid support. A polypeptide microarray contains one or more polypeptides (e.g., 10, 20, 30, 40, 50, 75, or 100 amino acids) delineated herein. A microarray contains at least 1, 2, 3, 4, 5, 6 polypeptide or nucleic acid molecules delineated herein.


“Monitoring” refers to recording changes in a varying parameter (e.g. monitoring progression of a disease).


As used herein, “obtaining” as in “obtaining an agent” includes synthesizing, purchasing, or otherwise acquiring the agent.


By “pathogen” is meant a bacteria, mycobacteria, fungi (including yeast), virus, or other microbe associated with disease. Exemplary pathogen's include various E. coli strains, C. difficle, B. fragilis, E. coli LF-82 and H. hepaticus, all of which have been demonstrated to be pathogenic to IBD. In certain embodiments, the term pathogen is applied to microbes that are not typically associated with disease in healthy individuals, but that are associated with disease in individuals having a dysregulated immune response (e.g., E. coli K-12 in Crohn's disease and ulcerative colitis).


By “portion” is meant a fragment of a protein or nucleic acid that is substantially identical to a reference protein or nucleic acid. In some embodiments the portion retains at least 50% 75%, or 80%, or more preferably 90%, 95%, or even 99% of the biological activity of the reference protein or nucleic acid described herein.


By “reduces” is meant a negative alteration of at least 10%, 25%, 50%, 75%, or 100%.


By “reference” is meant a standard or control condition.


As used herein, “sample” or “biological sample” refers to anything, which may contain an analyte (e.g., polypeptide, polynucleotide, or fragment thereof) for which an analyte assay is desired. The sample may be a biological sample, such as a biological fluid or a biological tissue. Examples of biological fluids include urine, blood, plasma, serum, saliva, semen, stool, sputum, cerebral spinal fluid, tears, mucus, amniotic fluid or the like. In one embodiment, a biological sample is blood, plasma or serum.


By “a set” is meant a group having more than one member. The set may be composed of 2, 4, 5, 8, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 125, 150, 200, 250, or 300 polypeptide, nucleic acid molecule, or chemical compound members.


As used herein, the term “sensitivity” is the percentage of marker-detected subjects with a particular disease.


By “specifically binds” is meant an agent (e.g., antibody) which recognizes and binds a polypeptide of the invention, but that does not substantially recognize and bind other molecules.


As used herein, the term “specificity” is the percentage of subjects correctly identified as having a particular disease. For example, the specificity is calculated as the number of subjects with a particular disease as compared to normal healthy subjects.


By “subject” is meant a mammal, including, but not limited to, a human or non-human mammal, such as a bovine, equine, canine, ovine, rodent, or feline.


By “specifically binds” is meant a compound or antibody that recognizes and binds a polypeptide of the invention, but which does not substantially recognize and bind other molecules in a sample, for example, a biological sample, which naturally includes a polypeptide of the invention.


As used herein, “obtaining” as in “obtaining an agent” includes synthesizing, purchasing, or otherwise acquiring the agent.


Ranges provided herein are understood to be shorthand for all of the values within the range. For example, a range of 1 to 50 is understood to include any number, combination of numbers, or sub-range from the group consisting 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50.


As used herein, the terms “treat,” treating,” “treatment,” and the like refer to reducing or ameliorating a disorder and/or symptoms associated therewith. It will be appreciated that, although not precluded, treating a disorder or condition does not require that the disorder, condition or symptoms associated therewith be completely eliminated.


Unless specifically stated or obvious from context, as used herein, the term “or” is understood to be inclusive. Unless specifically stated or obvious from context, as used herein, the terms “a”, “an”, and “the” are understood to be singular or plural.


Unless specifically stated or obvious from context, as used herein, the term “about” is understood as within a range of normal tolerance in the art, for example within 2 standard deviations of the mean. About can be understood as within 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.5%, 0.1%, 0.05%, or 0.01% of the stated value. Unless otherwise clear from context, all numerical values provided herein are modified by the term about.


The recitation of a listing of chemical groups in any definition of a variable herein includes definitions of that variable as any single group or combination of listed groups. The recitation of an embodiment for a variable or aspect herein includes that embodiment as any single embodiment or in combination with any other embodiments or portions thereof.


Any compositions or methods provided herein can be combined with one or more of any of the other compositions and methods provided herein.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is a schematic diagram showing the overall strategy used for the identification of novel serological biomarkers for inflammatory bowel disease using E. coli whole proteome chip. To fabricate the whole proteome chip, >4,000 E. coli proteins were cloned and expressed. These proteins were purified using high-throughput protein purification protocol and printed onto FullMoon slides using a ChipWriter Pro robot. 134 patient sera were collected from the Johns Hopkins Hospital for this analysis. These sera were screen by E. coli proteome chips. Two-level of data analyses were performed: (i) global IBD analysis was performed to identify differentially immunogenic proteins in healthy control, CD and ulcerative colitis using Significance Analysis of Microarray (SAM) and Gene Ontology (GO) enrichment analysis; and (ii) serological IBD biomarkers discovery using k-TSP algorithm.



FIGS. 2A and 2B show representative images of E. coli proteome chips and a scatter plot, respectively. The proteome chip in FIG. 2A was probed with sera from Crohn's Disease patients and healthy controls, respectively. Two E. coli proteome chips probed with sera from a Crohn's Disease (CD) patient (left panel) and a healthy control (HC) (right panel). To identify the proteins that can be recognized by reactive serum antibodies, each E. coli protein chip was incubated with a serum from healthy control or Crohn's Disease, as illustrated in FIG. 1. Cy3-labeled anti-human immunoglobulin antibodies were then probed on the chips, allowing visualization of immunoreactive protein spots. The immunogenic profiles of both the IBD patients and healthy control were acquired by the resulting fluorescent signals. Green spots are spots of E. coli protein in the chips detected by serum antibodies, representing immunogenic reactions. The intensity of the protein spots reflects immunogenicity of the proteins. Middle panel shows some representative images of immunogenic spots of three pairs of specific proteins (see more information of these proteins in FIG. 5 and Tables 1-3) from these proteome chips. Every E. coli protein is spotted in duplicate on the chip. Crohn's Disease vs ulcerative colitis vs healthy control can be distinguished by comparing the signal intensities between protein spots on the E. coli proteome chips. FIG. 2B is a scatter plot showing duplicate spots are highly correlated with each other (R=0.985). Each point is the plot of the original protein spot expression vs. the technical replicate protein spot expression. All 4265 proteins of a single array are displayed in this scatter plot.



FIGS. 3A to 3D show global immunogenic profiles of IBD patients' sera against E. coli proteins. FIG. 3A is a heatmap of 273 differentially immunogenic proteins between healthy controls (HC) and Crohn's Disease (CD) samples identified by SAM analysis. Yellow and blue colors indicate high and low immunogenic response, respectively. FIG. 3B is a heatmap of the 188 differentially immunogenic proteins between Crohn's Disease and ulcerative colitis samples identified by SAM analysis; and FIG. 3C illustrates 33 differentially immunogenic proteins between healthy controls and ulcerative colitis samples as identified by SAM analysis. Each row corresponds to a protein and each column corresponds to a sample. The expression level for each protein is normalized across the samples such that the mean is 0 and the standard deviation is 1. Blue and yellow indicates high and low immunogenic proteins, respectively. FIG. 3D is a Venn diagram of these differentially immunogenic proteins showing only limited overlapping among healthy control vs Crohn's Disease vs ulcerative colitis.



FIG. 4 is a graph showing the distribution of the cellular component terms in the highly immunogenic response proteins of healthy controls (HC), CD and ulcerative colitis. Six Cellular Component terms from the Gene Ontology were examined. Cell projection term contains flagellum and fimbrium proteins. The main messages include: 1) approximately 80% of the highly immunogenic proteins are either membrane proteins in healthy control (p<0.0001), compared to only ˜37% of the top immunogenic proteins in Crohn's Disease patients (not statistically significant); 2) conversely, ˜30% of top immunogenic proteins in Crohn's Disease patients are intracellular proteins (p<0.05) compared to only ˜7% in healthy control (not statistically significant); 3) a significant higher percentage of cell wall proteins (˜26%) are immunogenic in ulcerative colitis (p<0.05) compared to those in healthy control and Crohn's Disease (not significant); and 4) a significant percentage of macromolecular complex proteins (˜16%; p<0.05) in Crohn's Disease compared to those in healthy control or ulcerative colitis (not statistically significant). No statistically significant enrichment of proteins of periplasmic space and cell projection were found in healthy control, Crohn's Disease and ulcerative colitis.



FIGS. 5A to 5D show three heat maps showing that k-TSP identified the top three pairs of biomarkers that can discriminate controls from Crohn's Disease patients and protein spots. Each column of the heat maps represents the immunogenic reactivity by individual IBD patients or healthy control. Within a column, each row represents ratio of the immunogenic reactivity of a top scoring pair of proteins. The expression values represented are the ratio of immunogenic reactivity (fluorescent signal or intensity) to protein X divided by the signals to protein Y, referred to as the TSP ratio (X and Y being example proteins). If the immunogenic reactivity of a patient to protein X was greater than the reactivity to protein Y, the box will appear yellow, and blue for vice versa (see examples below). FIG. 5A depicts the classifier for healthy control vs Crohn's Disease (yellow=CD, blue=HC). For example, if immunogenic reaction against era is >ybaN, the subject is identifed as having Crohn's Disease and shows as yellow (light shading). If not, then its indicative of a healthy control (blue) (dark shading). FIG. 5B displays healthy control vs. ulcerative colitis classifier (yellow=ulcerative colitis, blue=healthy control). For example, if immunogenic reaction against relE is >cysE_wcaB, the subject is identified as having ulcerative colitis and shows as yellow, If not, then its indicative of it is classified as a healthy control (blue). FIG. 5C shows the Crohn's Disease vs. ulcerative colitis classifier (yellow=ulcerative colitis, blue=Crohn's Disease). If frvX is ≥yidX, it is a ulcerative colitis (yellow), or else a CD (blue). See representative images of some of those protein pairs in FIG. 2. FIG. 5D shows representative protein spots that were differentially recognized by sera from Crohn's Disease vs ulcerative colitis, respectively. This figure shows the relative immunogenic reactivity (fluorescent signals) of frvX and yidX by serum antibodies from a CD and ulcerative colitis patient.



FIGS. 6A-6C are scatter plots showing the immunogenic reactivity (signal) of the samples to era and ybaN individually (FIGS. 6A & B, respectively) and the TSP ratio (era/ybaN). FIG. 6C shows that immunogenic reactivity to era or ybaN alone (the top scoring pair in the HC vs CD k-TSP classifier) does not allow for class separation of the data; no threshold level would clearly separate healthy controls from Crohn's Disease. However, the ratio of the two features (top-scoring pair ratio) results in clear separation in the data lending well to classification FIG. 6C. Similar results were found when scatter plot analysis were done for the other two TSP pairs from the HC vs Crohn's Disease classifier as shown in FIG. 5A.



FIGS. 7A and 7B show scatter plots of immunoreactivity of OmpC and fliC, respectively, which were carried out as described in FIG. 6C. The present study found that OmpC and fliC (Cbir), two of the known serological markers, performed poorly in the present study. The scatter plots display the normalized immunogenic signal to each protein for every serum sample. The samples are separated along the x-axis according to class (healthy control, CD, and ulcerative colitis). Both statistical analysis and visual inspection demonstrate that antibodies against neither protein are capable of discriminating among the classes.





DETAILED DESCRIPTION OF THE INVENTION

The present invention provides biomarkers for use in serological testing for inflammatory bowel disease, and methods of using such markers to distinguish among intestinal disorders and selected effective therapies.


The invention is based, at least in part, on the discovery of new serological markers using a whole E. coli proteome microarray as a novel high-throughput proteomic approach to screening and identifying IBD markers. Each protein array, which contains 4,256 E. coli K12 proteins, was screened using individual serum from healthy controls (n=39) and clinically well-characterized patients with IBD [66 Crohn's disease (CD) and 29 ulcerative colitis (UC)]. Proteins that could be recognized by serum antibodies were visualized and quantified using Cy3-labeled goat anti-human antibodies. Surprisingly, SAM (significant analysis of microarray) analysis identified a total of 417 E. coli proteins that were differentially recognized by serum antibodies between healthy controls and Crohn's Disease or ulcerative colitis. Among those, 169 proteins were identified as highly immunogenic in healthy controls, 186 proteins are highly immunogenic in Crohn's Disease, only 19 in ulcerative colitis. Using a supervised learning algorithm (k-Top Scoring Pairs), two sets of serum antibodies were identified that were novel biomarkers for specifically distinguishing Crohn's Disease from healthy controls (accuracy: 86±4%; p<0.01), and Crohn's Disease from ulcerative colitis (accuracy: 80±2%; p<0.01), respectively. The Set 1 antibodies recognized three pairs of E. coli proteins: era vs ybaN, yhgN vs focA, and gabT vs ycdG and the Set 2 antibodies recognized yidX vs frvX. The specificity and sensitivity of Set 1 antibodies were 81±5% and 89±3%, respectively, while those of set 2 antibodies were 84±1% and 70±6%, respectively. Serum antibodies identified for distinguishing healthy controls vs ulcerative colitis were only marginal, since their accuracy, specificity and sensitivity were 66±5%, 69±5%, and 61±7%, respectively (p<0.04). Taken together, novel sets of serological biomarkers have been identified for diagnosis of Crohn's disease vs healthy control and Crohn's disease vs ulcerative colitis.


The use of biomarkers is particularly important because Crohn's disease and ulcerative colitis share many symptoms, both clinically and histologically. This makes the diagnosis of these two diseases difficult. The differences between Crohn's disease and ulcerative colitis exist at many levels. Crohn's disease may occur anywhere along the digestive tract from the mouth to the anus (although in most cases distal ileum and colon are affected). In ulcerative colitis, the large intestine (colon) is typically the only site that is affected. Second, the pattern of inflammation may be different. Ulcerative colitis tends to be continuous thoughout the inflamed area, while Crohn's disease exhibits skipped lessions or ranulomas (intermittent patterns between inflamed and healthy-looking tissues. Third, there can be difference in the degree of tissue penetration. In ulcerative colitis, the colonic mucosal lining is ulcerated, but this does not extend beyong the mucosal lining. In Crohn's, such ulceration is typically deeper and may extend to virtually any layers of colon wall. Finally, the complications associated with the disease may differ. In Crohn's disease patients may experience complications, such as fistulizing and structuring. These complications are much less frequent in ulcerative colitis. In up to 15% Crohn's patients, extra-intestinal manifistitions of disease can also occur. These may include inflammation in tissues or organs outside the gastrointestinal tract. Interesting general, smoking is bad and of a risk factor for Crohn's disease but protective or therapeutic for UC.


Inflammatory Bowel Disease


Serological testing is a non-invasive method for diagnosing IBD, and differentiating ulcerative colitis from Crohn's disease (Li et al., (2008) World J. Gastroenterol. 14, 5115-5124; Peyrin-Biroulet et al. (2007) Inflamm. Bowel. Dis. 13, 1561-1566; Vermeire et al. (2008) Gastroenterol. Clin. North Am. 37, 429-438). Several serological IBD biomarkers have been identified in the past decade, and some have been used in the clinics of IBD (Li et al., (2008) World J. Gastroenterol. 14, 5115-5124; Peyrin-Biroulet et al. (2007) Inflamm. Bowel. Dis. 13, 1561-1566; Vermeire et al. (2008) Gastroenterol. Clin. North Am. 37, 429-438). Many of these antibodies are produced on intestinal exposure to normal commensal bacteria in genetically susceptible individuals. Although it is not known whether these antibodies are pathogenic or not, they are specific to patients with either Crohn's disease or ulcerative colitis, and may reflect a dysregulated immune inflammatory response to intestinal bacterial antigens (Xavier (2007) Nature 448, 427-434, Strober (2002) Annu. Rev. Immunol. 20, 495-549; Blumberg (1999) Curr. Opin. Immunol. 11, 648-656; Papp et al., (2007) Inflamm. Bowel. Dis. 13, 984-992). Work on several experimental animal models of IBD have led to the suggestion that the pathogenesis of IBD may be the result of an aberrant immune response to normal commensal bacteria in genetically susceptible individuals. In fact, most of the major serological biomarkers being used in IBD clinics are antibodies to microbial antigens, including yeast oligomanna (anti-Saccharomyces cerevisiae, ASCA), bacterial outer membrane porin C (OmpC), Pseudomonas fluorescens bacterial sequence 12 (anti-I2), and most recently bacterial flagellin (CBir 1) ((Li et al., (2008) World J. Gastroenterol. 14, 5115-5124; Peyrin-Biroulet et al. (2007) Inflamm. Bowel. Dis. 13, 1561-1566; Vermeire et al. (2008) Gastroenterol. Clin. North Am. 37, 429-438)). All of these anti-microbial antibodies show preponderance in patients with Crohn's Disease. However, ASCA has been identified in up to 5% of patients with ulcerative colitis.


In comparison, IBD-specific pANCA or antineutrophil cytoplasmic antibody w perinuclear highlighting was first described in 1990. Although generally considered an autoantibody, the specific antigenic stimulation for pANCA production remains unclear. This auto-antibody is present in up to 70% of patients with ulcerative colitis, and in up to 20% of patients with CD. Recently, a panel of five new anti-glycan antibodies have been identified, including anti-chitobioside IgA (ACCA), anti-laminaribioside IgG (ALCA), anti-manobioside IgG (AMCA), and antibodies against chemically synthesized (Σ) two major oligomannose epitopes, Man α-1,3 Man α-1,2 Man (ΣMan3) and Man α-1,3 Man α-1,2 Man α-1,2 Man (ΣMan4) (Li (2008) World J. Gastroenterol. 14, 5115-5124, 13, 15). If desired, these conventional biomarkers may be used in combination with the new serological biomarkers delineated herein (e.g., FIG. 5 and Tables 2-5, and 7).


Collectively, these antibodies are not generally present in either children or adults with non-IBD disease, and may represent serological markers of intestinal inflammation specific to ulcerative colitis or Crohn's disease. Though encouraging, none of the current commercially available biomarker tests/assays, including all of those mentioned above, can be used as stand-alone tools in clinics, and therefore they are currently only recommended as an adjunct to endoscopy in diagnosis and prognosis of the disease (Li (2008) World J. Gastroenterol. 14, 5115-5124; 16, 17). Therefore, additional specific and sensitive IBD biomarkers are needed.


Proteomic technologies, such as 2-dimensional gel electrophoresis, various variations of mass spectrometry and protein chip (array) technology are now proving to be powerful tools in biomarker discovery and are beginning to be utilized in IBD biomarker discovery (Li (2008) World J. Gastroenterol. 14, 5115-5124; 18). These technologies enable robust, and/or large-scale and high-throughput identification and analysis of differential protein expression when comparing disease to control. Blood-based (serum or plasma-based) proteomics hold particular promises for biomarker discovery of various human diseases such as neurodegenerative diseases and cancers (Goldknopf (2008) Expert Rev. Proteomics. 5, 1-8; Maurya et al., (2007) Anticancer Res. 27, 1247-1255; Veenstra et al. (2005) 4, 409-418.-21). Antigen microarrays are also powerful tools that allow high-throughput serum analysis of aberrant immune responses in autoimmune diseases, as well as efficient discovery of biomarkers for infectious pathogens. The present invention provides methods of using an E coli proteome microarray to characterize differential immune responses (serum anti-E. coli antibodies) among patients clinically classified as having Crohn's disease, ulcerative colitis and healthy controls. In addition, the invention provides novel IBD-specific anti microbial antibodies, particularly anti-E. coli antibodies, which are present in IBD patients and were identified by screening the sera with E. coli protein arrays.


Serum Antibody Biomarkers


The present invention provides serum antibody biomarkers that are differentially present in subjects having an inflammatory bowel disease, such as Crohn's disease or ulcerative colitis. These serum antibody biomarkers specifically bind to particular E. coli polypeptides, which are delineated in FIG. 5 and Tables 2-5, and 7. In particular, the invention provides that serum antibody biomarkers of the invention may be used individually or in combination with other markers to provide a method of diagnosing an inflammatory bowel disease. In one embodiment, the diagnosis of an inflammatory bowel disease involves distinguishing an inflammatory bowel disease from healthy controls. In certain embodiments, biomarkers comprises the pairs and sets of E. coli polypeptides delineated in FIGS. 5A, 5B, and 5C and Table 7 and the corresponding serum antibodies. In another embodiment, the diagnosis of an inflammatory bowel disease involves distinguishing Crohn's disease from ulcerative colitis. The invention further provides methods for selecting or monitoring the efficacy of a therapeutic regimen in a subject having a inflammatory bowel disease Inflammatory bowel diseases include, but are not limited to Crohn's disease, ulcerative colitis, and indeterminate colitis.


Serum antibody biomarkers that are differentially present in samples of subjects having a inflammatory bowel disease and healthy control subjects find application in methods and kits for diagnosing an inflammatory bowel disease, such as Crohn's disease or ulcerative colitis, or distinguishing inflammatory bowel disease from healthy control. Accordingly, methods are provided for identifying inflammatory bowel disease in a subject, which involve detecting a differential presence of a serum antibody biomarker in subjects with a inflammatory bowel disease in a biological sample (e.g., blood, sera, plasma) obtained from the subject. The amount of one or more serum antibody biomarkers found in a test sample compared to a control, or the presence or absence of one or more serum antibody biomarkers in the test sample provides useful information regarding the inflammatory bowel disease status of the patient.


The serum antibody biomarkers can be measured in different types of biological samples. Preferably, the sample is a blood sample.


Detection Methods


The invention provides methods of detecting one or more serum antibody biomarkers associated with a inflammatory bowel disease or control markers in a blood sample obtained from a subject. The invention specifically describes the use of immunoassays to detect serum antibody biomarkers that specifically bind certain polypeptides or that measure the relative immune reaction against certain polypeptides. For example, the invention provides for the detection of greater immunogenic reactivity to era than to ybaN, greater immunogenic reactivity to yhgN than to focA, and greater immunogenic reactivity to gabT than ycdG. When each of these differential immunogenic reactivities is present then the subject is identified as having Crohn's Disease. In another example, the detection of greater immunogenic reactivity to frvX than to yidX identifies a subject as having ulcerative colitis.


In other embodiments, the invention provides at Table 7 methods for distinguishing Crohn's disease, ulcerative colitis, and healthy controls. In particular, microarrays comprising E. coli polypeptides delineated herein are useful for measuring immunogenic reactivity present in subject sera. Measurements can be relative to the immunogenic reactivity of another E. coli polypeptide. In one embodiment, the method provides that the following pairs can be used to measure relative levels of immunogenic reactivity.


For example, era>ybaN=Crohn's disease (CD)


The sequence of E. coli polypeptides are known in the art and can be identified in public databases by searching on the gene or polypeptide name. For example, the E. coli era polypeptide is NCBI Reference Sequence: AAA03242.1. The amino acid sequence of an exemplary era polypeptide is provided below.











  1
msidksycgf iaivgrpnvg kstllnkllg qkisitsrka qttrhrivgi htegayqaly






 61
vdtpglhmee krainrlmnk aasssigdve lvifvvegtr wtpddemyln klregkapvi





121
lavnkvdnvq ekadllphlq flasqmnfld ivpisaetgl nvdtiaaivr khlpeathhf





181
pedyitdrsq rfmaseiire klmrflgael pysvtveier fvsnerggyd inglilvere





241
gqkkmvignk gakiktigie arkdmqemfe apvhlelwvk yksgwadder alrslgyvdd





301
l






The sequence of an exemplary E. coli ybaN polypeptide (NCBI Reference Sequence: AP_001117.1) is provided below:











  1
mqriiliiig wlavvlgtlg vvlpvlpttp fillaawcfa rssprfhawl lyrswfgsyl






 61
rfwqkhhamp rgvkpraill illtfaislw fvqmpwvrim llvilacllf ymwripvide





121
kqekh






In another example, yhgN>focA=CD. The sequence of an exemplary E. coli yhgN (NCBI Reference Sequence: AP_004357) is provided below:











  1
mneiisaavl lilimdplgn lpifmsvlkh tepkrrraim vrelliallv mlvflfagek






 61
ilaflslrae tvsisggiil fliaikmifp sasgnssglp ageepfivpl aiplvagpti





121
latlmllshq ypnqmghlvi alllawggtf villqsslfl rllgekgvna lerlmglilv





181
mmatqmfldg irmwmkg






The sequence of an exemplary E. coli focA (NCBI Reference Sequence: AP_001534) is provided below:











  1
mkadnpfdll lpaamakvae eagvykatkh plktfylait agvfisiafv fyitattgtg






 61
tmpfgmaklv ggicfslgli lcvvcgadlf tstvlivvak asgritwgql aknwlnvyfg





121
nlvgallfvl lmwlsgeymt angqwglnvl qtadhkvhht fieavclgil anlmvclavw





181
msysgrslmd kafimvlpva mfvasgfehs ianmfmipmg ivirdfaspe fwtavgsape





241
nfshltvmnf itdnlipvti gniigggllv gltywviylr endhh






For example, gabT>ycdG=CD The sequence of an exemplary E. coli gabT (NCBI Reference Sequence: AP_003235.1) is provided below











  1
mnsnkelmqr rsgaiprgvg qihpifadra encrvwdveg reyldfaggi avintghlhp






 61
kvvaaveaql kklshtcfqv layepylelc eimnqkvpgd fakktllvtt gseavenavk





121
iaraatkrsg tiafsgayhg rthytlaltg kvnpysagmg lmpghvyral ypcplhgise





181
ddaiasihri fkndaapedi aaiviepvqg eggfyasspa fmqrlralcd ehgimliade





241
vqsgagrtgt lfameqmgva pdlttfaksi aggfplagvt graevmdava pgglggtyag





301
npiacvaale vlkvfeqenl lqkandlgqk lkdgllaiae khpeigdvrg lgamiaielf





361
edgdhnkpda kltaeivara rdkglillsc gpyynvlril vpltiedaqi rqgleiisqc





421
fdeakq






The sequence of an exemplary E. coli ycdG (NCBI Reference Sequence: AP_001637.1) is provided below.











  1
mamfgfphwq lkststesgv vapderlpfa qtavmgvqha vamfgatvlm pilmgldpnl






 61
silmsgigtl lfffitggry psylgssaaf vgvviaatgf ngqginpnis ialggiiacg





121
lvytviglvv mkigtrwier lmppvvtgav vmaiglnlap iavksysasa fdswmavmtv





181
lciglvavft rgmiqrllil vglivaclly gvmtnvlglg kavdftivsh aawfglphfs





241
tpafngqamm liapvavilv aenlghlkav agmtgrnmdp ymgrafvgdg latmlsgsvg





301
gsgvttyaen igvmavtkvy stivfvaaav iamllgfspk fgalihtipa aviggasivv





361
fgliavagar iwvqnrvdls qngnlimvav tivlgagdfa ltlggftlgg igtatfgail





421
lnallsrklv dvpppevvhq ep






In other examples yidX (NCBI AP_004097)>frvX=UC; relE (NCBI ABD51640.1)>cysE/wcaB (NCBI CAQ33933.1)=UC; lnt (NCBI AP_001306.1)>ybiO (NCBI AP_001439.1)=UC; ftsE (NCBI AP_004329.1)>pssR (NCBI F65179)=UC; yhgN (NCBI AP_004357.1)>yhfG (NCBI AP_004427.1)=UC; yafN (NCBI AP_000885.1)>dsbB(NCBI AP_001810.1)=UC; yihI (NCBI AP_003942.1)>yabK (NCBI AAC73178.1)=UC 421#15>yhdN (NCBI AAC76318.1)=UC; hisP (NCBI AAC75366.1)>rplO (NCBI AAC76326.1)=UC; cml (NCBI P12056.1)>nuoM (NCBI AP_002875.1)=UC; yieC (NCBI AAC76743.1)>nuol (NCBI AP_002879.1)=UC.


One of skill in the art will recognize that any suitable method can be used to detect the serum antibody biomarkers described herein. Successful practice of the invention can be achieved with one or a combination of methods that can detect and/or quantify the markers. Such methods include, without limitation, hybridization-based methods including those employed in microarrays, mass spectrometry (e.g., laser desorption/ionization mass spectrometry), fluorescence (e.g. sandwich immunoassay), surface plasmon resonance, ellipsometry, atomic force microscopy, and 2-dimensional gel electrophoresis. Methods may further include, one or more of electrospray ionization mass spectrometry (ESI-MS), ESI-MS/MS, ESI-MS/(MS)n, matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF-MS), surface-enhanced laser desorption/ionization time-of-flight mass spectrometry (SELDI-TOF-MS), desorption/ionization on silicon (DIOS), secondary ion mass spectrometry (SIMS), quadrupole time-of-flight (Q-TOF), atmospheric pressure chemical ionization mass spectrometry (APCI-MS), atmospheric pressure photoionization mass spectrometry (APPI-MS), quadrupole mass spectrometry, fourier transform mass spectrometry (FTMS), and ion trap mass spectrometry. In one preferred embodiment, detection methods employ a microchip array comprising immunogenic pathogen (e.g., E. coli) polypeptides.


Microarrays


As described herein, collections of immunogenic E. coli polypeptides may be used to identify serum antibody biomarker profiles that are associated with inflammatory bowel disease. These collections preferably include polypeptides that are differentially immunogenic (e.g., polypeptides that induce serum antibody biomarkers in healthy controls, but not in inflammatory bowel disease, or polypeptides that induce serum antibody biomarkers in inflammatory bowel disease or ulcerative colitis, but not in healthy controls). Such polypeptides of the invention are useful as hybridizable array elements in a microarray. Polypeptides useful in arrays of the invention include, but are not limited to, those polypeptides delineated in FIG. 5 and Tables 2-5, and 7. The array elements are organized in an ordered fashion such that each element is present at a specified location (i.e., an addressable location) on the substrate. Useful substrate materials include membranes, composed of paper, nylon or other materials, filters, chips, glass slides, and other solid supports. The ordered arrangement of the array elements allows hybridization patterns and intensities to be interpreted as levels of particular serum antibody biomarkers. Methods for making polypeptide microarrays are described, for example, by Ge (Nucleic Acids Res. 28: e3. i-e3. vii, 2000), MacBeath et al., (Science 289:1760-1763, 2000), Zhu et al (Nature Genet. 26:283-289), and in U.S. Pat. No. 6,436,665, hereby incorporated by reference.


Serum antibody biomarkers associated with inflammatory bowel disease may be analyzed using protein microarrays comprising the entire E. coli proteome, or comprising as few as one, two, three, four, five, or six E. coli proteins. Typically, protein microarrays feature a protein, or fragment thereof, bound to a solid support. Suitable solid supports include membranes (e.g., membranes composed of nitrocellulose, paper, or other material), polymer-based films (e.g., polystyrene), beads, or glass slides. For some applications, proteins are spotted on a substrate using any convenient method known to the skilled artisan (e.g., by hand or by inkjet printer). Preferably, such methods retain the biological activity or function of the protein bound to the substrate (Ge et al., supra; Zhu et al., supra).


The protein microarray is hybridized with blood, serum, or plasma derived from a subject. The sample comprises antibodies that specifically bind an E. coli polypeptide, thereby acting as probes. Probes can also include antibodies, candidate peptides, nucleic acids, or small molecule compounds derived from a peptide, nucleic acid, or chemical library. Hybridization conditions (e.g., temperature, pH, protein concentration, and ionic strength) are optimized to promote specific interactions. Such conditions are known to the skilled artisan and are described, for example, in Harlow, E. and Lane, D., Using Antibodies: A Laboratory Manual. 1998, New York: Cold Spring Harbor Laboratories. After removal of non-specific probes, specifically bound probes are detected, for example, by fluorescence, enzyme activity (e.g., an enzyme-linked calorimetric assay), direct immunoassay, radiometric assay, or any other suitable detectable method known to the skilled artisan.


The biochip surfaces may, for example, be ionic, anionic, hydrophobic; comprised of immobilized nickel or copper ions, comprised of a mixture of positive and negative ions; and/or comprised of one or more antibodies, single or double stranded nucleic acids, proteins, peptides or fragments thereof, amino acid probes, or phage display libraries. Many protein biochips are described in the art. These include, for example, protein biochips produced by Ciphergen Biosystems (Fremont, Calif.), Packard BioScience Company (Meriden Conn.), Zyomyx (Hayward, Calif.) and Phylos (Lexington, Mass.). Examples of such protein biochips are described in the following patents or patent applications: U.S. Pat. No. 6,225,047 (Hutchens and Yip, “Use of retentate chromatography to generate difference maps,” May 1, 2001); International publication WO 99/51773 (Kuimelis and Wagner, “Addressable protein arrays,” Oct. 14, 1999); U.S. Pat. No. 6,329,209 (Wagner et al., “Arrays of protein-capture agents and methods of use thereof,” Dec. 11, 2001) and International publication WO 00/56934 (Englert et al., “Continuous porous matrix arrays,” Sep. 28, 2000).


Serum antibody biomarkers may be captured with capture reagents (e.g., target polypeptides) immobilized to a solid support, such as a biochip, a multiwell microtiter plate, a resin, or nitrocellulose membranes that are subsequently probed for the presence of proteins. Capture can be on a chromatographic surface or a biospecific surface. For example, a serological sample containing the serum antibody biomarkers may be contacted with the active surface of a biochip for a sufficient time to allow binding. Then, unbound molecules are washed from the surface using a suitable eluant, such as phosphate buffered saline. In general, the more stringent the eluant, the more tightly the proteins must be bound to be retained after the wash.


Upon capture on a biochip, analytes can be detected by a variety of detection methods selected from, for example, a gas phase ion spectrometry method, an optical method, an electrochemical method, atomic force microscopy and a radio frequency method. Gas phase ion spectrometry methods are described herein. Of particular interest is the use of mass spectrometry, and in particular, SELDI. Optical methods include, for example, detection of fluorescence, luminescence, chemiluminescence, absorbance, reflectance, transmittance, birefringence or refractive index (e.g., surface plasmon resonance, ellipsometry, a resonant mirror method, a grating coupler waveguide method or interferometry). Optical methods include microscopy (both confocal and non-confocal), imaging methods and non-imaging methods. Immunoassays in various formats (e.g., ELISA) are popular methods for detection of analytes captured on a solid phase. Electrochemical methods include voltametry and amperometry methods. Radio frequency methods include multipolar resonance spectroscopy.


Mass spectrometry (MS) is a well-known tool for analyzing chemical compounds. Thus, in one embodiment, the methods of the present invention comprise performing quantitative MS to measure serum antibody biomarkers present in a serological sample. The method may be performed in an automated (Villanueva, et al., Nature Protocols (2006) 1(2):880-891) or semi-automated format. This can be accomplished, for example with MS operably linked to a liquid chromatography device (LC-MS/MS or LC-MS) or gas chromatography device (GC-MS or GC-MS/MS). Methods for performing MS are known in the field and have been disclosed, for example, in US Patent Application Publication Nos: 20050023454; 20050035286; U.S. Pat. No. 5,800,979 and references disclosed therein.


The protein fragments, whether they are peptides derived from the main chain of the protein or are residues of a side-chain, are collected on the collection layer. They may then be analyzed by a spectroscopic method based on matrix-assisted laser desorption/ionization (MALDI) or electrospray ionization (ESI). The preferred procedure is MALDI with time of flight (TOF) analysis, known as MALDI-TOF MS. This involves forming a matrix on the membrane, e.g. as described in the literature, with an agent which absorbs the incident light strongly at the particular wavelength employed. The sample is excited by UV, or IR laser light into the vapour phase in the MALDI mass spectrometer. Ions are generated by the vaporization and form an ion plume. The ions are accelerated in an electric field and separated according to their time of travel along a given distance, giving a mass/charge (m/z) reading which is very accurate and sensitive. MALDI spectrometers are commercially available from PerSeptive Biosystems, Inc. (Frazingham, Mass., USA) and are described in the literature, e.g. M. Kussmann and P. Roepstorff, cited above.


Diagnostics


Levels of particular serum antibody biomarkers have been correlated with a particular inflammatory bowel disease state, and thus are useful in diagnosis. In one embodiment, a patient having a inflammatory bowel disease will show an alteration in the expression of one or more serum antibody biomarkers delineated herein. In another embodiment, a patient having a inflammatory bowel disease will have a particular expression profile that includes significantly altered expression of two or more serum antibody biomarkers. Alterations in serum antibody biomarkers levels are detected using methods known to the skilled artisan and described herein.


If desired, biomarkers delineated herein are used alone or in combination with convention biomarkers, which include anti-glycan antibodies (e.g., anti-chitobioside IgA (ACCA), anti-laminaribioside IgG (ALCA), anti-manobioside IgG (AMCA)), antibodies against chemically synthesized (Σ) two major oligomannose epitopes, Man α-1,3 Man α-1,2 Man (ΣMan3) and Man α-1,3 Man α-1,2 Man α-1,2 Man (ΣMan4) (Li (2008) World J. Gastroenterol. 14, 5115-5124; 13, 15), IBD-specific pANCA or antineutrophil cytoplasmic antibody, antibodies to microbial antigens (e.g., yeast oligomanna (anti-Saccharomyces cerevisiae, RSCA), bacterial outer membrane porin C (OmpC), Pseudomonas fluorescens bacterial sequence I2 (anti-I2), and antibodies against bacterial flagellin).


In one embodiment, E. coli polypeptides or fragments derived from these polypeptides may be used as targets in a microarray. The microarray is used to assay the level of large numbers of serum antibody biomarkers simultaneously and to identify alterations in the overall or relative levels of expression. Such information can be used to diagnose a inflammatory bowel disease or a subject having a propensity to develop such a condition.


In one embodiment, an increased level of a serum antibody biomarker that specifically binds frvX relative to the level of serum antibody biomarker that binds yidX identifies a subject as having ulcerative colitis. In another embodiment an increased level of serum antibody biomarker binding to era relative to ybaN, increased serum antibody biomarker binding to yhgN relative to focA, and/or increased serum antibody biomarker binding to gabT relative to ycdG identifies a subject as having Crohn's Disease. A variety of protocols for measuring an alteration in the expression of such polypeptides are known, including immunological methods (such as ELISAs and RIAs), and provide a basis for diagnosing an inflammatory bowel disease.


In additional embodiment of the methods of the present invention, multiple markers are measured. The use of multiple markers increases the predictive value of the test and provides greater utility in diagnosis, treatment selection, patient stratification and patient monitoring. The process detects serum antibody biomarker profiles formed by the analysis of multiple markers. Such analysis may improve the sensitivity and specificity of tests delineated herein. Subtle variations in data from clinical samples indicate that certain patterns of serum antibody biomarker expression can predict phenotypes such as the presence or absence of a certain disease, a particular stage of disease progression, or a positive or adverse response to drug treatments.


Data generated by detection of serum antibody biomarkers can be analyzed using any suitable means. In one embodiment, data is analyzed with the use of a programmable digital computer. The computer program generally contains a readable medium that stores data. This data can indicate the number of serum antibody biomarkers detected, including the strength of the signal generated by each marker. Data analysis can include the steps of determining signal strength of a marker detected. When the sample is measured and data is generated, the data is then analyzed by a computer software program


As indicated above, the invention provides methods for aiding a human inflammatory bowel disease diagnosis using one or more serum antibody biomarkers, as specified herein. These markers can be used alone, in combination with other markers in any set, or with entirely different markers in aiding human inflammatory bowel disease diagnosis. The serum antibody biomarkers are differentially present in samples of a subject having or having a propensity to develop a inflammatory bowel disease and a healthy control subject in whom inflammatory bowel disease is undetectable. For example, some of the serum antibody biomarkers are expressed at an elevated level and/or are present at a higher frequency in human inflammatory bowel disease subjects than in normal subjects, while some of the serum antibody biomarkers are expressed at a decreased level and/or are present at a lower frequency in human inflammatory bowel disease subjects than in normal subjects. Therefore, detection of one or more of these markers in a person would provide useful information regarding the probability that the person may have an inflammatory bowel disease.


The detection of a marker is then correlated with a probable diagnosis of inflammatory bowel disease. In some embodiments, the detection of the mere presence or absence of a marker, without quantifying the amount thereof, is useful and can be correlated with a probable diagnosis of inflammatory bowel disease. The measurement of markers may also involve quantifying the markers to correlate the detection of markers with a probable diagnosis of inflammatory bowel disease. Thus, if the amount of the markers detected in a subject being tested is different compared to a control amount (i.e., higher or lower than the control, depending on the marker), then the subject being tested has a higher probability of having inflammatory bowel disease.


The correlation may take into account the amount of the serum antibody biomarkers in the sample compared to a control amount of the serum antibody biomarkers (up or down regulation of the marker or markers) in normal subjects or in subjects where inflammatory bowel disease is undetectable. A control can be, e.g., the average or median amount of marker present in comparable samples of normal subjects. The control amount is measured under the same or substantially similar experimental conditions as in measuring the test amount. As a result, the control can be employed as a reference standard, where the normal phenotype is known, and each result can be compared to that standard, rather than re-running a control.


Accordingly, a serum antibody biomarkers profile may be obtained from a subject sample and compared to a reference marker profile obtained from a reference population, so that it is possible to classify the subject as belonging to or not belonging to the reference population. The correlation may take into account the presence or absence of the markers in a test sample and the frequency of detection of the same markers in a control. The correlation may take into account both of such factors to facilitate determination inflammatory bowel disease status.


Any marker, individually, is useful in aiding in the determination of inflammatory bowel disease status. First, the selected serum antibody biomarkers is detected in a subject sample using the methods described herein (e.g. microarray analysis). Then, the result is compared with a control that distinguishes inflammatory bowel disease status from non-inflammatory bowel disease status. As is well understood in the art, the techniques can be adjusted to increase sensitivity or specificity of the diagnostic assay depending on the preference of the diagnostician.


While individual serum antibody biomarkers are useful diagnostic markers, in some instances, a combination of markers provides greater predictive value than single markers alone. The detection of a plurality of markers (or absence thereof, as the case may be) in a sample can increase the percentage of true positive and true negative diagnoses and decrease the percentage of false positive or false negative diagnoses. Thus, one method of the present invention provides for the measurement of more than one marker.


Optionally, methods described herein may be combined with any conventional method for the diagnosis of IBD (e.g., stool sample analysis, colonoscopy or sigmoidoscopy, barium x-ray, computerized axial tomography, and or capsule endoscopy).


Monitoring


Methods of characterizing inflammatory bowel disease in a subject are also useful in managing subject treatment based on the subject's status. The invention provides for such methods where the serum antibody biomarkers (or specific combinations of markers) are measured before and again after subject management. In these cases, the methods are used to monitor the status of the inflammatory bowel disease, e.g., response to inflammatory bowel disease treatment, amelioration of the disease or progression of the disease.


For example, markers of the invention (e.g., antibodies that bind an E. coli polypeptide listed in FIG. 5 and Tables 2-5, and 7) can be used to monitor a subject's response to certain treatments of inflammatory bowel disease. The level or function of a marker delineated herein may be measured before treatment, during treatment, or following the conclusion of a treatment regimen. Preferably, multiple measurements (e.g., 2, 3, 4, 5) are made at one or more of those times. Measurements are made, for example, using an immunoassay, microarray or other method to determine the expression profile of one or more serum antibody biomarkers. Such monitoring may be useful, for example, in assessing the efficacy of a particular drug in a patient. Therapeutics that normalize the levels of a serum antibody biomarker (e.g., that increase or reduce levels to correspond to levels present in a healthy control subject) are taken as particularly useful in the invention.


Kits


In one aspect, the invention provides kits for monitoring and diagnosing inflammatory bowel disease, wherein the kits can be used to detect the markers described herein. For example, the kits can be used to detect any one or more of the markers differentially present in samples of inflammatory bowel disease subjects vs. normal subjects. If desired a kit of the invention includes any one or more of the E. coli polypeptides listed in FIG. 5 and Tables 2-5, and 7. In one embodiment, the kit comprises a set of biomarkers for distinguishing Crohn's Disease from healthy control, the set comprising era, ybaN, yhgN, focA, ga bT and ycdG. In another embodiment, the kit comprises the set of biomarkers for distinguishing Crohns from UC, which is yidx/frvx. If desired, the kit comprises reagents suitable for measuring conventional IBD biomarkers, including anti-glycan antibodies (e.g., anti-chitobioside IgA (ACCA), anti-laminaribioside IgG (ALCA), anti-manobioside IgG (AMCA)), antibodies against chemically synthesized (Σ) two major oligomannose epitopes, Man α-1,3 Man α-1,2 Man (ΣMan3) and Man α-1,3 Man α-1,2 Man α-1,2 Man (ΣMan4) (Li (2008) World J. Gastroenterol. 14, 5115-512413, 15), IBD-specific pANCA or antineutrophil cytoplasmic antibody, antibodies to microbial antigens (e.g., yeast oligomanna (anti-Saccharomyces cerevisiae, RSCA), bacterial outer membrane porin C (OmpC), Pseudomonas fluorescens bacterial sequence 12 (anti-12), and antibodies against bacterial flagellin (air).


The kits of the invention have many applications. For example, the kits can be used to distinguish between inflammatory bowel disease and control, to determine if a subject has a Crohn's Disease or ulcerative colitis, or to determine that the subject does not have inflammatory bowel disease, thus aiding in inflammatory bowel disease diagnosis. The kits can also be used to identify compounds that modulate expression of one or more of the serum antibody biomarkers in an animal model of inflammatory bowel disease.


The kits of the invention may include instructions for the assay, reagents, testing equipment (test tubes, reaction vessels, needles, syringes, etc.), standards for calibrating the assay, and/or equipment provided or used to conduct the assay. Reagents may include acids, bases, oxidizing agents, marker species. The instructions provided in a kit according to the invention may be directed to suitable operational parameters in the form of a label or a separate insert.


The kits may also include an adsorbent, wherein the adsorbent retains one or more markers selected from one or more of the markers described herein, and written instructions for use of the kit for detection of an inflammatory bowel disease. Such a kit could, for example, comprise: (a) a substrate comprising an adsorbent thereon, wherein the adsorbent is suitable for binding a serum antibody biomarkers, and (b) instructions to detect the serum antibody biomarkers by contacting a sample with the adsorbent and detecting the serum antibody biomarkers retained by the adsorbent. Accordingly, the kit could further comprise a detection reagent.


Optionally, the kit may further comprise a standard or control information so that the test sample can be compared with the control information standard to determine if the test amount of a marker detected in a sample is a diagnostic amount consistent with a diagnosis of inflammatory bowel disease.


Selection of a Treatment Method


After a subject is diagnosed as having inflammatory bowel disease a method of treatment is selected. Because inflammatory bowel disease typically involves an excessive or undesirable immune response, therapies often involve treatment with immunosuppressive agents. Such therapies would not be appropriate for a subject that has irritable bowel syndrome. Thus, the invention provides methods for selecting an appropriate therapy for a subject, the method involving identifying a subject as having inflammatory bowel disease, Crohn's disease or ulcerative colitis, and administering to the subject a therapeutic treatment appropriate for that disease. Exemplary treatments for IBD include but are not limited to aminosalicylates, immunomodulators, infliximab, adalimumab, certolizumab, and/or antibiotics.


Biomarkers identified herein are useful for identifying subjects in need of surgery. In particular embodiments, pairs and sets of biomarkers delineated in Tables 2-5, 7, and FIG. 5 are useful alone or in combination with existing biomarkers to identify subjects that could benefit from surgery.



E. coli Polypeptides and Analogs


Also included in the invention are E. coli polypeptides or fragments thereof that are modified in ways that enhance or do not inhibit their ability to bind a serum antibody. In one embodiment, the invention provides methods for optimizing an E. coli amino acid sequence or nucleic acid sequence by producing an alteration. Such changes may include certain mutations, deletions, insertions, or post-translational modifications. In one preferred embodiment, the E. coli amino acid sequence is modified to enhance protease resistance. Accordingly, the invention further includes polypeptides of other yeast or bacteria having at least 85%, 90%, 95% or greater sequence identity to an E. coli polypeptide delineated herein. In other embodiments, the invention includes analogs of any naturally-occurring polypeptide of the invention. Analogs can differ from the naturally-occurring the polypeptide of the invention by amino acid sequence differences, by post-translational modifications, or by both. Analogs of the invention will generally exhibit at least 85%, more preferably 90%, and most preferably 95% or even 99% identity with all or part of a naturally-occurring amino, acid sequence of the invention. The length of sequence comparison is at least 10, 13, 15 amino acid residues, preferably at least 25 amino acid residues, and more preferably more than 35 amino acid residues. Again, in an exemplary approach to determining the degree of identity, a BLAST program may be used, with a probability score between e−3 and e−100 indicating a closely related sequence. Modifications include in vivo and in vitro chemical derivatization of polypeptides, e.g., acetylation, carboxylation, phosphorylation, or glycosylation; such modifications may occur during polypeptide synthesis or processing or following treatment with isolated modifying enzymes. Analogs can also differ from the naturally-occurring polypeptides of the invention by alterations in primary sequence. These include genetic variants, both natural and induced (for example, resulting from random mutagenesis by irradiation or exposure to ethanemethylsulfate or by site-specific mutagenesis as described in Sambrook, Fritsch and Maniatis, Molecular Cloning: A Laboratory Manual (2d ed.), CSH Press, 1989, or Ausubel et al., supra). Also included are cyclized peptides, molecules, and analogs which contain residues other than L-amino acids, e.g., D-amino acids or non-naturally occurring or synthetic amino acids, e.g., .beta. or .gamma. amino acids.


In addition to full-length polypeptides, the invention also includes fragments of any one of the polypeptides of the invention. As used herein, the term “a fragment” means at least 5, 10, 13, or 15 amino acids in length. In other embodiments a fragment is at least 20 contiguous amino acids, at least 30 contiguous amino acids, or at least 50 contiguous amino acids, and in other embodiments at least 60 to 80 or more contiguous amino acids. Fragments of the invention can be generated by methods known to those skilled in the art or may result from normal protein processing (e.g., removal of amino acids from the nascent polypeptide that are not required for biological activity or removal of amino acids by alternative mRNA splicing or alternative protein processing events).


Screening Assays


Methods of the invention are useful for the high-throughput low-cost screening of candidate agents that bind an E. coli polypeptide described herein. A candidate agent that specifically binds to a E. coli is then isolated and tested for activity in an in vitro assay or in vivo assay. If desired, the candidate agent comprises a detectable label. In one embodiment, such polypeptides are subsequently screened for an effect on bacterial proliferation or as agents that block antibody binding to a E. coli polypeptide listed herein. One skilled in the art appreciates that the effects of a candidate agent on a cell is typically compared to a corresponding control cell not contacted with the candidate agent. Thus, the screening methods include comparing the effect of a candidate agent with an untreated control cell.


In one embodiment, candidate compounds may be identified by first assaying those that specifically bind to an E. coli polypeptide of the invention. Such an interaction can be readily assayed using any number of standard binding techniques and functional assays (e.g., those described in Ausubel et al., supra). For example, a candidate compound may be tested in vitro for interaction and binding with a polypeptide of the invention and its ability to modulate bacterial proliferation may be assayed by any standard assays (e.g., those described herein)


In one particular example, a candidate compound that binds to an E. coli polypeptide may be identified using a chromatography-based technique. For example, a recombinant E. coli polypeptide of the invention may be purified by standard techniques from cells engineered to express the polypeptide, or may be chemically synthesized, once purified the peptide is immobilized on a column. A solution of candidate agents is then passed through the column, and an agent that specifically binds the polypeptide or a fragment thereof is identified on the basis of its ability to bind to polypeptide and to be immobilized on the column. To isolate the agent, the column is washed to remove non-specifically bound molecules, and the agent of interest is then released from the column and collected. Agents isolated by this method (or any other appropriate method) may, if desired, be further purified (e.g., by high performance liquid chromatography). In addition, these candidate agents may be tested for their ability to reduce bacterial proliferation or block serum antibody binding to an E. coli polypeptide. Agents isolated by this approach may also be used, for example, as therapeutics to treat or prevent inflammatory bowel disease (e.g., Crohn's, ulcerative colitis). Compounds that are identified as binding to a an E. coli polypeptide with an affinity constant less than or equal to 1 nM, 5 nM, 10 nM, 100 nM, 1 mM or 10 mM are considered particularly useful in the invention.


Such agents may be used, for example, as a therapeutic to combat the pathogenicity of an bacterial pathogen. Optionally, agents identified in any of the above-described assays may be confirmed as useful in conferring protection against the development of a pathogen infection in any standard animal model and, if successful, may be used as anti-pathogen therapeutics.


Each of the protein sequences provided herein may also be used in the discovery and development of antipathogenic compounds (e.g., antibiotics). The E. coli protein, upon expression, can be used as a target for the screening of drugs to treat or prevent IBD.


Test Compounds and Extracts


In general, candidate agents are identified from large libraries of natural product or synthetic (or semi-synthetic) extracts or chemical libraries or from polypeptide or nucleic acid libraries, according to methods known in the art. Those skilled in the field of drug discovery and development will understand that the precise source of test extracts or compounds is not critical to the screening procedure(s) of the invention. Agents used in screens may include known those known as therapeutics for the treatment of pathogen infections. Alternatively, virtually any number of unknown chemical extracts or compounds can be screened using the methods described herein. Examples of such extracts or compounds include, but are not limited to, plant-, fungal-, prokaryotic- or animal-based extracts, fermentation broths, and synthetic compounds, as well as the modification of existing polypeptides.


Libraries of natural polypeptides in the form of bacterial, fungal, plant, and animal extracts are commercially available from a number of sources, including Biotics (Sussex, UK), Xenova (Slough, UK), Harbor Branch Oceangraphics Institute (Ft. Pierce, Fla.), and PharmaMar, U.S.A. (Cambridge, Mass.). Such polypeptides can be modified to include a protein transduction domain using methods known in the art and described herein. In addition, natural and synthetically produced libraries are produced, if desired, according to methods known in the art, e.g., by standard extraction and fractionation methods. Examples of methods for the synthesis of molecular libraries can be found in the art, for example in: DeWitt et al., Proc. Natl. Acad. Sci. U.S.A. 90:6909, 1993; Erb et al., Proc. Natl. Acad. Sci. USA 91:11422, 1994; Zuckermann et al., J. Med. Chem. 37:2678, 1994; Cho et al., Science 261:1303, 1993; Carrell et al., Angew. Chem. Int. Ed. Engl. 33:2059, 1994; Carell et al., Angew. Chem. Int. Ed. Engl. 33:2061, 1994; and Gallop et al., J. Med. Chem. 37:1233, 1994. Furthermore, if desired, any library or compound is readily modified using standard chemical, physical, or biochemical methods.


Numerous methods are also available for generating random or directed synthesis (e.g., semi-synthesis or total synthesis) of any number of polypeptides, chemical compounds, including, but not limited to, saccharide-, lipid-, peptide-, and nucleic acid-based compounds.


Synthetic compound libraries are commercially available from Brandon Associates (Merrimack, N.H.) and Aldrich Chemical (Milwaukee, Wis.). Alternatively, chemical compounds to be used as candidate compounds can be synthesized from readily available starting materials using standard synthetic techniques and methodologies known to those of ordinary skill in the art. Synthetic chemistry transformations and protecting group methodologies (protection and deprotection) useful in synthesizing the compounds identified by the methods described herein are known in the art and include, for example, those such as described in R. Larock, Comprehensive Organic Transformations, VCH Publishers (1989); T. W. Greene and P. G. M. Wuts, Protective Groups in Organic Synthesis, 2nd ed., John Wiley and Sons (1991); L. Fieser and M. Fieser, Fieser and Fieser's Reagents for Organic Synthesis, John Wiley and Sons (1994); and L. Paquette, ed., Encyclopedia of Reagents for Organic Synthesis, John Wiley and Sons (1995), and subsequent editions thereof.


Libraries of compounds may be presented in solution (e.g., Houghten, Biotechniques 13:412-421, 1992), or on beads (Lam, Nature 354:82-84, 1991), chips (Fodor, Nature 364:555-556, 1993), bacteria (Ladner, U.S. Pat. No. 5,223,409), spores (Ladner U.S. Pat. No. 5,223,409), plasmids (Cull et al., Proc Natl Acad Sci USA 89:1865-1869, 1992) or on phage (Scott and Smith, Science 249:386-390, 1990; Devlin, Science 249:404-406, 1990; Cwirla et al. Proc. Natl. Acad. Sci. 87:6378-6382, 1990; Felici, J. Mol. Biol. 222:301-310, 1991; Ladner supra.).


In addition, those skilled in the art of drug discovery and development readily understand that methods for dereplication (e.g., taxonomic dereplication, biological dereplication, and chemical dereplication, or any combination thereof) or the elimination of replicates or repeats of materials already known for their activity should be employed whenever possible.


When a crude extract is found to have E. coli polypeptide binding activity further fractionation of the positive lead extract is necessary to isolate molecular constituents responsible for the observed effect. Thus, the goal of the extraction, fractionation, and purification process is the careful characterization and identification of a chemical entity within the crude extract that treats or prevents IBD or acts as an antibiotic. Methods of fractionation and purification of such heterogenous extracts are known in the art. If desired, compounds shown to be useful as therapeutics are chemically modified according to methods known in the art.


The present invention provides methods of treating inflammatory bowel disease and/or disorders or symptoms thereof which comprise administering a therapeutically effective amount of a pharmaceutical composition comprising a compound of the formulae herein to a subject (e.g., a mammal such as a human). Thus, one embodiment is a method of treating a subject suffering from or susceptible to a inflammatory bowel disease or disorder or symptom thereof. The method includes the step of administering to the mammal a therapeutic amount of an amount of a compound herein sufficient to treat the disease or disorder or symptom thereof, under conditions such that the disease or disorder is treated.


The methods herein include administering to the subject (including a subject identified as in need of such treatment) an effective amount of a compound described herein, or a composition described herein to produce such effect. Identifying a subject in need of such treatment can be in the judgment of a subject or a health care professional and can be subjective (e.g. opinion) or objective (e.g. measurable by a test or diagnostic method). As used herein, the terms “treat,” treating,” “treatment,” and the like refer to reducing or ameliorating a disorder and/or symptoms associated therewith. It will be appreciated that, although not precluded, treating a disorder or condition does not require that the disorder, condition or symptoms associated therewith be completely eliminated. As used herein, the terms “prevent,” “preventing,” “prevention,” “prophylactic treatment” and the like refer to reducing the probability of developing a disorder or condition in a subject, who does not have, but is at risk of or susceptible to developing a disorder or condition.


The therapeutic methods of the invention (which include prophylactic treatment) in general comprise administration of a therapeutically effective amount of the compounds herein, such as a compound of the formulae herein to a subject (e.g., animal, human) in need thereof, including a mammal, particularly a human. Such treatment will be suitably administered to subjects, particularly humans, suffering from, having, susceptible to, or at risk for a disease, disorder, or symptom thereof. Determination of those subjects “at risk” can be made by any objective or subjective determination by a diagnostic test or opinion of a subject or health care provider (e.g., genetic test, enzyme or protein marker, Marker (as defined herein), family history, and the like). The compounds herein may be also used in the treatment of any other disorders in which inflammation of the intestine may be implicated.


In one embodiment, the invention provides a method of monitoring treatment progress. The method includes the step of determining a level of diagnostic marker (Marker) (e.g., any target delineated herein) or diagnostic measurement (e.g., screen, assay) in a subject suffering from or susceptible to a inflammatory bowel disease, or disorder or symptoms thereof associated with intestinal inflammation. The level of Marker determined in the method can be compared to known levels of Marker in either healthy normal controls or in other afflicted patients to establish the subject's disease status. In preferred embodiments, a second level of Marker in the subject is determined at a time point later than the determination of the first level, and the two levels are compared to monitor the course of disease or the efficacy of the therapy. In certain preferred embodiments, a pre-treatment level of Marker in the subject is determined prior to beginning treatment according to this invention; this pre-treatment level of Marker can then be compared to the level of Marker in the subject after the treatment commences, to determine the efficacy of the treatment.


The practice of the present invention employs, unless otherwise indicated, conventional techniques of molecular biology (including recombinant techniques), microbiology, cell biology, biochemistry and immunology, which are well within the purview of the skilled artisan. Such techniques are explained fully in the literature, such as, “Molecular Cloning: A Laboratory Manual”, second edition (Sambrook, 1989); “Oligonucleotide Synthesis” (Gait, 1984); “Animal Cell Culture” (Freshney, 1987); “Methods in Enzymology” “Handbook of Experimental Immunology” (Weir, 1996); “Gene Transfer Vectors for Mammalian Cells” (Miller and Calos, 1987); “Current Protocols in Molecular Biology” (Ausubel, 1987); “PCR: The Polymerase Chain Reaction”, (Mullis, 1994); “Current Protocols in Immunology” (Coligan, 1991). These techniques are applicable to the production of the polynucleotides and polypeptides of the invention, and, as such, may be considered in making and practicing the invention. Particularly useful techniques for particular embodiments will be discussed in the sections that follow.


The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how to make and use the assay, screening, and therapeutic methods of the invention, and are not intended to limit the scope of what the inventors regard as their invention.


EXAMPLES
Example 1: Identification of IBD Serological Markers from E. coli Proteome Chips

Sera was collected from 134 individuals (29 healthy control, 66 CD and 39 ulcerative colitis) from the Johns Hopkins Medical Institutes (Table 1).









TABLE 1







Demographic and clinical information


of IBD Patient and healthy controls.











CD
UC
HC


Patient Data/Characteristics
(n = 66)
(n = 29)
(n = 39)





Gender: female %
55  
53
43


Age: mean/st dev (yrs)
36.7 ± 13.1
38 ± 14.5
47 ± 12.4


Age at diagnosis: mean (yrs)
36.7
28


Duration of disease (yrs)
12.3
  8.9


Extraintestinal disease: n (%)
 7 (11)
 3 (10)


Surgery: n (%)
46 (70)
 4 (12)


Ethnicity: n (%)


African American
29 (44)
10 (34)


Caucasian
35 (53)
22 (75)


Hispanic
2 (3)
 0


Smoking: n (%)


Past or present
17 (26)
 5 (17)


Nonsmoker
40 (60)
18 (62)


Unknown
 9 (14)
 9 (35)


Medications: n (%)


Antibiotics
23 (35)
 4 (14)


5-ASA
49 (74)
26 (90)


Corticosteroids
16 (24)
 9 (35)


AZA/6-MP
26 (39)
17 (59)


Methotrexate
0 
1 (3)


Infliximab
15 (23)
2 (7)


Crohn's Disease Subtype: n (total)


Nonstricturing and nonpenetrating
18 (27)


Penetrating
26 (40)


Stricturing
14 (21)


Penetrating and stricturing
 8 (12)


Ulcerative Colitis: n (total)


Left Sided Colitis

13 (45)


Pancolitis

18 (62)









To identify potential biomarkers for IBD diagnosis, the antibody repertoire of the IBD patients was profiled using the E. coli proteome chips that each contained more than 4,200 individual proteins (see schematic illustration of our strategy in (FIG. 1). Since each protein was spotted in duplicate on the chip, the reproducibility of duplicates of each protein was first analyzed. As shown in FIG. 2A, the visual appearance of duplicate spots was very similar. As shown in FIG. 2B in scatter plot, the duplicate spots of each protein were highly correlated, indicating the good quality of the array manufacturing. To recognize those reactive antibodies on the chips, the chips were probed with Cy3-labeled anti-human immunoglobulin antibodies. The immunogenic profiles of both the IBD patients and healthy control were acquired by the resulting fluorescent signals. CD vs ulcerative colitis vs healthy control can be distinguished by comparing the signal intensities between protein spots on the E. coli proteome chips (see FIG. 2A, which shows visual appearance of two representative chips probed with sera from CD and healthy control, respectively). Two-level of data analyses were performed with these immunogenic profiles (i) to identify differential immunogenic responses among CD vs ulcerative colitis vs healthy control using Significance Analysis of Microarray (SAM) and Gene Ontology (GO) enrichment analysis; and (ii) to construct robust classifiers to distinguish CD vs ulcerative colitis vs healthy control using k-TSP method.


Example 2: Global Immunogenic Profiles of IBD Against E. coli

Sera samples from healthy control subjects (n=39), patients with Crohn's Disease (n=66), and patients with ulcerative colitis (n=29) (Table 1) were used to compare differences between healthy control and IBD immunogenic profiles. To investigate the differential global changes in immunogenic response to E. coli proteins among healthy control vs Crohn's Disease vs ulcerative colitis, SAM was employed as described herein above for the immunogenic profiles. For convenience, the E. coli proteins that were differentially recognized by serum antibodies from healthy control, Crohn's Disease or ulcerative colitis are referred to as “differentially-expressed immunogenic proteins” throughout. Heat maps shown in FIGS. 3A-C present a visual illustration of the differentially immunogenic proteins for each phenotype. 273 differentially immunogenic proteins were identified by SAM when compared healthy control with CD samples. 81 proteins are highly immunogenic in CD samples and 192 are highly immunogenic in healthy control samples (FIG. 3A). Conversely, 188 proteins have different immunogenic responses in the IBD subtypes, 51 and 137 are highly immunogenic in ulcerative colitis and CD samples, respectively (FIG. 3B). When healthy control and ulcerative colitis samples are compared, only 27 and 6 proteins are discriminatory and highly immunogenic in healthy control and ulcerative colitis samples, respectively (FIG. 3C). A full list of the immunogenic E. coli proteins in FIGS. 3A-C can be found in Tables 2-4, respectively.









TABLE 2







SAM ANALYSIS OF HEALTHY CONTROLS (HC) VERSUS CROHN'S DISEASE (CD) (see FIG. 3A)












SPOT
PROTEIN
NAME
GO BP
GO CC
GO MF










81 Highly immunogenic proteins in CD












yfiC
yfiC
Hypothetical protein yfiC ///


8168 // methyltransferase activity //




predicted S-adenosyl-L-


inferred from electronic annotation




methionine-dependent


/// 16740 // transferase activity //




methyltransferase


inferred from electronic annotation


era
era
GTP-binding protein Era
50875 // cellular
5622 //
166 // nucleotide binding // inferred





physiological
intracellular //
from electronic annotation /// 3676





process // inferred
inferred from
// nucleic acid binding // inferred





from electronic
electronic
from electronic annotation /// 3723





annotation
annotation ///
// RNA binding // inferred from






16020 //
electronic annotation /// 5525 //






membrane //
GTP binding // inferred from






inferred from
electronic annotation






electronic






annotation


ygbD
ygbD
nitric oxide reductase
6118 // electron

15036 // disulfide oxidoreductase





transport // inferred

activity // inferred from electronic





from electronic

annotation /// 16491 //





annotation

oxidoreductase activity // inferred







from electronic annotation /// 16731







// oxidoreductase activity, acting on







iron-sulfur proteins as donors, NAD







or NADP as acceptors


yjhO
yjhO ///
KpLE2 phage-like element;


16787 // hydrolase activity //



sgcX
predicted endoglucanase with


inferred from electronic annotation




Zn-dependent exopeptidase




domain


aidA
aidA
DNA-3-methyladenine
6281 // DNA repair

5515 // protein binding // inferred




glycosylase II /// 3-methyl-
// inferred from

from physical interaction /// 3677 //




adenine DNA glycosylase II
electronic

DNA binding // inferred from





annotation /// 6284

electronic annotation /// 3905 //





// base-excision

alkylbase DNA N-glycosylase





repair // inferred

activity // inferred from electronic





from electronic

annotation /// 16787 // hydrolase





annotation /// 6974

activity // inferred from electronic





// response to DNA

annotation





damage stimulus //





inferred from





electronic





annotation /// 5975





// carbohydrate





metabolism // inf


yhcI
yhcI ///
N-acetylmannosamine kinase
5975 //

166 // nucleotide binding // inferred



nanK

carbohydrate

from electronic annotation /// 5524





metabolism //

// ATP binding // inferred from





inferred from

electronic annotation /// 9384 // N-





electronic

acylmannosamine kinase activity //





annotation /// 6051

inferred from electronic annotation





// N-

/// 16301 // kinase activity //





acetylmannosamine

inferred from electronic annotation





metabolism //





inferred from





electronic





annotation


fliS
fliS
flagellar protein FliS
9296 // flagellum
9288 // flagellum






biogenesis //
(sensu Bacteria)





inferred from
// inferred from





electronic
electronic





annotation
annotation ///






19861 //






flagellum //






inferred from






electronic






annotation


infC
infC
Translation initiation factor IF-3
6412 // protein

3743 // translation initiation factor





biosynthesis //

activity // inferred from electronic





inferred from

annotation /// 3723 // RNA binding





electronic

// inferred from electronic





annotation /// 6413

annotation





// translational





initiation // inferred





from electronic





annotation /// 6417





// regulation of





protein biosynthesis





// inferred from





electronic





annotation /// 6445





// regulation of


metB
metB
cystathionine gamma-synthase
6520 // amino acid

3962 // cystathionine gamma-





metabolism //

synthase activity // inferred from





inferred from

electronic annotation /// 16740 //





electronic

transferase activity // inferred from





annotation /// 8652

electronic annotation /// 16829 //





// amino acid

lyase activity // inferred from





biosynthesis //

electronic annotation





inferred from





electronic





annotation /// 9086





// methionine





biosynthesis //





inferred from





electronic





annotation


purM
purM
phosphoribosylaminoimidazole
6164 // purine
5737 //
3824 // catalytic activity // inferred




synthetase
nucleotide
cytoplasm //
from electronic annotation /// 4641





biosynthesis //
inferred from
//





inferred from
electronic
phosphoribosylformylglycinamidine





electronic
annotation
cyclo-ligase activity // inferred from





annotation /// 6189

electronic annotation /// 16874 //





// ‘de novo’ IMP

ligase activity // inferred from





biosynthesis //

electronic annotation





inferred from





electronic





annotation


argC
argC
N-acetyl-gamma-glutamyl-
6520 // amino acid
5737 //
3942 // N-acetyl-gamma-glutamyl-




phosphate reductase
metabolism //
cytoplasm //
phosphate reductase activity //





inferred from
inferred from
inferred from electronic annotation





electronic
electronic
/// 16491 // oxidoreductase activity





annotation /// 6526
annotation
// inferred from electronic





// arginine

annotation /// 16620 //





biosynthesis //

oxidoreductase activity, acting on





inferred from

the aldehyde or oxo group of do





electronic





annotation /// 8652





// amino acid





biosynthesis //





inferred from





electronic





annotation /// 9085





// lysine





biosynthesis // inf


phnB
phnB
PhnB protein /// hypothetical







protein


torA
torA
Trimethylamine-N-oxide
6118 // electron
42597 //
16491 // oxidoreductase activity //




reductase 1 precursor ///
transport // inferred
periplasmic
inferred from electronic annotation




trimethylamine N-oxide
from electronic
space // inferred
/// 30151 // molybdenum ion




(TMAO) reductase I, catalytic
annotation
from electronic
binding // inferred from electronic




subunit

annotation
annotation /// 50626 //







trimethylamine-N-oxide reductase







(cytochrome c) activity // inferred







from electronic annotation


ibpB
ibpB
16 kDa heat shock protein B ///
6457 // protein

5515 // protein binding // inferred




heat shock chaperone
folding // inferred

from physical interaction /// 51082





from electronic

// unfolded protein binding //





annotation /// 6986

inferred from electronic annotation





// response to





unfolded protein //





inferred from





electronic





annotation /// 50821





// protein





stabilization //





inferred from





electronic





annotation


hycF
hycF
hydrogenase 4 Fe—S subunit ///
6118 // electron

5506 // iron ion binding // inferred




formate hydrogenlyase
transport // inferred

from electronic annotation /// 9055




complex iron-sulfur protein
from electronic

// electron carrier activity // inferred





annotation /// 6810

from electronic annotation /// 46872





// transport //

// metal ion binding // inferred from





inferred from

electronic annotation /// 51536 //





electronic

iron-sulfur cluster binding //





annotation


ycbF
ycbF
predicted periplasmic pilini
6457 // protein
9289 // fimbrium
5515 // protein binding // inferred




chaperone
folding // inferred
// inferred from
from electronic annotation /// 51082





from electronic
electronic
// unfolded protein binding //





annotation /// 7047
annotation ///
inferred from electronic annotation





// cell wall
30288 //





organization and
periplasmic





biogenesis //
space (sensu





inferred from
Proteobacteria) //





electronic
inferred from





annotation
electronic






annotation ///






42597 //






periplasmic






space // inferred






from electronic






annotation


ssi6
ssi6
hypothetical protein





yjhE
yjhE
KpLE2 phage-like element;
6810 // transport //
16020 //
5215 // transporter activity //




predicted membrane protein
inferred from
membrane //
inferred from electronic annotation




(pseudogene)
electronic
inferred from





annotation
electronic






annotation ///






16021 // integral






to membrane //






inferred from






electronic






annotation


ygeW
ygeW
ornithine carbamoyltransferase
6207 // ‘de novo’

4070 // aspartate





pyrimidine base

carbamoyltransferase activity //





biosynthesis //

inferred from electronic annotation





inferred from

/// 16597 // amino acid binding //





electronic

inferred from electronic annotation





annotation /// 6520

/// 16743 // carboxyl- and





// amino acid

carbamoyltransferase activity //





metabolism //

inferred from electronic annotation





inferred from





electronic





annotation


hofH
hofH ///
Putative general secretion
6810 // transport //
15627 // type II
8565 // protein transporter activity //



gspH
pathway protein H precursor ///
inferred from
protein secretion
inferred from electronic annotation




predicted general secretory
electronic
system complex




pathway component, cryptic
annotation /// 15628
// inferred from





// type II protein
electronic





secretion system //
annotation





inferred from





electronic





annotation


rffD
rffD ///
UDP-N-acetyl-D-
6118 // electron

16491 // oxidoreductase activity //



wecC
mannosamine dehydrogenase
transport // inferred

inferred from electronic annotation




/// UDP-N-acetyl-D-
from electronic




mannosaminuronic acid
annotation




dehydrogenase


yjhC
yjhC
KpLE2 phage-like element;
6118 // electron

16491 // oxidoreductase activity //




predicted oxidoreductase
transport // inferred

inferred from electronic annotation





from electronic





annotation /// 8152





// metabolism //





inferred from





electronic





annotation


yjcS
yjcS
Hypothetical protein yjcS


5488 // binding // inferred from







electronic annotation


ftn
ftn
Ferritin 1 /// ferritin iron
6826 // iron ion

4322 // ferroxidase activity //




storage protein (cytoplasmic)
transport // inferred

inferred from electronic annotation





from electronic

/// 5488 // binding // inferred from





annotation /// 6879

electronic annotation /// 5506 // iron





// iron ion

ion binding // inferred from





homeostasis //

electronic annotation /// 8199 //





inferred from

ferric iron binding // inferred from





electronic

electronic





annotation


ybbQ
ybbQ
2-hydroxy-3-oxopropionate
6098 // pentose-

4616 // phosphogluconate




reductase
phosphate shunt //

dehydrogenase (decarboxylating)





inferred from

activity // inferred from electronic





electronic

annotation /// 8442 // 3-





annotation /// 6573

hydroxyisobutyrate dehydrogenase





// valine metabolism

activity // inferred from electronic





// inferred from

annotation /// 8679 // 2-hydroxy-3-





electronic

oxopropionate reductase activit





annotation /// 46487





// glyoxylate





metabolism //





inferred from





electronic





annotation


ppdB
ppdB
Prepilin peptidase dependent







protein B precursor ///




hypothetical protein


fimC
fimC
Chaperone protein fimC
6457 // protein
9289 // fimbrium
5515 // protein binding // inferred




precursor /// chaperone,
folding // inferred
// inferred from
from electronic annotation /// 51082




periplasmic
from electronic
electronic
// unfolded protein binding //





annotation /// 7047
annotation ///
inferred from electronic annotation





// cell wall
30288 //





organization and
periplasmic





biogenesis //
space (sensu





inferred from
Proteobacteria) //





electronic
inferred from





annotation
electronic






annotation ///






42597 //






periplasmic






space // inferred






from electronic






annotation


dgxA
dgxA
hypothetical protein





fumB
fumB
Fumarate hydratase class I,
6091 // generation

3824 // catalytic activity // inferred




anaerobic /// anaerobic class I
of precursor

from electronic annotation /// 4333




fumarate hydratase (fumarase
metabolites and

// fumarate hydratase activity //




B)
energy // inferred

inferred from electronic annotation





from electronic

/// 5506 // iron ion binding //





annotation /// 6099

inferred from electronic annotation





// tricarboxylic acid

/// 16829 // lyase activity // inferred





cycle // inferred

from electronic annotation





from electronic





annotation


(thiS)
thiS
sulfur carrier protein ThiS
6790 // sulfur







metabolism //





inferred from





electronic





annotation


yjeJ
yjeJ
Hypothetical protein yjeJ ///







hypothetical protein


cedA
cedA
Cell division activator cedA ///
7049 // cell cycle //






cell division modulator
inferred from





electronic





annotation /// 51301





// cell division //





inferred from





electronic





annotation


cysW
cysW
sulfate/thiosulfate transporter
6810 // transport //
9276 // cell wall
5215 // transporter activity //




subunit
inferred from
(sensu
inferred from electronic annotation





electronic
Proteobacteria) //
/// 15116 // sulfate transporter





annotation /// 8272
inferred from
activity // inferred from electronic





// sulfate transport //
electronic
annotation /// 15563 // uptake





inferred from
annotation ///
permease activity // inferred from





electronic
16020 //
electronic annotation





annotation
membrane //






inferred from






electronic






annotation ///






16021 // integral






to membrane //






inferred from






electronic






annotation


ygcQ
ygcQ
Putative electron transfer
6118 // electron

9055 // electron carrier activity //




flavoprotein subunit ygcQ
transport // inferred

inferred from electronic annotation





from electronic

/// 50660 // FAD binding // inferred





annotation

from electronic annotation


rpsR
rpsR
30S ribosomal protein S18
6412 // protein
5622 //
3723 // RNA binding // inferred





biosynthesis //
intracellular //
from electronic annotation /// 3735





inferred from
inferred from
// structural constituent of ribosome





electronic
electronic
// inferred from electronic





annotation
annotation ///
annotation /// 19843 // rRNA






5840 // ribosome
binding // inferred from electronic






// inferred from
annotation






electronic






annotation ///






30529 //






ribonucleoprotein






complex //






inferred from






electronic






annotation


narY
narY
nitrate reductase 2 (NRZ), beta
6118 // electron
9325 // nitrate
5506 // iron ion binding // inferred




subunit
transport // inferred
reductase
from electronic annotation /// 8940





from electronic
complex //
// nitrate reductase activity //





annotation /// 6810
inferred from
inferred from electronic annotation





// transport //
electronic
/// 9055 // electron carrier activity //





inferred from
annotation ///
inferred from electronic annotation





electronic
16020 //
/// 16491 // oxidoreductase activity





annotation /// 42126
membrane //





// nitrate
inferred from





metabolism //
electronic





inferred from
annotation





electronic





annotation /// 42128





// nitrate





assimilation //





inferred from electr


citB
citB
Transcriptional Regulatory
160 // two-
5737 //
156 // two-component response




protein dpiA /// DNA-binding
component signal
cytoplasm //
regulator activity // inferred from




response regulator in two-
transduction system
inferred from
electronic annotation /// 3677 //




component regulatory system
(phosphorelay) //
electronic
DNA binding // inferred from




with citA
inferred from
annotation
electronic annotation /// 30528 //





electronic

transcription regulator activity //





annotation /// 6350

inferred from electronic annotation





// transcription //





inferred from





electronic





annotation /// 6355





// regulation of





transcription, DNA-





dependent //





inferred from





electronic





annotation


yjbR
yjbR
Protein yjbR /// hypothetical







protein


ybbA
ybbA
Hypothetical ABC transporter
6810 // transport //

166 // nucleotide binding // inferred




ATP-binding protein ybbA ///
inferred from

from electronic annotation /// 5524




predicted transporter subunit:
electronic

// ATP binding // inferred from




ATP-binding component of
annotation

electronic annotation /// 5525 //




ABC superfamily


GTP binding // inferred from







electronic annotation /// 16887 //







ATPase activity // inferred from







electronic annotation


gst
gst
Glutathione S-transferase ///


4364 // glutathione transferase




glutathionine S-transferase


activity // inferred from electronic







annotation /// 16740 // transferase







activity // inferred from electronic







annotation


grxC
grxC
Glutaredoxin 3
6118 // electron

9055 // electron carrier activity //





transport // inferred

inferred from electronic annotation





from electronic

/// 15035 // protein disulfide





annotation /// 6810

oxidoreductase activity // inferred





// transport //

from electronic annotation





inferred from





electronic





annotation /// 9263





//





deoxyribonucleotide





biosynthesis //





inferred from





electronic





annotation /// 45454





// cell redox





homeostasis // infer


cysD
cysD
sulfate adenylyltransferase
103 // sulfate

4781 // sulfate adenylyltransferase




subunit 2
assimilation //

(ATP) activity // inferred from





inferred from

electronic annotation /// 16740 //





electronic

transferase activity // inferred from





annotation /// 8152

electronic annotation /// 16779 //





// metabolism //

nucleotidyltransferase activity //





inferred from

inferred from electronic annotation





electronic





annotation /// 8652





// amino acid





biosynthesis //





inferred from





electronic





annotation /// 19344





// cysteine





biosynthesis //





inferred from





electronic





annotation


radC
radC
DNA repair protein RadC
6281 // DNA repair







// inferred from





electronic





annotation /// 6974





// response to DNA





damage stimulus //





inferred from





electronic





annotation


citG
citG
2-(5″-triphosphoribosyl)-3′-


16740 // transferase activity //




dephosphocoenzyme-A


inferred from electronic annotation




synthase /// triphosphoribosyl-


/// 46917 // triphosphoribosyl-




dephospho-CoA transferase


dephospho-CoA synthase activity //







inferred from electronic annotation


fdhE
fdhE
formate dehydrogenase
6118 // electron

5506 // iron ion binding // inferred




accessory protein FdhE
transport // inferred

from electronic annotation /// 9055





from electronic

// electron carrier activity // inferred





annotation

from electronic annotation /// 20037







// heme binding // inferred from







electronic annotation


fecB
fecB
KpLE2 phage-like element;
6810 // transport //
42597 //
5381 // iron ion transporter activity




iron-dicitrate transporter
inferred from
periplasmic
// inferred from electronic




subunit
electronic
space // inferred
annotation /// 5506 // iron ion





annotation /// 6811
from electronic
binding // inferred from electronic





// ion transport //
annotation
annotation





inferred from





electronic





annotation /// 6826





// iron ion transport





// inferred from





electronic





annotation /// 6827





// high affinity iron





ion transport





inferred from





electronic





annotation


yhgH
yhgH ///
Hypothetical protein yhgH ///
9116 // nucleoside





gntX
gluconate periplasmic binding
metabolism //




protein with
inferred from




phosphoribosyltransferase
electronic




domain, GNT I system
annotation


(phnE)
phnE
membrane channel protein
6810 // transport //
5887 // integral
5215 // transporter activity //




component of Pn transporter
inferred from
to plasma
inferred from electronic annotation





electronic
membrane //
/// 15604 // phosphonate transporter





annotation /// 15716
inferred from
activity // inferred from electronic





// phosphonate
electronic
annotation





transport // inferred
annotation ///





from electronic
16020 //





annotation
membrane //






inferred from






electronic






annotation ///






16021 // integral






to membrane //






inferred from






electronic






annotation


cysJ
cysJ
Sulfite reductase [NADPH]
103 // sulfate

5515 // protein binding // inferred




flavoprotein alpha-component
assimilation //

from physical interaction /// 4783 //




/// sulfite reductase, alpha
inferred from

sulfite reductase (NADPH) activity




subunit, flavoprotein
electronic

// inferred from electronic





annotation /// 6118

annotation /// 5506 // iron ion





// electron transport

binding // inferred from electronic





// inferred from

annotation /// 9055 // electron





electronic

carrier activity/





annotation /// 6810





// transport //





inferred from





electronic





annotation /// 8652





// amino acid





biosynthesis //





inferred from





electronic





annotation


445#15
ygaX
Putative transport protein ///
6810 // transport
16020 //
5215 // transporter activity




predicted transporter

membrane ///






16021 // integral






to membrane


fba
fba ///
fructose-bisphosphate aldolase
6096 // glycolysis //

5515 // protein binding // inferred



fbaA
/// fructose-bisphosphate
inferred from

from physical interaction /// 4332 //




aldolase
electronic

fructose-bisphosphate aldolase





annotation

activity // inferred from electronic







annotation /// 8270 // zinc ion







binding // inferred from electronic







annotation /// 16829 // lyase activity







// inferred from electronic







annotation


yjbI
yjbI
hypothetical protein





yfjQ
yfjQ
CP4-57 prophage; predicted







protein


mltB
mltB
Membrane-bound lytic murein
5975 //

16787 // hydrolase activity //




transglycosylase B precursor
carbohydrate

inferred from electronic annotation





metabolism //

/// 16798 // hydrolase activity,





inferred from

acting on glycosyl bonds // inferred





electronic

from electronic annotation





annotation


yhaA
yhaA ///
propionate kinase/acetate
6082 // organic acid
5622 //
8776 // acetate kinase activity //



tdcD
kinase C, anaerobic
metabolism //
intracellular //
inferred from electronic annotation





inferred from
inferred from
/// 16301 // kinase activity //





electronic
electronic
inferred from electronic annotation





annotation /// 8152
annotation
/// 16740 // transferase activity //





// metabolism //

inferred from electronic annotation





inferred from

/// 16774 // phosphotransferase





electronic

activity, c





annotation /// 16310





// phosphorylation //





inferred from





electronic





annotation


yjeB
yjeB
Hypothetical protein yjeB ///
6412 // protein

3677 // DNA binding // inferred




predicted DNA-binding
biosynthesis //

from electronic annotation /// 4826




transcriptional regulator
inferred from

// phenylalanine-tRNA ligase





electronic

activity // inferred from electronic





annotation

annotation /// 5524 // ATP binding







// inferred from electronic







annotation


thiF
thiF
thiamine biosynthesis protein
9228 // thiamin

3824 // catalytic activity // inferred




ThiF
biosynthesis //

from electronic annotation /// 16740





inferred from

// transferase activity // inferred





electronic

from electronic annotation /// 16779





annotation

// nucleotidyltransferase activity //







inferred from electronic annotation


gcpE
gcpE ///
4-hydroxy-3-methylbut-2-en-
8299 // isoprenoid

5506 // iron ion binding // inferred



ispG
1-yl diphosphate synthase /// 4-
biosynthesis //

from electronic annotation /// 16491




hydroxy-3-methylbut-2-en-1-yl
inferred from

// oxidoreductase activity // inferred




diphosphate synthase
electronic

from electronic annotation /// 16728





annotation /// 16114

// oxidoreductase activity, acting on





// terpenoid

CH2 groups, disulfide as acceptor //





biosynthesis //

inferred from electronic annotation





inferred from





electronic





annotation


mviN
mviN
Virulence factor mviN
9405 //
16020 //





homolog /// predicted inner
pathogenesis //
membrane //




membrane protein
inferred from
inferred from





electronic
electronic





annotation
annotation ///






16021 // integral






to membrane //






inferred from






electronic






annotation


yihK
yihK ///
GTP-binding protein
6412 // protein
5622 //
5515 // protein binding // inferred



bipA
typA/BipA /// GTP-binding
biosynthesis //
intracellular //
from physical interaction /// 166 //




protein
inferred from
inferred from
nucleotide binding // inferred from





electronic
electronic
electronic annotation /// 5525 //





annotation
annotation
GTP binding // inferred from







electronic annotation


ubiG
ubiG
3-demethylubiquinone-9 3-
6744 // ubiquinone

5515 // protein binding // inferred




methyltransferase
biosynthesis //

from physical interaction /// 8168 //





inferred from

methyltransferase activity // inferred





electronic

from electronic annotation /// 8425





annotation

// 2-polyprenyl-6-methoxy-1,4-







benzoquinone methyltransferase







activity // inferred from electronic







annotati


yejG
yejG
Hypothetical protein yejG ///







hypothetical protein


304#1
lsrB
AI2 transporter





ygfY
ygfY
Hypothetical protein ygfY ///







hypothetical protein


319#17
ydhZ
Hypothetical protein ydhZ ///







hypothetical protein


336#6


430#8
iscR
Hypothetical protein yfhP ///







DNA-binding transcriptional




repressor


yhfR
yhfR ///
predicted DNA-binding
6350 // transcription
5622 //
5515 // protein binding // inferred



frlR
transcriptional regulator
// inferred from
intracellular //
from physical interaction /// 3677 //





electronic
inferred from
DNA binding // inferred from





annotation /// 6355
electronic
electronic annotation /// 3700 //





// regulation of
annotation
transcription factor activity //





transcription, DNA-

inferred from electronic annotation





dependent //

/// 30528 // transcription regulator





inferred from

activity //





electronic





annotation /// 45449





// regulation of





transcription //





inferred from





electronic





annotation


phnG
phnG
PhnG protein /// carbon-
15716 //






phosphorus lyase complex
phosphonate




subunit
transport // inferred





from electronic





annotation /// 19634





// phosphonate





metabolism //





inferred from





electronic





annotation


ymfE
ymfE
e14 prophage; predicted inner

16020 //





membrane protein

membrane //






inferred from






electronic






annotation ///






16021 // integral






to membrane //






inferred from






electronic






annotation


yejO
yejO
predicted autotransporter outer
7155 // cell
16020 //
5524 // ATP binding // inferred




membrane protein
adhesion // inferred
membrane //
from electronic annotation





from electronic
inferred from





annotation
electronic






annotation ///






19867 // outer






membrane //






inferred from






electronic






annotation


dicC
dicC
Qin prophage; DNA-binding
6350 // transcription

3677 // DNA binding // inferred




transcriptional regulator for
// inferred from

from electronic annotation




DicB
electronic





annotation /// 6355





// regulation of





transcription, DNA-





dependent //





inferred from





electronic





annotation /// 7049





// cell cycle //





inferred from





electronic





annotation /// 51301





// cell division //





inferred from





electronic





annotation


galR
galR
Galactose operon repressor ///
5975 //
5622 //
3677 // DNA binding // inferred




DNA-binding transcriptional
carbohydrate
intracellular //
from electronic annotation /// 3700




repressor
metabolism //
inferred from
// transcription factor activity //





inferred from
electronic
inferred from electronic annotation





electronic
annotation





annotation /// 6012





// galactose





metabolism //





inferred from





electronic





annotation /// 6350





// transcription //





inferred from





electronic





annotation /// 6355





// regulation of





transcription, DNA-





de


yphC
yphC
Hypothetical zinc-type alcohol


8270 // zinc ion binding // inferred




dehydrogenase-like protein


from electronic annotation /// 16491




yphC


// oxidoreductase activity // inferred







from electronic annotation /// 46872







// metal ion binding // inferred from







electronic annotation


rplT
rplT
50S ribosomal protein L20
27 // ribosomal
5622 //
3723 // RNA binding // inferred





large subunit
intracellular //
from electronic annotation /// 3735





assembly and
inferred from
// structural constituent of ribosome





maintenance //
electronic
// inferred from electronic





inferred from
annotation ///
annotation /// 19843 // rRNA





electronic
5840 // ribosome
binding // inferred from electronic





annotation /// 6412
// inferred from
annotation





// protein
electronic





biosynthesis //
annotation ///





inferred from
30529 //





electronic
ribonucleoprotein





annotation
complex //






inferred from






electronic






annotation


267#6
paaJ
acetyl-CoA acetyltransferase


8415 // acyltransferase activity ///







16740 // transferase activity


selD
selD
selenophosphate synthetase


166 // nucleotide binding // inferred







from electronic annotation /// 287 //







magnesium ion binding // inferred







from electronic annotation /// 3824







// catalytic activity // inferred from







electronic annotation /// 4756 //







selenide, water dikinase activity //


tdcB
tdcB
threonine dehydratase
6520 // amino acid

3824 // catalytic activity // inferred





metabolism //

from electronic annotation /// 4794





inferred from

// threonine ammonia-lyase activity





electronic

// inferred from electronic





annotation /// 8152

annotation /// 16829 // lyase activity





// metabolism //

// inferred from electronic





inferred from

annotation





electronic





annotation


yhfV
yhfV
Phosphotriesterase homology
9056 // catabolism //

8270 // zinc ion binding // inferred




protein
inferred from

from electronic annotation /// 16788





electronic

// hydrolase activity, acting on ester





annotation

bonds // inferred from electronic







annotation


yjaI
yjaI ///
Zinc resistance-associated

42597 //
8270 // zinc ion binding // inferred



zraP
protein precursor /// Zn-

periplasmic
from electronic annotation




binding periplasmic protein

space // inferred






from electronic






annotation


hycA
hycA
Formate hydrogenlyase
6350 // transcription

16829 // lyase activity // inferred




Regulatory protein hycA ///
// inferred from

from electronic annotation




regulator of the transcriptional
electronic




regulator FhlA
annotation /// 6355





// regulation of





transcription, DNA-





dependent //





inferred from





electronic





annotation







192 Highly immunogenic response proteins in HC












pbuX
pbuX
hypothetical protein





fabH
fabH
3-oxoacyl-(acyl carrier protein) synthase
6633 // fatty acid

3824 // catalytic





biosynthesis // inferred from

activity // inferred





electronic annotation ///

from electronic





8610 // lipid biosynthesis //

annotation /// 4315





inferred from electronic

// 3-oxoacyl-[acyl-





annotation

carrier protein]







synthase activity //







inferred from







electronic







annotation /// 8415







// acyltransferase







activity // inferred







from electronic







annotation /// 16740/


glpF
glpF
Glycerol uptake facilitator protein ///
6810 // transport // inferred
16020 // membrane //
287 // magnesium




glycerol facilitator
from electronic annotation
inferred from
ion binding //






electronic annotation
inferred from






/// 16021 // integral
electronic






to membrane //
annotation /// 5215






inferred from
// transporter






electronic annotation
activity // inferred







from electronic







annotation /// 46872







// metal ion binding







// inferred from







electronic







annotation


273#6
ydcU
Hypothetical ABC transporter permease
6810 // transport
16020 // membrane
5215 // transporter




protein ydcU /// predicted

/// 16021 // integral
activity




spermidine/putrescine transporter subunit

to membrane


ybhR
ybhR
Hypothetical protein ybhR /// predicted
6810 // transport // inferred
16020 // membrane //
5524 // ATP binding




transporter subunit: membrane component
from electronic annotation
inferred from
// inferred from




of ABC superfamily

electronic annotation
electronic






/// 16021 // integral
annotation /// 42626






to membrane //
// ATPase activity,






inferred from
coupled to






electronic annotation
transmembrane







movement of







substances //







inferred from







electronic







annotation


yqcE
yqcE
Hypothetical protein yqcE /// predicted
6810 // transport // inferred
16020 // membrane //
5215 // transporter




transporter
from electronic annotation
inferred from
activity // inferred






electronic annotation
from electronic






/// 16021 // integral
annotation






to membrane //






inferred from






electronic annotation


flhD
flhD
transcriptional activator FlhD
6350 // transcription //
19861 // flagellum //
3677 // DNA





inferred from electronic
inferred from
binding // inferred





annotation /// 6355 //
electronic annotation
from electronic





regulation of transcription,

annotation /// 3700





DNA-dependent // inferred

// transcription





from electronic annotation

factor activity //





/// 9296 // flagellum

inferred from





biogenesis // inferred from

electronic





electronic annotation ///

annotation /// 16563





45893 // positive regula

// transcriptional







activator activity //







inferred from







electronic







annotation


trkG
trkG
Rac prophage; potassium transporter
6810 // transport // inferred
16020 // membrane //
8324 // cation




subunit
from electronic annotation
inferred from
transporter activity





/// 6811 // ion transport //
electronic annotation
// inferred from





inferred from electronic
/// 16021 // integral
electronic





annotation /// 6812 // cation
to membrane //
annotation /// 30955





transport // inferred from
inferred from
// potassium ion





electronic annotation ///
electronic annotation
binding // inferred





6813 // potassium ion

from electronic





transport // inferred from

annotation





electronic annotation


ybdS
ybdS
Citrate carrier/transporter
6814 // sodium ion transport
16020 // membrane //
5215 // transporter





// inferred from electronic
inferred from
activity // inferred





annotation
electronic annotation
from electronic







annotation


brnQ
brnQ
Branched-chain amino acid transport
6810 // transport // inferred
16020 // membrane //
15171 // amino acid




system II carrier protein /// predicted
from electronic annotation
inferred from
transporter activity




branched chain amino acid transporter
/// 6865 // amino acid
electronic annotation
// inferred from




(LIV-II)
transport // inferred from
/// 16021 // integral
electronic





electronic annotation ///
to membrane //
annotation /// 15658





15803 // branched-chain
inferred from
// branched-chain





aliphatic amino acid
electronic annotation
aliphatic amino acid





transport // inferred from

transporter activity





electronic annotation

// inferred from







electronic







annotation


ycaD
ycaD
putative MFS family transporter protein
6810 // transport // inferred
16020 // membrane //
5215 // transporter





from electronic annotation
inferred from
activity // inferred






electronic annotation
from electronic






/// 16021 // integral
annotation






to membrane //






inferred from






electronic annotation


ybhN
ybhN
Hypothetical protein ybhN /// conserved

16020 // membrane //





inner membrane protein

inferred from






electronic annotation






/// 16021 // integral






to membrane //






inferred from






electronic annotation


yabK
yabK ///
thiamin ABC transporter membrane
6810 // transport // inferred
9276 // cell wall
5215 // transporter



thiP
component
from electronic annotation
(sensu
activity // inferred






Proteobacteria) //
from electronic






inferred from
annotation






electronic annotation






/// 16020 //






membrane // inferred






from electronic






annotation /// 16021






// integral to






membrane // inferred






from electronic






annotation


ycdG
ycdG
Putative purine permease ycdG ///
6810 // transport // inferred
16020 // membrane //
5215 // transporter




predicted transporter
from electronic annotation
inferred from
activity // inferred






electronic annotation
from electronic






/// 16021 // integral
annotation






to membrane //






inferred from






electronic annotation


yojI
yojI
Hypothetical ABC transporter ATP-
6810 // transport // inferred
16020 // membrane //
166 // nucleotide




binding protein yojI /// fused predicted
from electronic annotation
inferred from
binding // inferred




multidrug transport subunits of ABC
/// 15833 // peptide transport
electronic annotation
from electronic




superfamily: membrane component/ATP-
// inferred from electronic
/// 16021 // integral
annotation /// 5524




binding component
annotation /// 46677 //
to membrane //
// ATP binding //





response to antibiotic //
inferred from
inferred from





inferred from electronic
electronic annotation
electronic





annotation

annotation /// 15197







// peptide







transporter activity







// inferred from







electronic







annotation /// 16887







// ATPase activity //







inferred from







electronic







annotation


ybaN
ybaN
Hypothetical protein ybaN /// conserved

16020 // membrane //





inner membrane protein

inferred from






electronic annotation






/// 16021 // integral






to membrane //






inferred from






electronic annotation


focA
focA
F1C major fimbrial subunit precursor
7155 // cell adhesion //
9289 // fimbrium //






inferred from electronic
inferred from





annotation
electronic annotation


321#3


yciR
yciR
Hypothetical protein yciR
7165 // signal transduction //

4871 // signal





inferred from electronic

transducer activity //





annotation

inferred from







electronic







annotation


427#1
yfgF
Hypothetical protein yfgF /// predicted

16020 // membrane





inner membrane protein

/// 16021 // integral






to membrane


celD
celD ///
Cel operon repressor /// DNA-binding
6350 // transcription //
5622 // intracellular
3677 // DNA



chbR
transcriptional dual regulator
inferred from electronic
// inferred from
binding // inferred





annotation /// 6355 //
electronic annotation
from electronic





regulation of transcription,

annotation /// 3700





DNA-dependent // inferred

// transcription





from electronic annotation

factor activity //





/// 45449 // regulation of

inferred from





transcription // inferred from

electronic





electronic annotation

annotation /// 43565







// sequence-specific







DNA binding //







inferred from







electronic







annotation


uidB
uidB
Glucuronide carrier protein /// glucuronide
6810 // transport // inferred
16020 // membrane //
5215 // transporter




transporter
from electronic annotation
inferred from
activity // inferred





/// 6814 // sodium ion
electronic annotation
from electronic





transport // inferred from
/// 16021 // integral
annotation /// 5351





electronic annotation
to membrane //
// sugar porter






inferred from
activity // inferred






electronic annotation
from electronic







annotation /// 15293







// symporter activity







// inferred from







electronic







annotation


ydjS
ydjS ///
succinylglutamate desuccinylase ///
6525 // arginine metabolism

8270 // zinc ion



astE
succinylglutamate desuccinylase
// inferred from electronic

binding // inferred





annotation /// 6527 //

from electronic





arginine catabolism //

annotation /// 16787





inferred from electronic

// hydrolase activity





annotation /// 8152 //

// inferred from





metabolism // inferred from

electronic





electronic annotation

annotation /// 16788







// hydrolase activity,







acting on ester







bonds // inferred







from electronic







annotation /// 46872







// metal ion bin


rocE
rocE
hypothetical protein





emrY
emrY
Multidrug resistance protein Y /// predicted
6810 // transport // inferred
16020 // membrane //
5215 // transporter




multidrug efflux system
from electronic annotation
inferred from
activity // inferred





/// 6306 // DNA methylation
electronic annotation
from electronic





// inferred from electronic
/// 16021 // integral
annotation /// 3677





annotation
to membrane //
// DNA binding //






inferred from
inferred from






electronic annotation
electronic







annotation


cydC
cydC
Transport ATP-binding protein cydC ///
6810 // transport // inferred
16020 // membrane //
166 // nucleotide




fused cysteine transporter subunits of ABC
from electronic annotation
inferred from
binding // inferred




superfamily: membrane component/ATP-

electronic annotation
from electronic




binding component

/// 16021 // integral
annotation /// 5524






to membrane //
// ATP binding //






inferred from
inferred from






electronic annotation
electronic







annotation /// 16887







// ATPase activity //







inferred from







electronic







annotation /// 17111







// nucleoside-







triphosphatase







activity // inferred







from electronic







annotation


yhhS
yhhS
hypothetical protein /// predicted
6810 // transport // inferred
16020 // membrane //
5215 // transporter




transporter
from electronic annotation
inferred from
activity // inferred






electronic annotation
from electronic






/// 16021 // integral
annotation






to membrane //






inferred from






electronic annotation


406#7
yfcH
Hypothetical protein yfcH /// conserved
9225 // nucleotide-sugar

3824 // catalytic




protein with NAD(P)-binding Rossmann-
metabolism

activity /// 51287 //




fold domain


NAD binding


atoE
atoE
Short-chain fatty acids transporter /// short
6810 // transport // inferred
16020 // membrane //
15635 // short-chain




chain fatty acid transporter
from electronic annotation
inferred from
fatty acid





/// 15912 // short-chain fatty
electronic annotation
transporter activity





acid transport // inferred
/// 16021 // integral
// inferred from





from electronic annotation
to membrane //
electronic






inferred from
annotation






electronic annotation


ybgE
ybgE
Protein ybgE /// conserved inner







membrane protein


JW0438
mdlA
Multidrug resistance-like ATP-binding
6810 // transport
16021 // integral to
166 // nucleotide




protein mdlA

membrane
binding /// 5524 //







ATP binding ///







16887 // ATPase







activity /// 17111 //







nucleoside-







triphosphatase







activity /// 42626 //







ATPase activity,







coupled to







transmembrane







movement of







substances


(yhhT)
yhhT
Hypothetical protein yhhT /// predicted

16020 // membrane //





inner membrane protein

inferred from






electronic annotation






/// 16021 // integral






to membrane //






inferred from






electronic annotation


ybhM
ybhM
Hypothetical protein ybhM





yicO
yicO
Hypothetical protein yicO /// predicted
6810 // transport // inferred
16020 // membrane //
5215 // transporter




xanthine/uracil permase
from electronic annotation
inferred from
activity // inferred






electronic annotation
from electronic






/// 16021 // integral
annotation






to membrane //






inferred from






electronic annotation


ybhL
ybhL
Hypothetical protein ybhL /// predicted

16020 // membrane //





inner membrane protein

inferred from






electronic annotation






/// 16021 // integral






to membrane //






inferred from






electronic annotation


yhiQ
yhiQ
Hypothetical protein yhiQ





ydaA
ydaA ///
Protein ydaA /// stress-induced protein
6950 // response to stress //





uspE

inferred from electronic





annotation


ydjZ
ydjZ
Hypothetical protein ydjZ /// conserved

16020 // membrane //





inner membrane protein

inferred from






electronic annotation






/// 16021 // integral






to membrane //






inferred from






electronic annotation


dnaQ
dnaQ
DNA polymerase III subunit epsilon
6260 // DNA replication //
5622 // intracellular
287 // magnesium





inferred from electronic
// inferred from
ion binding //





annotation
electronic annotation
inferred from







electronic







annotation /// 3677







// DNA binding //







inferred from







electronic







annotation /// 3887







// DNA-directed







DNA polymerase







activity // inferred







from electronic







annotation /// 4518







// nuclease activity







// inferred from







electronic







annotation


yidY
yidY ///
Hypothetical transport protein yidY ///
6810 // transport // inferred
16020 // membrane //
5215 // transporter



mdtL
multidrug efflux system protein
from electronic annotation
inferred from
activity // inferred





/// 46677 // response to
electronic annotation
from electronic





antibiotic // inferred from
/// 16021 // integral
annotation





electronic annotation
to membrane //






inferred from






electronic annotation


211#11


dgkA
dgkA
Diacylglycerol kinase
8654 // phospholipid
16020 // membrane //
4143 //





biosynthesis // inferred from
inferred from
diacylglycerol





electronic annotation
electronic annotation
kinase activity //






/// 16021 // integral
inferred from






to membrane //
electronic






inferred from
annotation /// 16301






electronic annotation
// kinase activity //







inferred from







electronic







annotation /// 16740







// transferase







activity // inferred







from electronic







annotation


secF
secF
protein export protein SecF
6605 // protein targeting //
9276 // cell wall
8565 // protein





inferred from electronic
(sensu
transporter activity





annotation /// 6810 //
Proteobacteria) //
// inferred from





transport // inferred from
inferred from
electronic





electronic annotation ///
electronic annotation
annotation /// 15450





6886 // intracellular protein
/// 15627 // type II
// protein





transport // inferred from
protein secretion
translocase activity





electronic annotation ///
system complex //
// inferred from





15031 // protein transport //
inferred from
electronic





inferred from electronic
electronic annotation
annotation





annotation
/// 16020 //






membrane // inferred






from electronic






annotation /// 16021






// integral


ybbC
ybbC
hypothetical protein





fadA
fadA
acetyl-CoA acetyltransferase
6629 // lipid metabolism //

3988 // acetyl-CoA





inferred from electronic

C-acyltransferase





annotation /// 6631 // fatty

activity // inferred





acid metabolism // inferred

from electronic





from electronic annotation

annotation /// 8415





/// 16042 // lipid catabolism

// acyltransferase





// inferred from electronic

activity // inferred





annotation

from electronic







annotation /// 16740







// transferase







activity // inferred







from electronic







annotation


fepD
fepD
Ferric enterobactin transport system
6810 // transport // inferred
16020 // membrane //
5215 // transporter




permease protein fepD
from electronic annotation
inferred from
activity // inferred






electronic annotation
from electronic







annotation


sdhD
sdhD
succinate dehydrogenase cytochrome b556
6099 // tricarboxylic acid
16020 // membrane //
5506 // iron ion




small membrane subunit
cycle // inferred from
inferred from
binding // inferred





electronic annotation ///
electronic annotation
from electronic





6118 // electron transport //
/// 16021 // integral
annotation /// 46872





inferred from electronic
to membrane //
// metal ion binding





annotation /// 6810 //
inferred from
// inferred from





transport // inferred from
electronic annotation
electronic





electronic annotation

annotation


yeiO
yeiO ///
Sugar efflux transporter B ///
6810 // transport // inferred
16020 // membrane //
5215 // transporter



setB
lactose/glucose efflux system
from electronic annotation
inferred from
activity // inferred





/// 8643 // carbohydrate
electronic annotation
from electronic





transport // inferred from
/// 16021 // integral
annotation /// 5351





electronic annotation
to membrane //
// sugar porter






inferred from
activity // inferred






electronic annotation
from electronic







annotation /// 15542







// sugar efflux







transporter activity







// inferred from







electronic







annotation


yhfU
yhfU
Hypothetical protein yhfU /// hypothetical







protein


(yeeF)
yeeF
Hypothetical transport protein yeeF ///
6810 // transport // inferred
16020 // membrane //
5279 // amino acid-




predicted amino-acid transporter
from electronic annotation
inferred from
polyamine





/// 6865 // amino acid
electronic annotation
transporter activity





transport // inferred from
/// 16021 // integral
// inferred from





electronic annotation
to membrane //
electronic






inferred from
annotation /// 15171






electronic annotation
// amino acid







transporter activity







// inferred from







electronic







annotation


yaeG
yaeG ///
Carbohydrate diacid regulator /// DNA-
6350 // transcription //





cdaR
binding transcriptional activator
inferred from electronic





annotation /// 6355 //





regulation of transcription,





DNA-dependent // inferred





from electronic annotation


nac
nac
Nitrogen assimilation Regulatory protein
6350 // transcription //

3677 // DNA




nac /// DNA-binding transcriptional dual
inferred from electronic

binding // inferred




regulator of nitrogen assimilation
annotation /// 6355 //

from electronic





regulation of transcription,

annotation /// 3700





DNA-dependent // inferred

// transcription





from electronic annotation

factor activity //





/// 42128 // nitrate

inferred from





assimilation // inferred from

electronic





electronic annotation

annotation


msbA
msbA
Probable transport ATP-binding protein
6810 // transport // inferred
9276 // cell wall
166 // nucleotide




msbA /// fused lipid transporter subunits of
from electronic annotation
(sensu
binding // inferred




ABC superfamily: membrane
/// 6869 // lipid transport //
Proteobacteria) //
from electronic




component/ATP-binding component
inferred from electronic
inferred from
annotation /// 5524





annotation
electronic annotation
// ATP binding //






/// 16020 //
inferred from






membrane // inferred
electronic






from electronic
annotation /// 16887






annotation /// 16021
// ATPase activity //






// integral to
inferred from






membrane // inferred
electronic






from electronic
annotation /// 17111






annotation
// nucleoside-







triphosphatase







activity // inferred







from electronic







annotation


narI
narI
Respiratory nitrate reductase 1 gamma
6118 // electron transport //
9325 // nitrate
5506 // iron ion




chain /// nitrate reductase 1, gamma
inferred from electronic
reductase complex //
binding // inferred




(cytochrome b(NR)) subunit
annotation /// 6810 //
inferred from
from electronic





transport // inferred from
electronic annotation
annotation /// 8940





electronic annotation ///
/// 16020 //
// nitrate reductase





42128 // nitrate assimilation
membrane // inferred
activity // inferred





// inferred from electronic
from electronic
from electronic





annotation
annotation /// 16021
annotation /// 16491






// integral to
// oxidoreductase






membrane // inferred
activity // inferred






from electronic
from electronic






annotation
annotation /// 46872







// metal ion binding







// inferred from







electronic







annotation


oppC
oppC
Oligopeptide transport system permease
6810 // transport // inferred
16020 // membrane //
5215 // transporter




protein oppC /// oligopeptide transporter
from electronic annotation
inferred from
activity // inferred




subunit
/// 6857 // oligopeptide
electronic annotation
from electronic





transport // inferred from
/// 16021 // integral
annotation /// 15198





electronic annotation ///
to membrane //
// oligopeptide





15031 // protein transport //
inferred from
transporter activity





inferred from electronic
electronic annotation
// inferred from





annotation

electronic







annotation


yibQ
yibQ
Hypothetical protein yibQ precursor ///







predicted polysaccharide deacetylase


pheP
pheP
Phenylalanine-specific permease ///
6810 // transport // inferred
16020 // membrane //
5279 // amino acid-




phenylalanine transporter
from electronic annotation
inferred from
polyamine





/// 6865 // amino acid
electronic annotation
transporter activity





transport // inferred from
/// 16021 // integral
// inferred from





electronic annotation
to membrane //
electronic






inferred from
annotation /// 15171






electronic annotation
// amino acid







transporter activity







// inferred from







electronic







annotation


16-3B0


emrD
emrD
Multidrug resistance protein D ///
6810 // transport // inferred
16020 // membrane //
5215 // transporter




multidrug efflux system protein
from electronic annotation
inferred from
activity // inferred





/// 15893 // drug transport //
electronic annotation
from electronic





inferred from electronic
/// 16021 // integral
annotation /// 15238





annotation
to membrane //
// drug transporter






inferred from
activity // inferred






electronic annotation
from electronic







annotation


ydeZ
ydeZ ///
AI2 transporter
6810 // transport // inferred
16020 // membrane //
5215 // transporter



lsrD

from electronic annotation
inferred from
activity // inferred






electronic annotation
from electronic






/// 16021 // integral
annotation






to membrane //






inferred from






electronic annotation


280#1
ddpX
D-ala-D-ala dipeptidase, Zn-dependent
6508 // proteolysis
5618 // cell wall
8233 // peptidase







activity /// 8237 //







metallopeptidase







activity /// 16787 //







hydrolase activity







/// 16805 //







dipeptidase activity


ybfC
ybfC
hypothetical protein





ydcD
ydcD
hypothetical protein





ygjR
ygjR
Hypothetical oxidoreductase ygjR ///
6118 // electron transport //
16020 // membrane //
16491 //




predicted NAD(P)-binding dehydrogenase
inferred from electronic
inferred from
oxidoreductase





annotation /// 8152 //
electronic annotation
activity // inferred





metabolism // inferred from
/// 16021 // integral
from electronic





electronic annotation
to membrane //
annotation






inferred from






electronic annotation


yehY
yehY
Hypothetical ABC transporter permease
6810 // transport // inferred
16020 // membrane //
5215 // transporter




protein yehY /// predicted transporter
from electronic annotation
inferred from
activity // inferred




subunit: membrane component of ABC

electronic annotation
from electronic




superfamily

/// 16021 // integral
annotation






to membrane //






inferred from






electronic annotation


ppx
ppx
Exopolyphosphatase

16020 // membrane //
287 // magnesium






inferred from
ion binding //






electronic annotation
inferred from







electronic







annotation /// 4309







//







exopolyphosphatase







activity // inferred







from electronic







annotation /// 16787







// hydrolase activity







// inferred from







electronic







annotation


nagE
nagE
PTS system, N-acetylglucosamine-specific
6810 // transport // inferred
16020 // membrane //
5351 // sugar porter




IIABC component
from electronic annotation
inferred from
activity // inferred





/// 9401 //
electronic annotation
from electronic





phosphoenolpyruvate-
/// 16021 // integral
annotation /// 8982





dependent sugar
to membrane //
// protein-N(PI)-





phosphotransferase system //
inferred from
phosphohistidine-





inferred from electronic
electronic annotation
sugar





annotation
/// 19866 // organelle
phosphotransferase






inner membrane //
activity // inferred






inferred from
from electronic






electronic annotation
annotation


kch
kch
Putative potassium channel protein ///
6810 // transport // inferred
16020 // membrane //
5216 // ion channel




voltage-gated potassium channel
from electronic annotation
inferred from
activity // inferred





/// 6811 // ion transport //
electronic annotation
from electronic





inferred from electronic
/// 16021 // integral
annotation





annotation /// 6813 //
to membrane //





potassium ion transport //
inferred from





inferred from electronic
electronic annotation





annotation


yjeM
yjeM
Hypothetical transporter yjeM /// predicted
6810 // transport // inferred
16020 // membrane //
5279 // amino acid-




transporter
from electronic annotation
inferred from
polyamine





/// 6865 // amino acid
electronic annotation
transporter activity





transport // inferred from
/// 16021 // integral
// inferred from





electronic annotation
to membrane //
electronic






inferred from
annotation






electronic annotation


ybfB
ybfB
predicted inner membrane protein

16020 // membrane //







inferred from






electronic annotation






/// 16021 // integral






to membrane //






inferred from






electronic annotation


279#6
ddpC
D-ala-D-ala transporter subunit
6810 // transport
16020 // membrane
5215 // transporter






/// 16021 // integral
activity






to membrane


aqpZ
aqpZ
aquaporin Z
6810 // transport // inferred
16020 // membrane //
5215 // transporter





from electronic annotation
inferred from
activity // inferred






electronic annotation
from electronic






/// 16021 // integral
annotation /// 15250






to membrane //
// water channel






inferred from
activity // inferred






electronic annotation
from electronic







annotation


yhjX
yhjX
Hypothetical protein yhjX
6810 // transport // inferred
16020 // membrane //
5215 // transporter





from electronic annotation
inferred from
activity // inferred






electronic annotation
from electronic






/// 16021 // integral
annotation /// 15297






to membrane //
// antiporter activity






inferred from
// inferred from






electronic annotation
electronic







annotation


malX
malX
PTS system, maltose and glucose-specific
6810 // transport // inferred
16020 // membrane //
5351 // sugar porter




IIABC component /// fused maltose and
from electronic annotation
inferred from
activity // inferred




glucose-specific PTS enzymes: IIB
/// 9401 //
electronic annotation
from electronic




component -! IIC component
phosphoenolpyruvate-
/// 16021 // integral
annotation /// 8982





dependent sugar
to membrane //
// protein-N(PI)-





phosphotransferase system //
inferred from
phosphohistidine-





inferred from electronic
electronic annotation
sugar





annotation

phosphotransferase







activity // inferred







from electronic







annotation /// 16740







// transferase







activity // inferred







from electronic







annotati


ycbM
ycbM ///
Putative aliphatic sulfonates transport
6810 // transport // inferred
16020 // membrane //
5215 // transporter



ssuC
permease protein ssuC /// alkanesulfonate
from electronic annotation
inferred from
activity // inferred




transporter subunit

electronic annotation
from electronic






/// 16021 // integral
annotation






to membrane //






inferred from






electronic annotation


narU
narU
Nitrite extrusion protein 2 /// nitrate/nitrite
6810 // transport // inferred
16020 // membrane //
15103 // inorganic




transporter
from electronic annotation
inferred from
anion transporter





/// 15698 // inorganic anion
electronic annotation
activity // inferred





transport // inferred from
/// 16021 // integral
from electronic





electronic annotation ///
to membrane //
annotation





42128 // nitrate assimilation
inferred from





// inferred from electronic
electronic annotation





annotation


lpxC
lpxC
UDP-3-O-[3-hydroxymyristoyl] N-
8610 // lipid biosynthesis //

8759 // UDP-3-O-




acetylglucosamine deacetylase
inferred from electronic

[3-





annotation /// 9245 // lipid A

hydroxymyristoyl]





biosynthesis // inferred from

N-





electronic annotation

acetylglucosamine







deacetylase activity







// inferred from







electronic







annotation /// 16787







// hydrolase activity







// inferred from







electronic







annotation


secY/prlA
secY ///
preprotein translocase SecY /// protein
6605 // protein targeting //
16020 // membrane //
15450 // protein



prlA
translocase subunit SecY
inferred from electronic
inferred from
translocase activity





annotation /// 6810 //
electronic annotation
// inferred from





transport // inferred from
/// 16021 // integral
electronic





electronic annotation ///
to membrane //
annotation





9306 // protein secretion //
inferred from





inferred from electronic
electronic annotation





annotation /// 15031 //





protein transport // inferred





from electronic annotation


(yhcP)
yhcP
Hypothetical protein yhcP /// p-
6810 // transport // inferred
16020 // membrane //





hydroxybenzoic acid efflux system
from electronic annotation
inferred from




component

electronic annotation






/// 16021 // integral






to membrane //






inferred from






electronic annotation


phsE
phsE ///
Penicillin-binding protein 6B precursor ///
6508 // proteolysis // inferred
5618 // cell wall //
4180 //



dacD
D-alanyl-D-alanine carboxypeptidase
from electronic annotation
inferred from
carboxypeptidase




(penicillin-binding protein 6b)
/// 8360 // regulation of cell
electronic annotation
activity // inferred





shape // inferred from
/// 16020 //
from electronic





electronic annotation ///
membrane // inferred
annotation /// 8233





9252 // peptidoglycan
from electronic
// peptidase activity





biosynthesis // inferred from
annotation
// inferred from





electronic annotation

electronic







annotation /// 9002







// serine-type D-







Ala-D-Ala







carboxypeptidase







activity // inferred







from electronic







annotation /// 16787/


hemY
hemY
predicted protoheme IX synthesis protein
6779 // porphyrin
16020 // membrane //






biosynthesis // inferred from
inferred from





electronic annotation
electronic annotation






/// 16021 // integral






to membrane //






inferred from






electronic annotation


yciS
yciS
Hypothetical protein yciS /// conserved

16020 // membrane //





inner membrane protein

inferred from






electronic annotation






/// 16021 // integral






to membrane //






inferred from






electronic annotation


malZ
malZ
Maltodextrin glucosidase
5975 // carbohydrate

3824 // catalytic





metabolism // inferred from

activity // inferred





electronic annotation

from electronic







annotation /// 4558







// alpha-glucosidase







activity // inferred







from electronic







annotation /// 16787







// hydrolase activity







// inferred from







electronic







annotation /// 16798







// hydrolase activity,







acting


ymdD
ymdD ///
glucans biosynthesis protein
9250 // glucan biosynthesis
16020 // membrane //
8415 //



mdoC

// inferred from electronic
inferred from
acyltransferase





annotation
electronic annotation
activity // inferred






/// 16021 // integral
from electronic






to membrane //
annotation /// 16740






inferred from
// transferase






electronic annotation
activity // inferred







from electronic







annotation /// 16741







// transferase







activity, transferring







one-carbon groups //







inferred from







electronic







annotation /// 1


316#4
rsxA
hypothetical protein





rfaB
rfaB
UDP-D-
9058 // biosynthesis //

16740 // transferase




galactose:(glucosyl)lipopolysaccharide-
inferred from electronic

activity // inferred




1,6-D-galactosyltransferase
annotation /// 9103 //

from electronic





lipopolysaccharide

annotation /// 16757





biosynthesis // inferred from

// transferase





electronic annotation

activity, transferring







glycosyl groups //







inferred from







electronic







annotation


emrB
emrB
multidrug efflux system protein
6810 // transport // inferred
16020 // membrane //
5215 // transporter





from electronic annotation
inferred from
activity // inferred





/// 46677 // response to
electronic annotation
from electronic





antibiotic // inferred from
/// 16021 // integral
annotation





electronic annotation
to membrane //






inferred from






electronic annotation


356#7
yegJ
hypothetical protein





fsr
fsr
Fosmidomycin resistance protein ///
6810 // transport // inferred
16020 // membrane //
5215 // transporter




predicted fosmidomycin efflux system
from electronic annotation
inferred from
activity // inferred





/// 46677 // response to
electronic annotation
from electronic





antibiotic // inferred from
/// 16021 // integral
annotation





electronic annotation
to membrane //






inferred from






electronic annotation


yigF
yigF
conserved inner membrane protein





233#6
yceJ
Cytochrome b561 homolog 2 /// predicted
6118 // electron transport ///
16020 // membrane
5506 // iron ion




cytochrome b561
6810 // transport
/// 16021 // integral
binding /// 46872 //






to membrane
metal ion binding


331#2
yeaE
Hypothetical protein yeaE


16491 //







oxidoreductase







activity


mrdB
mrdB
Rod shape-determining protein rodA
7049 // cell cycle // inferred
9276 // cell wall






from electronic annotation
(sensu





/// 8360 // regulation of cell
Proteobacteria) //





shape // inferred from
inferred from





electronic annotation
electronic annotation






/// 16021 // integral






to membrane //






inferred from






electronic annotation


thiL
thiL
thiamine monophosphate kinase
9228 // thiamin biosynthesis

9030 // thiamin





// inferred from electronic

phosphate kinase





annotation

activity // inferred







from electronic







annotation /// 16301







// kinase activity //







inferred from







electronic







annotation /// 16740







// transferase







activity // inferred







from electronic







annotation


yphD
yphD
predicted sugar transporter subunit:
6810 // transport // inferred
16020 // membrane //
5215 // transporter




membrane component of ABC superfamily
from electronic annotation
inferred from
activity // inferred






electronic annotation
from electronic






/// 16021 // integral
annotation






to membrane //






inferred from






electronic annotation


fabZ
fabZ
(3R)-hydroxymyristoyl ACP dehydratase
6633 // fatty acid
5737 // cytoplasm //
16829 // lyase





biosynthesis // inferred from
inferred from
activity // inferred





electronic annotation ///
electronic annotation
from electronic





8610 // lipid biosynthesis //

annotation /// 16836





inferred from electronic

// hydro-lyase





annotation /// 9245 // lipid A

activity // inferred





biosynthesis // inferred from

from electronic





electronic annotation

annotation


yoaA
yoaA
conserved protein with nucleoside
6139 // nucleobase,

166 // nucleotide




triphosphate hydrolase domain
nucleoside, nucleotide and

binding // inferred





nucleic acid metabolism //

from electronic





inferred from electronic

annotation /// 3676





annotation

// nucleic acid







binding // inferred







from electronic







annotation /// 3677







// DNA binding //







inferred from







electronic







annotation /// 4386







// helicase activity //







inferred from







electronic







annotation


yfjY
yfjY
CP4-57 prophage; predicted DNA repair
6281 // DNA repair //






protein
inferred from electronic





annotation


nrfE
nrfE
heme lyase (NrfEFG) for insertion of heme
6461 // protein complex
16020 // membrane //
15232 // heme




into c552, subunit NrfE
assembly // inferred from
inferred from
transporter activity





electronic annotation ///
electronic annotation
// inferred from





8535 // cytochrome c
/// 16021 // integral
electronic





oxidase complex assembly //
to membrane //
annotation





inferred from electronic
inferred from





annotation /// 15886 // heme
electronic annotation





transport // inferred from





electronic annotation ///





17004 // cytochrome com


udk
udk
uridine kinase
8655 // pyrimidine salvage //

166 // nucleotide





inferred from electronic

binding // inferred





annotation /// 9058 //

from electronic





biosynthesis // inferred from

annotation /// 4849





electronic annotation

// uridine kinase







activity // inferred







from electronic







annotation /// 5524







// ATP binding //







inferred from







electronic







annotation /// 16301







// kinase activity //







inferred from







electronic







annotation


yhhL
yhhL
Hypothetical protein yhhL /// conserved







inner membrane protein


JW1949
yedS_3
Pseudo





sucB
sucB
dihydrolipoamide acetyltransferase
6099 // tricarboxylic acid
45252 // oxoglutarate
4149 //





cycle // inferred from
dehydrogenase
dihydrolipoyllysine-





electronic annotation ///
complex // inferred
residue





8152 // metabolism //
from electronic
succinyltransferase





inferred from electronic
annotation
activity // inferred





annotation

from electronic







annotation /// 5515







// protein binding //







inferred from







electronic







annotation /// 8415







// acyltransferase







activity // inferred







from electronic







annotation /// 1674


aceF
aceF
dihydrolipoamide acetyltransferase
6096 // glycolysis // inferred
45254 // pyruvate
5515 // protein





from electronic annotation
dehydrogenase
binding // inferred





/// 8152 // metabolism //
complex // inferred
from physical





inferred from electronic
from electronic
interaction /// 4742





annotation
annotation
//







dihydrolipoyllysine-







residue







acetyltransferase







activity // inferred







from electronic







annotation /// 5515







// protein binding //







inferred from







electronic







annotation /// 8415







// acyltran


yaiV
yaiV
Hypothetical protein yaiV /// predicted
6355 // regulation of
5622 // intracellular
3700 // transcription




DNA-binding transcriptional regulator
transcription, DNA-
// inferred from
factor activity //





dependent // inferred from
electronic annotation
inferred from





electronic annotation

electronic







annotation


yccY
yccY ///
phosphotyrosine-protein phosphatase
6470 // protein amino acid

4721 //



etp

dephosphorylation // inferred

phosphoprotein





from electronic annotation

phosphatase activity







// inferred from







electronic







annotation /// 4725







// protein tyrosine







phosphatase activity







// inferred from







electronic







annotation /// 16787







// hydrolase activity







// inferred from







electronic







annotation


yhaO
yhaO
predicted transporter
6810 // transport // inferred
16020 // membrane //
5215 // transporter





from electronic annotation
inferred from
activity // inferred






electronic annotation
from electronic






/// 16021 // integral
annotation






to membrane //






inferred from






electronic annotation


yhiP
yhiP
Hypothetical transporter yhiP /// predicted
6810 // transport // inferred
16020 // membrane //
5215 // transporter




transporter
from electronic annotation
inferred from
activity // inferred





/// 6857 // oligopeptide
electronic annotation
from electronic





transport // inferred from
/// 16021 // integral
annotation





electronic annotation
to membrane //






inferred from






electronic annotation


yaaH
yaaH
Hypothetical protein yaaH /// conserved

16020 // membrane //





inner membrane protein associated with

inferred from




acetate transport

electronic annotation






/// 16021 // integral






to membrane //






inferred from






electronic annotation


oppF
oppF
Oligopeptide transport ATP-binding
6810 // transport // inferred
16020 // membrane //
166 // nucleotide




protein oppF /// oligopeptide transporter
from electronic annotation
inferred from
binding // inferred




subunit
/// 6857 // oligopeptide
electronic annotation
from electronic





transport // inferred from

annotation /// 5524





electronic annotation ///

// ATP binding //





15031 // protein transport //

inferred from





inferred from electronic

electronic





annotation /// 15833 //

annotation /// 15197





peptide transport // inferred

// peptide





from electronic annotation

transporter activity







// inferred from







electronic







annotation /// 15198







// oligopeptide







transporter activity


pnuC
pnuC
Protein pnuC /// predicted nicotinamide
6810 // transport // inferred
16020 // membrane //





mononucleotide transporter
from electronic annotation
inferred from






electronic annotation






/// 16021 // integral






to membrane //






inferred from






electronic annotation


ansP
ansP
L-asparagine permease /// L-asparagine
6810 // transport // inferred
16020 // membrane //
5279 // amino acid-




transporter
from electronic annotation
inferred from
polyamine





/// 6865 // amino acid
electronic annotation
transporter activity





transport // inferred from
/// 16021 // integral
// inferred from





electronic annotation
to membrane //
electronic






inferred from
annotation /// 15171






electronic annotation
// amino acid







transporter activity







// inferred from







electronic







annotation


cybB
cybB
Cytochrome b561
6118 // electron transport //
16020 // membrane //
5506 // iron ion





inferred from electronic
inferred from
binding // inferred





annotation /// 6810 //
electronic annotation
from electronic





transport // inferred from
/// 16021 // integral
annotation /// 46872





electronic annotation
to membrane //
// metal ion binding






inferred from
// inferred from






electronic annotation
electronic







annotation


yddH
yddH
Hypothetical protein yddH
6118 // electron transport //

16491 //





inferred from electronic

oxidoreductase





annotation

activity // inferred







from electronic







annotation


sfsA
sfsA
sugar fermentation stimulation protein ///


3677 // DNA




sugar fermentation stimulation protein A


binding // inferred







from electronic







annotation


slyX
slyX
hypothetical protein





dinI
dinI
DNA-damage-inducible protein I /// DNA
6281 // DNA repair //






damage-inducible protein I
inferred from electronic





annotation /// 6974 //





response to DNA damage





stimulus // inferred from





electronic annotation ///





9432 // SOS response //





inferred from electronic





annotation


ynjC
ynjC
fused transporter subunits of ABC
6810 // transport // inferred
16020 // membrane //
5215 // transporter




superfamily: membrane components
from electronic annotation
inferred from
activity // inferred






electronic annotation
from electronic






/// 16021 // integral
annotation






to membrane //






inferred from






electronic annotation


411#1
yfdG
CPS-53 (KpLE1) prophage; bactoprenol-
271 // polysaccharide
16020 // membrane





linked glucose translocase (flippase)
biosynthesis /// 6810 //
/// 16021 // integral





transport
to membrane


yjgT
yjgT ///
Gnt-II system L-idonate transporter /// L-
6810 // transport // inferred
16020 // membrane //
5351 // sugar porter



idnT
idonate and D-gluconate transporter
from electronic annotation
inferred from
activity // inferred





/// 15725 // gluconate
electronic annotation
from electronic





transport // inferred from
/// 16021 // integral
annotation /// 15128





electronic annotation ///
to membrane //
// gluconate





19521 // D-gluconate
inferred from
transporter activity





metabolism // inferred from
electronic annotation
// inferred from





electronic annotation

electronic







annotation


yheG
yheG
Probable general secretion pathway protein E
6810 // transport // inferred
5622 // intracellular
166 // nucleotide





from electronic annotation
// inferred from
binding // inferred





/// 15628 // type II protein
electronic annotation
from electronic





secretion system // inferred
/// 15627 // type II
annotation /// 5524





from electronic annotation
protein secretion
// ATP binding //






system complex //
inferred from






inferred from
electronic






electronic annotation
annotation /// 8565







// protein transporter







activity // inferred







from electronic







annotation /// 17111







// nucleoside-







triphosphatase







activity


dgt
dgt
deoxyguanosinetriphosphate
46039 // GTP metabolism //

287 // magnesium




triphosphohydrolase
inferred from electronic

ion binding //





annotation

inferred from







electronic







annotation /// 3824







// catalytic activity //







inferred from







electronic







annotation /// 8832







// dGTPase activity







// inferred from







electronic







annotation /// 16787







// hydrolase activity







// inferred from


folK
folK
2-amino-4-hydroxy-6-
9396 // folic acid and

3848 // 2-amino-4-




hydroxymethyldihydropteridine
derivative biosynthesis //

hydroxy-6-




pyrophosphokinase
inferred from electronic

hydroxymethyldihydropteridine





annotation

diphosphokinase







activity // inferred







from electronic







annotation /// 16301







// kinase activity //







inferred from







electronic







annotation /// 16740







// transferase







activity // inferred







from electronic







annotation


gppA
gppA ///
Guanosine-5′-triphosphate,3′-diphosphate


8894 // guanosine-



gpp
pyrophosphatase /// guanosine


5′-triphosphate,3′-




pentaphosphatase/exopolyphosphatase


diphosphate







diphosphatase







activity // inferred







from electronic







annotation /// 16787







// hydrolase activity







// inferred from







electronic







annotation


glnD
glnD
PII uridylyl-transferase
6807 // nitrogen compound

3824 // catalytic





metabolism // inferred from

activity // inferred





electronic annotation ///

from electronic





8152 // metabolism //

annotation /// 8773





inferred from electronic

// [protein-PII]





annotation /// 9399 //

uridylyltransferase





nitrogen fixation // inferred

activity // inferred





from electronic annotation

from electronic







annotation /// 16597







// amino acid







binding // inferred







from electronic







annotation /// 16740







// transferase


yhbX
yhbX
Outer-membrane protein yhbX precursor
8152 // metabolism //
16020 // membrane //
8484 // sulfuric ester




/// predicted hydrolase, inner membrane
inferred from electronic
inferred from
hydrolase activity //





annotation
electronic annotation
inferred from






/// 16021 // integral
electronic






to membrane //
annotation






inferred from






electronic annotation


ygjQ
ygjQ
Hypothetical protein ygjQ





323#1
ydiV
Hypothetical protein ydiV /// hypothetical







protein


cydB
cydB
Cytochrome D ubiquinol oxidase subunit
6118 // electron transport //
16020 // membrane //
5506 // iron ion




II /// cytochrome d terminal oxidase,
inferred from electronic
inferred from
binding // inferred




subunit II
annotation /// 6810 //
electronic annotation
from electronic





transport // inferred from
/// 16021 // integral
annotation /// 16491





electronic annotation
to membrane //
// oxidoreductase






inferred from
activity // inferred






electronic annotation
from electronic







annotation /// 46872







// metal ion binding







// inferred from







electronic







annotation


ybhA
ybhA
Hypothetical protein ybhA /// predicted
6812 // cation transport //
16020 // membrane //
287 // magnesium




hydrolase
inferred from electronic
inferred from
ion binding //





annotation /// 8152 //
electronic annotation
inferred from





metabolism // inferred from

electronic





electronic annotation

annotation /// 3824







// catalytic activity //







inferred from







electronic







annotation /// 5524







// ATP binding //







inferred from







electronic







annotation /// 15662







// ATPase activity,







coupled to







transmembrane


yibL
yibL
hypothetical protein





yifE
yifE
Protein yifE /// hypothetical protein





ygfF
ygfF
predicted NAD(P)-binding oxidoreductase
8152 // metabolism //

5515 // protein




with NAD(P)-binding Rossmann-fold
inferred from electronic

binding // inferred




domain
annotation

from physical







interaction /// 16491







// oxidoreductase







activity // inferred







from electronic







annotation


rffG
rffG
dTDP-glucose 4,6-dehydratase
9103 // lipopolysaccharide

3824 // catalytic





biosynthesis // inferred from

activity // inferred





electronic annotation ///

from electronic





9225 // nucleotide-sugar

annotation /// 8460





metabolism // inferred from

// dTDP-glucose





electronic annotation ///

4,6-dehydratase





44237 // cellular metabolism

activity // inferred





// inferred from electronic

from electronic





annotation

annotation /// 16829







// lyase activity //







inferred from







electronic







annotation /// 50662







// coenzyme binding







//


yeaS
yeaS
Hypothetical protein yeaS /// neutral
6865 // amino acid transport
16020 // membrane //
5293 // lysine




amino-acid efflux system
// inferred from electronic
inferred from
permease activity //





annotation
electronic annotation
inferred from






/// 16021 // integral
electronic






to membrane //
annotation






inferred from






electronic annotation


yaiM
yaiM ///
Hypothetical protein yaiM /// predicted


4759 // serine



frmB
esterase


esterase activity //







inferred from







electronic







annotation /// 16787







// hydrolase activity







// inferred from







electronic







annotation /// 4091







// carboxylesterase







activity // inferred







from electronic







annotation


ygeD
ygeD
Hypothetical protein ygeD /// predicted

16020 // membrane //





inner membrane protein

inferred from






electronic annotation






/// 16021 // integral






to membrane //






inferred from






electronic annotation


yjhB
yjhB
KpLE2 phage-like element; predicted
6810 // transport // inferred
16020 // membrane //
5215 // transporter




transporter
from electronic annotation
inferred from
activity // inferred






electronic annotation
from electronic






/// 16021 // integral
annotation






to membrane //






inferred from






electronic annotation


codB
codB
Cytosine permease /// cytosine transporter
6810 // transport // inferred
16020 // membrane //
15205 // nucleobase





from electronic annotation
inferred from
transporter activity





/// 15931 // nucleobase,
electronic annotation
// inferred from





nucleoside, nucleotide and
/// 16021 // integral
electronic





nucleic acid transport //
to membrane //
annotation





inferred from electronic
inferred from





annotation /// 19858 //
electronic annotation





cytosine metabolism //





inferred from electronic





annotation


rfaL
rfaL
O-antigen ligase
9103 // lipopolysaccharide
16020 // membrane //
16874 // ligase





biosynthesis // inferred from
inferred from
activity // inferred





electronic annotation
electronic annotation
from electronic






/// 16021 // integral
annotation






to membrane //






inferred from






electronic annotation


yiaQ
yiaQ ///
Probable hexulose-6-phosphate synthase ///
5975 // carbohydrate

5515 // protein



sgbH
3-keto-L-gulonate 6-phosphate
metabolism // inferred from

binding // inferred




decarboxylase
electronic annotation ///

from physical





6207 // ‘de novo’ pyrimidine

interaction /// 287 //





base biosynthesis // inferred

magnesium ion





from electronic annotation

binding // inferred







from electronic







annotation /// 4590







// orotidine-5′-







phosphate







decarboxylase







activity // inferred







from electronic







annotation /// 16829







// lyase activ


potC
potC
spermidine/putrescine ABC transporter
6810 // transport // inferred
16020 // membrane //
5215 // transporter




membrane component ///
from electronic annotation
inferred from
activity // inferred




spermidine/putrescine ABC transporter

electronic annotation
from electronic




membrane protein

/// 16021 // integral
annotation






to membrane //






inferred from






electronic annotation


secB
secB
export protein SecB
6457 // protein folding //

51082 // unfolded





inferred from electronic

protein binding //





annotation /// 6605 // protein

inferred from





targeting // inferred from

electronic





electronic annotation ///

annotation





6810 // transport // inferred





from electronic annotation





/// 15031 // protein transport





// inferred from electronic





annotation


murG
murG
N-acetylglucosaminyl transferase
5975 // carbohydrate
5618 // cell wall //
16740 // transferase





metabolism // inferred from
inferred from
activity // inferred





electronic annotation ///
electronic annotation
from electronic





7049 // cell cycle // inferred
/// 9276 // cell wall
annotation /// 16757





from electronic annotation
(sensu
// transferase





/// 8360 // regulation of cell
Proteobacteria) //
activity, transferring





shape // inferred from
inferred from
glycosyl groups //





electronic annotation ///
electronic annotation
inferred from





9252 // peptidoglycan
/// 16020 //
electronic





biosynthesis // inferred from
membrane // inferred
annotation /// 16758





electronic annotation
from electronic
// transferase






annotation
activity, transferring







hexosyl groups //







inferred from







electronic







annotation


ydhV
ydhV
Hypothetical protein ydhV /// predicted
6118 // electron transport //

16491 //




oxidoreductase
inferred from electronic

oxidoreductase





annotation

activity // inferred







from electronic







annotation /// 16730







// oxidoreductase







activity, acting on







iron-sulfur proteins







as donors // inferred







from electronic







annotation


putP
putP
Sodium/proline symporter ///
6810 // transport // inferred
16020 // membrane //
5215 // transporter




proline:sodium symporter
from electronic annotation
inferred from
activity // inferred





/// 6811 // ion transport //
electronic annotation
from electronic





inferred from electronic
/// 16021 // integral
annotation /// 5298





annotation /// 6814 // sodium
to membrane //
// proline:sodium





ion transport // inferred from
inferred from
symporter activity //





electronic annotation ///
electronic annotation
inferred from





6865 // amino acid transport

electronic





// inferred from electronic

annotation /// 15171





annotation

// amino acid







transporter activity







// inferred from







electronic







annotation /// 15293







// sympo


yiaL
yiaL
Hypothetical protein yiaL





queA
queA
S-adenosylmethionine:tRNA
8616 // queuosine

3824 // catalytic




ribosyltransferase-isomerase
biosynthesis // inferred from

activity // inferred





electronic annotation

from electronic







annotation /// 16740







// transferase







activity // inferred







from electronic







annotation /// 16853







// isomerase activity







// inferred from







electronic







annotation


yhaH
yhaH
Hypothetical protein yhaH /// predicted

16020 // membrane //





inner membrane protein

inferred from






electronic annotation






/// 16021 // integral






to membrane //






inferred from






electronic annotation


cobU
cobU
adenosylcobinamide kinase ///
6779 // porphyrin

166 // nucleotide




adenosylcobinamide
biosynthesis // inferred from

binding // inferred




kinase/adenosylcobinamide-phosphate
electronic annotation ///

from electronic




guanylyltransferase
9236 // cobalamin

annotation /// 3824





biosynthesis // inferred from

// catalytic activity //





electronic annotation

inferred from







electronic







annotation /// 5524







// ATP binding //







inferred from







electronic







annotation /// 5525







// GTP binding //







inferred from







electronic







annotation


23-12A0


yadQ
yadQ
chloride channel protein
6810 // transport // inferred
16020 // membrane //
5247 // voltage-





from electronic annotation
inferred from
gated chloride





/// 6811 // ion transport //
electronic annotation
channel activity //





inferred from electronic
/// 16021 // integral
inferred from





annotation /// 6821 //
to membrane //
electronic





chloride transport // inferred
inferred from
annotation /// 15297





from electronic annotation
electronic annotation
// antiporter activity







// inferred from







electronic







annotation /// 31404







// chloride ion







binding // inferred







from electronic







annotation


yciQ
yciQ
predicted inner membrane protein





tauB
tauB
Taurine transport ATP-binding protein
6810 // transport // inferred
16020 // membrane //
166 // nucleotide




tauB /// taurine transporter subunit
from electronic annotation
inferred from
binding // inferred






electronic annotation
from electronic







annotation /// 5524







// ATP binding //







inferred from







electronic







annotation /// 15411







// taurine-







transporting ATPase







activity // inferred







from electronic







annotation /// 16787







// hydrolase activity







// inferred from







electronic







annotation


yagG
yagG
CP4-6 prophage; predicted sugar
6810 // transport // inferred
16020 // membrane //
5215 // transporter




transporter
from electronic annotation
inferred from
activity // inferred





/// 6814 // sodium ion
electronic annotation
from electronic





transport // inferred from
/// 16021 // integral
annotation /// 15293





electronic annotation
to membrane //
// symporter activity






inferred from
// inferred from






electronic annotation
electronic







annotation


lipA
lipA
lipoyl synthase
9107 // lipoate biosynthesis

3824 // catalytic





// inferred from electronic

activity // inferred





annotation

from electronic







annotation /// 5506







// iron ion binding //







inferred from







electronic







annotation /// 16740







// transferase







activity // inferred







from electronic







annotation /// 16783







// sulfurtransferase







activity // inferred







from electronic







annotation


yhcO
yhcO
Hypothetical protein yhcO /// predicted







barnase inhibitor


maoC
maoC
fused aldehyde dehydrogenase/enoyl-CoA
8152 // metabolism //

16491 //




hydratase
inferred from electronic

oxidoreductase





annotation

activity // inferred







from electronic







annotation


nfrB
nfrB
Bacteriophage N4 adsorption protein B ///
6810 // transport // inferred
16020 // membrane //
5524 // ATP binding




bacteriophage N4 receptor, inner
from electronic annotation
inferred from
// inferred from




membrane subunit
/// 9597 // detection of virus
electronic annotation
electronic





// inferred from electronic
/// 16021 // integral
annotation





annotation /// 46718 // entry
to membrane //





of virus into host cell //
inferred from





inferred from electronic
electronic annotation





annotation


yajR
yajR
Hypothetical transport protein yajR ///
6810 // transport // inferred
16020 // membrane //
5215 // transporter




predicted transporter
from electronic annotation
inferred from
activity // inferred






electronic annotation
from electronic






/// 16021 // integral
annotation






to membrane //






inferred from






electronic annotation


trkH
trkH
Trk system potassium uptake protein trkH
6810 // transport // inferred
16020 // membrane //
8324 // cation




/// potassium transporter
from electronic annotation
inferred from
transporter activity





/// 6811 // ion transport //
electronic annotation
// inferred from





inferred from electronic
/// 16021 // integral
electronic





annotation /// 6812 // cation
to membrane //
annotation /// 30955





transport // inferred from
inferred from
// potassium ion





electronic annotation ///
electronic annotation
binding // inferred





6813 // potassium ion

from electronic





transport // inferred from

annotation





electronic a


exuR
exuR
Exu regulon transcriptional regulator ///
6350 // transcription //
5622 // intracellular
3677 // DNA




DNA-binding transcriptional repressor
inferred from electronic
// inferred from
binding // inferred





annotation /// 6355 //
electronic annotation
from electronic





regulation of transcription,

annotation /// 3700





DNA-dependent // inferred

// transcription





from electronic annotation

factor activity //







inferred from







electronic







annotation


glnQ
glnQ
glutamine ABC transporter ATP-binding
6810 // transport // inferred
16020 // membrane //
166 // nucleotide




component /// glutamine ABC transporter
from electronic annotation
inferred from
binding // inferred




ATP-binding protein
/// 6865 // amino acid
electronic annotation
from electronic





transport // inferred from

annotation /// 5524





electronic annotation

// ATP binding //







inferred from







electronic







annotation /// 15171







// amino acid







transporter activity







// inferred from







electronic







annotation /// 16887







// ATPase activity //







inferred from







electronic







annotation r


yafJ
yafJ
Hypothetical protein yafJ /// predicted
8152 // metabolism //

16740 // transferase




amidotransfease
inferred from electronic

activity // inferred





annotation

from electronic







annotation


ydeF
ydeF ///
Hypothetical protein ydeE /// predicted
6810 // transport // inferred
16020 // membrane //
5215 // transporter



ydeE
transporter
from electronic annotation
inferred from
activity // inferred






electronic annotation
from electronic






/// 16021 // integral
annotation






to membrane //






inferred from






electronic annotation


yejF
yejF
Hypothetical ABC transporter ATP-
6810 // transport // inferred
16020 // membrane //
166 // nucleotide




binding protein yejF /// fused predicted
from electronic annotation
inferred from
binding // inferred




oligopeptide transporter subunits of ABC
/// 6857 // oligopeptide
electronic annotation
from electronic




superfamilly: ATP-binding components
transport // inferred from

annotation /// 5524





electronic annotation ///

// ATP binding //





15031 // protein transport //

inferred from





inferred from electronic

electronic





annotation

annotation /// 16887







// ATPase activity //







inferred from







electronic







annotation /// 17111







// nucleoside-







triphosphatase







activity // inferred







from electronic







annotation


yheU
yheU
hypothetical protein





greA
greA
transcription elongation factor GreA
6350 // transcription //

3677 // DNA





inferred from electronic

binding // inferred





annotation /// 6355 //

from electronic





regulation of transcription,

annotation /// 3711





DNA-dependent // inferred

// transcriptional





from electronic annotation

elongation regulator







activity // inferred







from electronic







annotation


yagM
yagM
CP4-6 prophage; predicted protein





allP
allP ///
Putative allantoin permease /// predicted
6144 // purine base
16020 // membrane //
15205 // nucleobase



ybbW
allantoin transporter
metabolism // inferred from
inferred from
transporter activity





electronic annotation ///
electronic annotation
// inferred from





6810 // transport // inferred
/// 16021 // integral
electronic





from electronic annotation
to membrane //
annotation





/// 15931 // nucleobase,
inferred from





nucleoside, nucleotide and
electronic annotation





nucleic acid transport //





inferred from electronic





annotation


yghK
yghK
Glycolate permease glcA /// glycolate
6810 // transport // inferred
16020 // membrane //
15129 // lactate




transporter
from electronic annotation
inferred from
transporter activity





/// 15727 // lactate transport
electronic annotation
// inferred from





// inferred from electronic
/// 16021 // integral
electronic





annotation
to membrane //
annotation






inferred from






electronic annotation


yjfP
yjfP
Hypothetical protein yjfP /// predicted


16787 // hydrolase




hydrolase


activity // inferred







from electronic







annotation


409#5
yfcP
Hypothetical fimbrial-like protein yfcP
7155 // cell adhesion
9289 // fimbrium





precursor


yefI
yefI ///
lipopolysaccharide biosynthesis protein
9058 // biosynthesis //

16740 // transferase



wbbK

inferred from electronic

activity // inferred





annotation /// 9103 //

from electronic





lipopolysaccharide

annotation





biosynthesis // inferred from





electronic annotation


ydbD
ydbD
hypothetical protein





214#3


yhiN
yhiN
Hypothetical protein yhiN /// predicted
6118 // electron transport //

16491 //




oxidoreductase with FAD/NAD(P)-
inferred from electronic

oxidoreductase




binding domain
annotation

activity // inferred







from electronic







annotation


mutT
mutT
Mutator mutT protein /// nucleoside
6260 // DNA replication //

287 // magnesium




triphosphate pyrophosphohydrolase,
inferred from electronic

ion binding //




marked preference for dGTP
annotation /// 6281 // DNA

inferred from





repair // inferred from

electronic





electronic annotation ///

annotation /// 4452





6974 // response to DNA

// isopentenyl-





damage stimulus // inferred

phosphate delta-





from electronic annotation

isomerase activity //





/// 8299 // isoprenoid

inferred from





biosynthesis // inferred from

electronic





electronic annotation

annotation /// 8413







// 8-oxo-7,8-







dihydroguanine







triphosphatase







activity // inferred







from electroni


virK
virK
hypothetical protein





ompC
ompC
Outer membrane protein C precursor
6810 // transport // inferred
16020 // membrane //
5215 // transporter





from electronic annotation
inferred from
activity // inferred





/// 6811 // ion transport //
electronic annotation
from electronic





inferred from electronic
/// 16021 // integral
annotation /// 15288





annotation /// 9597 //
to membrane //
// porin activity //





detection of virus // inferred
inferred from
inferred from





from electronic annotation
electronic annotation
electronic





/// 46718 // entry of virus
/// 19867 // outer
annotation





into host cell // inferred from
membrane // inferred





electronic annotation
from electronic






annotation


yghT
yghT
Hypothetical ATP-binding protein yghT ///


166 // nucleotide




predicted protein with nucleoside


binding // inferred




triphosphate hydrolase domain


from electronic







annotation /// 5524







// ATP binding //







inferred from







electronic







annotation


yohG
yohG
Putative channel/filament proteins ///
6810 // transport // inferred
16020 // membrane //
5215 // transporter




predicted outer membrane protein
from electronic annotation
inferred from
activity // inferred





/// 46677 // response to
electronic annotation
from electronic





antibiotic // inferred from
/// 19867 // outer
annotation /// 8289





electronic annotation
membrane // inferred
// lipid binding //






from electronic
inferred from






annotation
electronic







annotation


ebgA
ebgA
Evolved beta-galactosidase alpha-subunit
5975 // carbohydrate
9341 // beta-
3824 // catalytic





metabolism // inferred from
galactosidase
activity // inferred





electronic annotation
complex // inferred
from electronic






from electronic
annotation /// 4553






annotation
// hydrolase activity,







hydrolyzing O-







glycosyl compounds







// inferred from







electronic







annotation /// 4565







// beta-galactosidase







activity // inferred







from electronic







annotation /// 16


yjfF
yjfF
Hypothetical ABC transporter permease
6810 // transport // inferred
16020 // membrane //
5215 // transporter




protein yjfF
from electronic annotation
inferred from
activity // inferred






electronic annotation
from electronic







annotation


452#13
ygcH
hypothetical protein





yphG
yphG
Hypothetical protein yphG


5488 // binding //







inferred from







electronic







annotation


ynaJ
ynaJ
Hypothetical protein ynaJ /// predicted

16020 // membrane //





inner membrane protein

inferred from






electronic annotation






/// 16021 // integral






to membrane //






inferred from






electronic annotation


sucD
sucD
succinyl-CoA synthetase alpha subunit ///
6099 // tricarboxylic acid

166 // nucleotide




succinyl-CoA synthetase subunit alpha
cycle // inferred from

binding // inferred





electronic annotation ///

from electronic





8152 // metabolism //

annotation /// 3824





inferred from electronic

// catalytic activity //





annotation

inferred from







electronic







annotation /// 4775







// succinate-CoA







ligase (ADP-







forming) activity //







inferred from







electronic







annotation /// 5524







// ATP binding/


prtC
prtC
hypothetical protein





yhdT
yhdT
Hypothetical protein yhdT /// conserved

16020 // membrane //





inner membrane protein

inferred from






electronic annotation






/// 16021 // integral






to membrane //






inferred from






electronic annotation


(yhiW)
yhiW
Hypothetical transcriptional regulator
6350 // transcription //
5622 // intracellular
3677 // DNA




yhiW
inferred from electronic
// inferred from
binding // inferred





annotation /// 6355 //
electronic annotation
from electronic





regulation of transcription,

annotation /// 3700





DNA-dependent // inferred

// transcription





from electronic annotation

factor activity //





/// 45449 // regulation of

inferred from





transcription // inferred from

electronic





electronic annotation

annotation /// 43565







// sequence-specific







DNA binding //







inferred from







electronic







annotation


610#6.1
















TABLE 3







SAM ANALYSIS OF CROHN'S DISEASE (CD) VERSUS ULCERATIVE COLITIS (UC) (see FIG. 3B)












SPOT
PROTEIN
NAME
GO BP
GO CC
GO MF










51 Highly immunogenic proteins in UC












secF
secF
protein export protein SecF
6605 // protein targeting //
9276 // cell wall
8565 // protein





inferred from electronic
(sensu
transporter activity //





annotation /// 6810 //
Proteobacteria) //
inferred from





transport // inferred from
inferred from
electronic annotation





electronic annotation ///
electronic
/// 15450 // protein





6886 // intracellular
annotation ///
translocase activity //





protein transport // inferred
15627 // type II
inferred from





from electronic annotation
protein secretion
electronic annotation





/// 15031 // protein
system complex //





transport // inferred from
inferred from





electronic annotation
electronic






annotation ///






16020 // membrane






// inferred from






electronic






annotation ///






16021 // integral


427#1
yfgF
Hypothetical protein yfgF /// predicted inner membrane

16020 // membrane





protein

/// 16021 // integral






to membrane


yojI
yojI
Hypothetical ABC transporter ATP-binding protein yojI ///
6810 // transport // inferred
16020 // membrane
166 // nucleotide




fused predicted multidrug transport subunits of ABC
from electronic annotation
// inferred from
binding // inferred




superfamily: membrane component/ATP-binding component
/// 15833 // peptide
electronic
from electronic





transport // inferred from
annotation ///
annotation /// 5524 //





electronic annotation ///
16021 // integral to
ATP binding //





46677 // response to
membrane //
inferred from





antibiotic // inferred from
inferred from
electronic annotation





electronic annotation
electronic
/// 15197 // peptide






annotation
transporter activity //







inferred from







electronic annotation







/// 16887 // ATPase







activity // inferred







from


emrY
emrY
Multidrug resistance protein Y /// predicted multidrug
6810 // transport // inferred
16020 // membrane
5215 // transporter




efflux system
from electronic annotation
// inferred from
activity // inferred





/// 6306 // DNA
electronic
from electronic





methylation // inferred
annotation ///
annotation /// 3677 //





from electronic annotation
16021 // integral to
DNA binding //






membrane //
inferred from






inferred from
electronic annotation






electronic






annotation


trkG
trkG
Rac prophage; potassium transporter subunit
6810 // transport // inferred
16020 // membrane
8324 // cation





from electronic annotation
// inferred from
transporter activity //





/// 6811 // ion transport //
electronic
inferred from





inferred from electronic
annotation ///
electronic annotation





annotation /// 6812 //
16021 // integral to
/// 30955 // potassium





cation transport // inferred
membrane //
ion binding //





from electronic annotation
inferred from
inferred from





/// 6813 // potassium ion
electronic
electronic annotation





transport // inferred from
annotation





electronic a


yhiN
yhiN
Hypothetical protein yhiN /// predicted oxidoreductase
6118 // electron transport //

16491 //


23-12A0

with FAD/NAD(P)-binding domain
inferred from electronic

oxidoreductase





annotation

activity // inferred







from electronic







annotation


ydaA
ydaA /// uspE
Protein ydaA /// stress-induced protein
6950 // response to stress //







inferred from electronic





annotation


nagE
nagE
PTS system, N-acetylglucosamine-specific IIABC
6810 // transport // inferred
16020 // membrane
5351 // sugar porter




component
from electronic annotation
// inferred from
activity // inferred





/// 9401 //
electronic
from electronic





phosphoenolpyruvate-
annotation ///
annotation /// 8982 //





dependent sugar
16021 // integral to
protein-N(PI)-





phosphotransferase system
membrane //
phosphohistidine-





// inferred from electronic
inferred from
sugar





annotation
electronic
phosphotransferase






annotation ///
activity // inferred






19866 // organelle
from electronic






inner membrane //
annotation






inferred from






electronic






annotation


ydhV
ydhV
Hypothetical protein ydhV /// predicted oxidoreductase
6118 // electron transport //

16491 //





inferred from electronic

oxidoreductase





annotation

activity // inferred







from electronic







annotation /// 16730







// oxidoreductase







activity, acting on







iron-sulfur proteins







as donors // inferred







from electronic







annotation


maoC
maoC
fused aldehyde dehydrogenase/enoyl-CoA hydratase
8152 // metabolism //

16491 //





inferred from electronic

oxidoreductase





annotation

activity // inferred







from electronic







annotation


yaiM
yaiM /// frmB
Hypothetical protein yaiM /// predicted esterase


4759 // serine







esterase activity //







inferred from







electronic annotation







/// 16787 // hydrolase







activity // inferred







from electronic







annotation /// 4091 //







carboxylesterase







activity // inferred







from electronic







annotation


yeiO
yeiO /// setB
Sugar efflux transporter B /// lactose/glucose
6810 // transport // inferred
16020 // membrane
5215 // transporter




efflux system
from electronic annotation
// inferred from
activity // inferred





/// 8643 // carbohydrate
electronic
from electronic





transport // inferred from
annotation ///
annotation /// 5351 //





electronic annotation
16021 // integral to
sugar porter activity






membrane //
// inferred from






inferred from
electronic annotation






electronic
/// 15542 // sugar






annotation
efflux transporter







activity // inferred







from electronic







annotation


yphD
yphD
predicted sugar transporter subunit:
6810 // transport // inferred
16020 // membrane
5215 // transporter




membrane component of ABC superfamily
from electronic annotation
// inferred from
activity // inferred






electronic
from electronic






annotation ///
annotation






16021 // integral to






membrane //






inferred from






electronic






annotation


narI
narI
Respiratory nitrate reductase 1 gamma
6118 // electron transport //
9325 // nitrate
5506 // iron ion




chain /// nitrate reductase 1, gamma
inferred from electronic
reductase complex
binding // inferred




(cytochrome b(NR)) subunit
annotation /// 6810 //
// inferred from
from electronic





transport // inferred from
electronic
annotation /// 8940 //





electronic annotation ///
annotation ///
nitrate reductase





42128 // nitrate
16020 // membrane
activity // inferred





assimilation // inferred
// inferred from
from electronic





from electronic annotation
electronic
annotation /// 16491






annotation ///
// oxidoreductase






16021 // integral to
activity // inferred






membrane //
from electronic






inferred from
annotation /// 46872






electronic
// metal ion binding //






annotation
inf


ycbM
ycbM /// ssuC
Putative aliphatic sulfonates transport permease
6810 // transport // inferred
16020 // membrane
5215 // transporter




protein ssuC /// alkanesulfonate transporter subunit
from electronic annotation
// inferred from
activity // inferred






electronic
from electronic






annotation ///
annotation






16021 // integral to






membrane //






inferred from






electronic






annotation


yafJ
yafJ
Hypothetical protein yafJ /// predicted
8152 // metabolism //

16740 // transferase




amidotransfease
inferred from electronic

activity // inferred





annotation

from electronic







annotation


lueO
lueO
hypothetical protein





ybbC
ybbC
hypothetical protein





(yeeF)
yeeF
Hypothetical transport protein yeeF /// predicted
6810 // transport // inferred
16020 // membrane
5279 // amino acid-




amino-acid transporter
from electronic annotation
// inferred from
polyamine





/// 6865 // amino acid
electronic
transporter activity //





transport // inferred from
annotation ///
inferred from





electronic annotation
16021 // integral to
electronic annotation






membrane //
/// 15171 // amino






inferred from
acid transporter






electronic
activity // inferred






annotation
from electronic







annotation


mhpF
mhpF
acetaldehyde dehydrogenase
6520 // amino acid
5737 // cytoplasm //
8774 // acetaldehyde





metabolism // inferred
inferred from
dehydrogenase





from electronic annotation
electronic
(acetylating) activity





/// 15976 // carbon
annotation
// inferred from





utilization // inferred from

electronic annotation





electronic annotation ///

/// 16491 //





19439 // aromatic

oxidoreductase





compound catabolism //

activity // inferred





inferred from electronic

from electronic





annotation

annotation /// 16620







// oxidoreductase







activity, acting on the







aldehyde or oxo







group of donor


rfaB
rfaB
UDP-D-galactose:(glucosyl)lipopolysaccharide-
9058 // biosynthesis //

16740 // transferase




1,6-D-galactosyltransferase
inferred from electronic

activity // inferred





annotation /// 9103 //

from electronic





lipopolysaccharide

annotation /// 16757





biosynthesis // inferred

// transferase activity,





from electronic annotation

transferring glycosyl







groups // inferred







from electronic







annotation


yciD
yciD /// ompW
Outer membrane protein W precursor /// outer membrane

16020 // membrane





protein W

// inferred from






electronic






annotation ///






19867 // outer






membrane //






inferred from






electronic






annotation


dinI
dinI
DNA-damage-inducible protein I /// DNA damage-inducible
6281 // DNA repair //






protein I
inferred from electronic





annotation /// 6974 //





response to DNA damage





stimulus // inferred from





electronic annotation ///





9432 // SOS response //





inferred from electronic





annotation


yjgT
yjgT /// idnT
Gnt-II system L-idonate transporter /// L-
6810 // transport // inferred
16020 // membrane
5351 // sugar porter




idonate and D-gluconate transporter
from electronic annotation
// inferred from
activity // inferred





/// 15725 // gluconate
electronic
from electronic





transport // inferred from
annotation ///
annotation /// 15128





electronic annotation ///
16021 // integral to
// gluconate





19521 // D-gluconate
membrane //
transporter activity //





metabolism // inferred
inferred from
inferred from





from electronic annotation
electronic
electronic annotation






annotation


yjiJ
yjiJ
Hypothetical protein yjiJ /// predicted inner membrane

16020 // membrane





protein

// inferred from






electronic






annotation ///






16021 // integral to






membrane //






inferred from






electronic






annotation


fsr
fsr
Fosmidomycin resistance protein /// predicted fosmidomycin
6810 // transport // inferred
16020 // membrane
5215 // transporter




efflux system
from electronic annotation
// inferred from
activity // inferred





/// 46677 // response to
electronic
from electronic





antibiotic // inferred from
annotation ///
annotation





electronic annotation
16021 // integral to






membrane //






inferred from






electronic






annotation


nac
nac
Nitrogen assimilation Regulatory protein nac /// DNA-binding
6350 // transcription //

3677 // DNA binding




transcriptional dual regulator of nitrogen assimilation
inferred from electronic

// inferred from





annotation /// 6355 //

electronic annotation





regulation of transcription,

/// 3700 //





DNA-dependent // inferred

transcription factor





from electronic annotation

activity // inferred





/// 42128 // nitrate

from electronic





assimilation // inferred

annotation





from electronic annotation


msbA
msbA
Probable transport ATP-binding protein msbA /// fused
6810 // transport // inferred
9276 // cell wall
166 // nucleotide




lipid transporter subunits of ABC superfamily: membrane
from electronic annotation
(sensu
binding // inferred




component/ATP-binding component
/// 6869 // lipid transport //
Proteobacteria) //
from electronic





inferred from electronic
inferred from
annotation /// 5524 //





annotation
electronic
ATP binding //






annotation ///
inferred from






16020 // membrane
electronic annotation






// inferred from
/// 16887 // ATPase






electronic
activity // inferred






annotation ///
from electronic






16021 // integral to
annotation /// 17111






membrane //
// nucleoside-






inferred from
triphosphatase






electronic
activity // inferred






annotation


modC
modC
Molybdenum transport ATP-binding protein modC /// molybdate
6810 // transport // inferred
9276 // cell wall
166 // nucleotide




transporter subunit
from electronic annotation
(sensu
binding // inferred





/// 15689 // molybdate ion
Proteobacteria) //
from electronic





transport // inferred from
inferred from
annotation /// 5524 //





electronic annotation
electronic
ATP binding //






annotation ///
inferred from






16020 // membrane
electronic annotation






// inferred from
/// 15098 //






electronic
molybdate ion






annotation
transporter activity //







inferred from







electronic annotation







/// 15412 //







molybdate-







transporting ATPa


kch
kch
Putative potassium channel protein /// voltage-gated potassium channel
6810 // transport // inferred
16020 // membrane
5216 // ion channel





from electronic annotation
// inferred from
activity // inferred





/// 6811 // ion transport //
electronic
from electronic





inferred from electronic
annotation ///
annotation





annotation /// 6813 //
16021 // integral to





potassium ion transport //
membrane //





inferred from electronic
inferred from





annotation
electronic






annotation


prtC
prtC
hypothetical protein





yjfH
yjfH /// rlmB
Hypothetical tRNA/rRNA
6364 // rRNA processing //

3723 // RNA binding




methyltransferase yjfH /// 23S rRNA (Gm2251)-methyltransferase
inferred from electronic

// inferred from





annotation /// 6396 // RNA

electronic annotation





processing // inferred from

/// 8168 //





electronic annotation ///

methyltransferase





9451 // RNA modification

activity // inferred





// inferred from electronic

from electronic





annotation

annotation /// 8173 //







RNA







methyltransferase







activity // inferred







from electronic







annotation /// 16740







// transferase activity







//


glnD
glnD
PII uridylyl-transferase
6807 // nitrogen compound

3824 // catalytic





metabolism // inferred

activity // inferred





from electronic annotation

from electronic





/// 8152 // metabolism //

annotation /// 8773 //





inferred from electronic

[protein-PII]





annotation /// 9399 //

uridylyltransferase





nitrogen fixation // inferred

activity // inferred





from electronic annotation

from electronic







annotation /// 16597







// amino acid binding







// inferred from







electronic annotation







/// 16740 //







transferase


yhjC
yhjC
Hypothetical transcriptional regulator
6350 // transcription //

3677 // DNA binding




yhjC /// predicted DNA-binding transcriptional regulator
inferred from electronic

// inferred from





annotation /// 6355 //

electronic annotation





regulation of transcription,

/// 3700 //





DNA-dependent // inferred

transcription factor





from electronic annotation

activity // inferred







from electronic







annotation


prpE
prpE
predicted propionyl-CoA synthetase with ATPase domain
8152 // metabolism //

3824 // catalytic





inferred from electronic

activity // inferred





annotation /// 19629 //

from electronic





propionate catabolism, 2-

annotation /// 16874





methylcitrate cycle //

// ligase activity //





inferred from electronic

inferred from





annotation

electronic annotation







/// 50218 //







propionate-CoA







ligase activity //







inferred from







electronic annotation


279#6
ddpC
D-ala-D-ala transporter subunit
6810 // transport
16020 // membrane
5215 // transporter






/// 16021 // integral
activity






to membrane


ygjR
ygjR
Hypothetical oxidoreductase ygjR /// predicted NAD(P)-binding
6118 // electron transport //
16020 // membrane
16491 //




dehydrogenase
inferred from electronic
// inferred from
oxidoreductase





annotation /// 8152 //
electronic
activity // inferred





metabolism // inferred
annotation ///
from electronic





from electronic annotation
16021 // integral to
annotation






membrane //






inferred from






electronic






annotation


ppx
ppx
Exopolyphosphatase

16020 // membrane
287 // magnesium ion






// inferred from
binding // inferred






electronic
from electronic






annotation
annotation /// 4309 //







exopolyphosphatase







activity // inferred







from electronic







annotation /// 16787







// hydrolase activity //







inferred from







electronic annotation


yefI
yefI /// wbbK
lipopolysaccharide biosynthesis protein
9058 // biosynthesis //

16740 // transferase





inferred from electronic

activity // inferred





annotation /// 9103 //

from electronic





lipopolysaccharide

annotation





biosynthesis // inferred





from electronic annotation


mesJ
mesJ
Putative cell cycle protein mesJ
8033 // tRNA processing //

166 // nucleotide





inferred from electronic

binding // inferred





annotation /// 16549 //

from electronic





tRNA editing // inferred

annotation /// 5524 //





from electronic annotation

ATP binding //







inferred from







electronic annotation







/// 16874 // ligase







activity // inferred







from electronic







annotation /// 16879







// ligase activity,







forming carbon-







nitrogen bonds


secY/prlA
secY /// prlA
preprotein translocase SecY /// protein translocase subunit SecY
6605 // protein targeting //
16020 // membrane
15450 // protein





inferred from electronic
// inferred from
translocase activity //





annotation /// 6810 //
electronic
inferred from





transport // inferred from
annotation ///
electronic annotation





electronic annotation ///
16021 // integral to





9306 // protein secretion //
membrane //





inferred from electronic
inferred from





annotation /// 15031 //
electronic





protein transport // inferred
annotation





from electronic annotation


yejA
yejA
Hypothetical protein yejA precursor /// predicted oligopeptide transporter
6810 // transport // inferred

5215 // transporter




subunit
from electronic annotation

activity // inferred







from electronic







annotation


dgkA
dgkA
Diacylglycerol kinase
8654 // phospholipid
16020 // membrane
4143 // diacylglycerol





biosynthesis // inferred
// inferred from
kinase activity //





from electronic annotation
electronic
inferred from






annotation ///
electronic annotation






16021 // integral to
/// 16301 // kinase






membrane //
activity // inferred






inferred from
from electronic






electronic
annotation /// 16740






annotation
// transferase activity







// inferred from







electronic annotation


ygcE
ygcE
Hypothetical sugar kinase ygcE
5975 // carbohydrate

16301 // kinase





metabolism // inferred

activity // inferred





from electronic annotation

from electronic







annotation


focA
focA
F1C major fimbrial subunit precursor
7155 // cell adhesion //
9289 // fimbrium //






inferred from electronic
inferred from





annotation
electronic






annotation


(rtn)
rtn
Rtn protein





ydbD
ydbD
hypothetical protein





211#11


ygfQ
ygfQ
predicted transporter
6810 // transport // inferred
16020 // membrane
166 // nucleotide





from electronic annotation
// inferred from
binding // inferred






electronic
from electronic






annotation
annotation /// 5215 //







transporter activity //







inferred from







electronic annotation







/// 5524 // ATP







binding // inferred







from electronic







annotation


folK
folK
2-amino-4-hydroxy-6-hydroxymethyldihydropteridine
9396 // folic acid and

3848 // 2-amino-4-




pyrophosphokinase
derivative biosynthesis //

hydroxy-6-





inferred from electronic

hydroxymethyldihydropteridine





annotation

diphosphokinase







activity // inferred







from electronic







annotation /// 16301







// kinase activity //







inferred from







electronic annotation







/// 16740 //







transferase activity //







inferred from







electronic ann







137 Highly immunogenic response proteins in CD












frvX
frvX
predicted endo-1,4-beta-


16787 //




glucanase


hydrolase







activity //







inferred from







electronic







annotation


LDR-D


416#1


sugE
sugE
SugE protein
6810 // transport // inferred from
16020 // membrane // inferred






electronic annotation
from electronic annotation ///






16021 // integral to






membrane // inferred from






electronic annotation


dinD
dinD
DNA-damage-inducible protein





fecB
fecB
KpLE2 phage-like element; iron-dicitrate transporter subunit
6810 // transport // inferred from
42597 // periplasmic space //
5381 // iron





electronic annotation /// 6811 //
inferred from electronic
ion transporter





ion transport // inferred from
annotation
activity //





electronic annotation /// 6826 //

inferred from





iron ion transport // inferred from

electronic





electronic annotation /// 6827 //

annotation ///





high affinity iron ion transport //

5506 // iron





inferred from electronic annotation

ion binding //







inferred from







electronic







annotation


fliA
fliA
flagellar biosynthesis sigma factor FliA /// flagellar
6350 // transcription // inferred

3677 // DNA




biosynthesis sigma factor
from electronic annotation /// 6352

binding //





// transcription initiation // inferred

inferred from





from electronic annotation /// 6355

electronic





// regulation of transcription,

annotation ///





DNA-dependent // inferred from

3700 //





electronic annotation

transcription







factor activity







// inferred







from







electronic







annotation ///







3899 // DNA-







directed RNA







polymerase







activity //







inferred from







electronic







annotation ///







16740 //







transferase ac


yjhA
yjhA
Hypothetical protein yjhA precursor /// N-acetylnuraminic
6810 // transport // inferred from
16020 // membrane // inferred
5351 // sugar




acid outer membrane channel protein
electronic annotation /// 6811 //
from electronic annotation ///
porter activity





ion transport // inferred from
16021 // integral to
// inferred





electronic annotation
membrane // inferred from
from






electronic annotation ///
electronic






19867 // outer membrane //
annotation ///






inferred from electronic
15288 // porin






annotation
activity //







inferred from







electronic







annotation


(thiS)
thiS
sulfur carrier protein ThiS
6790 // sulfur metabolism //







inferred from electronic annotation


mcrD
mcrD /// yjiV
hypothetical protein





ygbA
ygbA
Hypothetical protein ygbA /// hypothetical protein





LDR-ABC


slyD
slyD
FKBP-type peptidyl-prolyl cis-trans isomerase slyD /// FKBP-
6457 // protein folding // inferred

5515 //




type peptidyl prolyl cis-trans isomerase (rotamase)
from electronic annotation

protein







binding //







inferred from







physical







interaction ///







3755 //







peptidyl-







prolyl cis-







trans







isomerase







activity //







inferred from







electronic







annotation ///







5507 // copper







ion binding //







inferred from







electronic







annotation ///







8270 // zinc







ion bindin


yliG
yliG
Hypothetical protein yliG /// predicted SAM-dependent


3824 //




methyltransferase


catalytic







activity //







inferred from







electronic







annotation ///







5506 // iron







ion binding //







inferred from







electronic







annotation ///







46872 // metal







ion binding //







inferred from







electronic







annotation ///







51536 // iron-







sulfur cluster







binding //







inferre


yfiD
yfiD
Protein yfiD /// pyruvate formate lyase subunit
8152 // metabolism // inferred

3824 //





from electronic annotation

catalytic







activity //







inferred from







electronic







annotation


ycfF
ycfF /// hinT
HIT-like protein ycfF /// purine nucleoside phosphoramidase





metJ
metJ
transcriptional repressor protein
6350 // transcription // inferred

3677 // DNA




MetJ
from electronic annotation /// 6355

binding //





// regulation of transcription,

inferred from





DNA-dependent // inferred from

electronic





electronic annotation /// 6555 //

annotation ///





methionine metabolism // inferred

3700 //





from electronic annotation /// 8652

transcription





// amino acid bios

factor activity







// inferred







from







electronic







annotation ///







16564 //







transcriptional







repressor







activity //







inferred from







electronic







annotation


yicC
yicC
Protein yicC





fecR
fecR
KpLE2 phage-like element; transmembrane signal
6810 // transport // inferred from
42597 // periplasmic space //
5506 // iron




transducer for ferric citrate transport
electronic annotation /// 6811 //
inferred from electronic
ion binding //





ion transport // inferred from
annotation
inferred from





electronic annotation /// 6826 //

electronic





iron ion transport // inferred from

annotation





electronic annotation /// 50896 //





response to stimulus // inferred





from electronic a


rpsR
rpsR
30S ribosomal protein S18
6412 // protein biosynthesis //
5622 // intracellular //
3723 // RNA





inferred from electronic annotation
inferred from electronic
binding //






annotation /// 5840 //
inferred from






ribosome // inferred from
electronic






electronic annotation ///
annotation ///






30529 // ribonucleoprotein
3735 //






complex // inferred from
structural






electronic annotation
constituent of







ribosome //







inferred from







electronic







annotation ///







19843 //







rRNA binding







// inferred







from







electronic







annotation


ycgN
ycgN
hypothetical protein





448#2
norV
pseudo /// anaerobic nitric oxide reductase flavorubredoxin
6118 // electron transport /// 6810

5506 // iron





// transport

ion binding







/// 10181 //







FMN binding







/// 16491 //







oxidoreductase







activity ///







46872 // metal







ion binding


rbsB
rbsB
D-ribose-binding periplasmic protein precursor /// D-ribose
6810 // transport // inferred from
42597 // periplasmic space //
5351 // sugar




transporter subunit
electronic annotation /// 6935 //
inferred from electronic
porter activity





chemotaxis // inferred from
annotation
// inferred





electronic annotation

from







electronic







annotation ///







15407 //







monosaccharide-







transporting







ATPase







activity //







inferred from







electronic







annotation ///







16787 //







hydrolase







activity //







inferred from







electronic







annotation


argB
argB
acetylglutamate kinase
6526 // arginine biosynthesis //
5737 // cytoplasm // inferred
166 //





inferred from electronic annotation
from electronic annotation
nucleotide





/// 8652 // amino acid biosynthesis

binding //





// inferred from electronic

inferred from





annotation /// 9085 // lysine

electronic





biosynthesis // inferred from

annotation ///





electronic annotation

3991 //







acetylglutamate







kinase







activity //







inferred from







electronic







annotation ///







5524 // ATP







binding //







inferred from







electronic







annotation ///







16301 //







kinase activity







// inferred fro


hoxK
hoxK
hypothetical protein





yfhD
yfhD
Hypothetical protein yfhD /// predicted transglycosylase
6810 // transport // inferred from
30288 // periplasmic space
5215 //





electronic annotation
(sensu Proteobacteria) //
transporter






inferred from electronic
activity //






annotation
inferred from







electronic







annotation


yjgF
yjgF
Protein yjgF





fadB
fadB
3-hydroxyacyl-CoA dehydrogenase /// fused 3-
6629 // lipid metabolism // inferred
16507 // fatty acid beta-
3824 //




hydroxybutyryl-CoA epimerase/delta(3)-cis-delta(2)-
from electronic annotation /// 6631
oxidation multienzyme
catalytic




trans-enoyl-CoA isomerase/enoyl-CoA
// fatty acid metabolism // inferred
complex // inferred from
activity //




hydratase/3-hydroxyacyl-CoA dehydrogenase
from electronic annotation /// 8152
electronic annotation
inferred from





// metabolism // inferred from

electronic





electronic annotation /// 9062 //

annotation ///





fatty acid catabolism // inferred

3857 // 3-





from elect

hydroxyacyl-







CoA







dehydrogenase







activity //







inferred from







electronic







annotation ///







4165 //







dodecenoyl-







CoA delta-







isomerase







activity //







inferred from







electronic







annotation ///







43


(phnE)
phnE
membrane channel protein
6810 // transport // inferred from
5887 // integral to plasma
5215 //




component of Pn transporter
electronic annotation /// 15716 //
membrane // inferred from
transporter





phosphonate transport // inferred
electronic annotation ///
activity //





from electronic annotation
16020 // membrane // inferred
inferred from






from electronic annotation ///
electronic






16021 // integral to
annotation ///






membrane // inferred from
15604 //






electronic annotation
phosphonate







transporter







activity //







inferred from







electronic







annotation


gabD
gabD
Succinate-semialdehyde
8152 // metabolism // inferred

9013 //




dehydrogenase [NADP+] /// succinate-semialdehyde
from electronic annotation

succinate-




dehydrogenase I, NADP-dependent


semialdehyde







dehydrogenase







[NAD(P)+]







activity //







inferred from







electronic







annotation ///







16491 //







oxidoreductase







activity //







inferred from







electronic







annotation ///







16620 //







oxidoreductase







activity,







acting on the







aldehyde or







oxo group of


rbfA
rbfA
ribosome-binding factor A
6364 // rRNA processing //







inferred from electronic annotation


rpsL
rpsL
30S ribosomal protein S12
6412 // protein biosynthesis //
5622 // intracellular //
49 // tRNA





inferred from electronic annotation
inferred from electronic
binding //





/// 46677 // response to antibiotic //
annotation /// 5840 //
inferred from





inferred from electronic annotation
ribosome // inferred from
electronic






electronic annotation ///
annotation ///






15935 // small ribosomal
3676 //






subunit // inferred from
nucleic acid






electronic annotation ///
binding //






30529 // ribonucleoprotein
inferred from






complex // inferred from
electronic






electronic annotation
annotation ///







3723 // RNA







binding //







inferred from







electronic







annotation ///







3735 //







structural







constituent of







ribosome //







inferred fro


yjbL
yjbL
hypothetical protein





nuoE
nuoE
ATP synthase subunit E /// NADH dehydrogenase subunit E
6120 // mitochondrial electron

5506 // iron





transport, NADH to ubiquinone //

ion binding //





inferred from electronic annotation

inferred from







electronic







annotation ///







8137 //







NADH







dehydrogenase







(ubiquinone)







activity //







inferred from







electronic







annotation ///







16491 //







oxidoreductase







activity //







inferred from







electronic







annotation ///







46872 // metal







ion


(gntU)
gntU
Low-affinity gluconate transport permease protein,
6810 // transport // inferred from
16020 // membrane // inferred
5351 // sugar




interrupted /// gluconate transporter, low affinity GNT 1
electronic annotation /// 15725 //
from electronic annotation ///
porter activity




system
gluconate transport // inferred
16021 // integral to
// inferred





from electronic annotation ///
membrane // inferred from
from





19521 // D-gluconate metabolism
electronic annotation
electronic





// inferred from electronic

annotation ///





annotation

15128 //







gluconate







transporter







activity //







inferred from







electronic







annotation


rplP
rplP
50S ribosomal protein L16
6412 // protein biosynthesis //
5622 // intracellular //
49 // tRNA





inferred from electronic annotation
inferred from electronic
binding //






annotation /// 5840 //
inferred from






ribosome // inferred from
electronic






electronic annotation ///
annotation ///






30529 // ribonucleoprotein
3723 // RNA






complex // inferred from
binding //






electronic annotation
inferred from







electronic







annotation ///







3735 //







structural







constituent of







ribosome //







inferred from







electronic







annotation ///







19843 //







rRNA binding







// inferred







from elect


ubiG
ubiG
3-demethylubiquinone-9 3-methyltransferase
6744 // ubiquinone biosynthesis //

5515 //





inferred from electronic annotation

protein







binding //







inferred from







physical







interaction ///







8168 //







methyltransferase







activity //







inferred from







electronic







annotation ///







8425 // 2-







polyprenyl-6-







methoxy-1,4-







benzoquinone







methyltransferase







activity //







inferred from







electronic







annotati


phnG
phnG
PhnG protein /// carbon-phosphorus lyase complex
15716 // phosphonate transport //






subunit
inferred from electronic annotation





/// 19634 // phosphonate





metabolism // inferred from





electronic annotation


mviN
mviN
Virulence factor mviN homolog /// predicted inner membrane
9405 // pathogenesis // inferred
16020 // membrane // inferred





protein
from electronic annotation
from electronic annotation ///






16021 // integral to






membrane // inferred from






electronic annotation


ydfA
ydfA
Hypothetical protein ydfB ///







Hypothetical protein ydfA ///




Qin prophage; predicted protein


rpmJ
rpmJ
50S ribosomal protein L36
6412 // protein biosynthesis //
5622 // intracellular //
3735 //





inferred from electronic annotation
inferred from electronic
structural






annotation /// 5840 //
constituent of






ribosome // inferred from
ribosome //






electronic annotation ///
inferred from






30529 // ribonucleoprotein
electronic






complex // inferred from
annotation






electronic annotation


yihD
yihD
Protein yihD /// hypothetical







protein


ydfR
ydfR
Hypothetical protein ydfR





baiF
baiF
hypothetical protein





ygbO
ygbO
tRNA pseudouridine synthase D
8033 // tRNA processing //

4730 //





inferred from electronic annotation

pseudouridylate





/// 31119 // tRNA pseudouridine

synthase





synthesis // inferred from

activity //





electronic annotation

inferred from







electronic







annotation ///







16439 //







tRNA-







pseudouridine







synthase







activity //







inferred from







electronic







annotation ///







16853 //







isomerase







activity //







inferred from







electronic







annotation


yfeC
yfeC
Hypothetical protein yfeC ///







predicted DNA-binding transcriptional regulator


basS
basS
Sensor protein basS/pmrB ///
160 // two-component signal
16020 // membrane // inferred
155 // two-




sensory histidine kinase in two-component regulatory system
transduction system
from electronic annotation ///
component




with BasR
(phosphorelay) // inferred from
16021 // integral to
sensor activity





electronic annotation /// 6468 //
membrane // inferred from
// inferred





protein amino acid
electronic annotation
from





phosphorylation // inferred from

electronic





electronic annotation /// 7165 //

annotation ///





signal transduction // inferred from

4871 // signal





electronic annotation

transducer







activity //







inferred from







electronic







annotation ///







5524 // ATP







binding //







inferred from







electronic







annotation ///







16301 //







kinase activity







// inferr


hisS
hisS
histidyl-tRNA synthetase
6412 // protein biosynthesis //
5737 // cytoplasm // inferred
166 //





inferred from electronic annotation
from electronic annotation
nucleotide





/// 6418 // tRNA aminoacylation

binding //





for protein translation // inferred

inferred from





from electronic annotation /// 6427

electronic





// histidyl-tRNA aminoacylation //

annotation ///





inferred from electronic annotation

4812 //







aminoacyl-







tRNA ligase







activity //







inferred from







electronic







annotation ///







4821 //







histidine-







tRNA ligase







activity //







inferred from







electronic







annotation ///







5524 // ATP







binding //


rpiB
rpiB
ribose-5-phosphate isomerase B
5975 // carbohydrate metabolism //

5515 //





inferred from electronic annotation

protein





/// 6098 // pentose-phosphate shunt

binding //





// inferred from electronic

inferred from





annotation

physical







interaction ///







4751 // ribose-







5-phosphate







isomerase







activity //







inferred from







electronic







annotation ///







16853 //







isomerase







activity //







inferred from







electronic







annotation


yhcJ
yhcJ /// nanE
Hypothetical protein yhcJ /// predicted N-acetylmannosamine-6-P
5975 // carbohydrate metabolism //

5515 //




epimerase
inferred from electronic annotation

protein





/// 6051 // N-acetylmannosamine

binding //





metabolism // inferred from

inferred from





electronic annotation

physical







interaction ///







16853 //







isomerase







activity //







inferred from







electronic







annotation ///







16857 //







racemase and







epimerase







activity,







acting on







carbohydrates







and







derivatives //







inferred from







electronic







annotati


atpB
atpB
ATP synthase subunit A /// F0F1 ATP synthase subunit A
6810 // transport // inferred from
16020 // membrane // inferred
15078 //





electronic annotation /// 6811 //
from electronic annotation ///
hydrogen ion





ion transport // inferred from
16021 // integral to
transporter





electronic annotation /// 15992 //
membrane // inferred from
activity //





proton transport // inferred from
electronic annotation ///
inferred from





electronic annotation
16469 // proton-transporting
electronic






two-sector ATPase complex
annotation ///






// inferred from electronic
16820 //






annotation /// 45263 //
hydrolase






proton-transpo
activity,







acting on acid







anhydrides,







catalyzing







transmembrane







movement







of substances







// inferred







from







electronic







annotation ///







16787 //







hydrolase act


yijF
yijF
Hypothetical protein yijF precursor /// hypothetical protein





clpX
clpX
ATP-dependent protease ATP-
6457 // protein folding // inferred

166 //




binding subunit
from electronic annotation ///

nucleotide





15031 // protein transport //

binding //





inferred from electronic annotation

inferred from





/// 19538 // protein metabolism //

electronic





inferred from electronic annotation

annotation ///





/// 6986 // response to unfolded

5515 //





protein // inferre

protein







binding //







inferred from







electronic







annotation ///







5524 // ATP







binding //







inferred from







electronic







annotation ///







8270 // zinc







ion binding //







inferred from







electronic ann


aspA
aspA
Aspartate ammonia-lyase
6531 // aspartate metabolism //

5515 //





inferred from electronic annotation

protein







binding //







inferred from







physical







interaction ///







3824 //







catalytic







activity //







inferred from







electronic







annotation ///







8797 //







aspartate







ammonia-







lyase activity







// inferred







from







electronic







annotation ///







16829 // lyase







activity //







inferred


yigK
yigK /// rhtB
Homoserine/homoserine lactone
6810 // transport // inferred from
16020 // membrane // inferred
5293 // lysine




efflux protein /// neutral amino-
electronic annotation /// 6865 //
from electronic annotation ///
permease




acid efflux system
amino acid transport // inferred
16021 // integral to
activity //





from electronic annotation
membrane // inferred from
inferred from






electronic annotation
electronic







annotation


yjhO
yjhO /// sgcX
KpLE2 phage-like element;


16787 //




predicted endoglucanase with Zn-dependent exopeptidase


hydrolase




domain


activity //







inferred from







electronic







annotation


cysD
cysD
sulfate adenylyltransferase
103 // sulfate assimilation //

4781 // sulfate




subunit 2
inferred from electronic annotation

adenylyltransferase





/// 8152 // metabolism // inferred

(ATP)





from electronic annotation /// 8652

activity //





// amino acid biosynthesis //

inferred from





inferred from electronic annotation

electronic





/// 19344 // cysteine biosynthesis //

annotation ///





inferred from electronic annotation

16740 //







transferase







activity //







inferred from







electronic







annotation ///







16779 //







nucleotidyltransferase







activity //







inferred from







electronic







annotation


yjbR
yjbR
Protein yjbR /// hypothetical







protein


yiiF
yiiF
hypothetical protein





cysW
cysW
sulfate/thiosulfate transporter
6810 // transport // inferred from
9276 // cell wall (sensu
5215 //




subunit
electronic annotation /// 8272 //
Proteobacteria) // inferred
transporter





sulfate transport // inferred from
from electronic annotation ///
activity //





electronic annotation
16020 // membrane // inferred
inferred from






from electronic annotation ///
electronic






16021 // integral to
annotation ///






membrane // inferred from
15116 //






electronic annotation
sulfate







transporter







activity //







inferred from







electronic







annotation ///







15563 //







uptake







permease







activity //







inferred from







electronic







annotation


221#15


rpsG
rpsG
30S ribosomal protein S7
6412 // protein biosynthesis //
5622 // intracellular //
49 // tRNA





inferred from electronic annotation
inferred from electronic
binding //






annotation /// 5840 //
inferred from






ribosome // inferred from
electronic






electronic annotation ///
annotation ///






15935 // small ribosomal
3723 // RNA






subunit // inferred from
binding //






electronic annotation ///
inferred from






30529 // ribonucleoprotein
electronic






complex // inferred from
annotation ///






electronic annotation
3735 //







structural







constituent of







ribosome //







inferred from







electronic







annotation ///







19843 //







rRNA binding







// inferred







from elect


yhfR
yhfR /// frlR
predicted DNA-binding
6350 // transcription // inferred
5622 // intracellular //
5515 //




transcriptional regulator
from electronic annotation /// 6355
inferred from electronic
protein





// regulation of transcription,
annotation
binding //





DNA-dependent // inferred from

inferred from





electronic annotation /// 45449 //

physical





regulation of transcription //

interaction ///





inferred from electronic annotation

3677 // DNA







binding //







inferred from







electronic







annotation ///







3700 //







transcription







factor activity







// inferred







from







electronic







annotation ///







30528 //







transcription







regulator







activity //


flxA
flxA
Qin prophage; predicted protein





agaZ
agaZ /// kbaZ
Putative tagatose 6-phosphate
19402 // galactitol metabolism //

9024 //




kinase agaZ /// tagatose 6-phosphate aldolase 1, kbaZ subunit
inferred from electronic annotation

tagatose-6-







phosphate







kinase activity







// inferred







from







electronic







annotation ///







16301 //







kinase activity







// inferred







from







electronic







annotation ///







16740 //







transferase







activity //







inferred from







electronic







annotation


(ycfA)
tfaE
e14 prophage; predicted tail fiber assembly protein





yhfV
yhfV
Phosphotriesterase homology
9056 // catabolism // inferred from

8270 // zinc




protein
electronic annotation

ion binding //







inferred from







electronic







annotation ///







16788 //







hydrolase







activity,







acting on ester







bonds //







inferred from







electronic







annotation


yieJ
yieJ /// cbrC
hypothetical protein





yjbQ
yjbQ
Hypothetical protein yjbQ ///







hypothetical protein


ptsN
ptsN
Nitrogen regulatory IIA protein /// sugar-specific enzyme IIA
6810 // transport // inferred from

5351 // sugar




component of PTS
electronic annotation /// 9401 //

porter activity





phosphoenolpyruvate-dependent

// inferred





sugar phosphotransferase system //

from





inferred from electronic annotation

electronic







annotation ///







8982 //







protein-N(PI)-







phosphohistidine-







sugar







phosphotransferase







activity







// inferred







from







electronic







annotation ///







16740 //







transferase







activity //







inferred from







electronic







annotati


ygbD
ygbD
nitric oxide reductase
6118 // electron transport //

15036 //





inferred from electronic annotation

disulfide







oxidoreductase







activity //







inferred from







electronic







annotation ///







16491 //







oxidoreductase







activity //







inferred from







electronic







annotation ///







16731 //







oxidoreductase







activity,







acting on







iron-sulfur







proteins as







donors, NAD







or NADP as







ac


fimB
fimB
Type 1 fimbriae Regulatory
6310 // DNA recombination //
9289 // fimbrium // inferred
3677 // DNA




protein fimB /// tyrosine recombinase/inversion of on/off
inferred from electronic annotation
from electronic annotation
binding //




regulator of fimA
/// 6313 // transposition, DNA-

inferred from





mediated // inferred from

electronic





electronic annotation /// 6350 //

annotation





transcription // inferred from





electronic annotation /// 6355 //





regulation of transcription, DNA-d


lasT
lasT /// yjtD
Hypothetical tRNA/rRNA
6396 // RNA processing // inferred

3723 // RNA




methyltransferase lasT /// predicted rRNA methyltransferase
from electronic annotation

binding //







inferred from







electronic







annotation ///







8168 //







methyltransferase







activity //







inferred from







electronic







annotation ///







8173 // RNA







methyltransferase







activity //







inferred from







electronic







annotation ///







16740 //







transferase







activity //


ivbL
ivbL
IlvBN operon leader peptide ///
8652 // amino acid biosynthesis //






ilvB operon leader peptide
inferred from electronic annotation





/// 9082 // branched chain family





amino acid biosynthesis // inferred





from electronic annotation


gst
gst
Glutathione S-transferase /// glutathionine S-transferase


4364 //







glutathione







transferase







activity //







inferred from







electronic







annotation ///







16740 //







transferase







activity //







inferred from







electronic







annotation


phbA
phbA
hypothetical protein





phnM
phnM
PhnM protein


16787 //







hydrolase







activity //







inferred from







electronic







annotation


yhgI
yhgI /// gntY
Protein yhgI /// predicted







gluconate transport associated




protein


trmD
trmD
tRNA (guanine-N(1)-)-
6400 // tRNA modification //

3723 // RNA




methyltransferase
inferred from electronic annotation

binding //





/// 8033 // tRNA processing //

inferred from





inferred from electronic annotation

electronic







annotation ///







8168 //







methyltransferase







activity //







inferred from







electronic







annotation ///







8175 // tRNA







methyltransferase







activity //







inferred from







electronic







annotation ///







9019 // tRNA







(guanine-N1-)-







meth


grxA
grxA
Glutaredoxin 1 /// glutaredoxin
6118 // electron transport //

5515 //




1, redox coenzyme for
inferred from electronic annotation

protein




ribonucleotide reductase
/// 6810 // transport // inferred

binding //




(RNR1a)
from electronic annotation /// 9263

inferred from





// deoxyribonucleotide

physical





biosynthesis // inferred from

interaction ///





electronic annotation /// 45454 //

9055 //





cell redox homeostasis // infer

electron







carrier activity







// inferred







from







electronic







annotation ///







15035 //







protein







disulfide







oxidoreductase







activity //







inferred from







electronic







annotation


gcpE
gcpE /// ispG
4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase /// 4-
8299 // isoprenoid biosynthesis //

5506 // iron




hydroxy-3-methylbut-2-en-1-yl diphosphate synthase
inferred from electronic annotation

ion binding //





/// 16114 // terpenoid biosynthesis

inferred from





// inferred from electronic

electronic





annotation

annotation ///







16491 //







oxidoreductase







activity //







inferred from







electronic







annotation ///







16728 //







oxidoreductase







activity,







acting on CH2







groups,







disulfide as







acceptor //







inferred from







electronic







annot


ycdB
ycdB
hypothetical protein





yfiC
yfiC
Hypothetical protein yfiC /// predicted S-adenosyl-L-


8168 //




methionine-dependent methyltransferase


methyltransferase







activity //







inferred from







electronic







annotation ///







16740 //







transferase







activity //







inferred from







electronic







annotation


(rcsC)
rcsC
Sensor protein rcsC /// hybrid
160 // two-component signal
16020 // membrane // inferred
155 // two-




sensory kinase in two-
transduction system
from electronic annotation ///
component




component regulatory system with RcsB and YojN
(phosphorelay) // inferred from
16021 // integral to
sensor activity





electronic annotation /// 6355 //
membrane // inferred from
// inferred





regulation of transcription, DNA-
electronic annotation ///
from





dependent // inferred from
30113 // capsule (sensu
electronic





electronic annotation /// 6468 //
Bacteria) // inferred from
annotation ///





protein amino acid
electronic annotation
156 // two-





phosphorylation // inferred from

component





electronic annotation

response







regulator







activity //







inferred from







electronic







annotation ///







4871 // signal







transducer







activity //







inferred from







electronic







annotation ///







5524


yedF
yedF
Hypothetical protein yedF ///







hypothetical protein


yfjI
yfjI
CP4-57 prophage; predicted







protein


yhdN
yhdN
Hypothetical protein yhdN ///







hypothetical protein


phnB
phnB
PhnB protein /// hypothetical







protein


(yfjV)
yfjV
CP4-57 prophage; predicted
46685 // response to arsenic //
16020 // membrane // inferred
15105 //




protein
inferred from electronic annotation
from electronic annotation ///
arsenite






16021 // integral to
transporter






membrane // inferred from
activity //






electronic annotation
inferred from







electronic







annotation


ydhL
ydhL
Hypothetical protein ydhL







precursor


yhaN
yhaN
hypothetical protein ///

16020 // membrane // inferred





hypothetical protein

from electronic annotation ///






16021 // integral to






membrane // inferred from






electronic annotation


slyA
slyA
transcriptional regulator SlyA
6350 // transcription // inferred
5622 // intracellular //
3677 // DNA





from electronic annotation /// 6355
inferred from electronic
binding //





// regulation of transcription,
annotation
inferred from





DNA-dependent // inferred from

electronic





electronic annotation /// 9405 //

annotation ///





pathogenesis // inferred from

3700 //





electronic annotation

transcription







factor activity







// inferred







from







electronic







annotation


411#4
yfdK
Hypothetical protein yfdK





633#5


folP
folP
Dihydropteroate synthase ///
9396 // folic acid and derivative

4156 //




7,8-dihydropteroate synthase
biosynthesis // inferred from

dihydropteroate





electronic annotation /// 46656 //

synthase





folic acid biosynthesis // inferred

activity //





from electronic annotation ///

inferred from





46677 // response to antibiotic //

electronic





inferred from electronic annotation

annotation ///







16740 //







transferase







activity //







inferred from







electronic







annotation


336#6


glvB
glvB
arbutin specific enzyme IIB
6810 // transport // inferred from
16020 // membrane // inferred
5351 // sugar




component of PTS
electronic annotation /// 9401 //
from electronic annotation ///
porter activity





phosphoenolpyruvate-dependent
16021 // integral to
// inferred





sugar phosphotransferase system //
membrane // inferred from
from





inferred from electronic annotation
electronic annotation
electronic







annotation ///







8982 //







protein-N(PI)-







phosphohistidine-







sugar







phosphotransferase







activity







// inferred







from







electronic







annotation ///







16740 //







transferase







activity //







inferred from







electronic







annotati


hflX
hflX
GTP-binding protein hflX ///
7264 // small GTPase mediated
5622 // intracellular //
166 //




predicted GTPase
signal transduction // inferred from
inferred from electronic
nucleotide





electronic annotation
annotation
binding //







inferred from







electronic







annotation ///







5525 // GTP







binding //







inferred from







electronic







annotation ///







8233 //







peptidase







activity //







inferred from







electronic







annotation


hemX
hemX
Putative uroporphyrin-III C-
6779 // porphyrin biosynthesis //
16020 // membrane // inferred
4851 //




methyltransferase /// predicted uroporphyrinogen III methylase
inferred from electronic annotation
from electronic annotation ///
uroporphyrin-






16021 // integral to
III C-






membrane // inferred from
methyltransferase






electronic annotation
activity //







inferred from







electronic







annotation ///







8168 //







methyltransferase







activity //







inferred from







electronic







annotation ///







16740 //







transferase







activity //







inferred from







electronic







annotation


yceD
yceD
Hypothetical protein yceD ///







hypothetical protein


ptpS
ptpS /// ygcM
Putative 6-pyruvoyl tetrahydrobiopterin synthase ///
6729 // tetrahydrobiopterin

3874 // 6-




6-pyruvoyl tetrahydrobiopterin synthase (PTPS)
biosynthesis // inferred from

pyruvoyltetra





electronic annotation

hydropterin







synthase







activity //







inferred from







electronic







annotation ///







8270 // zinc







ion binding //







inferred from







electronic







annotation ///







16829 // lyase







activity //







inferred from







electronic







annotation ///







46872 // metal







ion bindi


ibpB
ibpB
16 kDa heat shock protein B /// heat shock chaperone
6457 // protein folding // inferred

5515 //





from electronic annotation /// 6986

protein





// response to unfolded protein //

binding //





inferred from electronic annotation

inferred from





/// 50821 // protein stabilization //

physical





inferred from electronic annotation

interaction ///







51082 //







unfolded







protein







binding //







inferred from







electronic







annotation


ecnB
ecnB
Putative toxin of osmotically
9636 // response to toxin //
16020 // membrane // inferred





regulated toxin-antitoxin system
inferred from electronic annotation
from electronic annotation




associated with programmed




cell death /// entericidin B




membrane lipoprotein


aceB
aceB
malate synthase
6097 // glyoxylate cycle // inferred

4474 // malate





from electronic annotation /// 6099

synthase





// tricarboxylic acid cycle //

activity //





inferred from electronic annotation

inferred from







electronic







annotation ///







16829 // lyase







activity //







inferred from







electronic







annotation


yehR
yehR
Hypothetical lipoprotein yehR precursor /// hypothetical protein

16020 // membrane // inferred







from electronic annotation


hsdM
hsdM
DNA methylase M
6306 // DNA methylation //

5515 //





inferred from electronic annotation

protein





/// 9307 // DNA restriction-

binding //





modification system // inferred

inferred from





from electronic annotation

physical







interaction ///







3677 // DNA







binding //







inferred from







electronic







annotation ///







8168 //







methyltransferase







activity //







inferred from







electronic







annotation ///







8170 // N-







methyltransferase







activity //







inferred


yrbB
yrbB
Hypothetical protein yrbB ///







hypothetical protein


hypC
hypC
Hydrogenase isoenzyme


3676 //




formation protein hypC ///


nucleic acid




protein required for maturation


binding //




of hydrogenases 1 and 3


inferred from







electronic







annotation


vacJ
vacJ
VacJ lipoprotein precursor /// predicted lipoprotein

16020 // membrane // inferred







from electronic annotation ///






19867 // outer membrane //






inferred from electronic






annotation


405#2


ydfD
ydfD
Hypothetical protein ydfD





rpsK
rpsK
30S ribosomal protein S11
6412 // protein biosynthesis //
5622 // intracellular //
3723 // RNA





inferred from electronic annotation
inferred from electronic
binding //






annotation /// 5840 //
inferred from






ribosome // inferred from
electronic






electronic annotation ///
annotation ///






30529 // ribonucleoprotein
3735 //






complex // inferred from
structural






electronic annotation
constituent of







ribosome //







inferred from







electronic







annotation ///







19843 //







rRNA binding







// inferred







from







electronic







annotation


yieF
yieF
Hypothetical protein yieF ///


16491 //




chromate reductase, Class I,


oxidoreductase




flavoprotein


activity //







inferred from







electronic







annotation


sixA
slp
Outer membrane protein slp

16020 // membrane // inferred





precursor /// outer membrane lipoprotein

from electronic annotation ///






19867 // outer membrane //






inferred from electronic






annotation


yijD
yijD
Hypothetical protein yijD /// conserved inner membrane

16020 // membrane // inferred





protein

from electronic annotation ///






16021 // integral to






membrane // inferred from






electronic annotation


fliS
fliS
flagellar protein FliS
9296 // flagellum biogenesis //
9288 // flagellum (sensu






inferred from electronic annotation
Bacteria) // inferred from






electronic annotation ///






19861 // flagellum // inferred






from electronic annotation


hycA
hycA
Formate hydrogenlyase
6350 // transcription // inferred

16829 // lyase




Regulatory protein hycA ///
from electronic annotation /// 6355

activity //




regulator of the transcriptional
// regulation of transcription,

inferred from




regulator FhlA
DNA-dependent // inferred from

electronic





electronic annotation

annotation


wcaA
wcaA
Putative colanic acid biosynthesis glycosyl
9103 // lipopolysaccharide

16740 //




transferase wcaA /// predicted glycosyl transferase
biosynthesis // inferred from

transferase





electronic annotation

activity //







inferred from







electronic







annotation


yhaC
yhaC
hypothetical protein





348#4
yoeE
hypothetical protein





yjhF
yjhF
KpLE2 phage-like element;
6810 // transport // inferred from
16020 // membrane // inferred
15128 //




predicted transporter
electronic annotation /// 15725 //
from electronic annotation ///
gluconate





gluconate transport // inferred
16021 // integral to
transporter





from electronic annotation
membrane // inferred from
activity //






electronic annotation
inferred from







electronic







annotation


recN
recN
DNA repair protein recN /// recombination and repair protein
6281 // DNA repair // inferred
5694 // chromosome //
5515 //





from electronic annotation /// 6310
inferred from electronic
protein





// DNA recombination // inferred
annotation /// 16020 //
binding //





from electronic annotation /// 6974
membrane // inferred from
inferred from





// response to DNA damage
electronic annotation
physical





stimulus // inferred from electronic

interaction ///





annotation /// 51276 //

166 //





chromosome organization and

nucleotide





biog

binding //







inferred from







electronic







annotation ///







5524 // ATP







binding //







inferred from







electronic







annotation


lldR
lldR
Putative L-lactate dehydrogenase operon
6350 // transcription // inferred
5622 // intracellular //
3677 // DNA




Regulatory protein
from electronic annotation /// 6355
inferred from electronic
binding //





// regulation of transcription,
annotation
inferred from





DNA-dependent // inferred from

electronic





electronic annotation

annotation ///







3700 //







transcription







factor activity







// inferred







from







electronic







annotation


yihA
yihA
GTP-binding protein
917 // barrier septum formation //
5622 // intracellular //
166 //





inferred from electronic annotation
inferred from electronic
nucleotide





/// 7049 // cell cycle // inferred
annotation
binding //





from electronic annotation ///

inferred from





51301 // cell division // inferred

electronic





from electronic annotation

annotation ///







5525 // GTP







binding //







inferred from







electronic







annotation


ydiL
ydiL
Hypothetical protein ydiL ///







hypothetical protein


tdcB
tdcB
threonine dehydratase
6520 // amino acid metabolism //

3824 //





inferred from electronic annotation

catalytic





/// 8152 // metabolism // inferred

activity //





from electronic annotation

inferred from







electronic







annotation ///







4794 //







threonine







ammonia-







lyase activity







// inferred







from







electronic







annotation ///







16829 // lyase







activity //







inferred from







electronic







annotation


umuD
umuD
UmuD protein /// DNA polymerase V, subunit D
6280 // mutagenesis // inferred

3677 // DNA





from electronic annotation /// 6281

binding //





// DNA repair // inferred from

inferred from





electronic annotation /// 6508 //

electronic





proteolysis // inferred from

annotation ///





electronic annotation /// 6974 //

4252 // serine-





response to DNA damage stimulus

type





// inferred from electronic

endopeptidase





annotation

activity //







inferred from







electronic







annotation ///







8233 //







peptidase







activity //







inferred from







electronic







annotation ///







8236 // serine-







type peptidase







activi


rplQ
rplQ
50S ribosomal protein L17
6412 // protein biosynthesis //
5622 // intracellular //
3735 //





inferred from electronic annotation
inferred from electronic
structural






annotation /// 5840 //
constituent of






ribosome // inferred from
ribosome //






electronic annotation ///
inferred from






30529 // ribonucleoprotein
electronic






complex // inferred from
annotation






electronic annotation


yjbI
yjbI
hypothetical protein





infC
infC
Translation initiation factor IF-3
6412 // protein biosynthesis //

3743 //





inferred from electronic annotation

translation





/// 6413 // translational initiation //

initiation





inferred from electronic annotation

factor activity





/// 6417 // regulation of protein

// inferred





biosynthesis // inferred from

from





electronic annotation /// 6445 //

electronic





regulation of

annotation ///







3723 // RNA







binding //







inferred from







electronic







annotation


aroF
aroF
3-deoxy-7-phosphoheptulonate synthase /// 3-deoxy-D-arabino-
8652 // amino acid biosynthesis //

3849 // 3-




heptulosonate-7-phosphate synthase, tyrosine-repressible
inferred from electronic annotation

deoxy-7-





/// 9058 // biosynthesis // inferred

phosphoheptulonate





from electronic annotation /// 9073

synthase





// aromatic amino acid family

activity //





biosynthesis // inferred from

inferred from





electronic annotation

electronic







annotation ///







16740 //







transferase







activity //







inferred from







electronic







annotation ///







16829 // lyase







activity //







inferred from







electronic







annotation


yjiA
yjiA
Hypothetical protein yjiA /// predicted GTPase


166 //







nucleotide







binding //







inferred from







electronic







annotation ///







5525 // GTP







binding //







inferred from







electronic







annotation


yohL
yohL
Hypothetical protein yohL ///







hypothetical protein


ftn
ftn
Ferritin 1 /// ferritin iron storage
6826 // iron ion transport //

4322 //




protein (cytoplasmic)
inferred from electronic annotation

ferroxidase





/// 6879 // iron ion homeostasis //

activity //





inferred from electronic annotation

inferred from







electronic







annotation ///







5488 //







binding //







inferred from







electronic







annotation ///







5506 // iron







ion binding //







inferred from







electronic







annotation ///







8199 // ferric







iron binding //







inferred from







electronic


cysM
cysM
Cysteine synthase B /// cysteine synthase B (O-acetylserine
6535 // cysteine biosynthesis from

3824 //




sulfhydrolase B)
serine // inferred from electronic

catalytic





annotation /// 8152 // metabolism

activity //





// inferred from electronic

inferred from





annotation /// 8652 // amino acid

electronic





biosynthesis // inferred from

annotation ///





electronic annotation /// 19344 //

4124 //





cysteine biosynthesis //

cysteine







synthase







activity //







inferred from







electronic







annotation ///







16740 //







transferase







activity //







inferred from







electronic







annotation ///







16787 //







hydrolase







activity // inf


aroK
aroK
shikimate kinase I
8652 // amino acid biosynthesis //

166 //





inferred from electronic annotation

nucleotide





/// 9073 // aromatic amino acid

binding //





family biosynthesis // inferred

inferred from





from electronic annotation ///

electronic





16089 // aromatic amino acid

annotation ///





family biosynthesis, shikimate

287 //





pathway // inferred from electronic

magnesium





annotation

ion binding //







inferred from







electronic







annotation ///







4765 //







shikimate







kinase activity







// inferred







from







electronic







annotation ///







5524 // ATP







binding //







inferred from


eno
eno
phosphopyruvate hydratase
6096 // glycolysis // inferred from
15 // phosphopyruvate
287 //





electronic annotation
hydratase complex // inferred
magnesium






from electronic annotation
ion binding //







inferred from







electronic







annotation ///







4634 //







phosphopyruvate







hydratase







activity //







inferred from







electronic







annotation ///







16829 // lyase







activity //







inferred from







electronic







annotation ///







46872 // metal







ion binding //
















TABLE 4





SAM ANALYSIS OF HEALTHY CONTROLS (HC) VERSUS


ULCERATIVE COLITIS (UC) (see FIG. 3C)


















SPOT
PROTEIN
NAME
GO BP










6 Highly immunogenic proteins in UC










era
era
GTP-binding protein Era
50875 // cellular





physiological process //





inferred from electronic





annotation


purK
purK
phosphoribosylaminoimidazole
6164 // purine nucleotide




carboxylase
biosynthesis // inferred





from electronic





annotation /// 6189 // ‘de





novo’ IMP biosynthesis //





inferred from electronic





annotation


cadA
cadA
Lysine decarboxylase, inducible
6520 // amino acid




/// lysine decarboxylase 1
metabolism // inferred





from electronic





annotation


nrfF
nrfF
Formate-dependent nitrite





reductase complex nrfF subunit





precursor /// heme lyase





(NrfEFG) for insertion of heme





into c552, subunit NrfF



murA
murA
UDP-N-acetylglucosamine 1-
7049 // cell cycle //




carboxyvinyltransferase
inferred from electronic





annotation /// 8360 //





regulation of cell shape //





inferred from electronic





annotation /// 9252 //


tpiA
tpiA
triosephosphate isomerase
peptidoglycan





biosynthesis // inferred





from electronic





annotation /// 19277 //





UDP-N-





acetylgalactosamine bios





6094 // gluconeogenesis





// inferred from electronic





annotation /// 6096 //





glycolysis // inferred





from electronic





annotation /// 6098 //





pentose-phosphate shunt





// inferred from electronic





annotation /// 6633 //





fatty acid biosynthesis //





inferred from electronic





annotation







27 Highly immunogenic proteins in HC










yphA
yphA
Hypothetical protein yphA





/// predicted inner membrane





protein



pssR
pssR /// yifA /// hdfR
transcriptional regulator
6350 // transcription // inferred




HdfR /// transcriptional
from electronic annotation ///




regulator HdfR
6355 // regulation of





transcription, DNA-dependent //





inferred from electronic





annotation /// 45892 // negative





regulation of transcription,





DNA-dependent // inferred





from electronic annotat


yhdN
yhdN
Hypothetical protein yhdN





/// hypothetical protein



rplO
rplO
50S ribosomal protein L15
6412 // protein biosynthesis //





inferred from electronic





annotation


420#7
ypeA
putative acetyltransferase



yehK
yehK
hypothetical protein



yihG
yihG
Hypothetical protein yihG ///
8152 // metabolism // inferred




predicted endonuclease
from electronic annotation


sucB
sucB
dihydrolipoamide
6099 // tricarboxylic acid cycle




acetyltransferase
// inferred from electronic





annotation /// 8152 //





metabolism // inferred from





electronic annotation


yggH
yggH
tRNA (guanine-N(7)-)-
6400 // tRNA modification //




methyltransferase ///
inferred from electronic




tRNA(m7G46)-
annotation /// 8033 // tRNA




methyltransferase
processing // inferred from





electronic annotation


rpsK
rpsK
30S ribosomal protein S11
6412 // protein biosynthesis //





inferred from electronic





annotation


fadA
fadA
acetyl-CoA acetyltransferase
6629 // lipid metabolism //





inferred from electronic





annotation /// 6631 // fatty acid





metabolism // inferred from





electronic annotation /// 16042





// lipid catabolism // inferred





from electronic annotation


ydfO
ydfO
Hypothetical protein ydfO ///





Qin prophage; predicted





protein



yjhA
yjhA
Hypothetical protein yjhA
6810 // transport // inferred




precursor /// N-
from electronic annotation ///




acetylnuraminic acid outer
6811 // ion transport // inferred




membrane channel protein
from electronic annotation


yheU
yheU
hypothetical protein



rpsL
rpsL
30S ribosomal protein S12
6412 // protein biosynthesis //





inferred from electronic





annotation /// 46677 // response





to antibiotic // inferred from





electronic annotation


yibQ
yibQ
Hypothetical protein yibQ





precursor /// predicted





polysaccharide deacetylase



ycfF
ycfF /// hinT
HIT-like protein ycfF ///





purine nucleoside





phosphoramidase



yzgL
yzgL
hypothetical protein



yjfY
yjfY
hypothetical protein



316#4
rsxA
hypothetical protein



yneC
yneC
hypothetical protein



yneG
yneG
Hypothetical protein yneG





/// hypothetical protein



fabH
fabH
3-oxoacyl-(acyl carrier
6633 // fatty acid biosynthesis //




protein) synthase
inferred from electronic




ribonuclease activity
annotation /// 8610 // lipid





biosynthesis // inferred from





electronic annotation


menG
menG
regulator protein RraA



sixA
slp
Outer membrane protein slp





precursor /// outer membrane





lipoprotein



yhdM
yhdM /// zntR
zinc-responsive
6350 // transcription // inferred




transcriptional regulator
from electronic annotation ///





6355 // regulation of





transcription, DNA-dependent //





inferred from electronic





annotation


ptsH
ptsH
Phosphocarrier protein HPr
6810 // transport // inferred




/// phosphohistidinoprotein-
from electronic annotation ///




hexose phosphotransferase
9401 // phosphoenolpyruvate-




component of PTS system
dependent sugar




(Hpr)
phosphotransferase system //





inferred from electronic





annotation














SPOT
GO CC
GO MF












6 Highly immunogenic proteins in UC











era
5622 // intracellular // inferred
166 // nucleotide binding // inferred




from electronic annotation ///
from electronic annotation /// 3676 //




16020 // membrane // inferred
nucleic acid binding // inferred from




from electronic annotation
electronic annotation /// 3723 //





RNA binding // inferred from





electronic annotation /// 5525 // GTP





binding // inferred from electronic





annotation



purK
9320 //
166 // nucleotide binding // inferred




phosphoribosylaminoimidazole
from electronic annotation /// 3824 //




carboxylase complex // inferred
catalytic activity // inferred from




from electronic annotation
electronic annotation /// 4638 //





phosphoribosylaminoimidazole





carboxylase activity // inferred from





electronic annotation /// 5524 // ATP





bin



cadA
5737 // cytoplasm // inferred
5515 // protein binding // inferred




from electronic annotation
from physical interaction /// 3824 //





catalytic activity // inferred from





electronic annotation /// 8923 //





lysine decarboxylase activity //





inferred from electronic annotation





/// 16829 // lyase activity // inferred





from electronic annotation



nrfF
42597 // periplasmic space //
5506 // iron ion binding // inferred




inferred from electronic
from electronic annotation /// 46872




annotation
// metal ion binding // inferred from





electronic annotation



murA
5618 // cell wall // inferred from
8760 // UDP-N-acetylglucosamine




electronic annotation
1-carboxyvinyltransferase activity //





inferred from electronic annotation





/// 16740 // transferase activity //





inferred from electronic annotation



tpiA

4807 // triose-phosphate isomerase





activity // inferred from electronic





annotation /// 16853 // isomerase





activity // inferred from electronic





annotation









27 Highly immunogenic proteins in HC











yphA
16020 // membrane //





inferred from electronic





annotation /// 16021 //





integral to membrane //





inferred from electronic





annotation




pssR

3677 // DNA binding //





inferred from electronic





annotation /// 3700 //





transcription factor activity //





inferred from electronic





annotation



yhdN





rplO
5622 // intracellular //
3723 // RNA binding //




inferred from electronic
inferred from electronic




annotation /// 5840 //
annotation /// 3735 //




ribosome // inferred from
structural constituent of




electronic annotation ///
ribosome // inferred from




15934 // large ribosomal
electronic annotation ///




subunit // inferred from
19843 // rRNA binding //




electronic annotation ///
inferred from electronic




30529 // ribonucleoprotein
annotation




complex // inferred from





electronic annotation




420#7

8080 // N-acetyltransferase





activity /// 8415 //





acyltransferase activity ///





16740 // transferase activity





/// 16747 // transferase





activity, transferring groups





other than amino-acyl groups



yehK





yihG
16020 // membrane //
8415 // acyltransferase




inferred from electronic
activity // inferred from




annotation /// 16021 //
electronic annotation ///




integral to membrane //
16740 // transferase activity //




inferred from electronic
inferred from electronic




annotation
annotation /// 4519 //





endonuclease activity //





inferred from electronic





annotation



sucB
45252 // oxoglutarate
4149 // dihydrolipoyllysine-




dehydrogenase complex //
residue succinyltransferase




inferred from electronic
activity // inferred from




annotation
electronic annotation /// 5515





// protein binding // inferred





from electronic annotation ///





8415 // acyltransferase





activity // inferred from





electronic annotation /// 1674



yggH

8168 // methyltransferase





activity // inferred from





electronic annotation /// 8176





// tRNA (guanine-N7-)-





methyltransferase activity //





inferred from electronic





annotation /// 16740 //





transferase activity // inferred





from electronic annotation



rpsK
5622 // intracellular //
3723 // RNA binding //




inferred from electronic
inferred from electronic




annotation /// 5840 //
annotation /// 3735 //




ribosome // inferred from
structural constituent of




electronic annotation ///
ribosome // inferred from




30529 // ribonucleoprotein
electronic annotation ///




complex // inferred from
19843 // rRNA binding //




electronic annotation
inferred from electronic





annotation



fadA

3988 // acetyl-CoA C-





acyltransferase activity //





inferred from electronic





annotation /// 8415 //





acyltransferase activity //





inferred from electronic





annotation /// 16740 //





transferase activity // inferred





from electronic annotation



ydfO





yjhA
16020 // membrane //
5351 // sugar porter activity //




inferred from electronic
inferred from electronic




annotation /// 16021 //
annotation /// 15288 // porin




integral to membrane //
activity // inferred from




inferred from electronic
electronic annotation




annotation /// 19867 // outer





membrane // inferred from





electronic annotation




yheU





rpsL
5622 // intracellular //
49 // tRNA binding // inferred




inferred from electronic
from electronic annotation ///




annotation /// 5840 //
3676 // nucleic acid binding //




ribosome // inferred from
inferred from electronic




electronic annotation ///
annotation /// 3723 // RNA




15935 // small ribosomal
binding // inferred from




subunit // inferred from
electronic annotation /// 3735




electronic annotation ///
// structural constituent of




30529 // ribonucleoprotein
ribosome // inferred fro




complex // inferred from el




yibQ





ycfF





yzgL





yjfY





316#4





yneC





yneG





fabH

3824 // catalytic activity //





inferred from electronic





annotation /// 4315 // 3-





oxoacyl-[acyl-carrier protein]





synthase activity // inferred





from electronic annotation ///





8415 // acyltransferase





activity // inferred from





electronic annotation ///





16740 /



menG





sixA
16020 // membrane //





inferred from electronic





annotation /// 19867 // outer





membrane // inferred from





electronic annotation




yhdM
5622 // intracellular //
3677 // DNA binding //




inferred from electronic
inferred from electronic




annotation
annotation /// 3700 //





transcription factor activity //





inferred from electronic





annotation /// 8270 // zinc ion





binding // inferred from





electronic annotation ///





46872 // metal ion binding //





inferred from



ptsH

5515 // protein binding //





inferred from physical





interaction /// 5351 // sugar





porter activity // inferred from





electronic annotation










As shown in the Venn diagram in FIG. 3D, the immunogenic responses to 417 proteins were found to be different between healthy control and Crohn's Disease or ulcerative colitis. Of these 417 proteins, 169 proteins were identified as highly immunogenic in healthy control, 186 proteins are highly immunogenic in Crohn's Disease and only 19 in ulcerative colitis. 44 proteins were highly immunogenic in both healthy control and inflammatory bowel disease (Crohn's Disease or ulcerative colitis). Among these 44 proteins, six overlap between healthy control and Crohn's Disease and 38 overlap between healthy control and ulcerative colitis. A full list of the immunogenic E. coli proteins in FIG. 3D can be found in Table 5.









TABLE 5







List of differentially immunogenic proteins among


Healthy Control (HC), Crohn's Disease (CD) and


ulcerative colitis (UC), as illustrated in FIG. 3A














PROTEIN SPOT
HC
CD
UC
Legend







rpsK




Highly immunogenic



rpsL








sixA








ycfF








yhdN








yjhA








era








(yeeF)








211#11








23-12A0








279#6








427#1








dgkA








dinI








emrY








focA








folK








fsr








glnD








kch








maoC








msbA








nac








nagE








narI








ppx








prtC








rfaB








secF








secY/prlA








trkG








yafJ








yaiM








ybbC








ycbM








ydaA








ydbD








ydhV








yefI








yeiO








ygjR








yhiN








yjgT








yojI








yphD








(yhcP)








(yhhT)








(yhiW)








16-3B0








214#3








233#6








273#6








280#1








316#4








321#3








323#1








331#2








356#7








406#7








409#5








411#1








420#7








452#13








610#6.1








aceF








allP








ansP








aqpZ








atoE








brnQ








celD








cobU








codB








cybB








cydB








cydC








dgt








dnaQ








ebgA








emrB








emrD








exuR








fabH








fabZ








fadA








fepD








flhD








glnQ








glpF








gppA








greA








hemY








JW0438








JW1949








lipA








lpxC








malX








malZ








menG








mrdB








murG








mutT








narU








nfrB








nrfE








ompC








oppC








oppF








pbuX








pheP








phsE








pnuC








potC








pssR








ptsH








putP








queA








rfaL








rffG








rocE








rplO








sdhD








secB








sfsA








slyX








sucB








sucD








tauB








thiL








trkH








udk








uidB








virK








yaaH








yabK








yadQ








yaeG








yagG








yagM








yaiV








yajR








ybaN








ybdS








ybfB








ybfC








ybgE








ybhA








ybhL








ybhM








ybhN








ybhR








ycaD








yccY








ycdG








yciQ








yciR








yciS








ydcD








yddH








ydeF








ydeZ








ydfO








ydjS








ydjZ








yeaS








yehK








yehY








yejF








yfiY








ygeD








ygfF








yggH








yghK








yghT








ygjQ








yhaH








yhaO








yhbX








yhcO








yhdM








yhdT








yheG








yheU








yhfU








yhhL








yhhS








yhiP








yhiQ








yhjX








yiaL








yiaQ








yibL








yibQ








yicO








yidY








yifE








yigF








yihG








yjeM








yjfF








yjfP








yjfY








yjhB








ymdD








ynaJ








yneC








yneG








ynjC








yoaA








yohG








yphA








yphG








yqcE








yzgL








(gntU)








(phnE)








(rcsC)








(thiS)








(ycfA)








(yfjV)








221#15








267#6








304#1








319#17








336#6








348#4








405#2








411#4








416#1








430#8








445#15








448#2








633#5








aceB








agaZ








aidA








argB








argC








aroF








aroK








aspA








atpB








baiF








basS








cedA








citB








citG








clpX








cysD








cysJ








cysM








cysW








dgxA








dicC








dinD








ecnB








eno








fadB








fba








fdhE








fecB








fecR








fimB








fimC








fliA








fliS








flxA








folP








frvX








ftn








fumB








gabD








galR








gcpE








glvB








grxA








grxC








gst








hemX








hflX








hisS








hofH








hoxK








hsdM








hycA








hycF








hypC








ibpB








infC








ivbL








lasT








LDR-ABC








LDR-D








lldR








mcrD








metB








metJ








mltB








mviN








narY








nuoE








phbA








phnB








phnG








phnM








ppdB








ptpS








ptsN








purM








radC








rbfA








rbsB








recN








rffD








rpiB








rplP








rplQ








rplT








rpmJ








rpsG








rpsR








selD








slyA








slyD








ssi6








sugE








tdcB








thiF








torA








trmD








ubiG








umuD








vacJ








wcaA








ybbA








ybbQ








ycbF








ycdB








yceD








ycgN








ydfA








ydfD








ydfR








ydhL








ydiL








yedF








yehR








yejG








yejO








yfeC








yfhD








yfiC








yfiD








yfjI








yfjQ








ygbA








ygbD








ygbO








ygcQ








ygeW








ygfY








yhaA








yhaC








yhaN








yhcI








yhcJ








yhfR








yhfV








yhgH








yhgI








yicC








yieF








yieJ








yigK








yihA








yihD








yihK








yiiF








yijD








yijF








yjaI








yjbI








yjbL








yjbQ








yjbR








yjcS








yjeB








yjeJ








yjgF








yjhC








yjhE








yjhF








yjhO








yjiA








yliG








ymfE








yohL








yphC








yrbB








(rtn)








cadA








lueO








mesJ








mhpF








modC








murA








nrfF








prpE








purK








tpiA








yciD








yejA








ygcE








ygfQ








yhjC








yjfH








yjiJ








Legend


✓ Highly immunogenic







This demonstrates that ulcerative colitis and healthy control subjects share more common immunogenic profiles than Crohn's Disease and healthy control. In general, these results indicate that much of the global immunogenic profiles of sera samples were systematically correlated with either healthy controls or IBD phenotypes and that sample class can be distinguished based on the sample's immunogenic profile.


Example 3: Protein Functional Enrichment Analysis

To delineate the immunogenic signatures of the healthy controls and IBD subtypes the differentially immunogenic proteins were assigned to functional groups based on classification by Gene Ontology (www.geneontology.org). Functional grouping of the 417 proteins were assigned by querying EcoCyc (http://ecocyc.org/) and KEGG (www.genome.ad.jp/kegg) databases, as well as cross-checked with Affymetrix E. coli Genome Array annotation file (www.affymetrix.com). 338 of these 417 proteins were assigned to at least one gene ontology (GO) term, and 78 hypothetical proteins have unknown annotations. The enrichment analysis was focussed on five GO cellular component terms (membrane, cell wall, intracellular, macromolecular complex, periplasmic space and cell projection). To assess whether the selected differentially immunogenic proteins were enriched in one of the GO terms, the hypergeometric statistical test was used to compute the probability of the number of proteins in each cellular component appearing by chance within the proteins highly immunogenic in healthy control (169), Crohn's Disease (185) and ulcerative colitis (18). FIG. 4 summarizes the enrichment analysis of these proteins that are immunogenic in healthy control and Crohn's Disease or ulcerative colitis. Antibodies against membrane proteins are highly enriched in healthy control samples (p<0.0001). Interestingly, antibodies against intracelluar and macromolecular complex proteins are highly enriched in Crohn's Disease samples (p<0.05), while those against cell wall proteins are highly enriched in ulcerative colitis samples (p<0.05). Although 12% proteins that were found to be highly immunogenic in Crohn's Disease samples were located in periplasmic space, their enrichment was not statistically significant (p=0.064) for this IBD subtype. Proteins located in cell projection term are not enriched in either healthy controls or IBD subtypes.


Example 4: Machine Learning Analysis

Next, optimal classifiers were constructed from the immunogenic response profiles to differentiate healthy control from the IBD subtypes (Crohn's Disease and ulcerative colitis), as well as to differentiate Crohn's Disease from ulcerative colitis. Upon successful construction of these classifiers, the classification rules may result in the discovery of new robust biomarkers. k-TSP, a novel machine learning method was employed to discover simple decision rules classifiers from the immunogenic response profiles. The three top scoring pairs were identified as classifiers to differentiate healthy control samples from Crohn's Disease samples as follows:


1. If a subject sample shows greater immunogenic reactivity to era than to ybaN then the subject is identified as likely having Crohn's Disease, or else as UC.


2. If a subject sample shows greater immunogenic reactivity to yhgN than to focA then the subject is identified as likely having Crohn's Disease or as a healthy control; and


3. If a subject sample shows greater immunogenic reactivity to gabT than ycdG then the subject is identified as likely having Crohn's Disease (see representative examples of actual images of immuno-reactive protein spots in FIG. 2). If all three pairs identify the subject as having Crohn's disease then the sample is classified as a Crohn's Disease sample. FIG. 5A depicts the protein spot ratios for this classifier that separate the data between the two phenotypes where yellow represents a vote for Crohn's Disease and blue represents a vote for healthy control. Using the k-TSP classifier, 36 out of 39 healthy control and 62 out of 64 Crohn's Disease samples are correctly classified, with an estimated ten-fold cross-validation accuracy of 86±4% (p<0.01). For distinguishing healthy control from ulcerative colitis samples, the k-TSP algorithm identifies eleven feature pairs (FIG. 5B) with an estimated ten-fold crossvalidation accuracy of 66±5% (p<0.04). A single feature pair of k-TSP classifier was identified for differentiating Chrohn's disease from ulcerative colitis: If the sample has greater immunogenic reactivity to frvX than to yidX then the subject is identified as having ulcerative colitis as illustrated in FIG. 5C (see representative examples of actual images of immuno-reactive protein spots in FIG. 5D). This classifier has an estimated ten-fold cross-validation accuracy of 80±2% (p<0.1).


The performance of k-TSP was also compared with SVM and kNN, two other commonly used learning algorithms, for each of the classification problems based on five runs of ten-fold crossvalidation. Table 6 displays the results of ten-fold cross-validation for each of the three classifiers.









TABLE 6





Estimated ten-fold cross-validation classification rates of IBD


using the three described classification methods. The reported


rates are given in percentages and are the mean performance


on all five runs of ten-fold cross validation ± the standard


deviation. In parenthesis are the numbers of samples in each


subtype used for classification. Sp = specificity, Sn =


sensitivity, PPV = positive predictive value,


NPV = negative predictive value.







Healthy control (HC) (39) vs CD (66)













Accuracy


PPV
NPV


Method
(%)
Sp (HC) (%)
Sn (CD) (%)
(CD) (%)
(HC) (%)





k-TSP
86 ± 4
81 ± 5
89 ± 3
89
81


SVM
70 ± 2
66 ± 1
73 ± 2
79
59


kNN
63 ± 3
47 ± 7
73 ± 6
70
50










Healthy control (HC) (39) vs UC (29)













Accuracy


PPV
NPV


Method
(%)
Sp (HC) (%)
Sn (UC) (%)
(UC) (%)
(HC) (%)





k-TSP
6.6 ± 5 
69 ± 5
61 ± 7 
59
70


SVM
62 ± 5
58 ± 1
68 ± 12
55
71


kNN
60 ± 6
57 ± 2
64 ± 12
53
68










CD (66) vs UC (29)













Accuracy


PPV
NPV


Method
(%)
Sp (CD) (%)
Sn (UC) (%)
(CD) (%)
(UC) (%)





k-TSP
80 ± 2
84 ± 1
70 ± 6
86
66


SVM
78 ± 3
82 ± 2
69 ± 9
86
63


kNN
78 ± 3
78 ± 4
61 ± 2
82
55










As demonstrated in Table 6, based on cross-validation, k-TSP performance meets or exceeds the performance of kNN and SVM for these classification problems. Because the cross-validation structure allowed each classifier to test the same subsets of data as described in the methods section, the performance of the three classifiers can be directly compared and tested for statistical significance by a simple student's t-test. The healthy control vs. CD k-TSP classifier outperformed the other methods in total classification performance (p<0.001). For the remaining two classification problems, the k-TSP classifiers achieved nominally better, but not statistically significant in classification accuracy when compared to SVM and kNN classifiers. From this study, k-TSP was found to perform much better than SVM and kNN in separating healthy control from Crohn's Disease. In addition, the ordering of the expression values within profiles were utilized in the k-TSP decision rules, therefore, the classifier is invariant to data preprocessing (28). FIGS. 6A and 6B show that on their own, the immunogenic responses to era and ybaN (the top scoring pair in the healthy control vs CD k-TSP classifier) do not allow for class separation of the data; no threshold level would clearly separate healthy control from Crohn's Disease. However, the ratio of the two features (top-scoring pair ratio) results in clear separation in the data lending well to classification (FIG. 6C). Similar results are true when scatter plot analysis was done for the other two TSP pairs from the healthy control vs Crohn's Disease classifier (yhgN vs focA and gabT vs ycdG, respectively). This represents an advantage of k-TSP over other learning methods where interpreting the decision rules are easy and can facilitate follow-up study. It is important to note that SAM identified era as the second best individual marker for up regulation in CD, thus it appears that individual markers will not work well for classification and explains why KNN and SVM fail to match the performance of k-TSP as the relative feature levels within samples appear to be much more robust then the absolute feature levels across samples


Example 5: Robustness of the k-TSP Classifiers

To determine that class imbalance did not greatly affect the classification results, an additional analysis was performed where samples were randomly discarded from a class with greater total number of samples in order to equalize the class sizes. 10-fold cross validation was performed as described. The process was then repeated by discarding a different random set of samples.


Table 7 (below) shows the performance of each classifier given class balance in the training set.









TABLE 7







Permutation statistics for each pair of biomarkers. Top scoring


pairs used for each classifier and the percentage of surrogate


classifiers in which those pairs appear during 10-fold


cross validation (mean ± standard deviation, p-value).










Accuracy (%)













k-TSP
10-fold
10-fold CV
Estimated
Features in
% feature


Classifier
CV
Permutation
p-value
k-TSP classifier
appearance





HC vs. CD
86 ± 3
50 ± 8
p < 0.01
era > ybaN = CD
90






yhgN > focA = CD
84






gabT > ycdG = CD
64


HC vs. UC
66 ± 5
51 ± 9
p < 0.05
relE > cysE/WcaB = UC
80






pyrI > yigK = UC
42






lnt > yniO = UC
36






ftsE > pssR = UC
36






yhgN > yhfG = UC
20






yafN > dsbB = UC
20






yihI > yabK = UC
26






421#15 > yhdN = UC
24






hisP > rplO = UC
16






cml > nuoM = UC
14






yieC > nuoI = UC
12


UC vs. CD
80 ± 2
52 ± 6
p < 0.01
yidX > frvX = UC
88










These results demonstrate that k-TSP outperforms SVM and kNN in most instances whether or not the class size is balanced, further supporting the data presented in Table 5.


Next, to determine the significance of each classifier, a permutation test was performed by randomly shuffling the class labels while maintaining the same number of samples in each class. 10-fold cross validation is carried out to yield a classification rate for the permutation set. 100 permutations were performed in order to get a null distribution of expected classification rates by chance. The classification rate from the un-permuted data is then compared to the null distribution to determine significance. Table 7 shows the permutation test results for all the classification problems. For the k-TSP classifiers trained to differentiate between healthy control and Crohn's Disease samples as well as Crohn's Disease and ulcerative colitis samples, no permuted set achieved classification rates equal or superior to the original data out of 100 permutations. Thus, these classifiers were estimated to be significant at the p<0.01 level. The k-TSP classifier built to differentiate healthy control and ulcerative colitis had 4/100 permutations achieve rates that matched or exceeded the original classifier, thus this classifier is near the typical significance threshold at p<0.05.


Finally, to gauge the robustness of the classification rules discovered by the k-TSP method, the surrogate classifiers created during the ten-fold cross validation procedure were inspected. Every loop of cross validation creates a separate classifier used to predict the left out sample classes, these are called surrogate classifiers. Thus, for each problem of interest that was performed ten fold cross-validation in Table 7, there were 50 classifiers to inspect (10 for each of the 5 runs). The percentage of the time that the rule from the final k-TSP classifier showed up in the 50 surrogate classifiers was an indicator of the robustness of that rule. Table 7 shows that the pairs that show up in the healthy control vs. Crohn's Disease classifier as well as the ulcerative colitis vs. Crohn's Disease classifier are fairly robust while the pairs in the healthy control vs. ulcerative colitis classifier are not. Along with the permutation testing, this indicates that the healthy control vs. Crohn's Disease and ulcerative colitis vs. Crohn's Disease classifier should perform well in independent testing while the healthy control vs. ulcerative colitis classifier may not.


Example 6: Stratifying Crohn's Disease Subtypes and Risk for Surgery

Certain antibody-based serological biomarkers (such as pANCA and ASCA) have shown promise in risk stratifying patients prior to instituting medical therapy or embarking on surgery. As an example, the presence of pANCA has been associated with the development of acute and chronic pouchitis after colectomy with ileal pouchanal anastamosis. Similarly, the presence of high titers of ASCA has been found to predict the occurrence of pouch complications and a more complicated disease course in Crohn's disease. To evaluate whether the new biomarkers identified can be used to stratify Crohn's Disease and ulcerative colitis subtypes or risk for surgery, the Vienna classification was used to subtype patients with Crohn's Disease into the following behavior subtypes (Table 1): penetrating/fistulizing, stricturing, penetrating/structuring and non-penetrating non-stricturing. Patients with ulcerative colitis were divided into those with left sided disease (inflammation extending no further than the splenic flexure). Pancolitis was considered to be continuous inflammation from the rectum extending beyond the splenic flexure. Due to the small sample sizes for each disease type, k-TSP analysis using the newly identified biomarkers was unable to stratify subtypes of Crohn's Disease or ulcerative colitis, or risk for surgery. When larger sample sizes are available, it is expected that at least those biomarkers listed herein will be useful for identifying subjects in need of surgery. In particular, pairs and sets of biomarkers delineated in Tables 2-5, 7, and FIG. 5 are useful alone or in combination with existing biomarkers to identify subjects that could benefit from surgery.


Example 7: OmpC and fliC, Two of the Known Serological Markers, Performed Poorly

Although anti-OmpC and anti-Cbir (fliC) have been recently considered two new IBD serological biomarkers, these markers were not identified in our screening of the E. coli K12 proteome. Scatter plot (FIG. 7) analysis of E. coli ompC and fliC demonstrates that neither allows for class separation between control vs Crohn's Disease vs ulcerative colitis; no threshold level would clearly separate the data.


Protein microarrays have been demonstrated to be a powerful tool to identify biomarkers. The results reported herein provide the first study to identify serological biomarkers in human autoimmune diseases using a protein chip of whole prokaryotic proteome. The significance of this study is three-fold: First, it presents here the first proof of principle for the feasibility of application of high density protein microarray/chip technology in the discovery of novel serological IBD biomarkers. This study can serve as an example of similar proteomic approaches for hunting serological biomarkers for other immune-related diseases, such as autoimmune disorders. Second, this is the first examination of human immune responses to the entire proteome of a microbial species under normal or any disease condition. It is surprising to learn that human circulating antibodies can recognize more than 400 E. coli proteins (FIG. 3D). Since it has been demonstrated that defective intestinal barrier function plays a central role in the pathogenesis of Crohn's Disease, it is conceivable that in patients with Crohn's Disease commensal bacteria or their products could more readily penetrate intestinal epithelia. Therefore, it is less surprising that 185 of the E. coli proteins were recognized by sera from Crohn's Disease patients (FIG. 3D). However, it remains a mystery why there are a large number (185) of immunogenic E coli proteins that are specific in healthy controls while only 18 immunogenic proteins are found to be specific to ulcerative colitis. Third, this study identified a set of novel serological biomarkers that have >80% overall accuracy and sensitivity in differentiating CD from healthy control or ulcerative colitis.


An intriguing observation in this study is the difference in the immunogenicity of surface/membrane vs intracellular proteins in healthy control vs CD patients. Approximately 85% of the highly immunogenic proteins were either cell wall proteins or membrane proteins in healthy control, compared to only ˜37% of the top immunogenic proteins in Crohn's Disease patients (FIG. 4, FIG. 5 and Tables 2-5, and 7). Conversely, ˜30% of top immunogenic proteins in Crohn's Disease patients are intracellular proteins compared to only ˜7% in healthy control (FIG. 4, FIG. 5 and Tables 2-5, and 7). Furthermore, there is no overlap among the top immunogenic E. coli surface/membrane proteins among the three distinct populations (healthy control, Crohn's Disease and ulcerative colitis, see FIG. 3D). This suggests that the host immunological response to E. coli is drastically different between healthy control and CD patients. The mechanism of having these immunogenic differences is not clear at this moment. It is likely that in immunologically healthy hosts where E. coli are largely confined to the luminal side of the gut due to intestinal epithelial barrier, surface and membrane proteins of E. coli might be theprimary antigens that are more accessible to the immune system, compared to intracellular proteins. In this case, immune system has adapted to the presence of luminal E. coli. In contrast, in Crohn's Disease patients, a disrupted or compromised intestinal barrier may lead to the bacterium or its products crossing the gut luminal barrier. If the whole E. coli invades into the lamina propria, it will mostly likely be lysed by host immune system. Subsequently, E. coli components such as intracellular proteins that wpi;d otherwise not be seen by the intestinal immune system in the lamina propria are presented by antigen-presenting cells (such as macrophages or dendritic cells).


This may dramatically alter the previously adapted immune system that is only used to the luminally exposed E. coli, resulting in an overwhelming production of antibodies against these intracellular E. coli proteins. The consequences of these immune responses include recruitment of various inflammatory immune cells such as neutrophils, dendritic cells, and lymphocytes to lamina propria or between colonic epithelial cells, leading to dysregulated mucosal inflammation. This may also explain why there are only 6 overlapping proteins among 354 top immunogenic proteins recognized by healthy control and Crohn's Disease patients (FIG. 3D).


None of the serum antibody biomarkers that are identified here for discriminating Crohn's Disease from healthy control or ulcerative colitis have been previously described. Although most of the antigens (E. coli proteins) responsible for generation of these marker antibodies have not been well characterized, their identity and function can be predicted based on their sequence information. Among the proteins in the k-TSP classifier—era, ybaN, yhgN, focA, gabT and ycdG (FIG. 5A)—for discriminating CD from healthy control, era is a GTP binding protein that involves in the binding of GTP and nucleotide of cell cycle and can be found in intracellular membrane. In this study, an increased immunogenic response to era is associated with Crohn's Disease, identified by both SAM and k-TSP analyses. YbaN is predicted as a conserved inner membrane protein with unknown function. YhgN is predicted as an inner six transmembrane domains protein where the C-terminus is located in the periplasm (36). YcdG (also called rutG) is another predicted transmembrane with eleven helices; the C-terminus of the protein is located on the cytoplasmic side of the inner membrane (36). This protein is predicted to be involved in the pyrimidine utilization in E. coli where it may function as a proton-driven uracil uptake system (37). FocA, an inner membrane protein, is a putative formate transporter that may involve in both formate uptake and efflux. Disruption of the focA gene confers resistance to hypophosphite, a toxic formate analogue (38). GabT, 4-aminobutyrate aminotransferase, is a well characterized protein and acts as the initial enzyme of the 4-aminobutyrate (GABA) degradation pathway in E. coli (39). Among the pair of proteins (frvX and yidX) that were identified to be discriminatory between CD and ulcerative colitis, frvX is a important protein in fructose-specific PEP-dependent sugar phosphotransferase system (40); and yidX is a predicted lipoprotein, the function of which is currently unknown.


Like all previously identified serological (antibody) biomarkers, including p-ANCA, ASCA, anti-OmpC, and anti-I2 and anti-Cbir, the pathological or functional consequences of having these newly identified circulating antibodies is unclear.


The newly identified biomarkers by k-TSP analysis have a particular impressive ˜86% accuracy in differentiating CD from healthy control, with a specificity of ˜81% and a sensitivity of ˜89% (Table 6). In addition, k-TSP analysis yields an accuracy of ˜80% in differentiating CD and ulcerative colitis, with a sensitivity of ˜84% and specificity of ˜70% (Table 6). These demonstrate that the sensitivity and specificity of these novel serological markers are comparable to those of combination of the multiple best-characterized IBD biomarkers (ASCA, pANCA, anti-OmpC, and anti-Cbir) (41, 42). More importantly, an identical performance can be achieved by using only the top 3 pairs of E. coli proteins for discriminating healthy controls vs CD, and one top pair of proteins for differentiating CD vs ulcerative colitis (FIGS. 2 &5 and Tables 2 &3).


These data provide a critical feasibility for 1) validation study using additional larger cohorts of IBD patients and controls and 2) future development of novel assay kits for diagnosis of CD and ulcerative colitis. However, it is necessary to point out that our current approach screening E. coli protein array is not suitable for identifying serological biomarkers in differentiating ulcerative colitis from healthy control (only ˜66% accuracy) (Tables 5 and 6). Importantly, OmpC, an E. coli antigen for one of the widely studied current serological biomarker (anti-OmpC), was not picked up in our screen (FIG. 7A). Similarly, fliC, an E. coli flagellin protein equivalent the Salmonella flagellin (which is the antigen for anti-Cbir, another widely studied anti-bacterial antibody) did not show up in our analysis (FIG. 7A). These data would suggest that anti-OmpC and at least the antibody against E. coli fliC are not robust serological biomarkers for IBD. In conclusion, we have presented here the first demonstration that using protein array to screen circulating disease-specific antibodies is a robust, effective and high throughput approach for discovery of novel biomarkers of IBD. This approach can be readily applied to screen serological biomarkers of various autoimmune diseases and/or even infectious diseases.


The results reported above were obtained using the following methods and materials.


Patients and Serum Acquisition.


Serum was obtained from 134 subjects in accordance with the policy of the Johns Hopkins Hospital Institutional Review Board. Sixty six patients had the diagnosis of Crohn's disease (CD), 29 patients were diagnosed with ulcerative colitis (UC), and 39 subjects were non-IBD healthy controls (HC). The healthy controls and IBD patients were similar in age and sex distribution. The demographic and clinical characteristics of the patients are summarized in Table 1. Clinical information was abstracted from the written and electronic medical records. The diagnosis of CD and ulcerative colitis was established by standard clinical, radiographic, endoscopic and histological criteria. Patients were classified as having CD based on the typical findings of skip lesions, deep linear or serpiginous ulcerations, cobblestoning, multiple noncaseating granulomas, transmural inflammation, small bowel involvement, structuring disease or presence of fistulilizing disease. The diagnosis of ulcerative colitis was considered if the colonic inflammation involved the rectum with or without proximal extension. The inflammation had to be continuous and be limited to the mucosa. There were no patients with proctitis enrolled in this study. The healthy controls consisted of individual undergone colon cancer screening or other non-IBD GI diseases or any other immune diseases. The serum samples were obtained at the time of initial outpatient encounter, at the time of an endoscopy or during hospitalization. The blood was collected into a serum separator tube (Red top tube, BD Vacutainer) and spun down within 60 minutes of collection. Serum was removed, aliquoted, and stored in multiple at −80° C. until assayed.


Fabrication of E. coli Proteome Chips.


To facilitate the analysis of protein function in the bacterial proteomes, we have constructed a protein chip that essentially covers the entire proteome of the E. coli K 12 strain (Chen (2008) Nat. Methods 5, 69-74). Briefly, 4,256 E. coli proteins were first purified using an ORF collection kindly provided by Dr. Mori and colleagues (26). E. coli cells first were grown overnight at 37 C in 2×LB media containing 30 μg/ml chloramphenicol in a 96-well format and allowed to grow for overnight. The overnight cultures were diluted to a final OD600 of ˜0.1. After the cells were grown for ˜3 hrs at 37 C, and protein expression were induced with 1 mM isopropyl β-D-thiogalactoside (IPTG) for ˜3.5 hrs. The liquid cultures were then harvested by centrifuge of 3500 rpm for 5 min at 4° C. The pellets were stored at −80° C. for future protein purification.


To purify the fusion proteins, the frozen cell pellets were re-suspended in phosphate lysis buffer, containing 300 mM NaCl, 20 mM imidazole, CelLytic B, Lysozyme (1 mg/mL), Benzonase (50 units/ml), proteinase inhibitor cocktail, and PMSF (1 mM). Along with Ni-NTA beads, the mixtures were incubated for 1.5 h at 4° C. After mixing, the resin-protein complexes were washed 3 times with Wash buffer I (50 mM NaH2PO4 with 300 mM NaCl, 10% glycerol, 20 mM imidazole, 0.01% Triton X-100, at pH 8) and 3 times with Wash buffer II (50 mM NaH2PO4 with 150 mM NaCl, 25% glycerol, 20 mM imidazole, 0.01% Triton X-100, at pH 8). Finally, the fusion protein was eluted with elution buffer (50 mM NaH2PO4/150 mM NaCl/25% glycerol/250 mM imidazole/0.01% Triton X-100, pH 7.5). All purified proteins were printed in duplicate onto FullMoon slides using a ChipWriter Pro (Bio-Rad) in a humidity-controlled chamber in a cold room (25).


Screen of E. coli Proteome Chip for Anti-E. coli Antibodies.


The entire screening process, except for the washing steps as specified, was done at room temperature. E. coli protein chips stored at −80 C were thawed at room temperature (22 C) and blocked in Superblock Blocking Buffer (Pierce) for one hour. The patient's serum was diluted (1:1000) with blocking buffer in a total volume of 3 ml. The diluted serum was then applied to the chip entirely covering the surface. After 1 hour incubation with gentle shaking on a rocker, the chip was rinsed once with 4 ml of Tris-buffered saline (TBS) with 0.05% Tween 20 (TBS-T). The chip was then soaked in 4 ml TBS-T, placed in a water bath and washed for 10 min at 50 C with gentle horizontal agitation. This washing step was repeated twice. The chip was then cooled to room temperature. After removal of TBS-T, the chip was incubated for 1 h with the secondary antibody, a Cy3-labeled donkey anti-human IgA, G, and M (Jackson ImmunoLab) diluted at 1:400 in 3 mL Superblock Blocking Buffer. The chip was then washed at 50 C in the same fashion as previously stated. After the final wash, the chip was rinsed in sterile water briefly, and quickly spun at 2000 rpm until dry prior to scanning. The chips were scanned with a GenePix array scanner (GenePix Pro 6.0 or GenePix 4200AL, Molecular Devices, PA) at wavelength of 536 nm. To achieve the best signal-to-noise ratio, many washing conditions with different stringencies had been tested, including increase of salt (0.5 or 1 M NaCl), addition of SDS (0.05 or 0.1%), change of washing temperature (22, 37, 40, or 50 C), and/or various combination of conditions described above. The washing condition described here gave best results among all conditions tested.


Protein Array Data Preprocessing.


Each quantified sample array image was exported from Genepix (Molecular Devices, CA) as a text file for preprocessing. The goal of preprocessing is to yield a feature of interest from each protein spot in the array that minimizes technical variability and maximizes the signal of interest. The ratio of the mean signal over the mean background signal for each protein spot was determined to be the best method of preprocessing. This method has the advantage that all features are normalized to their background signals. Thus, if a protein spot signal is artificially high due to an artifact on the slide the ratio will account for it. Furthermore this preprocessing method also normalizes the features across all arrays, as the ratio is a standardized metric. The ratio represents the fold change of the signal above background and can be interpreted as the degree of host serum reactivity to each spotted protein.


Univariate Significance Testing.


Significance Analysis for Microarrays (SAM) (27) was used to determine proteins to which healthy control, CD, and ulcerative colitis groups of samples show a statistically significant immunogenic response. We used stringent criteria in the SAM analysis and only called a protein as significant with at least 1.5 fold change differences between two phenotypes at 0% False Discovery Rate in 500 permutations.


Supervised Learning Algorithms.


To construct the classifier in this study, we employed three supervised learning methods. The algorithms implemented were k-Nearest Neighbors (kNN) (27), Support Vector Machines (SVM), and the k-Top Scoring Pairs Algorithm (k-TSP) (28). The k-TSP was implemented using a publicly available executable program developed at the Institute for Computational Medicine of Johns Hopkins University (Tan (2005) Bioinformatics. 21, 3896-3904). SVM and kNN were implemented using the R statistical programming language, packages: e1071 and class for SVM and kNN, respectively.


Feature Selection.


For kNN and SVM learning methods, SAM was applied to the training set for feature selection before the classifiers were trained on that data. The features selected in SAM were those that were found to be significant with a false discovery rate of zero. The k-TSP algorithm does not require feature reduction as it intrinsically selects the top scoring features. Parameters such as the number of nearest neighbors for kNN and the number of top scoring pairs for k-TSP were selected based on leave one out cross-validation performance on the training set. A script was written in Matlab to perform the cross-validation scheme and call executables for the learning algorithms.


Statistical Analyses.


We used the open source statistical software R to perform the statistical analyses in this study. P-value<0.05 was regarded as significant.


Other Embodiments

From the foregoing description, it will be apparent that variations and modifications may be made to the invention described herein to adopt it to various usages and conditions. Such embodiments are also within the scope of the following claims.


The recitation of a listing of elements in any definition of a variable herein includes definitions of that variable as any single element or combination (or subcombination) of listed elements. The recitation of an embodiment herein includes that embodiment as any single embodiment or in combination with any other embodiments or portions thereof.


All patents and publications mentioned in this specification are herein incorporated by reference to the same extent as if each independent patent and publication was specifically and individually indicated to be incorporated by reference.

Claims
  • 1. A method for monitoring the condition of a subject having Crohn's disease, the method comprising: (a) contacting a microarray comprising a set of E. coli polypeptides comprising era, ybaN, yhgN, focA, gabT and ycdG, with sera derived from the subject; and(b) detecting in the subject's sera: (i) immunogenic reactivity to era relative to immunogenic reactivity to ybaN;(ii) immunogenic reactivity to yhgN relative to immunogenic reactivity to focA; and(iii) immunogenic reactivity to gabT relative to immunogenic reactivity to ycdG,wherein a reduction in the immunogenic reactivity of (i), (ii) and/or (iii) compared to the immunogenic reactivity of (i), (ii) and/or (iii) from an earlier time point identifies an improvement in the subject's condition, and an increase in the immunogenic reactivity of (i), (ii) and/or (iii) compared to the immunogenic reactivity of (i), (ii) and/or (iii) from the earlier time point identifies a worsening in the subject's condition, andwherein the microarray further comprises a set of E. coli polypeptides selected from the group consisting of rpsK, rpsL, sixA, ycfF, yhdN, yjhA, gntU, phnE, rcsC, thiS, ycfA, yfjV, aceB, agaZ, aidA, argB, argC, aroF, aroK, aspA, atpB, baiF, basS, cedA, citB, citG, clpX, cysD, cysJ, cysM, cysW, dgxA, dicC, dinD, ecnB, eno, fadB, fba, fdhE, fecB, fecR, fimB, fimC, fliA, fliS, flxA, folP, frvX, ftn, fumB, gabD, galR, gcpE, glvB, grxA, grxC, gst, hemX, hflX, hisS, hofH, hoxK, hsdM, hycA, hycF, hypC, ibpB, infC, ivbL, lasT, LDR-ABC, LDR-D, lldR, mcrD, metB, metJ, mltB, mviN, narY, nuoE, phbA, phnB, phnG, phnM, ppdB, ptpS, ptsN, purM, radC, rbfA, rbsB, recN, rffD, rpiB, rplP, rplQ, rplT, rpmJ, rpsG, rpsR, selD, slyA, slyD, ssi6, sugE, tdcB, thiF, torA, trmD, ubiG, umuD, vacJ, wcaA, ybbA, ybbQ, ycbF, ycdB, yceD, ycgN, ydfA, ydfD, ydfR, ydhL, ydiL, yedF, yehR, yejG, yejO, yfeC, yfhD, yfiC, yfiD, yfjI, yfjQ, ygbA, ygbD, ygbO, ygcQ, ygeW, ygfY, yhaA, yhaC, yhaN, yhcI, yhcJ, yhfR, yhfV, yhgH, yhgI, yicC, yieF, yieJ, yigK, yihA, yihD, yihK, yiiF, yijD, yijF, yjaI, yjbI, yjbL, yjbQ, yjbR, yjcS, yjeB, yjeJ, yjgF, yjhC, yjhE, yjhF, yjhO, yjiA, yliG, ymfE, yohL, yphC, yrbB, yhcP, yhhT, yhiW, aceF, allP, ansP, aqpZ, atoE, brnQ, celD, cobU, codB, cybB, cydB, cydC, dgt, dnaQ, ebgA, emrB, emrD, exuR, fabH, fabZ, fadA, fepD, flhD, glnQ, glpF, gppA, greA, hemY, JW0438, JW1949, lipA, lpxC, malX, malZ, menG, mrdB, murG, mutT, narU, nfrB, nrfE, ompC, oppC, oppF, pbuX, pheP, phsE, pnuC, potC, pssR, ptsH, putP, queA, rfaL, rffG, rocE, rplO, sdhD, secB, sfsA, slyX, sucB, sucD, tauB, thiL, trkH, udk, uidB, virK, yaaH, yabK, yadQ, yaeG, yagG, yagM, yaiV, yajR, ybdS, ybfB, ybfC, ybgE, ybhA, ybhL, ybhM, ybhN, ybhR, ycaD, yccY, yciQ, yciR, yciS, ydcD, yddH, ydeF, ydeZ, ydfO, ydjS, ydjZ, yeaS, yehK, yehY, yejF, yfjY, ygeD, ygfF, yggH, yghK, yghT, ygj Q, yhaH, yhaO, yhbX, yhcO, yhdM, yhdT, yheG, yheU, yhfU, yhhL, yhhS, yhiP, yhiQ, yhjX, yiaL, yiaQ, yibL, yibQ, yicO, yidY, yifE, yigF, yihG, yjeM, yjfF, yjfP, yjfY, yjhB, ymdD, ynaJ, yneC, yneG, ynjC, yoaA, yohG, yphA, yphG, yqcE, and yzgL, or fragments thereof.
  • 2. A method for monitoring the condition of a subject having Crohn's disease, the method comprising: (a) contacting a microarray comprising a set of E. coli polypeptides comprising era, ybaN, yhgN, focA, gabT and ycdG, with sera derived from the subject; and(b) detecting in the subject's sera: (i) immunogenic reactivity to era relative to immunogenic reactivity to ybaN;(ii) immunogenic reactivity to yhgN relative to immunogenic reactivity to focA; and(iii) immunogenic reactivity to gabT relative to immunogenic reactivity to ycdG,
  • 3. A method for monitoring the condition of a subject having Crohn's disease, the method comprising: (a) contacting a microarray comprising a set of E. coli polypeptides comprising era, ybaN, yhgN, focA, gabT and ycdG, with sera derived from the subject; and(b) detecting in the subject's sera: (i) immunogenic reactivity to era relative to immunogenic reactivity to ybaN;(ii) immunogenic reactivity to yhgN relative to immunogenic reactivity to focA; and(iii) immunogenic reactivity to gabT relative to immunogenic reactivity to ycdG,
CROSS-REFERENCE TO RELATED APPLICATION

This application is a continuation of U.S. application Ser. No. 13/140,702, filed Jun. 17, 2011, which is a 35 U.S.C. § 371 U.S. national entry of International Application PCT/US2009/006647 (WO 2010/077323) having an International filing date of Dec. 17, 2009 which claims the benefit of the following U.S. Provisional Application No. 61/138,309, filed Dec. 17, 2008, the entire contents of which are incorporated herein by reference.

STATEMENT OF RIGHTS TO INVENTIONS MADE UNDER FEDERALLY SPONSORED RESEARCH

This invention was made with government support under grant number DK077064 awarded by the National Institutes of Health. The government has certain rights in this invention.

US Referenced Citations (2)
Number Name Date Kind
20100004213 Abbas et al. Jan 2010 A1
20110111458 Masuda et al. May 2011 A1
Non-Patent Literature Citations (8)
Entry
Bacarese-Hamilton et al. (2002) “Protein Microarrays: From Serodiagnosis to Whole Proteome Scale Analysis of the Immune Response Against Pathogenic Microorganisms” Biotechniques 33:S24-S29.
Morris (1990) “Book Review: ELISA in Clinical Microbiology Laboratory” Journal of Infection 21(2):230.
Ferrante et al (2007) “New serological markers in inflammatory bowel disease are associated with complicated disease behaviour” Gut 56:1394-1403.
Velayos, et al., (2007) “Positioning biologic therapy for Crohn's disease and ulcerative colitis”, Current Gastroenterology Reports 9(6): 521-527.
Chien-Sheng Chen, et al., Nature Methods, Jan. 2008, vol. 5(1), pp. 69-74.
Chuanmin Uran et al., Anal. Chem., 2002, vol. 74, pp. 4814-4820.
David L. Stokes et al., Fresenius J Anal chem. 2001, vol. 369, pp. 295-301.
D. Huw Davies, et al., PNAS, 2005, vol. 102(3), pp. 547-552.
Related Publications (1)
Number Date Country
20160370381 A1 Dec 2016 US
Provisional Applications (1)
Number Date Country
61138309 Dec 2008 US
Continuations (1)
Number Date Country
Parent 13140702 US
Child 15160204 US