Claims
- 1. A computer-based method for providing problem determination and error recovery features to a computing environment, the method comprising:
receiving information regarding a status of the computing environment; identifying at least one applicable rule from a knowledge base of rules, wherein the at least one applicable rule is applicable to the status of the computing environment; and applying the at least one applicable rule to obtain a result, wherein the knowledge base of rules includes one of a logging logic rule specifying that particular events should be logged by system components under particular circumstances, a problem determination logic rule specifying that a presence of particular information contained within event logs indicates a particular problem, and an error recovery logic rule specifying that a particular problem implies a particular solution to the particular problem should be followed.
- 2. The method of claim 1, wherein knowledge base of rules includes at least one logging logic rule and the result is a logging policy specifying how particular events should be logged by system components.
- 3. The method of claim 2, wherein the logging policy includes an identification of the system components to perform logging of the particular events.
- 4. The method of claim 2, wherein the logging policy includes specified conditions under which the particular events are to be logged.
- 5. The method of claim 2, wherein the logging policy includes a specified level of detail at which the particular events are to be logged.
- 6. The method of claim 1, wherein the knowledge base of rules includes at least one problem determination logic rule and the result is a problem diagnosis.
- 7. The method of claim 6, wherein the at least one problem determination logic rule correlates particular information contained within event logs to problems using a statistical test.
- 8. The method of claim 6, wherein the at least one problem determination logic rule correlates particular information contained within event logs to problems using a machine learning algorithm.
- 9. The method of claim 1, wherein the knowledge base of rules includes at least one error recovery logic rule and the result is a course of action to follow in resolving a problem.
- 10. The method of claim 9, further comprising:
following the course of action to resolve the problem; and in response to following the course of action, determining a degree of success of the course of action.
- 11. The method of claim 10, further comprising:
prioritizing the rules in the knowledge base in response to the degree of success of the course of action.
- 12. The method of claim 1, further comprising:
applying a machine learning algorithm to add additional rules to the knowledge base.
- 13. The method of claim 12, wherein the machine learning algorithm is an inductive logic programming algorithm.
- 14. The method of claim 1, further comprising:
determining a degree of relevance of a rule in the knowledge base; and in response to a determination that the rule has a low degree of relevance, removing the rule.
- 15. The method of claim 1, further comprising:
determining a degree of relevance of a rule in the knowledge base; and in response to a determination that the rule has a low degree of relevance, assigning the rule a low priority in the knowledge base.
- 16. The method of claim 1, further comprising:
applying a machine learning algorithm to modify rules within the knowledge base.
- 17. The method of claim 16, wherein the machine learning algorithm is a supervised learning algorithm.
- 18. The method of claim 17, wherein the supervised learning algorithm is one of a neural network, a Bayesian network, and Support Vector Machines.
- 19. The method of claim 16, wherein the machine learning algorithm is an unsupervised learning algorithm.
- 20. The method of claim 19, wherein the unsupervised learning algorithm is one of k-means clustering, hierarchical clustering, and principal component analysis.
- 21. The method of claim 1, wherein receiving the information regarding the status of the computing environment, identifying the at least one applicable rule, and applying the at least one applicable rule are performed in response to a request from a client.
- 22. A computer program product in a computer-readable medium comprising functional descriptive material that, when executed by a computer, enables the computer to perform acts including:
receiving information regarding a status of the computing environment; identifying at least one applicable rule from a knowledge base of rules, wherein the at least one applicable rule is applicable to the status of the computing environment; and applying the at least one applicable rule to obtain a result, wherein the knowledge base of rules includes one of a logging logic rule specifying that particular events should be logged by system components under particular circumstances, a problem determination logic rule specifying that a presence of particular information contained within event logs indicates a particular problem, and an error recovery logic rule specifying that a particular problem implies a particular solution to the particular problem should be followed.
- 23. The computer program product of claim 22, wherein knowledge base of rules includes at least one logging logic rule and the result is a logging policy specifying how particular events should be logged by system components.
- 24. The computer program product of claim 23, wherein the logging policy includes an identification of the system components to perform logging of the particular events.
- 25. The computer program product of claim 23, wherein the logging policy includes specified conditions under which the particular events are to be logged.
- 26. The computer program product of claim 23, wherein the logging policy includes a specified level of detail at which the particular events are to be logged.
- 27. The computer program product of claim 22, wherein the knowledge base of rules includes at least one problem determination logic rule and the result is a problem diagnosis.
- 28. The computer program product of claim 27, wherein the at least one problem determination logic rule correlates particular information contained within event logs to problems using a statistical test.
- 29. The computer program product of claim 27, wherein the at least one problem determination logic rule correlates particular information contained within event logs to problems using a machine learning algorithm.
- 30. The computer program product of claim 22, wherein the knowledge base of rules includes at least one error recovery logic rule and the result is a course of action to follow in resolving a problem.
- 31. The computer program product of claim 30, comprising additional functional descriptive material that, when executed by the computer, enables the computer to perform additional acts including:
following the course of action to resolve the problem; and in response to following the course of action, determining a degree of success of the course of action.
- 32. The computer program product of claim 31, comprising additional functional descriptive material that, when executed by the computer, enables the computer to perform additional acts including:
prioritizing the rules in the knowledge base in response to the degree of success of the course of action.
- 33. The computer program product of claim 22, comprising additional functional descriptive material that, when executed by the computer, enables the computer to perform additional acts including:
applying a machine learning algorithm to add additional rules to the knowledge base.
- 34. The computer program product of claim 33, wherein the machine learning algorithm is an inductive logic programming algorithm.
- 35. The computer program product of claim 22, comprising additional functional descriptive material that, when executed by the computer, enables the computer to perform additional acts including:
determining a degree of relevance of a rule in the knowledge base; and in response to a determination that the rule has a low degree of relevance, removing the rule.
- 36. The computer program product of claim 22, comprising additional functional descriptive material that, when executed by the computer, enables the computer to perform additional acts including:
determining a degree of relevance of a rule in the knowledge base; and in response to a determination that the rule has a low degree of relevance, assigning the rule a low priority in the knowledge base.
- 37. The computer program product of claim 22, comprising additional functional descriptive material that, when executed by the computer, enables the computer to perform additional acts including:
applying a machine learning algorithm to modify rules within the knowledge base.
- 38. The computer program product of claim 37, wherein the machine learning algorithm is a supervised learning algorithm.
- 39. The computer program product of claim 38, wherein the supervised learning algorithm is one of a neural network, a Bayesian network, and Support Vector Cachines.
- 40. The computer program product of claim 37, wherein the machine learning algorithm is an unsupervised learning algorithm.
- 41. The computer program product of claim 40, wherein the unsupervised learning algorithm is one of k-means clustering, hierarchical clustering, and principal component analysis.
- 42. The computer program product of claim 22, wherein receiving the information regarding the status of the computing environment, identifying the at least one applicable rule, and applying the at least one applicable rule are performed in response to a request from a client.
- 43. A data processing system comprising:
means for receiving information regarding a status of the computing environment; means for identifying at least one applicable rule from a knowledge base of rules, wherein the at least one applicable rule is applicable to the status of the computing environment; and means for applying the at least one applicable rule to obtain a result, wherein the knowledge base of rules includes one of a logging logic rule specifying that particular events should be logged by system components under particular circumstances, a problem determination logic rule specifying that a presence of particular information contained within event logs indicates a particular problem, and an error recovery logic rule specifying that a particular problem implies a particular solution to the particular problem should be followed.
- 44. The data processing system of claim 43, wherein knowledge base of rules includes at least one logging logic rule and the result is a logging policy specifying how particular events should be logged by system components.
- 45. The data processing system of claim 44, wherein the logging policy includes an identification of the system components to perform logging of the particular events.
- 46. The data processing system of claim 44, wherein the logging policy includes specified conditions under which the particular events are to be logged.
- 47. The data processing system of claim 44, wherein the logging policy includes a specified level of detail at which the particular events are to be logged.
- 48. The data processing system of claim 43, wherein the knowledge base of rules includes at least one problem determination logic rule and the result is a problem diagnosis.
- 49. The data processing system of claim 48, wherein the at least one problem determination logic rule correlates particular information contained within event logs to problems using a statistical test.
- 50. The data processing system of claim 48, wherein the at least one problem determination logic rule correlates particular information contained within event logs to problems using a machine learning algorithm.
- 51. The data processing system of claim 43, wherein the knowledge base of rules includes at least one error recovery logic rule and the result is a course of action to follow in resolving a problem.
- 52. The data processing system of claim 51, further comprising:
means for following the course of action to resolve the problem; and means, responsive to following the course of action, for determining a degree of success of the course of action.
- 53. The data processing system of claim 52, further comprising:
means for prioritizing the rules in the knowledge base in response to the degree of success of the course of action.
- 54. The data processing system of claim 43, further comprising:
means for applying a machine learning algorithm to add additional rules to the knowledge base.
- 55. The data processing system of claim 54, wherein the machine learning algorithm is an inductive logic programming algorithm.
- 56. The data processing system of claim 43, further comprising:
means for determining a degree of relevance of a rule in the knowledge base; and means, responsive to a determination that the rule has a low degree of relevance, for removing the rule.
- 57. The data processing system of claim 43, further comprising:
means for determining a degree of relevance of a rule in the knowledge base; and means, responsive to a determination that the rule has a low degree of relevance, for assigning the rule a low priority in the knowledge base.
- 58. The data processing system of claim 43, further comprising:
means for applying a machine learning algorithm to modify rules within the knowledge base.
- 59. The data processing system of claim 58, wherein the machine learning algorithm is a supervised learning algorithm.
- 60. The data processing system of claim 59, wherein the supervised learning algorithm is one of a neural network, a Bayesian network, and Support Vector Machines.
- 61. The data processing system of claim 58, wherein the machine learning algorithm is an unsupervised learning algorithm.
- 62. The data processing system of claim 61, wherein the unsupervised learning algorithm is one of k-means clustering, hierarchical clustering, and principal component analysis.
- 63. The data processing system of claim 43, wherein receiving the information regarding the status of the computing environment, identifying the at least one applicable rule, and applying the at least one applicable rule are performed in response to a request from a client.
CROSS REFERENCE TO RELATED APPLICATIONS
[0001] The present invention is related to the following applications entitled: “Method and Apparatus for Publishing and Monitoring Entities Providing Services in a Distributed Data Processing System”, Ser. No. ______, attorney docket no. YOR920020173US1; “Method and Apparatus for Automatic Updating and Testing of Software”, Ser. No. ______, attorney docket no. YOR920020174US1; “Composition Service for Autonomic Computing”, Ser. No. ______, attorney docket no. YOR920020176US1; and “Self-Configuring Computing System”, Ser. NO. ______, attorney docket no. YOR920020181US1; all filed even date hereof, assigned to the same assignee, and incorporated herein by reference.