Claims
- 1. A method comprising:
comparing intensity of each of a plurality of pixels in a source video frame with intensity of one or more corresponding pixels in a reference video frame to calculate one or more intensity differences for each of the plurality of pixels; and determining whether said each of the plurality of pixels in the source video frame is a foreground pixel based on the one or more intensity differences and a plurality of threshold sets.
- 2. The method of claim 1 wherein the reference video frame is a background mosaic transformed based on background motion information.
- 3. The method of claim 1 further comprising:
for each of the plurality of pixels in the source frame, defining a window in the reference frame that is centered around a pixel corresponding to said each of the plurality of pixels in the source frame, the window being of a predefined size and containing the one or more corresponding pixels participating in intensity comparison with said each of the plurality of pixels in the source frame.
- 4. The method of claim 1 wherein the plurality of threshold sets comprises a first set of thresholds allowing a larger intensity difference, a third set of thresholds allowing a smaller intensity difference, and a second set of thresholds allowing an intensity difference in between the larger intensity difference and the smaller intensity difference.
- 5. The method of claim 4 wherein determining whether said each of the plurality of pixels in the source video frame is a foreground pixel comprises:
generating a first thresholding result using the first set of thresholds; generating a second thresholding result using the second set of thresholds; generating a third thresholding result using the second thresholding result and the third set of thresholds; and combining the third thresholding result with the first thresholding result to provide foreground/background segmentation of the source video frame.
- 6. The method of claim 5 wherein generating the first thresholding result comprises:
classifying each of the plurality of pixels in the source video frame as any one of a background pixel and a foreground pixels by comparing the one or more intensity differences associated with said each of the plurality of pixels with the first set of thresholds; forming a first binary image using the foreground/background classification; and identifying one or more foreground candidate regions in the first binary image, the one or more foreground candidate regions representing the first thresholding result.
- 7. The method of claim 6 wherein classifying each of the plurality of pixels in the source video frame as any one of a background pixel and a foreground pixels includes:
classifying each of the plurality of pixels in the source video frame as a background pixel if at least one of the one or more intensity differences is below the first set of thresholds; and classifying each of the plurality of pixels in the source video frame as a foreground pixel if none of the one or more intensity differences is below the first set of thresholds.
- 8. The method of claim 6 further comprising:
prior to identifying one or more foreground candidate regions in the first binary image, subjecting the first binary image to morphological filtering to effect cleanup of noise.
- 9. The method of claim 6 wherein generating the second thresholding result comprises:
classifying each of the plurality of pixels in the source video frame as any one of a background pixel and a foreground pixel by comparing the one or more intensity differences associated with said each of the plurality of pixels with the second set of thresholds; forming a second binary image using the foreground/background classification with the second set of thresholds; and subjecting the second binary image to morphological dilation to define one or more neighborhood areas for one or more foreground regions in the source video frame, the one or more neighborhood areas representing the second thresholding result.
- 10. The method of claim 9 further comprising:
prior to subjecting the second binary image to morphological dilation, subjecting the second binary image to morphological filtering to effect cleanup of noise.
- 11. The method of claim 9 wherein generating the third thresholding result comprises:
classifying pixels in the one or more neighborhood areas as foreground/background pixels by comparing intensity differences associated with the pixels in the one or more neighborhood areas with the third set of thresholds; and forming a third binary image using the foreground/background classification with the third set of thresholds, foreground pixels in the third binary image representing the third thresholding result.
- 12. The method of claim 11 wherein combining the third thresholding result with the first thresholding result comprises:
determining which pixels in the third thresholding result can be connected to the one or more foreground candidate regions in the first thresholding result to provide final foreground/background classification of pixels in the source video image; forming a final binary image using the final foreground/background classification; and performing morphological smoothing of the final binary image to provide clean foreground/background segmentation of the source video image.
- 13. A computer readable medium that provides instructions, which when executed on a processor cause the processor to perform a method comprising:
comparing intensity of each of a plurality of pixels in a source video frame with intensity of one or more corresponding pixels in a reference video frame to calculate one or more intensity differences for each of the plurality of pixels; and determining whether said each of the plurality of pixels in the source video frame is a foreground pixel based on the one or more intensity differences and a plurality of threshold sets.
- 14. The computer readable medium of claim 13 wherein the reference video frame is a background mosaic transformed based on background motion information.
- 15. The computer readable medium of claim 13 wherein the plurality of threshold sets comprises a first set of thresholds allowing a larger intensity difference, a third set of thresholds allowing a smaller intensity difference, and a second set of thresholds allowing an intensity difference in between the larger intensity difference and the smaller intensity difference.
- 16. A computerized system comprising:
a memory; and at least one processor coupled to the memory, the at least one processor executing a set of instructions which cause the at least one processor to compare intensity of each of a plurality of pixels in a source video frame with intensity of one or more corresponding pixels in a reference video frame to calculate one or more intensity differences for each of the plurality of pixels; and determine whether said each of the plurality of pixels in the source video frame is a foreground pixel based on the one or more intensity differences and a plurality of threshold sets.
- 17. The system of claim 16 wherein the reference video frame is a background mosaic transformed based on background motion information.
- 18. The system of claim 16 wherein the plurality of threshold sets comprises a first set of thresholds allowing a larger intensity difference, a third set of thresholds allowing a smaller intensity difference, and a second set of thresholds allowing an intensity difference in between the larger intensity difference and the smaller intensity difference.
- 19. An apparatus comprising:
an intensity difference calculator to compare intensity of each of a plurality of pixels in a source video frame with intensity of one or more corresponding pixels in a reference video frame to calculate one or more intensity differences for each of the plurality of pixels; and a multi-thresholding processor to determine whether said each of the plurality of pixels in the source video frame is a foreground pixel based on the one or more intensity differences and a plurality of threshold sets.
- 20. The apparatus of claim 19 wherein the reference video frame is a background mosaic transformed based on background motion information.
- 21. The apparatus of claim 19 further comprising a window designator to define, for each of the plurality of pixels in the source frame, a window in the reference frame that is centered around a pixel corresponding to said each of the plurality of pixels in the source frame, the window being of a predefined size and containing the one or more corresponding pixels participating in intensity comparison with said each of the plurality of pixels in the source frame.
- 22. The apparatus of claim 19 wherein the plurality of threshold sets comprises a first set of thresholds allowing a larger intensity difference, a third set of thresholds allowing a smaller intensity difference, and a second set of thresholds allowing an intensity difference in between the larger intensity difference and the smaller intensity difference.
- 23. The apparatus of claim 22 wherein the multi-thresholding processor is to determine whether said each of the plurality of pixels in the source video frame is a foreground pixel by
generating a first thresholding result using the first set of thresholds, generating a second thresholding result using the second set of thresholds, generating a third thresholding result using the second thresholding result and the third set of thresholds, and combining the third thresholding result with the first thresholding result to provide foreground/background segmentation of the source video frame.
- 24. The apparatus of claim 23 wherein the multi-thresholding processor is to generate the first thresholding result by
classifying each of the plurality of pixels in the source video frame as any one of a background pixel and a foreground pixels by comparing the one or more intensity differences associated with said each of the plurality of pixels with the first set of thresholds, forming a first binary image using the foreground/background classification, and identifying one or more foreground candidate regions in the first binary image, the one or more foreground candidate regions representing the first thresholding result.
- 25. The apparatus of claim 24 wherein the multi-thresholding processor is to classify each of the plurality of pixels in the source video frame as any one of a background pixel and a foreground pixels by
classifying each of the plurality of pixels in the source video frame as a background pixel if at least one of the one or more intensity differences is below the first set of thresholds, and classifying each of the plurality of pixels in the source video frame as a foreground pixel if none of the one or more intensity differences is below the first set of thresholds.
- 26. The apparatus of claim 24 wherein the multi-thresholding processor is to subject the first binary image to morphological filtering to effect cleanup of noise prior to identifying one or more foreground candidate regions in the first binary image.
- 27. The apparatus of claim 24 wherein the multi-thresholding processor is to generating the second thresholding result by
classifying each of the plurality of pixels in the source video frame as any one of a background pixel and a foreground pixel by comparing the one or more intensity differences associated with said each of the plurality of pixels with the second set of thresholds, forming a second binary image using the foreground/background classification with the second set of thresholds, and subjecting the second binary image to morphological dilation to define one or more neighborhood areas for one or more foreground regions in the source video frame, the one or more neighborhood areas representing the second thresholding result.
- 28. The apparatus of claim 27 wherein the multi-thresholding processor is to subject the second binary image to morphological filtering to effect cleanup of noise prior to subjecting the second binary image to morphological dilation.
- 29. The apparatus of claim 24 wherein the multi-thresholding processor is to generate the third thresholding result by
classifying pixels in the one or more neighborhood areas as foreground/background pixels by comparing intensity differences associated with the pixels in the one or more neighborhood areas with the third set of thresholds, and forming a third binary image using the foreground/background classification with the third set of thresholds, foreground pixels in the third binary image representing the third thresholding result.
- 30. The apparatus of claim 29 wherein the multi-thresholding processor is to combine the third thresholding result with the first thresholding result by
determining which pixels in the third thresholding result can be connected to the one or more foreground candidate regions in the first thresholding result to provide final foreground/background classification of pixels in the source video image, forming a final binary image using the final foreground/background classification, and performing morphological smoothing of the final binary image to provide clean foreground/background segmentation of the source video image.
- 31. An apparatus comprising:
means for comparing intensity of each of a plurality of pixels in a source video frame with intensity of one or more corresponding pixels in a reference video frame to calculate one or more intensity differences for each of the plurality of pixels; and means for determining whether said each of the plurality of pixels in the source video frame is a foreground pixel based on the one or more intensity differences and a plurality of threshold sets.
- 32. The apparatus of claim 31 wherein the reference video frame is a background mosaic transformed based on background motion information.
- 33. The apparatus of claim 31 wherein the plurality of threshold sets comprises a first set of thresholds allowing a larger intensity difference, a third set of thresholds allowing a smaller intensity difference, and a second set of thresholds allowing an intensity difference in between the larger intensity difference and the smaller intensity difference.
RELATED APPLICATIONS
[0001] This application is related to and claims the benefit of U.S. Provisional Patent application serial No. 60/340,203 filed Dec. 12, 2001, and U.S. Provisional Patent application serial No. 60/340,204 filed Dec. 12, 2001, which are hereby incorporated by reference.