Document analysis by extracting the geometrical structure