This disclosure relates to estimating a surface area and/or volume of a body or a body part of a subject, for example for use in a treatment operation that is to be performed on the body or the body part, and in particular to a computer-implemented method, apparatus and computer program product for estimating a surface area and/or volume of a body or a body part of a subject.
In adult humans the body or skin surface area (BSA) of a subject can be anywhere in a wide range, typically from about 1.4 to 2.6 square meters (m2). Body volume (BV) of a subject can likewise vary over a wide range.
The use of the three dimensional (3D) human body shape measurements or models has the potential to change the way that subjects interact with the world in a wide variety of ways. Applications of this technology could be helpful in several fields such as healthcare, online shopping and the textile industry. For example, in the healthcare domain, the knowledge of the 3D body shape can help in the assessment of the Psoriasis Area and Severity Index (PAST), or dosing chemotherapy or other drugs according to the BSA.
Clearly for these applications it is desirable for the estimation of the BSA, BV and body shape prediction to be as accurate as possible. One of the most accurate techniques for estimating BSA or BV or estimating the surface area and/or volume of a particular body part uses full body 3D scanning of the subject and processing to identify different body parts. However, the technology required for full body 3D scanning is expensive and typically not available in, for example, domestic environments, and therefore impacts the scenarios in which BSA, BV or body part surface area and/or volume can potentially be used.
It is therefore desirable to be able to estimate BSA, BV or the surface area and/or volume of particular body parts in a more cost effective and simple way, for example using information that is readily available to a subject and/or obtainable by the subject.
Some techniques exist in which a 3D model of the body of a subject or of a particular body part of a subject can be determined from available metadata, such as age, gender, volume, blood pressure, number of children, body mass index (BMI), etc., and body measurements, such as height, leg length, arm circumference, etc., although often these techniques do not provide surface area and/or volume values that are sufficiently accurate for many applications. One example of such a technique is found in “Estimating 3D human shapes from measurements” by S. Wuhrer and C. Shu, Mach. Vis. Appl., vol. 24, no. 6, pp. 1133-1147, 2013.
Therefore there is a need for improvements in estimating a surface area and/or volume of a body part or body of a subject without requiring a 3D body scan or 3D body part scan of the subject.
According to a first specific aspect, there is provided a computer-implemented method for estimating a surface area and/or a volume of a body or a body part of a subject. The method comprises obtaining at least one image, wherein the at least one image includes a face of the subject; processing the at least one image to determine values for one or more facial image parameters for the face of the subject; determining values for one or more characteristics of the subject, wherein the one or more characteristics comprises one or more of age of the subject, weight of the subject, height of the subject and gender of the subject; using a facial parametric model and the determined values for the one or more facial image parameters to determine values for one or more facial shape parameters for the face of the subject, wherein the facial parametric model relates specific values for one or more facial image parameters to a respective 3D representation of a face having respective values for the one or more facial shape parameters; using a prediction model with the determined values for the one or more characteristics and the determined values of the one or more facial shape parameters to predict a 3D representation of the full body of the subject; and analyzing the predicted 3D representation of the full body of the subject to estimate the surface area and/or the volume of the body or body part of the subject. Thus, the surface area and/or volume of the body or body part(s) can be determined just using one or more images of the subject and one or more of the age, weight, height and gender.
In some embodiments, the facial parametric model is a linear model. In some embodiments, the prediction model is a non-linear model, e.g. a non-linear regression model, such as a cubic polynomial.
In some embodiments, the step of using the prediction model to predict the 3D representation of the full body comprises using the prediction model to predict values of one or more body shape parameters from the determined values for the one or more characteristics and the determined values of the one or more facial shape parameters; and using a body parametric model and the predicted values of the one or more body shape parameters to predict the 3D representation of the full body, wherein the body parametric model relates specific values for the one or more body shape parameters to a respective 3D representation of a body. In these embodiments, the body parametric model can be a linear model.
In some embodiments, the method further comprises determining the prediction model from a population dataset, wherein the population dataset comprises 3D scans of a plurality of test subjects and values for the one or more characteristics for each of the test subjects, and the prediction model is determined by: registering a body parametric model to each of the 3D scans, wherein the body parametric model relates a 3D representation of a body to specific values of one or more body shape parameters; determining values of the one or more body shape parameters for each of the registered body parametric models; registering a facial parametric model to each of the 3D scans, wherein the facial parametric model relates a 3D representation of a face to specific values of one or more facial shape parameters; determining values of the one or more facial shape parameters for each of the registered facial parametric models; and forming the prediction model from the determined values of the one or more body shape parameters, determined values of the one or more facial shape parameters and values for the one or more characteristics for each of the test subjects.
In some embodiments, the prediction model is specific to the body part of the subject for which the surface area and/or the volume is to be estimated, wherein the prediction model predicts the 3D representation of the full body based on determined values of the one or more facial shape parameters and a respective subset of the one or more characteristics. In this way the prediction model can be customized to the body part(s) of interest, requiring values for only some of the characteristics.
In some embodiments, the step of forming the prediction model comprises: forming a plurality of candidate prediction models, wherein each candidate prediction model uses a respective subset of the one or more characteristics; evaluating an accuracy of each of the candidate prediction models in predicting the 3D representation of the full body or a body part or body parts; and forming the prediction model as a candidate prediction model that provides one of: a highest accuracy of the candidate prediction models, and/or a sufficient accuracy with a minimum number of characteristics.
In some embodiments, the method further comprises: receiving an indication of the body or the body part of the subject for which the surface area and/or the volume is to be estimated; wherein the step of analyzing comprises analyzing the predicted 3D representation of the full body of the subject to estimate the surface area and/or the volume of the indicated body or body part of the subject.
In these embodiments, the method can further comprise: requesting an input indicating the body or body part of the subject for which the surface area and/or the volume is to be estimated.
In these embodiments, the indication can be received as an input corresponding to a body part or body parts displayed on a user interface.
In some embodiments, the method can further comprise: requesting an input indicating the values of the respective subset of the one or more characteristics used by the prediction model for the indicated body part.
In some embodiments, the step of forming the prediction model can be performed after receiving the indication. In this way, the user is able to indicate any desired body part(s) or combination of body part(s) for which the surface area and/or volume is to be estimated, and a prediction model that is suitable for modelling that/those body part(s) can be determined.
In some embodiments, the body part of the subject for which the surface area and/or the volume is to be estimated comprises a plurality of non-contiguous body parts.
In some embodiments, the step of determining values for the plurality of characteristics comprises processing the at least one image to determine the values for one or more of the plurality of characteristics. This has the advantage that it is not necessary for the values of the one or more characteristics to be manually input by a user or the subject.
In alternative embodiments, the step of determining values for the one or more characteristics comprises receiving an input from the subject indicating the values for one or more of the one or more characteristics.
In some embodiments, the at least one characteristic of the subject is a plurality of characteristics of the subject comprising two or more of age, weight, height and gender.
In some embodiments, the body part is one or more of an arm, both arms, a leg, both legs, a foot, both feet, a hand, both hands, a palm of a hand, both palms, a breast or both breasts, waist, hips, chest, torso, abdomen, and back.
According to a second aspect, there is provided a computer-implemented method of providing feedback on a treatment operation by a treatment device. The method comprises: estimating the surface area and/or volume of a body or a body part of a subject that is to be treated in a treatment operation using the treatment device according to the first aspect or any embodiment thereof; and using the estimated surface area and/or volume to determine feedback on the treatment operation.
In some embodiments, the feedback is any one of: feedback on progress of the treatment operation; a number of treatments required to treat the body or body part in the treatment operation; a number of treatments remaining required to treat the body or body part in the treatment operation; guidance to a user of the treatment device to treat the body or body part.
According to a third aspect, there is provided a computer program product comprising a computer readable medium having computer readable code embodied therein, the computer readable code being configured such that, on execution by a suitable computer or processor, the computer or processor is caused to perform the method according to the first aspect, the second aspect, or any embodiments thereof.
According to a fourth aspect, there is provided an apparatus for estimating a surface area and/or a volume of a body or a body part of a subject. The apparatus comprises a processing unit configured to obtain at least one image from an imaging unit, wherein the at least one image includes a face of the subject; process the at least one image to determine values for one or more facial image parameters for the face of the subject; determine values for one or more characteristics of the subject, wherein the one or more characteristics comprises one or more of age of the subject, weight of the subject, height of the subject and gender of the subject; use a facial parametric model and the determined values for the one or more facial image parameters to determine values for one or more facial shape parameters for the face of the subject, wherein the facial parametric model relates specific values for one or more facial image parameters to a respective 3D representation of a face having respective values for the one or more facial shape parameters; use a prediction model with the determined values for the one or more characteristics and the determined values of the one or more facial shape parameters to predict a 3D representation of the full body of the subject; and analyze the predicted 3D representation of the full body of the subject to estimate the surface area and/or the volume of the body or body part of the subject. Thus, the surface area and/or volume of the body or body part(s) can be determined just using one or more images of the subject and one or more of the age, weight, height and gender.
In some embodiments, the facial parametric model is a linear model. In some embodiments, the prediction model is a non-linear model, e.g. a non-linear regression model, such as a cubic polynomial.
In some embodiments, the processing unit is configured to use the prediction model to predict values of one or more body shape parameters from the determined values for the one or more characteristics and the determined values of the one or more facial shape parameters; and use a body parametric model and the predicted values of the one or more body shape parameters to predict the 3D representation of the full body, wherein the body parametric model relates specific values for the one or more body shape parameters to a respective 3D representation of a body. In these embodiments, the body parametric model can be a linear model.
In some embodiments, the processing unit is further configured to determine the prediction model from a population dataset, wherein the population dataset comprises 3D scans of a plurality of test subjects and values for the one or more characteristics for each of the test subjects, and the processing unit is configured to determine the prediction model by: registering a body parametric model to each of the 3D scans, wherein the body parametric model relates a 3D representation of a body to specific values of one or more body shape parameters; determining values of the one or more body shape parameters for each of the registered body parametric models; registering a facial parametric model to each of the 3D scans, wherein the facial parametric model relates a 3D representation of a face to specific values of one or more facial shape parameters; determining values of the one or more facial shape parameters for each of the registered facial parametric models; and forming the prediction model from the determined values of the one or more body shape parameters, determined values of the one or more facial shape parameters and values for the one or more characteristics for each of the test subjects.
In some embodiments, the prediction model is specific to the body part of the subject for which the surface area and/or the volume is to be estimated, wherein the prediction model predicts the 3D representation of the full body based on determined values of the one or more facial shape parameters and a respective subset of the one or more characteristics. In this way the prediction model can be customized to the body part(s) of interest, requiring values for only some of the characteristics.
In some embodiments, the processing unit is configured to form the prediction model by: forming a plurality of candidate prediction models, wherein each candidate prediction model uses a respective subset of the one or more characteristics; evaluating an accuracy of each of the candidate prediction models in predicting the 3D representation of the full body or a body part or body parts; and forming the prediction model as a candidate prediction model that provides one of: a highest accuracy of the candidate prediction models, and/or a sufficient accuracy with a minimum number of characteristics.
In some embodiments, the processing unit is further configured to: receive an indication of the body or the body part of the subject for which the surface area and/or the volume is to be estimated; wherein the processing unit is configured to analyze the predicted 3D representation of the full body of the subject to estimate the surface area and/or the volume of the indicated body or body part of the subject.
In these embodiments, the processing unit is further configured to: request an input indicating the body or body part of the subject for which the surface area and/or the volume is to be estimated.
In these embodiments, the indication can be received as an input corresponding to a body part or body parts displayed on a user interface.
In some embodiments, the processing unit is further configured to: request an input indicating the values of the respective subset of the one or more characteristics used by the prediction model for the indicated body part.
In some embodiments, the processing unit is configured to form the prediction model after receiving the indication. In this way, the user is able to indicate any desired body part(s) or combination of body part(s) for which the surface area and/or volume is to be estimated, and a prediction model that is suitable for modelling that/those body part(s) can be determined.
In some embodiments, the body part of the subject for which the surface area and/or the volume is to be estimated comprises a plurality of non-contiguous body parts.
In some embodiments, the processing unit is configured to determine values for the plurality of characteristics by processing the at least one image to determine the values for one or more of the plurality of characteristics. This has the advantage that it is not necessary for the values of the one or more characteristics to be manually input by a user or the subject.
In alternative embodiments, the processing unit is configured to determine values for the one or more characteristics by receiving an input from the subject indicating the values for one or more of the one or more characteristics.
In some embodiments, the at least one characteristic of the subject is a plurality of characteristics of the subject comprising two or more of age, weight, height and gender.
In some embodiments, the body part is one or more of an arm, both arms, a leg, both legs, a foot, both feet, a hand, both hands, a palm of a hand, both palms, a breast or both breasts, waist, hips, chest, torso, abdomen, and back.
According to a fifth aspect, there is provided an apparatus for providing feedback on a treatment operation by a treatment device. The apparatus comprises a processing unit configured to: estimate the surface area and/or volume of a body or a body part of a subject that is to be treated in a treatment operation using the treatment device according to the fourth aspect or any embodiment thereof; and use the estimated surface area and/or volume to determine feedback on the treatment operation.
In some embodiments, the feedback is any one of: feedback on progress of the treatment operation; a number of treatments required to treat the body or body part in the treatment operation; a number of treatments remaining required to treat the body or body part in the treatment operation; guidance to a user of the treatment device to treat the body or body part.
According to a sixth aspect, there is provided a system, comprising an apparatus according to the fourth aspect, fifth aspect, or any embodiment thereof; and an imaging unit for obtaining at least one image.
These and other aspects will be apparent from and elucidated with reference to the embodiment(s) described hereinafter.
Exemplary embodiments will now be described, by way of example only, with reference to the following drawings, in which:
As noted above, there are various applications for which information on the (total) body surface area (BSA), body volume (BV) or surface area and/or volume of a body part (e.g. a leg, both legs, arm, both arms, chest, back, torso, etc.) is useful. However, in many of these applications it is desirable to be able to obtain reliable estimates of body/skin surface area/volume without having to perform full body 3D scanning of the subject or the relevant body part(s), since the technology required for full body 3D scanning is expensive, both in terms of cost and processing resources, and typically not available in domestic environments.
One of the problems in the efficient processing of full body 3D models is the high volume of data. The cost and volume of the data required can be significantly reduced by the system ‘learning’ a statistical representation of the ‘generic’ human shape space, which means that only sparse data about the subject combined with the learned space is needed to reconstruct a full body 3D model of the subject instead of a dense representation of the subject themselves. The techniques described herein provide for estimates of the BSA, BV, surface area and/or volume of a body part(s) for a subject to be obtained using a statistical representation of a generic human shape space, and limited, but easily obtainable, additional information about the subject. In particular, it has been found that features of the shape of the face of the subject (which can be observed or inferred from an image of the face of the subject), in combination with values of one or more characteristics of the subject, such as age, weight, height and/or gender, can provide sufficiently reliable estimates of the BSA, BV, surface area and/or volume of a body part(s) for a subject.
The imaging unit 6 may include or be any suitable component for capturing an image, for example a charge-coupled device (CCD) and may include one or more lenses and/or mirrors. In some embodiments, the imaging unit 6 is the camera, or one of the cameras, in an electronic device such as a smartphone, a smartwatch, a tablet computer, a laptop, a digital camera or a smart mirror. The imaging unit 6 is able to output an obtained image, or series of obtained images to the apparatus 4.
The apparatus 4 comprises a processing unit 8 that is for estimating the surface area and/or volume of a body or particular body part according to the techniques described herein. The processing unit 8 can be implemented in numerous ways, with software and/or hardware, to perform the various functions described herein. The processing unit 8 may comprise one or more microprocessors or digital signal processor (DSPs) that may be programmed using software or computer program code to perform the required functions and/or to control components of the processing unit 8 to effect the required functions. The processing unit 8 may be implemented as a combination of dedicated hardware to perform some functions (e.g. amplifiers, pre-amplifiers, analog-to-digital convertors (ADCs) and/or digital-to-analog convertors (DACs)) and a processor (e.g., one or more programmed microprocessors, controllers, microcontrollers, DSPs and associated circuitry) to perform other functions. Examples of components that may be employed in various embodiments of the present disclosure include, but are not limited to, conventional microprocessors, DSPs, application specific integrated circuits (ASICs), and field-programmable gate arrays (FPGAs).
The processing unit 8 is connected to a memory unit 10 (in alternative embodiments the memory unit 10 can be part of the processing unit 8), and the memory unit 10 can store data, information and/or signals for use by the processing unit 8 in estimating the surface area and/or volume of a body or particular body part. For example the memory unit 10 can store one or more images of a subject. In some implementations the memory unit 10 stores computer-readable code that can be executed by the processing unit 8 so that the processing unit 8 performs one or more functions, including the operations described herein. The memory unit 10 can comprise any type of non-transitory machine-readable medium, such as cache or system memory including volatile and non-volatile computer memory such as random access memory (RAM) static RAM (SRAM), dynamic RAM (DRAM), read-only memory (ROM), programmable ROM (PROM), erasable PROM (EPROM) and electrically erasable PROM (EEPROM), implemented in the form of a memory chip, an optical disk (such as a compact disc (CD), a digital versatile disc (DVD) or a Blu-Ray disc), a hard disk, a tape storage solution, or a solid state device, including a memory stick, a solid state drive (SSD), a memory card, etc.
The apparatus 4 can also include interface circuitry 12. The interface circuitry 12 can enable the apparatus 4 to communicate with the imaging unit 6 (e.g. to receive images obtained by the imaging unit 6). In embodiments where the apparatus 4, or the functionality provided by the apparatus 4, is part of or integral with the imaging unit 6, the interface circuitry 12 in the apparatus 4 may be omitted.
The interface circuitry 12 in the apparatus 4 is for enabling a data connection to and/or data/information exchange with another device/apparatus/unit, including the imaging unit 6, if present in the system 2. The connection may be direct or indirect (e.g. via the Internet), and thus the interface circuitry 12 can enable a connection between the apparatus 4 and a network, such as the Internet, or directly between the apparatus 4 and another device/apparatus/unit, via any desirable wired or wireless communication protocol. For example, the interface circuitry 12 can operate using WiFi, Bluetooth, Zigbee, or any cellular communication protocol (including but not limited to Global System for Mobile Communications (GSM), Universal Mobile Telecommunications System (UMTS), Long Term Evolution (LTE), LTE-Advanced, etc.). In the case of a wireless connection, the interface circuitry 12 (and thus the apparatus 4) may include one or more suitable antennas for transmitting/receiving over a transmission medium (e.g. the air). Alternatively, in the case of a wireless connection, the interface circuitry 12 may include means (e.g. a connector or plug) to enable the interface circuitry 12 to be connected to one or more suitable antennas external to the apparatus 4 for transmitting/receiving over a transmission medium (e.g. the air). The interface circuitry 12 is connected to the processing unit 8 to enable information or data received by the interface circuitry 12 to be provided to the processing unit 8, and/or to enable information or data from the processing unit 8 (e.g. an estimate of the surface area of the body or a body part) to be transmitted by the interface circuitry 12.
In some embodiments, the apparatus 4 comprises a user interface 14 that includes one or more components that enables a user of apparatus 4 (e.g. the subject) to input information, data and/or commands into the apparatus 4, such as an indication of the part of the body that the surface area and/or volume should be estimated for, and/or enables the apparatus 4 to output information or data to the user of the apparatus 4, for example the estimate of the surface area and/or volume of the body or body part. The user interface 14 can comprise any suitable input component(s), including but not limited to a keyboard, keypad, one or more buttons, switches or dials, a mouse, a track pad, a touchscreen, a stylus, a camera, a microphone, etc., and the user interface 14 can comprise any suitable output component(s), including but not limited to a display screen, one or more lights or light elements, one or more loudspeakers, a vibrating element, etc.
The apparatus 4 can be any type of electronic device or computing device. In some implementations, the apparatus 4 can be, or be part of, a laptop, a tablet, a computer, a smartphone, a smartwatch, a smart mirror, etc., or other type of electronic device that can be present or used in a home or care environment of the subject/user. In other implementations, the apparatus 4 is an apparatus that is remote from the subject/user, and remote from the home or care environment of the subject/user. For example, the apparatus 4 can be a server, for example a server in a data center (also referred to as being ‘in the cloud’).
It should be noted that
As noted above, the techniques described herein provide for estimates of the BSA, BV, surface area and/or volume of a body part(s) for a subject to be obtained using a statistical representation of a generic human shape space, in particular a body parametric model, along with features of the shape of the face of the subject (which can be observed or inferred from an image of the face of the subject) and values of one or more characteristics of the subject, such as age, weight, height and/or gender. In particular, one or more images of the face of the subject are processed to determine values for one or more facial image parameters, and a facial parametric model is used to determine one or more facial shape parameters for the subject from the facial image parameters determined from the image(s). The facial shape parameter(s) are input into a prediction model along with the values of the one or more characteristics to determine a 3D representation of the full body. The prediction model predicts values for one or more body shape parameters from the specific values for the one or more characteristics and specific values for the one or more facial shape parameters. A body parametric model (which can be part of the prediction model), is used to determine a 3D representation of the full body of the subject based on the determined values for the one or more body shape parameters. The surface area and/or volume of the full body and/or one or more body parts can be determined from the 3D representation of the full body.
In some preferred embodiments, the techniques described herein enable a body part or body parts of interest to be selected (e.g. by a user using a graphical user interface), and the surface area and/or volume of that/those selected body part(s) estimated. In some further embodiments, as the body part or body parts of interest can be selected dynamically by a user, following the selection of body part(s), a prediction model can be determined that is specific to estimating the surface area and/or volume and that is to receive specific one(s) of the characteristics as inputs.
The following description provides some further details of a facial parametric model, body parametric model and prediction model that can be used in embodiments of the techniques described herein, and in particular indicates how an exemplary facial parametric model and an exemplary body parametric model can be derived from a population dataset that includes data (3D scans) for a population of test subjects, and the effectiveness of these exemplary models in estimating surface area of a full body or a body part based on values of various input parameters. Those skilled in the art will appreciate that the techniques and information provided below can be readily adapted to estimate volume of a full body or body part based on values of various input parameters. In particular, the surface area is estimated from a 3D representation of the full body of the subject of interest, and it is also possible to estimate volume from such a 3D representation.
Firstly,
The facial parametric model and body parametric model can be developed according to the methods described in “The space of human body shapes: reconstruction and parameterization from range scans” (by B. Allen, B. Curless, and Z. Popovic, ACM Transactions on Graphics, 2003) and “A morphable model for the synthesis of 3D faces” (by V. Blanz and T. Vetter, Proceedings of the 26th annual conference on Computer graphics and interactive techniques, 1999).
A facial parametric model can be derived using 3D scans of a large number (e.g. 3000) of different faces (i.e. faces of different shapes, sizes, ages, genders, weights, expressions, etc.). A body parametric model can be derived using 3D scans of a large number (e.g. 4000) of full bodies (including the face). In the present example, the 3D scans were of test subjects that were standing in a ‘tree’ position (i.e. standing upright with arms out from the body). Databases containing suitable population datasets (i.e. 3D scans of faces and full bodies) are available to those skilled in the art, for example the ‘Size China Dataset’ described in “Measuring Chinese heads and faces”, by R. Ball and J. Molenbroek, Proceedings of the 9th International Congress of Physiological Anthropology, Human diversity: design for life, 2008, and the CAESAR dataset described in “The CAESAR project: a 3-D surface anthropometry survey” by K. Robinette, H. Daanen, and E. Paquet, Second International Conference on 3-D Digital Imaging and Modeling (Cat. No.PR00062), 1999.
Briefly, respective template meshes for the body and face are registered into the 3D scans of the bodies and faces respectively to form registered models, and the registered models are encoded into the selected body parametric model parameters and facial parametric model parameters. A mesh represents, in a discrete form, the surface of the template or the 3D scans.
Registration—In order to register every face and full body template mesh, non-rigid registration techniques can be used. Such techniques are known from, for example, “Registration of 3d point clouds and meshes: A survey from rigid to Nonrigid” (by G. K. Tam, Z. Q. Cheng, Y. K. Lai, F. C. Langbein, Y. Liu, D. Marshall, R. R. Martin, X. F. Sun, and P. L. Rosin, IEEE Transactions on Visualization and Computer Graphics, 2013), “A survey on shape correspondence” (by O. van Kaick, H. Zhang, G. Hamarneh, and D. Cohen-Or, Eurographics Symposium on Geometry Processing, 2011) and “On Computing Mapping of 3D Objects” (by X. Li and S. S. Iyengar, ACM Computing Surveys, 2014).
A template mesh with about NP≈53000 vertices can be used for the body, as shown in
Both template meshes can then be used to register the full population dataset that includes the 3D full body scans. The quality of the registration can be assessed via visual inspection and other measures (e.g. as described in the paper “Registration of 3d point clouds and meshes: A survey from rigid to Nonrigid”). For around N≈3750 full body 3D scans, both registrations show low fit error (e.g. below 0.5 millimeters (movement measurements) Root Mean Squared Error (RMSE) as surfaces distance for the registration of the facial mesh, and below 1.0 mm RMSE for the registration of the full body).
Registration leads to a representation of each test subject in the population dataset as two morphed template meshes. Let vi,jr∈3 be the full body morphed coordinates of vertex j∈NP for population subject i∈N. Furthermore, the morphed coordinates of all vertices of scan i∈N can be written as a single flattened vector, stacking all vertices' coordinates together, as
p
i
r=(vi,1r, vi,2r, . . . , vi,N
where i is the test subject, r denotes registered data rather than the (raw) subject data, NP represents the total number of vertices for the body template and pir represents all vertices of test subject i. Collecting all participants into a rectangular matrix results in
P
r=(p1r; p2r; . . . ; pNr)′∈N×3N
where Pr is a matrix including all test subjects and all vertices for the full body. In the same way, the definition of the face representation is
Q
r=(q1r; q2r; . . . ; qNr)′∈N×3N
where Qr is a matrix including all test subjects and all vertices for the face and Nr represents the total number of vertices for the face template.
Parametric spaces—The registered meshes can be parametrized with Principal Component Analysis (PCA) transformation, using a plurality of (e.g. 200) Eigenvectors for the body and a plurality (e.g. 180) Eigenvectors for the face. Those skilled in the art will appreciate that different numbers of Eigenvectors can be used. The PCA transformation can be written in matrix form as
P
r
=
r
+YD′+E
r (4)
where
D∈3N
P
r
≈=
r
+YD′ (6)
The transformation in equation (6) gives a compact representation of 53000 3D vectors of vertex coordinates Pr with the 200-dimensional PCA coefficient vectors Y. In the same way, the PCA transformation is applied to the registered facial meshes:
Q
r
≈Q=
Q
r
where
Prediction model—This section explains how the body shape coefficients Y are predicted using the subject's features, denoted as XF∈N×(N
which, in the case when the (binary) gender feature is included, becomes
Equations (8) and (9) are given for completeness but are not needed to understand the rest of the techniques described herein or run algorithms which can simply count the combinations or use simple statistical techniques such as stars and bars combinatorial techniques. To facilitate the notation, the constant term is included in both XF and XG, but it is not counted in NF and NG.
Then, multi-linear regression is performed for the body coefficients Y
Y=XB+ε (10)
with four settings of the independent variable X, with and without interactions and with and without face coefficients:
(a) X=XF∈N×(N
(b) X=XG∈N×(N
(c) X=[XF, XQ]∈N×(N
(d) X=[XG, XQ]∈N×(N
Next, the predictions of specific body parts are evaluated, using the segmentation masks shown in
YΣ
m
=XBΣ
m+εΣm (15)
where Σm=D′ImD∈200×200. The least mean square estimate of B in the above equation is
{circumflex over (B)}
m=((X′X)−1X′YΣm)Σm−1 (16)
for each mask m.
Fitness measures—For each model and mask, a leave-one-out cross validation was performed on the N test subjects. In other words, the estimation of {circumflex over (B)} has been carried out every time, leaving out the test subject to predict. Once the predicted body coefficients Ŷ are computed, it is necessary to convert back, decode, using the PCA transformation in equation (6) to reach the predicted vertices {circumflex over (P)} as
{circumflex over (P)}=
r
+ŶD′=
r
+X{circumflex over (B)}D′ (17)
To evaluate the prediction, the predicted {circumflex over (P)}(i, :) is first aligned to the original coordinates P(i, :)∀i∈[1, N] with weighted Procrustes (as described in “Principal Warps: Thin-Plate Splines and the Decomposition of Deformations” by F. L. Bookstein, IEEE Transactions on Pattern Analysis and Machine Intelligence, 1989), and then the vertex-wise RMSE is computed over all test subjects for each vertex vi,j versus its predicted position {circumflex over (v)}i,j
As a final measure of fitness for the masks, the mean absolute error E is used for all vertices:
The above error measure also penalizes misplacement of the body part points on the surface and therefore can be considered more accurate.
Results—Two groups of features are evaluated, that are listed in Table 1 below, with twelve features in total.
The first group is composed of reported gender, age and weight (without clothes), all acquired in the paper “The CAESAR project: a 3-D surface anthropometry survey” mentioned above. The second group includes parametric measurements that were computed from the registered body meshes: the height computed as head to floor; upper body height as head to the highest touchable point of the pelvis; arm length as the distance between acromion (shoulder) to the distal end of the middle finger; leg length from crotch to floor; the perimeters for waist as the midpoint between the lower margin of the last palpable rib and the top of the iliac crest; hips circumference it is performed at the most prominent point, on the major trochanters, and at the level of the maximum relief of the gluteal muscles; arm circumference taken from the midpoint of the total length of the arm, between acromion and olecranon; leg quadriceps circumference taken from the midpoint of the total length of the thigh; neck circumference taken from the midpoint of the total length of the neck.
The features correlation matrix of all the features is presented in Table 2 below.
The importance of each feature is assessed by performing a search over all possible combinations of the set XF resulting in 212=4096 possible subsets of features. The empty subset is considered as the error compared to the average of the population dataset. For each subset, four different feature designs are compared: XF (age, gender, height and weight), XG (which is the set of augmented features determined according to equations (8) and (9) above, e.g. if XF={age, weight}, then XG={age, age2, age3, age*weight2, age2*weight, weight, weight, weight3, etc.), [XF, XQ] (age, gender, height and weight, and the facial coefficients XQ) and [XG, XQ] (XG and the facial coefficients XQ). The maximum number of features reached by models without the face is NG minus all combinations of the gender. In this example, the maximum number of features is NF=12 and all combinations of gender from second order are NF+1. Hence equation (9) provides that the maximum number of regressors, when using interactions, is NG=441. Considering instead the example with age, gender, weight and height, where NF=4, NG=29.
In the following, the mean absolute vertices error E (in mm) for the full body mask (without arms) are presented, along with the errors E for all the remaining four body parts represented by the masks in
Full body mask without arms—The table in
In order to evaluate the significance of adding the face shape, consider the model with X=[XG, XQ] where XG is augmented from=[age, gender, weight, height]. This model has an error of 15.91 mm which is better than the error of the model with NF=4 best predictors without the face parameters. Therefore this indicates that the face shape can be used instead of detailed parametric measurements. Thus, for example, the face coefficients combined with age, gender and weight features gives a lower error than the prediction using waist, hip circumference and leg length features. It can be seen in
Other body parts—The tables in
It can be seen in
For both the waist and hips masks, the best performing feature is the hip circumference, registering an error E of 12.59 mm for the waist surface area and 11.70 mm for the hip surface area. The lowest error reached using all features for the waist mask is 8.59 mm, whereas the hip mask achieved a minimum error of 8.00 mm.
For the breast mask, the results for which are shown in
Finally, from the errors registered for the leg mask (
Overall, the use of the face features improves the estimation most for the hips band, where the reduction for the best 12 models is 10.45% (0.99 mm). For the waist mask the average reduction is 9.71% (0:98 mm) and for the full body the drop is 8.12% (1.33 mm). Finally the reduction for the legs is 7.32% (0.84 mm) and the face achieves the lowest reduction for the breasts area with 7.14% (0.54 mm).
The table in
Thus, it can be seen in
Moreover, it can be seen from
Estimating surface area and/or volume—This part of the description relates to the techniques for estimating the surface area and/or volume of a body part, or of the full body, that make use of the prediction model described above that predicts body shape coefficients based on the face shape space of the subject and one or more of the characteristics age, gender, weight and height.
The flow chart in
In a first step, step 101, a body parametric model (e.g. in the form of an avatar) is registered in to a set of 3D body scans (e.g. contained in a population dataset that includes 3D body scans for a number of different test subjects) that includes the body part(s) of interest (e.g. the arms, legs, full body, etc.). Step 101 can be performed, for example, using the techniques described in the paper “The space of human body shapes: reconstruction and parameterization from range scans” mentioned above, or in “Exploring the space of human body shapes: Data-driven synthesis under anthropometric control” by Allen, B., Curless, B., and Popović, Z., SAE International Proc. Digital Human Modeling for Design and Engineering Conference 2004. The body parametric model relates values of one or more body shape parameters to a 3D representation of the full body (or alternatively to a specific body part or parts). The body parametric model can be a linear model. Step 101 corresponds to the ‘Registration’ section above.
In step 103, each registered avatar (i.e. the body parametric model adapted to a 3D scan of a particular test subject) is converted to a plurality of body shape parameters (e.g. 200 shape parameters). These body shape parameters can encode the 3D body surface with millimeter accuracy, e.g. corresponding to PCA coefficients, and thus there is a set of values of body shape parameters for each of the test subjects. Step 103 can be performed as described in “The space of human body shapes: reconstruction and parameterization from range scans” and “Exploring the space of human body shapes: Data-driven synthesis under anthropometric control”. Step 103 corresponds to the ‘Parametric spaces’ section above.
In step 105, a facial parametric model (e.g. in the form of an avatar for the face) is registered in to a set of 3D face scans (e.g. that are in a population dataset). Step 105 can be performed, for example using the techniques described in “The space of human body shapes: reconstruction and parameterization from range scans” and “Exploring the space of human body shapes: Data-driven synthesis under anthropometric control”. The facial parametric model relates values of one or more facial shape parameters to a 3D representation of the face. The facial parametric model can be a linear model. Step 105 corresponds to the ‘Registration’ section above.
In step 107, each registered facial avatar (i.e. the facial parametric model adapted to a 3D scan of a particular test subject) is converted to a plurality of facial shape parameters (e.g. 180 facial shape parameters). These facial shape parameters can encode the 3D facial surface with millimeter accuracy, e.g. corresponding to PCA coefficients, and thus there is a set of values of facial shape parameters for each of the test subjects. Step 107 can be performed as described in “The space of human body shapes: reconstruction and parameterization from range scans” and “Exploring the space of human body shapes: Data-driven synthesis under anthropometric control”. Step 107 corresponds to the ‘Parametric spaces’ section above.
In step 109, a prediction model that is to be used to predict a shape representation of each subject (in the form of values for one or more body shape parameters) is built, trained or formed as a function of one or more characteristics and facial shape parameters. The trained or formed prediction model is to receive values of facial shape parameters for a subject and values of the one or more characteristics for the subject as input, and predict values of one or more body shape parameters for the subject (or, where the prediction model includes a body parametric model, predict the 3D representation of the full body or body part by inputting the predicted values of the body shape parameter(s) into the body parametric model). The prediction model is built, trained or formed from the sets of body shape parameters obtained in step 103, the sets of facial shape parameters obtained in step 107, and values of the one or more characteristics for the test subjects in the population dataset.
The prediction model can be a non-linear model. For example the prediction model can be a regression model, such as a cubic polynomial. Step 109 corresponds to the ‘Prediction model’ section above.
The one or more characteristics that the prediction model is a function of can be any one or more of age of the subject, gender of the subject, weight of the subject and height of the subject. In some embodiments, in addition to the one or more of age, gender, weight and height, the characteristics can include one or more other measurements of the body of the subject, such as leg length, leg (quadriceps) circumference, waist circumference, hips circumference, arm length, arm circumference, upper body height, and neck circumference (all as shown in
In some embodiments, in step 109 a plurality of candidate prediction models can be formed, with each candidate prediction model using a respective subset of the one or more characteristics. For example a first candidate prediction model can be formed that uses age, gender and height, a second candidate prediction model can be formed that uses gender and weight, a second candidate prediction model can be formed that just uses height, etc. The accuracy (i.e. error) of each of the candidate prediction models in predicting the 3D representation of the full body or a body part or body parts or interest is then evaluated (e.g. with the results shown in
The flow chart in
In a first step, step 111, at least one image of the subject is obtained. The at least one image should include (i.e. show) the face of the subject. In some embodiments a plurality of images can be obtained, or a video sequence of the subject can be obtained (where frames of the video sequence correspond to images). Preferably the at least one image was taken from the front of the subject (i.e. the image(s) show the face ‘front-on’). However, it is possible for a plurality of images to be obtained that show the face of the subject from different directions (e.g. an image that shows the left side of the face, an image that shows the right side, etc.). In step 111 the image(s) can be obtained from the imaging unit 6, i.e. the imaging unit 6 is used to capture the image(s) of the face of the subject, and the image(s) are provided to the processing unit 8. Alternatively, in step 111 the processing unit 8 can obtain the image(s) from the memory unit 10, for example if the image(s) were obtained previously by the imaging unit 6 or by another imaging unit/device. In some embodiments, the image(s) can be so-called ‘selfies’ that are obtained by the subject using the imaging unit 6, their smartphone, or other electronic device with a user-facing imaging unit. The one or more image(s) may be color images or monochrome images.
In step 113, the processing unit 8 processes the at least one image to determine values for one or more facial image parameters for the face of the subject. The values of the one or more facial image parameters represent the 3D geometry of the subject's face.
In this step, an image of the subject's face (which can be a single image or a frame in a video sequence) can be processed to extract a set of two-dimensional (2D) points that represent the facial geometry. The paper “Supervised Descent Method and its Applications to Face Alignment” by Xuehan Xiong Fernando De la Torre, The Robotics Institute, Carnegie Mellon University, Pittsburgh Pa., 15213 provides an exemplary technique that can be used to extract 2D points representing facial geometry from an image of a face. These 2D points can then be used to compute a set of 3D points, to provide an avatar representing the face of the subject. The use of the 2D points in this way is described in WO 2017/085075.
Alternatively, in step 113, a video sequence including the subject's face can be processed to extract a set of 3D points that represent the facial geometry. These 3D points provide an avatar representing the face of the subject. This step can use the techniques described in “Dense 3D Face Alignment from 2D Videos in Real-Time” by L'aszl'o A. Jeni (Robotics Institute, Carnegie Mellon University, Pittsburgh, Pa., USA), Jeffrey F. Cohn (Robotics Institute and Department of Psychology, University of Pittsburgh, Pittsburgh, Pa., USA) and Takeo Kanade (Robotics Institute).
In step 115 (which can be performed before step 111, before 113, or after step 113), values for one or more characteristics of the subject are determined. The one or more characteristics comprises one or more of age, weight, height and gender of the subject. Step 115 can comprise the subject or another person manually entering the value(s) for the one or more characteristics into the apparatus 4, for example using the user interface 14. For example, the subject or another person can manually enter any one or more of the age, weight, height and gender of the subject. Alternatively, step 115 can comprise obtaining or retrieving the value(s) for the one or more characteristics from memory unit 10, for example if the value(s) have previously been stored in the memory unit 10, or from another database, e.g. health records for the subject. Alternatively, step 115 can comprise determining the value(s) for the one or more characteristics by analyzing the image(s) obtained in step 111. For example, the image(s) can be processed to estimate the age of the subject, estimate the weight of the subject (although it will be appreciated that the image(s) should preferably show the subject's face and at least part of their body), estimate the gender of the subject and estimate the height of the subject (although it will be appreciated that the image(s) should show the subject's body). Those skilled in the art of image processing techniques will be aware of suitable techniques that can be used to process image(s) or a video sequence to determine any of age, gender, weight and height. It will be appreciated that in some embodiments where values for a plurality of characteristics are required, a combination of the above embodiments can be used, e.g. a value for one or more of the characteristics (e.g. height) can be input by the subject or another user, a value for another one of the characteristics (e.g. age, or a date of birth) can be stored in the memory unit 10, and/or a value for another one of the characteristics (e.g. gender) can be determined from the image(s) or video sequence.
The specific characteristics for which values are determined in step 115 can depend on the requirements of the prediction model that is to be used in subsequent steps to predict the body shape parameters. For example, in some implementations the prediction model may require values for all four characteristics, whereas in other implementations the prediction model may require values of two specific characteristics, e.g. height and age. As noted further below, in some embodiments the characteristic(s) used by the prediction model can depend on the body part(s) for which the surface area and/or volume is to be estimated.
In step 117, a facial parametric model and the values for the one or more facial image parameters extracted from the image(s) in step 113 are used to determine values for one or more facial shape parameters for the face of the subject. The facial parametric model relates specific values for one or more facial image parameters to a respective 3D representation of a face having respective values for the one or more facial shape parameters. The facial shape parameters can be, for example, PCA coefficients. The facial parametric model can be a linear model, as described above. It should be noted that some types of facial image parameters can be considered as facial shape parameters, and the facial parametric model may not be required to relate all values of facial image parameters to facial shape parameters in step 117.
In particular embodiments of step 117, the facial parametric model is registered in to the image(s) of the face of the subject as represented by the one or more facial image parameters determined in step 113, and the registered facial parametric model is converted into the one or more facial shape parameters.
In step 119, a 3D representation of the full body of the subject is predicted using the prediction model (e.g. as determined in step 109), the values for the one or more characteristics determined in step 105 and the values of the one or more facial shape parameters determined in step 107. The 3D representation of the full body predicted in step 119 indicates the surface shape of the full body.
As noted above, in some embodiments the prediction model uses values of facial shape parameters for a subject and values of the one or more characteristics for the subject as inputs, and predicts values of one or more body shape parameters for the subject. In these embodiments, the values for the one or more body shape parameters are used to predict a 3D representation of the full body using a body parametric model (e.g. as described above with reference to step 101/103) that relates a 3D representation to values of body shape parameters.
In alternative embodiments, the prediction model can also include a body parametric model, in which case the prediction model determines the 3D representation of the full body by inputting the predicted values of the body shape parameter(s) into the body parametric model.
In step 121, the predicted 3D representation of the full body of the subject is analyzed to estimate the surface area and/or volume of the body or body part of the subject (as required). In particular, the surface of the 3D representation will be made up of a (large) plurality of polygons (e.g. triangles obtained by the triangulation of the average vertex positions in the parametric body model), and the surface area of the body or body part can be estimated by summing the areas of the polygons making up that body or body part. Likewise, the 3D representation can be made up of a large plurality of voxels (e.g. tetrahedrons obtained by the tetrahedralisation of the average vertex positions in the parametric body model), and the volume of the body or body part can be estimated by summing the volumes of the voxels making up that body or body part.
Once determined, the estimated surface area and/or volume of the body or body part can be used in one or more of a number of different ways.
In some embodiments, the estimated surface area and/or volume can be output to the subject or another user via the user interface 14. For example the estimated surface area and/or volume can be displayed on a display screen, output as audio via a loudspeaker, etc.
In some embodiments, the estimated surface area and/or volume can be communicated from the apparatus 4 to another electronic device (e.g. via the interface circuitry 12) where it can be stored (e.g. in a health record database) or used by the electronic device.
In some embodiments, the estimated surface area and/or volume can be used or applied in a particular field, such as healthcare (including personal care), online shopping and the textile industry. In the latter two examples, the estimated surface area and/or volume be used to select items of clothing that are a suitable size for the subject.
In the healthcare domain, the estimated surface area can be used to assess the Psoriasis Area and Severity Index (PASI) in a subject with Psoriasis. The estimated surface area and/or volume can also or alternatively be used to determine a drug dosage, such as a chemotherapy dosage.
In the personal care domain, a device can be used to perform a treatment operation on the subject. Many types of device are available that can be used on a body of a subject to provide a treatment operation to the body or a personal care operation on the body. For example, there are devices for the removal of unwanted hairs using various techniques such as shaving, electrolysis, plucking, laser and light therapies (known as photoepilation) and injection of therapeutic anti-androgens. Other types of dermatological treatments, including hair growth reduction and treating acne, can also use light-based technologies. Devices can also be used for providing a massage to the subject, for providing a nail treatment, for providing physiotherapy, for applying patches to the subject (e.g. electrocardiogram electrodes, etc.). Another type of device is an ultrasound probe.
For any of these types of device or treatment operation, it may be useful to know the surface area and/or volume of the body or relevant body part, as this can be used to provide feedback on a treatment operation by a treatment device. Thus, a method of providing feedback is provided in which the surface area and/or volume of the full body or body part of interest is determined using the above method(s), and the estimated surface area and/or volume is used to determine feedback on the treatment operation. For example, knowledge of the surface area of a leg would enable the computation of the average number of light pulses/flashes that is required for epilating the whole leg. This is particularly useful for applications where it is difficult to monitor which areas of the skin have been treated (e.g. in photoepilation the treated areas are invisible to the human eye), and the information can be used to provide the subject or user with (real-time) feedback on the treatment progress. Thus, in some embodiments the processing unit 8 determines a number of treatments (e.g. light pulses/flashes) required to treat a particular body part (or the whole body) based on the estimated surface area and the surface area treated per treatment (e.g. area treated per pulse/flash). The number of treatments may also take into account an amount of overlap between treatments (e.g. the area of one treatment may overlap with a previous treatment area by a certain amount, e.g. 20%). In these embodiments, the processing unit 8 can use the determined number of treatments required for treating the body part (or whole body), along with information on the number of treatments already administered, to indicate to the subject or user how many treatments (e.g. flashes/pulses) are required to complete the treatment operation. The processing unit 8 can present this information to the subject or user in any desired form, e.g. a number of treatments remaining, a percentage completion, etc. This information, including the information about the number of treatments performed and/or the amount of the body part/body treated, can be stored in a treatment record for the subject.
In addition to feedback on the treatment progress, so-called ‘smart scheduling’ can be used in which the surface area of specific body parts is known, and the subject or other user of the treatment device can be guided through a certain treatment scheme based on factors such as available treatment time (for instance set by the subject or the user) and/or flash device charge.
In some embodiments, the method of estimating the surface area and/or volume in
As noted above with respect to the performance measures shown in
For example, if it is desired to only use two characteristics in step 115 (and subsequent steps), then if the surface area and/or volume of the legs or full body (excluding arms) is to be estimated, then step 115 may comprise determining the weight and height, as they provide the most accurate two-characteristic result for these two masks. Whereas, if the surface area and/or volume of the hips, waist or breasts is to be estimated, then step 115 may comprise determining the gender and weight instead. As another example, it may be determined that a particular combination of characteristics is sub-optimum when compared to another combination of characteristic(s), in which case it may be better to use that/those other combination of characteristic(s) instead. E.g. for the hips band weight provides a more accurate 3D representation than age, gender and height combined. In that case, it is preferred to use weight as the characteristic for that prediction model rather than the other three characteristics, or to at least make sure that weight is included in the combination of characteristics.
Therefore, depending on the part of the body of the subject that is to be evaluated, it is possible to use a selected one or more of the characteristics from the set of age, gender, weight and height in combination with the face shape parameters to achieve an output of a sufficient accuracy, or to minimize the amount of information that needs to be manually input, retrieved from another information source and/or determined from the image(s).
As noted above, in some embodiments the user or subject can dynamically select the part or parts of the body for which the surface area and/or volume is to be estimated, e.g. by highlighting or selecting one or more portions or parts of a body on a user interface 14. In this case an ‘optimum’ set of input characteristics to use for estimating the 3D representation of that/those body part(s) may not be known. Therefore, in these embodiments, before proceeding with the method in
In some embodiments, the accuracy of the facial shape parameter(s) determined in step 117 can be improved by also using the values of the one or more characteristics determined in step 115 to determine the values of the one or more facial shape parameters. In these embodiments, a facial prediction model can be determined in step 107 of
Therefore there is provided improved techniques for estimating the surface area and/or volume of a body part or body of a subject without requiring a 3D body scan or 3D body part scan of the subject. Instead, the disclosed techniques require an image of the face of the subject and limited (but easily obtainable) information on the subject, such as age, gender, weight and/or height.
Variations to the disclosed embodiments can be understood and effected by those skilled in the art in practicing the principles and techniques described herein, from a study of the drawings, the disclosure and the appended claims. In the claims, the word “comprising” does not exclude other elements or steps, and the indefinite article “a” or “an” does not exclude a plurality. A single processor or other unit may fulfil the functions of several items recited in the claims. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage. A computer program may be stored or distributed on a suitable medium, such as an optical storage medium or a solid-state medium supplied together with or as part of other hardware, but may also be distributed in other forms, such as via the Internet or other wired or wireless telecommunication systems. Any reference signs in the claims should not be construed as limiting the scope.
Number | Date | Country | Kind |
---|---|---|---|
19176080.0 | May 2019 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2020/064044 | 5/20/2021 | WO |