INTRODUCTION
Obesity is defined as an excess of total body fat and is a risk factor for chronic diseases like type-2 diabetes, cancer and cardiovascular diseases, which are the leading causes of death in adults worldwide 1. Childhood obesity is one of the most relevant problems of public health in this century, growing at alarming rates, especially in low and middle-income countries 2. Mean body mass index (BMI) and the prevalence of obesity have increased worldwide in children and adolescents from 1975 to 2016, with a global increase of 0.32 kg/m2 per decade in girls and 0.40 kg/m2 in boys, from an age-standardized mean BMI of 17.2 kg/m2 in girls and 16.8 kg/m2 in boys to 18.6 kg/m2 and 18.5 kg/m2 in 2016, respectively 3. Obesity rates in children and adolescents increased from less than 1% in 1975 to almost 6% in girls and 8% in boys in 2016 3.
More than 20% (approximately 42.5 million) of Latin American children aged 0 to 19 years are overweight or obese. Among children under five years old, excess weight and obesity are widespread, particularly in the Southern Cone and in Mexico, reaching figures of around 10%. In older children, the prevalence of excess weight and obesity exceed 25% of the population in some countries, like Chile, Ecuador and Mexico 4. It is important to consider that children with obesity are prone to obesity in adulthood and more likely to develop non-communicable diseases at younger ages 5.
There are many techniques to determine excess body fat (dual-energy X-ray absorptiometry, underwater weighing, air displacement plethysmography, magnetic resonance imaging and computed tomography) 6. Anthropometry is a relatively quick, simple, and inexpensive method of nutritional assessment that can provide information about fat mass and fat free mass composition 7.
Body weight and height are used to calculate BMI, the most common index used to identify obesity in children 8. Both weight and height are easy measurable parameters. However, in growing children, age and sex need to be considered along with BMI 9. BMI provides information about total body mass; however, it does not give information about body compartments. To overcome the limitations of BMI in clinical practice, other measurements could be taken 11. Waist circumference (WC) is a good indicator to assess abdominal obesity 12 and it is an important measurement to be taken, as cardio-metabolic risk factors are more prevalent in children and adolescents with abdominal obesity than those with excess weight or general obesity 13. The WC measurement has been difficult to standardize worldwide because there is no consensus on the location of the measurement. There are different protocols to measure WC: the World Health Organization (WHO) recommends to measure it in the midpoint between the top of the iliac crest and the lower margin of the last palpable rib in the mid axillary line 14, the Centers for Disease Control and prevention (CDC) recommend to measure it at the uppermost lateral border of the hip crest (ilium) 15 and certain studies have measured it at the umbilical level 16. In subjects with excess abdominal fat, this measurement becomes difficult at any location and can lead to errors. Skinfold thickness provides information about subcutaneous fat. The measurements are difficult to standardize because the size of skinfolds may vary according to the duration and level of compression during the measurement, and the level of tissue hydration 17. Obesity may influence the reliability of skinfold measurements, especially in cases in which skinfold size approaches the upper limit of the measurement range of the caliper 18. In their review of anthropometric measurement error, Ulijaszek and Kerr found that skinfold measurements were the most imprecise 10.
The accuracy and usefulness of the anthropometric method greatly rely on the quality of the measurements. Reliability assessment is a direct indicator of data quality 19. When anthropometric mesurements are taken, the measurements may vary due to different personal skills or inconsistencies in adherence to study protocols among fieldworkers 20. However, variations due to measurement error, technical error or personal error can be controlled to some extent at the time anthropometrical studies are performed 21.
The most common method to estimate error when taking anthropometric measurements is through the technical error of measurement (TEM), which is an accurate index that estimates the quality of the measurements taken. From an evaluation viewpoint, a low variability between repeated measurements on the same subject by one (intra-observer differences) or by two or more (inter-observer differences) observers is considered as high reliability 20.
There is information on anthropometric measurement error in epidemiological studies in children and adolescents, but mainly in North America and Europe. However, this information is not usually reported in studies from South America.
Therefore, the aim of this study was to describe the intra-observer reliability of ten anthropometric measurements (weight, height; neck, arm, waist and hip circumferences; and triceps, subscapular, bicep and suprailiac skinfolds) in the South American Youth/Child Cardiovascular and Environmental (SAYCARE) feasibility study, in children and adolescents.
MATERIAL AND METHODs
SAYCARE is an observational multicenter, feasibility study that has been designed to develop valid and reliable indicators to investigate health-related behavior and nutritional status in children and adolescents. The study was carried out in seven centers from six countries of South America: Buenos Aires (Argentina), Lima (Peru), Medellin (Colombia), Montevideo (Uruguay), Santiago (Chile), and São Paulo and Teresina (Brazil). The first goal of the SAYCARE study was to assess the feasibility and reliability of the questionnaires and the anthropometric measurements. The fieldwork was carried out during the academic years of 2015 and 2016, and different number of anthropometrists participated in each center (Medellin 2, Santiago 2, Lima 3, Montevideo 4, Buenos Aires 4, Teresina 4 and San Pablo 5)
The subjects were selected in each city, stratified by age into two groups: children (3-10 years) and adolescents (11-18 years), and by type of school into two groups: public and private. Each sex was represented by 50% of participants. The sample size was calculated based on the experience of other multicenter studies in which feasibility pilot studies were previously conducted as well as the reliability and validity of the methods were evaluated 11,12,19. This sample size was increased by 20% in order to take potential drop-outs into account. The target sample size was 240 children and 240 adolescents from each city 22. Data from Buenos Aires was not included in this anthropometric analysis because only a single measurement was available.
The project followed the ethical standards of the Declaration of Helsinki and Good Clinical Practice recommendations. The study was approved by the Ethics Committee of each participating center. The volunteer and/or guardian provided fully-informed written consent to participate.
STANDARDIZATION PROCESS
Prior to the study, fieldworkers from the seven centers participated in central training that took place in March 2015 at the Federal University of Piauí in Teresina (Brazil). The aim of the training was to ensure the application of standardized procedures to measure weight, height, circumferences (neck, arm, waist and hip) and skinfolds (triceps, subscapular, biceps and suprailiac). Theoretical and practical training was implemented, and thereafter an assessment of intra- and inter-observer reliability was performed. As described below, intra-observer (twice) and inter-observer (once) reliability was assessed.
ANTHROPOMETRIC MEASUREMENTS
The guidelines for each measurement (according to the reference manual of anthropometric standardization of the WHO) 22 were laid down in the standard operation procedures manual and were available to all staff members. Anthropometric measurements were taken at the right side of the body two times, non-consecutively. In case of an error of 5% or higher between the first and the second measurement, a third measurement was taken. All measurements were taken in underwear or with as few clothes as possible and without shoes, as follows 14,23,24.
Weight was measured to the nearest 0.1 kg by using a digital scale (WISO W801, Barreiro, Brazil). Height was measured by using a portable stadiometer (Cardiomed WSC, Paraná, Brazil) to the nearest 0.1 cm, with the subject barefoot and head in the Frankfort plane.
The arm circumference was measured with the subject standing relaxed with his or her side to the observer, and the arm hanging freely to the side; the tape was passed around the arm at the level of the midpoint of the upper arm (the midpoint between the posterior border of the acromion process to the tip of the olecranon process). The waist circumference was measured midway between the lowest rib margin and the iliac crest, feet positioned close together, at the end of a gentle expiration. The hip circumference measurement was taken at the point yielding the maximum circumference over the buttocks, feet positioned close together, with the tape held on a horizontal plane. The neck circumference measurement was taken with the tape placed around the neck above the larynx and perpendicular to the axis of the neck. All the circumferences were measured to the nearest 0.1 cm using a flexible, non-elastic tape (Cardiomed WSC, Paraná, Brazil). The subject was in a standing position. Skinfold thicknesses were measured to the nearest 1mm by using a caliper (Lange, CA, USA) at the following sites: triceps, halfway between the acromion process and the olecranon process; biceps, at the same level as the triceps skinfold and directly above the center of the cubital fossa; subscapular, about 20 mm below the tip of the scapula and 45 degrees to the lateral side of the body; suprailiac, about 20 mm above the iliac crest and 20 mm toward the medial line.
The WHO cut-off points were used to characterize the population according to their nutritional status 25,26.
STATISTICAL ANALYSIS
The technical error of measurement (TEM) was calculated as the square root of the measurement error variance. TEM is the most commonly-used measure of precision and was calculated with the following formula:
where "d" is the difference between measurements and "N" is the number of individuals measured. The lower the TEM obtained, the better the precision of the appraiser to take the measurement 10.
The coefficient of reliability ® is another approach to obtain comparability of anthropometric measurement error in population studies. It shows the proportion of between-subject variance free from measurement error. In this case, we used R as a percentage (R%), which was calculated by the following formula:
where SD2 is the total intra-subject variance for the study, including measurement error. An R% value of 100% means that there was no measurement error. The higher the R% values, the greater the measurement precision. An R% above 95% means that the variance was due to factors not related to measurement error. Where R is around 99%, such error is unlikely 10.
All statistical analyses were performed by using the Predictive Analytic Software (PASW 18.0 for Windows; PASW Chicago, IL).
RESULTS
Anthropometry was the part of the SAYCARE study that presented the highest compliance (728 children and 391 adolescents had double anthropometry versus 237 and 258, respectively, who completed all the questionnaires twice). Table I summarizes the characteristics of the population with anthropometric data.
The intra-observer TEM and reliability for each anthropometric measurement, both in children and adolescents from the six cities of South America, are shown in Table II and Table III, respectively.
BMI: body mass index; WC: waist circumference.
*Height/age < -2 SD (95% CI).
†BMI/Age > +2 SD up to five years old and BMI/age > +1 SD above five years old (95% CI).
The TEM for weight ranged between 0 kg (Medellin) and 0.2 kg (San Pablo) in children and from a minimum of 0.03 kg (Santiago) and a maximum of 0.89 (Lima) in adolescents. The TEM for height ranged between 0.03 cm (Lima) and 0.93 cm (San Pablo) in children and 0.05 cm (Santiago) and 1.25 cm (Lima) in adolescents. Reliability was above 99% in both measurements, except for adolescent's height from Lima, which was 98.6%.
The error in circumferences ranged between 0.03 cm (neck, waist, hip) and 1.8 cm (hip) in children and between 0.05 cm (neck) and 0.64 cm (waist) in adolescents. Reliability in children´s circumferences was above 99% for all measurements, except for the arm circumference, which was 98.4% and 98.8% in Teresina and Montevideo respectively, and for the neck circumference, which was 96.6% in Montevideo, 96.8% in Teresina and 98.5% in Santiago. In adolescents' circumference measurements, reliability was above 99% for almost all of them. The exceptions were the arm circumference, which was 97.9% in Teresina, 98.3% in Montevideo and 98.7% in Medellin, and the neck circumference, which was 98.8% in Montevideo.
TEM in skinfold measurement in children ranged between 0.07 mm (subscapular) and 1.40 mm (triceps) and in adolescents between 0.08 mm (triceps and suprailiac) and 2.13 mm (triceps). Reliability for skinfolds was above 95% in all skinfolds, except for the biceps in children from San Pablo (88.5%) and Medellin (88.9%) and triceps in children (93%) and adolescents (89%) from Medellin.
DISCUSSION
The SAYCARE feasibility study was carried out to determine the reliability of several indicators that will be used in the final complete study to investigate risk factors for cardiovascular diseases in children and adolescents from seven cities of South America. Anthropometric measurements are among these indicators and they are very useful in population studies to determine the presence of obesity or body composition abnormalities. As excess weight and obesity are risk factors for cardiovascular diseases 12, it is very important to establish 17 the reliability of body fat and fat distribution measurements. The most important result of our study is that the field work researchers obtained a very good intra-observer agreement.
In this study, for the vast majority of measurements and cities, R values were higher than 95%, in most measurements reaching values above 98%. The lowest R values were obtained for the triceps skinfold, both in children (95.6%) and adolescents (97.9%), although they are above the optimal values in both cases. Skinfold thicknesses are particularly difficult to measure, as this involves separating the subcutaneous adipose tissue from the underlying muscle, something that becomes even more difficult in overweight individuals 27. Skinfold thicknesses can be affected by compression during measurement but also by the biological characteristics of the individual being measured and anatomical differences in compressibility that vary with age, gender and recent weight loss 17.
Reliability was higher for circumferences than for skinfolds, both in children and adolescents. Analyzing the reliability of both measurements in children and adolescents, for circumferences, reliability was higher in adolescents than in children, and for skinfolds, the opposite occurs. The TEM for the circumferences measured was below 0.5 cm (except for children's hip that was 0.99 cm). The TEM for the skinfolds was below 1 mm, except for children's tricipital skinfold, which was 1.09 mm.
The measurement of error in anthropometry is not always analyzed in epidemiological studies. In a review by Ulijaszek and Kerr 10, results were compiled on intra-observer reliability; for circumferences it ranged from 85 to 95% and for the skinfolds from 89 to 97%. Weight and height had a reliability of 98%.
There are few studies measuring anthropometry and involving several countries in Latin America, and even less that have studied the reliability of the measurements taken. One of them is the Hispanic Health and Nutrition Examination Survey (HHANES 1982-1984) 28, which collected data to determine growth patterns for Hispanic children living in the United States. They determined the intra-observer reliability of some anthropometric measurements in a sample of children aged above 12 years. Considering the reliability of the common measurements taken in both studies (in the case of SAYCARE, taking into account the reliability of adolescents): height, arm circumference, and tricipital, subscapular and suprailiac skinfolds, the reliability was higher in most of the SAYCARE measurements: in arm circumference, it was 1.9% higher (99.7% vs 97.8%), in the subscapular skinfold it was 3.4% higher (98.8% vs 95.4%), and in the suprailiac skinfold it was 10.5% higher (99.7% vs 89.2%). The tricipital skinfold reliability was the same in both cases (97.9%) and for height the SAYCARE study reliability was 0.5% lower (99.5% vs 100%).
In Europe, there have been more recent epidemiological studies that analyzed the measurement errors in anthropometry in children and adolescents: Toybox (infants, six countries) 20, IDEFICS (infants and children, six countries) 18 and HELENA (adolescents, ten cities) 17. Height and weight reliability in SAYCARE children were very similar to Toybox and IDEFICS studies (above 99%), but for waist circumference, the reliability was better in SAYCARE children (99.8%) than in Toybox (96.7%). Toybox had a second training session for waist circumference measurement due to low reliability in the first. In the second training session, reliability was better but still lower than our study or the IDEFICS study. On comparing reliability of the other circumference measurements, we found that, for SAYCARE children, it was better than for IDEFICS children, except for arm circumference, which was lower (98.3% vs 99.3%). Something similar occurs with skinfolds reliability, which was better in SAYCARE children than in IDEFICS children, except for the tricipital skinfold that was lower in our study (95.6% vs 99.4%). Reliability in adolescent measurements was better in our study than in the HELENA study, both for circumferences (> 99% vs > 97%) and for skinfold thicknesses (> 97% vs > 95%). In all the previous studies, reliability results were optimal or at least acceptable (> 90%) for epidemiological studies in children and adolescents.
In summary, the intra-observer reliability of the anthropometric measurements of our study can be considered as very good. However, due to involuntary difficulties, we could not determine the margin of inter-observer error in this feasibility study, but it will be determined prior to the complete final study.
CONCLUSIONS
We can conclude that the training workshop carried out with the fieldworkers before taking the measurements in the respective countries gave good results in the subsequent feasibility study. The reliability of the measurements taken in our study was in most cases the same or higher than the reliability shown in European or American studies.