Identification of predictive factors of the degree of adherence to the Mediterranean diet through machine-learning techniques


Identification of predictive factors of the
degree of adherence to the Mediterranean
diet through machine-learning techniques
Alba Arceo-Vilas1,*, Carlos Fernandez-Lozano2,3,*, Salvador Pita1,†,
Sonia Pértega-Díaz1 and Alejandro Pazos2,3

1 Clinical Epidemiology and Biostatistics Research Group,, Instituto de Investigación Biomédica de
A Coruña (INIBIC), Complexo Hospitalario Universitario de A Coruña (CHUAC), SERGAS,
Universidade da Coruña, A Coruña, Spain

2 Department of Computer Science and Information Technologies, Faculty of Computer Science,
CITIC-Research Center of Information and Communication Technologies, Universidade da
Coruña, A Coruña, Spain

3 Grupo de Redes de Neuronas Artificiales y Sistemas Adaptativos. Imagen Médica y Diagnóstico
Radiológico (RNASA-IMEDIR). Instituto de Investigación Biomédica de A Coruña (INIBIC).
Complexo Hospitalario Universitario de A Coruña (CHUAC), SERGAS, Universidade da
Coruña, A Coruña, Spain

* These authors contributed equally to this work.
† Deceased author.

ABSTRACT
Food consumption patterns have undergone changes that in recent years have
resulted in serious health problems. Studies based on the evaluation of the nutritional
status have determined that the adoption of a food pattern-based primarily on a
Mediterranean diet (MD) has a preventive role, as well as the ability to mitigate the
negative effects of certain pathologies. A group of more than 500 adults aged
over 40 years from our cohort in Northwestern Spain was surveyed. Under our
experimental design, 10 experiments were run with four different machine-learning
algorithms and the predictive factors most relevant to the adherence of a MD were
identified. A feature selection approach was explored and under a null hypothesis
test, it was concluded that only 16 measures were of relevance, suggesting the
strength of this observational study. Our findings indicate that the following factors
have the highest predictive value in terms of the degree of adherence to the MD: basal
metabolic rate, mini nutritional assessment questionnaire total score, weight, height,
bone density, waist-hip ratio, smoking habits, age, EDI-OD, circumference of the
arm, activity metabolism, subscapular skinfold, subscapular circumference in cm,
circumference of the waist, circumference of the calf and brachial area.

Subjects Bioinformatics, Artificial Intelligence, Data Mining and Machine Learning
Keywords Feature selection, Nutritional status, Machine learning, Mediterranean diet,
Support vector machines, Nutrition disorders

INTRODUCTION
The economic development, urbanisation and industrialisation worldwide have changed
individuals’ eating habits and lifestyles, such as smoking, excessive consumption of
alcohol, sedentary lifestyle and stress, leading to a nutritional transition which its principle
cost in the health sector, is the appearance of non-transmissible chronic diseases.

How to cite this article Arceo-Vilas A, Fernandez-Lozano C, Pita S, Pértega-Díaz S, Pazos A. 2020. Identification of predictive factors of
the degree of adherence to the Mediterranean diet through machine-learning techniques. PeerJ Comput. Sci. 6:e287 DOI 10.7717/peerj-
cs.287

Submitted 3 July 2019
Accepted 6 July 2020
Published 27 July 2020

Corresponding author
Carlos Fernandez-Lozano,
carlos.fernandez@udc.es

Academic editor
Yu-Dong Zhang

Additional Information and
Declarations can be found on
page 13

DOI 10.7717/peerj-cs.287

Copyright
2020 Arceo-Vilas et al.

Distributed under
Creative Commons CC-BY 4.0

http://dx.doi.org/10.7717/peerj-cs.287
http://dx.doi.org/10.7717/peerj-cs.287
mailto:carlos.�fernandez@�udc.�es
https://peerj.com/academic-boards/editors/
https://peerj.com/academic-boards/editors/
http://dx.doi.org/10.7717/peerj-cs.287
http://www.creativecommons.org/licenses/by/4.0/
http://www.creativecommons.org/licenses/by/4.0/
https://peerj.com/computer-science/


A consequence of the alteration of dietary patterns is what has been called ‘epidemic
obesity’, defined by the World Health Organisation (WHO) as the first non-viral epidemic
of the 21st century, with 500 million obese people worldwide (Finucane et al., 2011;
Krzysztoszek, Wierzejska & Zielińska, 2015) affecting more than 50% of the adult
population in Spain (López-Sobaler et al., 2016; Anta et al., 2013; Rodriguez Rodriguez
et al., 2011).

The assessment of nutritional status of a population is one of the best indicators of
the health status of the said population, being a methodology that must include three
important aspects: a global assessment, a study of the dimension and a study of body
composition (Ravasco, Anderson & Mardones, 2010). With adequate interpretation of the
findings, appropriate therapeutic measures should be taken to correct deviations from
normality.

In the context of nutrition and public health, the Mediterranean diet (MD) has been
forged over the centuries, being characterised by cereal, olive oil, low saturated fats and
meat, moderate consumption of dairy and a regular and moderate intake of wine,
being a lifestyle in accordance with geographic, climatological, orographic, cultural and
environmental conditions within the countries and regions that surround the
Mediterranean Sea (Pérez & Aranceta, 2011).

There is an increasing interest in the study of the preventive role of MD and also as a
treatment for various pathologies associated with chronic inflammation, such as metabolic
syndrome, diabetes mellitus, cardiovascular disease (CVD), neurodegenerative diseases,
breast cancer and psycho-organic deterioration, leading to greater longevity and better
quality of life (Dussaillant et al., 2016; Chrysohoou et al., 2004; Trichopoulou, 2004;
Serra-Majem, Roman & Estruch, 2006; Estruch et al., 2013; Sofi et al., 2014; Della Camera
et al., 2017). Moreover, the importance of MD has also been identified as a potential
element contributing to the prevention of breast cancer (Shapira, 2017) or in patients
carrying the BRCA mutation (Bruno et al., 2017). In 2010; UNESCO declared this diet an
Intangible Cultural Heritage of Humanity (UNESCO, 2010).

Numerous studies have been published over the past decades, showing the relationship
between MD intake and CVD (Martínez-González et al., 2015; Widmer et al., 2015),
and meta-analyses that relate it to general health status (Sofi et al., 2014). In the Greek
cohort EPIC (European Prospective Investigation into Cancer and Nutrition Study) a
2-point increase in adherence to this diet was associated with a 33% reduction in CVD
mortality (Sofi et al., 2014). Additionally, the analysis of a sub-cohort of 2,700 individuals
over 60 years old, with a history of myocardial infarction showed that a greater adherence
to MD had an 18% drop in overall mortality (Lack et al., 2003). Other studies have
confirmed these associations, including the follow-up of a Spanish cohort of 13,600 adults
with coronary heart disease. After 5 years, it was observed that two points of increase in
adherence to MD were associated with a 26% decrease in coronary risk (Trichopoulou
et al., 2007).

Eating disorders are linked to a distorted perception of one’s own body image, as well as
to body dissatisfaction. The importance of a study on body dissatisfaction is due to the fact
that recent investigations have confirmed that alterations in body image have a causal

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 2/21

http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


participation in an eating disorder, rather than being secondary to it (Míguez Bernárdez
et al., 2011). Body image is considered a qualitative approximation to the nutritional status
of the individual (Sámano et al., 2015) and can be determining for their nutritional
management (Martínez-González et al., 2011).

One of the main fields of application of Machine-Learning (ML) techniques since its
origins is in the field of Biomedicine, finding previously published studies in related areas
such as biomedical image (Fernandez-Lozano et al., 2016b), characterisation of different
types of carcinomas (Kim et al., 2017), measurement of activity in genetic networks
(Hu et al., 2016), deformable models for image comparison (Rodriguez et al., 2014), gene
selection, and classification of microarray data (Díaz-Uriarte & De Andrés, 2006), to name
a few.

Moreover, due to the great versatility of ML techniques, they have been used in a
wide variety of application areas, to discover hidden patterns in the datasets: identification
and authentication of tequilas (Pérez-Caballero et al., 2017), wearable sensor data
fusion (Kanjo, Younis & Sherkat, 2018), predicting the outcomes of organic reactions
(Skoraczyñski et al., 2017), animal behaviour detection (Pons, Jaen & Catala, 2017) or
to measure the visual complexity of images (Machado et al., 2015). In particular, ML
techniques have proven to be able to uncover unimaginable relationships in very diverse
fields of application, such as image or voice recognition, sentiment analysis or language
translation (Li, Li & Wu, 2015; De Viñaspre & Oronoz, 2015).

The main objective of this work is the development of ML models for the
prediction of the degree of adherence to the MD. To this end, information on different
anthropometric and socio-demographic variables, nutritional status and self-perception
of body image is used in order to identify which of the variables have a greater influence
and are key in the adherence to a healthy diet such as MD, allowing our patients to
improve their quality of life and to reduce the negative effects of well-known and related
diseases.

Taking into account all of the above, the experimental methodology proposed in
the development of this study is based on the collection and generation of data to be
analysed with our cohort in Galicia (Spain), as well as on the use of ML techniques.
The purpose is to extract and explain the underlying information in the data and
determine which of these variables are the most important to classify people as having
either a good or poor adherence to the MD. As mentioned before, there are several
health benefits related to this particular food diet, especially for: chronic inflammation,
metabolic syndrome, diabetes mellitus, CVD, neurodegenerative diseases, cancer and
psycho-organic deterioration, moreover leading to greater longevity and better quality of
life. Thus, this study is relevant for understanding how to measure the degree of adherence,
in order to ensure the aforementioned benefits.

The structure of the article is as follows: in the Materials and methods section, the
subjects are presented, the variables are measured for each of them. Next, the machine
learning and feature selection techniques are described, along with the experimental design
followed to ensure that the results are reproducible and representative of the studied

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 3/21

http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


problem. In the next section, the results are presented and discussed, and the final section
of the article includes the conclusions of the work.

MATERIALS AND METHODS
The present study was structured as follows. Initially, a population from our cohort was
selected to carry out the study; the population was grouped into two categories: with
high and low degree of adherence to the MD. Once the set of the population on which the
study will be carried out has been identified, the information is collected from each of
the users of the health system. The type of study carried out will be described below, as well
as the sample size will be justified and all measurements collected will be explained in
detail. Once the dataset is generated, it will be analysed with four different ML techniques
and a feature selection phase will be applied for dimensionality reduction.

Population and data description
This is an observational prevalence study, conducted in Northwestern Spain (municipality
of Cambre, A Coruña, Spain), which included a randomly selected population aged
40 years and over. The sampling population consisted of individuals residing in Cambre,
identified through the National Health System card census. In Spain, the National
Health System has universal coverage, and almost all Spanish citizens are beneficiaries of
public healthcare services.

The sample size was calculated taking into account the total population of the
municipality (n ¼ 12; 446). After stratification by age and gender, (n ¼ 503) persons
were selected to participate in the study. Sample size was estimated using the single
proportion formula, with 95% confidence Interval. A sample size of (n ¼ 503) subjects
was estimated based on an adherence to MD rate of 50%. Precision was set at 4.3% and
percentage of losses at 10%. Population data is shown in Table 1.

Table 1 Population data of the municipality of Cambre (A Coruña) for the year 2012 and sample
data according to age and sex.

Age groups Population Sample

Total Men Women Total Men Women

40–44 2,465 1,202 (26.9%) 1,263 (27.8%) 33 19 (12.9%) 14 (13.2%)

45–49 2,231 1,110 (24.8%) 1,121 (24.7%) 85 52 (35.4%) 33 (31.1%)

50–54 1,763 857 (19.2%) 906 (19.9%) 54 32 (21.8%) 22 (20.8%)

55–59 1,383 702 (15.7%) 681 (14.9%) 33 18 (12.3%) 15 (14.1%)

60–64 1,170 598 (13.4%) 572 (12.6%) 48 26 (17.7%) 22 (20.8%)

Total (40–64) 9,012 4,469 4,543 253 147 106

65–69 1,027 497 (33%) 530 (27.5%) 94 57 (38%) 37 (37%)

70–74 688 337 (22.4%) 351 (18.2%) 77 46 (30.7%) 31 (31%)

75–79 777 326 (21.6%) 451 (23.4%) 46 28 (18.7%) 18 (18%)

80–84 511 198 (13.1%) 313 (16.2%) 24 12 (8%) 12 (12%)

85-more 431 148 (9.8%) 283 (14.7%) 9 7 (4.7%) 2 (2%)

Total (65 and more) 3,434 1,506 1,928 250 150 100

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 4/21

http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


A personal interview was arranged with each individual. After obtaining their
permission and written consent, a trained nurse proceeded to the measurement of
anthropometric variables and to the collection of the necessary data to cover the
questionnaires. The patients who could not go to the health centre due to personal or
displacement reasons and those who suffer from a cognitive impairment, making it
impossible for them to perform the study, were excluded. The study received written
approval from the Regional Ethics Committee for Clinical Research (code 2012/390 CEIC
Galicia).

The information described below was collected from each selected subject: socio-
demographic variables: age, gender, level of education, marital status and relationships of
coexistence; prevalence of arterial hypertension and smoking: the systolic and diastolic
blood pressure of each patient was recorded at the beginning and at the end of the visit,
obtaining the prevalence of arterial hypertension; the smoking habit was recorded
according to self-reported information. Anthropometric variables: the anthropometric
parameters allow us to know the state of the protein and caloric reserves, besides providing
guidance to the health professional about the consequences of the imbalances in these
reserves.

All measurements were made during the same session, to avoid variations in the
environmental or biological conditions. For the measurement of weight and size, the
person was barefoot and with light clothing; an MB-201T plus Asimed scale-rod was
used with an accuracy of 100 grams (weight) and 1 mm (size). BMI was obtained by
means of the BMIratio ¼ weightðkgÞheightðm2Þ, and grouped according to the WHO classification of
BMI , 18:5 kgm2 : low weight; 18.5–24.99

kg
m2: normal weight; 25–29.99

kg
m2: overweight,

and � 30 kgm2: obesity.
The waist and hip circumference was measured with an inelastic tape measure with the

patient standing upright, the abdomen relaxed, the upper limbs hanging at the sides,
and with the feet and knees joined together. The waist circumference was measured by
taking the mid-point between the lower costal margins and the iliac crests, as it is
considered a risk factor for CVD when it is wider than 80 cm in women and wider than
94 cm in men, and a very high risk if it exceeds 88 cms and 102 cms, respectively (Alberti
et al., 2009).

The hip circumference was measured as the maximum circumference around the
buttocks. Based on these two values, the waist-hip ratio was calculated using the cut-off
points proposed by the WHO, where normal levels of 0.8 are found in women and one in
men, higher values indicating abdominal visceral obesity, which is associated with
increased cardiovascular risk (Jover, 1997).

The calf circumference was measured in the widest section of the ankle-knee distance
(cuff area) showing a good correlation with fat-free mass and muscle strength (Rolland
et al., 2003; Barbosa Murillo et al., 2007; Bonnefoy et al., 2002). The measurement was
carried out with an inextensible tape measure in cm.

Subscapular skin fold, this fold measures truncal obesity. The measurement is made one
centimetre below the lower angle of the scapula, following the natural furrow of the skin.

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 5/21

http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


The scapula protrudes when the arm is carefully placed behind the back and the lower
angle can be located this way. The measurement of the fold will be diagonally over an angle
of 45° to ensure the correct thickness measurement. The plicometer forceps should be
applied 1 cm in the inferolateral position to the thumb and finger that lifts the fold.

To assess the amount of subcutaneous adipose tissue, the skin folds were measured in
millimetres in the tricipital, bicipital, subscapular and suprailiac areas. A digital calliper
Trim metre was used, including a double layer of skin and underlying adipose tissue,
always avoiding the muscle. The tricipital skinfold was measured longitudinally, at the
back of the non-dominant upper limb, at the midpoint between acromion and olecranon,
with the limb relaxed, parallel to the axis of the arm; the bicipital fold was measured at the
same point as the tricipital, but on the under arm.

The circumference of the arm was measured with an inextensible anthropometric tape
measure in cm. The measurement was taken at the midpoint of the non-dominant arm,
in the same place where the tricipital skinfold was measured and without compression
with the anthropometric tape.

Once the data of the different measurements were obtained, the mid-arm muscle
circumference was found, with which the skeletal muscle mass of the patients (protein
compartment) was known and expressed in cm. The arm muscle area indicated that the
muscle compartment was based on the brachial circumference and tricipital skinfold
measurements. The fat area of the arm indicated that the patient’s fatty compartment used
the total brachial area and the muscular area of the arm. The Adipose Muscular Index,
which evaluates the nutritional status from the adipose and muscular areas of the arm, was
also calculated, being essentially applied in the assessment of obesity.

For the determination of body fat percentage by electrical bio-impedance, a Beurer and
BG55 model bio-impedance metre was used, with a maximum capacity of 150 kg and a
precision of 0.1% for body fat, body water and muscle percentage, and 100 g for body
weight, according to the information provided by the manufacturer.

These methods are based on physical principles, such as the different ability of
conduction or resistance that the tissues show to the passage of an electric current, with
greater conductivity of the lean tissues than the fatty ones (Norman et al., 2007). Thus,
by means of bio-impedance, the following values were obtained: weight, fat mass, liquid
mass, muscle mass, bone density, basal metabolic rate (BMR) and activity metabolism.
Data on socio-demographic variables, such as age, gender (male/female), cohabitation
(with whom the live), prevalence of current smokers, ex smokers (patient stopped smoking
more than 12 months before entering the study) and non-smokers were estimated.
Additionally, blood pressure was recorded.

Adherence to the MD
Consumption of a characteristic food pattern of MD is associated with numerous health
benefits. These benefits are attributed to bioactive compounds that exert synergistic effects
and decrease the risk for development of chronic diseases.

In order to assess the quality of dietary habits (adherence to a Mediterranean dietary
pattern), the MD adherence test was used (Estruch et al., 2018). It is a questionnaire

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 6/21

http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


consisting of fourteen quick questions that allow us to understand whether participants’
usual diet can be considered as following the parameters of the MD. Each question
answered affirmatively adds a point. It is considered that a person correctly follows the MD
when their score is equal to or greater than nine points.

The assessment of nutritional status was determined using the Mini Nutritional
Assessment (MNA) questionnaire (Estruch et al., 2018). It is a validated method which,
through eighteen short questions, evaluates anthropometric measures, dietary habits,
lifestyle, pharmacological treatments and mobility, and performs a subjective evaluation of
health and nutritional status. The total value of the MNA scale is thirty points, a score <17
being considered malnutrition, there is a risk of malnutrition between 17 and 23, 5, and
well nourished subjects obtain scores of 24 points and higher.

A measure of subjective weight is included by asking: ‘I consider that my weight is:
(A) higher than normal, (B) normal, (C) lower than normal’, following the model
proposed in (Espina et al., 2001). Based on the answer, the population is classified
into three groups: ‘fairly subjective weight’ those who believe to be at an ideal weight, ‘more
subjective kilograms’ for those who believe that they are overweight and ‘less subjective
kilograms’ for those who think they weigh less than they should.

Two of the eleven sub-scales of Garner’s Eating Disorder Inventory (EDI-2) (Garner,
1998) were used to study body image: Body Dissatisfaction (EDI-IC) and Obsession for
Thinness (EDI-OD), as they evaluate aspects directly related to perceptual alterations.
The body dissatisfaction sub-scale (EDI-CI) measures the dissatisfaction of the subject
with the general shape of their body or with those parts of the body that most concern
those with eating disorders (stomach, hips, thighs, buttocks). The thinness obsession
sub-scale (EDI-O) measures concern about weight, diets and fear of getting fat.

This questionnaire was validated in Spain by (Corral et al., 1998). The fourteen items of
these two sub-scales were mixed in the questionnaire to avoid the subjects guessing the
construct being evaluated. All items were answered and corrected according to the
form proposed in the questionnaire manual. The MD Adherence test was used to
determine the degree of adherence to the MD, being a short specific questionnaire of
fourteen items validated for the Spanish population and used by the MD Prevention Group
(PREDIMED) (Martínez-González et al., 2015).

Machine learning and statistical analysis
The authors tested different ML techniques for solving this problem, using cross-validation
techniques to avoid over-training, while ensuring that the generalised capability of the
model is the best possible, as well as different runs of the experiments to check the
behaviour of the techniques. Thus, all experiments were repeated ten times to check the
stability of the results and the observed deviation between the experiments was small,
as shown in the results section. In particular, a tenfold cross validation was used to divide
the dataset in such a way that nine random partitions were used to train and one to validate
the results, each time taking a different subset for validation.

In order to compare the performance of the ML techniques, the Area Under the
Receiver Operating Characteristic Curve (AUROC) was used. This is a combined

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 7/21

http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


measurement which, besides being independent of the threshold used, includes both Type
I and type II errors, ensuring that it is not conditioned by differences in the total number of
cases of each class (Fawcett, 2006).

An experimental design was employed (Fernandez-Lozano et al., 2016a), allowing us to
divide the data using a cross-validation technique which ensured that the performance
results obtained, as mentioned above, were not skewed. That is they were adjusted to
the data, and researchers are able to identify which of the hyper-parameters are most
suitable to find the best model with each ML techniques, according to its particular
hyper-parameter configuration. To this end, the programming environment R (R Core
Team, 2016) and the package mlr (Bischl et al., 2016) were used, which also allowed us
to perform the considered experimental design. In addition, another of the objectives
pursued by this study was to find as few variables as possible that would yield a
performance value as high as possible, preferably at least equal to that obtained using
all available variables. This is basically a feature selection approach where the main aims
are the following: avoid overfitting and improve model performance, to provide faster and
more cost-effective models, and moreover to gain a deeper insight into the underlying
processes that generated the data as mentioned in (Saeys, Inza & Larrañaga, 2007).
There are three approaches in ML to perform this process and the use of a filter
approximation was chosen, for its velocity and independence of the classifier (Saeys,
Inza & Larrañaga, 2007). In general, performing this feature selection process helps to
reduce inherently the present noise in such datasets.

The final step of our experimental analysis was a null hypothesis test for choosing the
best model in order to ensure whether the performance of a particular ML technique is
statistically better than the others or not. In our case, as there were more than two repeated
measures, an ANOVA or a Friedman test should be considered. In particular, three
different conditions should be checked: normality, independence and homoscedasticity.
If our results fulfil the three conditions, a parametric test is applicable, and the ANOVA
one should be considered, otherwise the non-parametric version, the Friedman test.
Finally, a post hoc procedure had to be used in order to correct the p-values for multiple
testing.

Machine learning techniques for classification problems
A large number of experiments were carried out in an attempt to identify the best ML
model able to solve the problem and to ensure that the results are reproducible, real and
obtained under equal conditions. In addition, the search space was explored for the
best possible parameters for each technique in the same way, so that all techniques could
have the same possibilities of exploration across the same subsets of data and avoid the
over fit that could occur. In particular, the following well-known state-of-the-art
techniques were implemented: Random Forest (RF) (Breiman, 2001), Support Vector
Machines (SVM) (Cortes & Vapnik, 1995; Vapnik, 1995), Elastic Net (ENET) (Tibshirani,
1994; Zou & Hastie, 2005) and weighted k-Nearest Neighbours (KNNs) (Hechenbichler &
Schliep, 2004).

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 8/21

http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


Random Forest (Breiman, 2001) is a state-of-the-art ML technique that was used in
multiple domains with good results. One of its main strengths is that the results obtained
are very easy to understand, it is based on very simple concepts and in general,
although it is applied with little experience in the parametrisation of hyper-parameters,
good results are obtained. This technique combines multiple decision trees, each of
them tuned over a subset of bootstrapped data. In this way, RF combines each of the
individual predictions of the decision trees into a global prediction that, in general, is more
successful than any of the simple ones. Of all the possible variables in the dataset, a number
were randomly selected (with replacement) and a number of trees were constructed
based on the set of examples used for the training phase and obtained from the previously
selected subset. When there are classification problems, it is recommended to use a square
root number of the total number of variables existing in the dataset. To explore the
solution space in the best way possible, in our experiments we used a parameter domain
that was adjusted by a grid search and that, for a number of trees (1,000), we explored
randomly selected values of variables (2–6). In addition, values that varied (1–4) according
to the size of the terminal nodes of the tree were explored.

Support Vector Machines (Cortes & Vapnik, 1995; Vapnik, 1995) is also one of the ML
techniques that have been commonly used in different domains in recent years and
have obtained good results. In fact, along with RF, it is one of the algorithms considered
state-of-the-art, easy to understand, and the results obtained are verifiable. In problems
that occur during a study, the main objective of SVM is to find the hyperplane that
best separates the examples between high and low degree of adherence to the MD and at
the same time to maximise the distance of separation between both examples and the
hyperplane. That is it attempts to find the separation hyperplane that generalises in the
best possible way (Burges, 1998). To achieve this goal, SVM introduces a particular
mathematical concept known as kernel: it is a mathematical function that allows the
conversion of the input space into a higher dimension, which is used to transform a
non-separable linear problem into one that is separable. There are different kernel
functions, which in general could be interpreted as a measure of similarity between two
objects (60), and one of the most used is Gaussian Radial Basis (RBF), because basically any
surface can be obtained with this function (61). In this case, the domain of the parameters
used to search for the best model consists of a grid search of two different parameters.
The first one (parameter C) is directly related to the model and is used as a balance between
the classification errors and the simplicity of the decision surface, while the second
(gamma parameter) is the free parameter of the Gaussian function and in particular, SVM
is very sensitive to changes in this parameter. For both parameters, and according to the
usual practice, values were evaluated in potencies of two between −12 and 12. To better
understand this technique, the following reading materials are recommended (Burges,
1998; Vert, Tsuda & Schölkopf, 2004; Cristianini & Shawe-Taylor, 2000).

Elastic Net (Tibshirani, 1994; Zou & Hastie, 2005) is based on lasso (penalised least
squares method) and was specifically developed to solve some of the limitations
encountered for this technique (56). On the one hand, a grid search was performed on two

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 9/21

http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


different parameters, the alpha penalty parameter was searched (it has values in the range
of 0–1) and in particular the following 0, 0.15, 0.25, 0.35, 0.5, 0.65, 0.75, 0.85 and 1. On the
other hand, the best value of the lambda parameter was used, as recommended by the
authors of the technique, from values less than or equal to one to negative powers of ten, in
particular the following values were used: 0.0001, 0.001, 0.01, 0.1, 1.

Finally, a simple KNN (Hechenbichler & Schliep, 2004) assigned, through a decision
rule, an unclassified example belonging to a class by frequency of occurrence to its k-most
similar neighbouring examples. Then, in accordance to the distance of Minkowski for
each of the examples and following the maximum accumulated kernel densities the
weighted KNN are identified (Hechenbichler & Schliep, 2004; Samworth, 2012).
In particular, neighbouring values of less than or equal to nine were used. Therefore,
this particular and improved implementation of a KNN used kernel functions to measure
the degree of similarity of the examples, as previously mentioned in the case of the SVMs.

RESULTS
The dataset has a total of 38 variables employed to characterise the differences underlying
in the data between high and low adherence to the MD. The data has been standardised
using the z-score formula to have a mean equal to zero and a standard deviation equal
to 1. Four different ML techniques were used to verify the results obtained, in an attempt to
identify the technique that provides the best-performing results. Initially, the analysis of
the complete set of study variables is carried out. It can be seen in Figs. 1A and 1B
how the techniques present a fairly stable behaviour in the prediction. Even a simple a
priori technique such as KNN obtains the best results of the entire experimental phase,
indicating that almost all variables contain relevant information. In any case, in order
to understand whether there is noise or contradictory or correlated information that may
be hindering the learning process of the algorithms, a phase of dimensionality reduction
will then be carried out.

Additionally, a process of feature selection was carried out to reduce the number of
variables as much as possible, so that the results could remain similar without statistical
differences, if not better, for those obtained using all variables. Our approach is a filter
feature selection using a T-test to quantify the correlation between each feature and the
class (high or low adherence to the MD) before the training process. Three subsets of
4, 16 and 32 features were evaluated of the original ordering according to the highest
p-value from the T-test. The average AUROC results of the execution of the ten 10-fold
cross-validation experiments are shown in Fig. 1.

As the number of features increases, there is a clear growing tendency in performance
and obtained results in AUROC with 16 and 32 features are very close to those
obtained with the full dataset. In any case, a study should be conducted on whether the
differences are statistically significant between the subsets of 16, 32 variables and the
full dataset to ensure that the subset with fewer features is statistically the best option.
Finally, as shown in Fig. 1A, SVM is the best model in three out of the four datasets, and
manages to reach values closest to 0.94 in AUROC.

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 10/21

http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


However, as previously mentioned, a single mean measure is not enough and it is
necessary to analyse the behaviour of the models during the whole experimental phase and
to verify how stable they are, as shown in Fig. 1B.

Figure 1B shows that if the number of variables is very small (4), the models are
skewed and there is a higher variability in the performance because there is not enough
information in the data to find a good classification model. It is also important to note
that the results obtained with 16 and 32 features showed that this variability was
significantly reduced until reaching average and standard deviation values very similar
to those obtained using all the variables.

As observed in the two previous figures, the best results in AUROC were obtained using
SVM. The same results in accuracy are shown in Fig. 2.

To check whether the difference between the three winning models (SVM with 16, 32
and all variables) is significant or not, a null hypothesis test was applied. Following the
experimental methodology proposed in (Fernandez-Lozano et al., 2016a) for the normality

FS−4

FS−16

FS−32

Full

0.6 0.7 0.8 0.9

D
a

ta
se

ts

(a)

0.6

0.7

0.8

0.9

A
U

R
O

C

(b)

Machine Learning 
algorithms

RF SVM GLMNET KNN

FS−4 FS−16 FS−32 Full

Figure 1 Summary of the performance (AUROC) of the four ML techniques (RF, SVM, GLMNET and KNN) for each one of the subsets of
features. (A) Average of the experiments for each size analysed and (B) boxplot of the results in order to check the behaviour of the techniques
through the learning process. Full-size DOI: 10.7717/peerj-cs.287/fig-1

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 11/21

http://dx.doi.org/10.7717/peerj-cs.287/fig-1
http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


condition we used the Shapiro–Wilk test (Shapiro & Wilk, 1965), with a confidence
level a ¼ 0:05. with the null hipothesis that our results follow a normal distribution.
The null hypothesis was not rejected with values W ¼ 0:96179 and p‐value ¼ 0:3438,
therefore our results did follow a normal distribution.

Next, a Bartlett test (Bartlett, 1937) was performed, with a confidence level a ¼ 0:05
and with the null hypothesis that our data were heterocedastic. The test result indicates
that the null hypothesis should not be rejected with a value of Barlett’s K-squared 2:3128
with two degrees of freedom and p‐value ¼ 0:3146. The result of both tests indicates that a
parametric ANOVA test should be conducted, with a confidence level a ¼ 0:05 assuming
the null hypothesis that our results are statistically equal. The results of the ANOVA test
indicates that we fail to reject the null hypothesis and the three ML models are statistically
equal with an adjusted p‐value ¼ 0:1124. Consequently, a 16-feature model should be
considered (BMR, MNA total score, weight, height, bone density, waist-hip ratio, smoker,
age, EDI-OD, circumference of the arm, activity metabolism, subscapular skin fold,

FS−4

FS−16

FS−32

Full

0.6 0.7 0.8

D
a

ta
se

ts

Accuracy(a)

FS−4

FS−16

FS−32

Full

0.5 0.6 0.7 0.8

F−Measure(b)

Machine Learning 
algorithms

RF SVM GLMNET KNN

Figure 2 Summary of the average performance of the experiments: (A) (Accuracy) and (B) (F-measure) of the four ML techniques (RF, SVM,
GLMNET and KNN) for each one of the subsets of features. Full-size DOI: 10.7717/peerj-cs.287/fig-2

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 12/21

http://dx.doi.org/10.7717/peerj-cs.287/fig-2
http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


subscapular circumference in cm, circumference of the waist, circumference of the calf,
brachial area) as the best-performing one, and half of the initial features that are not
relevant for the SVM were removed.

DISCUSSION
To check whether our results are relevant and are in accordance with what has been
previously published, the state-of-the-art articles published on the topic were reviewed, in
an attempt to identify the degree of adherence to the variables most related to a MD.
The search results led to previous studies that also found the variables identified by SVM as
the most important, in particular BMR (Cutillas et al., 2013; Careau, 2017; Srivastava et al.,
2017; Bonfanti et al., 2014), MNA total score (Farias et al., 2016; Zaragoza Martí et al.,
2015; Abreu-Reyes et al., 2017), weight and height (De la Montaña Miguélez et al., 2012;
Buckland, Bach-faig & Majem, 2008; Ortega Anta & López Sobaler, 2014; Travé &
Castroviejo, 2011), bone density (Romero Pérez & Rivas Velasco, 2014; Savanelli et al.,
2017; Melaku et al., 2017; Štefan et al., 2017) or waist-hip ratio (Downer et al., 2016; Estruch
et al., 2016; Bertoli et al., 2015) and if the patient is a smoker (Zaragoza Martí et al., 2015;
Marventano et al., 2017, Grao-Cruces et al., 2015). Therefore, the results were contrasted,
ensuring the ability of ML techniques to identify underlying patterns in the data.
According to the feature selection process, the remaining predictors are not relevant for all
the ML techniques.

CONCLUSIONS
The first model based on ML that was proposed for the prediction of the degree of
adherence to the MD depended on information related to different anthropometric
variables, socio-demographic variables, nutritional status and self-perception of body image.

Initially, experiments with four different ML methods were performed and feature
selection techniques were applied to reduce the dimensionality of the problem. SVM is the
best-performing model according to the experimental design after a null hypothesis test,
and our study found that using a feature selection approach, the number of features
could be drastically reduced to 16 (less than half of the initial number) achieving an
equivalent performance value in AUROC. The best model obtained was an SVM with
an RBF kernel as a decision function. The importance of each one of the predictors cannot
be studied because a nonlinear SVM is like a black box and the internal mapping function
is unknown. Furthermore, the weight vector cannot be explicitly computed.

Finally, our results are in accordance with the findings of previous publications and have
primarily served to establish new factors related to the degree of adherence to the MD.

ADDITIONAL INFORMATION AND DECLARATIONS

Funding
This work is supported by the “Collaborative Project in Genomic Data Integration
(CICLOGEN)” PI17/01826 funded by the Carlos III Health Institute from the Spanish
National plan for Scientific and Technical Research and Innovation 2013–2016 and

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 13/21

http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


the European Regional Development Funds (FEDER)—“A way to build Europe”.
This project was also supported by the General Directorate of Culture, Education and
University Management of Xunta de Galicia (Ref. ED431G/01, ED431D 2017/16), the
“Galician Network for Colorectal Cancer Research” (Ref. ED431D 2017/23), Competitive
Reference Groups (Ref. ED431C 2018/49) and the European Regional Development Funds
(FEDER)—“A way to build Europe”. The funders had no role in study design, data
collection and analysis, decision to publish, or preparation of the manuscript.

Grant Disclosures
The following grant information was disclosed by the authors:
Collaborative Project in Genomic Data Integration (CICLOGEN): PI17/01826.
Carlos III Health Institute from the Spanish National plan for Scientific and Technical
Research and Innovation 2013–2016.
European Regional Development Funds (FEDER)—“A way to build Europe”.
General Directorate of Culture, Education and University Management of Xunta de
Galicia: ED431G/01 and ED431D 2017/16.
Galician Network for Colorectal Cancer Research: ED431D 2017/23.
Competitive Reference Groups: ED431C 2018/49.
European Regional Development Funds (FEDER)—“A way to build Europe”.

Competing Interests
The authors declare that they have no competing interests.

Author Contributions
� Alba Arceo-Vilas performed the experiments, analysed the data, performed the
computation work, prepared figures and/or tables, authored or reviewed drafts of the
paper, and approved the final draft.

� Carlos Fernandez-Lozano conceived and designed the experiments, performed the
experiments, analysed the data, performed the computation work, prepared figures and/
or tables, authored or reviewed drafts of the paper, and approved the final draft.

� Salvador Pita conceived and designed the experiments, analysed the data, authored or
reviewed drafts of the paper, and approved the final draft.

� Sonia Pértega-Díaz conceived and designed the experiments, analysed the data, authored
or reviewed drafts of the paper, and approved the final draft.

� Alejandro Pazos conceived and designed the experiments, authored or reviewed drafts of
the paper, and approved the final draft.

Ethics
The following information was supplied relating to ethical approvals (i.e. approving body
and any reference numbers):

The study received written approval from the Regional Ethics Committee for Clinical
Research (code 2012/390 CEIC Galicia).

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 14/21

http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


Data Availability
The following information was supplied regarding data availability:

Data is available at Figshare: Fernandez-Lozano, Carlos (2019): Identification of
predictive factors of the degree of adherence to the Mediterranean diet through
machine-learning techniques. figshare. Dataset. DOI 10.6084/m9.figshare.7628837.v2.

Supplemental Information
Supplemental information for this article can be found online at http://dx.doi.org/10.7717/
peerj-cs.287#supplemental-information.

REFERENCES
Abreu-Reyes JA, Álvarez-Luis D, Arteaga-Hernández V, Sánchez-Mendez M, Abreu-González R.

2017. Mediterranean diet adherence by patients with primary open angle glaucoma. Archivos de la
Sociedad Española de Oftalmología 92(8):353–358 DOI 10.1016/j.oftale.2017.02.006.

Alberti KGMM, Eckel RH, Grundy SM, Zimmet PZ, Cleeman JI, Donato KA, Fruchart JC,
James WPT, Loria CM, Smith SC. 2009. Harmonizing the metabolic syndrome: a joint interim
statement of the international diabetes federation task force on epidemiology and prevention;
national heart, lung, and blood institute; American heart association; world heart federation;
international. Circulation 120(16):1640–1645.

Anta RMO, Lopez-Solaber AM, Perez-Farinos N, Ortega Anta RM, López-Solaber AM,
Pérez-Farinós N. 2013. Associated factors of obesity in Spanish representative samples.
Nutricion Hospitalaria 28(5):56–62.

Buckland G, Bach-Faig A, Majem LS. 2008. Eficacia de la dieta mediterránea en la prevención de
la obesidad. Una revisión de la bibliografía. Revista Española de Obesidad 6(6):329–339.

Barbosa Murillo JAP, Rodríguez M NG, Hernández H De Valera YM, Hernández H RA,
Herrera M HA. 2007. Masa muscular, fuerza muscular y otros componentes de funcionalidad
en adultos mayores institucionalizados de la Gran Caracas-Venezuela. Nutricion Hospitalaria
22(5):578–583.

Bartlett MS. 1937. Properties of sufficiency and statistical tests. Proceedings of the Royal Society of
London: Series A, Mathematical and Physical Sciences 160(901):268–282.

Bertoli S, Leone A, Vignati L, Bedogni G, Martínez-González MÁ, Bes-Rastrollo M,
Spadafranca A, Vanzulli A, Battezzati A. 2015. Adherence to the Mediterranean diet is
inversely associated with visceral abdominal tissue in Caucasian subjects. Clinical Nutrition
34(6):1266–1272 DOI 10.1016/j.clnu.2015.10.003.

Bischl B, Lang M, Kotthoff L, Schiffner J, Richter J, Studerus E, Casalicchio G, Jones Z. 2016.
mlr: machine learning in R. Journal of Machine Learning Research 17(170):1–5.

Bonfanti N, Fernandez JM, Gomez-Delgado F, Perez-Jimenez F. 2014. Effect of two hypocaloric
diets and their combination with physical exercise on basal metabolic rate and body
composition. Nutricion Hospitalaria 29(3):635–643.

Bonnefoy M, Jauffret M, Kostka T, Jusot JF. 2002. Usefulness of calf circumference measurement
in assessing the nutritional state of hospitalized elderly people. Gerontology 48(3):162–169
DOI 10.1159/000052836.

Breiman L. 2001. Random forests. Machine Learning 45(1):5–32 DOI 10.1023/A:1010933404324.

Bruno E, Manoukian S, Venturelli E, Oliverio A, Rovera F, Iula G, Morelli D, Peissel B,
Azzolini J, Roveda E, Pasanisi P. 2017. Adherence to Mediterranean diet and metabolic

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 15/21

https://doi.org/10.6084/m9.figshare.7628837.v2
http://dx.doi.org/10.7717/peerj-cs.287#supplemental-information
http://dx.doi.org/10.7717/peerj-cs.287#supplemental-information
http://dx.doi.org/10.1016/j.oftale.2017.02.006
http://dx.doi.org/10.1016/j.clnu.2015.10.003
http://dx.doi.org/10.1159/000052836
http://dx.doi.org/10.1023/A:1010933404324
http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


syndrome in BRCA mutation carriers. Integrative Cancer Therapies 17(1):153–160
DOI 10.1177/1534735417721015.

Burges CJC. 1998. A tutorial on support vector machines for pattern recognition. Data Mining and
Knowledge Discovery 2(2):121–167 DOI 10.1023/A:1009715923555.

Careau V. 2017. Energy intake, basal metabolic rate, and within-individual trade-offs in men and
women training for a half marathon: a reanalysis. Physiological and Biochemical Zoology
90(3):392–398 DOI 10.1086/691338.

Chrysohoou C, Panagiotakos DB, Pitsavos C, Das UN, Stefanadis C. 2004. Adherence to the
Mediterranean diet attenuates inflammation and coagulation process in healthy adults: the
ATTICA study. Journal of the American College of Cardiology 44(1):152–158
DOI 10.1016/j.jacc.2004.03.039.

Corral S, González M, Pereña J, Seisdedos N. 1998. Adaptación española del Inventario de
trastornos de la conducta alimentaria. Madrid: TEA.

Cortes C, Vapnik V. 1995. Support-vector networks. Machine Learning 20:273–297.

Cristianini N, Shawe-Taylor J. 2000. An introduction to support vector machines: and other kernel-
based learning methods. New York: Cambridge University Press.

Cutillas AB, Herrero E, De San Eustaquio A, Zamora S, Pérez-Llamas F. 2013. Prevalencia de
peso insuficiente, sobrepeso y obesidad, ingesta de energía y perfil calórico de la dieta de
estudiantes universitarios de la comunidad autónoma de la región de Murcia (España).
Nutricion Hospitalaria 28(3):683–689.

De la Montaña Miguélez J, Cobas N, Rodríguez M, Míguez Bernárdez M, Castro Sobrino L.
2012. Adherencia a la dieta mediterranea y su relación con el índice de masa corporal en
universiarios de Galicia. Nutrición Clínica y Dietética Hospitalaria 32(3):72–80.

De Viñaspre OP, Oronoz M. 2015. SNOMED CT in a language isolate: an algorithm for a
semiautomatic translation. BMC Medical Informatics and Decision Making 15(Suppl. 2):S5
DOI 10.1186/1472-6947-15-S2-S5.

Della Camera PA, Morselli S, Cito G, Tasso G, Cocci A, Laruccia N, Travaglini F, Del Fabbro D,
Mottola AR, Gacci M, Serni S, Carini M, Natali A. 2017. Sexual health, adherence to
Mediterranean diet, body weight, physical activity and mental state: factors correlated to each
other. Urologia Journal 84(4):221–225 DOI 10.5301/uj.5000255.

Downer MK, Gea A, Stampfer M, Sánchez-Tainta A, Corella D, Salas-Salvadó J, Ros E, Estruch
R, Fitó M, Gómez-Gracia E, Arós F, Fiol M, De-la Corte FJG, Serra-Majem L, Pinto X,
Basora J, Sorlí JV, Vinyoles E, Zazpe I, Martínez-González M-Á. 2016. Predictors of short-
and long-term adherence with a Mediterranean-type diet intervention: the PREDIMED
randomized trial. International Journal of Behavioral Nutrition and Physical Activity 13(1):67
DOI 10.1186/s12966-016-0394-6.

Dussaillant C, ECheverría G, inés UrquiaGa, niColás VelasCo and attilio RiGotti. 2016.
Evidencia actual sobre los beneficios de la dieta mediterránea en salud. Artículo de Revisión Rev
Med chileRev Med Chile 144(144):1044–1052.

Díaz-Uriarte R, De Andrés SA. 2006. Gene selection and classification of microarray data using
random forest. BMC Bioinformatics 7(1):3 DOI 10.1186/1471-2105-7-3.

Espina A, Ortego MA, De Alda IO, Yenes F, Aleman A. 2001. La imagen corporal en los
trastornos alimentarios. Body Shape in Eating Disorders 13(4):533–538.

Estruch R, Martínez-González MA, Corella D, Salas-Salvadó J, Fitó M, Chiva-Blanch G, Fiol M,
Gómez-Gracia E, Arós F, Lapetra J, Serra-Majem L, Pintó X, Buil-Cosiales P, Sorlí JV,
Muñoz MA, Basora-Gallisá J, Lamuela-Raventós RM, Serra-Mir M, Ros E. 2016. Effect of a
high-fat Mediterranean diet on bodyweight and waist circumference: a prespecified secondary

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 16/21

http://dx.doi.org/10.1177/1534735417721015
http://dx.doi.org/10.1023/A:1009715923555
http://dx.doi.org/10.1086/691338
http://dx.doi.org/10.1016/j.jacc.2004.03.039
http://dx.doi.org/10.1186/1472-6947-15-S2-S5
http://dx.doi.org/10.5301/uj.5000255
http://dx.doi.org/10.1186/s12966-016-0394-6
http://dx.doi.org/10.1186/1471-2105-7-3
http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


outcomes analysis of the PREDIMED randomised controlled trial. Lancet Diabetes &
Endocrinology 4(8):666–676 DOI 10.1016/S2213-8587(16)30085-7.

Estruch R, Ros E, Salas-Salvadó J, Covas M-I, Corella D, Arós F, Gómez-Gracia E,
Ruiz-Gutiérrez V, Fiol M, Lapetra J, Lamuela-Raventos RM, Serra-Majem L, Pintó X, Basora J,
Muñoz MA, Sorlí JV, Martínez JA, Martínez-González MA. 2013. Primary prevention of
cardiovascular disease with a Mediterranean diet. New England Journal of Medicine
368(14):130225030008006.

Farias G, Thieme RD, Teixeira LM, Heyde ME, Bettini S, Radominski R. 2016. Nutrición
Hospitalaria Trabajo Original. Nutricion Hospitalaria 33(5):1108–1115.

Fawcett T. 2006. An introduction to ROC analysis. Pattern Recognition Letters 27(8):861–874
DOI 10.1016/j.patrec.2005.10.010.

Fernandez-Lozano C, Gestal M, Munteanu CR, Dorado J, Pazos A. 2016a. A methodology for
the design of experiments in computational intelligence with multiple regression models. PeerJ
4(4):e2721 DOI 10.7717/peerj.2721.

Fernandez-Lozano C, Seoane J, Gestal M, Gaunt T, Dorado J, Pazos A, Campbell C. 2016b.
Texture analysis in gel electrophoresis images using an integrative kernel-based approach.
Scientific Reports 6(1):2064 DOI 10.1038/srep19256.

Finucane M, Stevens G, Cowan M, Danaei G, Lin JK, Paciorek CJ, Singh GM, Gutierrez HR,
Lu Y, Bahalim AN, Farzadfar F, Riley LM, Ezzati M. 2011. National, regional, and global
trends in body-mass index since 1980: systematic analysis of health examination surveys and
epidemiological studies with 960 country. Lancet 377(9765):557–567
DOI 10.1016/S0140-6736(10)62037-5.

Garner D. 1998. EDI-2: Inventario de Trastornos de la Conducta Alimentaria: Manual. Madrid: TEA.

Grao-Cruces A, Nuviala A, Fernandez-Martinez A, Martinez-Lopez E-J. 2015. Relationship of
physical activity and sedentarism with tobacco and alcohol consumption, and Mediterranean
diet in Spanish teenagers. Nutricion hospitalaria 31(4):1693–1700.

Hechenbichler K, Schliep K. 2004. Weighted k-nearest-neighbor techniques and ordinal
classification. Collaborative Research Center 386, Discussion Paper 399.
DOI 10.5282/ubm/epub.1769.

Hu Z, Mao J-H, Curtis C, Huang G, Gu S, Heiser L, Lenburg ME, Korkola JE, Bayani N,
Samarajiwa S, Seoane JA, Dane MA, Esch A, Feiler HS, Wang NJ, Hardwicke MA,
Laquerre S, Jackson J, Wood KW, Weber B, Spellman PT, Aparicio S, Wooster R, Caldas C,
Gray JW. 2016. Genome co-amplification upregulates a mitotic gene network activity that
predicts outcome and response to mitotic protein inhibitors in breast cancer. Breast Cancer
Research 18(1):70 DOI 10.1186/s13058-016-0728-y.

Estruch R, Ros E, Salas-Salvadó J, Covas MI, Corella D, Arós F, Gómez-Gracia E,
Ruiz-Gutiérrez V, Lapetra J, Lamuela-Raventos RM, Serra-Majem L, Pintó X, Basora J,
Muñoz MA, Sorli JV, Martinez JA, Fitó M, Gea A, Hernan MA, Martinez-Gonzalez MA,
for the PREDIMED Study Investigators. 2018. Primary prevention of cardiovascular disease
with a mediterranean diet supplemented with extra-virgin olive oil or nuts. New England Journal
of Medicine 378(25):e34 DOI 10.1056/NEJMoa1800389.

Jover E. 1997. Índice cintura/cadera—obesidad y riesgo cardiovascular. Anales De Medicina
Interna 14:1–2.

Kanjo E, Younis E, Sherkat N. 2018. Towards unravelling the relationship between on-body,
environmental and emotion data using sensor information fusion approach. Information Fusion
40:18–31.

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 17/21

http://dx.doi.org/10.1016/S2213-8587(16)30085-7
http://dx.doi.org/10.1016/j.patrec.2005.10.010
http://dx.doi.org/10.7717/peerj.2721
http://dx.doi.org/10.1038/srep19256
http://dx.doi.org/10.1016/S0140-6736(10)62037-5
http://dx.doi.org/10.5282/ubm/epub.1769
http://dx.doi.org/10.1186/s13058-016-0728-y
http://dx.doi.org/10.1056/NEJMoa1800389
http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


Kim J, Bowlby R, Mungall AJ, Robertson AG, Odze RD, Cherniack AD, Shih J,
Pedamallu CS, Cibulskis C, Dunford A, Meier SR, Kim J, Raphael BJ, Wu H-T, Wong AM,
Willis JE, Bass AJ, Derks S, Garman K, McCall SJ, Wiznerowicz M, Pantazi A, Parfenov M,
Thorsson V, Shmulevich I, Dhankani V, Miller M, Sakai R, Wang K, Schultz N, Shen R,
Arora A, Weinhold N, Sánchez-Vega F, Kelsen DP, Zhang J, Felau I, Demchok J, Rabkin CS,
Camargo MC, Zenklusen JC, Bowen J, Leraas K, Lichtenberg TM, Curtis C, Seoane JA,
Ojesina AI, Beer DG, Gulley ML, Pennathur A, Luketich JD, Zhou Z, Weisenberger DJ,
Akbani R, Lee J-S, Liu W, Mills GB, Zhang W, Reid BJ, Hinoue T, Laird PW, Shen H,
Piazuelo MB, Schneider BG, McLellan M, Taylor-Weiner A, Cibulskis C, Lawrence M,
Cibulskis K, Stewart C, Getz G, Lander E, Gabriel SB, Ding L, McLellan MD, Miller CA,
Appelbaum EL, Cordes MG, Fronick CC, Fulton LA, Mardis ER, Wilson RK, Schmidt HK,
Fulton RS, Ally A, Balasundaram M, Bowlby R, Carlsen R, Chuah E, Dhalla N, Holt RA,
Jones SJM, Kasaian K, Brooks D, Li HI, Ma Y, Marra MA, Mayo M, Moore RA, Mungall AJ,
Mungall KL, Robertson AG, Schein JE, Sipahimalani P, Tam A, Thiessen N, Wong T,
Cherniack AD, Shih J, Pedamallu CS, Beroukhim R, Bullman S, Cibulskis C, Murray BA,
Saksena G, Schumacher SE, Gabriel S, Meyerson M, Hadjipanayis A, Kucherlapati R,
Pantazi A, Parfenov M, Ren X, Park PJ, Lee S, Kucherlapati M, Yang L, Baylin SB,
Hoadley KA, Weisenberger DJ, Bootwalla MS, Lai PH, Van Den Berg DJ, Berrios M,
Holbrook A, Akbani R, Hwang J-E, Jang H-J, Liu W, Weinstein JN, Lee J-S, Lu Y, Sohn BH,
Mills G, Seth S, Protopopov A, Bristow CA, Mahadeshwar HS, Tang J, Song X, Zhang J,
Laird PW, Hinoue T, Shen H, Cho J, Defrietas T, Frazer S, Gehlenborg N, Heiman DI,
Lawrence MS, Lin P, Meier SR, Noble MS, Voet D, Zhang H, Kim J, Polak P, Saksena G,
Chin L, Getz G, Wong AM, Raphael BJ, Wu H-T, Lee S, Park PJ, Yang L, Thorsson V,
Bernard B, Iype L, Miller M, Reynolds SM, Shmulevich I, Dhankani V, Abeshouse A,
Arora A, Armenia J, Kundra R, Ladanyi M, Lehmann K-V, Gao J, Sander C, Schultz N,
Sánchez-Vega F, Shen R, Weinhold N, Chakravarty D, Zhang H, Radenbaugh A, Hegde A,
Akbani R, et al. 2017. Integrated genomic characterization of oesophageal carcinoma. Nature
541(7636):169–175.

Krzysztoszek J, Wierzejska E, Zielińska A. 2015. Obesity: an analysis of epidemiological and
prognostic research. Archives of Medical Science 11(1):24–33.

Lack G, Fox D, Northstone K, Golding J. 2003. Factors associated with the development of peanut
allergy in childhood. New England Journal of Medicine 348(11):977–985.

Li X, Li J, Wu Y. 2015. A global optimization approach to multi-polarity sentiment analysis.
PLOS ONE 10(4):1–18.

López-Sobaler AM, Aparicio A, Aranceta-Bartrina J, Gil Á, González-Gross M, Serra-Majem L,
Varela-Moreiras G, Ortega RM. 2016. Overweight and general and abdominal obesity in a
representative sample of spanish adults: findings from the ANIBES study. BioMed Research
International 2016:8341487.

Machado P, Romero J, Nadal M, Santos A, Correia J, Carballal A. 2015. Computerized measures
of visual complexity. Acta Psychologica 160:43–57 DOI 10.1016/j.actpsy.2015.06.005.

Martínez-González MA, García-López M, Bes-Rastrollo M, Toledo E, Martínez-Lapiscina EH,
Delgado-Rodriguez M, Vazquez Z, Benito S, Beunza JJ. 2011. Mediterranean diet and the
incidence of cardiovascular disease: a Spanish cohort. Nutrition, Metabolism and Cardiovascular
Diseases 21(4):237–244.

Martínez-González MA, Salas-Salvadó J, Estruch R, Corella D, Fitó M, Ros E. 2015. Benefits of
the Mediterranean diet: insights from the PREDIMED study. Progress in Cardiovascular
Diseases 58(1):50–60 DOI 10.1016/j.pcad.2015.04.003.

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 18/21

http://dx.doi.org/10.1016/j.actpsy.2015.06.005
http://dx.doi.org/10.1016/j.pcad.2015.04.003
http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


Marventano S, Godos J, Platania A, Galvano F, Mistretta A, Grosso G. 2017. Mediterranean diet
adherence in the Mediterranean healthy eating, aging and lifestyle (MEAL) study cohort.
International Journal of Food Sciences and Nutrition 69(1):1–8.

Melaku YA, Gill TK, Taylor AW, Adams R, Shi Z. 2017. Association between nutrient patterns
and bone mineral density among ageing adults. Clinical Nutrition ESPEN 22:97–106
DOI 10.1016/j.clnesp.2017.08.001.

Míguez Bernárdez M, De la Montaña Miguélez J, González Carnero J, González Rodríguez M.
2011. Concordancia entre la autopercepción de la imagen corporal y el estado nutricional en
universitarios de Orense. Nutricion Hospitalaria 26(3):472–479.

Norman K, Smoliner C, Valentini L, Lochs H, Pirlich M. 2007. Is bioelectrical impedance vector
analysis of value in the elderly with malnutrition and impaired functionality? Nutrition
23(7–8):564–569 DOI 10.1016/j.nut.2007.05.007.

Ortega Anta RM, López Sobaler AM. 2014. Primeras Jornadas UCM-ASEN Avances y
controversias en nutricion y salud. Nutrición Hospitalaria 30(2):21–28.

Pons P, Jaen J, Catala A. 2017. Assessing machine learning classifiers for the detection of animals’
behavior using depth-based tracking. Expert Systems with Applications 86:235–246
DOI 10.1016/j.eswa.2017.05.063.

Pérez C, Aranceta J. 2011. ¿Es posible la dieta Mediterránea en el siglo XXI? In: Alonso E,
Varela G, Silvestre D, eds. La dieta Mediterránea en el marco de la nutrición comunitaria: luces y
sombras. Madrid: Instituto Tomás Pascual Sanz, 147–162.

Pérez-Caballero G, Andrade J, Olmos P, Molina Y, Jiménez I, Durán J, Fernandez-Lozano C,
Miguel-Cruz F. 2017. Authentication of tequilas using pattern recognition and supervised
classification. TrAC—Trends in Analytical Chemistry 94:117–129.

R Core Team. 2016. R: a language and environment for statistical computing. Vienna: The R
Foundation for Statistical Computing. Available at http://www.R-project.org/.

Ravasco P, Anderson H, Mardones F. 2010. Métodos de valoración del estado nutricional.
Nutricion Hospitalaria 25(Suppl. 3):57–66.

Rodriguez A, Fernandez-Lozano C, Dorado J, Rabuñal JR. 2014. Two-dimensional gel
electrophoresis image registration using block-matching techniques and deformation models.
Analytical Biochemistry 454:53–59 DOI 10.1016/j.ab.2014.02.027.

Rodriguez Rodriguez E, Lopez Plaza B, Lopez Sobaler M, Ortega R. 2011. Prevalencia de
sobrepeso y obesidad en adultos españoles. Nutricion Hospitalaria 26(2):355–363.

Rolland Y, Lauwers-Cances V, Cournot M, Nourhashémi F, Reynish W, Rivière D, Vellas B,
Grandjean H. 2003. Sarcopenia, calf circumference, and physical function of elderly women: a
cross-sectional study. Journal of the American Geriatrics Society 51(8):1120–1124
DOI 10.1046/j.1532-5415.2003.51362.x.

Romero Pérez A, Rivas Velasco A. 2014. Adherence to Mediterranean diet and bone health.
Nutricion Hospitalaria 29(5):989–996.

Saeys Y, Inza I, Larrañaga P. 2007. A review of feature selection techniques in bioinformatics.
Bioinformatics 23(19):2507–2517 DOI 10.1093/bioinformatics/btm344.

Samworth RJ. 2012. Optimal weighted nearest neighbour classifiers. Annals of Statistics
40(5):2733–2763 DOI 10.1214/12-AOS1049.

Savanelli MC, Barrea L, Macchia PE, Savastano S, Falco A, Renzullo A, Scarano E, Nettore IC,
Colao A, Di Somma C. 2017. Preliminary results demonstrating the impact of Mediterranean
diet on bone health. Journal of Translational Medicine 15(1):81
DOI 10.1186/s12967-017-1184-x.

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 19/21

http://dx.doi.org/10.1016/j.clnesp.2017.08.001
http://dx.doi.org/10.1016/j.nut.2007.05.007
http://dx.doi.org/10.1016/j.eswa.2017.05.063
http://www.R-project.org/
http://dx.doi.org/10.1016/j.ab.2014.02.027
http://dx.doi.org/10.1046/j.1532-5415.2003.51362.x
http://dx.doi.org/10.1093/bioinformatics/btm344
http://dx.doi.org/10.1214/12-AOS1049
http://dx.doi.org/10.1186/s12967-017-1184-x
http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


Serra-Majem L, Roman B, Estruch R. 2006. Scientific evidence of interventions using the
Mediterranean diet: a systematic review. Nutrition Reviews 64(2 Pt 2):S27–S47
DOI 10.1111/j.1753-4887.2006.tb00232.x.

Shapira N. 2017. The potential contribution of dietary factors to breast cancer prevention.
European Journal of Cancer Prevention 26(5):385–395 DOI 10.1097/CEJ.0000000000000406.

Shapiro SS, Wilk MB. 1965. An analysis of variance test for normality (complete samples).
Biometrika 52(3–4):591–611 DOI 10.1093/biomet/52.3-4.591.

Skoraczyñski G, Dittwald P, Miasojedow B, Szymkuć S, Gajewska EP, Grzybowski BA,
Gambin A. 2017. Predicting the outcomes of organic reactions via machine learning: are current
descriptors sufficient? Scientific Reports 7(1):3582 DOI 10.1038/s41598-017-02303-0.

Sofi F, Macchi C, Abbate R, Gensini GF, Casini A. 2014. Mediterranean diet and health status: an
updated meta-analysis and a proposal for a literature-based adherence score. Public Health
Nutrition 17(12):2769–2782 DOI 10.1017/S1368980013003169.

Srivastava R, Batra A, Dhawan D, Bakhshi S. 2017. Association of energy intake and expenditure
with obesity: a cross-sectional study of 150 pediatric patients following treatment for leukemia.
Pediatric Hematology and Oncology 34(1):29–35 DOI 10.1080/08880018.2016.1272025.

Sámano R, Rodríguez-ventura A, Sánchez-jiménez B, Godínez E, Noriega A, Zelonka R,
Garza M, Nieto J. 2015. Satisfacción de la imagen corporal en adolescentes y adultos mexicanos
y su relación con la autopercepción corporal y el índice de masa corporal real. Nutricion
Hospitalaria 31(3):1082–1088.

Štefan L, Čule M, Milinović I, Sporiš G, Juranko D. 2017. The relationship between adherence to
the Mediterranean diet and body composition in Croatian university students. European Journal
of Integrative Medicine 13(Suppl. C):41–46.

Tibshirani R. 1994. Regression selection and shrinkage via the lasso. Journal of the Royal Statistical
Society, Series B (Methodological) 58(1):267–288.

Travé TD, Castroviejo A. 2011. Adherencia a la dieta mediterránea en la población universitaria.
Nutrición Hospitalaria 26(3):602–608.

Trichopoulou A. 2004. Traditional Mediterranean diet and longevity in the elderly: a review.
Public Health Nutrition 7(7):943–947 DOI 10.1079/PHN2004558.

Trichopoulou A, Bamia C, Norat T, Overvad K, Schmidt EB, Tjonneland A, Halkjær J,
Clavel-Chapelon F, Vercambre M-N, Boutron-Ruault M-C, Linseisen J, Rohrmann S,
Boeing H, Weikert C, Benetou V, Psaltopoulou T, Orfanos P, Boffetta P, Masala G, Pala V,
Panico S, Tumino R, Sacerdote C, De-Mesquita HBB, Ocke MC, Peeters PH, Van der
Schouw YT, González C, Sanchez MJ, Chirlaque MD, Moreno C, Larrañaga N, Van Guelpen B,
Jansson J-H, Bingham S, Khaw K-T, Spencer EA, Key T, Riboli E, Trichopoulos D. 2007.
Modified Mediterranean diet and survival after myocardial infarction: the EPIC-Elderly study.
European Journal of Epidemiology 22(12):871–881 DOI 10.1007/s10654-007-9190-6.

UNESCO. 2010. The Mediterranean diet inscription on the representative list of the intangible
cultural heritage of humanity. Available at https://ich.unesco.org/doc/src/17331-EN.pdf.

Vapnik VN. 1995. The nature of statistical learning theory. New York: Springer New York, Inc.

Vert JP, Tsuda K, Schölkopf B. 2004. A primer on kernel methods, Kernel methods in
computational biology. London: MIT Press, 35–70.

Widmer RJ, Flammer AJ, Lerman LO, Lerman A. 2015. The Mediterranean diet, its components,
and cardiovascular disease. American Journal of Medicine 128(3):229–238
DOI 10.1016/j.amjmed.2014.10.014.

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 20/21

http://dx.doi.org/10.1111/j.1753-4887.2006.tb00232.x
http://dx.doi.org/10.1097/CEJ.0000000000000406
http://dx.doi.org/10.1093/biomet/52.3-4.591
http://dx.doi.org/10.1038/s41598-017-02303-0
http://dx.doi.org/10.1017/S1368980013003169
http://dx.doi.org/10.1080/08880018.2016.1272025
http://dx.doi.org/10.1079/PHN2004558
http://dx.doi.org/10.1007/s10654-007-9190-6
https://ich.unesco.org/doc/src/17331-EN.pdf
http://dx.doi.org/10.1016/j.amjmed.2014.10.014
http://dx.doi.org/10.7717/peerj-cs.287
https://peerj.com/computer-science/


Zaragoza Martí A, Ferrer Cascales R, Cabañero Martínez MJ, Hurtado Sánchez JA,
Laguna Pérez A. 2015. Adherencia a la dieta mediterránea y su relación con el estado
nutricional en personas mayores. Nutrición Hospitalaria 31(4):1667–1674.

Zou H, Hastie T. 2005. Regularization and variable selection via the elastic net. Journal of the Royal
Statistical Society: Series B Statistical Methodology 67(2):301–320
DOI 10.1111/j.1467-9868.2005.00503.x.

Arceo-Vilas et al. (2020), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.287 21/21

http://dx.doi.org/10.1111/j.1467-9868.2005.00503.x
https://peerj.com/computer-science/
http://dx.doi.org/10.7717/peerj-cs.287

	Identification of predictive factors of the degree of adherence to the Mediterranean diet through machine-learning techniques
	Introduction
	Materials and Methods
	Results
	Discussion
	Conclusions
	References


<<
  /ASCII85EncodePages false
  /AllowTransparency false
  /AutoPositionEPSFiles true
  /AutoRotatePages /None
  /Binding /Left
  /CalGrayProfile (Dot Gain 20%)
  /CalRGBProfile (sRGB IEC61966-2.1)
  /CalCMYKProfile (U.S. Web Coated \050SWOP\051 v2)
  /sRGBProfile (sRGB IEC61966-2.1)
  /CannotEmbedFontPolicy /Warning
  /CompatibilityLevel 1.4
  /CompressObjects /Off
  /CompressPages true
  /ConvertImagesToIndexed true
  /PassThroughJPEGImages true
  /CreateJobTicket false
  /DefaultRenderingIntent /Default
  /DetectBlends true
  /DetectCurves 0.0000
  /ColorConversionStrategy /LeaveColorUnchanged
  /DoThumbnails false
  /EmbedAllFonts true
  /EmbedOpenType false
  /ParseICCProfilesInComments true
  /EmbedJobOptions true
  /DSCReportingLevel 0
  /EmitDSCWarnings false
  /EndPage -1
  /ImageMemory 1048576
  /LockDistillerParams false
  /MaxSubsetPct 100
  /Optimize true
  /OPM 1
  /ParseDSCComments true
  /ParseDSCCommentsForDocInfo true
  /PreserveCopyPage true
  /PreserveDICMYKValues true
  /PreserveEPSInfo true
  /PreserveFlatness true
  /PreserveHalftoneInfo false
  /PreserveOPIComments false
  /PreserveOverprintSettings true
  /StartPage 1
  /SubsetFonts true
  /TransferFunctionInfo /Apply
  /UCRandBGInfo /Preserve
  /UsePrologue false
  /ColorSettingsFile (None)
  /AlwaysEmbed [ true
  ]
  /NeverEmbed [ true
  ]
  /AntiAliasColorImages false
  /CropColorImages true
  /ColorImageMinResolution 300
  /ColorImageMinResolutionPolicy /OK
  /DownsampleColorImages false
  /ColorImageDownsampleType /Average
  /ColorImageResolution 300
  /ColorImageDepth 8
  /ColorImageMinDownsampleDepth 1
  /ColorImageDownsampleThreshold 1.50000
  /EncodeColorImages true
  /ColorImageFilter /FlateEncode
  /AutoFilterColorImages false
  /ColorImageAutoFilterStrategy /JPEG
  /ColorACSImageDict <<
    /QFactor 0.15
    /HSamples [1 1 1 1] /VSamples [1 1 1 1]
  >>
  /ColorImageDict <<
    /QFactor 0.15
    /HSamples [1 1 1 1] /VSamples [1 1 1 1]
  >>
  /JPEG2000ColorACSImageDict <<
    /TileWidth 256
    /TileHeight 256
    /Quality 30
  >>
  /JPEG2000ColorImageDict <<
    /TileWidth 256
    /TileHeight 256
    /Quality 30
  >>
  /AntiAliasGrayImages false
  /CropGrayImages true
  /GrayImageMinResolution 300
  /GrayImageMinResolutionPolicy /OK
  /DownsampleGrayImages false
  /GrayImageDownsampleType /Average
  /GrayImageResolution 300
  /GrayImageDepth 8
  /GrayImageMinDownsampleDepth 2
  /GrayImageDownsampleThreshold 1.50000
  /EncodeGrayImages true
  /GrayImageFilter /FlateEncode
  /AutoFilterGrayImages false
  /GrayImageAutoFilterStrategy /JPEG
  /GrayACSImageDict <<
    /QFactor 0.15
    /HSamples [1 1 1 1] /VSamples [1 1 1 1]
  >>
  /GrayImageDict <<
    /QFactor 0.15
    /HSamples [1 1 1 1] /VSamples [1 1 1 1]
  >>
  /JPEG2000GrayACSImageDict <<
    /TileWidth 256
    /TileHeight 256
    /Quality 30
  >>
  /JPEG2000GrayImageDict <<
    /TileWidth 256
    /TileHeight 256
    /Quality 30
  >>
  /AntiAliasMonoImages false
  /CropMonoImages true
  /MonoImageMinResolution 1200
  /MonoImageMinResolutionPolicy /OK
  /DownsampleMonoImages false
  /MonoImageDownsampleType /Average
  /MonoImageResolution 1200
  /MonoImageDepth -1
  /MonoImageDownsampleThreshold 1.50000
  /EncodeMonoImages true
  /MonoImageFilter /CCITTFaxEncode
  /MonoImageDict <<
    /K -1
  >>
  /AllowPSXObjects false
  /CheckCompliance [
    /None
  ]
  /PDFX1aCheck false
  /PDFX3Check false
  /PDFXCompliantPDFOnly false
  /PDFXNoTrimBoxError true
  /PDFXTrimBoxToMediaBoxOffset [
    0.00000
    0.00000
    0.00000
    0.00000
  ]
  /PDFXSetBleedBoxToMediaBox true
  /PDFXBleedBoxToTrimBoxOffset [
    0.00000
    0.00000
    0.00000
    0.00000
  ]
  /PDFXOutputIntentProfile (None)
  /PDFXOutputConditionIdentifier ()
  /PDFXOutputCondition ()
  /PDFXRegistryName ()
  /PDFXTrapped /False

  /CreateJDFFile false
  /Description <<
    /CHS <FEFF4f7f75288fd94e9b8bbe5b9a521b5efa7684002000500044004600206587686353ef901a8fc7684c976262535370673a548c002000700072006f006f00660065007200208fdb884c9ad88d2891cf62535370300260a853ef4ee54f7f75280020004100630072006f0062006100740020548c002000410064006f00620065002000520065006100640065007200200035002e003000204ee553ca66f49ad87248672c676562535f00521b5efa768400200050004400460020658768633002>
    /CHT <FEFF4f7f752890194e9b8a2d7f6e5efa7acb7684002000410064006f006200650020005000440046002065874ef653ef5728684c9762537088686a5f548c002000700072006f006f00660065007200204e0a73725f979ad854c18cea7684521753706548679c300260a853ef4ee54f7f75280020004100630072006f0062006100740020548c002000410064006f00620065002000520065006100640065007200200035002e003000204ee553ca66f49ad87248672c4f86958b555f5df25efa7acb76840020005000440046002065874ef63002>
    /DAN <FEFF004200720075006700200069006e0064007300740069006c006c0069006e006700650072006e0065002000740069006c0020006100740020006f007000720065007400740065002000410064006f006200650020005000440046002d0064006f006b0075006d0065006e007400650072002000740069006c0020006b00760061006c00690074006500740073007500640073006b007200690076006e0069006e006700200065006c006c006500720020006b006f007200720065006b007400750072006c00e60073006e0069006e0067002e0020004400650020006f007000720065007400740065006400650020005000440046002d0064006f006b0075006d0065006e0074006500720020006b0061006e002000e50062006e00650073002000690020004100630072006f00620061007400200065006c006c006500720020004100630072006f006200610074002000520065006100640065007200200035002e00300020006f00670020006e0079006500720065002e>
    /DEU <FEFF00560065007200770065006e00640065006e0020005300690065002000640069006500730065002000450069006e007300740065006c006c0075006e00670065006e0020007a0075006d002000450072007300740065006c006c0065006e00200076006f006e002000410064006f006200650020005000440046002d0044006f006b0075006d0065006e00740065006e002c00200076006f006e002000640065006e0065006e002000530069006500200068006f00630068007700650072007400690067006500200044007200750063006b006500200061007500660020004400650073006b0074006f0070002d0044007200750063006b00650072006e00200075006e0064002000500072006f006f0066002d00470065007200e400740065006e002000650072007a0065007500670065006e0020006d00f60063006800740065006e002e002000450072007300740065006c006c007400650020005000440046002d0044006f006b0075006d0065006e007400650020006b00f6006e006e0065006e0020006d006900740020004100630072006f00620061007400200075006e0064002000410064006f00620065002000520065006100640065007200200035002e00300020006f0064006500720020006800f600680065007200200067006500f600660066006e00650074002000770065007200640065006e002e>
    /ESP <FEFF005500740069006c0069006300650020006500730074006100200063006f006e0066006900670075007200610063006900f3006e0020007000610072006100200063007200650061007200200064006f00630075006d0065006e0074006f0073002000640065002000410064006f0062006500200050004400460020007000610072006100200063006f006e00730065006700750069007200200069006d0070007200650073006900f3006e002000640065002000630061006c006900640061006400200065006e00200069006d0070007200650073006f0072006100730020006400650020006500730063007200690074006f00720069006f00200079002000680065007200720061006d00690065006e00740061007300200064006500200063006f00720072006500630063006900f3006e002e002000530065002000700075006500640065006e00200061006200720069007200200064006f00630075006d0065006e0074006f00730020005000440046002000630072006500610064006f007300200063006f006e0020004100630072006f006200610074002c002000410064006f00620065002000520065006100640065007200200035002e003000200079002000760065007200730069006f006e0065007300200070006f00730074006500720069006f007200650073002e>
    /FRA <FEFF005500740069006c006900730065007a00200063006500730020006f007000740069006f006e00730020006100660069006e00200064006500200063007200e900650072002000640065007300200064006f00630075006d0065006e00740073002000410064006f00620065002000500044004600200070006f007500720020006400650073002000e90070007200650075007600650073002000650074002000640065007300200069006d007000720065007300730069006f006e00730020006400650020006800610075007400650020007100750061006c0069007400e90020007300750072002000640065007300200069006d007000720069006d0061006e0074006500730020006400650020006200750072006500610075002e0020004c0065007300200064006f00630075006d0065006e00740073002000500044004600200063007200e900e90073002000700065007500760065006e0074002000ea0074007200650020006f007500760065007200740073002000640061006e00730020004100630072006f006200610074002c002000610069006e00730069002000710075002700410064006f00620065002000520065006100640065007200200035002e0030002000650074002000760065007200730069006f006e007300200075006c007400e90072006900650075007200650073002e>
    /ITA <FEFF005500740069006c0069007a007a006100720065002000710075006500730074006500200069006d0070006f007300740061007a0069006f006e00690020007000650072002000630072006500610072006500200064006f00630075006d0065006e00740069002000410064006f006200650020005000440046002000700065007200200075006e00610020007300740061006d007000610020006400690020007100750061006c0069007400e00020007300750020007300740061006d00700061006e0074006900200065002000700072006f006f0066006500720020006400650073006b0074006f0070002e0020004900200064006f00630075006d0065006e007400690020005000440046002000630072006500610074006900200070006f00730073006f006e006f0020006500730073006500720065002000610070006500720074006900200063006f006e0020004100630072006f00620061007400200065002000410064006f00620065002000520065006100640065007200200035002e003000200065002000760065007200730069006f006e006900200073007500630063006500730073006900760065002e>
    /JPN <FEFF9ad854c18cea51fa529b7528002000410064006f0062006500200050004400460020658766f8306e4f5c6210306b4f7f75283057307e30593002537052376642306e753b8cea3092670059279650306b4fdd306430533068304c3067304d307e3059300230c730b930af30c830c330d730d730ea30f330bf3067306e53705237307e305f306f30d730eb30fc30d57528306b9069305730663044307e305930023053306e8a2d5b9a30674f5c62103055308c305f0020005000440046002030d530a130a430eb306f3001004100630072006f0062006100740020304a30883073002000410064006f00620065002000520065006100640065007200200035002e003000204ee5964d3067958b304f30533068304c3067304d307e30593002>
    /KOR <FEFFc7740020c124c815c7440020c0acc6a9d558c5ec0020b370c2a4d06cd0d10020d504b9b0d1300020bc0f0020ad50c815ae30c5d0c11c0020ace0d488c9c8b85c0020c778c1c4d560002000410064006f0062006500200050004400460020bb38c11cb97c0020c791c131d569b2c8b2e4002e0020c774b807ac8c0020c791c131b41c00200050004400460020bb38c11cb2940020004100630072006f0062006100740020bc0f002000410064006f00620065002000520065006100640065007200200035002e00300020c774c0c1c5d0c11c0020c5f40020c2180020c788c2b5b2c8b2e4002e>
    /NLD (Gebruik deze instellingen om Adobe PDF-documenten te maken voor kwaliteitsafdrukken op desktopprinters en proofers. De gemaakte PDF-documenten kunnen worden geopend met Acrobat en Adobe Reader 5.0 en hoger.)
    /NOR <FEFF004200720075006b00200064006900730073006500200069006e006e007300740069006c006c0069006e00670065006e0065002000740069006c002000e50020006f0070007000720065007400740065002000410064006f006200650020005000440046002d0064006f006b0075006d0065006e00740065007200200066006f00720020007500740073006b00720069006600740020006100760020006800f800790020006b00760061006c00690074006500740020007000e500200062006f007200640073006b0072006900760065007200200065006c006c00650072002000700072006f006f006600650072002e0020005000440046002d0064006f006b0075006d0065006e00740065006e00650020006b0061006e002000e50070006e00650073002000690020004100630072006f00620061007400200065006c006c00650072002000410064006f00620065002000520065006100640065007200200035002e003000200065006c006c00650072002000730065006e006500720065002e>
    /PTB <FEFF005500740069006c0069007a006500200065007300730061007300200063006f006e00660069006700750072006100e700f50065007300200064006500200066006f0072006d00610020006100200063007200690061007200200064006f00630075006d0065006e0074006f0073002000410064006f0062006500200050004400460020007000610072006100200069006d0070007200650073007300f5006500730020006400650020007100750061006c0069006400610064006500200065006d00200069006d00700072006500730073006f0072006100730020006400650073006b0074006f00700020006500200064006900730070006f00730069007400690076006f0073002000640065002000700072006f00760061002e0020004f007300200064006f00630075006d0065006e0074006f00730020005000440046002000630072006900610064006f007300200070006f00640065006d0020007300650072002000610062006500720074006f007300200063006f006d0020006f0020004100630072006f006200610074002000650020006f002000410064006f00620065002000520065006100640065007200200035002e0030002000650020007600650072007300f50065007300200070006f00730074006500720069006f007200650073002e>
    /SUO <FEFF004b00e40079007400e40020006e00e40069007400e4002000610073006500740075006b007300690061002c0020006b0075006e0020006c0075006f0074002000410064006f0062006500200050004400460020002d0064006f006b0075006d0065006e007400740065006a00610020006c0061006100640075006b006100730074006100200074007900f6007000f60079007400e400740075006c006f0073007400750073007400610020006a00610020007600650064006f007300740075007300740061002000760061007200740065006e002e00200020004c0075006f0064007500740020005000440046002d0064006f006b0075006d0065006e00740069007400200076006f0069006400610061006e0020006100760061007400610020004100630072006f0062006100740069006c006c00610020006a0061002000410064006f00620065002000520065006100640065007200200035002e0030003a006c006c00610020006a006100200075007500640065006d006d0069006c006c0061002e>
    /SVE <FEFF0041006e007600e4006e00640020006400650020006800e4007200200069006e0073007400e4006c006c006e0069006e006700610072006e00610020006f006d002000640075002000760069006c006c00200073006b006100700061002000410064006f006200650020005000440046002d0064006f006b0075006d0065006e00740020006600f600720020006b00760061006c00690074006500740073007500740073006b0072006900660074006500720020007000e5002000760061006e006c00690067006100200073006b0072006900760061007200650020006f006300680020006600f600720020006b006f007200720065006b007400750072002e002000200053006b006100700061006400650020005000440046002d0064006f006b0075006d0065006e00740020006b0061006e002000f600700070006e00610073002000690020004100630072006f0062006100740020006f00630068002000410064006f00620065002000520065006100640065007200200035002e00300020006f00630068002000730065006e006100720065002e>
    /ENU (Use these settings to create Adobe PDF documents for quality printing on desktop printers and proofers.  Created PDF documents can be opened with Acrobat and Adobe Reader 5.0 and later.)
  >>
  /Namespace [
    (Adobe)
    (Common)
    (1.0)
  ]
  /OtherNamespaces [
    <<
      /AsReaderSpreads false
      /CropImagesToFrames true
      /ErrorControl /WarnAndContinue
      /FlattenerIgnoreSpreadOverrides false
      /IncludeGuidesGrids false
      /IncludeNonPrinting false
      /IncludeSlug false
      /Namespace [
        (Adobe)
        (InDesign)
        (4.0)
      ]
      /OmitPlacedBitmaps false
      /OmitPlacedEPS false
      /OmitPlacedPDF false
      /SimulateOverprint /Legacy
    >>
    <<
      /AddBleedMarks false
      /AddColorBars false
      /AddCropMarks false
      /AddPageInfo false
      /AddRegMarks false
      /ConvertColors /NoConversion
      /DestinationProfileName ()
      /DestinationProfileSelector /NA
      /Downsample16BitImages true
      /FlattenerPreset <<
        /PresetSelector /MediumResolution
      >>
      /FormElements false
      /GenerateStructure true
      /IncludeBookmarks false
      /IncludeHyperlinks false
      /IncludeInteractive false
      /IncludeLayers false
      /IncludeProfiles true
      /MultimediaHandling /UseObjectSettings
      /Namespace [
        (Adobe)
        (CreativeSuite)
        (2.0)
      ]
      /PDFXOutputIntentProfileSelector /NA
      /PreserveEditing true
      /UntaggedCMYKHandling /LeaveUntagged
      /UntaggedRGBHandling /LeaveUntagged
      /UseDocumentBleed false
    >>
  ]
>> setdistillerparams
<<
  /HWResolution [2400 2400]
  /PageSize [612.000 792.000]
>> setpagedevice