Highlight
Numerous studies have demonstrated the validity of Infodemiology (a methodology that combines information theory with the study of epidemiology) applied to topics such as flu outbreaks or venereal diseases, which allows us to infer that it can be used successfully in the field of public health for other topics of interest.
In this case, a technique for extracting information from the general interests of the population is applied to a disease, Metabolic Syndrome, never used before.
Thanks to the results obtained, public health researchers will have an added monitoring tool for active surveillance within the Metabolic Syndrome disease. This tool is of special interest for prevention campaigns developed by Public Health centres.
Introduction
Since 1988, when Reaven(1) first systematically described it, an abundance of research has advanced an understanding of the pathophysiology, epidemiology, prognostic implications, and therapeutic strategies related to the Metabolic Syndrome (MS).
The MS is a clinical entity with metabolic and hormonal alterations, characterized by a set of risk factors: abdominal obesity, insulin resistance, hypertension, and dyslipidemia. It is a complex disorder related to the endocrine system that, along with a sedentary lifestyle and unhealthy eating habits, affects many people around the world and greatly interferes with the quality of life and in the employment of this population(2),(3).
Thus, Niazi et al.(4) assessed the MS frequency and its associated risk factors in health workers, finding that, in addition, age, shiftwork and inactivity were associated to MS which was an important risk factor to be considered for employees’ public health. Furthermore, Yamaguchi et al.(5) pointed out the work stress impact on workers’ health, giving that the increase requirements in work throughout time was associated with a higher risk of MS.
A current publication shows that the number of people suffering from MS is gradually and constantly increasing. In this paper, it is claimed that this growth is occurring because of the campaigns that promote eating healthy and do the required exercise are not enough or not accomplish their purpose. Therefore, the authors request to carry out campaigns aimed at the public considering their language and concerns(6).
Facing this situation, the idea that the population provides data on their tastes, searching services including the search on their disease athwart the search information behaviour on Web has already been explored in recent years(7).
Google is an engine search that provides information to whatever person, which by means of the procured outcomes may readily access to the existed documents on the Network. Google, which does not need presentation, was founded in 1997, and it is considered the most search engine in the world with a market share that thoroughly overflows other search engines as Baidu o Yandex (the most used engines in China and Russia respectively)(8). Although, Google Trends (GT) is not Google’s best-known implement, is a freely accessible tool that reports the volume of searches conducted by the users worldwide to expound on how frequently a term is searched and in what places. The search data on the Internet may provide helpful information on the demeanour population patterns(9),(10).
In the health field, Eysenbach(11),(12) coined the term “infodemiology” (information + epidemiology) as an emerging set of public health information methods in order to analyse search behaviour, communication, and publication on the Internet. That is, observing and analysing the behaviour based on the Web to know the human demeanour with the purpose of foretelling, assessing, and even preventing the health-related problems that constantly arise in the quotidian life(13).
Nowadays, in a marked scenario by the growing usage of social networks and interactivity resources, its inclusion in digital spaces for health communication becomes a relevant issue. Over the past decade, using Web-based data on health public issues in which infodemiology has shown to be useful for assessing various human demeanour features.
As Orduña-Malea(14) pointed out, it may be used in newscasting tasks (forecasting the present, that is, foretelling values that are occurring at the same time data is generating and it may be used as well in forecasting tasks (predicting future trends). Thus, this tool may facilitate to know many issues i.e., the most searched services, which are the new trends, and what needs are users demanding. GT is the indicated tool for gathering information and it has already been used in a wide range of topics hitherto, with studies in the field of diabetes(15), obesity(16), hypertension(17), diet(18) and occupational health(19) among others.
Consequently, the aim of this paper was to study and relate the population interest athwart information search trends on Metabolic Syndrome (including Abdominal Obesity, Hypertension, Diabetes, Cholesterol and Triglycerides) with the Occupational Health (OH).
Methods
Data collection search
The information search data was procured by means of direct search, online access, and Google Trends (GT): https://trends.google.es/.
The scope was worldwide and in all categories. The study period was from 1 January 2004 to 30 September 2020. Searching and compiled data date were 10 October 2023.
To examine the evolution of the Relative Search Volume (RSV), the period to study was segmented into three periods: The first was from January 2004 (when GT provided the first data) to December 2009, the second from January 2010 to December 2016, and the third period was from January 2017 to September 2023.
Tool
Google Trends (GT) is free and open access that provides standardized statistics of Google Trends for different searches since January 1, 2004. It analyses the inquiries to delimit how many searches were carried out on a specific term in comparison with the total quantity of the conducted searches by users on Google for the same term and in the same period.
GT rules out terms with low volume search or duplicated searches performed by the same user in a short while.
Search topic
Searches were conducted on the following topics: ““Metabolic Syndrome”, “Hypertension”, “Diabetes”, “Central Obesity”, “Cholesterol”, “Triglycerides” and “Occupational Health”. The outcomes were taken from over the World and in all categories.
The results of the terms sharing the same concept in any language were obtained considering the use of the word as a “Topic” in this tool. (For example: if it is searched “Occupational Health”, the search includes results of the following topics “Job Security”, “Safety Committee” o “Health at Work”, among others).
Data collection and storage
The procured outcomes were downloaded in a standardized “CSV” (comma-separated values) format, which allowed them to be stored in a spreadsheet file. The quality control of this information was performed through double tables, amending the possible inconsistencies by consulting the original downloaded table.
Variables
Relative Search Volume (RSV): Result provided by Google Trends whose values are normalized on a scale from 0 (relative search volume less than 1% of the maximum volume) to 100 (relative search volume reaches its maximum). For instance, an RSV = 25 represents 25% of the highest observed search proportion during the period under study.
Trend: temporal behaviour and evolution of the searches carried out in a specific topic, long term.
Seasonality: periodic and foreseeable variation in a time series with a period less than or equal to one year.
Unfolding level by country: an indicator created by the “United Nations Development Programme” that measure the breakthrough degree of each country dealing with variables such as life expectancy, education, and per capita income. For its classification, the United Nations Statistics Division Website (http://data.un.org/) was consulted, which determines the three levels of development: developed, developing and least developed.
Data analysis
For quantitative data, its mean, and standard deviation (SD), median, maximum, and minimum were calculated. The Kolmogorov-Smirnov test (with Lilliefors correction) was used to verify the normality of the variables. The Kruskal Wallis test was employed to compare the medians amid groups, and the Wilcox test with Benjamini & Hochberg adjustment was used to study the association between groups.
The temporal evolution of search trends was analysed by means of regression analysis calculating the coefficient of determination (R2). Pearson correlation coefficient was employed to procure the relation amid quantitative variables.
The Kwiatkowski-Phillips-Schmidt-Shin (KPSS) test (tests the void hypothesis that an observable time series is seasonality against the alternative of a unit root) was used to verify seasonality.
The level of significance used in all the hypothesis tests was α ≤ 0.05, using asterisks to represent the strength of the association: * p-value < 0.05; ** p-value < 0.01; *** p-value < 0.001 for Kolmogorov-Smirnov test, Kruskal Wallis test and Wilcox test. * p-value > 0.05 for Kwiatkowski-Phillips-Schmidt-Shin (KPSS) test.
The R software version 4.2.2 with the Rstudio work suite version 2023.09.0 build 463 was used for data analysis.
Results
Relative Search Volume (RSV)
From the study of the procured data from GT it was verified that the Kolmogorov-Smirnov test discarded normality for all the subjects under study (“Metabolic Syndrome”, “Hypertension”, “Diabetes”, “Central Obesity”, “Cholesterol”, “Triglycerides” and “Occupational Health”: p < 0.05) (Table 2), thus, we worked with nonparametric population comparison tests. The RSV values for each topic can be found in Table 1 and the trends can be seen in Figure 1.
Table 1. Median and Pearson Determination Coefficient for RSV.

M = Median; PDC = Pearson Determination Coefficient
Statistical results correspond to Pearson Correlation Coefficient with his corresponding p-values:
*p-value < 0.05;
**p-value < 0.01;
***p-value < 0.001
Table 2. Statistical results and normality test.

Min = Minimum; SD = Standard Deviation; Max = Maximum; KS = Kolmogorov-Smirnov normality test
Statistical results correspond to Kolmogorov-Smirnov test with his corresponding p-values:
*p-value < 0.05;
**p-value < 0.01;
***p-value < 0.001
The relationship between MS and OH demonstrated a positive correlation (R = 0.56; p < 0.001). The correlation observed between MS and its related topics featured the following outcomes: regarding Hypertension (R = - 0.37; p < 0.001), Diabetes (R = -0.36; p < 0.001), Central Obesity (R = - 0.70; p < 0.001), Cholesterol (R = -0.08; p > 0.05), Triglycerides (R = - 0.40; p < 0.001).
Trends
From the RSV data searches, the search trend was obtained for the studied Topics (Figure 1), showing the median and the evolution (established by the coefficient of determination) for the time series, as well as for each of the three analysed periods. Specifically, for MS and OH, a linear decreasing pattern was observed in both cases.
Having segregated the data according to the studied period, it was possible to calculate using the Kruskal-Wallis test, whether the RSV demonstrated the same distribution (Figure 2). It has been noted that in all the analysed Topics and for all periods, significant differences were found, of lesser or greater intensity, except for two comparisons: the Hypertension, where the was no relationship between the first and second period, and the Central Obesity, where there was no relationship between the second and third period.
Seasonality
Throughout the years, the study seasonality in the searches was annually performed for the study period. Using the specific and non-parametric test for seasonality search (KPSS), it was observed that only the Topic Metabolic Syndrome featured seasonality. The situation was not observed in the searches with the other Topics (Figure 3).
Interest according to country
The interest in searches for the topics studied by country can be seen in Figure 4, which highlights the topic that featured the most searched topic. Neither of them the MS was the most searched topic.
To determine the possible differences in population interest, procured from RSV, according to the level of development of the country, following the classification of the United Nations Development Program, a group comparison analysis was carried out for each topic. For the OH topic no significant differences were found in the RSV, albeit there were meaningful differences on the SM Topic between developed and developing countries versus underdeveloped countries, see Figure 5.
Discussion
In this work, it has been verified that GT provides information on the population’s interest in disease data and permits the analysis of differences in the popularity of certain Topics and even between different countries over time. In this infodemiological ecological study, the global popularity of searches on MS and OH related topics was ranked among Google users. The analysis disclosed some results that deserve a detailed discussion.
Nevertheless, it should take into consideration that this is an ecological data analysis, and the findings might not be representative at the individual level(20),(21). For instance, RSV trends are at the population level and may not deduce that only the individuals who suffer from metabolic syndrome are those who indeed create all the search volume concerning this pathology.
Relative Search Volume (RSV) and Trends
The good correlation observed between the MS and OH Topics demonstrates a similar trend in both searches for this information, albeit there is a disparity in the correspondences amid MS and its component pathologies (Hypertension, Abdominal Obesity, Diabetes, Cholesterol and Triglycerides). This situation can also be seen in the RSV medians, which in all cases are higher than that calculated for MS. To these data, it should be added that while the ailments that make up the MS feature increasing trends in the search of information, both MS and OH demonstrated diminishing results (significantly lower in the current period).
Although the relationship between MS and OH is not coincidental(22), these observations might be because the general population are acquainted with the diseases that make up MS, while the ailments that compound MS and OH are not, or at least not to the degree of the other search Topics. This would justify the relation amid these two Topics with each other, but the different demeanour with the others. Thus, the low correlation observed between some terms might have been due to their different social awareness.
This assessment is according to Santacruz-Salazar et al work(23), on MS knowledge in overweight or obese patients, in which they concluded that these people had a dearth of knowledge of MS aspects in comparison with those who were overweight, and the participants recognized SM as symptomatic pathology, but not as a set of risk factors. Lo et al.(24), carried out similar research on the knowledge of MS in Chinese adults, in which study participants showed a poor understanding of this pathology. By the same token, another study demonstrated that education level has an impact on MS knowledge in elderly people, and, thus, the development of health education programs would enhance the prevention and self-control of SM(25).
Ayala-Aguirre et al.(26), in a recent study, pointed out that searching by using technical words as in the case of MS is not common in GT. The vast majority of the population does not search using technical terms; they tend to perform this type of search using general topics or frequently common words. That is to say, the diseases that make up MS are better known in isolation, and therefore, it is not startling that these diseases are more searched by the general population as well as they associate them with MS.
Seasonality
In most of the tests for searching seasonality, the null hypothesis is the existence of a unit root that would discard seasonality, notwithstanding, in the KPSS test we use the analysis of moving median over the time period of study to straightway test for seasonality, allowing the detection of time trends, even if we do not have constant mean signals over the entire time period of the study. In this way, this test is more robust when finding seasonality in a signal in which the trend is not gradually constant (recall the trends in figure 1). Moreover, it is the non-parametric version of the moving median study, so it is appropriate for our variables under study.
Occasionally, MS has been featured as an epidemic disease(27) as well as related to healthy lifestyles(28). Seasonal change in the incidence of the epidemiological disease is a common phenomenon. Although sometimes the mechanisms responsible for this incidence and the epidemiological aftermath of seasonality are not well understood(29).
On this matter, the possible variations in the search information were studied and was verified whether it demonstrated a temporal evolution (according to seasonality) in the form of sawtooth, as pointed out in a previous study(10). On the other hand, it was not possible to observe any milestone, as the special interest that responded to specific events (i.e., significant increase in the incidence of the disease, specific prevention campaigns and so on) since they generally result in a heightened interest of the population in the subject and thus, a considerable number of information searches(30). The detection of milestones (peaks of searches) is relevant information to demonstrate epidemiological behaviour as well as to verify the relationship with certain moments of increasing disease data(31),(32).
In this study, it was not observed that any Topic features an evolution in its RSV in the form of sawtooth nor it was possible to corroborate the existence of any notable milestone, even so moderate seasonality was found for the MS and Diabetes topics, however, it was not conclusive for the rest of the analysed topics.
Interest according to country
The RSV data obtained with GT facilitate to have an insight into the interest of the public, from different countries, in the topic under study. As it has been proved and according to previous work(21), search engine trends are a tool that may integrate into real-time the monitoring of the population’s health information needs.
From the procured outcomes, which were shown visually on the world map, it was found that in none of them was the MS the most searched topic. However, it should be noted that there were significant differences regarding information search on MS between developed and developing countries in contrast with the less developed countries. These results would be consistent with what we have seen previously when studying RSV and search trends in which it was observed that the education level influences knowledge about MS(33) and consequently, in the information searching interest.
Although the onset of MS was in the Western world, however, it has become a truly global problem(27). The main causes were increased urbanization, nutritional transition and reduced physical activity, these conditions gave rise to the prevalence of MS to increase swiftly in developing countries, leading to rising morbidity and derived mortality(34) which would justify the increased interest in information search on MS.
However, when analysing RSV´s, it should be taken into consideration that it is difficult to know the responsible relationship of outcomes by region regarding Google access (or any search engine) and these may be due to different reasons (appearance of news in the main media of a certain country, regional health campaigns focused on a certain disease and so on), and, remember that GT only provides results of searches that have high traffic(35).
On the other hand, no country was found where the RSV of the Topics under study was not reported. This circumstance, if it was found, is mainly due to two motives: the digital divide or the filtering of social content.
Regarding the digital divide, it is not only a technological problem, but also the differences that appear in having or not an Internet connection, or in the type of connection. Digital health services depend on sociodemographic and socioeconomic factors. The filtering of social content (censorship) befalls in countries for which Internet blocking is observed, censoring topics that contradict the accepted social rules of a country (pornography, gambling, alcohol, and drugs, LGTBI content and so on). These countries that are characterized by censoring online news and information are known as “enemies of the Internet”(36),(37),(38).
Limitations of this study
According to Johnson & Mehta(20), given that Google Trends does not provide real usage data and more accurate time intervals, decreases the forecasting capability. Besides, greater transparency is lacking, since there is no information on the specific methods and models that Google uses to calculate the RSV, and as it has been suggested in several publications(10),(21),(39), the outcomes procured using this tool might be swayed by media interest, mainly advertising campaigns, which may not exactly correspond to the interest of the general population.
Conclusion
Through their information searches, the whole population showed to have a dearth of knowledge of MS than of its component diseases. A relationship was found between the information searches carried out on MS and OH.
The study of information search trends may provide useful information on the population’s interest in the disease data, as well as would gradually allow the analysis of differences in popularity, or interest even between different countries.


















