Introduction
Green spaces have become an important field of study in urban health research focused on the relationship the between built environment and health outcomes. Several studies have stressed the beneficial impact of green spaces on the population's health by reducing harm (i.e., exposure to air pollution), restoring capacities (i.e., psychophysiological stress recovery), and building capacities (i.e., encouraging physical activity and facilitating social cohesion).1,2
Different categories of green spaces have been used in urban research: “green land use” usually refers to land areas in urban planning allocated for green use, such as vegetation or parks, and “urban parks” usually refers to green spaces within cities with vegetation and accessible for pedestrians.3 Urban parks have been studied as the main pathway between green spaces and physical activity within cities. However, a standard and operative definition of what constitutes an urban park is not yet established, although a set of characteristics have been proposed as those required to promote physical activity.3 The methods conducted to estimate and define the extension of urban parks have widely differed among studies: while some used processed cartographic layers of information of green land-uses provided by public institutions, other used satellite images to create their own layers or to estimate the level of greenness by performing complex operations like the NDVI.4 The former represent an easy-to-implement methodology with ready-to-use data, but with limited capacity to define what is considered as an urban park, while the use of satellite images allows a more flexible definition of urban park, but it implies a complex methodology using Geographic Information Systems (GIS). Different urban park definitions can yield different results in public health studies: while some found an association between urban parks and mental health using land-use layers, others found no association using obesity as the outcome variable.5,6 Alongside this complexity, the need of relying on secondary data could difficult these analysis, since each database was created to respond to a specific purpose and it could gather different information.
This paper aims to present an approach to create a comprehensive spatial database of urban parks for public health research purposes using the example of Madrid.
Methodology
Study setting
The proposed approach is for the study of urban parks using the example of Madrid, the capital of Spain, with 3,334,730 inhabitants, according to the National Statistics Institute for the 2020 year, and an area of 604,45km2.
Database selection
Two specific databases offering data on green spaces and/or urban parks in geospatial vector format (shapefile) were gathered: 1) the land uses reported in the General Urban Development Plan of the Regional Government (GUDP) of the year 1997 -land use layer-,7 and, 2) the green spaces for the year 2019 whose maintenance is the responsibility of the City Council -green space layer-, provided by the Sub-Directorate for the Conservation of Green Areas and Urban Trees. Both were collected by public institutions. Despite the year of the GUDP is 1997, this is the main tool to guide the urban development in the Spanish context, and some further updates until 2016 of the plan were included in the database.
Spatial database processing
The land-use layer comprises information on the land use distribution, in which the extent of the city is divided into polygons with different land-use (e.g., residential, industrial, etc.). This layer was filtered to keep those polygons labelled as any “green land use”. In contrast, the green space layer is a more specific database representing only those areas that constitute green spaces within the city. We used GIS to merge both layers into a new one -final layer- depicting all the areas classified as “green space” by the land-use or the green space layer. We identified some polygons in which both layers were overlapped. Both sources classify these polygons as “green space”, and we automatically included them in the final layer. However, we found polygons that were not coincident (i.e., areas depicted as “green space” in the land use layer, but not in the green space layer, and vice versa).
To solve this conflict, two researchers conducted a two-step analysis. Firstly, one of the researchers assessed, using photo-interpretation with Google Street View imagery, whether each conflictive polygon complied the following criteria:
As the GUDP might account as “green spaces” for some polygons that are not currently built, but it is expected to be converted into a green space in the future, the park should be already built and verifiable using Google Street View imagery.
Urban parks should be accessible for pedestrians, considered as the presence of at least one walking path around the perimeter or crossing the area.
The researcher classified the conflictive polygons into three categories depending on whether the polygon should be included, excluded, or to review. As the database information was previously validated by public institutions, we considered that only a double-check was needed for the conflictive polygons. Thus, the other researcher checked those polygons categorized as ‘excluded' or ‘to review', and classified them following the previous criteria. Discrepancies were discussed between the researchers, and the final decision was based on their agreement. Both databases were comprised of 8124 polygons, of which 545 (6.7%) were non-coincident, and 97 of these (17.8%) were finally included at the end of the process. This database had an area of 93.03km2.
Then, adjacent polygons were dissolved to be considered as a unique urban park. Subsequently, a polygon size filter was applied to remove those polygons smaller than 0.5ha, a threshold posed by the World Health Organization as the minimum size for a park to promote physical activity.3 The final area of the layer was 70.1km2, which supposes a 75.35% of the previous one.
At the end of the process, the final layer was composed of those polygons corresponding to urban parks of the city of Madrid, accessible for pedestrians and whose area is bigger than 0.5ha. Figure 1 illustrates the methodological design of this study.
All the spatial processing of databases was conducted using QGIS software (A Coruña 3.10 version).
Practical use
The final spatial database of green spaces could be used for several purposes. Although we focus on the relationship between urban parks and health, environmental studies have stressed the importance of accessibility and availability of parks in terms of environmental justice.8 Other researchers placed parks at the center of long-term urban dynamics such as gentrification and try to understand the potential side effects of these areas in terms of economic, social, and health impact.9 Although both databases came from a different public institution, we considered that this approach enabled us to obtain a more comprehensive database with information related to urban parks that could be replicable in other settings where unified databases for parks are not available.
Conclusions
The study of the impact of urban parks in different research areas has gained interest in the urban research field. A combination of different databases may be useful to gather comprehensive information on the location and characteristics of green spaces, thus overcoming the problems of definition and heterogeneity of parks databases, and allowing for a better understanding of the relationship of these urban assets with different outcomes in public health studies.