Spatial modeling in disease epidemic studies

By Avishek Majumder & Chandrika Kapagunta on October 3, 2017

Epidemiology of infectious diseases often requires geographical information. Both spatial and temporal factors of a population can affect disease spread. Spatial factors refer to the geographical or topological factors associated with a disease. Temporal factors mean time-bound progress of a disease in a population. These factors influence the virulence of the disease, patterns of prevalence and also future incidents. Temporal factors in disease epidemiology influence the progress of a disease in the population. They also offer clues on the source of the disease and its associated risk factors. However, spatial information of a disease can strongly predict disease prevalence. This is done by linking climate and environmental information to disease prevalence patterns. Statistical models like spatial models can determine risk factors, causality and predict future trends. Therefore, this article discusses the role of spatial modelling in disease epidemiology.

Use of spatial modelling in identifying the spatial structure of diseases

The study of geographical variations of a disease or risk factors is known as spatial epidemiology (Ostfeld, Glass, & Keesing, 2005). Several spatial methods and models have been adopted in epidemiology. Combination of spatial and temporal factors along with multilevel data is known as spatial modelling. It refers to statistical and mathematical modelling of spatial data related to diseases due to its ability to help link secondary factors to distribution trends in a population. Therefore, the underlying spatial structure of disease data can be identified and used to devise control strategies.

Need for spatial modelling in epidemiology

Spatial or geographical data is often stored in the form of a Geographical Information System (GIS) data. It provides information on the location of a disease event. The location is shared in the form of latitude or longitude coordinates and village or district level data. Distances between host-vector-pathogens influence the rate and extent of disease spread. Moreover, spatial information can also provide information on host-vector-pathogen interactions (Ostfeld et al., 2005). Therefore, spatial information is crucial in epidemiology.

Interaction data can also help in designing disease control strategies and predicting future epidemics. Here secondary factors like socioeconomic factors, environmental and climate factors are control factors (Martins & Rocha, 2012). Since spatial epidemiology data are often discrete, the spatial heterogeneity of an epidemic can be quantified (Lambin et al, 2010). Furthermore, using spatial modelling, locations of other factors (environment, pollutant, pathogen etc.) can be connected to disease incidents. Inference on possible risks to the disease can be drawn based on the findings. Finally, it enables isolation of factors contributing to this heterogeneity as well as predicting future disease trends.

Goals of spatial modeling in disease epidemiology (Martins & Rocha, 2012). — Goals of spatial modelling in disease epidemiology (Martins & Rocha, 2012).

Difference between spatial modelling and mechanistic models

Furthermore, infectious disease research adopted mechanistic models. In such models, spatial information was treated as constant. However, there is a need to represent spatial data of diseases as a variable. Dynamic influence on the mode and trends of disease spread aid this supposition (Linard & Tatem, 2012). Therefore, spatial modelling of disease epidemics, also known as spatial epidemiology came into being. Spatial epidemiology has been defined as “the study of spatial variation in disease risk or incidence” (Ostfeld et al., 2005). In spatial epidemiology, statistical modelling in the form of spatial or spatial modelling is an important tool. Furthermore, it helps identify influential factors and their relationship with disease prevalence in a population.

Spatial epidemic models

Several types of spatial models exist, depending on the purpose of the study. These are exploring data, inferring from data and predicting future disease patterns. These methods can also be grouped based on three spatial perspectives where disease information can be analysed and applied (Robb, Bauer, & Vena, 2016). This is represented in the figure below.

In traditional methods, disease data is quantified to understand the trends. In Acute event methods, the disease data is understood better during epidemics to create control strategies. Thus, both methods subsequently assist in applying knowledge at the community level.

Types of spatial methods based on epidemiological perspectives (Robb, Bauer, & Vena, 2016)

Statistical models under spatial modelling

Depending on the data scale and the purpose of analysis, Moise, Cunningham, & Inglis, (2015) have described several important methods. These are represented in the table below. Furthermore, Methods applied for describing or exploring data is different from those for predicting patterns.

No.	Type of Analysis Method	Purpose
Exploratory or descriptive analysis
	Global Patterns
	Quadrant count analysis	Description of disease features
	Nearest Neighbour analysis
	Ripley’s K-function
	Local Patterns
	Hot Spot Analysis	Mapping of Clusters
	Anselin’s local indicators of spatial association (LISA)
	Kulldorff’s Scan Statistic
Predictive Spatial Patterns
	Voronoi (Thiessen) polygons	Prediction of Spatial Patterns
	Inverse Distance Weighting (IDW)
	Kriging
	Kernel Density Estimation (KDE)
	Disease Mapping
	Disease Clustering
	Ecological Analysis

Types of spatial analysis methods using GIS information (Moise et al., 2015)

Standard techniques are available to analyse data. However, researchers devise new methods to analyse data better. This depends upon the purpose of the study and the limitations of existing models. As a result, researchers are continuously developing a large number of models.

A systematic review of spatial models developed

Different authors have conducted spatial epidemiological studies for infectious diseases. Various models and methods have been applied in the process. Some of them have been reviewed below.

No.	Model name	Disease under study	Geographic region	Variables studied	Results
1	Logistic Regression model (Kleinschmidt, Bagayoko, Clarke, Craig, & Le Sueu, 2000).	Malaria	Africa	Climatic, population and topographic.	Residual spatial patterns reduced, and residual spatial dependence modelled by kriging. The end predictions made sense from an entomological perspective.
2	Global K-Functions (Getis, Morrison, Gray, & Scott, 2003).	Dengue		Age, poverty, education and household income.	Adults clustered strongly within houses and weakly to a distance of 30 meters beyond the household.Clustering not detected beyond 10 meters for positive containers or pupae.
3	Spatial regression model in Bayes.X (Kazembe, 2007).	Malaria	Northern Malawi	Age, environmental factor, altitude, precipitation and soil water capacity.	Malaria risk increased with altitude, precipitation, and soil water capacity.
4	Fixed Effects Grouped Logistics Regression model and Bayesian Space-Time Multivariate model (Pullan et al., 2011).	Helminths	Kenya	The age range of sampled individuals, a method of diagnosis, environmental and covariates.	In 2009, high certainty that endemicity was below the 20% threshold, helminths prevalence decreased over time and approximately 2.8 million school-age children were living in endemic districts.
5	Spatial explicit model in R (Hagenlocher & Castro, 2015).	Malaria	United Republic of Tanzania	environmental, socioeconomic, demographic, biological, cultural, and political factors.	Malaria risk was higher in Mainland areas than in Zanzibar, which is a result of differences in both malaria entomological inoculation rate and prevailing vulnerabilities.
6	Agent-based model in SaTScan (Pizzitutti et al., 2015).	Malaria	Peruvian Amazon	Age , human, mosquitoes species and environmental factor- climate.	Able to reproduce the variations of the malaria monthly incidence over a period of 3 years and also the spatial heterogeneities of local scale malaria transmission.

A systematic review of spatial models in Infectious disease research

The table above shows that for a single disease (Malaria), different models are applicable since the aims are different. These models can be tested in different software. Each software has different properties and advantages. Also, each of these models has used different ranges of variables. These variables represent demographic profile, environmental factors, socioeconomic factors and disease features.

Spatial modelling for better strategy formulation

Overall, spatial modelling of infectious diseases can be conducted using different statistical models and methods. The methods depend on the purpose of the study and the data available. Future spatial studies of infectious diseases can gain from a wide range of GIS tools developed. More statistical tools are being continuously developed to ensure maximum capture of information from the available data. They highlight the underlying causes and relationships between elements of the data. Consequently, such information will help in a faster analysis of patterns and risks and hence contribute to the better design of control strategies.

References

Getis, A., Morrison, A. C., Gray, K., & Scott, T. W. (2003). Characteristics of the spatial pattern of the dengue vector, Aedes aegypti, in Iquitos, Peru. The American Journal of Tropical Medicine and Hygiene, 69(5), 494–505.
Hagenlocher, M., & Castro, M. C. (2015). Mapping malaria risk and vulnerability in the United Republic of Tanzania: a spatial explicit model. Population Health Metrics, 13(1), 2.
Kazembe, L. N. (2007). Spatial modelling and risk factors of malaria incidence in northern Malawi. Acta Tropica, 102(2), 126–137.
Kleinschmidt, I., Bagayoko, M., Clarke, G. P., Craig, M., & Le Sueu, r D. (2000). A spatial statistical approach to malaria mapping. International Journal of Epidemiology, 29(2), 355–361.
Lambin, E. F., Tran, A., Vanwambeke, S. O., Linard, C., & Soti, V. (2010). Pathogenic landscapes: Interactions between land, people, disease vectors, and their animal hosts. International Journal of Health Geographics, 27(9), 54.
Linard, C., & Tatem, A. J. (2012). Large-scale spatial population databases in infectious disease research. International Journal of Health Geographics, 11, 7.
Martins, H., & Rocha, J. G. (2012). Distributed Geospatial Data Management for Entomological and Epidemiological Studies. In L. Díaz, C. Granell, & J. Huerta (Eds.), Discovery of Geospatial Resources: Methodologies, Technologies, and Emergent Applications: Methodologies, Technologies, and Emergent Applications (pp. 220–240). Hershey PA: IGI Global.
Moise, I. K., Cunningham, M., & Inglis, A. (2015). Geospatial Analysis in Global Health M&E.
Ostfeld, R. S., Glass, G. E., & Keesing, F. (2005). Spatial epidemiology: An emerging (or re-emerging) discipline. Trends in Ecology & Evolution, 20(6), 328–336.
Pizzitutti, F., Pan, W., Barbieri, A., Miranda, J. J., Feingold, B., Guedes, G. R., … Mena, C. F. (2015). A validated agent-based model to study the spatial and temporal heterogeneities of malaria incidence in the rainforest environment. Malaria Journal, 14, 514.
Pullan, R. L., Gething, P. W., Smith, J. L., Mwandawiro, C. S., Sturrock, H. J., Gitonga, C. W., … Brooker, S. (2011). Spatial modelling of soil-transmitted helminth infections in Kenya: a disease control planning tool. PLoS Neglected Tropical Diseases, 5(2), 958. Retrieved from http://journals.plos.org/plosntds/article?id=10.1371/journal.pntd.0000958.
Robb, S. W., Bauer, S. E., & Vena, J. E. (2016). Integration of Different Epidemiological Perspecitives and Applications to Spatial Epidemiology. In A. B. Lawson, S. Banerjee, R. P. Haining, & M. D. Ugarte (Eds.), Handbook of Spatial Epidemiology (pp. 3–38). CRC Press.