The results showed that for predicting daily PM10 concentration in the spring, DC, RH, WD, CO, and O3 were the most important factors, respectively. The WD followed by O3, SSD, and RH had a greater role in predicting daily PM10 in the summer. The most important predictor variables of daily PM10 in the autumn were SO2, NO2, T, CO, and O3, respectively. DC had played the most important part in predicting daily PM10 in the winter, while other variables made a much smaller contribution. In general, the variable of DC in spring and winter, SO2 in autumn, and WD in summer had a greater impact in predicting the daily PM10, compared to other parameters.
Discussion
Particulate matter ≤ 10 μm (PM10), as one of the most important air pollutants, have adverse effects on public health and the environment Therefore, it is of great important to predict and identify the factors affecting their concentration changes in different regions. In this study, after removing the variables with low variance, Boruta algorithm was used to select the best predictive variables of PM10 seasonal changes in weather. According to the results of this study, in spring and autumn, the fraction of unique values was less than 10% for 4 variables of DC, Rain, TDF, and Wsmean, and a higher percentage for other variables. In summer, the lowest values of variance were observed in DC, Rain, and TDF variables, and in the winter season, it was only observed for the last two variables. Therefore, they were excluded, and other variables were used to introduce Boruta algorithm in each season separately. According to Boruta algorithm results, some variables were not confirmed to model and predict daily PM10 concentrations in the study seasons. Unconfirmed and tentative variables for predicting PM10 in spring were WD, NO2, SSD, and O3 while in autumn; the only variable was NO2 variable. For winter, NO2, SSD, and SO2 were not confirmed for predicting daily PM10, while for summer, all the variables were confirmed. Finally, the remaining parameters (marked in blue in Figure 4) were confirmed and used to predict the target variable in different seasons. In one study, Tmax, Tmin, WSmax, WDmax, evaporation, and rainfall were selected as independent variables for PM10 prediction in all the seasons in Isfahan 44. In addition to these parameters, maximum and minimum values of relative humidity and sunshine hours were chosen to predict seasonal PM10 in Ahvaz city 45. Most of the climate variables selected in the previous studies were confirmed in the current study and were largely consistent with the findings of this stage in the present study. Kliengchuay W, et al., found that some air pollutants such as NO2 and CO also had an effect on the daily concentration of PM10, and in the context of choosing these variables, it could be concluded that the findings of this study were consistent with the reports of these researchers 14. One of the reasons regarding the similarity of the selected parameters for predicting the PM10 in Isfahan and Ahvaz in all the seasons was that all the climatic variables entered the modeling process without performing collinear analysis. This was while this important issue was taken into account in the present study. For this purpose, Boruta algorithm was used separately to select predictor variables in each season.
In this study, the performance of RF and Xgboost models was evaluated based on R2, RMSE, MAE, and NSE. Compared to Xgboost model, RF showed a higher performance in predicting PM10 in all the seasons and was selected as the best predictive model. The adjusted R2 value in this model was almost 0.9 for all the seasons, indicating that confirmed variables using Boruta algorithm could explain almost 90 % variability of daily PM10 in different seasons. The successful application of the RF model for predicting PM10 was also proved by Mallet 29 and Tella Balogun 19, which confirmed the findings obtained from this study. The higher accuracy of RF for PM10 prediction could be due to the following strengths. This model was highly iterative in nature, which made bootstrap data points for robust and stable forecasts. There were user-friendly OpenSource R libraries, some of which were designed to handle large numbers of input variables. Furthermore, the model had an inherent high capacity to handle nonlinear and high-order interactions between predictors 46. In addition, RF approach had less restrictive assumptions and was more flexible 47.
In general, some variables such as CO, O3, RH, SO2, and Tmean had been selected to predict seasonal PM10 changes in Yazd city. This suggested that the changes in all the mentioned variables had an impact on pollution changes caused by PM10 in the weather during all the seasons. In addition to these parameters, the changes of some variables such as spring and winter DC, summer and winter WS mean, and autumn NO2 had influenced the changes of PM10 in the weather of Yazd city. These results showed the complex relationship between climatic parameters and PM10 in different seasons. Such behaviors had also been reported in some of the previous studies regarding PM10 and other air pollutants in different parts of the world. The production and destruction of O3 in Baghdad city was affected by different levels of solar radiation in different seasons48, and seasonal changes of air dust had different effects on surface solar radiation in Taklamakan desert 49, 50. Soil moisture followed by air pollution hours, soil heat flux, air pressure, vegetation cover, wind direction, and evaporation were identified as the most important environmental factors affecting PM10 changes in Isfahan 51. Although Isfahan is located in the vicinity of Yazd city, the relative importance of environmental factors affecting PM10 changes in these cities was different. Difference in the study period selected for Isfahan (2013 to 2019) and Yazd (2018 to 2022), as well as the study scale selected, which was annual for Isfahan and seasonal for Yazd, were the main reasons for the difference in the selection of some predictor variables of PM10 in these cities.
In general, the high contribution of DC in predicting PM10 concentrations in the spring and winter indicated the local origin of dust particles in these seasons in Yazd. One of these sources was the Yazd-Ardakan plain, where a the large amount of soil particles is separated from the soil surface every year due to the phenomenon of wind erosion 52, and because the direction of the prevailing winds is from the side of this plain towards the city of Yazd 53, a large part these particles are transferred to this city. Considering that the increase in DC was caused by the increase in the frequency of dust events and dusty days, and the fact that there was a significant relationship between seasonal changes in PM10 and dusty days in Birjand city 54, with relatively similar climatic conditions to Yazd, the findings of another researcher confirmed the findings of the present research. Probably, the cold weather and increase in the traffic of automobiles, followed by the increase in the consumption of fossil fuels in the autumn, were the reasons for the increase in sulfur dioxide gas and the greater contribution of this factor in predicting the PM10 in this season. The significant contribution of WD in predicting PM10 concentrations in summer indicated that regional sources had a great impact on PM concentration in the dusty city of Yazd in this season. Guerra and Lane 55 pointed out a significant relationship between the wind direction and the concentration of PMs, which to some extent confirmed findings of this study. Although the mentioned variables were influential in predicting the concentration of PM10 in the study area, the influence of other climatic parameters and air pollutants should not be ignored. For example, in winter, one of the reasons for increasing air pollution was air temperature inversions 56, and this might be the reason why this variable appeared in the list of the main variables predicting winter PM10 concentration. The effect of temperature fluctuations on PM10 concentration variations had been proven in a study conducted by Plocoste and Calif 15. Wind speed affected PM10 concentration by affecting the horizontal dissemination 57. Moreover, on warm days, the wind can increase the concentration of particular matters by removing soil particles and road dust 58. In several studies, sunshine duration had been identified as a key factor affecting the concentration of PM10, which was consistent with the findings of this research. Solar radiation was also identified as another major factor affecting the change in PM10 concentration in different seasons because its changes were a function of changes in the concentration of airborne particles, cloud covers, and anthropogenic pollutions in the urban areas 59, 60. In agreement with the findings of this study, the important role of some drivers such as wind speed, wind direction, air pressure, relative humidity, air temperature, rainfall, and O3 in PMs changes had also been reported by Duarte and Schneider 61 for urban environments. Most of these variables were successfully used to predict PM10 concentration in previous studies 18, 29 and confirmed outcomes of this study.
Conclusion
Analysis of daily changes in air pollutants concentrations and determining the factors influencing these changes can help to better manage air quality in urban environments. In this work, the efficiency of RF and Xgboost models for predicting daily PM10 concentrations in different seasons was evaluated, and the most important factors affecting it were identified by the best fitted model. The major findings of this study are as follows:
- According to the accuracy metrics, the RF model is better than the Xgboost model for predicting the daily PM10 concentration in all seasons.
- DC has the greatest relative importance in predicting daily PM10 concentrations in spring and winter.
- WD and SO2 are identified as the most important factors affecting the daily changes of PM10 concentration in summer and autumn, respectively.
One of the limitations of this study was the lack of access to information on air pollutants in all the stations of Yazd city in a similar and long-term period. Furthermore, the incompleteness of PM10 information in other air pollution monitoring stations in Yazd province was one of the limitations of this study regarding the analysis of spatial variations of PM10 at a larger scale. Having such information provides a broader insight into areas with a higher risk of pollution in the province. Therefore, it is suggested that if PM10 information is recorded in the coming years for all the air pollution measuring stations in Yazd province, they should be used for spatial analysis of PM10 changes at the provincial scale.
Acknowledgments
This research was supported by the University of Jiroft under grant NO: 4812-01-1.The authors would like to thank the vice-chancellor of education and research in the university for his support and the department of environment and the meteorological organization of Yazd province for providing the required information on air pollutants and meteorological parameters.
Conflict of Interest
The authors declared no conflict of interest.
Funding
This research was supported by the University of Jiroft under grant NO: 4812-01-1.
Ethical considerations
The authors fully addressed ethical issues including plagiarism, informed consent, misconduct, data fabrication and/or falsification, double publishing and/or submission, redundancy, etc.
Code of ethics
This study was carried out with the aim of determining weather parameters and air pollutants affecting seasonal changes of particulate matter of less than 10 microns using meteorological data and air pollutants measures. The ethics code is not applicable for this research.
Authors' contributions
Zohre Ebrahimi-Khusfi and Ali Reza Nafarzadegan contributed to the study’s conception and design. Data collection, statistical analysis and data processing were performed by Zohre Ebrahimi-Khusfi, Mohsen Ebrahimi-Khusfi, and Ali Reza Nafarzadegan. The first draft of the manuscript was written by Zohre Ebrahimi-Khusfi and Mojtaba Soleimani Sardoo. Zohre Ebrahimi-Khusfi and Ali Reza Nafarzadegan were responsible for reviewing and revising the manuscript and supervising the project. All the authors read and approved the final manuscript.
This is an Open-Access article distributed in accordance with the terms of the Creative Commons Attribution (CC BY 4.0) license, which permits others to distribute, remix, adapt, and build upon this work for commercial use.
References
1. Taheri Shahraiyni H, Sodoudi S. Statistical modeling approaches for PM10 prediction in urban areas; A review of 21st-century studies. Atmosphere. 2016;7(2):15.
2. Kchih H, Perrino C, Cherif S. Investigation of desert dust contribution to source apportionment of PM10 and PM2.5 from a southern Mediterranean coast. Aerosol Air Qual Res. 2015;15(2):454-64.
3. Kermani M, Arfaeinia H, Masroor K, et al. Health impacts and burden of disease attributed to long-term exposure to atmospheric PM10/PM2.5 in Karaj, Iran: effect of meteorological factors. Int J Environ Anal Chem. 2020;102(18):6134-50.
4. Lee S, Hong YC, Park H, et al. Combined effects of multiple prenatal exposure to pollutants on birth weight: The Mothers and Children's Environmental Health (MOCEH) study. Environ Res. 2020;181:108832.
5. Tzima K, Analitis A, Katsouyanni K, et al. Has the risk of mortality related to short-term exposure to particles changed over the past years in Athens, Greece?. Environment International. 2018;113:306-12.
6. Sicard P, Khaniabadi YO, Perez S, et al. Effect of O3, PM10 and PM2.5 on cardiovascular and respiratory diseases in cities of France, Iran and Italy. Environ Sci Pollut Res. 2019;26(31): 32645-65.
7. Karimi B, Samadi S. Mortality and hospitalizations due to cardiovascular and respiratory diseases associated with air pollution in Iran: A systematic review and meta-analysis. Atmos Environ. 2019;198:438-47.
8. Tahery N, Geravandi S, Goudarzi G, et al. Estimation of PM10 pollutant and its effect on total mortality (TM), hospitalizations due to cardiovascular diseases (HACD), and respiratory disease (HARD) outcome. Environ Sci Pollut Res. 2021;28(17):22123-30.
9. Kumar S, Dwivedi S. Impact on particulate matters in India's most polluted cities due to long-term restriction on anthropogenic activities. Environ Res. 2021;200:111754.
10. Millán-Martínez M, Sánchez-Rodas D, de la Campa AMS, et al. Contribution of anthropogenic and natural sources in PM10 during North African dust events in Southern Europe. Environ Pollut. 2021;290:118065.
11. Alifa M, Bolster D, Mead M, et al. The influence of meteorology and emissions on the spatio-temporal variability of PM10 in Malaysia. Atmos Res. 2020;246:105107.
12. Hassan H, Latif MT, Juneng L, et al. Interaction of PM10 concentrations with local and synoptic meteorological conditions at different temporal scales. Atmos Res. 2020;241:104975.
13. Arregocés HA, Rojano R, Restrepo G. Meteorological factors contributing to organic and elemental carbon concentrations in PM10 near an open-pit coal mine. Environ Sci Pollut Res. 2022;29(19):28854-65.
14. Kliengchuay W, Worakhunpiset S, Limpanont Y, et al. Influence of the meteorological conditions and some pollutants on PM10 concentrations in Lamphun, Thailand. J Environ Health Sci Eng. 2021;19(1):237-49.
15. Plocoste T, Calif R. Is there a causal relationship between Particulate Matter (PM10) and air Temperature data? An analysis based on the Liang–Kleeman information transfer theory. Atmos Pollut Res. 2021;12(10):101177.
16. Chuchro M, Sarlej W, Grzegorczyk M, et al. Application of Photo texture analysis and weather data in assessment of air quality in terms of airborne PM10 and PM2.5 Particulate Matter. Sensors. 2021;21(16):5483.
17. Bozdağ A, Dokuz Y, Gökçek ÖB. Spatial prediction of PM10 concentration using machine learning algorithms in Ankara, Turkey. Environ Pollut. 2020;263:114635.
18. Choubin B, Abdolshahnejad M, Moradi E, et al. Spatial hazard assessment of the PM10 using machine learning models in Barcelona, Spain. Sci Total Environ. 2020;701:134474.
19. Tella A, Balogun AL, Adebisi N, et al. Spatial assessment of PM10 hotspots using Random Forest, K-Nearest Neighbour and Naïve Bayes. Atmos Pollut Res. 2021;12(10):101202.
20. Park H, Kim T, Yang M. The effect of meteorological factors on PM10 depletion in the atmosphere and evaluation of rainwater quality. Korean Journal of Remote Sensing. 2020;36(63):1733-41.
21. Zalewska T, Biernacik D, Marosz M. Correlations between 7Be, 210Pb, dust and PM10 concentrations in relation to meteorological conditions in northern Poland in 1998–2018. J Environ Radioact. 2021;228: 106526.
22. Draxler RR, Gillette DA, Kirkpatrick JS, et al. Estimating PM10 air concentrations from dust storms in Iraq, Kuwait and Saudi Arabia. Atmos Environ. 2001;35(25):4315-30.
23. Hussein T, Li X, Al-Dulaimi Q, et al. Particulate matter concentrations in a middle eastern city–an insight to sand and dust storm episodes. Aerosol Air Qual Res. 2020;20:2780-92.
24. Guan Q, Luo H, Pan N, et al. Contribution of dust in northern China to PM10 concentrations over the Hexi corridor. Sci Total Environ. 2019;660:947-58.
25. Rezazadeh M, Irannejad P, Shao Y. Climatology of the middle east dust events. Aeolian Res. 2013;10:103-9.
26. Ebrahimi-Khusfi Z, Mirakbari M, Soleimani-Sardo M. Aridity index variations and dust events in Iran from 1990 to 2018. Ann Am Assoc Geogr. 2022;112(1):123-40.
27. Jalili M, Ehrampoush MH, Mokhtari M, et al. Ambient air pollution and cardiovascular disease rate an ANN modeling: Yazd-Central of Iran. Scientific Reports. 2021;11(1):16937.
28. Czernecki B, Marosz M, Jędruszkiewicz J. Assessment of machine learning algorithms in short-term forecasting of PM10 and PM2.5 concentrations in selected Polish agglomerations. Aerosol Air Qual Res. 2021;21:200586.
29. Mallet MD. Meteorological normalisation of PM10 using machine learning reveals distinct increases of nearby source emissions in the Australian mining town of Moranbah. Atmos Pollut Res. 2021;12(1):23-35.
30. Kursa MB, Rudnicki WR. Feature selection with the Boruta package. J Stat Softw. 2010;36(11):1-13.
31. O’Loingsigh T, McTainsh G, Tews E, et al. The Dust Storm Index (DSI): a method for monitoring broadscale wind erosion using meteorological records. Aeolian Res. 2014;12:29-40.
32. Shao Y, Yang Y, Wang J, et al. Northeast Asian dust storms: Real‐time numerical prediction and validation. Journal of Geophysical Research. 2003;108(D22):4691.
33. Zhou T, Geng Y, Ji C, et al. Prediction of soil organic carbon and the C:N ratio on a national scale using machine learning and satellite data: A comparison between Sentinel-2, Sentinel-3 and Landsat-8 images. Sci Total Environ. 2021;755:142661.
34. Bradley B, Brandon G. Hands-On Machine Learning with R. CRC Press, USA. 2020.
35. Ebrahimi-Khusfi Z, Dargahian F, Nafarzadegan AR. Predicting the dust events frequency around a degraded ecosystem and determining the contribution of their controlling factors using gradient boosting-based approaches and game theory. Environ Sci Pollut Res. 2022;29(24):36655-73.
36. Chen X, Li X. Estimating aerosol optical extinction across eastern China in winter during 2014–2019 using the random forest approach. Atmos Environ. 2022;269:118864.
37. Ruiz-Aĺvarez M, Gomariz-Castillo F, Alonso-Sarría F. Evapotranspiration response to climate change in semi-arid areas: using random forest as multi-model ensemble method. Water 2021;13(2):222.
38. Pouyan S, Rahmanian S, Amindin A, et al. Spatial and seasonal modeling of the land surface temperature using random forest. Computers in Earth and Environmental Sciences. 2022:221-34.
39. Lagomarsino D, Tofani V, Segoni S, et al. A tool for classification and regression using random forest methodology: Applications to landslide susceptibility mapping and soil thickness modeling. Environ Model Assess. 2017;22(3):201-14.
40. Polewko-Klim A, Lesiński W, Golińska AK, et al. Sensitivity analysis based on the random forest machine learning algorithm identifies candidate genes for regulation of innate and adaptive immune response of chicken. Poult Sci. 2020;99(12):6341-54.
41. Elith J, Leathwick JR, Hastie T. A working guide to boosted regression trees. J Anim Ecol. 2008;77(4):802-13.
42. Cui Y, Cai M, Stanley HE. Comparative analysis and classification of cassette exons and constitutive exons. Biomed Res Int. 2017;2017:1-9.
43. Sahour H, Gholami V, Torkaman J, et al. Random forest and extreme gradient boosting algorithms for streamflow modeling using vessel features and tree-rings. Environ Earth Sci. 2021;80:1-14.
44. Masoudi M, Gerami S. Assessment of PM10 concentration and its prediction using meteorological parameters in the air of Isfahan, Iran. The Jordan Journal of Earth and Environmental Sciences. 2018;9(2):75-80.
45. Masoudi M, Asadifard E, Rastegar M. Status of PM10 as an air pollutant and its prediction using meteorological parameters in Ahvaz, Iran. Eur Respir Rev. 2018;6(2):163-74.
46. Stafoggia M, Bellander T, Bucci S, et al. Estimation of daily PM10 and PM2.5 concentrations in Italy, 2013–2015, using a spatiotemporal land-use random-forest model. Environment International. 2019;124:170-9.
47. Breiman L. Random forests. Machine Learning. 2001;45:5-32.
48. Nassif WG, Al-Taai OT, Abbood ZM. The influence of solar radiation on ozone column weight over Baghdad city. IOP Conf Ser Mater Sci Eng. 2020;928(7):072089.
49. Fei Y, Xia X, Che H. Dust aerosol drives upward trend of surface solar radiation during 1980–2009 in the Taklimakan Desert. Atmos Sci Lett. 2014;15(4):282-7.
50. Gehlot S, Minnett PJ, Stammer D. Impact of sahara dust on solar radiation at cape verde islands derived from MODIS and surface measurements. Remote Sens Environ. 2015;166:154-62.
51. Ebrahimi-Khusfi Z, Taghizadeh-Mehrjardi R, Kazemi M, et al. Predicting the ground-level pollutants concentrations and identifying the influencing factors using machine learning, wavelet transformation, and remote sensing techniques. Atmos Pollut Res. 2021;12(5): 101064.
52. Ekhtesasi M, Sepehr A. Investigation of wind erosion process for estimation, prevention, and control of DSS in Yazd–Ardakan plain. Environ Monit Assess. 2009;159:267-80.
53. Ebrahimi Khusfi Z, Roustaei F, Ebrahimi Khusfi M, et al. Investigation of the relationship between dust storm index, climatic parameters, and normalized difference vegetation index using the ridge regression method in arid regions of Central Iran. Arid Land Res Manag. 2020;34(3):239-63.
54. Ebrahimi A, Ahmadizadeh SR, Rashki A. Variation of PM10 and its relationship with dust and climate in Birjand, Iran. Desert. 2022;27(1):97-114.
55. Guerra SA, Lane DD, Marotz GA, et al. Effects of wind direction on coarse and fine particulate matter concentrations in southeast Kansas. J Air Waste Manag Assoc. 2006;56(11): 1525-31.
56. Li X, Miao Y, Ma Y, et al. Impacts of synoptic forcing and topography on aerosol pollution during winter in Shenyang, Northeast China. Atmos Res. 2021;262:105764.
57. Galindo N, Varea M, Gil-Moltó J, et al. The influence of meteorology on particulate matter concentrations at an urban Mediterranean location. Water Air Soil Pollut. 2011;215(1):365-72.
58. Kukkonen J, Pohjola M, Sokhi RS, et al. Analysis and evaluation of selected local-scale PM10 air pollution episodes in four European cities: Helsinki, London, Milan and Oslo. Atmos Environ. 2005;39(15):2759-73.
59. Jin K, Qin P, Liu C, et al. Impact of urbanization on sunshine duration from 1987 to 2016 in Hangzhou City, China. Atmosphere. 2021;12(2):211.
60. Chakchak J, Cetin NS. Investigating the impact of weather parameters selection on the prediction of solar radiation under different genera of cloud cover: A case-study in a subtropical location. Measurement. 2021;176: 109159.
61. Duarte AL, Schneider IL, Artaxo P, et al. Spatiotemporal assessment of particulate matter (PM10 and PM2.5) and ozone in a Caribbean urban coastal city. Geosci Front. 2022;13(1): 101168.