| تعداد نشریات | 127 |
| تعداد شمارهها | 7,196 |
| تعداد مقالات | 77,227 |
| تعداد مشاهده مقاله | 157,218,011 |
| تعداد دریافت فایل اصل مقاله | 118,403,670 |
Machine Learning Interpretability Methods to Delineate the Aerosol Formation in the Arabian Sea Near Kerala Coast | ||
| Pollution | ||
| دوره 12، شماره 2، مرداد 2026، صفحه 601-613 اصل مقاله (1.15 M) | ||
| نوع مقاله: Original Research Paper | ||
| شناسه دیجیتال (DOI): 10.22059/poll.2026.409240.3256 | ||
| نویسندگان | ||
| Sherin Babu* 1؛ Marina Aloysius2؛ Sana S Navas1؛ Binu Thomas3 | ||
| 1Department of Computer Science, Assumption College Autonomous, Changanassery, Kottayam, Kerala, India | ||
| 2Department of Physics, Assumption College Autonomous, Changanassery, Kottayam, Kerala, India | ||
| 3Department of Computer Applications, Marian College, Kuttikanam, Idukki, Kerala, India | ||
| چکیده | ||
| Atmospheric aerosols have a significant function in atmospheric systems and hence play a crucial role in climatic changes. Machine learning (ML) models are highly preferred for aerosol estimation because of their exceptional predictive capability. However, it is challenging to justify and understand the predictions made by these ML models. The purpose of this research is to show how model-agnostic interpretation methods - permutation feature importance (PFI) and SHapley Additive exPlanations (SHAP) can be used to enhance and clarify machine learning model prediction of aerosols in the Arabian Sea region near the Kerala coast. Initially, the performance of 3 ML models, Polynomial regression, Bayesian ridge regression and Support Vector Regression (SVR) models are analyzed for estimating the aerosol optical depth (AOD). The study employed Pearson correlation to investigate the relationships between AOD and the various input features and to find the best features for building the ML models. Mean Squared Error (MSE) and Coefficient of Determination (R2) are the performance metrics used to assess these models' performance. Results indicated that SVR model (with R2 = 0.7933 and MSE = 0.0063) provided better predictive performance. Then the predictions of the most accurate model are explained by PFI and SHAP. The ML interpretability analysis showed that the main factors strongly associated with aerosol formation are aerosol radiative forcing at the top of the atmosphere (ARF_TOA), radiative forcing at the surface of the atmosphere (ARF_SURF), sea salt and temperature profile at 250hPa. | ||
| کلیدواژهها | ||
| AOD؛ Interpretable ML؛ PFI؛ SHAP؛ SVR | ||
| مراجع | ||
|
Agarwal, N., & Das, S. (2020). Interpretable Machine Learning Tools: A Survey. 2020 IEEE Symposium Series on Computational Intelligence (SSCI), 1528–1534. https://doi.org/10.1109/SSCI47803.2020.9308260 Ahmed, S., Shamim Kaiser, M., Hossain, M. S., & Andersson, K. (2024). A Comparative Analysis of LIME and SHAP Interpreters with Explainable ML-Based Diabetes Predictions. IEEE Access, 1–1. IEEE Access. https://doi.org/10.1109/ACCESS.2024.3422319 Alam, K., Khan, R., Blaschke, T., & Mukhtiar, A. (2014). Variability of aerosol optical depth and their impact on cloud properties in Pakistan. Journal of Atmospheric and Solar-Terrestrial Physics, 107, 104–112. https://doi.org/10.1016/j.jastp.2013.11.012 Babu, S., & Thomas, B. (2023). A survey on air pollutant PM2.5 prediction using random forest model. Environmental Health Engineering And Management Journal, 10(2), 157–163. https://doi.org/10.34172/EHEM.2023.18 Babu, S., & Thomas, B. (2025). Daily PM10 Prediction of Thiruvananthapuram City and Interpretability Analysis of Influencing factors. Pollution, 11(2), 525–537. https://doi.org/10.22059/poll.2024.382674.2561 Boiyo, R., Kumar, K., & Zhao, T. (2018). Spatial variations and trends in AOD climatology over East Africa during 2002-2016: A comparative study using three satellite data sets. International Journal of Climatology, 38. https://doi.org/10.1002/joc.5446 Breiman, L. (2001). Random Forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324 Casalicchio, G., Molnar, C., & Bischl, B. (2019). Visualizing the Feature Importance for Black Box Models. Machine Learning and Knowledge Discovery in Databases, 655–670. https://doi.org/10.1007/978-3-030-10925-7_40 Chaibi, M., Benghoulam, E. M., Tarik, L., Berrada, M., & Hmaidi, A. E. (2021). An Interpretable Machine Learning Model for Daily Global Solar Radiation Prediction. Energies, 14(21), Article 21. https://doi.org/10.3390/en14217367 Chatterjee, A., Anil, G., & Shenoy, L. R. (2022). Marine heatwaves in the Arabian Sea. Ocean Science, 18(3), 639–657. https://doi.org/10.5194/os-18-639-2022 Comito, C., & Pizzuti, C. (2022). Artificial intelligence for forecasting and diagnosing COVID-19 pandemic: A focused review. Artificial Intelligence in Medicine, 128, 102286. https://doi.org/10.1016/j.artmed.2022.102286 de Leeuw, G., Kang, H., Fan, C., Li, Z., Fang, C., & Zhang, Y. (2023). Meteorological and anthropogenic contributions to changes in the Aerosol Optical Depth (AOD) over China during the last decade. Atmospheric Environment, 301, 119676. https://doi.org/10.1016/j.atmosenv.2023.119676 Elshora, M. (2023). Evaluation of MODIS combined DT and DB AOD retrievals and their association with meteorological variables over Qena, Egypt. Environmental Monitoring and Assessment, 195(4), 483. https://doi.org/10.1007/s10661-023-11118-8 Feng, D.-C., Wang, W.-J., Mangalathu, S., & Taciroglu, E. (2021). Interpretable XGBoost-SHAP Machine-Learning Model for Shear Strength Prediction of Squat RC Walls. Journal of Structural Engineering, 147(11), 04021173. https://doi.org/10.1061/(ASCE)ST.1943-541X.0003115 Fisher, A., Rudin, C., & Dominici, F. (2019). All Models are Wrong, but Many are Useful: Learning a Variable’s Importance by Studying an Entire Class of Prediction Models Simultaneously (arXiv:1801.01489). arXiv. https://doi.org/10.48550/arXiv.1801.01489 Gilpin, L. H., Bau, D., Yuan, B. Z., Bajwa, A., Specter, M., & Kagal, L. (2018). Explaining Explanations: An Overview of Interpretability of Machine Learning. 2018 IEEE 5th International Conference on Data Science and Advanced Analytics (DSAA), 80–89. https://doi.org/10.1109/DSAA.2018.00018 Imane, M., Aoula, E.-S., & Achouyab, E. H. (2022). Using Bayesian Ridge Regression to predict the Overall Equipment Effectiveness performance. 2022 2nd International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET), 1–4. https://doi.org/10.1109/IRASET52964.2022.9738316 Jaafari, A. (2024). Landslide susceptibility assessment using novel hybridized methods based on the support vector regression. Ecological Engineering, 208, 107372. https://doi.org/10.1016/j.ecoleng.2024.107372 Jabal, M. S., Joly, O., Kallmes, D., Harston, G., Rabinstein, A., Huynh, T., & Brinjikji, W. (2022). Interpretable Machine Learning Modeling for Ischemic Stroke Outcome Prediction. Frontiers in Neurology, 13. https://doi.org/10.3389/fneur.2022.884693 Jebli, I., Belouadha, F.-Z., Kabbaj, M. I., & Tilioua, A. (2021a). Prediction of solar energy guided by pearson correlation using machine learning. Energy, 224, 120109. https://doi.org/10.1016/j.energy.2021.120109 Jebli, I., Belouadha, F.-Z., Kabbaj, M. I., & Tilioua, A. (2021b). Prediction of solar energy guided by pearson correlation using machine learning. Energy, 224, 120109. https://doi.org/10.1016/j.energy.2021.120109 Karimian, H., Li, Y., Chen, Y., & Wang, Z. (2023). Evaluation of different machine learning approaches and aerosol optical depth in PM2.5 prediction. Environmental Research, 216, 114465. https://doi.org/10.1016/j.envres.2022.114465 Khan, M., Tariq, S., & Haq, Z. U. (2023). Variations in the aerosol index and its relationship with meteorological parameters over Pakistan using remote sensing. Environmental Science and Pollution Research, 30(16), 47913–47934. https://doi.org/10.1007/s11356-023-25613-5 Kim, Y., & Oh, H. (2021). Comparison between Multiple Regression Analysis, Polynomial Regression Analysis, and an Artificial Neural Network for Tensile Strength Prediction of BFRP and GFRP. Materials, 14(17), Article 17. https://doi.org/10.3390/ma14174861 Li, H., & Yamamoto, S. (2016). Polynomial regression based model-free predictive control for nonlinear systems. 2016 55th Annual Conference of the Society of Instrument and Control Engineers of Japan (SICE), 578–582. https://doi.org/10.1109/SICE.2016.7749264 Liu, Y., Sun, L., Du, C., & Wang, X. (2020). Near-infrared prediction of edible oil frying times based on Bayesian Ridge Regression. Optik, 218, 164950. https://doi.org/10.1016/j.ijleo.2020.164950 Liu, Z., Huang, X., & Wang, X. (2024). PM2.5 prediction based on modified whale optimization algorithm and support vector regression. Scientific Reports, 14(1), 23296. https://doi.org/10.1038/s41598-024-74122-z Lundberg, S., & Lee, S.-I. (2017). A Unified Approach to Interpreting Model Predictions (arXiv:1705.07874). arXiv. https://doi.org/10.48550/arXiv.1705.07874 Matthew, E., & Adeyinka, O. (2020). Application of Hierarchical Polynomial Regression Models to Predict Transmission of COVID-19 at Global Level. International Journal of Clinical Biostatistics and Biometrics, 6(1). https://doi.org/10.23937/2469-5831/1510027 Molnar, C., Casalicchio, G., & Bischl, B. (2020). Interpretable Machine Learning – A Brief History, State-of-the-Art and Challenges. In I. Koprinska, M. Kamp, A. Appice, C. Loglisci, L. Antonie, A. Zimmermann, R. Guidotti, Ö. Özgöbek, R. P. Ribeiro, R. Gavaldà, J. Gama, L. Adilova, Y. Krishnamurthy, P. M. Ferreira, D. Malerba, I. Medeiros, M. Ceci, G. Manco, E. Masciari, … J. A. Gulla (Eds.), ECML PKDD 2020 Workshops (pp. 417–431). Springer International Publishing. https://doi.org/10.1007/978-3-030-65965-3_28 Musolf, A. M., Holzinger, E. R., Malley, J. D., & Bailey-Wilson, J. E. (2022). What makes a good prediction? Feature importance and beginning to open the black box of machine learning in genetics. Human Genetics, 141(9), 1515–1528. https://doi.org/10.1007/s00439-021-02402-z Najah, A., Merwe, R. van der, & Al Shehhi, M. R. (2025). Review of tropical cyclones impacting the Western Arabian Sea and Oman. Journal of Operational Oceanography, 18(1), 21–39. https://doi.org/10.1080/1755876X.2024.2444753 Oh, S. (2022). Predictive case-based feature importance and interaction. Information Sciences, 593, 155–176. https://doi.org/10.1016/j.ins.2022.02.003 Quan, Q., Hao, Z., Xifeng, H., & Jingchun, L. (2022). Research on water temperature prediction based on improved support vector regression. Neural Computing and Applications, 34(11), 8501–8510. https://doi.org/10.1007/s00521-020-04836-4 Song, S., Kang, Y., & Im, J. (2023). Estimation of geostationary satellite-based hourly daytime and nighttime AOD using machine learning. EGU-12334. https://doi.org/10.5194/egusphere-egu23-12334 Tatachar, A. V. (2021). Comparative Assessment of Regression Models Based On Model Evaluation Metrics. 08(09). Ullah, I., Liu, K., Yamamoto, T., Zahid, M., & Jamal, A. (2023). Modeling of machine learning with SHAP approach for electric vehicle charging station choice behavior prediction. Travel Behaviour and Society, 31, 78–92. https://doi.org/10.1016/j.tbs.2022.11.006 Wei, X., Chang, N.-B., Bai, K., & Gao, W. (2020). Satellite remote sensing of aerosol optical depth: Advances, challenges, and perspectives. Critical Reviews in Environmental Science and Technology, 50(16), 1640–1725. https://doi.org/10.1080/10643389.2019.1665944 Yeom, J.-M., Jeong, S., Ha, J.-S., Lee, K.-H., Lee, C.-S., & Park, S. (2022). Estimation of the Hourly Aerosol Optical Depth From GOCI Geostationary Satellite Data: Deep Neural Network, Machine Learning, and Physical Models. IEEE Transactions on Geoscience and Remote Sensing, 60, 1–12. IEEE Transactions on Geoscience and Remote Sensing. https://doi.org/10.1109/TGRS.2021.3107542 Yousefi, R., Wang, F., Ge, Q., & Shaheen, A. (2020). Long-term aerosol optical depth trend over Iran and identification of dominant aerosol types. Science of The Total Environment, 722, 137906. https://doi.org/10.1016/j.scitotenv.2020.137906 Zhang, F., & O’Donnell, L. J. (2020). Chapter 7—Support vector regression. In A. Mechelli & S. Vieira (Eds.), Machine Learning (pp. 123–140). Academic Press. https://doi.org/10.1016/B978-0-12-815739-8.00007-9 Zheng, G., Zhang, Y., Yue, X., & Li, K. (2023). Interpretable prediction of thermal sensation for elderly people based on data sampling, machine learning and SHapley Additive exPlanations (SHAP). Building and Environment, 242, 110602. https://doi.org/10.1016/j.buildenv.2023.110602 | ||
|
آمار تعداد مشاهده مقاله: 68 تعداد دریافت فایل اصل مقاله: 9 |
||