An Ensemble Learning Approach for Estimating High Spatiotemporal Resolution of Ground-Level Ozone in the Contiguous United States

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

An Ensemble Learning Approach for Estimating High Spatiotemporal Resolution of Ground-Level Ozone in the Contiguous United States. / Requia, Weeberb J.; Di, Qian; Silvern, Rachel; Kelly, James T.; Koutrakis, Petros; Mickley, Loretta J.; Sulprizio, Melissa P.; Amini, Heresh; Shi, Liuhua; Schwartz, Joel.

In: Environmental Science & Technology, Vol. 54, No. 18, 2020, p. 11037-11047.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Requia, WJ, Di, Q, Silvern, R, Kelly, JT, Koutrakis, P, Mickley, LJ, Sulprizio, MP, Amini, H, Shi, L & Schwartz, J 2020, 'An Ensemble Learning Approach for Estimating High Spatiotemporal Resolution of Ground-Level Ozone in the Contiguous United States', Environmental Science & Technology, vol. 54, no. 18, pp. 11037-11047. https://doi.org/10.1021/acs.est.0c01791

APA

Requia, W. J., Di, Q., Silvern, R., Kelly, J. T., Koutrakis, P., Mickley, L. J., Sulprizio, M. P., Amini, H., Shi, L., & Schwartz, J. (2020). An Ensemble Learning Approach for Estimating High Spatiotemporal Resolution of Ground-Level Ozone in the Contiguous United States. Environmental Science & Technology, 54(18), 11037-11047. https://doi.org/10.1021/acs.est.0c01791

Vancouver

Requia WJ, Di Q, Silvern R, Kelly JT, Koutrakis P, Mickley LJ et al. An Ensemble Learning Approach for Estimating High Spatiotemporal Resolution of Ground-Level Ozone in the Contiguous United States. Environmental Science & Technology. 2020;54(18):11037-11047. https://doi.org/10.1021/acs.est.0c01791

Author

Requia, Weeberb J. ; Di, Qian ; Silvern, Rachel ; Kelly, James T. ; Koutrakis, Petros ; Mickley, Loretta J. ; Sulprizio, Melissa P. ; Amini, Heresh ; Shi, Liuhua ; Schwartz, Joel. / An Ensemble Learning Approach for Estimating High Spatiotemporal Resolution of Ground-Level Ozone in the Contiguous United States. In: Environmental Science & Technology. 2020 ; Vol. 54, No. 18. pp. 11037-11047.

Bibtex

@article{399f5faf6d3440ae8ac1b1c64bbc5509,
title = "An Ensemble Learning Approach for Estimating High Spatiotemporal Resolution of Ground-Level Ozone in the Contiguous United States",
abstract = "In this paper, we integrated multiple types of predictor variables and three types of machine learners (neural network, random forest, and gradient boosting) into a geographically weighted ensemble model to estimate the daily maximum 8 h O3 with high resolution over both space (at 1 km × 1 km grid cells covering the contiguous United States) and time (daily estimates between 2000 and 2016). We further quantify monthly model uncertainty for our 1 km × 1 km gridded domain. The results demonstrate high overall model performance with an average cross-validated R2 (coefficient of determination) against observations of 0.90 and 0.86 for annual averages. Overall, the model performance of the three machine learning algorithms was quite similar. The overall model performance from the ensemble model outperformed those from any single algorithm. The East North Central region of the United States had the highest R2, 0.93, and performance was weakest for the western mountainous regions (R2 of 0.86) and New England (R2 of 0.87). For the cross validation by season, our model had the best performance during summer with an R2 of 0.88. This study can be useful for the environmental health community to more accurately estimate the health impacts of O3 over space and time, especially in health studies at an intra-urban scale.",
author = "Requia, {Weeberb J.} and Qian Di and Rachel Silvern and Kelly, {James T.} and Petros Koutrakis and Mickley, {Loretta J.} and Sulprizio, {Melissa P.} and Heresh Amini and Liuhua Shi and Joel Schwartz",
year = "2020",
doi = "10.1021/acs.est.0c01791",
language = "English",
volume = "54",
pages = "11037--11047",
journal = "Environmental Science & Technology",
issn = "0013-936X",
publisher = "American Chemical Society",
number = "18",

}

RIS

TY - JOUR

T1 - An Ensemble Learning Approach for Estimating High Spatiotemporal Resolution of Ground-Level Ozone in the Contiguous United States

AU - Requia, Weeberb J.

AU - Di, Qian

AU - Silvern, Rachel

AU - Kelly, James T.

AU - Koutrakis, Petros

AU - Mickley, Loretta J.

AU - Sulprizio, Melissa P.

AU - Amini, Heresh

AU - Shi, Liuhua

AU - Schwartz, Joel

PY - 2020

Y1 - 2020

N2 - In this paper, we integrated multiple types of predictor variables and three types of machine learners (neural network, random forest, and gradient boosting) into a geographically weighted ensemble model to estimate the daily maximum 8 h O3 with high resolution over both space (at 1 km × 1 km grid cells covering the contiguous United States) and time (daily estimates between 2000 and 2016). We further quantify monthly model uncertainty for our 1 km × 1 km gridded domain. The results demonstrate high overall model performance with an average cross-validated R2 (coefficient of determination) against observations of 0.90 and 0.86 for annual averages. Overall, the model performance of the three machine learning algorithms was quite similar. The overall model performance from the ensemble model outperformed those from any single algorithm. The East North Central region of the United States had the highest R2, 0.93, and performance was weakest for the western mountainous regions (R2 of 0.86) and New England (R2 of 0.87). For the cross validation by season, our model had the best performance during summer with an R2 of 0.88. This study can be useful for the environmental health community to more accurately estimate the health impacts of O3 over space and time, especially in health studies at an intra-urban scale.

AB - In this paper, we integrated multiple types of predictor variables and three types of machine learners (neural network, random forest, and gradient boosting) into a geographically weighted ensemble model to estimate the daily maximum 8 h O3 with high resolution over both space (at 1 km × 1 km grid cells covering the contiguous United States) and time (daily estimates between 2000 and 2016). We further quantify monthly model uncertainty for our 1 km × 1 km gridded domain. The results demonstrate high overall model performance with an average cross-validated R2 (coefficient of determination) against observations of 0.90 and 0.86 for annual averages. Overall, the model performance of the three machine learning algorithms was quite similar. The overall model performance from the ensemble model outperformed those from any single algorithm. The East North Central region of the United States had the highest R2, 0.93, and performance was weakest for the western mountainous regions (R2 of 0.86) and New England (R2 of 0.87). For the cross validation by season, our model had the best performance during summer with an R2 of 0.88. This study can be useful for the environmental health community to more accurately estimate the health impacts of O3 over space and time, especially in health studies at an intra-urban scale.

U2 - 10.1021/acs.est.0c01791

DO - 10.1021/acs.est.0c01791

M3 - Journal article

C2 - 32808786

AN - SCOPUS:85091125941

VL - 54

SP - 11037

EP - 11047

JO - Environmental Science & Technology

JF - Environmental Science & Technology

SN - 0013-936X

IS - 18

ER -

ID: 249764343