Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Ambient air pollution poses significant health and environmental challenges.
Exposure to high concentrations of PM$_{2.5}$ have been linked to increased
respiratory and cardiovascular hospital admissions, more emergency department
visits and deaths. Traditional air quality monitoring systems such as
EPA-certified stations provide limited spatial and temporal data. The advent of
low-cost sensors has dramatically improved the granularity of air quality data,
enabling real-time, high-resolution monitoring. This study exploits the
extensive data from PurpleAir sensors to assess and compare the effectiveness
of various statistical and machine learning models in producing accurate hourly
PM$_{2.5}$ maps across California. We evaluate traditional geostatistical
methods, including kriging and land use regression, against advanced machine
learning approaches such as neural networks, random forests, and support vector
machines, as well as ensemble model. Our findings enhanced the predictive
accuracy of PM2.5 concentration by correcting the bias in PurpleAir data with
an ensemble model, which incorporating both spatiotemporal dependencies and
machine learning models.