Abstract
Extreme heat has become an increasingly severe issue in many urban areas, with local governments struggling to predict heat events accurately using existing satellite-based or institutional systems. This study investigates whether a cost-effective sensor network—consisting of DHT22 sensors connected to ESP32 microcontrollers—can be used to collect real-time data to accurately predict localized heat distributions using Long Short-Term Memory (LSTM) models. Data was collected over a two-day period from three sensor stations arranged in a triangular formation in Upland, California. Each station stored data locally on an SD card, while additional vegetation and color information was recorded using an iOS application to provide supplementary context, though not used directly in the prediction model. The resulting time series dataset included temperature, humidity, and NDVI data, which were used to train and evaluate three models: LSTM, Linear Regression, and Decision Tree Regressor. Performance was assessed using Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and R² score. The LSTM model demonstrated the highest predictive accuracy, achieving a lower MAE and RMSE across all three sensor locations. This research supports the feasibility of deploying low-cost, decentralized sensor networks to generate localized and reliable heat forecasts. It also confirms that LSTM models can outperform traditional regression techniques when fed sensor-based environmental data. The project demonstrates a scalable and accessible solution for communities and researchers seeking real-time heat prediction systems without reliance on expensive institutional infrastructure.