Cooling load prediction of a double-story terrace house using ensemble learning techniques and genetic programming with SHAP approach (
View/ Open
Access
info:eu-repo/semantics/openAccessDate
2024Author
Çakıroğlu, CelalAydın, Yaren
Bekdaş, Gebrail
Işıkdağ, Ümit
Sadeghifam, Aidin Nobahar
Abualigah, Laith
Metadata
Show full item recordCitation
Çakıroğlu, C., Aydın, Y., Bekdaş, G., Işıkdağ, Ü., Sadeghifam, Aidin N., Abualigah, L. (2024). Cooling load prediction of a double-story terrace house using ensemble learning techniques and genetic programming with SHAP approach. Energy & Buildings, 315.Abstract
Since the cooling systems used in buildings in hot climates account for a significant portion of the energy consumption, it is very important for both economy and environment to accurately predict the cooling load and consider it in building designs. This study aimed to maximize energy efficiency by appropriately selecting the features of a building that affect its cooling load. To this end, data-driven, accurate, and accessible tools were developed that enable the prediction of the cooling load of a building by practitioners. The study involves simulating the energy consumption of a mid-rise, double-story terrace house in Malaysia using building information modeling (BIM) and estimating the cooling load using ensemble machine learning models and genetic programming. Categorical Boosting (CatBoost), eXtreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), and Random Forest (RF) models have been developed and made available as an online interactive graphical user interface on the Streamlit platform. Furthermore, the symbolic regression technique has been utilized to obtain a closed-form equation that predicts the cooling load. The dataset used for training the predictive models comprised 94,310 data points with 10 input variables and the cooling load as the output variable. Performance metrics such as the coefficient of determination (R2), root mean squared error (RMSE), and mean absolute error (MAE) were used to measure the predictive model performances. The results of the machine learning models indicated successful prediction, with the CatBoost model achieving the highest score (R2 = 0.9990) among the four ensemble models and the predictive equation. The SHAP analysis determined the aspect ratio of the building as the most impactful feature of the building.