Özkan, RecepTokdemir, Onur BehzatToğan, VedatKadıoğlu, FethiMostofi, Fatemeh2026-02-052026-02-052026Özkan, R., Mostofi, F., Kadıoğlu, F., Toğan, V., Tokdemir, O. B. (2026). Strategic Energy Project Investment Decisions Using RoBERTa: A Framework for Efficient Infrastructure Evaluation. Buildings, 16(3), 547.https://www.mdpi.com/2075-5309/16/3/547https://hdl.handle.net/20.500.12846/2124The task of identifying high-value projects from vast investment portfolios presents a major challenge in the construction industry, particularly within the energy sector, where decision-making carries high financial and operational stakes. This complexity is driven by both the volume and heterogeneity of project documentation, as well as the multidimensional criteria used to assess project value. Despite this, research gaps remain: large language models (LLMs) as pretrained transformer encoder models are underutilized in construction project selection, especially in domains where investment precision is paramount. Existing methodologies have largely focused on multi-criteria decision-making (MCDM) frameworks, often neglecting the potential of LLMs to automate and enhance early-phase project evaluation. However, deploying LLMs for such tasks introduces high computational demands, particularly in privacy-sensitive, enterprise-level environments. This study investigates the application of the robustly optimized BERT model (RoBERTa) for identifying high-value energy infrastructure projects. Our dual objective is to (1) leverage RoBERTa’s pre-trained language architecture to extract key information from unstructured investment texts and (2) evaluate its effectiveness in enhancing project selection accuracy. We benchmark RoBERTa against several leading LLMs: BERT, DistilBERT (a distilled variant), ALBERT (a lightweight version), and XLNet (a generalized autoregressive model). All models achieved over 98% accuracy, validating their utility in this domain. RoBERTa outperformed its counterparts with an accuracy of 99.6%. DistilBERT was fastest (1025.17 s), while RoBERTa took 2060.29 s. XLNet was slowest at 4145.49 s. In conclusion, RoBERTa can be the preferred option when maximum accuracy is required, while DistilBERT can be a viable alternative under computational or resource constraints.eninfo:eu-repo/semantics/openAccessproject portfolio selectioninformed investment decisionenergy construction projectvalue managementrisk managementRoBERTaStrategic Energy Project Investment Decisions Using RoBERTa: A Framework for Efficient Infrastructure EvaluationArticle16310.3390/buildings16030547547547