Translate this page into:
Analyzing the relationship between three-dimensional architectural landscapes and urban carbon emissions using machine learning approaches
* Corresponding author E-mail address: windzpf@163.com (P. Zhang)
-
Received: ,
Accepted: ,
Abstract
Detecting the relationship between urban architectural landscapes and carbon emissions is crucial for achieving China’s carbon-peaking and carbon-neutrality goals. This study aims to investigate the relationship between 3D architectural landscapes and carbon emissions in Qingdao City, based on building 3D information extracted from high-resolution satellite images and carbon emission data from the Center for Global Environmental Research for the year 2020. First, key architectural landscape factors impacting carbon emissions were identified utilizing the Pearson correlation test and Random Forest (RF). A predictive relationship model between architectural landscapes and carbon emissions was built using Support Vector Machine Regression (SVR) and further optimized through the Chaotic Particle Swarm Optimization (PSO) algorithm. The results revealed strong correlations between carbon emissions and factors such as building density, building number, shape, and height. Floor area ratio had the highest impact on carbon emissions, contributing 45.5%, followed by building number, landscape shape index, building coverage ratio, Shannon’s diversity index, and building shape coefficient (BSC). The optimized PSO-SVR model achieved a higher coefficient of determination (R2) in the training dataset (77.32%) and test dataset (76.14%) compared to the SVR model (70% and 64.48%), along with lower mean absolute error (MAE) and mean relative error (MRE). Overall, the PSO-SVR model demonstrated enhanced accuracy in predicting carbon emissions and provided valuable insights for carbon reduction through targeted urban planning and architectural design.
Keywords
Particle swarm optimization
Relationship
Support vector machine regression
Three-dimensional architectural landscapes
Urban carbon emissions
1. Introduction
Carbon emissions are widely regarded as a primary driver of global warming, leading to numerous natural disasters (Zeng et al., 2022). According to the Intergovernmental Panel on Climate Change (IPCC) fifth assessment report, urban areas are responsible for 71–76% of total global emissions. In China, cities contribute 85% of the nation’s emissions (Xu et al., 2021). Rapid urbanization and buildings’ vertical expansion have significantly contributed to the rise in urban carbon emissions. Thus, understanding the relationship between architectural landscapes and carbon emissions is essential for the development of low-carbon cities.
Economic activities, population growth, energy structure, transportation, and buildings were widely recognized as major drivers of urban carbon emissions. Previous studies have employed the logarithmic mean divisia index (LMDI) approach (Zeng et al., 2020), the pooled mean group-auto regressive distribution lag (PMG-ARDL) framework (Liu et al., 2023), spatial panel Durbin models (Yi et al., 2022), and panel threshold models (Wei et al., 2023). Additionally, dynamic ARDL simulation techniques (Ali et al., 2022) and stochastic impacts by regression on population, affluence and technology (STIRPAT) models (Yu et al., 2020) to explore the impacts of economic growth, energy consumption, industrial structure, and population dynamics on carbon emissions. In parallel, other studies have explored the influences of urban form on carbon emissions, found that the complexity (Wang et al., 2019) and compactness (Meng et al., 2023) of urban form were associated with variations in carbon emissions. More recent studies have examined the role of architectural landscapes using numerical simulation (Quan et al., 2020), neural network regression (Ye et al., 2018), random forest (RF) (Lin et al., 2021), support vector regression (Gao et al., 2023), and Urban Modeling Interface platform (Wang et al., 2023). These findings have highlighted that building coverage ratio, spatial congestion, floor area ratio, building height and shape coefficient were the significant contributors to carbon emissions.
At present, RF, support vector regression, and chaotic particle swarm optimization (PSO) have been widely applied in carbon emission prediction (Khajavi and Rastgoo, 2023), industrial pollution and carbon reduction (Wen et al., 2024), and urban carbon peak optimization (Zeng et al., 2023). Such applications underscored the strong predictive power and flexibility of these models in capturing complex nonlinear relationships.
Although substantial research has investigated the factors of urban carbon emissions, several limitations remain: (1) Most studies have focused on population, economy, energy, and urban form, while often overlooking the role of architectural landscapes. (2) Existing research on architectural landscapes tends to emphasize the building sector or specific building types, rather than accounting for all buildings in a comprehensive urban context. (3) Few studies have integrated advanced machine learning models and optimization algorithms to fully capture the complex relationship between architectural landscapes and carbon emissions. Qingdao is a major coastal city in eastern China, features a diverse urban form and active participation in national low-carbon pilot programs. And its urbanization patterns and carbon emissions are broadly aligned with those of many Chinese cities. So, it was selected as a representative research area. Key architectural landscape factors affecting carbon emissions in Qingdao City for the year 2020 were identified using the RF. A predictive model describing the relationship between architectural landscapes and carbon emissions was constructed using Support Vector Machine Regression (SVR), and optimized using PSO. This research provides valuable insights for promoting low-carbon urban development and offers practical guidance for urban planning in China.
2. Material and Methods
2.1 Study area
Qingdao located in the southern part of Shandong province, China, and covers an area of 11293 km2. The city has undergone rapid urbanization and includes seven municipal districts (Shinan, Shibei, Licang, Laoshan, West Coast, Chengyang, and Jimo) and three county-level cities (Jiaozhou, Pingdu, and Laixi). Plains and basins make up approximately 59.4% of the city. In comparison, mountains comprise 17.6% of the area, mainly in the southeast and northern regions (Fig. 1). From 2010 to 2020, construction land increased by 56.6 km2, while energy consumption per unit of GDP and carbon emissions decreased by 21.5% and 27%, respectively. Therefore, investigating the relationship between architectural landscapes and carbon emissions is essential for promoting low-carbon cities and guiding the design of energy-efficient buildings. However, rapid urban growth has driven up energy consumption and carbon emissions annually, the architectural landscape significantly influences urban carbon emissions. Therefore, investigating the relationship between architectural landscapes and urban carbon emissions is essential for promoting low-carbon cities.

- Study area and buildings’ three-dimensional information.
2.2 Data and processing
The buildings’ 3D information (height and contour) was extracted from high-resolution satellite images of Qingdao in 2020 using Barista software (Fig. 1). The Open-Data Inventory for Anthropogenic Carbon dioxide (ODIAC) data (Fig. 2) was obtained from the Center for Global Environmental Research (CGER) at https://db.cger.nies.go.jp/dataset/ODIAC/. This dataset provides monthly carbon emission estimates at a spatial resolution of 1km × 1km (Oda et al., 2018), making it highly reliable for scientific purposes, and widely used for analyzing urban carbon emission assessments and carbon flux inversion (Dissanayake et al., 2018).

- Carbon emissions and fishnet in Qingdao City.
The size of rectangular grids was optimized through iterative testing to balance spatial resolution with data completeness. A coarse grid size resulted in a loss of important spatial detail, while overly small grids may lead to a high proportion of empty building grids, particularly in rural areas. So, a grid of size 5km × 5km was selected, yielding 424 spatial units across Qingdao City (Fig. 2). Various architectural landscape indices, as well as carbon emissions, were calculated for each grid using Fragstats and ArcGIS software. Finally, the relationships between architectural landscapes and carbon emissions were analyzed, and a regression model was developed at the grid scale utilizing RF, SVR, and PSO algorithm.
2.3. Methods
2.3.1 Pearson correlation analysis
Pearson correlation is a fundamental statistical tool used to assess the strength and direction of a linear relationship between two continuous variables. The correlation coefficient quantifies this relationship on a scale from -1 to 1, where values indicate a negative or positive linear relationship. Due to its simplicity and effectiveness in measuring linear relationships, Pearson correlation is widely used to identify potential relationships among variables. This study applies Pearson correlation analysis to examine the relationship between architectural landscapes and carbon emissions.
2.3.2 Random forest
Random forest (RF), developed by (Breiman, 2001) as an improvement over the classification and regression tree algorithm, has been further developed and analyzed in subsequent studies (Scornet et al., 2015; Biau and Scornet, 2016). RF enhances prediction accuracy and effectively manages complex relationships within data due to its strong classification capabilities (Zheng and Zhang, 2023). RF is an ensemble algorithm that constructs multiple decision trees using bootstrap aggregating of the training dataset and combines their outputs. During training, bootstrap re-sampling randomly selects k samples to build k decision trees, each grown without pruning. Each tree is trained on a random vector that is independently and identically distributed. The final prediction is obtained by averaging or majority voting (Fig. 3). RF is particularly effective in handling nonlinear relationship between input and output variables, making it suitable for regression tasks with strong generalization ability and feature importance evaluation. In this study, RF was implemented in Python to explore the contribution of architectural landscapes to carbon emissions.

- Workflow of RF.
2.3.3 Support vector regression
Support vector regression (SVR) is a machine learning algorithm based on statistical learning theory, proposed by Vapnik. SVR has proven highly capable of capturing nonlinear data patterns and effectively addresses challenges posed by small sample sizes, multifactor scenarios, high dimensionality, and nonlinear regression. It is resistant to overfitting and high predictive accuracy and stability, and widely used in energy prediction, where kernel functions are critical for modeling nonlinear relationships in high-dimensional feature spaces. The linear, Gaussian, and polynomial kernels are commonly applied in nonlinear regression, the Gaussian kernel’s free parameter allows SVR to approximate a broader range of nonlinear responses than the polynomial kernel (Roy and Chakraborty, 2023). The SVR model’s vector field structure is shown in Fig. 4 (Zhong et al., 2019). So, this study built a regression model of the key factors and carbon emissions using support vector regression in Python. SVR hyperparameters, such as penalty coefficient, degree of the polynomial kernel function, and kernel function parameters are crucial to defining support vectors in the regression model.

- Structure of the vector field-based SVR (Zhong et al., 2019).
2.3.4 Particle swarm optimization algorithm
The Particle swarm optimization (PSO) algorithm is an intelligent optimization method inspired by birds’ foraging behavior. PSO randomly initializes a population of particles that iteratively seeks individual and global optima to update their position and locate the optimal solution (Eberhart and Kennedy, 1995). In each iteration, each particle’s objective function is evaluated, recording the optimal solution (Pbest) and the optimal solution (Gbest) of the current population. The particle’s velocity and position are then updated based on these two optima, with the fitness calculated for each particle to determine the individual and global optimal values (Zeng et al., 2023). PSO effectively identifies optimal parameter combinations in high-dimensional spaces without being constrained by local optima. By balancing computational efficiency, global search capability, and implementation simplicity, POS is well-suited for optimizing SVR models in capturing the complex relationship between architectural landscapes and carbon emissions.
This study identified the key factors of architectural landscapes on carbon emissions using RF, then built a regression model of the key factors and carbon emissions using Support Vector Regression, and optimized the SVR model using Particle Swarm Optimization in Python. During these processes, we applied standardized normalization and outlier detection procedures for the dataset prior to model training. And a 10-fold cross-validation strategy and incorporated a grid-search mechanism for hyperparameter optimization were implemented to enhance the model’s generalization capability and prevent overfitting. The model framework is shown in Fig. 5.

- Model framework for architectural landscape indices and urban carbon emissions.
To evaluate the accuracy of the SVR and POS-SVR regression model, the coefficient of determination (R2), mean absolute error (MAE) (Eq. 1), and mean relative error (MRE) (Equation 2) were used. A higher R2, lower MAE, and MRE indicate a better fit and greater prediction accuracy.
where n is the sample size; yi and are the actual and predicted carbon emissions for the i-th grid, respectively.
2.4 Indices
In this study, 10 2D and 5 3D architectural landscape indicators were selected (Table 1) to examine their relationship with urban carbon emissions. These indicators represent aspects of building number, area, shape, connectivity, aggregation, and diversity. The values of these indices were calculated within a 5km × 5km grid using Fragstats 4.2 (Liu et al., 2017) and ArcGIS.
Metric | Calculation | Description |
---|---|---|
BN | Indicates the number of buildings per unit area. | |
BCR | Indicates the proportion of total building footprint area to land area (%). | |
LPI | Indicates the degree building concentration; higher values suggest greater buildings integration (%). | |
LSI | Indicates building shape regularity and aggregation. | |
Perimeter Area Fractal Dimension (PAFRAC) | Indicates the complexity of building boundaries; lower values suggest simpler shapes. | |
COHESION | Indicates spatial connectivity and aggregation among buildings (%). | |
PLADJ | Measures the aggregation degree of a specific building type (%); higher values indicate stronger clustering. | |
AI | Measures the degree of aggregation among architectural patches (%); higher values indicate stronger aggregation. | |
DIVISION | Indicates the level of landscape fragmentation (%). | |
Shannon’s diversity index (SHDI) | Reflects the richness and distribution of different architectural landscape types. | |
MBH | Indicates the average building height (m). | |
SDBH | Indicates the variation in building height (m). | |
FAR | Indicates the ratio of total floor area to land area. | |
Building shape coefficient (BSC) | Quantifies the relationship between building surface area and volume. | |
Spatial congestion degree (SCD) | Indicates the total building volume relative to the overall urban volume. |
Where B is the total number of buildings; Hi, Vi and Ai denote the height (m), volume (m3), and footprint area (m2) of the i-th building, respectively; A is the area of the grid cell (m2); Fi is the number of floors in the i-th building; Pi is the perimeter of the i-th building (m); β is the slope of the regression line obtained from a log-transformed regression of building area and perimeter; pk is the frequency of occurrence of patch type k in the landscape; n is the number of patch types in the landscape; E is the total boundary length of all building patches (m); Ab is the total area of the building landscape (m2); gi is the number of like adjacencies between pixels of patch type i, calculated using the single-count method; Bj is the number of buildings in the j-th grid; gii is the number of adjacencies within patch category i; and gik is the number of adjacencies between patch categories i and k.
3. Results
3.1 Pearson correlation between architectural landscapes and carbon emissions
Pearson linear correlation was calculated for architectural landscapes and carbon emissions (Fig. 6). All the architectural landscape indicators were significantly associated with carbon emissions at the 0.01 significance level. Indicators such as largest patch index (LPI), proportion of like adjacency (PLADJ), patch cohesion index (COHESION), and aggregation index (AI) showed a negative correlation with carbon emissions, while the other indicators exhibited positive correlations. A correlation coefficient with an absolute value greater than 0.6 indicates a strong correlation. Several indicators, including landscape shape index (LSI), building number (BN), building coverage ratio (BCR), mean building height (MBH), standard deviation of building height (SDBH), and floor area ratio (FAR), meet this threshold. Indicators such as SHDI, perimeter area fractal dimension (PAFRAC), landscape division index (DIVISION), building shape coefficient (BSC), PLADJ, COHESION, LPI, and AI, with correlation coefficients between 0.2 and 0.6, demonstrated moderate relevance to carbon emissions. Indicators with values below 0.2, such as SCD, showed a weak correlation and were excluded from further analysis.

- Correlation coefficients between architectural landscape indices and carbon emissions.
The linear relationship indicates that carbon emissions have a strong correlation with building number, density, shape, and height, while showing moderate relevance to building aggregation, diversity, and boundary complexity.
3.2 Factor contributions to carbon emissions by RF
The contribution of architectural landscape indices to carbon emissions has been shown in Fig. 7. It presents the importance ranking of variables from highest to lowest, a higher importance indicates a greater influence on carbon emissions. FAR exerted the most influence, contributing 45.5% to carbon emissions. This predominance suggests that an increase in FAR directly escalates energy consumption and population concentration, thereby amplifying carbon emissions. BN (18.4%) and LSI (13.3%) constituted secondary critical factors, with denser building clusters and complex geometric forms further intensifying emissions. Moderately influential parameters included BCR, SHDI, and BSC, each exceeding 2.8% contributions, while remaining indicators ranged from 0.7% to 2.5%. Cumulatively, these six metrics (FAR, BN, LSI, BCR, SHDI, and BSC) explained 86% of emissions, highlighting the critical interplay between architectural landscapes (building density, number, shape, and diversity) and carbon emissions. Consequently, these six indices were selected as predictors in the SVR model to quantify architectural landscapes impacts on carbon emissions.

- Variable importance ranking for urban carbon emissions from the RF model.
3.3 Regression model of architectural landscapes and carbon emissions by SVR
The relationship model of architectural landscapes (FAR, BN, LSI, BCR, SHDI, and BSC) and carbon emissions was built using SVR in Python. The dataset was partitioned into training and test sets through randomized sampling, with 90% for training and 10% for testing. Three kernel configurations of linear, polynomial, and Gaussian kernels were systematically evaluated in the SVR model. The regression model was selected according to the prediction accuracy (Fig. 8) of three kernel configurations.

- Comparison of accuracy for different kernel functions in SVR models.
The polynomial kernel function achieved the highest coefficient of determination (R2) for both training and test sets, with values of 70% and 68.58%, respectively. The polynomial kernel function also exhibited the lowest MRE and MAE values across both sets, at 44.75% and 40.82%, 325.47 and 456.7, respectively, outperforming the linear and Gaussian kernel functions. These results indicated that the polynomial kernel function exhibited superior predictive performance in the SVR model.
Therefore, the SVR model with a polynomial kernel function was selected for further analysis, as shown in Eq. (3):
Where C is the total carbon emissions (unit: t); k is the coefficient (with k=±10); n is the number of grids; is a kernel function that satisfies Mercer’s condition; represents the architectural landscape index in the i-th grid, Xi∈(FAR, BN, LSI, BCR, SHDI, BSC).
3.4 Optimization of the SVR model by PSO
PSO was applied to optimize the hyperparameters of the SVR model with a polynomial kernel, using the same random sampling approach. The optimization process iteratively adjusted the penalty coefficient, kernel coefficient, and kernel function degree to maximize prediction accuracy. The initial PSO parameters were carefully determined based on a combination of prior empirical studies and sensitivity analysis (Table 2).
Parameters | C1 | C2 | p | D | K | c | g |
---|---|---|---|---|---|---|---|
Values | 1.9 | 2 | 0.3 | 100 | 300 | [0.01,100] | [0.01,1000] |
The optimal parameters of the PSO-SVR model, obtained through iterative computation in Python, were a penalty factor (c) of 2.17, a kernel function degree of 3, and a kernel function parameter gamma of 5.4. Then optimized regression model (PSO-SVR), which captures the relationship between architectural landscape indices and urban carbon emissions, is presented in Eq. (4):
Where represents total carbon emissions (unit: t); k is the coefficient (with k=±2.1); n is the number of grids; is a kernel function satisfying Mercer’s condition; and represents the architectural landscapes index in the i-th grid, where X i∈(FAR, BN, LSI, BCR, SHDI, BSC).
3.5 Prediction accuracy of SVR and PSO-SVR models
Fig. 9 displays the observed and predicted carbon emissions for each model across training and test datasets, with absolute errors shown in blue. Each dataset is presented separately in sub-graphs. Error analysis revealed that the variance of prediction error was higher in the training dataset than in the test dataset. Specifically, both R2 and MRE values were higher for the training dataset, while MAE was lower for both SVR and PSO-SVR models (Table 3). These results indicated that the models are reliable for predicting urban carbon emissions.

- Observed and predicted carbon emissions using SVR and POS-SVR models.
Models | Training set | Test set | ||||
---|---|---|---|---|---|---|
R2 | MRE | MAE | R2 | MRE | MAE | |
SVR | 70.00% | 44.75% | 325.47 | 64.58% | 45.40% | 456.70 |
PSO-SVR | 77.32% | 42.59% | 265.20 | 76.14% | 40.82% | 404.92 |
As shown in Table 3, the PSO-SVR model achieved an R2 of 77.32% on the training dataset and 76.14% on the test dataset, representing improvements of 7.32% and 11.56%, respectively, over the standard SVR model. In addition, the PSO-SVR model reduced MAE by 60.27 on the training dataset and 51.78 on the test dataset. It also lowered MRE by 2.16% and 4.58%, respectively. These results collectively demonstrated that the PSO-SVR model significantly outperforms the SVR model in predicting carbon emissions based on architectural landscape indices. Therefore, optimizing the SVR model with PSO notably enhances model accuracy and generalization capability.
3.6 Relative errors of carbon emissions predictions
The relative errors (RE) between observed and predicted carbon emissions from the SVR and PSO-SVR models were categorized into seven levels. Fig. 10 compares the proportion of grid cells at each RE level for both models. For the SVR model, the distribution was as follows: RE≤5% (25.4%), 5%<RE≤10% (6.5%), 10%<RE≤20% (12.6%), 20%<RE ≤30% (11.7%), 30%<RE≤40% (13.9%), 40%<RE≤50% (5.0%), and RE>50% (25%). In comparison, the PSO-SVR model showed a 2.22% increase in the proportion of grids with RE≤5%, a 0.9% reduction in RE levels between 5% and 30%, and a 1.3% reduction in RE>30%, including a 1.67% decline in the RE>50% category. These results indicated that the PSO optimization effectively improved the predictive accuracy of the SVR model.

- RE comparison for carbon emissions prediction between SVR and POS-SVR models.
Fig. 11 illustrates the spatial distribution and variation in RE for carbon emissions predictions generated by the SVR and PSO-SVR models using ArcGIS. In both models (Figs. 11a and b), RE values below 30% were mainly concentrated in central urban grids with high building density. In contrast, RE values exceeding 30% were mainly distributed in suburban, rural, and green areas characterized by sparse building coverage, which negatively affected prediction accuracy.

- RE of carbon emissions using the SVR model.

- RE of carbon emissions using the POS-SVR model.
Fig. 11(c) compares the changes in RE between the two models. Specifically, 40.7% of grid cells were unchanged, 31.7% experienced reduced errors, and 27.6% exhibited increased errors. Grids with unchanged or improved accuracy were primarily located in city centers, while those with increased errors were predominantly found in suburban, rural, and green areas. These findings suggested that PSO effectively enhanced the SVR model by reducing errors in dense urban zones. The PSO-SVR model provides more accurate and spatially stable estimates of carbon emissions in Qingdao City.

- RE changes from SVR to POS-SVR.
4. Discussion
4.1 The correlation between architectural landscapes and urban carbon emissions
Compact urban development is generally more sustainable than urban sprawl. However, unchecked vertical expansion of buildings has led to environmental challenges, including the urban heat island effect (Lin et al., 2021), poor pollutant dispersion, and increased carbon emissions. Unlike previous studies, this research emphasized the role of both 2D and 3D architectural landscape indices in explaining urban carbon emissions. The findings revealed significant correlations between urban carbon emissions and architectural landscape indicators, including FAR, BN, LSI, BCR, and building height. These findings align with previous studies: Higher FAR is linked to increased energy, water, and gas consumption, greater waste production (Pu et al., 2022), and higher carbon emissions. Higher BN and BCR reduce carbon sequestration in green areas and amplify emissions (Xu et al., 2021). LSI and SHDI impacted carbon emissions by altering energy efficiency (Nutkiewicz et al., 2018; Quan et al., 2020) and modifying block-level wind and thermal environment (Wei et al., 2022). These findings underscore the importance of optimizing FAR, BN, LSI, BCR, SHDI, and BSC in urban planning and building design to enhance energy efficiency and support low-carbon city development.
4.2 Comparison of SVR and PSO-SVR models
A comparative analysis of the SVR and PSO-SVR models revealed that the PSO effectively optimized the SVR model. The PSO-SVR model significantly improved the forecasting accuracy of carbon emissions across all test cases, indicating its superior generalization capacity and enhanced capability to capture complex nonlinear dependencies in emission patterns. These improvements could have tangible implications for urban carbon management. The enhanced prediction accuracy enables more reliable identification of high-emission zones, supports targeted mitigation interventions, and strengthens data-driven decision-making for localized policy development.
While this study focuses on SVR and PSO-SVR due to their proven effectiveness in modeling spatially heterogeneous and nonlinear emission data, alternative machine learning models such as XGBoost and artificial neural networks (ANNs) also show promise. XGBoost has been recognized for its high predictive accuracy and efficiency, particularly for structured datasets with moderate sample sizes. However, its performance depends heavily on hyperparameter tuning, and its decision rules can lack interpretability. ANNs offer strong capabilities in capturing complex nonlinear interactions and are well-suited for large-scale, high-dimensional datasets. However, their reliance on extensive data preprocessing and computational demands may limit their practical deployment in urban planning. Future studies could integrate a broader model comparison to enhance model selection rigor and robustness of conclusions.
4.3 Limitations
While the PSO-SVR model effectively demonstrated the relationship between three-dimensional architectural landscapes and urban carbon emissions in Qingdao City, three limitations require attention:
(1) Dataset limitations: The urban carbon emissions data from ODIAC may underrepresent the impact of architectural landscape indices (Gao et al., 2022; Sun et al., 2024), potentially affecting the prediction accuracy of both the SVR and PSO-SVR models. Future studies should use detailed building-level carbon emission data to improve model precision.
This study relies solely on 2020 data for Qingdao, which limits the generalizability of findings. Architectural styles, industrial structures, and social-economic conditions vary across cities, potentially altering the observed relationships. Although our results are valid for Qingdao and similar Chinese cities, multi-city analysis with longitudinal dataset would provide more comprehensive insights.
(2) Computational scale limitation: This study employed a 5 x 5 km2 grid resolution determined through iterative testing to balance data density and emission variability. While this approach ensured computational feasibility, finer-scale differences in carbon emissions may be masked, especially in heterogeneous urban environments. Coarse grid size introduced spatial averaging effects, resulting in a loss of detail important spatial detail. Future studies should consider using the adaptive mesh methods or multi-resolution spatial frameworks that dynamically adjust grid sizes based on urban density or functional zoning. Such approaches may help capture spatial heterogeneity more precisely while maintaining computational feasibility.
(3) Limitations of Machine Learning Methods: Although we applied standardized normalization and 10-fold cross-validation strategy for model training and hyperparameter optimization to enhance generalization capability and prevent overfitting. The PSO-SVR model outperformed the conventional SVR in terms of prediction accuracy of carbon emissions. It is essential to recognize the inherent limitations of such machine learning approaches. Such as the necessity for comprehensive data pretreatment, overfitting, and overlook complex causal relationships. Future studies should consider incorporating advanced preprocessing frameworks, such as principal component analysis or mutual information-based filtering to improve model, and integrate machine learning with domain knowledge for more comprehensive insights.
5. Conclusion
Detecting the relationship between urban architectural landscapes and carbon emissions is essential for sustainable urban planning and the design of low-carbon buildings. This study identified key architectural landscape factors impacting carbon emissions utilizing RF. A predictive model describing the relationship between architectural landscapes and carbon emissions was built using SVR and further optimized with the PSO algorithm. The following conclusions were drawn based on the findings of the study:
(1) A strong relationship was observed between building density, number, shape, diversity, and urban carbon emissions. FAR was the most influential contributor, accounting for 45.5% of the total impact. Other significant factors included BN (18.4%), LSI (13.3%), BCR, SHID, and BSC, with a cumulative contribution reaching 86%.
(2) The SVR model effectively captured the relationship between architectural landscape indices and carbon emissions. After optimization with the PSO algorithm, the PSO-SVR model demonstrated higher predictive accuracy, confirming the effectiveness of the optimization process.
This study contributes to urban carbon reduction efforts by offering both theoretical insight and practical guidance for low-carbon urban planning and building design. The proposed methodological framework can also be extended to other cities to identify key architectural factors and develop predictive models for urban carbon emissions.
Acknowledgment
This work research was supported by China Scholarship Council (202306450148); the Project of Strategic Research Program on Technological Innovation in Qingdao City (24-1-7-zlyj-18-zhc); Shandong Province Natural Science Foundation (ZR2018QD001) and National Natural Science Foundation of China (41301198).
CRediT authorship contribution statement
Peifeng Zhang: Conceptualization, Resources, Writingoriginal draft. Yudi Fu: Formal analysis, Visualization, Software. Beibei Jia: Data curation, Methodology. Tadesse Zelele: Review & editing, Validation. Mohamed Al-Hussein: Conception and design of the study.
Declaration of Generative AI and AI-assisted technologies in the writing process
The authors confirm that there was no use of artificial intelligence (AI)-assisted technology for assisting in the writing or editing of the manuscript and no images were manipulated using AI.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
References
- The impact of renewable and non-renewable energy consumption on carbon emission intensity in China: Fresh evidence from novel dynamic ARDL simulations. J Environ Manage. 2022;320:115782. https://doi.org/10.1016/j.jenvman.2022.115782
- [Google Scholar]
- A random forest guided tour. TEST. 2016;25:197-227. https://doi.org/10.1007/s11749-016-0481-7
- [Google Scholar]
- Random forests. Machine Learning. 2001;45:5-32. https://doi.org/10.1023/A:1010933404324
- [Google Scholar]
- Impact of Urban surface characteristics and socio-economic variables on the spatial variation of land surface temperature in Lagos City, Nigeria. Sustainability. 2019;11:25. https://doi.org/10.3390/su11010025
- [Google Scholar]
- A new optimizer using particle swarm theory[C]. MHS’95. Proceedings of the sixth international symposium on micro machine and human science Nagoya, Japan, pp. 39-43.
- A review of building carbon emission accounting and prediction models. Buildings. 2023;13:1617. https://doi.org/10.3390/buildings13071617
- [Google Scholar]
- Spatiotemporal dynamics and driving forces of city-level CO2 emissions in China from 2000 to 2019. J Cleaner Prod. 2022;377:134358. https://doi.org/10.1016/j.jclepro.2022.134358
- [Google Scholar]
- Predicting the carbon dioxide emission caused by road transport using a Random Forest (RF) model combined by meta-heuristic algorithms. Sustainable Cities and Society. 2023;93:104503. https://doi.org/10.1016/j.scs.2023.104503
- [Google Scholar]
- Analyzing the impact of three-dimensional building structure on CO2 emissions based on random forest regression. Energy. 2021;236:121502. https://doi.org/10.1016/j.energy.2021.121502
- [Google Scholar]
- Linkage among Urbanization, energy consumption, economic growth and carbon emissions panel data analysis for China using ARDL model. Fuel. 2023;332:126122. https://doi.org/10.1016/j.fuel.2022.126122
- [Google Scholar]
- Toward integrated governance of urban CO2 emissions in China: Connecting the “codes” of global drivers, local causes, and indirect influences from a multi-perspective analysis. Cities. 2023;134:104181. https://doi.org/10.1016/j.cities.2022.104181
- [Google Scholar]
- Energy modeling of urban informal settlement redevelopment: Exploring design parameters for optimal thermal comfort in Dharavi, Mumbai, India. Applied Energy. 2018;231:433-445. https://doi.org/10.1016/j.apenergy.2018.09.002
- [Google Scholar]
- The Open-source Data Inventory for Anthropogenic CO2, version 2016 (ODIAC2016): a global monthly fossil fuel CO2 gridded emissions data product for tracer transport simulations and surface flux inversions. Earth Syst Sci Data. 2018;10:87-107. https://doi.org/10.5194/essd-10-87-2018
- [Google Scholar]
- Driving effects of urbanization on city-level carbon dioxide emissions: From multiple perspectives of urbanization. Int J Urban Sci. 2022;26:108-128. https://doi.org/10.1080/12265934.2020.1803105
- [Google Scholar]
- An exploration of the relationship between density and building energy performance. Urban Design International. 2020;25:92-112. https://doi.org/10.1057/s41289-020-00109-7.
- [Google Scholar]
- Support vector machine in structural reliability analysis: A review. Reliability Engineering & System Safety. 2023;233:109126. https://doi.org/10.1016/j.ress.2023.109126
- [Google Scholar]
- Consistency of random forests. Ann Statist. 2015;43 https://doi.org/10.1214/15-aos1321
- [Google Scholar]
- Does environmental carbon pressure lead to low-carbon technology innovation? Empirical evidence from Chinese cities based on satellite remote sensing and machine learning. Computers & Industrial Engineering. 2024;189:109948. https://doi.org/10.1016/j.cie.2024.109948
- [Google Scholar]
- Evaluation of the impact of urban morphology on commercial building carbon emissions at the block scale – A study of commercial buildings in Beijing. J Cleaner Prod. 2023;408:137191. https://doi.org/10.1016/j.jclepro.2023.137191
- [Google Scholar]
- Estimating the impacts of urban form on CO2 emission efficiency in the Pearl River Delta, China. Cities. 2019;85:117-129. https://doi.org/10.1016/j.cities.2018.08.009
- [Google Scholar]
- Variations in outdoor thermal comfort in an urban park in the hot-summer and cold-winter region of China. Sustainable Cities and Society. 2022;77:103535. https://doi.org/10.1016/j.scs.2021.103535
- [Google Scholar]
- Investigating the impact of multi-dimensional urbanization and FDI on carbon emissions in the belt and road initiative region: Direct and spillover effects. J Cleaner Prod. 2023;384:135608. https://doi.org/10.1016/j.jclepro.2022.135608
- [Google Scholar]
- Impact factors of industrial pollution and carbon reduction under the “Dual Carbon” target: A case study of urban aggregation in the pearl river delta and yangtze river delta. Sustainability. 2024;16:1879. https://doi.org/10.3390/su16051879
- [Google Scholar]
- Investigating the impacts of three-dimensional spatial structures on CO2 emissions at the urban scale. Sci Total Environ. 2021;762:143096. https://doi.org/10.1016/j.scitotenv.2020.143096
- [Google Scholar]
- Modeling energy-related CO2 emissions from office buildings using general regression neural network. Resources, Conservation and Recycling. 2018;129:168-174. https://doi.org/10.1016/j.resconrec.2017.10.020
- [Google Scholar]
- Effects of digital economy on carbon emission reduction: New evidence from China. Energy Policy. 2022;171:113271. https://doi.org/10.1016/j.enpol.2022.113271
- [Google Scholar]
- How urban agglomeration improve the emission efficiency? A spatial econometric analysis of the Yangtze River Delta urban agglomeration in China. J Environ Manage. 2020;260 https://doi.org/10.1016/j.jenvman.2019.110061.
- [Google Scholar]
- Analysis of influencing factors and trend forecast of CO2 emission in chengdu-chongqing urban agglomeration. Sustainability. 2022;14:1167. https://doi.org/10.3390/su14031167
- [Google Scholar]
- Sequential data-driven carbon peaking path simulation research of the Yangtze River Delta urban agglomeration based on semantic mining and heuristic algorithm optimization. Energy. 2023;285:129415. https://doi.org/10.1016/j.energy.2023.129415
- [Google Scholar]
- Prediction and urban adaptivity evaluation model based on carbon emissions: A case study of six coastal city clusters in China. Sustainability. 2023;15:3202. https://doi.org/10.3390/su15043202
- [Google Scholar]
- Vector field-based support vector regression for building energy consumption prediction. Applied Energy. 2019;242:403-414. https://doi.org/10.1016/j.apenergy.2019.03.078
- [Google Scholar]
- Identifying the key factors influencing Chinese carbon intensity using machine learning, the random forest algorithm, and evolutionary analysis. Acta Geographica Sinica. 2019;74:2592-2603. https://doi.org/10.11821/dlxb201912012
- [Google Scholar]
- Comparative study on CO2 emissions from fossil energy consumption and its influencing factors in typical cities of China. Acta Ecologica Sinica. 2020;40:3315-3327. http://doi.org/10.5846/stxb201901290215
- [Google Scholar]