7.9
CiteScore
 
3.6
Impact Factor
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
ABUNDANCE ESTIMATION IN AN ARID ENVIRONMENT
Case Study
Correspondence
Corrigendum
Editorial
Full Length Article
Invited review
Letter to the Editor
Original Article
Research Article
Retraction notice
REVIEW
Review Article
SHORT COMMUNICATION
Short review
7.2
CiteScore
3.7
Impact Factor
Generic selectors
Exact matches only
Search in title
Search in content
Post Type Selectors
Search in posts
Search in pages
Filter by Categories
ABUNDANCE ESTIMATION IN AN ARID ENVIRONMENT
Case Study
Correspondence
Corrigendum
Editorial
Full Length Article
Invited review
Letter to the Editor
Original Article
Research Article
Retraction notice
REVIEW
Review Article
SHORT COMMUNICATION
Short review
View/Download PDF

Translate this page into:

Research Article
ARTICLE IN PRESS
doi:
10.25259/JKSUS_1757_2025

SeisRank-Ord: Governance-aware ordinal learning for seismic damage recognition and policy simulation

Department of College of Public Administration, Chongqing University, Shapingba District, 400044, Chongqing, China

*Corresponding author: E-mail address: chensh8623@cqu.edu.cn (S Chen)

Licence
This is an open-access article distributed under the terms of the Creative Commons Attribution-Non Commercial-Share Alike 4.0 License, which allows others to remix, transform, and build upon the work non-commercially, as long as the author is credited and the new creations are licensed under the identical terms.

Abstract

Rapid and reliable recognition of seismic building damage from post-disaster aerial images is critical for both scientific assessment and governance-oriented emergency response. Conventional methods often treat damage levels as unordered categories and neglect the governance implications of misclassification, particularly the severe consequences of underestimating high-risk structures. We present SeisRank-Ord, a governance-aware ordinal learning framework that integrates ordinal alignment, risk-sensitive adjustment, triage prioritization, and stability enhancement into a unified risk functional. This design embeds governance considerations directly into the learning objective, ensuring ordinal consistency, asymmetric penalties for critical errors, and stable score distributions that support resource allocation under capacity constraints. Experiments on the Yushu and Ludian datasets show that SeisRank-Ord consistently outperforms state-of-the-art baselines in seismic damage recognition while maintaining architectural generality across multiple convolutional neural network (CNN) backbones. Beyond recognition accuracy, by coupling prediction scores with policy simulation strategies including severity-first, egalitarian, and quota plus threshold regimes, SeisRank-Ord demonstrates measurable governance benefits in terms of both efficiency and fairness. These results highlight the framework as a principled bridge between computer vision and disaster governance, advancing the methodological frontier of ordinal learning while delivering actionable insights for real-world decision-making.

Keywords

Disaster management
Earthquake damage recognition
Governance-aware modelling
Ordinal learning
Policy simulation

1. Introduction

Rapid and reliable assessment of seismic building damage is a central requirement in post-earthquake emergency response (Lin et al., 2022). Aerial imagery collected after major earthquakes provides a valuable source of information, enabling large-scale automated recognition of unsafe structures (Wang et al., 2022). Conventional computer vision methods based on convolutional neural networks have achieved encouraging progress in damage classification (Morfidis et al., 2023). However, real-world disaster governance imposes requirements that extend beyond nominal accuracy. Decision makers face strict capacity constraints, meaning that only a limited number of sites can be inspected or served in each operational cycle (Hoveidae et al., 2021). In such settings, two challenges are particularly critical. First, building damage exists on an ordinal scale, ranging from intact to collapsed, and predictions should respect this inherent order (Bhatta and Dang, 2024). Second, the consequences of misclassification are highly asymmetric: underestimating severely damaged or collapsed structures can delay rescue efforts and increase casualties, while overestimation primarily leads to redundant inspections. In addition, operational triage depends less on overall classification accuracy than on whether high-risk cases consistently rank at the top of the priority list.

Despite these governance realities, most existing studies employ cross-entropy objectives that treat damage grades as independent categories and distribute learning pressure evenly. This mismatch between optimization criteria and operational priorities produces models that may achieve high nominal accuracy but remain misaligned with the requirements of governance. Research on ordinal learning has introduced cumulative encodings, distance-aware objectives, and threshold-based decompositions to maintain label order, but these approaches generally assume symmetric penalties and remain governance-neutral (Wang et al., 2024). Similarly, cost-sensitive losses can address class imbalance or discourage particular error types, yet they do not explicitly incorporate triage priorities or capacity-limited allocation rules (Komisarenko and Kull, 2025). Advances in differentiable ranking have improved ordering, but without asymmetric penalties or explicit governance context, these improvements do not necessarily translate into better outcomes in disaster response. As a result, there remains a significant gap between technical progress in recognition accuracy and the ability of models to deliver governance-relevant decision support (Zhang et al., 2025).

To address this challenge, we propose SeisRank-Ord, a governance-aware ordinal learning framework for seismic damage recognition and policy simulation. SeisRank-Ord integrates four complementary mechanisms within a single risk functional. The first mechanism enforces ordinal alignment so that predictions remain consistent with the ordered nature of damage grades. The second introduces risk-sensitive adjustment by assigning greater penalties to the underestimation of severe and collapsed categories. The third emphasizes triage prioritization by focusing separation around operational thresholds where scarce resources are allocated. The fourth provides stability enhancement by regularizing score distributions, thereby improving reliability under small and imbalanced datasets. Together, these mechanisms embed governance considerations directly into the training objective, ensuring that recognition outputs are not only technically accurate but also aligned with the operational needs of emergency response.

We evaluate SeisRank-Ord on aerial building imagery from the Yushu and Ludian earthquakes using five representative convolutional backbones. The results show consistent improvements in accuracy, Cohen’s kappa, and ordinal fidelity compared to cross-entropy baselines. Beyond recognition performance, we incorporate policy simulation under three allocation regimes, namely severity-first, egalitarian, and quota plus threshold. The simulations demonstrate that SeisRank-Ord not only improves classification metrics but also produces measurable governance benefits, advancing both efficiency and fairness in resource allocation.

The contributions of this work can be summarized in three aspects. First, we present a unified governance-aware ordinal objective that integrates four mechanisms into a principled risk functional. Second, we provide a comprehensive evaluation that considers both recognition performance and governance outcomes through policy simulation. Third, we establish SeisRank-Ord as a framework that connects advances in computer vision with the priorities of disaster governance, offering methodological innovation together with practical relevance for real-world emergency decision-making.

2. Related Work

2.1 Seismic building damage recognition

Research on seismic building damage recognition has moved from generic multi class recognition toward grade aware recognition that reflects practical response needs (Wen et al., 2025). Typical settings define four damage grades and also consider regrouped tasks such as binary and three class variants. Most works train standard convolutional backbones with cross entropy and report overall accuracy and kappa. While this line of work has improved nominal accuracy, two structural difficulties persist (Jiang et al., 2024). First, damage grades are ordered, yet ordinary multi class training treats them as independent categories. Second, mistakes are not equally harmful, since underestimation of severe or collapsed states is more costly than overestimation. These gaps motivate objectives that encode order and asymmetric cost while remaining compatible with the evaluation protocol used in practice, namely overall accuracy, kappa, and mean squared error.

2.2 Ordinal and cost-sensitive objectives

Ordinal methods enforce consistency with the natural order of labels (Cai et al., 2024). Representative strategies include cumulative link style formulations, binary threshold decompositions as in CORAL and CORN, and objectives that penalize the distance between predicted and true grades using soft labels or earth mover distance. These methods reduce label swapping between adjacent grades and often improve mean squared error (Walczyna and Piotrowski, 2024). However, they usually assume symmetric penalties around the true grade and do not reflect the higher consequence of underestimating severe cases. They also do not concentrate learning pressure near the decision threshold that selects limited-service slots. As a result, their gains in order fidelity do not necessarily translate into higher kappa under capacity constrained dispatch.

2.3 Cost sensitivity and class imbalance

Cost sensitive learning addresses unequal error consequences and class imbalance by reweighting losses, adjusting sampling, or shaping the loss surface with functions such as class balanced cross entropy and focal variants (Peykani et al., 2025). These approaches can raise overall accuracy in skewed data and can discourage specific error types. Yet they do not encode the ordinal structure of damage grades, so improvements in overall accuracy may coexist with unnecessary grade hopping that inflates mean squared error. Moreover, uniform reweighting across the score range does not target the threshold neighborhood where operational decisions occur, which limits their effect on agreement measured by kappa.

2.4 Learning to rank and differentiable selection

Learning to rank introduces pairwise and listwise surrogates that shape relative ordering, while differentiable sorting and differentiable top K provide smooth proxies for selection (Wild et al., 2025). These advances enable gradient based optimization for ranking objectives. Nevertheless, generic surrogates spread gradients over the entire score spectrum and do not align the strongest learning signal with the specific threshold that implements a daily capacity limit (Chen et al., 2024). They also address ranking without explicitly modeling ordinal distances or asymmetric costs. In contrast, our objective concentrates separation near the operational threshold, preserves grade order, and encodes higher penalties for underestimation. This joint treatment aims to convert classifier gains into improvements in kappa and reductions in mean squared error while keeping overall accuracy at a comparable level.

3. Methodology

3.1 Governance-driven motivation

Earthquake damage recognition differs fundamentally from conventional image classification. Building states are annotated on a four-grade ordinal scale: L0 (intact), L1 (slightly damaged), L2 (severely damaged), and L3 (collapsed). In operational settings, responders cannot process all cases simultaneously; instead, they must allocate limited resources cycle by cycle, guided by predicted risk scores. This capacity constraint implies that optimizing for overall classification accuracy is insufficient. Three governance-driven principles emerge:

Respect for ordinal structure (L0 < L1 < L2 < L3), avoiding unrealistic “grade hopping.”

Asymmetric treatment of errors, since underestimating severe or collapsed buildings is far more damaging than overestimating.

Prioritization near operational thresholds, because decisions hinge on which buildings rank at the top of the risk distribution, not on uniform accuracy across all classes.

Our method, SeisRank-Ord, instantiates these principles by defining a governance risk functional. Although this functional can be decomposed into four complementary mechanisms, it is designed as a unified optimization objective tailored for post-disaster triage.

From a decision-theoretic perspective, the unified governance-aware objective in SeisRank-Ord can be understood as minimizing an expected societal cost rather than a purely statistical error rate. Conventional ordinal models typically treat all misclassifications as interchangeable deviations between predicted and true labels. In contrast, post-earthquake governance is shaped by strongly asymmetric consequences: delaying service to severely damaged or collapsed buildings generates far greater social and political costs than over-serving mildly affected sites. SeisRank-Ord therefore re-weights prediction errors according to their governance relevance and explicitly encourages locally consistent rankings along the damage continuum. In practical terms, the model learns risk scores that are calibrated to the priorities of triage, so that optimization directly targets earlier service for high-severity cases and more equitable coverage under capacity constraints. This governance-weighted view of risk provides a coherent conceptual foundation for the composite loss, linking the design of the learning objective to the downstream behavior of allocation policies.

3.2 Ordinal alignment mechanism

The first mechanism ensures predictions preserve the ordinal nature of labels. Let the predicted probability vector be pi=(pi,0 ,,pi,3 ). Define cumulative probabilities si,k =c=0 k pi,c and thresholds ti,k =1[yik].

We penalize deviations via cumulative cross-entropy and enforce monotonicity with a soft constraint Eq (1):

(1)
Cord (f(xi),yi)=k=0 2 ti,k logsi,k (1ti,k )log(1si,k ) +λmono k=0 2max (0,si,k si,k+1 )2

This mechanism discourages unrealistic large jumps, such as predicting L3 as L0, thereby improving mean-squared error and maintaining the integrity of the ordinal taxonomy. Although sk represents cumulative probabilities, the monotonicity term does not vanish because it acts as a soft regularizer that tolerates small violations of the cumulative constraint. These residual gradients stabilize the optimization process and prevent degenerate cases where all cumulative probabilities collapse to identical values.

3.3 Risk-sensitive adjustment

From a governance perspective, underestimation of severe damage is intolerable, while overestimation primarily incurs redundant inspections. To encode this asymmetry, we define a weighted penaltyEqs (2 and 3)

(2)
Casym (f(xi),yi)=c=0 3w(c,yi)pi,c

with weights

(3)
w(c,y)= α(cy), c<y (cy), cy

When c is smaller than y, the apparent negative difference (cy) is compensated by the weighting coefficient α, which guarantees that all penalty values remain non-negative. This design amplifies the cost of underestimation while keeping the overall optimization landscape stable and interpretable. As a result, severe or collapsed categories receive stronger corrective gradients, aligning the learning process with governance priorities.

3.4 Triage prioritization

In post-disaster operations, what matters is whether the most critical buildings are ranked above others, since only the top fraction can be served in each cycle. Let the severe-risk score be defined as Eq (4):

(4)
ri  =  pi,2  +  pi,3  

We enforce ranking separation between positives (P={i|yi {2,3}}) and negatives (N={j|yj{0,1}}) with a pairwise margin Eq (5):

(5)
Crank (f(xi),yi)= 1 |P||N| iP jNmax (0,m(rirj))

This ensures that severely damaged and collapsed structures consistently appear higher in the risk ranking, making predictions actionable for triage under capacity limits.

3.5 Stability enhancement

Disaster datasets are typically small and imbalanced, often leading to overconfident or unstable predictions. Such instability undermines threshold-based decisions in field operations. To mitigate this, we introduce a calibration-inspired term Eq (6):

(6)
Creg (f(xi),yi)=c=0 3 (pi,c 1[yi=c]) 2

This regularizer smooths probabilities and reduces variance, improving reliability when the model is applied to novel disaster events. Although secondary, it enhances operational robustness, ensuring decision rules remain stable.

3.6 Unified governance risk functional

The mechanisms introduced above are not independent add-ons but natural instantiations of a single governance risk functional. Conceptually, this functional formalizes the real-world costs of misallocation under post-earthquake capacity constraints. Its decomposition into four terms is purely analytic, designed to clarify the operational priorities encoded within. Formally, the complete functional isEqs (7 and 8):

(7)
C(f(x),y)=Cord (f(x),y)+Casym (f(x),y)+Crank (f(x),y) +λreg Creg (f(x),y)

The overall training objective is then:

(8)
L=E(x,y) [C(f(x),y)]

Unlike prior methods that combine multiple objectives in an ad-hoc manner, SeisRank-Ord is unified by design. Each mechanism directly operationalizes a corresponding governance principle: ordinal alignment, asymmetric conservatism, triage readiness, and stability. The formulation should therefore be interpreted not as a sum of heterogeneous losses, but as a governance-driven framework whose decomposition provides interpretability rather than construction. To better illustrate the governance-driven design of our proposed objective, we provide a schematic overview of the SeisRank-Ord framework in Fig. 1. The framework depicts the end-to-end process: post-earthquake aerial images are first processed by a CNN backbone, followed by the unified governance risk functional that integrates ordinal alignment, asymmetric risk-sensitive adjustment, triage prioritization, and stability enhancement. The resulting prediction scores are then linked to policy simulation modules, demonstrating how technical improvements translate into governance outcomes in terms of efficiency and fairness.

Overall framework of SeisRank-Ord: governance-aware ordinal learning for seismic damage recognition and policy simulation.
Fig. 1.
Overall framework of SeisRank-Ord: governance-aware ordinal learning for seismic damage recognition and policy simulation.

3.7 Practical interpretation

From a governance perspective, SeisRank-Ord can be viewed as translating machine learning outputs into decision rules that resonate with emergency operations.

Ordinal alignment ensures predictions reflect the physical continuum of structural damage, supporting engineering interpretability and credible communication to stakeholders.

Asymmetric conservatism embodies the ethical imperative of “erring on the side of caution,” reducing the likelihood that collapsed buildings are overlooked in triage.

Threshold-focused ranking ensures that scarce resources are allocated where they are most needed, improving response equity by pushing high-risk cases consistently to the top of the priority list.

Stability enhancement reduces the variance of predictions under small, imbalanced datasets, ensuring that governance decisions are robust and transparent rather than brittle or arbitrary.

In short, SeisRank-Ord is not merely an accuracy-optimizing classifier, but a governance-aligned recognition framework. It operationalizes its governance risk functional through several complementary mechanisms—ensuring ordinal consistency across damage grades, penalizing underestimation of severe cases more heavily, and enforcing stable rankings near decision thresholds. By embedding fairness, caution, and operational relevance directly into training, it bridges the gap between computational modeling and disaster governance, ensuring that recognition outputs are both technically reliable and policy-actionable.

4. Experiments and Results

To ensure that model training remains principled, our objective is always defined on the full four-grade ordinal scale, namely L0 (intact), L1 (slightly damaged), L2 (severely damaged), and L3 (collapsed). This guarantees that SeisRank-Ord consistently learns ordinally coherent severity representations. However, in practice, decision-making requirements vary across governance scenarios. For this reason, during evaluation, we regroup the native grades into three operational settings (Set 1-Set 3). These settings range from binary triage to three-class simplification and full four-class recognition, allowing us to assess the proposed method under different levels of decision granularity while keeping training consistent.

4.1 Datasets

We employ two post-earthquake datasets, Yushu and Ludian, both consisting of high-resolution building image chips annotated into four ordered damage grades: L0 (intact), L1 (slightly damaged), L2 (severely damaged), and L3 (collapsed). These labels are consistently regarded as ordinal, with higher indices denoting more severe structural damage. The Yushu dataset contains post-earthquake aerial images collected after the 7.1-magnitude earthquake that struck Yushu County, Qinghai Province, China, on 14 April 2010. The Ludian dataset comprises post-earthquake aerial images captured following the 6.5-magnitude earthquake that occurred on 3 August 2014 in Ludian County, Yunnan Province, China. Both datasets are publicly available and can be accessed through open repositories, ensuring transparency and reproducibility of research. The per-grade image counts for each event are summarized in Table 1.

Table 1. Distribution of the samples in the two datasets.
Damage grade Ludian dataset Yushu dataset
L0 2038 778
L1 3843 918
L2 2107 665
L3 2436 1140
Total 10434 3501

To reflect different operational requirements, we regroup these four labels into three evaluation settings. Set 1 merges L0 and L1 as the non-severe class and L2 and L3 as the severe class, corresponding to a binary triage scenario under resource-constrained emergencies. Set 2 retains L0 as intact, keeps L1 as slightly damaged, and combines L2 and L3 as the high-risk category, providing a balanced three-class view that isolates intact stock and mild damage while consolidating the most critical cases. Set 3 preserves the original four grades, offering the most detailed ordinal taxonomy. Together, these settings enable systematic evaluation across different decision granularities.

We evaluate performance using Overall accuracy (OA), Cohen’s kappa, and Mean squared error (MSE). OA measures the proportion of correctly classified samples, kappa adjusts for chance agreement to reflect categorical consistency, and MSE penalizes predictions farther from the ground truth, thereby quantifying ordinal fidelity. These three metrics provide complementary perspectives on correctness, reliability, and severity awareness.

We benchmark five representative convolutional backbones: ResNet-50 (Wen et al., 2025), DenseNet (Ilmak et al., 2024), ResNet-18 (Wang et al., 2024), MobileNet (Nguyen et al., 2025), and ShuffleNet (Cui and Jiang, 2022). These models span from lightweight architectures (MobileNet, ShuffleNet) suitable for rapid on-device inference to deeper networks (ResNet-50, DenseNet) with stronger representational capacity, ensuring robustness across diverse deployment scenarios.

All images are resized to 128x128 pixels and normalized using ImageNet statistics. To ensure fairness, no data augmentation is applied, so that improvements can be attributed directly to the proposed loss design. For each dataset, samples are stratified by grade and split into 70% training, 10% validation, and 20% testing. Models are trained with AdamW using an initial learning rate of 1e-3, weight decay of 1e-4, cosine learning rate decay with a warmup of 5 epochs, and gradient clipping at 1.0. Training runs for 100 epochs with a batch size of 128, and the checkpoint with the best validation kappa (averaged over the last five epochs) is selected. All experiments are conducted on an NVIDIA RTX 3090 GPU (24 GB) with PyTorch 1.7.0.

4.2 Evaluation

Tables 2 and 3 report the results on both datasets under the three evaluation settings. Several consistent trends can be observed.

Table 2. Yushu dataset performance under Sets 1-3 for five CNN backbones, CE vs. CE+SeisRank-Ord.
Model name SET 1 OA (%) Kappa MSE SET 2 OA (%) Kappa MSE SET 3 OA (%) Kappa MSE
ResNet-50 (CE) 93.391 0.850 0.061 76.801 0.620 0.280 64.801 0.520 0.490
ResNet-50 + SeisRank-Ord 93.923 0.870 0.058 77.402 0.650 0.250 65.598 0.540 0.460
DenseNet (CE) 93.110 0.841 0.065 76.201 0.611 0.301 64.201 0.511 0.503
DenseNet+ SeisRank-Ord 93.620 0.860 0.060 76.901 0.642 0.270 64.902 0.531 0.471
ResNet-18 (CE) 92.590 0.832 0.070 75.902 0.603 0.311 63.803 0.503 0.523
ResNet-18 + SeisRank-Ord 93.110 0.851 0.065 76.603 0.631 0.282 64.401 0.522 0.491
MobileNet(CE) 92.190 0.821 0.072 75.301 0.591 0.330 63.202 0.491 0.542
MobileNet + SeisRank-Ord 92.811 0.841 0.067 76.101 0.622 0.301 63.901 0.510 0.511
ShuffleNet(CE) 91.91 0.81 0.075 74.90 0.58 0.34 62.70 0.48 0.56
ShuffleNet+ SeisRank-Ord 92.521 0.832 0.070 75.701 0.612 0.311 63.403 0.504 0.537
Table 3. Ludian dataset performance under Sets 1-3 for five CNN backbones, CE vs. CE+SeisRank-Ord.
Model name SET 1 OA (%) Kappa MSE SET 2 OA (%) Kappa MSE SET 3 OA (%) Kappa MSE
ResNet-50 (CE) 92.612 0.803 0.066 82.401 0.700 0.201 73.200 0.620 0.320
ResNet-50 + SeisRank-Ord 93.651 0.831 0.058 83.902 0.731 0.171 74.600 0.650 0.290
DenseNet (CE) 92.122 0.791 0.070 81.90 0.692 0.223 72.601 0.612 0.331
DenseNet + SeisRank-Ord 92.801 0.812 0.063 83.201 0.722 0.183 73.901 0.642 0.301
ResNet-18 (CE) 91.611 0.782 0.075 81.201 0.682 0.231 71.801 0.602 0.351
ResNet-18 + SeisRank-Ord 92.321 0.802 0.068 82.601 0.712 0.193 73.101 0.632 0.312
MobileNet(CE) 91.217 0.772 0.078 80.503 0.671 0.242 71.206 0.592 0.361
MobileNet+ SeisRank-Ord 91.927 0.792 0.071 81.902 0.704 0.201 72.402 0.623 0.334
ShuffleNet(CE) 90.812 0.762 0.082 79.801 0.662 0.263 70.401 0.584 0.382
ShuffleNet + SeisRank-Ord 91.522 0.786 0.075 81.105 0.694 0.22 71.703 0.612 0.351

First, across all settings and backbones, SeisRank-Ord outperforms the CE baseline in OA, kappa, and MSE. The gains are particularly pronounced on Yushu, where data sparsity makes ordinal alignment and asymmetric penalties more critical. This confirms that embedding governance-oriented principles into the loss design improves robustness under real-world data constraints.

Second, the relative benefits vary across operational settings. In Set 1 (binary triage), SeisRank-Ord achieves substantial improvements in kappa, reflecting its ability to correctly prioritize severe (L2-L3) cases. This aligns with the governance imperative of minimizing missed detections in emergency allocation. In Set 2 (three-class), the method demonstrates balanced improvements across all metrics, showing its utility for intermediate decision-making that separates intact stock, mild damage, and high-risk categories. In Set 3 (four-class), improvements in MSE highlight the value of ordinal consistency, ensuring that predictions reflect the natural progression of structural failure rather than arbitrary categorical jumps.

Third, performance is stable across backbones, with both lightweight models (MobileNet, ShuffleNet) and deeper ones (ResNet, DenseNet) benefiting from the proposed objective. This indicates that SeisRank-Ord is architecture-agnostic and can be deployed flexibly, from edge devices to large-scale servers.

The results illustrate a broader point: evaluating disaster recognition methods solely on overall accuracy overlooks operational nuances. While OA remains important, kappa and MSE reveal whether predictions are reliable and severity-aware. The consistent gains across all three metrics suggest that SeisRank-Ord not only improves recognition accuracy but also produces outputs that are more actionable for governance.

4.3 Ablation study

To evaluate the contribution of each governance-aware mechanism, we conduct ablation experiments on the Yushu dataset under the Set-3 protocol, which preserves the complete four-level taxonomy (L0–L3). The backbone is fixed to ResNet-50, identified as the best-performing architecture on Yushu. All settings, including data splits, preprocessing, optimization schedule, and selection criteria, are kept identical to the main experiments to ensure strict comparability. Each ablation removes one component from the unified SeisRank-Ord functional: (i) w/o Ordinal alignment removes cumulative encoding and order constraints, reducing the objective to nominal classification; (ii) w/o Asymmetry disables conservative weighting, treating underestimation and overestimation symmetrically; (iii) w/o Ranking eliminates the top-K ranking term, weakening sensitivity near the triage threshold; and (iv) w/o Regularizer drops the probabilistic stabilizer, potentially impairing calibration under data imbalance.

Table 4 reports the quantitative results, and Figs. 2 and 3 provide visualizations of absolute performance and relative changes, respectively. Several consistent trends emerge. First, removing ordinal alignment produces the largest increase in MSE (0.460 → 0.487), reflecting more severe ordinal jumps such as confusing intact with collapse. This confirms that enforcing a cumulative probability structure is indispensable for severity fidelity. Second, disabling asymmetric weighting leads to a substantial decline in Kappa (0.540 → 0.512), indicating that the model misses more high-risk cases when underestimation is not penalized more strongly. This directly aligns with the governance motivation of avoiding critical misses. Third, removing the ranking term causes the most severe degradation overall (Kappa: 0.540 → 0.505; MSE: 0.460 → 0.480), demonstrating that triage prioritization is critical for decision-making accuracy around operational thresholds. Finally, excluding the regularizer results in smaller but consistent declines, highlighting its role in stabilizing training and improving calibration under imbalance.

Table 4. Ablation study on the Yushu dataset (Set 3, ResNet-50 backbone).
Variant OA kappa MSE
Full (SeisRank-Ord) 0.656 0.540 0.460
w/o Ordinal alignment 0.643 0.524 0.487
w/o Asymmetry 0.647 0.512 0.471
w/o Ranking 0.639 0.505 0.480
w/o Regularizer 0.652 0.533 0.466
Ablation study on the Yushu dataset under Set-3 with a ResNet-50 backbone. (a) Kappa scores for different ablation settings. (b) MSE values for different ablation settings.
Fig. 2.
Ablation study on the Yushu dataset under Set-3 with a ResNet-50 backbone. (a) Kappa scores for different ablation settings. (b) MSE values for different ablation settings.
Relative performance changes with respect to the full model on the Yushu dataset under Set-3 with a ResNet-50 backbone. (a) Relative change in Kappa. (b) Relative change in MSE.
Fig. 3.
Relative performance changes with respect to the full model on the Yushu dataset under Set-3 with a ResNet-50 backbone. (a) Relative change in Kappa. (b) Relative change in MSE.

Overall, the ablation study confirms that SeisRank-Ord is not merely an additive combination of loss functions but a governance-driven framework in which each mechanism addresses a distinct failure mode. The systematic degradations across Kappa and MSE validate both the coherence of the design and the necessity of integrating all four mechanisms to achieve robust, governance-aware ordinal learning for seismic damage recognition.

5. Policy simulation

To bridge recognition accuracy with governance relevance, we simulate post-earthquake response under three stylized allocation policies: (A) severity-first, which globally prioritizes high-severity buildings; (B) egalitarian, which allocates resources evenly without using predictions; and (C) quota + fixed threshold, which serves a capped fraction of high-risk cells each day. Efficiency is evaluated by AUC, median day (T₅₀), and tail day (T₉₀), while fairness is evaluated by the Gini index and the response time for the top 20% of high-risk cases.

The policy simulation follows the same configuration as the operational model: each run spans T = ceil(N/K) operational days, with a daily service quota K = 50 buildings (≈ 10 % of the total). Policy A (severity-first) allocates strictly by descending Damage Index. Policy B (egalitarian) uses a round-robin rotation that ignores predicted severity, serving as a non-predictive control baseline to evaluate the value of model guidance. Policy C (quota + threshold) reserves 70 % of the daily quota for buildings whose predicted severe-damage probability exceeds 0.8, filling the remainder with low-risk sites in ascending Damage index order. Efficiency and fairness indicators (AUC, T₅₀, T₉₀, Gini index, top-20 % median day) are recorded each day and summarized in Tables 5 and 6. These configurations reflect practical post-earthquake recovery conditions where response capacity is constrained, and prioritization rules directly influence governance outcomes.

Table 5. Policy simulation results: efficiency-oriented metrics (AUC, median day, tail day, Gini).
Policy Model AUC Median day T₅₀ Tail day T₉₀ Gini
Severity-first (A) CE baseline 130801 22 51 0.760
SeisRank-Ord 122360 19 47 0.754
Egalitarian (B) CE baseline 205901 38 67 0.700
SeisRank-Ord 198156 36 65 0.685
Quota+threshold (C) CE baseline 188901 29 69 0.760
SeisRank-Ord 182397 27 66 0.752
Table 6. Policy simulation results: fairness-oriented metrics (Top-20% median day, Gini).
Policy Model Top-20% median day Gini
Severity-first (A) CE baseline 9 0.760
SeisRank-Ord 8 0.754
Egalitarian (B) CE baseline 39 0.700
SeisRank-Ord 36 0.685
Quota+threshold (C) CE baseline 14 0.760
SeisRank-Ord 11 0.752

5.1 Efficiency outcomes

Table 5 summarizes efficiency-oriented results. Across all policies, SeisRank-Ord consistently outperforms the CE baseline. Under severity-first (Policy A), SeisRank-Ord reduces the AUC from 130,800.0 to 122,359.7 and advances the median response (T₅₀) from day 22 to day 19, indicating that critical buildings are typically served three days earlier. The tail coverage (T₉₀) also improves substantially (51→47), showing that even the hardest-to-reach cases are addressed sooner.

Under egalitarian allocation (Policy B), which does not explicitly leverage severity predictions, SeisRank-Ord still provides clear benefits: AUC drops from 205,900.0 to 198,156.0, and T₉₀ improves from 67 to 65. The two-day gain at the tail highlights that improved recognition accuracy positively impacts resource distribution, even when allocation is evenly enforced.

For quota + threshold (Policy C), the governance-aware model again demonstrates consistent advantages. SeisRank-Ord reduces AUC from 188,900.0 to 182,396.6 and shortens both the median response day (29→27) and the tail response (69→66). These improvements confirm that calibrated risk scores are particularly valuable in constrained regimes where both quotas and thresholds govern daily allocation. Fig. 4 (cumulative coverage curves) illustrates these efficiency gains: Policy A achieves the fastest coverage, Policy B is the slowest, while Policy C provides a balanced trade-off between responsiveness and quota fairness. Fig. 4 illustrates the cumulative coverage curves that characterize efficiency gains across the three allocation policies. Policy A achieves the fastest coverage, Policy B is the slowest, while Policy C provides a balanced trade-off between responsiveness and quota fairness. Baseline comparisons are summarized in Table 5.

Cumulative coverage curves of SeisRank-Ord under three allocation policies.
Fig. 4.
Cumulative coverage curves of SeisRank-Ord under three allocation policies.

5.2 Fairness outcomes

Having established efficiency improvements, we next examine fairness metrics. Table 6 highlights fairness-oriented results. Under Policy A, SeisRank-Ord advances the top-20% median response by one day (9 → 8) while slightly reducing inequality (Gini 0.760 → 0.754). This shows that severity-driven prioritization not only improves efficiency but also maintains fairness.

Policy B demonstrates the largest fairness improvement: the Gini index drops from 0.700 to 0.685, and top-20% high-risk buildings are reached three days earlier (39 →36). Even under egalitarian allocation, governance-aware objectives reduce response inequality and ensure that the most critical cases are not delayed.

In Policy C, SeisRank-Ord outperforms the baseline across both fairness metrics. The most critical 20% of buildings are served earlier (14 → 11), and the Gini index is reduced from 0.760 to 0.752. These improvements are significant because quota systems are frequently adopted in practice, and ensuring equitable allocation under such constraints directly strengthens governance legitimacy. Fig. 5 further highlights the fairness–efficiency trade-offs by showing the uncovered-damage trajectories across the three allocation policies. SeisRank-Ord consistently lowers residual damage throughout the response period, especially in the high-risk tail.

Uncovered-damage trajectories of SeisRank-Ord under three allocation policies.
Fig. 5.
Uncovered-damage trajectories of SeisRank-Ord under three allocation policies.

The simulations demonstrate that SeisRank-Ord delivers consistent benefits across all policies, advancing both efficiency (earlier median and tail coverage, lower uncovered-damage AUC) and fairness (lower Gini, earlier top-20% service). While Tables 5 and 6 provide detailed numerical comparisons against the CE baseline, Figs. 4 and 5 focus exclusively on SeisRank-Ord to clearly illustrate efficiency and fairness dynamics. Specifically, cumulative coverage curves (Fig. 4) emphasize improvements in median and tail response times, while uncovered-damage trajectories (Fig. 5) highlight systematic reductions in long-tail risk. Crucially, these gains emerge from aligning model training with governance principles rather than simply optimizing nominal accuracy, underscoring that governance-aware objectives can directly improve disaster response by serving high-risk structures earlier and distributing aid more equitably.

5.3 Governance insights

The policy simulations offer several actionable insights for earthquake disaster governance. First, severity-first prioritization proves most effective in accelerating the service of critically unsafe buildings. With SeisRank-Ord, the median response to high-risk structures is advanced by three days compared to the baseline, and the long-tail coverage is shortened by four days. In operational terms, this implies that search-and-rescue teams could identify and stabilize severely damaged or collapsed buildings earlier, reducing secondary casualties and enabling faster triage of affected communities. Second, even under egalitarian allocation, governance-aware modeling enhances equity without sacrificing responsiveness. Although resources are distributed broadly in this regime, SeisRank-Ord lowers inequality (Gini reduced by 0.015) and reaches the top 20% of high-risk cases three days earlier. This indicates that fairer allocation does not necessarily conflict with efficiency when prediction models are aligned with governance priorities. In practice, such improvements could support more transparent and publicly accepted disaster response strategies, particularly in regions where equity concerns are politically salient. Third, quota-based allocation benefits from calibrated severity scores. Many real-world disaster response systems impose quotas or thresholds due to logistical constraints. In our simulations, SeisRank-Ord improves both responsiveness and equity under these conditions: high-risk buildings are reached three days earlier, and the distribution of service becomes more equitable (Gini reduced by 0.008). These improvements are especially relevant to earthquake-prone developing regions where limited capacity amplifies the consequences of misallocation. By complementing institutional mechanisms, governance-aware recognition ensures that quota rules do not inadvertently disadvantage the most vulnerable groups. Taken together, these findings emphasize that technical refinements in recognition models can translate directly into governance outcomes. By embedding operational principles into the learning objective, SeisRank-Ord enables disaster response systems to become not only more accurate but also more just, transparent, and aligned with societal priorities. These findings underscore that efficiency–fairness trade-offs are not merely numerical differences but policy choices. In real emergency response, prioritizing speed may reduce residual damage, while prioritizing fairness may strengthen institutional legitimacy and social stability. SeisRank-Ord thus serves as a bridge between algorithmic recognition and governance strategy, supporting more transparent and accountable disaster management decisions.

6. Conclusions

This study proposed SeisRank-Ord, a governance-aware objective for seismic building damage recognition. Unlike conventional approaches that optimize nominal accuracy, SeisRank-Ord integrates three operational principles: preserving the ordinal nature of damage grades, penalizing underestimation more than overestimation, and concentrating learning pressure near decision thresholds. By unifying these mechanisms within a single loss, the model produces severity scores that are not only accurate but also aligned with disaster governance priorities.

Extensive experiments on the Yushu and Ludian earthquake datasets demonstrate the effectiveness of the approach. Across multiple backbones and task settings, SeisRank-Ord consistently improves overall accuracy, Cohen’s kappa, and mean squared error relative to cross-entropy. Ablation studies further confirm the contribution of each component, showing that the full objective provides the strongest alignment with the ordinal and asymmetric characteristics of seismic damage recognition.

Policy simulations extend these improvements to governance outcomes. Under severity-first, egalitarian, and quota-based allocation regimes, SeisRank-Ord advances both efficiency by reducing AUC and shortening median and tail response times, and fairness by lowering Gini indices and accelerating the service of high-risk buildings. These gains translate into actionable governance insights: critical structures can be identified and served earlier, resource allocation becomes more equitable, and institutional rules such as quotas are supported rather than undermined.

Overall, the results underscore the importance of embedding governance principles directly into model design. SeisRank-Ord demonstrates that technical innovation in recognition systems can yield measurable improvements in both operational efficiency and social fairness, bridging the gap between machine learning performance and disaster governance impact. Future work may extend this framework to other hazards and multi-modal data sources, broadening the scope of governance-aware AI for crisis management.

CRediT authorship contribution statement

An Zhang: Writing - original draft, conceptualization. Zihao Wang: Writing - review & editing, visualization. Sheng Chen: Writing - review & editing.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Data availability

Data will be provided upon reasonable request.

Declaration of generative AI and AI-assisted technologies in the writing process

The authors confirm that they have used artificial intelligence (AI)-assisted technology for assisting in the writing or editing of the manuscript or image creations.

Funding

This work was partially supported by the National Natural Science Foundation of China (Grant No.72274026), the National Social Science Fund of China Major Project (Grant No. 24ZDA036), and the Funda-mental Research Funds for the Central Universities (Project Nos. 2020CDJSK01WT07, 2021CDJSKPT05.and 2023CDJSKPT04).

References

  1. , . Machine learning-based classification for rapid seismic damage assessment of buildings at a regional scale. J Earthquake Eng. 2024;28:1861-1891. https://doi.org/10.1080/13632469.2023.2252521
    [Google Scholar]
  2. , , , . Hybrid amplitude ordinal partition networks for ECG morphology discrimination: An application to PVC recognition. IEEE Trans Instrum Meas. 2024;73:1-13. https://doi.org/10.1109/tim.2024.3400307
    [Google Scholar]
  3. , , , , . Decoupled differentiable graph neural architecture search. Inf Sci. 2024;673:120700. https://doi.org/10.1016/j.ins.2024.120700
    [Google Scholar]
  4. , . Shuffle-CDNet: A lightweight network for change detection of bitemporal remote-sensing images. Remote Sensing. 2022;14:3548. https://doi.org/10.3390/rs14153548
    [Google Scholar]
  5. , , . Seismic damage assessment of a historic masonry building under simulated scenario earthquakes: A case study for Arge-Tabriz. Soil Dyn Earthq Eng. 2021;147:106732. https://doi.org/10.1016/j.soildyn.2021.106732
    [Google Scholar]
  6. , , . Deep learning-based scene classification of very high-resolution satellite imagery for post-earthquake damage assessment: A case study of the 2023 Kahramanmaraş earthquakes. Int Arch Photogramm Remote Sens Spatial Inf Sci XLVIII-4/W9-2024 2024:249-256. https://doi.org/10.5194/isprs-archives-xlviii-4-w9-2024-249-2024
    [Google Scholar]
  7. , , , . Video comprehension-based approach for seismic damage recognition of freestanding non-structural components. Eng Struct. 2024;308:118034. https://doi.org/10.1016/j.engstruct.2024.118034
    [Google Scholar]
  8. , . Cost-sensitive classification with cost uncertainty: do we need surrogate losses? Mach Learn. 2025;114:132. https://doi.org/10.1007/s10994-024-06634-8
    [Google Scholar]
  9. , , , , , . Transfer learning for improving seismic building damage assessment. Remote Sensing. 2022;14:201. https://doi.org/10.3390/rs14010201
    [Google Scholar]
  10. , , . A rapid seismic damage assessment (RASDA) tool for RC buildings based on an artificial intelligence algorithm. Appl Sci. 2023;13:5100. https://doi.org/10.3390/app13085100
    [Google Scholar]
  11. , , . A digital twin framework with MobileNetV2 for damage detection in slab structures. Fracture and Structural Integrity. 2025 https://doi.org/10.3221/IGF-ESIS.72.09
    [Google Scholar]
  12. , , , , . Evaluation of cost-sensitive learning models in forecasting business failure of capital market firms. Mathematics. 2025;13:368. https://doi.org/10.3390/math13030368
    [Google Scholar]
  13. , . Fast fake: Easy-to-train face swap model. Appl Sci. 2024;14:2149. https://doi.org/10.3390/app14052149
    [Google Scholar]
  14. , , , , , . HOPE: Hybrid-granularity ordinal prototype learning for progression prediction of mild cognitive impairment. IEEE J Biomed Health Inform. 2024;28:6429-6440. https://doi.org/10.1109/JBHI.2024.3357453
    [Google Scholar]
  15. , , , , , . Identification of the surface cracks of concrete based on ResNet-18 depth residual network. Appl Sci. 2024;14:3142. https://doi.org/10.3390/app14083142
    [Google Scholar]
  16. , , , , , . A two-stage seismic damage assessment method for small, dense, and imbalanced buildings in remote sensing images. Remote Sensing. 2022;14:1012. https://doi.org/10.3390/rs14041012
    [Google Scholar]
  17. , , , . Disaster recognition and classification based on improved ResNet-50 neural network. Appl Sci. 2025;15:5143. https://doi.org/10.3390/app15095143
    [Google Scholar]
  18. , , , , , . Seismic damage recognition of structural and non-structural components based on convolutional neural networks. J Build Eng. 2025;102:112012. https://doi.org/10.1016/j.jobe.2025.112012
    [Google Scholar]
  19. , , , , . Automatic feature selection and weighting in molecular systems using differentiable information imbalance. Nat Commun. 2025;16:270. https://doi.org/10.1038/s41467-024-55449-7
    [Google Scholar]
  20. , , , , , , . Physics-aware parameter diffusion for spatio-temporal continuum field modeling for governance. Neurocomputing. 2025;655:131262. https://doi.org/10.1016/j.neucom.2025.131262
    [Google Scholar]
Show Sections