Introduction

A considerable number of patients infected with SARS-CoV-2 develop severe symptoms leading to hospital admission, Intensive Care Unit (ICU) treatment and death from coronavirus disease 2019 (COVID-19) [1, 2]. Many studies have identified risk factors for a variety of outcomes. Older age, male sex and a higher burden from comorbidities [3,4,5], particularly chronic renal failure [6], turned out to be of major importance, in addition to generic factors such as treatment limitations [7], biomarkers, or radiological findings [8, 9] including their changes over time during hospital stay [5, 10,11,12,13]. The respiratory viral load, as an indicator of the magnitude of the challenge by the virus, could also be a risk factor. This load is obtained by nasopharyngeal swabs and commonly quantified via the Ct (cycle threshold) value needed in the reverse transcription polymerase chain reaction (RT-PCR) of specific genes of SARS-CoV-2 to achieve a pre-determined signal strength [14]. The higher is the Ct value, the lower the virus load, and vice versa [15]. A number of studies found the Ct value to be predictive for mortality and ICU admission [16,17,18]. However, respiratory virus load is known to change during the course of infection, indicating the degree of virus shedding in an individual patient. Based on these limitations as well as methodological concerns [19], it has been suggested that the value in predicting the severity of COVID-19 is limited [20], while computed tomography and laboratory markers have been shown to be other important predictors [5, 21]. To avoid the difficulties arising from the time course of the Ct value, we focused on the value obtained as early as possible at hospital admission.

For the prediction of clinical outcomes of COVID-19, several models have been proposed [22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39]. As far as an initial tentative assessment is envisaged, scores should be simple and rely only on information available at the time of admission. At later stages during the hospital stay, information about changes in biomarkers or radiological findings might become more important [40,41,42]. To our knowledge, none of the proposed simple scoring systems for early assessment includes the Ct value obtained at the time of hospital admission, although it might be expected that its inclusion improves the accuracy of prediction. Thus, we analyzed data from patients hospitalized due to COVID-19, to develop a simple scoring system for mortality risk based on information available at the time of hospital admission including the Ct value. For comparison, we also determined the association of the predictors with other clinical outcomes, including the need for high-flow nasal oxygen (HFNO) therapy, requirement of ICU therapy, and invasive ventilation (IV).

Methods

Study population

Data were collected in the RoMed hospitals located in Rosenheim, Bad Aibling, Prien am Chiemsee and Wasserburg. Patients admitted to one of these hospitals between February 1st and May 31st 2021 (primary study population) and giving informed consent to the study protocol were enrolled. For validation, we used data from patients admitted between December 15th 2020 and January 31st 2021. The study protocol was approved by the Ethical Committee of the University Hospital of the LMU Munich (number 21–0452).

Assessments

Patients were characterized according to age, sex and vital parameters such as body temperature, heart rate, blood pressure, oxygen saturation from pulse oximetry (SpO2) and respiratory rate upon admission. Comorbidities were recorded following the Charlson comorbidity index (CCI) without age, as age was used as a separate predictor. For patient characterization, we also used standard laboratory parameters, including leukocyte count, C-reactive protein (CRP), procalcitonin (PCT), hemoglobin, lactate dehydrogenase (LDH), creatinine, troponin, N-terminal pro b-type natriuretic peptide (NT-proBNP), D-dimer, alanine transaminase (ALT), aspartate aminotransferase (AST), gamma-glutamyltransferase (gamma-GT), lactate, in addition to the blood gas parameters of pH, arterial pressures of oxygen (pO2) and carbon dioxide (pCO2), and standard base excess (BE). The estimated glomerular filtration rate (eGFR) was determined from creatinine values according to the Chronic Kidney Disease-Epidemiology Collaboration (CKD-EPI) [43]. The presence of SARS-CoV-2 infection was determined via RT-PCR from a nasopharyngeal swab taken at admission in the Clinical Laboratory of the Rosenheim Hospital, using the E gene and standardized procedures, and the number of cycles required to achieve a pre-determined threshold was quantified by the Ct value. Samples were taken into PCR buffer containing guanidine hydrochloride, using an aliquot for RNA isolation by means of the MagMAX™ Viral/Pathogen Nucleic Acid Isolation Kit (ThermoFisher, Life Technologies, Schwerte, Germany) and a KingFisher™ Flex Nukleic Acid Extraction System. The reverse transcription PCR (RT-PCR) for the detection of SARS-CoV-2 RNA was performed via the ampliCube Coronavirus SARS-CoV-2 test (Mikrogen Diagnostik, Neuried, Germany). If sufficient material was available, the type of variant (wild type versus WHO alpha variant (B.1.1.7)) was determined by PCR kits for VirSNIP SARS-CoV-2 Spike del H69/V70 and VirSNIP SARS-CoV-2 Spike N501Y (TIB MOLBIOL, Berlin, Germany).

We also recorded medical treatment including the potential administration of systemic steroids, anti-coagulation therapy and platelet aggregation inhibitors, including ICU admission, HFNO therapy and IV. All patients were followed until discharge from a RoMed hospital, discharge from another hospital after transfer (as far as compatible with data security requirements), or death.

Outcomes

The primary study outcome was death, additionally stratified for ICU and non-ICU stay. Secondary outcomes were the need for HFNO, ICU treatment and the clinical requirement of IV.

Statistical analysis

Numbers and percentages, or median values and quartiles were used for the description of the study population. To assess relationships with outcome measures, contingency tables and Chi-square statistics, as well as multiple binary logistic regression analysis were used. We aimed to define all predictors as binary variables to facilitate their translation into a score. The cutoff values of continuous variables were determined by ROC (receiver operator characteristics) analysis and the Youden criterion for each of the outcomes. All analyses were performed with the statistical software SPSS (Version 26, Armonk, NY, USA), and the level of statistical significance (two-sided) was assumed at p < 0.05.

Results

Study population

The population was comprised of 289 patients (46.0% female), with median [quartiles] age of 66.0 [51.0; 78.5] years (Table 1). Further data on laboratory parameters including Ct values are shown in the Supplemental Table S1, while data for medication and treatment modalities are given in Table 2. The median (quartiles) duration of hospitalization was 8.9 (5.0; 14.9) days, and that of time to death 10.1 (5.0; 17.1) days. Regarding the variant of SARS-CoV-2, measurements were available in 73.4% of patients, showing that 7.3% of patients had the wild type and 66.1% the alpha variant.

Table 1 Basic characteristics of patients
Table 2 Treatment of patients

Mean (95% CI) Ct values for wild type, alpha variant and unknown were 26.8 (23.9; 29.8), 26.1 (25.2; 26.9) and 31.8 (30.2; 33.4) cycles, respectively. The time course of variants is illustrated in the Supplemental Figure S1, demonstrating that the dominance of the alpha variant was already established during the initial phase of the time period analyzed.

Analysis of risk factors

We analyzed the data for the outcomes HFNO, ICU admission, IV and death (total, ICU, non-ICU), using only information easily available at the time of hospital admission or shortly afterwards. As such information we considered age, sex, SpO2, the presence of oxygen supply, renal function quantified via eGFR, and the Ct value (for other predictors see sensitivity analyses). Age, eGFR and SpO2 had already been established as risk factors in previous studies [7], while the importance of Ct values for mortality is illustrated in Fig. 1.

Fig. 1
figure 1

Ct values of PCR for SARS-CoV-2 plotted against age, with patients deceased from COVID-19 marked by black rhombi. It is evident that the Ct value was predictive for death especially in patients of age >70 years. Ct values were taken as provided by the laboratory irrespective of possibly exceeding extreme values of 40. This was not relevant for the statistical analysis, since binary variables with cutoff values much below this extreme value were used

These predictors were used as binary scores for transformation into a scoring system, using optimal cutoff values determined by ROC analysis. Cutoff values of age (> 70 years), SpO2 (≤ 90%) and eGFR (≤ 60 mL/min) were the same for the outcomes death, HFNO, ICU admission and IV. Regarding the Ct value, the optimal cutoff value was ≤ 30 for the outcomes HFNO and ICU treatment, and ≤ 26 for IV and death (independent of ICU stay). The logistic regression analyses included these categorical variables, additionally the presence/absence of oxygen supply as predictors. Results are given in Table 3.

Table 3 Results of logistic regression analyses for the primary study population

The rounded integer values of the regression coefficients obtained for mortality (Table 3) were then used to define a novel additive risk score (Table 4). Using the coefficients for the number of all deceased patients, the following score points were chosen. We assigned 2 points for age > 70 years, 2 points for SpO2 ≤ 90%, 1 point for oxygen supplementation, 1 point for eGFR < 60 mL/min, and 2 points for a Ct value ≤ 26, each determined upon hospital admission. Table 4 also shows the numbers and percentages of patients, in whom the respective criterion was met, as well as sensitivity and specificity regarding death for each of the 5 items. For each patient, the score points could be summed up to obtain a risk score for mortality ranging from 0 to 8.

Table 4 Score points for each of the five categorical predictors of mortality showing the numbers and percentages of patients in whom the respective criterion was satisfied

The distribution in relation to mortality of this 5-item score is depicted in Fig. 2, indicating a markedly increasing risk for death at score values ≥ 4 or ≥ 5. It was virtually absent for score values lower than 4, but very high for values ≥ 6. Sum scores of ≥ 4 were associated with a sensitivity of 95.0% and a specificity of 72.5%, and scores ≥ 5 with values of 82.5% and 81.7% (p < 0.001 each). After computing a 4-item sum score without the Ct value, the distribution shown in the Supplemental Figure S2 was obtained, corresponding to a sensitivity and specificity of 90.2% and 68.2%, respectively, for a value ≥ 3, and of 68.3% and 87.7% for a value ≥ 4. For the 5-item score and cutoff values of ≥ 4 or ≥ 5, odds ratios for death for the 5-item score were 50 and 21, respectively, whereby the latter number reflects that fact that the number of false-positive predictions increased with higher cutoff value (see Fig. 2). This comparison of the 5-item and the 4-item score was valid, as logistic regression analysis for death without the Ct value did not result in markedly altered coefficients for the other predictors, thus the same score points could be taken. The superiority in prediction was confirmed by logistic regression analysis including both sum scores as simultaneous predictors; the score including the Ct value was predictive (p < 0.0001) that without Ct value not (p > 0.5).

Fig. 2
figure 2

Distribution of sum score values for mortality risk from five parameters available at the time of admission, including the Ct value (Table 3). It is evident that the cutoff value for prediction of mortality risk that yields high sensitivity and specificity can be chosen as 4 or 5 (for numeric values see “Results”). The patient with a score of 1 who died after 28 days of hospitalization with ICU admission, was a male ex-smoker of age 70 years with an BMI of 28 kg/m2, the comorbidities diabetes and systemic hypertension, and a Ct value of 35 at the time of admission; the single score point originated from the need for supplemental oxygen at the time of admission

Sensitivity analyses and validation

The CCI was omitted from the regression analysis as it would require a comprehensive assessment of clinical history which is often difficult upon admission. When performing additional analyses including the CCI, its predictive value was found to be absent or low (only relevant in case of HFNO). We also omitted sex as predictor since it turned out to be not relevant when added as a predictor to the five selected predictors. Furthermore, routine laboratory measures (Supplemental Table S1) were found not to be significant additional predictors when added as predictors to the five predictors given in Table 3, and the same was true for the blood gas parameters (Supplemental Table S1), as was true for vital signs such as heart rate and respiratory rate (Table 1). However, when analyzed separately without taking into account the five items of the score, the majority of laboratory parameters given in the Supplemental Table S1 differed between survivors and non-survivors (see legend to this table). Introducing the variant of SARS-CoV-2 (Supplemental Figure S1) as an additional categorical variable into the logistic regression analysis revealed that it was neither a significant predictor nor did it alter the significance level or the order of magnitude of the five basic predictors. The variant did, therefore, not interfere with the accuracy of prediction of death/survival.

To assess whether the score was critically dependent on the data used, we used data from patients admitted between December 15th 2020 and January 31st 2021. These patients showed characteristics very similar to those of the time period analyzed in the present study, and in addition it could be assumed that the pattern of virus variants was not much different from that observed in the primary study population (see Supplemental Figure S1). Data from 215 patients could be evaluated (47.9% female, median age [quartiles] 76 [59, 84] years). The coefficients of logistic regression analysis were similar to those of Table 3, except for the fact that the coefficient referring to the use of oxygen supplementation upon admission was smaller and no longer statistically significant. When applying the score system derived from the primary study population to these data, the distribution shown in the Supplemental Figure S3 was obtained. It can be seen that the score was again predictive in these patients. A score ≥ 4 was associated with a sensitivity of 85.7% and a specificity of 57.2%, and a score ≥ 5 with values of 71.4% and 70.5%, respectively.

Discussion

We analyzed data from patients hospitalized between February 1st and May 31st 2021 due to COVID-19 in southern Germany, aiming to define a simple risk score that could be calculated from basic parameters available at hospital admission. The primary outcome studied was death. It turned out that the PCR Ct value for the presence of SARS-CoV-2, obtained from nasopharyngeal swabs, was an integral element of this score, in addition to age above 70 years, SpO2 of 90% or lower, the presence of oxygen supplementation, and impaired renal function as indicated by eGFR ≤ 60 ml/min upon hospital admission. The relevant Ct value was 26 or less, indicating a high respiratory viral load. The proposed score for mortality risk (range 0–8) is simple to compute and yielded high predictive accuracy when using cutoff values of 4 or 5. Regarding the additional outcomes HFNO and admission to ICU, a Ct value of 30 or less became predictive.

Several risk scores for mortality from COVID-19 or the need for treatment modalities have been proposed so far [23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39], but none of them explicitly included the Ct value as a measure of respiratory viral load. The outcomes that have been analyzed ranged from ICU admission and respiratory failure to mortality. In accordance with our findings, age was identified as a significant predictor across studies [23,24,25,26,27,28, 30,31,32,33,34,35,36,37, 39]. If its influence was quantified via cutoff values, these mostly ranged between 60 and 75 years [25, 26, 30,31,32, 36, 37] and thus were not markedly different from the value of 70 years which we found adequate for all outcomes. Studies also included SpO2 as a predictor [24, 27, 30, 37, 39]. The present cutoff value of 90% upon admission was compatible with reported values under 94% [24, 26, 33, 37]. A number of patients already required supplementary oxygen before admission, thus ambient-air saturation values were not available. We found that the presence of oxygen supplementation upon admission could be easily incorporated as a predictor. While a saturation of > 90% was not counted in the score if achieved under supplementary oxygen, the fact that supplementary oxygen was administered was counted. Conversely, if SpO2 was ≤ 90% even with supplementary oxygen, the presence of supply increased the risk score. The two criteria were additive in the logistic regression analysis, without interaction between them, thus enabling them to be incorporated as additive contributions to the sum score.

In previous studies, elevated values of creatinine have been identified as relevant [24, 25, 27, 30], in accordance with our findings regarding eGFR. We favored the use of eGFR over creatinine as it includes a standardization for age and sex. In most previous analyses, the dependence of risk on creatinine was determined without giving explicit cutoff values or score points. The cutoff value of ≤ 60 ml/min that we identified as optimal is in line with clinical recommendations [43], moreover similar to cutoff values that can be derived from a proposed cutoff value of 1.2 mg/dl for creatinine [27], assuming men with age 65–70 years.

Regarding Ct values, we found different cutoff values for the prediction of HFNO and ICU admission compared to IV and death. The observation that the optimal cutoff for HFNO and ICU admission was 30 seemed to reflect the fact that values above 30 can be considered as indicating a low risk. Conversely, we found Ct values of 26 or below, reflecting a high virus load, to be most informative for IV and especially death from COVID-19. This is illustrated in Fig. 1 indicating that mortality risk was very low in patients with high Ct values. In addition, the prediction of death/survival was more precise when including the Ct value (Fig. 2 vs. Supplemental Fig. 2), indicating that in the range of high sum scores, prediction was better when including the Ct value. When comparing the odds ratios from adjusted logistic regression analysis with those from unadjusted contingency tables (Table 3), their similarity suggested little interference of Ct values with the other determinants of the outcomes, thereby supporting their additive contribution. This was favored by the use of the logarithmic coefficients for the scoring system, as these are naturally additive, in contrast to odds ratios which are multiplicative.

Different findings about the predictive value of the Ct value have been reported. There are systematic reviews [16, 18] indicating a predictive role for morbidity and mortality, but studies have also pointed to a number of weaknesses, such as inter-individual differences in sample-taking, which may result in varying outcomes for specimens [19]. Moreover, Ct values change over the course of the disease, which renders them dependent on the time of sampling [23, 44]. We only took values assessed at an early stage upon hospital admission, and the result supports the assumption that initial values were already sufficient for a satisfactory prediction of outcomes, especially death. Despite this, consecutive PCR tests could further improve the predictive power.

When using data from a 1.5-month time period immediately prior to that covered by the present study to assess the validity of the proposed score, its predictive value turned out still to be high, thereby indicating that it was robust and applicable beyond the cohort from which it was derived. The score proposed did not include standard laboratory parameters or blood gases assessed at admission. When these were added as predictors to the regression analyses, they were not significant and the prediction was not improved. If analyzed separately without taking into account other parameters or any adjustment, most of them (see Supplemental Table S1) were associated with the outcome death. As a consequence, our findings are fully compatible with those of other studies in which parameters such as CRP, LDH and D-dimer were found to be linked to COVID-19 outcomes; they only underline the comprehensive nature of the five items comprised in the clinical score proposed by us. We also found the virus variant to be not relevant for the score regarding any of the outcomes analyzed, however, this analysis was limited by the fact that the number of patients infected with wild-type SARS-CoV-2 decreased considerably during the study period.

Limitations

The present analysis was based on data of COVID-19 patients admitted between February 1st and May 31st 2021 in southern Germany, a period which was dominated by the alpha variant of SARS-CoV-2, and it is not clear to which extent the score can be extrapolated to other regions and virus variants. The analysis of the second sample in principle confirmed the predictive value of the score and the rank order of predictors but was limited by the small sample size that was a consequence of the time restriction required to establish comparability between samples. Occurrence of novel variants over time is to be expected and might require adjustment of the Ct cutoff value and/or the score points attributed to the PCR results. The fact, however, that the PCR information was additive relative to the information provided by the other four predictors, renders it likely that such adjustments, if needed, can easily be performed without disturbance of the scoring system. One might argue that the chosen secondary outcomes depend on clinical decisions and are not necessarily based on the same criteria in different locations. As we found secondary outcomes to be dependent on the same set of predictors as mortality risk, although with different coefficients, the present score might also be helpful to estimate the need for resource allocation upon hospital admission.

Conclusion

Using data from patients hospitalized due to COVID-19 between February 1st and May 31st 2021, we defined a simple mortality risk score from basic information available at the time of hospital admission. The proposed sum score for mortality risk ranges from 0 to 8 and is based on five simple predictors. Age > 70 years, SpO2 ≤ 90%, the need for supplemental oxygen, and an eGFR ≤ 60 ml/min were significant predictors; prediction was improved when additionally taking a Ct value ≤ 26 for SARS-CoV-2 into account. The five items corresponded to 2, 2, 1, 1, and 2 score points, respectively. Using threshold values of 4 or 5 for the sum score, sensitivity/specificity was 95.0%/72.5% and 82.5%/81.7%, respectively, with odds ratios for death > 20. The scoring system provides a quick and simple tool that may be used for initial risk assessment, prior to the availability of further data obtained during the hospital stay.