Abstract
INTRODUCTION. Acute cholangitis (AC) is associated with a high mortality even among treated patients. Therefore, early diagnosis and treatment are important for improving prognosis. We investigated the discriminatory ability of the Tokyo Guidelines 2018 (TG18) regarding the presence of inflammation (A criterion) and affected liver parameters (B criterion) as a guide to diagnose AC among unselected patients in the emergency department (ED).
METHODS. The study was designed as a cohort study of adult ED patient visits in the Region of Southern Denmark. The cohort included patient visits collected from seven EDs between 1 January 2016 and 20 March 2018. We examined the diagnostic accuracy of the TG18 A and B criteria. This project was approved by the Danish Patient Safety Authority (identifier 3-3013-2272/1). The Region of Southern Denmark authorised data storage (identifier 17/24904, amendment identifier 20/24502).
RESULTS. We included 202,881 ED patient visits, of which 19,816 met the TG18 A and B criteria. A total of 517 patient visits had a discharge diagnosis compatible with AC. The TG18 had a sensitivity of 85.1% (95% CI: 81.7-88.1%) and a specificity of 90.4% (95% CI: 90.3-90.6%).
CONCLUSIONS. The TG18 A and B criteria demonstrated high diagnostic accuracy for predicting AC, even among unselected adult patients in the ED, and were effective at identifying patients who required radiological imaging to confirm or rule out suspected AC.
FUNDING. None.
TRIAL REGISTRATION. Not relevant.
Acute cholangitis (AC) is an infection of the bile ducts, often triggered by gallstone obstruction or biliary strictures. Detecting AC in the emergency department (ED) is important because mortality may approach 10% despite treatment [1, 2]. Late endoscopic retrograde cholangiopancreatography (ERCP) past 24-48 hours is associated with longer hospitalisation, persistent organ failure, increased risk of death and readmission within 30 days [3, 4].
The Tokyo Guidelines 2018 (TG18) define AC and share similarities with Charcot’s triad [4, 5]. The TG18 are based on three categories: A) Systemic inflammation: presence of fever or inflammatory response. B) Cholestasis: jaundice or abnormal liver-function blood samples. C) Imaging: biliary dilatation or evidence of aetiology on imaging. The diagnosis should be considered if at least one criterion from group A is present along with an additional criterion from either group B or C [5, 6]. Therefore, early diagnostic imaging may be important for patients who meet the A and B criteria.
The TG18 has demonstrated high diagnostic accuracy for identifying AC among selected populations at high risk of AC, but its diagnostic accuracy among unselected patients in the ED remains unknown [7-9].
The aim of this study was to investigate the discriminative ability of the TG18 criteria A and B for predicting AC among unselected ED patients, with the aim of clarifying the need for radiological imaging. The research objectives were as follows: Firstly, to investigate how many ED patient visits meeting the TG18 A and B criteria result in an AC diagnosis at discharge; secondly, to examine the discriminatory ability of the TG18 A and B criteria in the subgroup of patients whose presenting symptom was abdominal pain; thirdly, to assess the discriminatory ability of the TG18 A and B criteria in the subgroup of patients whose presenting symptom was fever.
Methods
Study design and setting
This was a register-based, multicentre, cohort study of ED patient visits in the Region of Southern Denmark (RSD), covering a population of 1.2 million citizens [10].
The study utilised a pre-existing dataset from seven EDs, encompassing patient visits between 1 January 2016 and 20 March 2018. The EDs offered 24-hour care, and patients arrived by ambulance, were referred by primary care physicians or presented as self-addressers. Patients arriving in the EDs were registered in different categories based on their primary presenting symptoms from a limited number of predefined possibilities [2]. In Denmark, patients' discharge diagnoses are recorded as International Classification of Diseases, tenth version (ICD-10) codes. Further information about the database is provided in Arving et al. [2]. Data are presented according to the Standards for Reporting of Diagnostic Accuracy Studies [11].
Participant selection
We included patients ≥ 18 years who arrived at a hospital ED in the RSD. We excluded patients without laboratory-based tests (blood samples) with at least one liver parameter (concentrations of plasma bilirubin, alkaline phosphatase (ALK phos), alanine aminotransferase (ALT), aspartate aminotransferase, γ-glutamyl transferase (GGT)) or at least one infection parameter (CRP) or white blood cell count (WBC)). Registered patient visits without a civil registration number (CRN) were excluded (Figure 1).
Definition of the Tokyo Guidelines 2018 criteria
In this study, the TG18 criterion A was considered positive if the body temperature was ≥ 38 °C, WBC < 4 × 109/l or > 10 × 109/l or CRP > 10 mg/l [6, 9]. Criterion B was met if the patients presented with jaundice, defined as a plasma bilirubin ≥ 2 mg/dl or abnormal liver values, which was defined as either one of the following liver parameters: ALK phos, ALT, aspartate aminotransferase (AST) or gamma-glutamyl transferase (GGT) > 1.5 × upper normal limit. We used the following cut-off values: ALT > 105 U/L, ALK phos > 105 U/l and GGT > 172 U/l [6, 8].
Variables and data sources
Patient data about presenting symptoms, vital parameters, ERCP and percutaneous transhepatic cholangiography (PTC) (Supplementary - Appendix B) and intensive care unit (ICU) stay were all retrieved from the regional electronic patient journal. Using the Danish Civil Registration System and the Danish National Patient Registry, we gathered data on length of hospital stay, 30-day mortality, discharge diagnoses including AC, and comorbidity using ICD-10 diagnoses. (Supplementary - Appendix A, C) [12, 13]. The level of comorbidity was quantified using the Charlson Comorbidity Index, which included data collected in the 10 years preceding the index date [14]. Information about laboratory-based tests was drawn from the Laboratory Automation Systems.
Statistical methods
All data were summarised as patient visits, unless otherwise stated. Proportions were presented with 95% CI. Continuous variables were presented as medians or means. A two-sided p-value < 0.05 was considered statistically significant. Patient characteristics and outcomes were compared between patients meeting the TG18 A and B criteria (TG18+) and patients who did not meet these criteria (TG18–). We calculated the sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) and positive/negative likelihood ratios (LR+ and LR–) for the TG18 A and B criteria. The presence of an ICD-10 code for AC at hospital discharge was considered the gold standard for the AC diagnosis (Supplementary - Appendix A).
We performed two sensitivity analyses describing the discriminative ability of the TG18 A and B criteria for predicting AC in patients presenting with abdominal pain and fever, respectively. Missing data were handled using complete case analysis.
Ethics
This project was approved by the Danish Patient Safety Authority (identifier 3-3013-2272/1). The RSD authorised data storage (identifier 17/24904, amendment identifier 20/24502).
All data were stored, secured and managed in accordance with the General Data Protection Regulation and the Danish Data Protection Act [2, 15]. According to the Danish Act on Research Ethics Review of Health Research Projects, register-based studies are exempt from approval from the research ethics committee system [16].
Trial registration: not relevant.
Results
Study subject characteristics
During the inclusion period, 443,953 ED patient visits were registered. Following exclusion of patient visits without CRN (n = 67), no liver parameter in laboratory tests (n = 240,151), or no inflammatory parameters in laboratory tests (n = 854), a total of 202,881 patient visits were included for analysis (Figure 1). The median age was 65 years. A total of 517 (0.25%) patient visits had an AC diagnosis at discharge (Table 1).
Main results
A total of 19,816 patient visits met the TG18 A and B criteria. TG18+ had a sensitivity of 85.1% (95% CI: 81.7-88.1%), a specificity of 90.4% (95% CI: 90.3-90.6%), a PPV of 2.2% (95% CI: 2.0-2.4%) and a NPV of 99.96% (95% CI: 99.95-99.97%) for detecting AC (Table 2). Thirty-day mortality was 15%. A total of 440 (2.22%) visits had a diagnosis compatible with AC (Table 1).
For the sub-group of TG18+ patient visits presenting mainly with abdominal pain, the sensitivity of TG18+ was 83.5% (95% CI: 78.3-87.9%) and the specificity 88.5% (95% CI: 88.2-88.9%). Among these patient visits, fulfilment of both the TG18 A and B criteria was associated with a PPV of 5.0% (95% CI: 4.3-5.7%) and a NPV of 99.87% (95% CI: 99.82-99.90%) for predicting AC (Table 2).
For the sub-group of TG18+ patient visits mainly presenting with fever, the sensitivity of TG18+ was 90.9% (95% CI: 83.4-95.8%) and the specificity was 85.9% (95% CI: 85.3-86.4%). Among these patients, meeting both the TG18 A and B criteria was associated with a PPV of 3.8% (95% CI: 3.0-4.6%) and a NPV of 99.94% (95% CI: 99.88-99.97%) (Table 2).
The study found 183,065 patient visits that did not meet the TG18 A and B criteria (TG18–). These patients had a thirty-day mortality of 4.1%. In this group, 77 (0.04%) patient visits had a diagnosis compatible with AC (Table 1).
Discussion
We investigated the discriminatory ability of the TG18 A and B criteria in predicting AC to aid in identifying patients who require radiological imaging. Our results indicate that the positive TG18 A and B criteria are effective in identifying patients with AC, both in unselected patients in the ED and in patients presenting with abdominal pain or fever. For all patient groups, the TG18 A and B criteria had high levels of sensitivity, specificity and LR+. Our findings suggest that the TG18 A and B criteria are highly valuable for identifying patients who require radiological imaging for suspected AC, but also for ruling out AC in patients presenting with abdominal pain or fever who do not meet the TG18 A and B criteria. We therefore find it relevant to use the TG18 A and B criteria as a guide to decide whether to perform radiological imaging in patients who meet the TG18 A and B criteria. Almost 10% of the included patient visits meet the TG18 A and B criteria. This does not mean that everyone should be referred for imaging diagnostics, but rather that when these criteria are met, the diagnosis should be kept in mind and clinical judgment should be used, as a positive findings make the diagnosis significantly more likely.
Because our dataset did not include radiological findings, we were unable to evaluate the discriminative ability of TG18 when including all three criteria. It is highly likely that inclusion of the C criterion would improve the accuracy as reported in previous studies [5, 7, 8, 17]. However, the A and B criteria represent the only information readily available during the initial evaluation of patients in the ED.
We did not assess the individual predictive values of TG18 criteria A or B, which were considered clinically irrelevant and likely to yield low predictive values.
Previous studies have reported a lower sensitivity for Charcot’s triad and the Tokyo Guidelines 2007 (TG07) than for the TG18, supporting the use of TG18 over previous algorithms [7, 18-20].
The present study found a 30-day mortality rate of 9.3% in AC, which is comparable to previous reports [1, 2, 7]. Hou et al. and Navaneethan et al. reported that prognosis in terms of hospitalisation time, ICU admission and readmission within 30 days improved with prompt treatment with ERCP < 48 hours from admission [4, 5]. These findings underscore the importance of early diagnosis of AC to improve prognosis. The study found a 30-day mortality of 15.0% among the group that met the TG18 A and B criteria, suggesting that this group requires increased attention (Table 1).
We found a higher PPV for TG18+ in patients presenting with abdominal pain than among non-selected patients in the ED (5% versus 2.2%). Presence of abdominal pain was included in the TG07 but was excluded in the TG18 [5]. This study indicates that abdominal pain slightly increases the pre-test probability of having AC, which is why this symptom is still important to note in the patient assessment. Furthermore, this study found a significant difference between TG18+ and TG18– regarding abdominal pain and fever (p < 0.001).
There is likely significant overlap between the groups of patient visits with abdominal pain and with fever. This, however, is difficult to clarify because our database only records the patient’s primary presenting symptom upon arrival.
The register-based study design had limitations, including that the population was selected from a database and then analysed, which increased the risk of sampling bias. Furthermore, the limited number of patient visits with missing CRN could lead to potential selection bias. Yet these account for only 67 (< 0.02%) of the excluded visits. Therefore, a missing CRN is unlikely to affect the main results of the study (Figure 1). We did not exclude patients who were readmitted during the inclusion period, which could result in problems with dependent observations.
An additional limitation of this study, compared with others, is the lack of a definitive gold standard for AC. As described by Kiriyama et al, AC, unlike other diseases, lacks a specific organ or tissue finding that permits definitive pathological diagnosis [7]. This makes it more difficult to compare different studies on AC, as they do not necessarily all use the same standard.
Although the cohort is dated, the study remains relevant as the TG18 have not changed and the treatment method remains the same. Even with the limitations listed above, the very large number of patient visits with prospective and consecutive inclusion is a strength of this study.
Conclusions
The TG18 A and B criteria perform well and are a straightforward tool for identifying unselected patients at risk of AC in the ED. We believe the TG18 A and B criteria are very useful for identifying patients requiring imaging diagnostics to confirm a diagnosis of AC. Furthermore, the very high NPV (99.8%) provides a safe rule-out strategy for AC.
Correspondence Stig Borbjerg Laursen. E-mail: stig.laursen@rsyd.dk
*) Shared first authorship
Accepted 6 January 2026
Published 19 March 2026
Conflicts of interest none. All authors have submitted the ICMJE Form for Disclosure of Potential Conflicts of Interest. These are available together with the article at ugeskriftet.dk/dmj
Acknowledgements Niels Ibsgaard Agerbek, data manager at the Research Unit of Emergency Medicine, Odense, Denmark, played an essential role in data extraction. The authors take this opportunity to acknowledge the Open Patient Data Explorative Network (OPEN), where the data are registered and secured
References can be found with the article at ugeskriftet.dk/dmj
Cite this as Dan Med J 2026;73(4):A09250722
doi 10.61409/A09250722
Open Access under Creative Commons License CC BY-NC-ND 4.0
Supplementary material: https://content.ugeskriftet.dk/sites/default/files/2026-01/a09250722-supplementary.pdf
Referencer
- Tan M, Schaffalitzky de Muckadell OB, Laursen SB. Timing af endoskopisk retrograd kolangiopankreatografi ved akut kolangitis. Ugeskr Læger. 2018;180:V10170806
- Arvig MD, Mogensen CB, Skjøt-Arkil H, et al. Chief complaints, underlying diagnoses, and mortality in adult, non-trauma emergency department visits: a population-based, multicenter cohort study. West J Emerg Med. 2022;23(6):855-863. https://doi.org/10.5811/westjem.2022.9.56332
- Hou LA, Laine L, Motamedi N, et al. Optimal timing of endoscopic retrograde cholangiopancreatography in acute cholangitis. J Clin Gastroenterol. 2017;51(6):534-538. https://doi.org/10.1097/MCG.0000000000000763
- Navaneethan U, Gutierrez NG, Jegadeesan R, et al. Delay in performing ERCP and adverse events increase the 30-day readmission risk in patients with acute cholangitis. Gastrointest Endosc. 2013;78(1):81-90 https://doi.org/10.1016/j.gie.2013.02.003
- Gravito-Soares E, Gravito-Soares M, Gomes D, et al. Clinical applicability of Tokyo Guidelines 2018/2013 in diagnosis and severity evaluation of acute cholangitis and determination of a new severity model. Scand J Gastroenterol. 2018;53(3):329-334. https://doi.org/10.1080/00365521.2018.1430255
- Ely R, Long B, Koyfman A. The emergency medicine–focused review of cholangitis. J Emerg Med. 2018;54(1):64-72. https://doi.org/10.1016/j.jemermed.2017.06.039
- Kiriyama S, Takada T, Strasberg SM, et al. New diagnostic criteria and severity assessment of acute cholangitis in revised Tokyo Guidelines. J Hepatobiliary Pancreat Sci. 2012;19(5):548-556. https://doi.org/10.1007/s00534-012-0537-3
- Mohan R, Goh SWL, Tan GW, et al. Validation of Tokyo Guidelines 2007 and Tokyo Guidelines 2013/2018 criteria for acute cholangitis and predictors of in-hospital mortality. Visc Med. 2021;37(5):434-442. https://doi.org/10.1159/000516424
- Mohan R, Goh SWL, Tan GW, et al. Validation of TG07 and TG13/TG18 criteria for acute cholangitis and predictors of in-hospital mortality in patients over 80 years old. Clin Exp Hepatol. 2021;7(4):396-405. https://doi.org/10.5114/ceh.2021.110996
- Danmarks Statistik. FOLK1A: Befolkningen den 1. i kvartalet efter område, køn, alder og civilstand. www.statistikbanken.dk/FOLK1A
- Cohen JF, Korevaar DA, Altman DG, et al. STARD 2015 guidelines for reporting diagnostic accuracy studies: explanation and elaboration. BMJ Open. 2016;6(11):e012799. https://doi.org/10.1136/bmjopen-2016-012799
- Schmidt M, Pedersen L, Sørensen HT. The Danish Civil Registration System as a tool in epidemiology. Eur J Epidemiol. 2014;29(8):541-549. https://doi.org/10.1007/s10654-014-9930-3
- Lynge E, Sandegaard JL, Rebolj M. The Danish National Patient Register. Scand J Public Health. 2011;39(7 suppl):30-33. https://doi.org/10.1177/1403494811401482
- Charlson ME, Pompei P, Ales KL, et al. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. 1987;40(5):373-383. https://doi.org/10.1016/0021-9681(87)90171-8
- European Union. Regulation (EU) 2016/679 of the European Parliament and of the Council of April 27, 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data. European Union, 2016. https://eur-lex.europa.eu/eli/reg/2016/679/oj
- UN. Act No. 593 relative to ethical medical research. UN, 2011. https://leap.unep.org/countries/dk/national-legislation/act-no-593-relative-ethical-medical-research
- Kiriyama S, Takada T, Strasberg SM, et al. TG13 guidelines for diagnosis and severity grading of acute cholangitis (with videos). J Hepatobiliary Pancreat Sci. 2013;20(1):24-34. https://doi.org/10.1007/s00534-012-0561-3
- Dinc T, Kayilioglu SI, Coskun F. Evaluation and comparison of Charcot's triad and Tokyo Guidelines for the Diagnosis of Acute Cholangitis. Indian J Surg. 2017;79(5):427-430. https://doi.org/10.1007/s12262-016-1512-z
- Sun G, Han L, Yang Y, et al. Comparison of two editions of Tokyo Guidelines for the management of acute cholangitis. J Hepatobiliary Pancreat Sci. 2014;21(2):113-119. https://doi.org/10.1002/jhbp.9
- Yokoe M, Takada T, Mayumi T, et al. Accuracy of the Tokyo Guidelines for the diagnosis of acute cholangitis and cholecystitis taking into consideration the clinical practice pattern in Japan. J Hepatobiliary Pancreat Sci. 2011;18(2):250-257. https://doi.org/10.1007/s00534-010-0338-5