key: cord-0031953-lpbsl1h6
authors: Martinez-Sanchez, Luisa; Cobbaert, Christa M.; Noordam, Raymond; Brouwer, Nannette; Blanco-Grau, Albert; Villena-Ortiz, Yolanda; Thelen, Marc; Ferrer-Costa, Roser; Casis, Ernesto; Rodríguez-Frias, Francisco; den Elzen, Wendy P. J.
title: Indirect determination of biochemistry reference intervals using outpatient data
date: 2022-05-19
journal: PLoS One
DOI: 10.1371/journal.pone.0268522
sha: 4207f9468cfe580cb70f19a55e02bed629873268
doc_id: 31953
cord_uid: lpbsl1h6

The aim of this study was to determine reference intervals in an outpatient population from Vall d’Hebron laboratory using an indirect approach previously described in a Dutch population (NUMBER project). We used anonymized test results from individuals visiting general practitioners and analysed during 2018. Analytical quality was assured by EQA performance, daily average monitoring and by assessing longitudinal accuracy between 2018 and 2020 (using trueness verifiers from Dutch EQA). Per test, outliers by biochemically related tests were excluded, data were transformed to a normal distribution (if necessary) and means and standard deviations were calculated, stratified by age and sex. In addition, the reference limit estimator method was also used to calculate reference intervals using the same dataset. Finally, for standardized tests reference intervals obtained were compared with the published NUMBER results. Reference intervals were calculated using data from 509,408 clinical requests. For biochemical tests following a normal distribution, similar reference intervals were found between Vall d’Hebron and the Dutch study. For creatinine and urea, reference intervals increased with age in both populations. The upper limits of Gamma-glutamyl transferase were markedly higher in the Dutch study compared to Vall d’Hebron results. Creatine kinase and uric acid reference intervals were higher in both populations compared to conventional reference intervals. Medical test results following a normal distribution showed comparable and consistent reference intervals between studies. Therefore a simple indirect method is a feasible and cost-efficient approach for calculating reference intervals. Yet, for generating standardized calculated reference intervals that are traceable to higher order materials and methods, efforts should also focus on test standardization and bias assessment using commutable trueness verifiers.

Introduction Specialists in clinical chemistry should provide accurate and useful information into their clinical laboratory reports. Reference intervals are commonly presented together with the actual analytical results. Their correct evaluation is crucial due to their use as a clinical decision-making tool [1] . Most manufacturers provide reference intervals in their technical documentation. According to ISO15189:2012, it is the responsibility of the laboratory to either validate them, find reference intervals from other sources or calculate the appropriate reference intervals for their method and population. Two different approaches to calculate reference intervals could be used: (a) The procedure recommended by the International Federation of Clinical Chemistry (IFCC), known as the direct method and [2, 3] (b) an alternative approach, known as the indirect method [4] .

The direct approach uses a bottom-up strategy. In this sense, the reference population will be analysed in detail in order to unravel their characteristics and then a realistic "model" will be constructed to derive the distribution of the reference population and the reference intervals. This methodology has been widely used and standardized [2] , but it is laborious and expensive. In addition, it struggles with selection bias, in combination with subjective terms as "reference population" and "health" [5] . As an alternative approach, the indirect method uses a top-down approach. It starts by acquiring a general overview of the total population by analysing clinical data from the laboratory information system (LIS) and, from this, filtering to uncover the distribution of the reference population and the reference intervals. This approach has several advantages, since 'big' analytical data is more accessible nowadays [4] . Automation has increased in clinical laboratories. This has resulted in the centralization of medical tests from a big geographical area around Vall d'Hebron into a single LIS, which guarantees a common diagnostic test process and a similar data structure for extraction [6] .

As a result of differences between reference intervals provided by different manufacturers and individual efforts to verify or select them from the literature, reference intervals vary per laboratory potentially resulting in unequal treatment and patient harm [7] . Standardization and harmonization efforts, which are currently successfully employed in several countries, are necessary to improve presentation and interpretation of laboratory results [8] [9] [10] [11] [12] [13] [14] . In the Netherlands, we previously determined nationally standardized reference intervals for clinical chemistry tests using an indirect "big data" approach [14] . A simple and straightforward workflow using the same approach is presented in this work. First, we determined indirect reference intervals using the NUMBER approach in a dataset of routine clinical chemistry values of the Vall d'Hebron laboratory population in Barcelona. The clinical laboratory Vall d'Hebron is the result of a fusion between three laboratories of the Catalan Institute of Health in Barcelona in 2014. It processes between 15,000 and 18,000 samples a day and covers a population of 1.2 million people, resulting in a very large amount of medical test results a year. This provided us with a unique opportunity to use only the data of a single laboratory using one single method to establish reference intervals, which is very important given the lack of harmonization in Spain [15] . Secondly, for those tests that are internationally standardized and produce test results traceable to standards and/or methods of higher order, we compared the reference intervals obtained from this study with the results published in the first NUMBER project in the Netherlands [14] . Finally, the reference intervals for creatinine kinase and uric acid were investigated, since no consensus was obtained yet in the NUMBER project [14] .

We extracted anonymized medical test results from individuals visiting general practitioners, analysed from January 1 st 2018 until and including 31 st of December 2018 in the Clinical Laboratory Vall d'Hebron from the LIS. The presented study was considered suitable from the point of view of ethics and science by the corresponding Clinical Research Ethics Comittee.

We included test results from patients visiting primary care centres, employees analytical control centres, sexual and reproduction centres and geriatric centres. Test results were excluded when phlebotomy was performed in the hospital (inpatients), drug addiction centres, mental health centres, external emergency centres, the prison women centre, or at home (e.g. when primary care patients could not visit the laboratory due to illness) since we expected substantial differences in health status in these settings that can add noise to the data [4] . We performed sensitivity analyses to compare the distribution between all the included centres, showing no signs of sample or sex bias between centres (results not shown).

Samples were collected from 62 blood collection centres and were transported via 8 different routes to the laboratory. Serum tubes for biochemistry tests included separating gel and coagulation activator (BD Vacutainer 1 ). Phlebotomy order of draw was always performed as advised by the EFLM pre-analytical workgroup [16] . The samples were transported to the laboratory in cool boxes with a temperature monitoring system. After arriving in the laboratory, the samples were centrifuged either 12 minutes at 3,500 rpm (2,438 g) when handled manually or 10 min at 3,000 rpm (2,113 g) when on the track.

Eighteen biochemistry tests were measured on three parallel AU5800 chemistry analysers (Beckman Coulter 1 ). Detailed descriptions of the methods and the recommended reference intervals (calculated by direct approaches) according to Beckman's IFU are presented in S1 Table. Tests included: albumin (CRM470 traceable), calcium (NIST-SRM-909bL1 traceable), creatinine (NIST-SRM-967 L1 traceable), lactate dehydrogenase (LDH) (not traceable to higher order reference material (NTRM)), magnesium (NIST-SRM-909bL2 traceable), anorganic phosphate (NTRM), total bilirubin (NIST-SRM-916a traceable), total protein (NIST-SRM-927c traceable), uric acid (traceable to isotope dilution Mass Spectrometry), urea (NIST-SRM-909bL1 traceable), chloride (NIST-SRM-919 traceable), potassium (NIST-SRM-918 traceable), sodium (NIST SRM-919 traceable), alkaline phosphatase (ALP) (NTRM), alanine aminotransferase (ALT) (NTRM), aspartate aminotransferase (AST) (NTRM), gamma glutamyltransferase (GGT) (traceable to IFCC reference method) and creatine kinase (CK) (traceable to IFCC reference method).

To assure the outpatient data quality, we first examined the monthly results from external quality control scheme from the Spanish Society of Clinical Chemistry (SEQC), basic biochemistry scheme. In this scheme, the results obtained in our laboratory are compared with the average calculated from every laboratory participating in the program using the same analytical method and/or instrument. When our result was within one time the standard deviation from other laboratories participating in the scheme using the same method, data from this particular month and test were accepted as valid. When our result was above or below three standard deviations, we excluded the data from that particular test and instrument for that month. When the result was between the second and third standard deviation, we analysed the daily average outpatient results for the particular test and month.

Daily averages were investigated to ensure longitudinal accuracy of the results over time. Averages were calculated per batch of 200 results a day and were compared with the average per month and year. Plots were visually inspected in order to decide whether the analytical quality was sufficient using the biological variation of the monthly and yearly mean as a reference and comparing it visually with the daily mean.

Finally, due to the lack of commutable trueness verifiers in 2018, we further validated the quality of the obtained reference intervals by using a new data extraction of test results from 2020. In 2020, our laboratory participated in the fortnightly EQA scheme from the Dutch EQA organizer Stichting Kwaliteitsbewaking Medische Laboratoriumdiagnostiek (SKML) which uses commutable and value-assigned trueness verifiers [17] . In all EQA reports, the Multi sample evaluation (MUSE) scores for all tests were > = 1 (meaning a total allowable error sigma value over 2), indicating adequate performance for all tests [18] . To verify the calculated reference intervals deduced from the 2018 data, outpatient data from July to October 2020 were selected, considering the same analytical and pre-analytical considerations explained previously for the main data. To that end, we designed an algorithm that computed 2,000 random samples of 200 test results each time. Next, for each random sample of 200 test results, we calculated the proportion of cases residing within the reference intervals deduced from the 2018 dataset. Then we calculated the mean of these 2000 proportions for each test. When the mean of the proportions (Prop.2020) was higher than 95 %, we considered the reference interval as valid. This protocol was based on the CLSI EP28-A3C for reference intervals transference modifying the sample number from 20 to 200 and repeating the protocol 2,000 times [2] .

To avoid pre-analytical issues that could confound the reference intervals, results from hemolyzed, lipemic and icteric samples were excluded when indices were > = 2 on a 0-5 scale (Beckman Coulter 1 AU5800, S2 Table) . In addition, since the icteric index could also be a good indicator for liver dysfunction, samples with icteric indices > = 1 were also excluded for total bilirubin, ALT, AST, ALP and GGT.

For the calculations on CK, individuals with AST results higher than decision limits in Vall d'Hebron laboratory (50 U/L in men and 35 U/L in women) were excluded, in order to exclude patients with skeletal muscle injury [19] .

Reference intervals were calculated per test using an automatic calculator programmed in R [20] (version 3.6.1), following the workflow presented in Fig 1. Firstly, we used the Tukey method [21] to identify and discard outliers. The lower and upper cut-offs for outlier exclusion were defined as Q1-(1.5xIQR) and Q3+(1.5xIQR), respectively, being Q1 the lower sample quartile, Q3 the upper sample quartile and IQR the interquartile range (Q3-Q1). The same workflow and outlier exclusion procedures were used as the ones described in the NUMBER project [14] , where outliers from biochemically related tests based on defined groups were excluded. Defined groups were:

• Electrolytes: calcium, chloride, potassium, sodium • Bone: calcium, magnesium, phosphate • Liver: alkaline phosphatase, GGT, ALT, AST, (total) bilirubin • Kidney: creatinine, urea • Proteins: albumin, total protein For calcium, two groups of tests were considered biochemically related. The histograms were visually inspected, and formal tests were performed (Z score for Skewness and Kurtosis) to determine the presence of a normal Gaussian distribution. Given the large numbers of test results, the formal tests of normality were very sensitive to a deviation from normality [22] . In such cases, visual inspection was considered decisive. If a normal distribution was absent, we performed a log transformation on the original data.

The reference intervals were calculated as mean plus/minus two times the standard deviation (mean ± 2SD) both for the total dataset and per subgroup when a minimum of 120 test 

results per group were available. Also 90% confidence intervals for the lower and upper limit were calculated. We used pre-defined subgroups analogous to the NUMBER project [14] : In addition, in order to test a recently published hypothesis [23] stating that certain differences between indirect studies may be due to diverse age representations into the age groups, sensitivity analyses with additional age categories were performed for ALT and GGT.

Per test and per group boxplots were visually inspected after outlier elimination in order to decide whether or not subgroup differentiated reference intervals were necessary. In addition, reference intervals results were compared with the reference limit estimator method employed by the group of Haeckel, Wosniok and Arzideh [24] using the same dataset.

Lastly, flagging rates were calculated to verify the clinical suitability of the reference intervals using an independent dataset (January-June 2019). The percentages of measurements below and above the lower and upper reference limits were calculated per test.

We extracted anonymized test results from a total of 530,778 clinical requests for a period of one year from the laboratory system of the Clinical Laboratory Vall d'Hebron University Hospital. After filtering by phlebotomy centre, 3.01% clinical requests were excluded. We discarded an additional 0.70% of the clinical requests because of hemolysis, 0.02% because of icteria, and 0.35% because of lipemia. The final dataset consisted of 509,408 requests.

Analytical performance, based on monthly external quality controls was adequate for SEQC material for all tests, except for ALP in December 2018. For this period, ALP results were excluded from the analyses. Daily average results showed stable performance over the year for all tests. In the S1 Fig we show an example for calcium.

Outlier exclusion by biochemically related tests ranged from 1.27 to 16.50%. Albumin, total protein, magnesium, phosphate, calcium, sodium, potassium and chloride followed a Gaussian distribution; for all other tests we obtained a Gaussian distribution after log transformation. The calculated reference intervals by the indirect approach are presented in Table 1 , stratified for sex and age categories, if necessary. Results from the reference interval quality verification protocol, tested with the new dataset from 2020 (110,237 clinical requests), are also presented in Table 1 , showing acceptable results (>95%) for all tests except for some age groups, particularly for creatinine and magnesium. Confidence intervals for the lower and upper limits per test are presented in S3 Table. In Table 2 , the obtained Vall d'Hebron reference intervals from the normally distributed tests are compared with results from the Dutch NUMBER project [14] . The kidney and liver parameters for both studies are graphically displayed in different age categories for men and women in Fig 2. Similar results for GGT were found when we increased the number of age categories (S2 Fig). In addition, results from the calculated reference intervals for creatine kinase and uric acid for the Vall d'Hebron hospital and the Dutch project are presented in Fig 3. The obtained Vall d'Hebron reference intervals for the normally distributed tests, compared with results from the Dutch NUMBER project, stratified for sex and age categories, if necessary. 

Application of big data to healthcare has been a matter of interest in recent years [25] . Consequently, in laboratory medicine, where quantitative data is generated every day, machine learning, data mining, business intelligence and related concepts are starting to be used for different purposes including analytical and quality management [25] . For the determination of reference intervals, for which classical (direct) recommendations are laborious and expensive, various statistical (indirect) methods have been developed using big data [4] . It is important to remark that some specialists are concerned about the possible bias due to the presence of unhealthy individuals in the dataset. Standard and detailed protocols following this approach are not available yet. However, the IFCC committee on Reference Intervals and Decision Limits (c-RIDL) recently recommended and promoted the development and assessment of indirect methods, stimulating future consensus for a harmonized indirect approach [26] .

In the present study, we calculated reference intervals in an outpatient population from Vall d'Hebron laboratory using the NUMBER approach created for calculating nationally standardized reference intervals for clinical chemistry tests in The Netherlands [14] . The normally distributed tests (Table 2) showed similar reference intervals between both studies and other previous projects such as the Canadian project CALIPER (direct method) [27] , the Australian and New Zeeland project ARIA (direct method) [8] , or the German projects (indirect methods) [23, 24] . This suggests that standardized tests allow global and common use of reference intervals and a straightforward indirect method could be a valuable approach for these normally distributed tests. The comparison of the results from this study with the reference limit estimator method (S4 Table and In this project, the upper reference limits for liver enzymes from the Dutch project were always substantially higher than the upper reference limits from Vall d'Hebron laboratory. We previously already hypothesized about potential explanations for the higher upper limits in the Netherlands [14] , as a result of the Dutch lifestyle and diet. The only IFCC-standardized method for liver parameters in our study was GGT and the differences for this test between Vall d'Hebron results in Barcelona and the NUMBER project could support this hypothesis. Alcohol consumption and increased body mass index have been related with higher ALT, GGT and AST results in the population from the Nordic Reference Interval Project (NORIP) [28] . Interestingly, in 2009, Strømme and colleagues, using data from the NORIP project, showed reference intervals results for ALT in northern Europe which are similar to our Dutch results [29] . They already highlighted the differences observed between the Nordic reference intervals and the reference intervals calculated for the Italian population, which in their turn are similar to the calculated reference intervals in our study for the population in Vall d'Hebron [29, 30] . In a recent publication, Wosniok et al. addressed these differences in calculated reference intervals from different studies for liver parameters [23] . They proposed it may be due to diverse age representations in the age groups. In order to test this hypothesis, we repeated the analyses for GGT, applying more age categories, in both the Vall d'Hebron and NUMBER datasets (S2 Fig). Since these results showed the same tendency, we consider differences in lifestyle a potential alternative hypothesis. In addition, the reference intervals for GGT are only significantly higher in the adult Dutch population (when diet or alcohol do start to play a role) and not in children, indicating a lifestyle component. The Mediterranean diet has been associated with favourable health outcomes [31] , and with decreasing levels of ALT, AST and GGT in patients with non-alcoholic fatty liver disease, supporting this hypothesis [32] . It is important to remark that the reference intervals for the liver parameters that were calculated using the reference limit estimator method (S4 Table and S3 Fig) were not as high in the Vall d'Hebron population as with NUMBER method, but were still higher than the reference intervals that are now commonly applied in clinical laboratories. This supports the idea that, for skewed distributions, it is still necessary to further explore the best indirect method for references interval calculation. For creatinine and urea, similar age distributions were found in the Vall d'Hebron outpatient sample compared to the Dutch national sample, even though the methodology for creatinine differed (Jaffe vs enzymatic, Fig 2) , which support earlier studies on the age related decline in renal function [33] .

Interestingly, for reasons yet unclear, in age group 19-50 years, for albumin, ALP, ALT, creatinine and urea, the resulting reference interval is usually smaller in male patients and the Prop. 2020 is always lower (<90%) when comparing to the results in female patients. No explanation was found for the significantly elevated reference intervals for CK and uric acid in the NUMBER project [14] , as the calculated reference intervals were substantially higher than those currently applied in the participating laboratories. In the Vall d'Hebron sample, we confirmed the Dutch observations and also found reference intervals higher than currently used and recommended for these tests. Nevertheless, compared with the Duch results, the upper limits of the reference intervals calculated in Vall d'Hebron laboratory were lower for all age groups for both CK and uric acid (Fig 3) . For CK, differences between currently used and calculated reference intervals are particularly extreme, which has been already observed in other studies [34, 35] . This finding might be explained by the high incidence of some related comorbidities such as metabolic syndrome or high blood pressure [36] together with the use of statins. For uric acid, the obtained higher limits in both studies are also higher than cut-off values associated with worse progression of kidney disease [37] and higher than the cut-off defined by the solubility limit of uric acid [14] .

Our analyses show important differences in flagging rates between the currently used reference intervals in Vall d'Hebron and the new calculated reference intervals in an independent dataset. In general terms, too much flagging is noted for currently used reference intervals. This highlights the need for establishing adequate reference intervals, as frequent flagging may distract attention from true pathological results [38] . In addition to that, we found, in general, higher flagging in our study compared to the Dutch NUMBER study which may be explained by the additional pre-analytical and clinical criteria used in the current study.

For some of the calculated reference intervals the confidence intervals for lower and upper limits (S3 Table) included only the reported limit, due to the large sample size, emphasizing the robustness of the presented results.

It is also important to remark that, in general, the results calculated with the NUMBER method and the Reference Limit Estimator method (S4 Table and S3 Fig) show in a great extent similar results across age group, but for a few laboratory tests there are some remarkable differences that deserve further study. Lower reference intervals were found with the Reference Limit Estimator method for GGT, creatinine and CK.

Our study has several strengths. First, compared to the direct method of establishing reference intervals, the applied automatic indirect approach is cost-efficient and avoids collecting and analysing material from healthy control donors. Second, it mimics preanalytical conditions of real samples. In addition, we had the unique opportunity to experiment with the Dutch NUMBER approach and to do head-to-head comparisons between the reference intervals obtained for the Dutch population with the reference intervals calculated in the Vall d'Hebron population for standardized tests. Lastly, results using the NUMBER method were also compared with the reference limit estimator method [24] using the same dataset.

We also acknowledge several limitations. First, since we used anonymous laboratory test results, clinical information was not available. Although we tried to select a healthy population as much as possible, test results from unhealthy persons may have been included in our datasets. Second, because of our completely anonymized databases, we did not exclude individuals visiting practitioners more than once a year leading to a possible bias. Third, structural monitoring with commutable, value-assigned trueness verifiers (type 1 EQA-materials) was not available in 2018. However, blinded type 1 EQA materials from the Dutch SKML were used in 2020, which is essential for proving metrological traceability of results from standardized test. By using a random sampling method with a dataset from 2020 we confirmed adequate analytical performance and verified the reference intervals calculated in the 2018 dataset. The COVID-19 pandemic and the resulting differences in patient population hampered us in using a dataset from 2020 to calculate the reference intervals. Fourth, we selected one statistical method (NUMBER method) to calculate reference intervals, and compared these with the reference limit estimator method [24] . Several statistical methods have been proposed so far but no consensus or official recommendations about 'which method to use when' are available yet [4] . We recommend that, on an international level, indirect (statistical) reference interval methods are compared, in order to reach consensus on criteria to decide which statistical method should be applied for which test. Given the comparable results between studies applying indirect methods to establish reference intervals, indirect methods are a promising tool for laboratories to develop cheap, specific and updated reference intervals.

In conclusion, using an indirect approach, we determined population-specific reference intervals for 16 biochemistry tests from the Vall d'Hebron region, some being more sex and age specific than in the product inserts. Reference intervals of normally distributed biochemical tests were comparable to those found in a Dutch outpatient sample, indicating that the indirect method is an appropriate approach for deducing reference intervals. In order to verify the applicability of SI-traceable reference intervals obtained by indirect methods across outpatient populations, equivalence of test results from SI-standardizable tests must be verified thoroughly using type 1 EQA-materials. To conclude, adequate implementation of common, metrologically traceable reference intervals is the ultimate goal for guaranteeing safe and clinically effective use of medical tests, as required by the upcoming EU IVD Regulation 2017/746. As a first step, method (Beckman)-and population (Vall d'Hebron region)-specific refined reference intervals were derived for biochemistry tests. Table were made just when the number of data per both methods were higher than 500. � Reference interval results calculated with less data than the recommended by the RLE method (4.000). (PDF) S1 

Evidence-based approach to harmonised reference intervals

EP28-A3c: Defining, Establishing, and Verifying Reference Intervals in the Clinical Laboratory; Approved Guideline-Third Edition

Protocol and standard operating procedures for common use in a worldwide multicenter study on reference values

Big data and reference intervals: rationale, current practices, harmonization and standardization prerequisites and future perspectives of indirect determination of reference intervals using routine data

Provisional recommendation on the theory of reference values (1978). Part 1. The concept of reference values

The role of total laboratory automation in a consolidated laboratory network

Pathology Harmony; a pragmatic and scientific approach to unfounded variation in the clinical laboratory

Harmonising adult and paediatric reference intervals in Australia and New Zealand: an evidence-based approach for establishing a first panel of chemistry analytes

Standardization in laboratory medicine: Adoption of common reference intervals to the Croatian population

Establishment of reference intervals of clinical chemistry analytes for the adult population in Saudi Arabia: a study conducted as a part of the IFCC global study on reference values

The Nordic Reference Interval Project 2000: recommended reference intervals for 25 common biochemical properties

Establishing reference intervals for major biochemical analytes for the Russian population: a research conducted as a part of the IFCC global study on reference values

Pediatric and adult reference interval harmonization in Canada: an update

NUMBER: Standardized reference intervals in the Netherlands using a "big data" approach

Impact of implementing a category 1 external quality assurance scheme for monitoring harmonization of clinical laboratories in Spain

Order of blood draw: Opinion Paper by the European Federation for Clinical Chemistry and Laboratory Medicine (EFLM) Working Group for the Preanalytical Phase (WG-PRE)

The quest for equivalence of test results: the pilgrimage of the Dutch Calibration 2.000 program for metrological traceability

Expressing analytical performance from multi-sample evaluation in laboratory EQA

R: a language and environment for statistical computing

An appraisal of statistical procedures used in derivation of reference intervals

Normality tests for statistical analysis: a guide for non-statisticians

A new indirect estimation of reference intervals: truncated minimum chi-square (TMC) approach

A plea for intra-laboratory reference limits. Part 2. A bimodal retrospective concept for determining reference limits from intralaboratory databases demonstrated by catalytic activity concentrations of enzymes

Concurrence of big data analytics and healthcare: a systematic review

Indirect methods for reference interval determination-review and recommendations

Biochemical marker reference values across pediatric, adult, and geriatric ages: establishment of robust pediatric and adult reference intervals on the basis of the Canadian Health Measures Survey

Evaluation of reference intervals for biomarkers sensitive to alcohol consumption, excess body weight and oxidative stress

Reference intervals for eight enzymes in blood of adult females and males measured in accordance with the International Federation of Clinical Chemistry reference system at 37 degrees C: part of the Nordic Reference Interval Project

Updated definitions of healthy ranges for serum alanine aminotransferase levels

Nordic diet, Mediterranean diet, and the risk of chronic diseases: the EPIC-Potsdam study

Intestinal permeability after Mediterranean diet and low-fat diet in non-alcoholic fatty liver disease

Structural and functional changes with the aging kidney

Are the currently used reference intervals for creatine kinase (CK) reflecting the general population? The Tromsø Study

Caveats in determining reference intervals for serum creatine kinase

Creatine kinase is associated with blood pressure

Serum uric acid and progression of kidney disease: a longitudinal analysis and mini-review

Opinion paper: deriving harmonized reference intervals-Global activities

We would like to thank Dr. Farhad Arzideh for his advice and assistance in the use of reference limit estimator.