Skip to main content

Econometric estimation of WHO-CHOICE country-specific costs for inpatient and outpatient health service delivery



Policy makers require information on costs related to inpatient and outpatient health services to inform resource allocation decisions.


Country data sets were gathered in 2008–2010 through literature reviews, website searches and a public call for cost data. Multivariate regression analysis was used to explore the determinants of variability in unit costs using data from 30 countries. Two models were designed, with the inpatient and outpatient models drawing upon 3407 and 9028 observations respectively. Cost estimates are produced at country and regional level, with 95% confidence intervals.


Inpatient costs across 30 countries are significantly associated with the type of hospital, ownership, as well as bed occupancy rate, average length of stay, and total number of inpatient admissions. Changes in outpatient costs are significantly associated with location, facility ownership and the level of care, as well as to the number of outpatient visits and visits per provider per day.


These updated WHO-CHOICE service delivery unit costs are statistically robust and may be used by analysts as inputs for economic analysis. The models can predict country-specific unit costs at different capacity levels and in different settings.


Health planners concerned with evidence-informed decision making and resource allocation rely on high quality information regarding the resources needed to implement investment strategies, and comparing these against current and future budgetary constraints. Information on costs is essential to inform discussions around value for money and efficiency. Unfortunately cost data is sparse in many settings, especially in low- and middle income countries, where ambitious health targets are now being set for the 2030 Sustainable Development Goals (SDGs). Challenges include insufficient allocation and/or inefficient allocation of resources towards health priorities [1]. Having access to accurate and reliable information on the cost of health services can serve various purposes including discussions on affordability and financial sustainability, budgeting, cost-effectiveness and cost–benefit analysis [2].

Literature related to estimates on the so-called ‘unit costs’ of specific health interventions is a growing field, yet the transferability of such findings from one setting to another is limited [2, 3]. For example, the Access, Bottlenecks, Costs and Equity (ABCE) initiative from the University of Washington’s Institute for Health Metrics and Evaluation has been collecting primary data on health care facility costs. However, the focus has been limited to Ghana, Kenya, Uganda and Zambia and there is no intention to create generalized global estimates [4,5,6,7]. Similarly, the recently established Global Health Cost Consortium at the University of Washington ( intends to produce unit cost estimates that can be adapted to local settings, but its scope is limited to TB and HIV services, and will not provide cost estimates for general inpatient or outpatient care.

To our knowledge, the WHO-CHOICE (CHOosing Interventions that are Cost Effective) project is the only programme seeking to collect and standardise estimates of the costs related to facility-based service delivery, to compare these across countries, and to provide country-specific estimates of facility service costs as a global public good. WHO-CHOICE estimates are produced for all countries and can therefore be used in settings where no local data is available. Estimates are based on modeling of primary and secondary data, and are derived from a model that provides the best fit according to global data. Such models will not yield perfect predictions for all countries, and it is therefore maintained that whenever good quality country unit cost data is available from a representative sample, this should be used rather than using the CHOICE predictions.

Under WHO-CHOICE, WHO has collated facility cost data from countries since 2000. These data have served to inform estimates for country-specific costs related to health service utilization—with estimates for cost per inpatient day and outpatient visit. The service delivery unit cost estimates are used to support cost-effectiveness analysis within the WHO-CHOICE project, among other things.

  • Inpatient day The estimated cost of a hospital bed-day reflects only the “hotel” component of the hospital cost—i.e. it excludes the cost of drugs and diagnostic tests but includes costs such as personnel, capital infrastructure and equipment, laboratory, maintenance and other operational costs of the hospital, as well as food costs. The intent is to produce a measure that covers those components that are assumed to be standardised across different diseases and treatments.

  • Outpatient visit Similarly, WHO-CHOICE outpatient costs include components not specific to the disease or treatment, but those largely assumed to be standardised across disease conditions for which the care is provided: namely personnel, capital infrastructure and equipment, laboratory, maintenance and other operational costs of the health facility. In recognition of the fact that costs for equipment, maintenance etc., may vary depending on the setting in which care is provided, inpatient and outpatient care costs are estimated for different types of providers and levels of the health system.

The use of standardised estimates for the service delivery component of intervention costs ensures that interventions can be compared fairly, using consistent price assumptions. Inputs into the service production process (including technology, prices and production efficiency) change over time, and WHO-CHOICE seeks to regularly provide updated estimates. WHO-CHOICE estimates are particularly useful for low- and middle income countries that may not have such data readily available.

In this paper we describe a process whereby country data sets were gathered and multivariate regression analysis was used to explore the determinants of variability in unit costs, in order to produce updated WHO-CHOICE estimates.

The previous round of WHO-CHOICE results for facility cost are henceforth referred to as “the first round analysis” [3, 8, 9]. Estimates are publicly available ( and have been widely used by researchers, academics and analysts at both global and country level [10, 11]—for example publications see


The “Methods” section first describes the data collection process followed by the econometric analysis.

Data collection

As a first step a literature review was undertaken with the aim of identifying variables and methods that should be taken into account during data collection and analysis [12].

Experience from prior work suggested that on the ground ‘bottom-up’ facility-level estimation is not a cost-effective approach for producing large data sets, given the large costs involved with primary data collection. Instead we made use of existing data for secondary analysis. Cost data were gathered 2009–2010 through three mechanisms:

  1. a.

    Authors identified through the literature review were directly contacted;

  2. b.

    A public call went out for cost data; and

  3. c.

    Websites of public institutions were searched for publicly available data.

The majority of data was gathered from respondents to the public call. Most of the sources identified through the literature review did not report facility-specific unit costs estimates, but instead presented only the sample average. Therefore, data from only two studies identified though the literature review were ultimately included in the estimation dataset (Additional file 1: Annex S1). In-depth examination of available online public data bases revealed limited usefulness of such data as they generally lacked several of the variables required to inform the analysis. Only one database found on-line was considered useful for extracting data [13].

The public call for cost data was released and widely disseminated on leading academic and international development websites in early 2009. The public call asked respondents to provide datasets of unit costs from at least 30 health facilities at any level (primary, secondary, tertiary or a mix). The data request covered a range of indicators, including: average unit costs (per bed day, admission and outpatient visit); the breakdown of each unit cost estimate by input category (salary, drugs, other supplies, capital), the proportion of recurrent to total costs, the proportion of drugs to recurrent costs, the proportion of ancillary costs to recurrent costs; average and recurrent unit costs of laboratory tests and diagnostic procedures; various determinants of costs and efficiency (e.g. average length of stay, occupancy rate, bed turn over, number of medical staff per bed, number of outpatient visits per medical staff per day, number of beds); and utilization data (e.g. number of bed days, number of admissions, number of outpatient visits and number of ancillary services by type of service).

Respondents to the public call were sent a scoping questionnaire to assess the type of information available. Out of 60 proposals, and considering geographic and income-level representation, a total of 30 respondents were sent the final data collection form of which 27 provided final data sets and were remunerated. Most respondents worked at public institutions and had access to data on resource use per facility, which had originally been collected to inform provider payment schemes and/or to monitor health system performance.

A standard template was used for extracting data. The range of variables collected drew upon earlier work, [3, 8, 9] and included facility size, level of care (Box 1), public/private affiliation (Box 1), number of available beds, number of outpatient visits, number of staff (by category), reference year for cost data, and a breakdown of costs into various components including medicines, salaries, laboratory and soforth; as well as the estimated split in costs between outpatient care and inpatient care, if available (see Additional file 1: Annex S2 for a complete list of variables).

Data was collected with a specific intention to assess capacity utilization as an explanatory variable [3]. Higher capacity utilization should result in lower predicted unit cost, as fixed costs are spread across a greater number of outputs. For inpatient care we collected data on the percentage of beds occupied, while for outpatient care, we collected data on total number of patient visits and total number of staff, in order to calculate the number of visits per provider per day as a capacity measure.

Respondents’ files were screened for data quality and consistency with the requested data format. Unit costs were extracted from the data files. Quality control mechanisms included recalculation of the service delivery inpatient and outpatient cost in accordance with the research protocol and instructions sent to data providers. Data cleaning comprised consistency checks and when needed was followed by discussions with data suppliers to ensure that the data submitted corresponded to standard definitions and requested specification. Some of the missing variables were directly derived, when possible, from other variables from the same observation point (e.g. occupancy rate calculated from number of beds and number of bed-days).

Data was gathered for a total of 30 countries. Sample size ranged from 9 to 6725 (median 36) observations for health centres per country-specific dataset, and from 1 to 4938 observations for hospitals (median 42, see Additional file 1: Annex S1 for details). One observation represents one facility in a given year.

Data collection and cleaning resulted in a dataset that was significantly (six times) larger compared to the previous WHO-CHOICE facility cost database. The majority of new cost data referred to year 2007. Costs collected in local currency units were converted to 2007 International dollars by means of Gross Domestic Product (GDP) deflators [14] and purchasing-power-parity exchange rates.

Econometric analysis

STATA software was used for data analysis [16]. Comparison with data previously collected (i.e. for the first round analysis) revealed that the new data had very different characteristics. Some of these differences were due to different variables having been collected, as a result of further development of data collection methods and instruments. Moreover, there was strong evidence of statistical heterogeneity in those variables that could be directly compared. As a result the old and the new datasets were not pooled, and regression analysis was performed using only the new dataset. Findings from the original literature review were also excluded unless authors had responded to the invitation to make available their datasets for inclusion in the analysis.

We adopted an approach derived from the economic literature on ‘hybrid cost functions’ [17, 18].

A log-cost function faced by a health facility is assumed to depend on:

  • A log-additive vector of input prices, and

  • An unknown function of:

    • A set of output indicators, and

    • A set of variables indicating the facility type.

Drawing upon previous work, various logarithmic models were tried and tested [3, 8, 9]. Variables were chosen based on the following criteria:

  • The variable is a known determinant of unit cost,

  • Measurement data for the variable are readily available,

  • The variable performs well in regression models.

Previous experience suggested that some variables are more capable of influencing the outcome of the analysis than others [3, 8, 9]. For instance, the level of the health facility (i.e. primary, secondary or tertiary) is a main determinant of cost. Other variables, such as the proportion of emergency admissions, did not prove to be as important. Experimentation with different variables showed that a restricted list was preferable.

Unfortunately several country datasets had key variables missing, in particular regarding the breakdown of costs into components (e.g. salaries, drugs, lab tests and other costs), which affected the variables that could be used for the regressions. We explored methods for imputing data, but were unable to find a model specification with imputed values that performed sensibly in either the regressions or with respect to simple descriptive and summary statistics. Observations with missing essential data were therefore dropped. All 30 data sets were included but observations used in the final analysis were reduced from a total of 19,008—3407 (outpatient) and 9028 (inpatient)—see Additional file 1: Annex S1.

Model specification: inpatient care

The relationship between the outpatient/inpatient unit cost and explanatory variables was explored using multiple regression analysis—Ordinary Least Squares (OLS). The dependent variables and the continuous explanatory variables were transformed into natural logarithms. This has the advantage of coefficients being readily interpreted as elasticities. Country dummies were included in the models to address the impact of large data sets from Brazil and Colombia.

For the inpatient unit cost model the dependent variable is one bed-day. The functional specification may be written as:

$$\ln IUC_{i} = a_{0} + a_{i} \cdot\mathop \sum \limits_{i = 1}^{n} \ln X_{i } + e_{i} , \quad i = 1 \ldots n$$

where ln IUCi is the natural log (ln) of cost per inpatient day in 2007 US$ in the ith facility; α0 and α1…n are the estimated parameters; the Xi are the explanatory variables transformed into natural logarithms for continuous variables; and e represents the error term.

Table 1 lists the explanatory variables included in the final regression.

Table 1 Descriptive statistics of sample used for final inpatient care estimates (N = 3407)

Model specification: outpatient care

For the outpatient unit cost model the dependent variable is one outpatient visit. The best performing approach for outpatient visits examined a pooled sample of health centers and hospitals.

The algebraic form of the outpatient unit cost model is:

$$\ln OUC_{i} = a_{0} + a_{i} \cdot\mathop \sum \limits_{i = 1}^{n} \ln X_{i } + e_{i} , \quad i = 1 \ldots n$$

where ln OUCi is the natural log (ln) of cost per outpatient visit in 2007 I$ in the ith facility; a0 and a1 are the estimated parameters; the Xi are the explanatory variables; and e represents the error term.

Table 2 lists the explanatory variables included in the final outpatient care regression. This includes an additional dummy specifically for Brazilian level 3 facilities, given that data from Brazil constituted a significant share of the sample for this level of hospitals.

Table 2 Descriptive statistics of sample used for final outpatient care estimates (N = 9028)


The tests used for judging model validity and the goodness of fit included the Breusch–Pagan/Cook–Weisberg test for heteroskedasticity [19], Ramsey’s regression specification-error test for omitted variables, the tolerance test and its reciprocal variance inflation factor, plots of the residuals versus the fitted values, plots of the residuals versus the independent variables, plots of the predicted values versus the continuous independent variables, estimates of adjusted R-squared, the Akaike information criterion, the Bayesian information criterion, and F-statistics of the regression model.

Robust estimation methods were used (i.e. the Stata command “robust”), in order to control for the effect on the estimate of standard errors caused by ‘clustering’ (i.e. the inclusion of multiple observations per country).

Predicted values and uncertainty analysis

WHO-CHOICE draws upon the prediction models described above to derive cost estimates for all member states. For the prediction of unit costs for inpatient days and outpatient visits, we use country-specific values where possible (i.e. GDP, country-specific dummy variables). Other explanatory variables, particularly those related to capacity utilization (e.g. occupancy rate, average length of stay), rely on representative ‘average values’ which can be set to normative values, sample medians, or to other values as appropriate. These ‘average values’ have desirable properties that may not always be possessed by the sample observations for that country [3]. Lacking appropriate norms, we applied the 80th percentile (p80) values from the sample of facilities used for the final regression models (Table 3). This is consistent with previous rounds of WHO-CHOICE unit cost estimates for which a 80% capacity level has been assumed.

Table 3 Values of variables used for prediction of the unit cost

As a first step, regression results (coefficients and variance–covariance matrix) were used to predict country-specific unit costs within a 95% uncertainty interval (UI). For the generic set of WHO-CHOICE estimates we assume public provision at all facility levels (however the model can also be set to predict private provider costs). The initial analysis based variables (GDP I$ per capita) on year 2007, since the majority of data in our dataset (92%) referred to this year. We have since applied the same model in order to derive estimates for 2010, which is a year consistent with the Global Burden of Disease Study (GBD) 2010 data [20], and thus useful for cost-effectiveness analysis drawing upon the GBD 2010 estimates. Explanatory variables were based on data for 2010 (i.e. GDP per capita in 2010 I$) and other values described in Table 3.

Next, we undertook adjustments to ensure that drug costs were not included. Data providers had reported challenges relating to apportioning drug costs between inpatient and outpatient care. This was handled by applying a proportional adjustment ratio, derived from previous work on inpatient care [8], to the reported cost estimates, thus adjusting these downwards in order to capture only the general, standardized components of the facility visit and bed-day, as described above. We set drug dummy variables to 0 and 1 respectively in order to estimate the average estimated contribution of drug costs across countries (47.5%) in previous work [8]. A similar approach can be adopted for separating out food costs from the reported inpatient costs (9.9%) in the first round analysis. Due to lack of data regarding adjustment ratios for outpatient care related drug costs, we applied the ratio derived from previous inpatient care models to outpatient visits as well. The adjustment ratios were uniformly applied to the regression results across all country estimates.

As described in previous analysis, [8] the re-transformation of predicted log unit costs (iuci) gives the median and not the mean of the distribution. One widely-used solution to this retransformation problem is Duan’s ‘smearing’ method [21]). The method is non-parametric, because it does not require that the regression error have any specific distribution. A smearing factor can be estimated following three steps: (i) Estimation of regression residuals, ri. (ii) Exponentiation of regression residuals to the power e, exp(ri). (iii) Averaging of the exponentiated residuals 1/n *∑exp(ri). The smearing factor is then multiplied by the re-transformed log unit costs (i.e. exp(ci)). This is the method applied in the Excel-based WHO CHOICE tools, which are available for countries to make their own predictions.

Unfortunately, Duan’s method requires that the distribution of the errors be homoscedastic. To test the sensitivity of our best estimates to this assumption and to construct 95% UIs, we employed a Bayesian approach. For each log prediction, we used the mean and standard error to draw 1000 random values, exponentiated each of these 1000 values and then extracted the mean, standard deviation and 2.5th and 97.5th centile values. For outpatient care, the two retransformation methods produced unit cost estimates that were equivalent on average. For inpatient care, the Bayesian approach produced unit cost estimates that were 4% higher on average. Given that the 95% UIs overlapped with estimates produced by Duan’s method, we present country-specific estimates obtained using the Bayesian approach.


Descriptive statistics

Tables 1 and 2 show the variable names, description, mean and standard error for the data sets on inpatient and outpatient care respectively.

Explanatory power

Tables 4 and 5 show the final regression models for inpatient and outpatient unit costs with 95% confidence intervals. All variables are significant, with most variables highly significant with p < 0.001. The inpatient cost model is slightly better performing with an adjusted R squared of 0.760 compared to 0.658 for the outpatient care model.

Table 4 Regression coefficients and 95% confidence interval: natural log of cost per inpatient bed day expressed in 2007 I $
Table 5 Regression coefficients and 95% confidence interval: natural log of cost per outpatient visit expressed in 2007 I $

Signs of the coefficients are consistent with expectations in both models, with more detailed interpretation below. In both models, GDP per capita is a highly significant proxy for price level but also for the level of technology.

Level and location of facilities

As expected, costs are higher in higher level, and urban, facilities. The level of facility is significant for both inpatient and outpatient care costs, with p < 0.001 for all levels. With regards to inpatient care, specialist and teaching hospitals (levels 4 and 5) have higher estimated unit cost than district hospitals (level 3). Urban/rural location is significant for outpatient care costs, with higher unit costs in urban settings.

Size of facilities

The size of facilities providing inpatient care is measured by the number of admissions (used instead of number of beds as the latter was highly collinear with the occupancy rate). This parameter has a very small, but still significant (p < 0.05) positive effect on inpatient costs. The small effect on cost can be said to result from mixed effects because, on the one hand, higher admissions could lead to lower overhead cost per patient and greater efficiency while, on the other hand, greater size could also indicate more specialist care with a larger proportion of complicated cases and thus a higher unit cost. The proper identification of the effect size would require more information of the capacity at which the facility is operating, and more detailed data on resource use at the level of the specialized units within each facility. For outpatient costs, the number of visits per year was used as an indicator of size, and here the results align with our expectations that larger facilities would have lower costs, likely due to operating with more efficient provision of care (significant at p < 0.05).


Compared to facilities run by not for profit private providers (i.e. missions or non-governmental organizations), unit costs are predicted to be lower for both outpatient and inpatient care in public facilities (p < 0.01 in both models). Both models also indicate higher costs in private for profit facilities (highly significant at p < 0.01 for inpatient care, and somewhat less significant at p < 0.05 for outpatient care). These results align with our expectations [22, 23].

Capacity utilization

The models confirm measures of capacity utilization as important explanatory variables of cost [3]. A higher bed occupancy rate should result in lower inpatient bed day cost, as fixed costs are spread across a greater number of outputs. This effect is confirmed by the inpatient model (p < 0.01). A longer average length of stay (ALOS) is significantly associated with lower inpatient bed day cost (p < 0.001), presumably because fixed costs are spread over a greater number of days. A greater number of visits per provider per day significantly reduces the outpatient cost (p < 0.001), more so than the number of visits per facility.

Country dummies

Country dummies were included only where their effect was significant, which is the case for both inpatient costs where Brazil observations constitute 58% of the entire sample, and outpatient costs where Brazil makes up as much as 62% of observations included in the final model. For outpatient care we also included a dummy variable for Colombia. All dummies for Brazil and Colombia are highly significant at p < 0.001.

Comparison with earlier models

The regression model for inpatient care differs from the earlier WHO-CHOICE prediction model in that variables for food and drug costs are not included [8]. On the other hand we included other variables that have been reported in hospital cost function estimation literature, such as the average length of stay [24, 25].

The model for outpatient care is significantly different from the previous WHO-CHOICE models, in that the new model differentiates between levels of care.


For internal validation purposes we produced scatter graphs of actual and predicted observations by facility type and examined these closely for every country. As described above, predicted values were based on the 80th percentile of variables used for prediction (Table 3).

Figure 1 plots the predicted values from the outpatient model against the unit cost data and the level of GDP per capita, for all level 1 facilities for the countries included in the sample. The line represents the predicted values of the cost per visit (in natural logs), estimated for a public facility using assumptions outlined in Table 3.

Fig. 1

Predicted values (regression lines) for outpatient service delivery costs in level 1 facilities, 2007 I$, plotted against the natural log of GDP per capita (X axis). (Y-axis shows the raw data for cost per visit in natural logs) N = 4750. Natural logarithm of outpatient unit costs expressed in 2007 I$. ARM Armenia, BFA Burkina Faso, BRA Brazil, COL Colombia, GEO Georgia, GHA Ghana, ECU Ecuador, IDN Indonesia, MDA Moldova, NGA Nigeria, MNG Mongolia, PAK Pakistan, PHL Phillippines, RWA Rwanda, SRL Sri Lanka, USA USA

Figure 2 similarly plots the predicted values for inpatient service delivery costs against the country supplied estimates and the level of GDP per capita, for level 3 facilities. The predicted costs reflect the regression analysis and as such represent an average relationship between cost determinants, whereas the actual country values demonstrate a significant spread due to facility-specific conditions.

Fig. 2

Predicted values (regression lines) for inpatient service delivery costs in level 3 facilities, 2007 I$, plotted against the natural log of GDP per capita (X axis). (Y-axis shows the raw data for cost per inpatient bed day in natural logs) N = 5037. Natural logarithm of inpatient unit costs expressed in 2007 I$. ARM Armenia, BEN Benin, BFA Burkina Faso, BRA Brazil, COL Colombia, CMR Cameroon, GEO Georgia, GHA Ghana, ECU Ecuador, IDN Indonesia, KGZ Kyrgyzstan, LBN Lebanon, MDA Moldova, MNG Mongolia, NGA Nigeria, NLD Netherlands, PAK Pakistan, PHL Philippines, RWA Rwanda, SRL Sri Lanka, SRB Serbia, THA Thailand, UGA Uganda, USA USA, ZMB Zambia

The figures confirm that the models have a reasonable fit with the data and illustrate the considerable variability in the observed unit costs within individual countries (each column of dots represents a country with a specific GDP per capita).

Moreover, we attempted to validate the predicted CHOICE cost estimates against cost data on TB and HIV-related health services (outpatient visits) which had been collected through country-specific research studies by WHO’s TB department and UNAIDS. While methodological differences between studies makes a direct comparison challenging, the comparison indicated that estimates were in the same range.

The models outlined here are used to derive predictions at country and WHO region level. Table 6 presents the predicted values for public hospitals in selected countries using Table 3 assumptions and 95% uncertainty intervals. The estimates are presented in 2010 I$, based on the 2010 GDP per capita in I$. Table 6 and Fig. 2 illustrate the comparatively lower estimates derived from the data samples from Brazil and the United States of America. Additional predictions are available from the WHO-CHOICE website:

Table 6 Predicted service delivery cost per bed-day (i) for selected countries (2010 I$)

We examined our data set for the reported shares of drug costs which were 10.6% for outpatient care (N = 5478 out of 9028) and 5.3% for inpatient care (2688 out of 3407). While these are likely underestimates, we ran a sensitivity analysis using these shares, with results reported in Table 7.

Table 7 Predicted service delivery cost per bed-day and outpatient visit for selected countries, using different ratios for drug cost adjustment


This paper describes the most recent effort by WHO to develop models to predict country-specific costs for outpatient visits and inpatient bed days. The database of country-specific cost estimates is a public good provided by WHO which serves a unique purpose at global, regional, and country level, allowing analysts to easily access data and apply these within a range of analytical settings—including economic evaluation, cost of illness studies, investment cases and resource needs appraisals.

The models presented in this paper were informed by data gathered through a thorough search for databases that reported costs at a range of facilities. Data imputation techniques were explored but not used, given that they reduced the performance of the models according to statistical and validation tests when compared to models not including imputed values. This probably indicates that data sets with missing values are less reliable than other data, which further suggests that a ‘missing at random’ assumption is inappropriate for this sample.

The variables included in the models have high statistical significance and the signs of the coefficients are consistent with results from previously published models. While there is a significant variation at country level (Figs. 1, 2), the model clarifies how such variation may be due to type/level of facility, facility size, ownership, and current capacity utilization. While a substantial portion of the observed variability can be explained by the specified determinants, some unexplained variability remained, possibly due to factors that we could not measure such as case mix (variation in diagnosis), quality of care, and incentive structures.

The majority of data points that the model draws upon refer to year 2007. While technology may have since evolved, the revised WHO CHOICE estimates presented here provide a valuable resource and more recent estimates compared to previous analysis. It is recommended that new rounds of data collection be conducted in the future to guide further model updates. In the interim however, the current model provides researchers and analysts with a set of comparative cost estimates not found elsewhere. Moreover, additional future work will be needed to improve cost estimates for outreach and community service delivery platforms which are important in particular for prevention activities.

Comparison with estimates derived from previous WHO-CHOICE prediction models

As expected, when compared with the previous round of WHO-CHOICE models, [3, 8, 9] the models presented here result in predicted costs that are higher in most cases, particularly for outpatient care. The background technical report provides more information on comparative statistics between the past and current set of models [26]. Higher costs from the current set of models are expected due to changes in technology as well as general price inflation.

The updated estimates are based on data from fewer countries than the previous WHO-CHOICE regression analysis (30 compared to 80). However, when considering the considerable variation in unit costs reported within countries the recommendation is that a sufficiently large number of facilities within a country is required to ensure representativeness [9]. Therefore, if one considers only datasets with 10 or more observations, the number of datasets in the two analyses are similar (30 in the new analysis compared to 33 in the first round) and can thus be considered similarly representative of between-country variation. With a higher average number of facilities per country, the new dataset is more representative of within-country variation. A significant limitation in the new round of analysis is however the lack of outpatient cost data from high-income countries. The effect of this feature of the data on the comparability of cost estimates across both hospitals and health centers is unknown.


Unit cost estimates are sensitive to the method used for cost allocation [27, 28]. Most data collectors reported using a bottom up approach (53% of sample, with 14% using a top-down approach and 33% not providing information, data not shown). The list of variables collected varied across settings, which is expected as costs would have been collected for different purposes. Nevertheless this posed challenges for our aggregate analysis. A particular challenge concerned overhead costs, where analysts included different components, which makes it likely that some respondents underestimated overhead costs. Another challenge was health worker salaries, where analysts reported encountering difficulties locating relevant cost data, particularly in public sector settings where managers have limited information on salaries of their fellow co-workers. A few respondents reported allocation between inpatient and outpatient care based on revenue generation rather than resource use. Estimates were pooled even when revenue generation was reported.

Data providers should be better equipped to extract commodity costs from their estimates. The use of a uniform ratio across countries to remove food- and/or drug related costs is a limitation, as the ratio of drugs relative to other costs would presumably vary across settings [29, 30]. Moreover, the average contribution of drug costs derived from previous work (47.5%) is higher than expected. The World Health Report 2010 reported that pharmaceuticals account for 20–30% of all global health spending [1]. Nevertheless, the 47.5% assumption is retained for this round of CHOICE estimates. Table 7 provides a sensitivity analysis highlighting the impact of this assumption.

Similarly, measures around performance of health facilities and overall capacity should be incorporated during cost data collection processes. The above challenges are a result of using secondary data sources but nevertheless point to lack of standardised data collection and reporting processes.

User-defined parameters

Our choice of admissions as the size measure for hospitals may be critiqued in that this reflects short-run output volume variations while hospital beds would better reflect installed capacity [31].

Moreover, in order to derive a standard set of WHO-CHOICE cost estimates (Table 4) we applied specific assumptions derived from the data sample, for facilities operating at the 80% percentile of a sample of similar such facilities in terms of capacity utilization and output. It is possible the 80% percentile values do not correspond to performance at 80% capacity, but unfortunately the data does not allow for verification of this. On the assumption that many facilities in low and middle income countries operate at low capacity, our prediction values may be too low, which would overestimate costs for a system operating at 80% capacity. A spreadsheet model is available through the WHO-CHOICE website for those who wish to adjust the parameters used and generate their own country specific estimates (, for example in relation to facilities that are private for profit, or facilities located in rural areas. Different assumptions for the predictor variables (e.g. average length of stay, number of patients seen per provider per day, etc.) may be used, although it is recommended that unit costs for economic evaluations should reflect technically and economically efficient service provision [32].


WHO-CHOICE health service delivery unit costs are unique and considered a standard data source for economic analysis, used by analysts all over the world. With the new models, unit cost estimates have been produced for all 14 WHO epidemiological sub-regions (and 21 regions used by the IHME for Global Burden of Disease Project) [33], using population-weighted GDP per capita. The models can predict country-specific unit costs at different capacity levels and in different settings. Country and region estimates are available through the WHO-CHOICE website, along with tools that allow users to adjust variables and make their own predictions [34].

With the SDGs there is a call to action to maximise the efficiency of data collection, and for informed and accountable decision making [35]. Resource allocation should be based on informed analysis with regards to costs and benefits. It is therefore critically important that national and local governments as well as the international community strengthen efforts to collect, analyse, and make use of information on health system resource use and efficiency. Efforts should be made to integrate cost information in general data collection efforts. For example, the World Bank’s Service Delivery Indicators program has been collecting facility-level data on performance and quality of service delivery indicators for a limited number of countries in Africa (Kenya, Senegal, Tanzania and Uganda) but to our knowledge has not focused on service costs [36]. Similarly with the District Health Information Software (DHIS2) software platform being used in a large number of countries, there is now ongoing work to assess how information on facility level resource use can be regularly captured and used to inform decision making. Countries should develop systematic data collection systems to store, transfer and produce robust and up to date strategic financial information for stakeholders at local, sub-national and national levels [2].

There is a need to strengthen capacity to understand and make use of data at country level, in particular in low-income countries where resources are limited and the use of economic and financial data for evaluating current system performance could lead to considerable efficiency gains [1]. The WHO-CHOICE project makes tools, models and datasets available for county users and has recently invested in the enhanced user-friendliness of the suite of cost-effectiveness tools through their incorporation into the Spectrum platform, and a built-in link to the OneHealth Tool—a joint UN resource projection tool used in over 40 countries [37].

While the WHO-CHOICE project provides robust defaults for countries to assess costs at five levels of service delivery, future modelling work should consider additional delivery modes such as community based delivery and outreach. More research may be needed regarding the assumptions used for continuous independent predictor variables in order to derive country predicted values, and guidance regarding setting norms for capacity utilization.


  1. 1.

    WHO. World health report 2010; 2010.

  2. 2.

    Beck EJ, et al. Counting the cost of not costing hiv health facilities accurately. pay now, or pay more later. Pharmacoeconomics. 2012;30(10):887–902.

    Article  PubMed  Google Scholar 

  3. 3.

    Adam, et al. Capacity utilization and the cost of primary care visits: implications for the costs of scaling up health interventions. Cost Effectiveness Resour Alloc. 2008;6(22):2008.

    Google Scholar 

  4. 4.

    Institute for Health Metrics and Evaluation (IHME). Health service provision in Ghana: assessing facility capacity and costs of care. Seattle: IHME; 2015.

    Google Scholar 

  5. 5.

    Institute for Health Metrics and Evaluation (IHME). Health service provision in Kenya: assessing facility capacity, costs of care, and patient perspectives. Seattle: IHME; 2014.

    Google Scholar 

  6. 6.

    Institute for Health Metrics and Evaluation (IHME). Health service provision in Zambia: assessing facility capacity, costs of care, and patient perspectives. Seattle: IHME; 2014.

    Google Scholar 

  7. 7.

    Institute for Health Metrics and Evaluation (IHME). Health service provision in Uganda: assessing facility capacity, costs of care, and patient perspectives. Seattle: IHME; 2014.

    Google Scholar 

  8. 8.

    Adam, et al. Econometric estimation of country-specific hospital costs. Cost Effectiveness Resour Alloc. 2003;1:3.

    Article  Google Scholar 

  9. 9.

    Adam T, Evans DB. Determinants of variation in the cost of inpatient stays versus outpatient visits in hospitals: a multi-country analysis. Soc Sci Med. 2006;63(7):1700–10.

    Article  PubMed  Google Scholar 

  10. 10.

    Disease Control Priorities in Developing Countries (DCPP). Accessed 10 Feb 2017.

  11. 11.

    Ha DA, Chisholm D. Cost-effectiveness analysis of interventions to prevent cardiovascular disease in Vietnam. Health Policy Plan. 2011;26(3):210–22.

    Article  PubMed  Google Scholar 

  12. 12.

    Stanciole AE, Tan-Torres Edejer T, Gkountouras G. Unit costs of health care services of general utilization: a review of the international literature and current estimates. Geneva: WHO; 2009.

    Google Scholar 

  13. 13.

    Centres for Medicare & Medicaid services. Accessed 26 Apr 2011.

  14. 14.

    International Monetary Fund.

  15. 15.

    Barnum H, Kutzin J. Public hospitals in developing countries. Baltimore: Hopkins; 1993.

    Google Scholar 

  16. 16.

    StataCorp. Stata statistical software: release 11. College Station: StataCorp LP; 2009.

    Google Scholar 

  17. 17.

    Pauly MV. Estimating hospital costs. J Health Econ. 1986;5:107–27.

    Article  PubMed  Google Scholar 

  18. 18.

    Iversen T. A theory of hospital waiting lists. J Health Econ. 1993;12:55–71.

    CAS  Article  PubMed  Google Scholar 

  19. 19.

    Breusch TS, Pagan AR. A simple test for heteroscedasticity and random coefficient variation. Econometrica. 1979;47(5):1287–94.

    Article  Google Scholar 

  20. 20.

    Lozano R, Naghavi M, Foreman K, et al. Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet. 2012;380:2095–128.

    Article  PubMed  Google Scholar 

  21. 21.

    Duan N. Smearing estimate: a nonparametric retransformation method. J Am Stat Assoc. 1983;78(383):605–10.

    Article  Google Scholar 

  22. 22.

    Omar AO, Komakech W, Hassan AH, Singh CH, Imoko J. Costs, resource utilisation and financing of public and private hospitals in Uganda. East Afr Med J. 1995;72:591–8.

    CAS  PubMed  Google Scholar 

  23. 23.

    Basu S, Andrews J, Kishore S, et al. Comparative performance of private and public healthcare systems in low- and middle-income countries: a systematic review. Plos Med. 2012;9:e1001244.

    Article  PubMed  PubMed Central  Google Scholar 

  24. 24.

    Wagstaff A. Econometric studies in health economics: a survey of the British literature. J Health Econ. 1989;8:1–51.

    CAS  Article  PubMed  Google Scholar 

  25. 25.

    Rego G, Costa JNR. The challenge of corporatisation: the experience of Portuguese public hospitals. Eur J Health Econ. 2010;11:367–81.

    Article  PubMed  Google Scholar 

  26. 26.

    WHO 2011, Estimation of unit costs for general health services: updated WHO-CHOICE estimates technical background report. Final version July March 2011.

  27. 27.

    Riewpaiboon A, et al. Effect of costing methods on unit cost of hospital medical services. Trop Med Int Health. 2007;12(4):554–63.

    Article  PubMed  Google Scholar 

  28. 28.

    Wordsworth S, Ludbrook A, Caskey F, Macleod A. Collecting unit cost data in multicentre studies: creating comparable methods. Eur J Health Econ. 2005;6(1):38–44.

    Article  PubMed  Google Scholar 

  29. 29.

    Vander Plaetse B, Hlatiwayo G, Van Eygen L, Meessen B, Criel B. Costs and revenue of health care in a rural Zimbabwean district. Health Policy Plan. 2005;20(4):243–51.

    CAS  Article  PubMed  Google Scholar 

  30. 30.

    Flessa S, Dung NT. Costing of services of Vietnamese hospitals: identifying costs in one central, two provincial and two district hospitals using a standard methodology. Int J Health Plan Manage. 2004;19(1):63–77.

    Article  Google Scholar 

  31. 31.

    Vitaliano DF. On the estimation of hospital cost functions. J Health Econ. 1987;6(4):305–18.

    CAS  Article  PubMed  Google Scholar 

  32. 32.

    Grieve R, Cairns J, Thompson SG. Improving costing methods in multicentre economic evaluation: the use of multiple Imputation for unit costs. Health Econ. 2010;19:939–54.

    Article  PubMed  Google Scholar 

  33. 33. Accessed 10 Jan 2018.

  34. 34. Accessed 10 Jan 2018.

  35. 35.

    Handley K, Boerma T, Victora C, Evans TG. An inflection point for country health data. Lancet Global Health. 3(8): e437–8.

  36. 36. Accessed 10 Jan 2018.

  37. 37. Accessed 10 Jan 2018.

Download references

Authors’ contributions

AS initiated the data collection process, which was then managed by KS. GG and KS undertook data cleaning, and developed the model jointly with JL. GG performed the regression analysis with JL supervision and KS inputs. CF undertook uncertainty analysis and derived estimates for 2010 I$. KS drafted the first version of the manuscript. All authors contributed to the writing. All authors read and approved the final manuscript.


The authors express their gratitude to Tessa-Tan Torres Edejer, David Evans, Eline Korenromp and Kirsi Viisainen for their input in the development of methods used and comments in relation to presentation and interpretation of results.

We are grateful to the following for assistance with collection of country level data: (in alphabetical order) Baktygul Akkazieva, Fanny Esi Atta-Peters, Osmat Azzam, Aaron Beaston-Blaakman, Carmelita Canila, Mihail Ciocanu, Charles Ezenduka, Steffen Flessa, Milena Gajic-Stevanovic, Ursula Giedion, Ketevan Goginashvili, Marco Aurelio Guerrero Figueroa, Samuel Kharazyan, Miguel Linares, Mshilla Maghanga, Felix Masiye, Juan Diego Misas, Alejandro Moline, Mardiati Nadjib, Eduardo Jose Navas Coutinho, Zakariaou Njoumemi, Walaiporn Patcharanarumol, Ravindra P Rannan-Eliya, Mohamed Samai, Mariyam Sarfraz, Bill Stomfay, Siok Swan Tan, Tsolmongerel Tsilaajav and Kirsi Vitikainen.

Funding for the project was provided by the Global Fund to Fight AIDS, Tuberculosis and Malaria.

Qin Wen provided support to data cleaning and data analysis.

The work represents the views of the authors and not necessarily those of the organization they represent.

Competing interests

The authors declare that they have no competing interests.

Availability of data and materials

Country and region-specific estimated costs are available through the WHO-CHOICE website (

Consent for publication

Not applicable.

Ethics approval and consent to participate

Not applicable.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information



Corresponding author

Correspondence to Karin Stenberg.

Additional file

Additional file 1: Annex S1.

Countries and number of unit cost observations included in the inpatient and outpatient cost prediction models. Annex S2. Data characteristics collected through standard template.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Stenberg, K., Lauer, J.A., Gkountouras, G. et al. Econometric estimation of WHO-CHOICE country-specific costs for inpatient and outpatient health service delivery. Cost Eff Resour Alloc 16, 11 (2018).

Download citation


  • Cost
  • Regression analysis
  • Estimates
  • Inpatient
  • Outpatient