The evidence gap on gendered impacts of performance-based financing among family physicians for chronic disease care: a systematic review reanalysis in contexts of single-payer universal coverage
Human Resources for Health volume 18, Article number: 69 (2020)
Although pay-for-performance (P4P) among primary care physicians for enhanced chronic disease management is increasingly common, the evidence base is fragmented in terms of socially equitable impacts in achieving the quadruple aim for healthcare improvement: better population health, reduced healthcare costs, and enhanced patient and provider experiences. This study aimed to assess the literature from a systematic review on how P4P for diabetes services impacts on gender equity in patient outcomes and the physician workforce.
A gender-based analysis was performed of studies retrieved through a systematic search of 10 abstract and citation databases plus grey literature sources for P4P impact assessments in multiple languages over the period January 2000 to April 2018, following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. The study was restricted to single-payer national health systems to minimize the risk of physicians sorting out of health organizations with a strong performance pay component. Two reviewers scored and synthesized the integration of sex and gender in assessing patient- and provider-oriented outcomes as well as the quality of the evidence.
Of the 2218 identified records, 39 studies covering eight P4P interventions in seven countries were included for analysis. Most (79%) of the studies reported having considered sex/gender in the design, but only 28% presented sex-disaggregated patient data in the results of the P4P assessment models, and none (0%) assessed the interaction of patients’ sex with the policy intervention. Few (15%) of the studies controlled for the provider’s sex, and none (0%) discussed impacts of P4P on the work life of providers from a gender perspective (e.g., pay equity).
There is a dearth of evidence on gender-based outcomes of publicly funded incentivizing physician payment schemes for chronic disease care. As the popularity of P4P to achieve health system goals continues to grow, so does the risk of unintended consequences. There is a critical need for research integrating gender concerns to help inform performance-based health workforce financing policy options in the era of the Sustainable Development Goals.
Governments and healthcare service organizations around the world have increasingly adopted financial incentives to stimulate guideline-based practice for the prevention, diagnostics, and treatment of prevalent diseases. Such incentives, also known as pay-for-performance (P4P), may be offered as added rewards to healthcare practitioners for changes in clinical behaviors in terms of time, services delivered, patients reached, quantity or quality of care, continuity of care, or other established targets to achieve health system goals [1, 2]. The World Health Organization advocates that health system efficiencies could be achieved in countries at all levels of economic development through better incentives for primary care providers and other means of focused financing . Performance incentive schemes have been implemented in several high-income countries and introduced in many low- and middle-income countries, the latter often as donor-supported pilot projects [3, 4]. However, it remains unclear to what extent, if at all, financial incentives positively influence the delivery of care in terms of equitable outcomes by gender or other personal characteristics of either patients or providers [1, 5]. The risk of potential unintended consequences of P4P schemes has tended to be overlooked in the available literature .
Enhancing the efficiency and effectiveness of healthcare investments is important in many countries; incentivizing physician payments to improve chronic disease management—versus relying on traditional fee-for-service, capitation, or bundled payments—is an area of increasing attention [2, 7]. Several systematic reviews have examined the impacts of P4P among medical practitioners on different indicators of healthcare processes, costs, and patient outcomes across different contexts and different systems of healthcare financing [1, 2, 4, 5, 7,8,9,10,11,12,13]. However, heterogeneity of incentive schemes and evaluation methods has meant there are fragmentation and general deficiency in the evidence base to support the use (or non-use) of incentive reimbursements among physicians to improve primary care for diabetes and other chronic non-communicable diseases (NCDs). Some research has found that physicians may react to incentives differently depending on whether they were for acute or chronic illness . Investing in better management to lessen the impact of chronic NCDs is critical, given that these diseases account for 71% of the total mortality burden worldwide . Moreover, much of the evidence on the impacts of P4P for NCDs pertains to diabetes . Reducing the number of diabetes-related premature deaths is one of the key targets of the international Sustainable Development Goals (SDG) agenda (target 3.4.1). Diabetes and its complications place a substantial long-term burden on health budgets . The greater susceptibility of patients with pre-existing diabetes to COVID-19 has further highlighted the cruciality of addressing diabetes management in health emergencies .
While the number of P4P policies continues to increase, along with the number of studies on P4P effects, it is uncertain whether and how P4P is related to better equity in patient outcomes. Some limited research has suggested that certain patient groups, notably older patients and those with multiple chronic NCDs, may benefit less from incentivized care compared to their younger and healthier counterparts . At the same time, rising global prevalence of NCDs and other health challenges run the risk of fueling gender-related health inequalities . Despite the evidence of biological and psychosocial differences between female and male patients in the progression of diabetes and related complications, clinical care guidelines tend not to differentiate by sex or consider gender-sensitive approaches to improve adherence to therapy .
Specifically, we are unaware of any reviews evaluating P4P schemes that consider a measure of better gender equity in patient outcomes. Achieving gender equality through strengthened policies and public allocations is another key SDG indicator (target 5.c). Health systems can make important contributions to this SDG by tracking gender inequalities and addressing underlying structural issues, including gender-based assessments of approaches to budgeting . While it is increasingly acknowledged that monitoring sex-specific impacts of health interventions is a critical starting point, sex and gender reporting remains inadequate in health research [19, 20]. Petkovic et al.’s study of recent systematic reviews documented that less than 30% of reviews reported on sex or gender in the results . There is growing recognition that, unless explicit attention is paid in health financing to gender, movement towards meeting population needs can fail to achieve gender balance or improve equity and may even exacerbate gender inequity . This knowledge gap led us to our first research question: Do incentive reimbursements for primary care physicians reflect or even exacerbate gender inequalities in patient-oriented diabetes outcomes, compared to the absence of incentivizing remuneration?
We are further unaware of any P4P schemes adjusted for physicians’ gender or other individual characteristics (aside from practice location), or reviews that consider performance pay in regard to gender wage gaps or other workforce equity measures . Males, including those in medical and other high-paying occupations, have long earned more than their female counterparts. The gender-related pay gaps have not been readily explained by objective labor market characteristics, including educational attainments . Studies from different countries have indicated that female physicians continue to earn on average 13% less than male physicians, after controlling for factors such as specialty and working hours [23, 24]. While health systems are often considered insufficiently responsive to women’s specific health needs, they are also highly dependent on women as providers of care . Women are increasingly predominant in the physician workforce, and specifically the primary care physician workforce, in many countries [24, 26]. As healthcare organizations strive to enhance patient experiences, improve population health, and reduce per capita costs of care, there is also growing recognition that achieving the ultimate goal of a high-performing health system requires improving the work life of service providers—collectively known as the Quadruple Aim for healthcare improvement [27, 28]. The World Health Organization acknowledges that health workforce gender imbalances, including wage differences, are a major challenge for health policymakers to enhance system efficiencies . For one, Hedden et al.’s systematic review presented evidence that female primary care physicians present different clinical practice patterns compared to their male counterparts, including spending more time with each patient and dealing with multiple health issues during a given visit . How differences in physician remuneration mechanisms and financing policies across jurisdictions over time may influence the differences between male and female physicians in observed practice patterns is an important area for a new investigation. This gap incited us to raise our second research question: Do incentive reimbursements reflect or even exacerbate gender inequalities in physician remuneration?
To address these questions, we conducted a reanalysis of a systematic review of the literature on impacts of P4P among primary care physicians for diabetes management and analyzed the evidence concentrating on the extent to which patients’ and/or physicians’ sex/gender is considered or influential in the results to achieve the Quadruple Aim for healthcare improvement. The aim was to enhance the understanding as to whether increasing numbers of women in medicine may drive change in clinical practice patterns without P4P, whether “gender-blind” P4P schemes have a different impact on male versus female patients, and whether such schemes are contributing to gender inequities in professional earnings among providers.
A reanalysis was conducted using a gender-based analysis approach of the authors’ previously published and unpublished data from a systematic review of P4P evaluation studies for the management of diabetes and other NCDs in publicly funded national health systems . The scope of the review focuses on the contexts of single-payer universal health coverage, thus minimizing the risk of unintended consequences of P4P from physicians gaming the payment system, that is, from physicians potentially moving between health organizations within a jurisdiction to benefit from an incentive, or avoiding high-risk patients altogether to not upset clinical performance metrics . This approach also discounts the specific effects of female medical practitioners potentially sorting out of health organizations with a strong performance pay component or having other characteristics that may be less attractive to women . Substantively, government-funded health systems further have the responsibility in the SDG era to ensure gender-responsive human resources for health (HRH) budgeting, as an important measure to realizing their international commitments to achieving gender equality.
In accordance with other systematic review reanalyses and subanalyses, this study was designed to reconsider a previously published systematic review from a distinct implementation and reporting perspective, thereby allowing for new research questions to be examined in detail while avoiding unwarranted research duplication. The protocol for the present study was published in the PROSPERO prospective register of systematic reviews (registration number CRD42018090021) . Whereas the authors’ original review focused on patient-oriented outcomes before and after the introduction of P4P (e.g., patient morbidity, avoidable hospitalization, premature death) , for this study, the primary outcomes of interest are gender equity in P4P effects from the patient and also provider perspectives. The review aligns with the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) guidelines .
Studies were eligible to be included in the systematic review if they addressed the question of whether the introduction of physician practice incentives for diabetes management in primary and community care led to improved population health and health system outcomes through some sort of evaluative component. This could include incentives for diabetes-specific care or management of multiple morbidities, from all countries with single-payer health insurance systems.
Ten abstract and citation databases were searched: ABI Inform, Business Source Premier, Canadian Business and Current Affairs, Cochrane Library, EconLit, PAIS, PubMed, Scopus, SocIndex, and Sociological Abstracts. Free text and formal search terms and filters were translated to respect database-specific requirements, with the advice and assistance of library professionals. Several Medical Subject Headings (MeSH) terms and combinations were used to identify the intervention [including “pay#for#performance,” “incentive reimbursement*,” “value#based purchasing,” “performance pay*,” “merit pay*.” and related nomenclature] and the health condition of interest [“diabetes mellitus,” “diabetes,” “hyperglyc*,” “prediabetes,” “dysglyc*,” and related nomenclature]. Reference lists of systematic reviews on the topic that were found during the database searches [1, 2, 4, 7,8,9,10,11,12,13] as well as of selected global health literature sources were further hand-searched .
Eligible studies included those published in English, French, Portuguese, or Spanish between 1 January 2000 and 30 April 2018. Two reviewers independently screened a sample of eligible abstracts and in turn full-text articles, to identify and secure consensus on studies for review inclusion. The country and its health financing arrangement, characteristics of the incentive scheme, study objective, provider and patient populations, data gathering techniques, comparison groups, and outcomes measured were recorded. The full eligibility criteria and search strategy, which were guided by a Population, Intervention, Comparison, Outcomes, and Study (PICOS) design framework, are available elsewhere in the original review and related protocol [5, 34].
For this analysis, we developed gender-based analysis grading criteria for the retrieved records. Each study’s contents were vetted distinguishing between “sex” (a biological/physiological characteristic distinguishing males from females) and “gender” (the roles, behaviors, activities, and attributes that a given society may construct or consider appropriate for men and women) . Studies were categorized by five items based on the level of inclusion and reporting of sex and gender data and analysis, pertaining to both patients and providers (Table 1). Simple mentions of the terms sex or gender as statistical control variables were assigned lower scores, while discussions of gender perspectives in the narrative of the results were given higher scores.
Two reviewers independently extracted and graded sex and gender reporting information from a sample of eligible full-text articles, with any disagreements resolved by consensus. Articles that received a non-zero score in terms of analyzing P4P from a gender perspective (items 3 and 5) were included in the narrative synthesis of the results.
Building on the authors’ previous work, the quality of the evidence reported in the studies was evaluated following the Grading of Recommendations, Assessment, Development and Evaluations (GRADE) approach for complex social interventions , with a letter grade assigned to each study based on two predetermined criteria. The evidence was narratively synthesized in terms of the following:
Outcome relevance: the study measured different dimensions for achieving the Quadruple Aim, notably in terms of improvement of outcomes in relation to patient-oriented care (e.g., fewer complications of chronic disease and other measures that matter to patients), population health (e.g., lower rate of onset of major chronic diseases, fewer premature deaths), healthcare costs (e.g., fewer hospital bed days), and/or work life of providers (e.g., pay equity, fewer burnouts, fewer early retirements) [27, 35].
Methodological rigor: the study utilized population-generalizable data and assessment techniques accounting for potential selection bias and unobservables (e.g., models for analyzing endogenous treatment effects of guideline-based diabetes care) .
Because of the heterogeneity of the outcomes and analytical approaches under review, performing a meta-analysis was not possible .
Article retrieval and inclusion
A total of 2218 records were initially retrieved: 2155 records from the ten electronic databases plus 63 records from hand searches. In the first step, 2128 duplicates and other records were removed based on the title and abstract screening. Following this screening, 90 articles were retained for full-text review, of which 51 were eventually screened from further consideration. This process left for analysis 39 articles evaluative of introducing P4P among physicians for diabetes and NCD management in primary and community care . A PRISMA depiction of the flow of information is found in Fig. 1.
The studies covered eight unique P4P interventions in seven countries with single-payer health insurance: Australia, Canada (two provincial-level schemes), Denmark, Italy, Sweden, Taiwan, and the United Kingdom . The characteristics of the eight schemes are described in Table 2. Many of the studies used administrative health data sources for the evaluation analyses, typically considered complete and population-representative given the focus on single-payer systems. The full references of the 39 articles reviewed are listed in the Appendix.
Reporting of sex/gender in P4P assessments
Of the 39 studies retained for narrative analysis, 31 (79%) reported that the study considered sex/gender of the patient and/or provider (Fig. 2). Only one substantively detailed that the results would be disaggregated by sex/gender as an integral component of the design. Among the 31 studies indicating any consideration of sex/gender of the patient, two thirds (20 studies or 65%) included only sex-disaggregated descriptives of the patient population among other general demographic characteristics, with the other one third (11 studies or 35%) further reporting sex-disaggregated data in the results of the statistical model assessing the impacts of P4P on patient outcomes. Twelve studies narratively described the sex-disaggregated results, of which nine limited the discussion to the descriptives and three substantively discussed the results in terms of gender-based patient outcomes from an equity perspective.
While most of the studies controlled statistically for the patient’s sex as a demographic variable, few (15%) controlled for the provider’s sex. Of the six studies that did, four presented sex-disaggregated results of the P4P evaluation model. Two discussed the data in terms of sex-specific patterns of provider behaviors. None substantively discussed gender equity from the provider perspective as part of the P4P assessment.
In terms of being able to address our first research question on P4P and gender equity in patient-oriented diabetes outcomes, the 12 studies that narratively discussed sex-disaggregated patient data covered four different P4P schemes: Canada (province of New Brunswick), Italy (Emilia-Romagna region), Taiwan, and the United Kingdom. Eight (67%) of these studies were from Taiwan. For Taiwan, to reduce the risk of bias from multiple reporting of effects of the same intervention, we retained for reporting only the one study classifying gender differences as an integral component of the design as well as the two most recent publications. Table 3 presents the characteristics of the eight studies retained for further analysis following the PICOS framework [6, 36,37,38,39,40,41,42].
In terms of addressing our second research question on P4P and gender equity in physician remuneration, the two studies that substantively discussed sex-disaggregated provider data in relation to the P4P assessment results were both from Italy. The characteristics of both studies are found in Table 3 [6, 37].
Among the eight studies narratively discussing sex/gender results among patients and/or providers, the number of records on diabetes patients totaled more than 800 000 (ranging from a survey sample of 1173 to a whole-population assessment of 396 838) (Table 4). Most (63%) of the studies did not report the number of providers captured in the data.
Impacts of P4P on gender equity in patient outcomes
Based on the quality assessment grid, three of the retained studies discussing sex-disaggregated patient data can be considered full evaluations yielding high-quality evidence on the impacts of P4P on health system outcomes (Table 4). Examples of the narratives describing sex/gender issues in these studies can be found in Table 5. Lippi Bruni et al. reported that patients’ age, insulin dependence, and frequency of visits to diabetes outpatient clinics—but not sex—were the most important determinants of emergency hospitalizations, with the findings robust to different specifications of physician financial incentives in an Italian jurisdiction . In relation to Taiwan’s P4P scheme, Hsiesh et al. reported that all-cause and diabetes-related mortality were lower among patient participants compared to non-participants and that, in terms of confounding factors, female patients with diabetes tended to have a lower risk of cancer mortality than males . Pan et al. reported that patient participants had higher physician continuity than non-participants and that, based on the multiple regression analyses, female patients had significantly higher continuity of care and lower hazard of mortality than male patients . None of the studies discussed sex-specific differences in patient-oriented outcomes by physicians’ P4P uptake.
Among the results of the partial evaluations, Yuan et al. systemically disaggregated patient data by sex in their assessment of an outpatient diabetes quality improvement plan operating within Taiwan’s P4P scheme . The authors found that male patients in the plan tended to have better glycemic control, but that age and socioeconomics were more important drivers of reported patient outcomes. In a Canadian province, LeBlanc et al. indicated no sex-specific difference in the likelihood of patients receiving the guideline-based number of A1c tests between patients followed by physicians who claimed the P4P incentive for diabetes management compared to those followed by physicians who had never claimed the incentive over the period of observation . Reporting on the United Kingdom’s Quality and Outcomes Framework (QOF), Millett et al. indicated that female patients with diabetes were more likely to have multiple comorbid conditions and that diabetes patients with comorbid conditions seemingly benefited more from the introduction of P4P in terms of achievement of established targets for blood glucose and cholesterol than those without comorbidity . Crawley et al. did not report results by patients’ sex in their statistical analysis, which focused on the differences across social class groups but substantively discussed the increasing evidence of inequities in care by socioeconomic status and the limited number of studies using individual-level data in the United Kingdom that consider gender and other characteristics potentially related to persistent inequitable outcomes after P4P introduction .
Impacts of P4P on gender equity in provider outcomes
The two full evaluations that narratively discussed sex-disaggregated HRH data in the P4P assessments were both from Italy (Tables 4 and 5). Lippi Bruni et al. reported that higher shares of practitioners’ income received through P4P was associated with significantly reduced adverse outcomes among their patients, but only under schemes requiring adherence to clinical guidelines . The authors also reported that physicians’ sex, but not their age or postgraduate qualifications, was significantly associated with patients’ risk of emergency hospitalization and notably that patients of female physicians had a significantly lower risk. Iezzi et al. also reported a lower risk of potentially avoidable hospitalization for patients followed by practitioners receiving a higher share of their pay through P4P but that practitioners’ sex and other individual characteristics did not produce systematic effects contributing to the risk . Neither of the studies discussed the impacts of physicians’ P4P uptake on sex-specific differences in professional earnings or other work life indicators.
In their partial evaluation of a low-powered scheme, LeBlanc et al. described that female physicians were more likely than their male counterparts to order the guideline-informed number of A1c tests for their patients, independent of P4P participation . Greene noted that 66% of general practitioners included in the Australian study’s sample were male, similar to the national demographic for all GPs .
Pay-for-performance among primary care physicians is increasingly used to enhance guideline-based care practices for diabetes mellitus and other prevalent NCDs. As the number of P4P schemes continues to grow, the potential for unintended consequences may also rise , which may possibly include exacerbated gender inequalities in health. This review of P4P impact evaluations in single-payer national health insurance systems revealed that the analysis and reporting of sex and gender in P4P assessments remains inadequate. Of the 39 studies narratively reviewed, most (79%) indicated consideration of the sex/gender of the patient and/or provider in the study design, but only one split all the analyses by patients’ sex as an integral component. One quarter (11 or 28%) of the 39 studies reported sex-disaggregated data in the results of the statistical models assessing influences of P4P on patient outcomes, and three (8%) substantively discussed the results. None (0%) included an interaction term of patients’ sex with the P4P treatment variable, thereby precluding interpretation of gendered impacts of the intervention itself. The already limited discussions concentrated on the presence or absence of sex differentials in the patient-level clinical goals (e.g., glycemic control) rather than in the policy option under investigation.
Consideration of gendered outcomes in the physician workforce was even less extensive. Six (15%) of the 39 studies reported controlling statistically for the providers’ sex. None (0%) included an interaction term of physicians’ sex with the P4P treatment variable or considered an outcome relevant to gender equity in the work life of providers.
In other words, we were unable to answer our original research questions as to whether P4P contributes to gender equity in patient and provider outcomes due to a lack of comprehensive consideration of the issue in the available literature. This finding highlights a critical evidence gap to support physician workforce financing policy decisions that may lead to unintentionally aggravated pre-existing gender inequalities. Some limited research, for example, Boeckxstaens et al.’s review of the United Kingdom’s QOF , has suggested that male patients may have benefited more from P4P in terms of quality of care than female patients. A descriptive analysis of physician service billings data from a Canadian province indicated that female family physicians have been under-represented in performance-based payments compared to their male counterparts, potentially exacerbating gender pay gaps . The social, cultural, and psychological reasons why women may respond less to P4P remain largely unknown [31, 46, 47]. Overall, P4P impact assessments focusing on gender and other equity dimensions have been substantially less common compared to those investigating cost-effectiveness .
The results of this review were consistent with Petkovic et al.’s examination of systematic reviews extracted from the Campbell and Cochrane Libraries, which revealed inadequate reporting of sex and gender in health research and, specifically, a large gap between the mention of sex/gender in studies’ methods section (51–83%) versus reporting on sex/gender in the results section (less than 30%) . Similarly to Petkovic et al. , we did not assess whether the terms “sex” (biological) and “gender” (sociocultural) were used appropriately by the studies’ authors, given the challenge of evolving terminology that is often used interchangeably. In contrast, since we did not restrict any of our database searches using sex/gender search terms, our approach was less likely to have potentially missed instances of sex/gender reporting. It is possible, however, that some studies were missed altogether in our searches given the range in terminology for P4P .
The lack of acknowledgment of gender bias in scientific publishing could help explain the knowledge and evidence gaps on gendered impacts of performance-based HRH financing. Gender-blindness in health research and across the sciences is increasingly documented as potentially contributing to reinforce existing gender inequalities, related to a wide range of factors, including bias against research on gender bias [25, 48,49,50]. For instance, while social science research is often seen as central to enhance understanding of equity in health systems , a review of bibliometrics in the social sciences found that articles focusing on gender bias were more often published in journals with a lower impact factor than those considering other dimensions of social discrimination . Some peer-reviewed journals have taken a stance to promote research to help inform actions to address persistent gender inequalities and mitigate gender bias in publication processes [25, 52]; however, avoidance of the identification and reduction of bias remains a seemingly acceptable occurrence. Not all published studies included in this review used gender-inclusive language throughout (e.g., referring to physicians’ characteristics as “the GP himself” [Table 5]). Pervasive (unconscious) gender bias has been quantified in peer review and editorial decision-making outcomes, with men reportedly less likely than women to acknowledge the existence of such a bias [49, 53]. Gender imbalances have also been documented in processes of clinical and public health guidelines development, which may impact the attention given to sex- and gender-specific differences in assessing the value of the evidence .
Strengths and limitations of the study
This study presented a critical interpretation of previously reviewed research from the unique and prospectively planned perspective of gender-based analysis. With the growing number of systematic reviews being published every year, the approach contributed to the literature aiming to optimize the use of identified studies on a given issue where there remained considerable unexplained heterogeneity and unreported information to help support decision making (for example, [55, 56]). The study design was intended to shed light on whether publicly funded primary care physician financing policies for chronic disease care were aligned with international commitments for gender-responsive budgeting for gender equality. The dearth of high-quality evidence suggests that research mechanisms to assess government’s accountability in delivering on gender equality remain insufficient.
The present reanalysis, however, inherited some of the limitations of the original review. Most notably, it was restricted to single-payer national health systems, which meant that relatively few countries were included, none of which were low-income or middle-income countries . This design choice was intended to minimize the risk of measuring physicians’ ability to “game” the payment system rather than true performance; however, such concerns have also been raised in the United Kingdom, as regards P4P potentially reflecting distorted “embellishing” of patient diagnosis codes over the quality of care . Performance pay as a mechanism to improve quality of care first emerged in high-income countries, and much of the research on P4P still tends to be siloed by income setting . Given the proliferation of P4P schemes in low- and middle-income countries, coupled with weaker information systems and the more limited research on P4P effectiveness in many of these contexts [4, 59, 60], rigorous empirical assessments are needed of the relationships (if any) between the allocation of limited resources to performance-based payments and consequences for gender equity from countries at all levels of development.
This systematic review reanalysis through a sex and gender lens weighed the evidence on how publicly funded performance-based physician remuneration policies may be contributing, positively or negatively, to gender equity in health system outcomes—in this case, in the health outcomes among patients living with diabetes and/or in the work environments among physicians providing diabetes care. Performance-based HRH financing is typically conceptualized as a means to strengthen health systems; however, its implementation and evaluation inadequately consider equity issues . The issue of gender equity has been neglected altogether. Despite the growing recognition of the importance of integrating sex and gender in health research, its practice remains uneven . Gender blindness in health systems and health workforce benchmarking and evidence may miss significant opportunities for gender equity promotion . This review underscored that consideration of gendered impacts in either patient-oriented outcomes or work life of providers is largely overlooked in the P4P literature. Measuring and evaluating the inequitable distribution of power and resources by gender and other social strata, as prerequisites to addressing the problem, remain important on the international health agenda, even if national interests may have waned . Our analysis was consistent with the findings elsewhere revealing a paucity of gendered analyses of health financing arrangements . While it is acknowledged that P4P will exercise different impacts on quality and costs of care depending on the structure of the scheme , the evidence base on how such payment models may attenuate or exacerbate gender inequities remains surprisingly weak. Research is needed on HRH financing options to better understand how P4P and other physician payment models may have unintended consequences in terms of gender-specific patient and provider outcomes in the longer term.
Availability of data and materials
Not applicable. No datasets were generated or analyzed.
Continuity of Care Index
Human resources for health
Population, Intervention, Comparison, Outcomes, Study design
Preferred Reporting Items for Systematic Reviews and Meta-Analyses
Quality and Outcomes Framework
Sustainable Development Goals
Flodgren G, Eccles MP, Shepperd S, Scott A, Parmelli E, Beyer FR. An overview of reviews evaluating the effectiveness of financial incentives in changing healthcare professional behaviours and patient outcomes. Cochrane Database Syst Rev. 2011;7:CD009255.
de Bruin SR, Baan CA, Struijs JN. Pay-for-performance in disease management: a systematic review of the literature. BMC Health Serv Res. 2011;11:272.
World Health Organization. The World Health Report—health systems financing: the path to universal coverage. Geneva: World Health Organization; 2010.
Turcotte-Tremblay AM, Spagnolo J, De Allegri M, Riddle V. Does performance-based financing increase value for money in low- and middle-income countries? A systematic review. Heal Econ Rev. 2016;6(1):30.
Gupta N, Ayles HM. Effects of pay-for-performance for primary care physicians on diabetes outcomes in single-payer health systems: a systematic review. Eur J Health Econ. 2019;20:1303–15. https://doi.org/10.1007/s10198-019-01097-4.
Iezzi E, Lippi Bruni M, Ugolini C. The role of GP’s compensation schemes in diabetes care: evidence from panel data. J Health Econ. 2014;34:104–20.
Emmert M, Eijkenaar F, Kemter H, Esslinger AS, Schöffski O. Economic evaluation of pay-for-performance in health care: a systematic review. Eur J Health Econ. 2012;13:755–67.
Chaix-Couturier C, Durand-Zaleski I, Jolly D, Durieux P. Effects of financial incentives on medical practice: results from a systematic review of the literature and methodological issues. Int J Qual Health Care. 2000;12:133–42.
Scott A, Sivey P, Ait Ouakrim D, Willenberg L, Naccarella L, Furler J, Young D. The effect of financial incentives on the quality of health care provided by primary care physicians. Cochrane Database Syst Rev. 2011;9:CD008451.
Van Herck P, De Smedt D, Annemans L, Remmen R, Rosenthal MB, Sermeus W. Systematic review: effects, design choices, and context of pay-for-performance in health care. BMC Health Serv Res. 2010;10:247.
Tao W, Agerholm J, Burström B. The impact of reimbursement systems on equity in access and quality of primary care: a systematic literature review. BMC Health Serv Res. 2016;16:542.
Jia L, Yuan B, Meng Q, Scott A. Payment methods for ambulatory care health professionals. Cochrane Database Syst Rev. 2015;9:CD011865.
Forbes LJ, Marchand C, Doran T, Peckham S. The role of the Quality and Outcomes Framework in the care of long-term conditions: a systematic review. Br J Gen Pract. 2017;67(664):e775–84.
World Health Organization. World Health Statistics 2020: Monitoring Health for the Sustainable Development Goals (SDGs). Geneva: World Health Organization; 2020.
Chan M. Obesity and diabetes: the slow-motion disaster. Milbank Q. 2017;95(1):11–4. https://doi.org/10.1111/1468-0009.12238.
Kontopantelis E, Springate DA, Ashcroft DM, et al. Associations between exemption and survival outcomes in the UK’s primary care pay-for-performance programme: a retrospective cohort study. BMJ Qual Saf. 2016;25(9):657–70.
Kautzky-Willer A, Harreiter J. Sex and gender differences in therapy of type 2 diabetes. Diabetes Res Clin Pract. 2017;131:230–41. https://doi.org/10.1016/j.diabres.2017.07.012.
Payne S. How can gender equity be addressed through health systems? Health Systems and Policy Analysis Policy Brief No. 12. Copenhagen: World Health Organization and European Observatory on Health Systems and Policies; 2009.
Day S, Mason R, Lagosky S, Rochon PA. Integrating and evaluating sex and gender in health research. Health Res Policy Syst. 2016;14(1):75. https://doi.org/10.1186/s12961-016-0147-7.
Petkovic J, Trawin J, Dewidar O, Yoganathan M, Tugwell P, Welch V. Sex/gender reporting and analysis in Campbell and Cochrane systematic reviews: a cross-sectional methods study. Syst Rev. 2018;7(1):113. https://doi.org/10.1186/s13643-018-0778-6.
Witter S, Govender V, Ravindran TKS, Yates R. Minding the gaps: health financing, universal health coverage and gender. Health Policy Plan. 2017;32(suppl 5):v4–v12. https://doi.org/10.1093/heapol/czx063.
International Labour Organization. Global Wage Report 2018/19: What lies behind gender pay gaps. Geneva: International Labour Organization; 2018.
Esteves-Sorenson C, Snyder J. The gender earnings gap for physicians and its increase over time. Econ Lett. 2012;116(1):37–41.
Boniol M, McIsaac M, Xu L, et al. Gender equity in the health workforce: analysis of 104 countries. Geneva: World Health Organization; 2019.
Gupta N. Research to support evidence-informed decisions on optimizing gender equity in health workforce policy and planning. Hum Resour Health. 2019;17:46. https://doi.org/10.1186/s12960-019-0380-6.
Hedden L, Barer ML, Cardiff K, McGrail KM, Law MR, Bourgeault IL. The implications of the feminization of the primary care physician workforce on service supply: a systematic review. Hum Resour Health. 2014;12(32):1–11. https://doi.org/10.1186/1478-4491-12-32.
Whittington JW, Nolan K, Lewis N, Torres T. Pursuing the Triple Aim: the first 7 years. Milbank Q. 2015;93(2):263–300.
Rathert C, Williams ES, Linhart H. Evidence for the Quadruple Aim: a systematic review of the literature on physician burnout and patient outcomes. Med Care. 2018;56(12):976–84.
World Health Organization. Delivered by women, led by men: a gender and equity analysis of the global health and social workforce. Human Resources for Health Observer Series No. 24. Geneva: World Health Organization; 2019.
Doran T, Fullwood C, Doran T, Reeves D, Gravelle H, Roland M. Exclusion of patients from pay-for-performance targets by English physicians. N Engl J Med. 2008;359:274–84.
Bandiera O, Fischer G, Prat A, Ytsma E. Do women respond less to performance pay? Building evidence from multiple experiments. CEPR Discussion Paper No. DP11724. London: Centre for Economic Policy Research; 2017.
Gupta N, Ayles H. Implications of feminization of the primary care medical workforce on pay-for-performance for chronic disease management. In: Protocol Registration No. CRD42018090021. PROSPERO International prospective register of systematic reviews; 2018. https://www.crd.york.ac.uk/prospero/display_record.php?ID=CRD42018090021.
Liberati A, Altman DG, Tetzlaff J, Mulrow C, Gøtzsche PC, et al. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate health care interventions: explanation and elaboration. PLoS Med. 2009;6(7):e1000100.
Gupta N, Ayles H. Systematic review protocol: examining the effects of introducing pay-for-performance for primary care physicians in diabetes outcomes in single-payer healthcare systems. Diabetes Population Health and Health Services Research Working Paper No. 2017-01. Fredericton: University of New Brunswick; 2017.
Movsisyan A, Melendez-Torres GJ, Montgomery P. Users identified challenges in applying GRADE to complex interventions and suggested an extension to GRADE. J Clin Epidemiol. 2016;70:191–9.
LeBlanc E, Bélanger M, Thibault V, et al. Influence of a pay-for-performance program on glycemic control in patients living with diabetes by family physicians in a Canadian province. Can J Diabetes. 2017;1(2):190–6.
Lippi Bruni M, Nobilio L, Ugolini C. Economic incentives in general practice: the impact of pay-for-participation and pay-for-compliance programs on diabetes care. Health Policy. 2009;90:140–8.
Yuan SP, Huang CN, Liao HC, et al. Glycemic control outcomes by gender in the pay-for-performance system: a retrospective database analysis in patients with type 2 diabetes mellitus. Int J Endocrinol. 2014;2014:575124.
Hsieh HM, Chiu HC, Lin YT, Shin SJ. A diabetes pay-for-performance program and the competing causes of death among cancer survivors with type 2 diabetes in Taiwan. Int J Qual Health Care. 2017;29(4):512–20.
Pan CC, Kung PT, Chiu LT, Liao YP, Tsai WC. Patients with diabetes in pay-for-performance programs have better physician continuity of care and survival. Am J Manag Care. 2017;23(2):e57–66.
Crawley D, Ng A, Mainous AG, et al. Impact of pay for performance on quality of chronic disease management by social class group in England. J R Soc Med. 2009;102(3):103–7.
Millett C, Bottle A, Ng A, et al. Pay for performance and the quality of diabetes management in individuals with and without co-morbid medical conditions. J R Soc Med. 2009;102(9):369–77.
Greene J. An examination of pay-for-performance in general practice in Australia. Health Serv Res. 2013;48(4):1415–32.
Nicholson S, Pauly MV, Wu AYJ, Murray JF, Teutsch SM, Berger ML. Getting real performance out of pay-for-performance. Milbank Q. 2008;86(3):435–57.
Boeckxstaens P, Smedt DD, Maeseneer JD, et al. The equity dimension in evaluations of the quality and outcomes framework: a systematic review. BMC Health Serv Res. 2011;11(209). https://doi.org/10.1186/1472-6963-11-209.
Gupta N, Lavallée R, Ayles J. Gendered effects of pay for performance among family physicians for chronic disease care: an economic evaluation in a context of universal health coverage. Hum Resour Health. 2019;17(4). https://doi.org/10.1186/s12960-019-0378-0.
Xiu L, Gunderson M. Performance pay in China: gender aspects. Br J Ind Relat. 2013;51(1):124–47. https://doi.org/10.1111/j.1467-8543.2011.00887.x.
Cislak A, Formanowicz M, Saguy T. Bias against research on gender bias. Scientometrics. 2018;115(1):189–200. https://doi.org/10.1007/s11192-018-2667-0.
Fox CW, Paine CET. Gender differences in peer review outcomes and manuscript impact at six journals of ecology and evolution. Ecol Evol. 2019;9(6):3599–619. https://doi.org/10.1002/ece3.4993.
Bernard C. Gender bias in publishing: double-blind reviewing as a solution? eNeuro. 2018;5(3). https://doi.org/10.1523/ENEURO.0225-18.2018.
Greenhalgh T. What have the social sciences ever done for equity in health policy and health systems? Int J Equity Health. 2018;17(124). https://doi.org/10.1186/s12939-018-0842-9.
The Lancet. Feminism is for everybody. Lancet. 2019;393:493.
Helmer M, Schottdorf M, Neef A, Battaglia D. Gender bias in scholarly peer review. eLife. 2017;6:e21718. https://doi.org/10.7554/eLife.21718.
Bohren MA, Javadi D, Vogel JP. Gender balance in WHO panels for guidelines published from 2008 to 2018. Bull World Health Organ. 2019;97:477–85.
Melendez-Torres GJ, Thomas J, Lorenc T, et al. Just how plain are plain tobacco packs: re-analysis of a systematic review using multilevel meta-analysis suggests lessons about the comparative benefits of synthesis methods. Syst Rev. 2018:7(153). https://doi.org/10.1186/s13643-018-0821-7.
Gentles SJ, Stacey D, Bennett C, et al. Factors explaining the heterogeneity of effects of patient decision aids on knowledge of outcome probabilities: a systematic review sub-analysis. Syst Rev. 2013:2(95). https://doi.org/10.1186/2046-4053-2-95.
Woolhandler S, Ariely D, Himmelstein DU. Why pay for performance may be incompatible with quality improvement. BMJ. 2012;345:e5015.
Anselmi L, Borghi J, Brown GW, et al. Pay for performance: a reflection on how a global perspective could enhance policy and research. Int J Health Policy Manag. 2020;9(9):365–9.
Soucat A, Dale E, Mathauer I, Kutzin J. Pay-for-performance debate: not seeing the forest for the trees. Health Syst Reform. 2017;3(2):74–9. https://doi.org/10.1080/23288604.2017.1302902.
Witter S, Fretheim A, Kessy FL, Lindahl AK. Paying for performance to improve the delivery of health interventions in low- and middle-income countries. Cochrane Database Syst Rev. 2012;2:CD007899. https://doi.org/10.1002/14651858.CD007899.pub2.
Ridde V, Gautier L, Turcotte-Tremblay AM, et al. Performance-based financing in Africa: time to test measures for equity. Int J Health Serv. 2018;48(3):549–61. https://doi.org/10.1177/0020731418779508.
Percival V, Dusabe-Richards E, Wurie H, et al. Are health systems interventions gender blind? Examining health system reconstruction in conflict affected states. Glob Health. 2018;14:90. https://doi.org/10.1186/s12992-018-0401-6.
Farrer L, Marinetti C, Cavaco YK, Costongs C. Advocacy for health equity: a synthesis review. Milbank Q. 2015;93(2):392–437.
The authors acknowledge and thank Barry Cull and Richelle Witherspoon, from the University of New Brunswick Libraries, for the assistance with developing the search tools. Some of the findings of this research were presented at the Canadian Health Workforce Conference (Gatineau, Canada, 3–5 October 2018) and the International Health Workforce Collaborative Conference (Ottawa, Canada, 22–24 October 2019).
Financial support for this study was received from the Diabetes Canada, the New Brunswick Health Research Foundation, and the University of New Brunswick. The funders had no role in the study design, data analysis, manuscript writing, or decision to submit for publication.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
List of studies included in the systematic review:
Greene J. An examination of pay-for-performance in general practice in Australia. Health Serv Res 2013; 48(4):1415–32.
Scott A, Schurer S, Jensen PH, Sivey P. The effects of an incentive program on quality of care in diabetes management. Health Econ 2009;18(9):1091-108.
LeBlanc E, Bélanger M, Thibault V, Babin L, Greene B, Halpine S, Mancuso M. Influence of a pay-for-performance program on glycemic control in patients living with diabetes by family physicians in a Canadian province. Can J Diabetes, 2017; 1(2):190–96.
Lavergne MR, Law MR, Peterson S, Garrison S, Hurley J, Cheng L, McGrail K. A population based analysis of incentive payments to primary care physicians for the care of patients with complex disease. CMAJ 2016; 188(15):e375–e383.
Hollander MJ, Kadlec H. Incentive-based primary care: cost and utilization analysis. Perm J 2015; 19(4):46-56.
Rudkjøbinga A Vrangbaek K, Birk HO, Andersen JS, Krasnik A. Evaluation of a policy to strengthen case management and quality of diabetes care in general practice in Denmark. Health Policy, 2015; 119(8):1023–30.
Iezzi E, Lippi Bruni M, Ugolini C. The role of GP’s compensation schemes in diabetes care: evidence from panel data. J Health Econ, 2014; 34:104–20.
Lippi Bruni M, Nobilio L, Ugolini C. Economic incentives in general practice: the impact of pay-for-participation and pay-for-compliance programs on diabetes care. Health Policy 2009; 90:140-48.
Ödesjö H, Anell A, Gudbjörnsdottir S, Thorn J, Björck S. Short-term effects of a pay-for-performance programme for diabetes in a primary care setting: an observational study. Scand J Prim Health Care, 2015; 33(4):291–97.
Hsieh HM, He JS, Shin SJ, Chiu HC, Lee CT. A diabetes pay-for-performance program and risks of cancer incidence and death in patients with type 2 diabetes in Taiwan. Prev Chronic Dis. 2017; 14:E88.
Hsieh HM, Chiu HC, Lin YT, Shin SJ. A diabetes pay-for-performance program and the competing causes of death among cancer survivors with type 2 diabetes in Taiwan. Int J Qual Health Care 2017; 29(4):512–20.
Pan CC, Kung PT, Chiu LT, Liao YP, Tsai WC. Patients with diabetes in pay-for-performance programs have better physician continuity of care and survival. Am J Manag Care 2017; 23(2): e57–e66.
Chen CC, Cheng SH. Does pay-for-performance benefit patients with multiple chronic conditions? Evidence from a universal coverage health care system. Health Policy Plan 2016; 31(1):83-90.
Chen YC, Lee CT, Lin BJ, Chang YY, Shi HY. Impact of pay-for-performance on mortality in diabetes patients in Taiwan: a population-based study. Medicine 2016; 95(27):e4197.
Chi MJ, Chou KR, Pei D, et al. Effects and factors related to adherence to a diabetes pay-for-performance program: analyses of a national health insurance claims database. J Am Med Dir Assoc 2016; 17(7):613-19.
Hsieh HM, Lin TH, Lee IC, et al. The association between participation in a pay-for-performance program and macrovascular complications in patients with type 2 diabetes in Taiwan: a nationwide population-based cohort study. Preventive Medicine 2016; 85:53–59.
Hsieh HM, Shin SJ, Tsai SL, Chiu HC. Effectiveness of pay-for-performance incentive designs on diabetes care. Med Care 2016; 54(12):1063-69.
Huang YC, Lee MC, Chou YJ, Huang N. Disease-specific pay-for-performance programs: do the P4P effects differ between diabetic patients with and without multiple chronic conditions? Med Care 2016; 54(11):977-83.
Lin TY, Chen CY, Huang YT, et al. The effectiveness of a pay for performance program on diabetes care in Taiwan: a nationwide population-based longitudinal study. Health Policy 2016; 120(11):1313-21.
Lo HY, Yang SL, Lin HH, Bai KJ, Lee JJ, Lee TI, Chiang CY. Does enhanced diabetes management reduce the risk and improve the outcome of tuberculosis? Int J Tuberc Lung Dis 2016; 20(3):376-82.
Yen SM, Kung PT, Sheen YJ, Chiu LT, Xu XC, Tsai WC. Factors related to continuing care and interruption of P4P program participation in patients with diabetes. Am J Manag Care 2016; 22(1):e18-e30.
Hsieh HM, Gu SM, Shin SJ, et al. Cost-effectiveness of a diabetes pay-for-performance program in diabetes patients with multiple chronic conditions. PLoS ONE 2015; 10(7):e0133163.
Hsieh HM, Tsai SL, Shin SJ, Mau LW, Chiu HC. Cost-effectiveness of diabetes pay-for-performance incentive designs. Med Care 2015; 53(2):106-15.
Tan EC, Pwu RF, Chen DR, Yang MC. Is a diabetes pay-for-performance program cost-effective under the National Health Insurance in Taiwan? Qual Life Res 2014; 23(2):687-96.
Yu HC, Tsai WC, Kung PT. Does the pay-for-performance programme reduce the emergency department visits for hypoglycemia in type 2 diabetic patients? Health Policy Plan 2014; 29(6):732-41.
Yuan SP, Huang CN, Liao HC, et al. Glycemic control outcomes by gender in the pay-for-performance system: a retrospective database analysis in patients with type 2 diabetes mellitus. Int J Endocrinol 2014; 2014:575124.
Lai CL, Hou YH. The association of clinical guideline adherence and pay-for-performance among patients with diabetes. J Chin Med Assoc 2013; 76(2):102-7.
Chang RE, Lin SP, Aron DC. A pay-for-performance program in Taiwan improved care for some diabetes patients, but doctors may have excluded sicker ones. Health Affairs 2012; 31(1):93-102.
Cheng SH, Lee TT, Chen CC. A longitudinal examination of a pay-for-performance program for diabetes care: evidence from a natural experiment. Med Care 2012; 50(2):109-16.
Lee TT, Cheng SH, Chen CC, Lai MS. A pay-for-performance program for diabetes care in Taiwan: a preliminary assessment. Am J Manag Care 2010; 16(1):65-69.
Kontopantelis E, Springate DA, Ashworth M, Webb RT, Buchan IE, Doran T. Investigating the relationship between quality of primary care and premature mortality in England: a spatial whole-population study. BMJ 2015; 350:h904.
Alshamsan R, Lee JT, Majeed A, et al. Effect of a UK pay-for-performance program on ethnic disparities in diabetes outcomes: interrupted time series analysis. Ann Fam Med 2012; 10(3):228-34.
Oluwatowoju I, Abu E, Wild SH, Byrne CD. Improvements in glycaemic control and cholesterol concentrations associated with the Quality and Outcomes Framework: a regional 2-year audit of diabetes care in the UK. Diabet Med 2010; 27(3):354-59.
Campbell SM, Reeves D, Kontopantelis E, et al. Effects of pay for performance on the quality of primary care in England. N Engl J Med 2009; 361(4):368-78.
Crawley D, Ng A, Mainous AG, et al. Impact of pay for performance on quality of chronic disease management by social class group in England. J R Soc Med 2009; 102(3):103-7.
Millett C, Bottle A, Ng A, et al. Pay for performance and the quality of diabetes management in individuals with and without co-morbid medical conditions. J R Soc Med 2009; 102(9):369-77.
Millett C, Netuveli G, Saxena S, Majeed A. Impact of pay for performance on ethnic disparities in intermediate outcomes for diabetes: a longitudinal study. Diabetes Care 2009; 32(3):404-9.
Vaghela P, Ashworth M, Schofield P, Gulliford MC. Population intermediate outcomes of diabetes under pay-for-performance incentives in England from 2004 to 2008. Diabetes Care 2009; 32(3):427-29.
Millet C, Gray J, Saxena S, et al. Ethnic disparities in diabetes management and pay-for-performance in the UK: the Wandsworth prospective study. PLoS Med 2007; 4(6):e191.
About this article
Cite this article
Gupta, N., Ayles, H.M. The evidence gap on gendered impacts of performance-based financing among family physicians for chronic disease care: a systematic review reanalysis in contexts of single-payer universal coverage. Hum Resour Health 18, 69 (2020). https://doi.org/10.1186/s12960-020-00512-9
- Physician reimbursement
- Gender-based analysis
- Health workforce financing
- Systematic review