![]() |
|
|
|
|
|
Systems Learning from Physician Performance Data | pdf >> By Patricia H Parkerton, PhD, MPH; Gary A Feldbau, MD; Hugh L Straley, MD
Introduction The Group Health Cooperative of Puget Sound (Group Health) comprises medical centers and networked physician practices in the State of Washington and selected counties in northern Idaho; Group Health Permanente (GHP) physicians work in these medical centers. From 1997 through 1998, GHP primary care physicians received quarterly reports of data collected on their performance. The physicians were working toward improved practice. However, results from practice and performance measures continued to vary widely within and between the 25 medical centers. The Associate Medical Director for Quality and Research and Director of the Sandy MacColl Institute for Health Care Improvement agreed to sponsor a doctoral student to assess the data and its value to improving primary care practice. The objectives were to evaluate reliability of current performance assessment measures and to determine if aggregation of these measures of physician performance was appropriate, to identify components or structures of physician practice that influence patient outcomes, and to extract ideas for practice improvement from the results of this research. The collaboration yielded new insights into physician performance assessment and practice structures, and articles on different aspects of the research were published in Medical Care, The Journal of General Internal Medicine, and Family Practice.1-3 Here we summarize the published research and results. Methods Interviews with 30 key physicians and administrators generated the following research questions, which guided the type of data collected and the data analyses.
Setting, Study Design,
and Data Collected The cross-sectional research design used existing administrative data integrated with additional practice and physician data that we collected. Quarterly reports distributed by the medical group to the physicians for two years before this study included measures of individual physician performance, appointment access, panel size and composition, and patient case-mix. Additional data included physicians' board certification, validation for specialty, and gender (all obtained from medical directories and from the American Medical Association Web site),4,5 and practice structure data gathered from the Human Resources department and practice leaders. These data were merged with the performance data using a random identifier to protect physician confidentiality. This research was approved by the Institutional Review Board associated with Group Health and its research center. Funding sources placed no constraints upon this research, and Group Health allowed the researcher access to the organization and its data without determining the topic of inquiry, its analysis, or interpretation. Physician Performance Measures
"Higher
rates correspond to better outcomes for cancer screening, diabetic management
compliance, and patient satisfaction, whereas the preferred cost outcome
is lower. These measures are widely used,[6] have sufficient
patient populations to provide reliable assessment,[7] and
represent different aspects of care. For cancer screening, which combines
rates of screening [in] different subpopulations, the aggregate measure
is the mean of component measures, both
of which summarize comparable screening rates for subgroups of each patient
panel. Assuming a direct relationship between physician services and patient
outcomes, these aggregate measures are indicators of both physician performance
and the cumulative quality of care received by their patients. The measures
of clinical processes, which included preventive services and disease
treatment, and outcome measures, which included patient satisfaction,
are included Independent Predictors
of Patient Outcomes Statistical Analyses Figure
1. Model of the statistical analysis used to determine influence of practice
variables on aggregated measures of physician performance Results Wide Practice Variation
for Each Performance Measure Physicians' Performance
Varies by Measure Are cancer screening, diabetic management, patient satisfaction, and ambulatory costs reasonable and reliable aggregate measures of physician performance? Yes, when the aggregate measures are each evaluated, and performance is evaluated for groups of physicians rather than individual physician performance. Each aggregate measure was reliable and independent, but loosely predictive, of the others. Each aggregate measure was significantly correlated with one or two of the other measures; high cancer screening rates correlated with close diabetic management and with high patient satisfaction scores; high diabetic management rates correlated with high cancer screening rates; and high patient satisfaction scores correlated with high cancer screening rates and high ambulatory costs. Further, physician performance was inconsistent across aggregate measures. More than 70% of the physicians ranked in the top third for at least one measure, but 80% of these same physicians ranked in the lowest third for a different measure. Sixty percent of the physicians ranked in the top third for one measure and in the bottom third for another.1 "Assessments of individual physicians with current performance measures may identify areas in which improvement is needed and facilitate provision of feedback to improve performance quality and efficiency. However, these performance measures, singly or as a unit, should be used cautiously to select, motivate, and reward physicians, or to encourage consumer assessment. There are relationships among physician performances in cancer screening, diabetes management, patient satisfaction and ambulatory costs. However these relationships are inconsistent across all physicians and unreliable for individual physicians."1 Part-time Practice Performance
Not Worse After adjusting for potential confounders, our analysis showed that as physician direct patient care hours decreased by 10%, the rate of cancer screening for women increased by 0.7% (p = .010), and the rate of diabetic management increased by 1.1% (p = .008). No association existed between physician direct patient care hours and patient satisfaction (p = .212) or ambulatory costs (p = .323). Although the data supported the analysis of continuous data at a minimum of three FTE (ten hours), no "threshold" of performance was found.3 Figure
2. Ranges of aggregate physician performance measures Adapted
and reproduced with permission of the
Practice Structures Coordinate
and Improve Care Discussion Two of our objectives involved testing HEDIS measures for reliability when used to assess physician performance. We found that each individual measure was reliable over time and that reliability increased when individual measures were grouped with correlated measures into four aggregate measures. However, physician performance was not consistent across measures, a finding that was also reported recently in Boston area clinics.10 Although some specific measures are significantly related, overall predictive value of any single measure is low. The aggregate performance measures appear to assess different aspects of practice; therefore, blending their results may mislead to conclusions. "Because these aggregate measures are not strongly correlated, an overall measure, or using [one] as a proxy for all, is not recommended. Care should be taken in assessing physicians based on narrow performance measures resulting from current inconsistency in performance and the evolution of quality measures."1 Our efforts to identify influences on patient outcomes associated with physician practice organization were productive, although the results were not as anticipated. We found that part-time physicians performed as well or better on the aggregate measures--including the measure of patient satisfaction--than those who worked more hours. As appointment hours decreased, performance either held constant or improved. Contrary to expectations, the trend toward reduced clinical hours merited attention but was not a current problem. Moreover, physician continuity, which we encourage, did not reach higher levels of physician performance. In fact, patient satisfaction declined as continuity increased. We believe that the explanation for this lies in coordinating structures around the physician-patient communication mechanisms other than the traditional visit (eg, telephone, e-mail, team members). Each of three practice structures were positively associated with some patient outcome measures: shared practice, larger medical centers, and clinical team tenure of 4 to 15 years. Because these practice structures were selected for availability (ie, convenience), they may not be the most influential structures.
Adapted
and reproduced with permission of the
Our final objective was to identify ideas for practice improvement. Although we considered the composition and roles within primary care teams to be potentially influential, gathering team data was difficult. We attempted to assess nursing and team pharmacist roles, but no administrative source was available, and our data were incomplete and lacked sufficient power to show statistically significant effects. Better understanding of team member roles and their optimization would be useful. Limitations of this research include its focus on physicians working in a medical group within a health maintenance organization, in teams with other practitioners, and within a single organization. Generalizability of these findings is also limited by the reduced reliability of performance measures for physicians with smaller patient panels or who provide care, for example, for few diabetic patients.7 In addition, the physicians included in our research cared for patients who had comprehensive health insurance benefits, and the physicians functioned as gatekeepers to specialty services. Therefore, our analyses implicitly controlled for specialty, organization, health benefits, payment, access to service, and designation of primary physician. Conclusions Reduced physician hours and physician continuity did not reduce the four aggregate measures of patient outcome, and some primary care practice structures (shared practice, larger medical centers, clinical team tenure of 4 to 15 years) benefited patient outcomes. Interviews with key leaders helped us to formulate useful research questions and to increase access to data. Individual primary care physician performance data yielded collective clinical practice information. Our analyses led to conclusions which differed from popular opinion and thus redirected some planning efforts. Analyzing physician performance data can help us to identify effective primary care practice structures and processes and can ultimately benefit patient care. However, constructive use of physician performance data requires acknowledgment of both positive and negative performance by individual physicians so that accolades are supported and poor practices are not masked. Performance data on a population of physicians can also allow their efforts to be tracked to improve the practice environment.
Acknowledgments This research received financial support from The Blue Cross Blue Shield of Michigan Foundation. The Sandy MacColl Institute for Healthcare Improvement and Group Health provided resource support for data collection. Dean Smith, PhD, Edward H Wagner, MD, MPH; and Mary Richardson, PhD; provided academic guidance. Staff of Group Health providing insight and data access included Brian Austin; Susan Crissman, RN, MNEd, MPH; Matthew Handley, MD; Michael Wanderer, MD; and Nirmala Sandhu, MPH. Dr Parkerton received support for her doctoral program as an Agency for Healthcare Research and Quality (AHRQ) Fellow. References
|
|
|
|