Europe PMC

This website requires cookies, and the limited processing of your personal data in order to function. By using the site you are agreeing to this as outlined in our privacy notice and cookie policy.

Abstract 


Purpose

The aim of this study was to evaluate the concordance between claims records in the National Health Insurance Research Database and patient self-reports on clinical diagnoses, medication use, and health system utilization.

Methods

In this study, we used the data of 15,574 participants collected from the 2005 Taiwan National Health Interview Survey. We assessed positive agreement, negative agreement, and Cohen's kappa statistics to examine the concordance between claims records and patient self-reports.

Results

Kappa values were 0.43, 0.64, and 0.61 for clinical diagnoses, medication use, and health system utilization, respectively. Using a strict algorithm to identify the clinical diagnoses recorded in claims records could improve the negative agreement; however, the effect on positive agreement and kappa was diverse across various conditions.

Conclusion

We found that the overall concordance between claims records in the National Health Insurance Research Database and patient self-reports in the Taiwan National Health Interview Survey was moderate for clinical diagnosis and substantial for both medication use and health system utilization.

Free full text 


Logo of plosoneLink to Publisher's site
PLoS One. 2014; 9(12): e112257.
PMCID: PMC4251897
PMID: 25464005

Concordance between Patient Self-Reports and Claims Data on Clinical Diagnoses, Medication Use, and Health System Utilization in Taiwan

Chi-Shin Wu, 1 , 2 , 3 Mei-Shu Lai, 2 Susan Shur-Fen Gau, 2 , 3 Sheng-Chang Wang, 4 and Hui-Ju Tsai 5 , 6 , 7 , *
Gabriele Multhoff, Editor

Associated Data

Supplementary Materials
Data Availability Statement

Abstract

Purpose

The aim of this study was to evaluate the concordance between claims records in the National Health Insurance Research Database and patient self-reports on clinical diagnoses, medication use, and health system utilization.

Methods

In this study, we used the data of 15,574 participants collected from the 2005 Taiwan National Health Interview Survey. We assessed positive agreement, negative agreement, and Cohen's kappa statistics to examine the concordance between claims records and patient self-reports.

Results

Kappa values were 0.43, 0.64, and 0.61 for clinical diagnoses, medication use, and health system utilization, respectively. Using a strict algorithm to identify the clinical diagnoses recorded in claims records could improve the negative agreement; however, the effect on positive agreement and kappa was diverse across various conditions.

Conclusion

We found that the overall concordance between claims records in the National Health Insurance Research Database and patient self-reports in the Taiwan National Health Interview Survey was moderate for clinical diagnosis and substantial for both medication use and health system utilization.

Introduction

The use of automated claims databases is a widely-used method for obtaining information for use in epidemiological studies. The advantages of claims-based studies compared with collecting information from patient reports or medical chart reviews include the inexpensiveness, large sample size, and lower risk of non-response [1], [2]. Taiwan's National Health Insurance Research Database (NHIRD), derived from the single-payer, compulsory National Health Insurance (NHI) program in Taiwan [3], is one of the largest available claims databases. As of 2007, about 22.6 million (98%) Taiwanese were enrolled. The NHI program covers ambulatory care, hospitalization and dental services, as well as preventive services. Patients' demographic characteristics, diagnoses, prescriptions, hospitalizations, and medical expenditures are recorded in the NHIRD. The number of published scientific papers using the NHIRD has increased dramatically in recent years [4]. However, since claims records are mainly generated to capture information for reimbursement purposes, physicians might record a diagnosis to receive payment when it is actually only just being considered. In contrast, a diagnosis which is not related to reimbursement might be under-recorded.

Patient self-report is a common method that is used to gather health information and that usually can provide other important data, such as the patient's perspective regarding their own general health conditions, lifestyle, and health behaviors. However, self-report has been shown to be affected by measurement error as the result of recall bias and the social desirability effect [5].

Both measurement methods have strengths and limitations. A linkage between claims data and patient self-reports could complement each other and enhance the utilization of both data sources in epidemiology and health services research. Despite the potential advantages, the concordance between claims data and patient self-report is still under-investigated in Taiwan. To date, only one study has examined the agreement between self-report and claims data on health system utilization, and the findings showed a fairly good concordance in the general population [6]. However, the concordance of clinical diagnoses and medication use in Taiwan remains unclear.

In this study, we used self-reported data collected from a nationwide representative survey, the 2005 Taiwan National Health Interview Survey (NHIS), to evaluate the concordance of clinical diagnoses, medication use and health system utilization, separately, between the self-reported data in the NHIS and claims records in the NHIRD. In addition, factors associated with disconcordance between the self-reported data and the claims records were also explored.

Materials and Methods

Study population

This study utilized data from the 2005 Taiwan National Health Interview Survey (NHIS), which is a nationwide cross-sectional survey used to investigate the health status of non-institutionalized residents in Taiwan. Detailed information related to the study design of the 2005 Taiwan NHIS can be found elsewhere [7]. In brief, households were randomly selected using a multistage stratified systematic sampling scheme. All members of these households were interviewed by trained interviewers using structured questionnaires. There were three versions of the questionnaires, one for each of the three age groups (<12, 12–64, and [greater, double equals]65 years, separately). The questionnaire contained several domains, including sociodemographic characteristics, personal health status, health system utilization, and occupation and economic status. Most participants in the survey were interviewed from April to July 2005. A total of 30,680 residents was selected and 24,726 (80.6%) agreed to participate in the 2005 Taiwan NHIS. In this study, we included only those aged 12 years and above, yielding 20,826 participants. Of this group, 15,574 (75%) participants gave informed consent to allow us to link their NHIS data with the data of Taiwan's NHIRD for research purposes. After linkage, any information that could be used to identify the participants was anonymized and de-identified prior to analysis. The study was approved by the ethics committee of the National Health Research Institutes.

Self-reported measures

We used information from the NHIS to measure self-reports in three domains, including clinical diagnoses, medication use, and health system utilization. Fourteen common clinical diagnoses, including hypertension, diabetes mellitus, dyslipidemia, malignancy, stroke, asthma, chronic pulmonary diseases, gout, osteoporosis, arthritis, renal disease, heart diseases, chronic hepatitis, and psychiatric disorders were explored in the versions of the questionnaires for the 12–64 and [greater, double equals]65 years age groups. The participants were asked two sequential questions, “Do you have this disease?” and “Were you told by a doctor or a nurse that you have this disease?” For some diseases with an episodic course, such as asthma, the participants were asked a third question, “Did you have this disease during the last year?” If the answers to these questions were all ‘Yes', the patient was classified as having this disease. If the patient had any negative or uncontained answers for a question, we classified the patient as not having this disease.

For patients with any of the above-mentioned diseases, self-reported medication use was explored by asking, “Have you received any medication for treatment in the last year?” Given that some diseases were treated without specific medications, we analyzed the concordance in medication use by focusing only on five types of drugs, including antihypertensive drugs, antidiabetic drugs, lipid-lowering agents, anti-asthmatic drugs, and anti-gout drugs.

With regard to health system utilization, we asked, “Were you hospitalized during the last year?”; “Did you visit the emergency department during the last year?”; “Did you utilize dental services during the last year?”; and “Did you have a health examination though the NHI program during the last year?”

Claims-recorded measures

The clinical diagnoses in the claims records were coded using the clinicians' judgment based on clinical history, diagnostic criteria, image, or biochemical data. The present analysis used NHIRD claims records for the one year preceding the 2005 Taiwan NHIS. We set May 15, 2005, which was approximately the median date of the survey, as the date for interviewing all participants. Both ambulatory and inpatient claims records for the period from May 16, 2004 to May 15, 2005 were extracted from the NHIRD for subsequent analysis. We used ICD9-CM codes in any diagnostic position, rather than only the primary position, to identify 14 clinical diagnoses from the ambulatory and inpatient claims in the NHIRD. Claims records from all types of healthcare settings were included. Among outpatient claims records during the study period, approximately 82.4% were Western Medicine, 7.8% were dental services, and 9.8% were Chinese Medicine. The target medications were identified using the Anatomical Therapeutic Chemical (ATC) classification system [8]. Given that a medication can be used for different indications, we defined a participant as a medication user only if the patient had a prescription record for a targeted medication with a corresponding diagnosis. For example, diuretics can be prescribed for hypertension or renal diseases; therefore, only those participants who received a prescribed diuretic and had a diagnosis of hypertension were defined as antihypertensive users. Regarding health system utilization, we assessed the type of medical services provided for each claims record in the NHIRD. The detailed ICD9-CM codes for clinical diagnoses and ATC codes for medication are provided in Table S1.

Statistical analysis

The percentages for each item, clinical diagnoses, medication use, and health system utilization, based on self-reports or claims records were calculated by dividing the number of participants who had the presence of each item, separately, by the total number of participants. In terms of the overall percentage of each domain, the percentage was calculated as the average value for each item within the same domain. For the concordance between self-reports and claims records, we calculated the positive, negative, and total agreement and Cohen's kappa statistics, a chance-adjusted agreement [9], [10]. The interpretation for kappa was based on Landis and Koch's classifications: <0.2 as slight, 0.21–0.40 as fair, 0.41–0.60 as moderate, 0.61–0.80 as substantial, and 0.81–1.00 as almost perfect [11].

Given that physicians might initially code a tentative diagnosis and then change it later if further evidence did not support this initial diagnosis, the use of a strict algorithm, via increasing the number of claims records used to identify clinical diagnoses, is a common strategy to improve accuracy [8]. Therefore, we explored the concordance between self-reports and claims records using three algorithms to define the clinical diagnoses: [greater, double equals]1 outpatient or [greater, double equals]1 discharge claims, [greater, double equals]2 outpatient or [greater, double equals]1 discharge claims, and [greater, double equals]3 outpatient or [greater, double equals]1 discharge claims. The interval between two claims records was not specified; however, if two or more outpatient records were found on the same day, only one claims record was calculated.

Furthermore, we explored the factors of disconcordance between self-reports and claims records for diseases, medication use, and health system utilization. For each comparison, the outcome was dichotomized into agreement and disagreement between self-report and claims record. Explanatory variables included grossly trisected age groups (<30, 30–49, and [greater, double equals]50), sex, educational level (≤6, 7–12, and>12), marital status, and urbanization level of residence (urban, suburban, and rural). Since the analyses were based on patient-condition pairs, we estimated the odds ratios and 95% confidence intervals using the multivariate generalized estimating equation with an unstructured covariance matrix to eliminate the effect of multiple representations of each patient. In addition to the above-mentioned general factors, we also explored the effect of disease-specific factors on certain clinical diagnoses. For example, obesity is strongly associated with hypertension, diabetes, and dyslipidemia. As such, we extract information on body mass index (BMI) from Taiwan's NHIS and categorized participants into normal (BMI<24), overweight (24≤BMI<27), and obese (BMI[greater, double equals]27). Using a logistic regression model, the effect of overweight and obesity on the agreement between hypertension, diabetes, and dyslipidemia were investigated.

SAS version 9.2 was used for all statistical analyses (SAS Institute, Cary, NC).

Results

A total of 15,574 participants aged 12 years and above who consented to link their NHIS data with the NHIRD were included in the analysis. The mean age of participants was 41.2±18.7 years, 47.8% were females, 57.6% were married, 76.9% had more than 6 years of education, and 44.6% lived in urban areas. Compared with those who did not consent to link their NHIS data with the NHIRD, the consenters tended to be younger, more likely to be males, more highly educated, and more likely living in an urban area. However, the difference between consenters and nonconsenters was small (Table 1).

Table 1

Characteristics of study subjects with and without consent to data linkage.
Total sampleConsentNo consent
(N = 20,826)(N = 15,574)(N = 5,252)
Age, years
12–296,910 (33.2)5,424 (34.8)1,486 (28.3)
30–497,439 (35.7)5,672 (36.4)1,767 (33.6)
≥506,477 (31.1)4,478 (28.8)1,999 (38.1)
Gender
Female10,191 (48.9)7,442 (47.8)2,749 (52.3)
Male10,635 (51.1)8,132 (52.2)2,503 (47.7)
Education, years
≤65,291 (25.4)3,600 (23.1)1,691 (32.2)
7–129,969 (47.9)7,623 (48.9)2,346 (44.7)
≥135,550 (26.6)4,344 (27.9)1,206 (23.0)
Marital status
Single7,364 (35.4)5,701 (36.6)1,663 (31.7)
Married/living as married12,263 (58.9)8,977 (57.6)3,286 (62.6)
Divorced/widowed/separated1,156 (5.6)862 (5.5)294 (5.6)
Residential area
Urban9,313 (44.7)6,944 (44.6)2,369 (45.1)
Suburban8,410 (40.4)6,204 (39.8)2,206 (42.0)
Rural3,103 (14.9)2,426 (15.6)677 (12.9)
Note:
All p-values for the chi-square tests between patients with or without consent were <0.001.
A total of 16 participants had missing data for educational level and 43 participants had missing data for marital status.

The prevalence of most clinical diagnoses, medication use, and health system utilization based on claims records was higher than that based on self-report measures, except for dyslipidemia, osteoporosis, use of lipid lowering agents, and utilization of dental services (Table 2). The kappa varied across conditions, ranging from 0.18 for chronic pulmonary diseases to 0.85 for anti-diabetes drug use. The positive agreement ranged from 0.20 for chronic pulmonary diseases to 0.85 for anti-diabetes drug use. The total and negative agreement was high (all >0.8).

Table 2

Concordance between self-reports and claims records, by disease diagnoses, medication use, and health system utilization.
Self-reports (%)Claims records (%)In claims records, in self-reports (%)In self-reports only (%)In claims records only (%)Not in claims records, not in self-reports (%)Total agreementPositive agreementNegative agreementKappa
Diagnoses
Hypertension11.612.78.82.73.884.60.930.730.960.69
Diabetes4.75.74.00.71.793.60.980.770.990.76
Dyslipidemia11.66.93.58.03.385.10.890.380.940.32
Malignancy0.81.60.70.10.998.30.990.570.990.57
Stroke1.22.50.80.41.797.10.980.430.990.43
Asthma2.12.80.91.11.996.00.970.380.980.37
Chronic pulmonary diseases2.45.30.81.64.593.10.940.200.970.18
Gout3.54.21.91.62.394.20.960.500.980.48
Osteoporosis3.71.90.82.91.195.20.960.290.980.27
Arthritis3.37.01.61.75.491.20.930.310.960.28
Renal diseases3.64.11.32.32.793.60.950.340.970.32
Heart diseases3.98.22.61.35.690.50.930.430.960.40
Chronic hepatitis3.27.12.01.25.191.70.940.390.970.36
Psychiatric disorders1.66.81.00.65.892.60.940.230.970.21
Overall4.15.52.21.93.392.60.950.460.970.43
Medication use
Anti-hypertensives9.711.37.81.93.486.90.950.750.970.72
Anti-diabetes4.14.43.60.50.895.10.990.850.990.85
Lipid lowering agents3.73.21.62.11.694.70.960.460.980.44
Anti-asthmatics1.42.40.80.71.796.90.980.390.990.38
Anti-gout drugs2.93.01.41.61.795.40.970.450.980.44
Overall4.44.93.01.31.893.80.970.660.980.64
Health system utilization
Hospitalization7.08.05.21.82.890.20.950.690.980.67
Emergence room visit13.416.49.14.37.379.30.880.610.930.54
Dentistry services38.537.228.310.28.952.60.810.750.850.59
Health examination6.78.03.23.54.888.50.920.440.960.39
Overall16.417.411.44.96.077.70.890.680.930.61

Using a strict algorithm for identifying clinical diagnoses in the claims records would have decreased the estimated prevalence. However, the effect on positive agreement and kappa was diverse across various conditions (Figure. 1).

An external file that holds a picture, illustration, etc.
Object name is pone.0112257.g001.jpg
Comparison of the concordance between claims records and self-reports using different algorithms.

In terms of factors associated with disagreement between self-reports and claims records, we found that old age was associated with more disagreement in clinical diagnosis, medication use, and health system utilization. Subgroup analyses for the three different age groups are shown in Tables S2S4. Briefly, the prevalence, as well as the kappa, of most items increased with age; however, the total agreement decreased with age. Male gender was associated with disconcordance in medication use, but less disconcordance in health system utilization than in their female counterparts. There was less disagreement between self-reports and claims records of patients who were single or who had a high level of education. The level of urbanization of residency had little effect on disconcordance (Table 3). In terms of the effect of disease-specific factors on the concordance of clinical diagnosis, we found that obesity was associated with disagreement in hypertension, diabetes mellitus, and dyslipidemia but not in psychiatric disorders (Table S5).

Table 3

Factors associated with disconcordance between self-reports and claims records.
Disease diagnosisMedication useHealth system utilization
Odds ratio (95% CI)Odds ratio (95% CI)Odds ratio (95% CI)
Age group (vs. 12–29)
30–492.26(2.02, 2.54)3.17(2.49, 4.04)0.85(0.77, 0.94)
[greater, double equals]505.75(5.08, 6.51)9.53(7.38, 12.32)1.21(1.08, 1.36)
Gender
Male vs. female1.04(1.03, 0.99)1.51(1.38, 1.65)0.94(0.89, 0.99)
Education (vs. ≤6 years)a
7–120.71(1.03, 0.66)0.60(0.53, 0.67)0.82(0.76, 0.89)
[greater, double equals]130.76(1.04, 0.70)0.49(0.42, 0.57)0.78(0.72, 0.85)
Marital statusa (vs. single)
Married/living as married1.35(1.06, 1.21)1.15(0.95, 1.41)1.32(1.19, 1.46)
Divorced/widowed/separated1.25(1.07, 1.08)1.09(0.84, 1.41)1.18(1.02, 1.37)
Residence (vs. urban)
Sub-urban0.97(1.03, 0.91)0.92(0.84, 1.02)1.03(0.97, 1.09)
Rural1.05(1.04, 0.98)1.01(0.89, 1.14)1.08(1.00, 1.16)

Discussion

In this study, using a nationwide representative survey to explore the concordance between self-reports and claims records for clinical diagnoses, medication use, and health system utilization, the results from kappa statistics showed that the overall concordance was moderate for clinical diagnosis and substantial for medication use and health system utilization. In addition, negative agreement, an analogue of specificity, was quite high for most conditions.

We found that the concordance of chronic diseases, such as hypertension and diabetes mellitus, was substantial. These two diseases had clear diagnostic criteria and are well-known in the general population. Thus, the concordance of these two diseases, as well as those of anti-hypertensive and anti-diabetes drug use, was quite high. However, the concordance of stroke, malignancy, and gout were only moderate. The concordance for serious diseases, such as malignancy or stroke, was lower than our expectation. There are several possible explanations for this. The low prevalence of these serious diseases might affect kappa statistics [9], and the lack of an exact timeframe in the interview questions might also be responsible for the observed low concordance. Patients might have had these serious diseases in the past and recovered from them in recent years; therefore, they may have not visited clinics in the past year because of these medical conditions. We also observed that the concordance of gout was only moderate. Gout causes sudden and severe pain; however, given the episodic nature of gout, patients might not recall exactly.

Furthermore, we found that the concordance of dyslipidemia, asthma, chronic pulmonary diseases, osteoporosis, arthritis, renal diseases, heart diseases, chronic hepatitis, and psychiatric disorders were only fair. In fact, we found that the concordance of dyslipidemia was lower than our expectation. The prevalence of dyslipidemia based on self-reports (11.9%) was similar to the finding for the blood lipid examination in the Nutrition and Health Survey in Taiwan, which showed that the prevalence of hypercholesterolemia was 10.2% in men and 11.2% in women [12]. A possible explanation is that the NHI program only reimbursed for lipid-lowering agents among patients who had severe hyperlipidemia or who were comorbid with other cardiometabolic disease. Physicians might provide health information for those with mild dyslipidemia but not code the diagnosis if they did not prescribe lipid-lowering agents. Furthermore, only the first three diagnostic codes for ambulatory claims would be included in the NHIRD. Patients with dyslipidemia also usually have other serious comorbid conditions. Thus, the diagnosis of dyslipidemia might be coded in the original claims but not included in the NHIRD. The concordance of lipid-lowering agents was moderate, which is better than that for the diagnosis of dyslipidemia. Similar findings were also noted for osteoporosis, which was diagnosed via routine screening and patients were usually given lifestyle interventions initially. Thus, physicians might not code the diagnosis for most mild cases.

For composite diseases, such as renal diseases and heart diseases, the concordances were only fair. Lack of a specific definition of these clinical diagnoses might increase the discrepancy between self-reports and claims records. Of note, the prevalence of psychiatric illnesses based on self-reports was much lower than that based on claims records. A previous comprehensive survey using the Chinese-Modified Diagnostic Interview Schedule for psychiatric epidemiology found that the prevalence of major depressive disorder, bipolar disorder, and anxiety disorders was 1.14%, 0.17%, and 7.75%, respectively [13]. The estimated prevalence was similar to those based on claims records, rather than self-reports. Patients might lack insight or hesitate to disclose their psychiatric illness during a general survey. Thus, the concordance was only fair and the positive agreement was low.

The concordance of medication use was substantial, except for anti-asthmatic medications. Given that asthma attacks are episodic in nature, the use of anti-asthmatic medications would also be intermittent rather than regular. Therefore, patients might have difficulty recalling their exact use. The concordance of prescription records in the NHIRD was generally better and less varied than that of clinical diagnoses. Previous studies using different health claims databases also found that the concordance of medication use in claims records was better than that of other data sources [5], [14], [15].

We found that the concordance of health system utilization was generally substantial, which was compatible with a previous validity study of health system utilization [6]. In this study, we further explored the concordance of a preventive service, the routine health examination. We found that the concordance for a health examination was only moderate. Since a routine health examination by the NHI program is very convenient and can be performed in a clinical setting, participants might not even be aware that they received a routine health examination.

Individuals using private health insurance might under-estimate the prevalence of certain health conditions based on claims records. In Taiwan, approximately 64.8% have private health insurance [16]. However, almost all individuals with private health insurance are also enrolled in the NHI program. The role of private health insurance in Taiwan is only supplemental. Therefore, the effect of private health insurance on estimating prevalence-based claims records is mild.

We further conducted subgroup analyses for the concordance stratified by three different age groups. Generally, the concordance among participants aged 30–49 and [greater, double equals]50 years were grossly consistent with the findings among the whole study sample. However, we found that the kappa values of most clinical diagnoses among participants aged between 12–29 years were relatively small. It should be noted that the prevalence of most examined diseases was quite low among this age group. Since kappa is sensitive to the prevalence of studied items, a low kappa might be attributed to the paradoxical effect of low prevalence.

Strategies for improving the concordance

In this study, the prevalence of most clinical diagnoses based on self-reports was lower than that based on claims records, which was defined by the occurrence of one ambulatory or one inpatient claim. This could be overestimated because the physician might code a tentative diagnosis initially and change it later if further evidence did not support this initial diagnosis. Thus, using strict definitions would reduce the estimated prevalence based on claims records. However, whether the kappa could provide accurate estimated concordance would depend on the disease of interest. Generally, using strict definitions could improve the concordance in chronic and/or severe diseases such as diabetes, malignancy, and stroke. However, for diseases that are episodic in nature such as gout or asthma, using strict definitions would have no overt benefit on the concordance. For potentially underestimated diseases in the claims records, such as dyslipidemia and osteoporosis, using a broad definition to reflect both treated and untreated patients with these conditions might be better. As such, since there is no fixed algorithm which can be applied to all diseases, the use of strict or broad definitions should be cautiously considered based on the aims of the study and the trade-off between sensitivity and specificity [17].

Factors associated with disconcordance

In our analysis of the participant characteristics associated with disconcordance, we found that age has an important impact on disconcordance in diseases and medication use. Compared with young adults, the elderly have more complex diseases and a greater vulnerability to memory impairment; therefore, their self-reports are more likely to be discrepant with the claims records. These findings are consistent with previous studies showing that those at an older age or with a complex comorbidity were more likely to have discordance between self-reports and claims records [18], [19]. In addition, a low level of education was related to poor health knowledge and physician-patient communication, and thereby had a negative effect on the agreement between self-reports and claims records [20]. Single status was associated with more actual agreement. In Taiwan, patient family members actively participate in medical treatment decision-making. If the patient has a serious disease, such as a malignancy, the physician might discuss this with family members first, before the patient himself/herself. The family members might even decide to conceal this information to avoid possible negative emotional reactions of the patient.

To explore the effect of disease-specific factors on the concordance in clinical diagnosis, we assessed the effect of obesity on the concordance in hypertension, diabetes mellitus, and dyslipidemia. Obesity is strongly associated with these cardiometabolic disorders but not with psychiatric illness. While we found that obesity was associated with disagreement between claims records and self-reports in hypertension, diabetes mellitus, and dyslipidemia, obesity was not related to that in psychiatric illness. A possible explanation is that patients with obesity are more likely to fall in a gray area of definitive diagnosis; therefore, they might be confused about whether or not they have obesity-related cardiometabolic disorders. In contrast, the effect of obesity on the concordance in psychiatric illness was not significant. In summary, patients with strong risk factors for cardiometabolic disorders might increase the proportion of disagreement between claims record and self-reports.

Validity of the NHIRD and NHIS

Despite a dramatic increase in the publication of scientific papers using the NHIRD, the validity of NHIRD is still under-investigated. One study conducted a medical chart review of 372 stroke patients and found that the accuracy of the discharge diagnosis for ischemic stroke in the NHIRD was high; in particular, the positive predictive value was 97.9% [21]. Another study reviewed 354 randomly selected medical charts and found that the accuracy of the discharge diagnoses of acute coronary syndrome was 100% [22]. In addition, Lin et al. looked at patient self-report on a mailed-in questionnaire and found that the rate of agreement with a diabetes diagnosis in either ambulatory or hospital claims was 74.6% [8]. To date, only one study validated the self-reported health services utilization of the NHIS by using the claims records as reference standards [6]. Our study explored the concordance between the NHIS and NHIRD and provided important information regarding the validity of both data sources. Further validation studies of the NHIRD and NHIS should focus on specific conditions and develop optimal definitions based on the nature of the clinical conditions being assessed and the aims of the study.

Comparison of the concordance of self-reports and claims records in other claims data

We found that the concordance of the NHIRD was comparable to that of other claims databases throughout the world. One study explored the concordance of hypertension diagnoses between a patient survey and an insurance claims database in the U.S. and found that the proportion of agreement was 0.96 [23]. In addition, the kappa statistic for hypertension between the Manitoba Heart Health Survey and physician service claims files was 0.56 [18]. These findings are similar to our findings for hypertension (the total agreement was 0.93; kappa was 0.69). In terms of the concordance of diabetes diagnoses, the kappa statistic between self-reports and claims records was 0.74 in U.S Medicare claims [24] and 0.80 in Medicare Australian data [25]. The kappa statistics of medication use between self-reports and claims data ranged from 0.69 to 0.80, depending on the type of medication use [14], [26], [27].

Limitations and strengths of the study

Several limitations for this study should be considered. First, the date of interview was not available in the 2005 Taiwan NHIS. As such, we used the approximate median date of the survey as the date of interview for all participants. Since the study timeframe could not be more precisely defined. Lack of an exact timeframe would underestimate the degree of concordance and the proportion of agreement, especially for those conditions that are episodic in nature. Second, 25.2% of participants, or those who did consent to link the NHIRD, were not included in this study. Compared with consenters, those excluded from the analysis were slightly elder and less educated, and were less likely to be living in an urban area, which were characteristics that were associated with high disconcordance. [6], [18], [20] Thus, the concordance between self-reports and claims records might be overestimated. Third, since disease-specific factors could provide more information regarding each examined disease, we further examined the effect of obesity on hypertension, diabetes, dyslipidemia, and psychiatric disorders, respectively. However, we did not have additional collected data for assessing disease-specific factors for the other examined diseases. Further investigation on disease-specific factors for these other examined diseases is warranted.

Despite the above-mentioned limitations, this is the first study to pervasively and comprehensively assess the concordance between NHIRD claims records and a nationwide representative survey on clinical diagnosis and medication use. Given that details regarding socio-demographic and lifestyle data in the NHIRD and comprehensive information on medication exposure and procedures in the NHIS were not available, the linkage between the NHIS and NHIRD could complement the strength of each system, and enhance the utilization of both data sources in epidemiology and health services research.

Conclusions

The overall concordance between the claims records and self-report survey results was moderate in clinical diagnoses. However, the concordance varied markedly across different diagnoses. In summary, the concordance in common chronic disease was substantial. The concordance in severe diseases was moderate, which might be attributable to low prevalence. Regarding diseases with episodic nature, as well as composite diseases without clear definitions, the concordance was only fair. In terms of medication use and health system utilization, the concordance was substantial. In terms of research implications, the linkage between the NHIS and NHIRD could enhance the utilization of both datasets.

Supporting Information

Table S1

Claims records for clinical diagnoses and medication use in the NHIRD.

(DOC)

Table S2

Concordance between self-report and claims record, by diagnoses, medication use, and health system utilization among participants aged 12–29 years.

(DOC)

Table S3

Concordance between self-report and claims record, by diagnoses, medication use, and health system utilization among participants aged 30–49 years.

(DOC)

Table S4

Concordance between self-report and claims record, by diagnoses, medication use, and health system utilization among participants aged 50 years and older.

(DOC)

Table S5

Factors associated with disagreement between self-report and claims record in hypertension, diabetes mellitus, dyslipidemia, and psychiatric disorders.

(DOC)

Acknowledgments

Dr. Wu is supported in part by a grant from the National Science Council, Taiwan (PI: Wu, NSC 102-2314-B-418 -002). Dr. Tsai is supported in part by grants from the National Health Research Institutes, Taiwan (PI: Tsai, PH-100-PP-14, PH-102-PP-14, PH-101-SP-14, PH-102-SP-5). We thank Tami R. Bartell at Ann & Robert H. Lurie Children's Hospital of Chicago, Stanley Manne Children's Research Institute for English editing.

This study is based, in part, on data from the National Health Insurance Research Database provided by the Bureau of National Health Insurance of the Department of Health, Taiwan, and managed by the National Health Research Institutes, Taiwan. The interpretation and conclusions contained in this article do not represent those of the Bureau of National Health Insurance, the Department of Health, or the National Health Research Institutes.

Funding Statement

No potential financial conflicts of interest relevant to this study were reported. Dr. Wu is supported in part by a grant from National Science Council, Taiwan (PI: Wu, NSC 102-2314-B-418 -002). Dr. Tsai is supported in part by grants from the National Health Research Institutes, Taiwan (PI: Tsai, PH-100-PP-14, PH-102-PP-14, PH-101-SP-14, PH-102-SP-5). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Data Availability

The authors confirm that, for approved reasons, some access restrictions apply to the data underlying the findings. The data utilized in this study cannot be made available in the manuscript, the supplemental files, or in a public repository due to the “Personal Information Protection Act” executed by Taiwan's government, starting from 2012. Requests for data can be sent as a formal proposal to obtain approval from the ethics review committee of the appropriate governmental department in Taiwan. Specifically, links regarding contact info for which data requests may be sent to are as follows: http://nhird.nhri.org.tw/en/Data_Subsets.html#S3 and http://nhis.nhri.org.tw/point.html.

References

1. Schneeweiss S, Avorn J (2005) A review of uses of health care utilization databases for epidemiologic research on therapeutics. Journal of clinical epidemiology 58:323–337. [Abstract] [Google Scholar]
2. Suissa S, Garbe E (2007) Primer: administrative health databases in observational studies of drug effects–advantages and disadvantages. Nature clinical practice Rheumatology 3:725–732. [Abstract] [Google Scholar]
3. Hennessy S, Bilker WB, Knauss JS, Margolis DJ, Kimmel SE, et al. (2002) Cardiac arrest and ventricular arrhythmia in patients taking antipsychotic drugs: cohort study using administrative data. Bmj 325:1070. [Europe PMC free article] [Abstract] [Google Scholar]
4. Chen YC, Wu JC, Chen TJ, Wetter T (2011) Reduced access to database. A publicly available database accelerates academic production. BMJ 342:d637. [Abstract] [Google Scholar]
5. Tisnado DM, Adams JL, Liu H, Damberg CL, Chen WP, et al. (2006) What is the concordance between the medical record and patient self-report as data sources for ambulatory care? Medical care 44:132–140. [Abstract] [Google Scholar]
6. Yu ST, Chang HY, Lin MC, Lin YH (2009) Agreement between self-reported and health insurance claims on utilization of health care: A population study. Journal of clinical epidemiology 62:1316–1322. [Abstract] [Google Scholar]
7. Maclure M (1991) The case-crossover design: a method for studying transient effects on the risk of acute events. American journal of epidemiology 133:144–153. [Abstract] [Google Scholar]
8. Lin CC, Lai MS, Syu CY, Chang SC, Tseng FY (2005) Accuracy of diabetes diagnosis in health insurance claims data in Taiwan. Journal of the Formosan Medical Association 104:157–163. [Abstract] [Google Scholar]
9. Cicchetti DV, Feinstein AR (1990) High agreement but low kappa: II. Resolving the paradoxes. Journal of clinical epidemiology 43:551–558. [Abstract] [Google Scholar]
10. Chen G, Faris P, Hemmelgarn B, Walker RL, Quan H (2009) Measuring agreement of administrative data with chart data using prevalence unadjusted and adjusted kappa. BMC medical research methodology 9:5. [Europe PMC free article] [Abstract] [Google Scholar]
11. Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 33:159–174. [Abstract] [Google Scholar]
12. Chang HY, Yeh WT, Chang YH, Tsai KS, Pan WH (2002) Prevalence of dyslipidemia and mean blood lipid values in Taiwan: results from the Nutrition and Health Survey in Taiwan (NAHSIT, 1993–1996). The Chinese journal of physiology 45:187–197. [Abstract] [Google Scholar]
13. Hwu HG, Yeh EK, Chang LY (1989) Prevalence of psychiatric disorders in Taiwan defined by the Chinese Diagnostic Interview Schedule. Acta psychiatrica Scandinavica 79:136–147. [Abstract] [Google Scholar]
14. Kwon A, Bungay KM, Pei Y, Rogers WH, Wilson IB, et al. (2003) Antidepressant use: concordance between self-report and claims records. Medical care 41:368–374. [Abstract] [Google Scholar]
15. Boudreau DM, Daling JR, Malone KE, Gardner JS, Blough DK, et al. (2004) A validation study of patient interview data and pharmacy records for antihypertensive, statin, and antidepressant medication use among older women. American journal of epidemiology 159:308–317. [Abstract] [Google Scholar]
16. Liu TC, Chen CS (2002) An analysis of private health insurance purchasing decisions with national health insurance in Taiwan. Social science & medicine 55:755–774. [Abstract] [Google Scholar]
17. Tu K, Campbell NR, Chen ZL, Cauch-Dudek KJ, McAlister FA (2007) Accuracy of administrative databases in identifying patients with hypertension. Open medicine: a peer-reviewed, independent, open-access journal 1:e18–26. [Europe PMC free article] [Abstract] [Google Scholar]
18. Muhajarine N, Mustard C, Roos LL, Young TK, Gelskey DE (1997) Comparison of survey and physician claims data for detecting hypertension. Journal of clinical epidemiology 50:711–718. [Abstract] [Google Scholar]
19. Rhodes AE, Lin E, Mustard CA (2002) Self-reported use of mental health services versus administrative records: should we care? International journal of methods in psychiatric research 11:125–133. [Europe PMC free article] [Abstract] [Google Scholar]
20. Haapea M, Miettunen J, Lindeman S, Joukamaa M, Koponen H (2010) Agreement between self-reported and pharmacy data on medication use in the Northern Finland 1966 Birth Cohort. International journal of methods in psychiatric research 19:88–96. [Europe PMC free article] [Abstract] [Google Scholar]
21. Cheng CL, Kao YH, Lin SJ, Lee CH, Lai ML (2011) Validation of the National Health Insurance Research Database with ischemic stroke cases in Taiwan. Pharmacoepidemiology and drug safety 20:236–242. [Abstract] [Google Scholar]
22. Wu CY, Chan FK, Wu MS, Kuo KN, Wang CB, et al. (2010) Histamine2-receptor antagonists are an alternative to proton pump inhibitor in patients receiving clopidogrel. Gastroenterology 139:1165–1171. [Abstract] [Google Scholar]
23. Quam L, Ellis LB, Venus P, Clouse J, Taylor CG, et al. (1993) Using claims data for epidemiologic research. The concordance of claims-based criteria with the medical record and patient survey for identifying a hypertensive population. Medical care 31:498–507. [Abstract] [Google Scholar]
24. Sakshaug JW, Weir DR, Nicholas LH (2014) Identifying diabetics in Medicare claims and survey data: implications for health services research. BMC health services research 14:150. [Europe PMC free article] [Abstract] [Google Scholar]
25. Comino EJ, Tran DT, Haas M, Flack J, Jalaludin B, et al. (2013) Validating self-report of diabetes use by participants in the 45 and Up Study: a record linkage study. BMC health services research 13:481. [Europe PMC free article] [Abstract] [Google Scholar]
26. Curtis JR, Westfall AO, Allison J, Freeman A, Kovac SH, et al. (2006) Agreement and validity of pharmacy data versus self-report for use of osteoporosis medications among chronic glucocorticoid users. Pharmacoepidemiology and drug safety 15:710–718. [Abstract] [Google Scholar]
27. Pit SW, Byles JE, Cockburn J (2008) Accuracy of telephone self-report of drug use in older people and agreement with pharmaceutical claims data. Drugs & aging 25:71–80. [Abstract] [Google Scholar]

Articles from PLOS ONE are provided here courtesy of PLOS

Citations & impact 


Impact metrics

Jump to Citations

Citations of article over time

Article citations


Go to all (146) article citations

Data 


Data behind the article

This data has been text mined from the article, or deposited into data resources.