Published on in Vol 4, No 1 (2016): Jan-Mar

Improving Inpatient Surveys: Web-Based Computer Adaptive Testing Accessed via Mobile Phone QR Codes

Improving Inpatient Surveys: Web-Based Computer Adaptive Testing Accessed via Mobile Phone QR Codes

Improving Inpatient Surveys: Web-Based Computer Adaptive Testing Accessed via Mobile Phone QR Codes

Authors of this article:

Tsair-Wei Chien1, 2 Author Orcid Image ;   Weir-Sen Lin2 Author Orcid Image

Original Paper

1Chi Mei Medical Center, Taiwan, Research Department, Chi Mei Medical Center, Taiwan, Tainan, Taiwan

2Chia-Nan University of Pharmacy and Science, Department of Hospital and Health Care Administration, Chia-Nan University of Pharmacy and Science, Tainan, Taiwan

Corresponding Author:

Weir-Sen Lin, MPH, PhD

Chia-Nan University of Pharmacy and Science

Department of Hospital and Health Care Administration

Number 60, Section 1, Erren Road, Rende District

Tainan City 71710

Taiwan (R.O.C.)

Phone: 886 +886 6 2664911

Fax:886 +886 6 2664911

Email: weirsen@mail.cnu.edu.tw


Background: The National Health Service (NHS) 70-item inpatient questionnaire surveys inpatients on their perceptions of their hospitalization experience. However, it imposes more burden on the patient than other similar surveys. The literature shows that computerized adaptive testing (CAT) based on item response theory can help shorten the item length of a questionnaire without compromising its precision.

Objective: Our aim was to investigate whether CAT can be (1) efficient with item reduction and (2) used with quick response (QR) codes scanned by mobile phones.

Methods: After downloading the 2008 inpatient survey data from the Picker Institute Europe website and analyzing the difficulties of this 70-item questionnaire, we used an author-made Excel program using the Rasch partial credit model to simulate 1000 patients’ true scores followed by a standard normal distribution. The CAT was compared to two other scenarios of answering all items (AAI) and the randomized selection method (RSM), as we investigated item length (efficiency) and measurement accuracy. The author-made Web-based CAT program for gathering patient feedback was effectively accessed from mobile phones by scanning the QR code.

Results: We found that the CAT can be more efficient for patients answering questions (ie, fewer items to respond to) than either AAI or RSM without compromising its measurement accuracy. A Web-based CAT inpatient survey accessed by scanning a QR code on a mobile phone was viable for gathering inpatient satisfaction responses.

Conclusions: With advances in technology, patients can now be offered alternatives for providing feedback about hospitalization satisfaction. This Web-based CAT is a possible option in health care settings for reducing the number of survey items, as well as offering an innovative QR code access.

JMIR Med Inform 2016;4(1):e8

doi:10.2196/medinform.4313

Keywords



Patient reports are central to the evaluation of medical care, both in terms of treatment outcomes (ie, patient-reported outcomes and in terms of experiences of quality of care (ie, patient-reported experience measures) [1]. A quality standard for patient experience in the United Kingdom’s National Health Service (NHS) has been developed by the National Institute for Health and Care Excellence [2,3]. The UK National Adult Inpatient Survey—UK NHS 70-item questionnaire—has been in use in Great Britain since 2002, gathering data from over 620,000 patients every year [4].

The first public reporting of the US equivalent of such surveys, the Hospital Consumer Assessment of Healthcare Provider and Systems (HCAHPS), began in 2008. HCAHPS collects more than 3.0 million completed surveys from 3912 hospitals every year. On average, more than 28,000 patients are surveyed every day about their recent hospital experience, and more than 8400 patients (approximately 30% response rate) complete the HCAHPS inpatient survey every day [5].

Research Questions

A concern reported in the literature is the burden on patients of answering all survey questions at one time [6-9]. More than 3.6 million patients completed mail-in surveys from the UK NHS (600,000 patients). The US survey (3.0 million respondents) required approximately 6 hours of time per month and cost approximately US $70 per month [3] to examine people’s health service experiences [10]. However, the UK NHS 70-item questionnaire is significantly longer than the US HCAHPS 25-item survey [11,12]. To reduce patient burden, it is first necessary to shorten the item length of the UK NHS inpatient questionnaire to increase response rates without compromising its assessment reliability [2,13].

Many studies [6-9] have reported that item response theory (IRT)-based computer adaptive testing (CAT) has the advantages of both long-form and short-form questionnaires [14-16] in precision and efficiency. Since many patients (or their guardians) already own mobile phones, which they are comfortable using, it makes sense to use them in hospitals and for hospital surveys. At this time, no studies have been published reporting online CAT via mobile phones in medical fields.

However, many skip items (see Multimedia Appendices 1 and 2) exist in the UK NHS 70-item questionnaire, which can be confusing and may perplex researchers on CAT implementation. Thus, our second aim was to tackle the problem of skip items in the UK NHS questionnaire and to implement the online CAT.

Rasch Partial Credit Model Applied to the Item Response Theory–Based Computer Adaptive Testing

Many researchers have contributed to the dichotomous [6] and polytomous [7-9] formats used by CAT. The UK NHS questionnaire comprises items with different categories (eg, 3 and 6 categories for Items 40 and 41; see Multimedia Appendix 1). It is suited for applying the Rasch partial credit model (PCM), that is, items with a different number of responses and with an equal discrimination parameter [17], or the generalized partial credit model, that is, items with a different number of responses and with unequal discrimination parameters [18], if those items form a unidimensional construct. None was jointly available for a comparison of precision and efficiency differences of CAT estimation with the aforementioned methods commonly used in literature, such as PCM, answering all items (AAI), and the randomized selection method (RSM).

Further, as mobile phones have become ubiquitous in the health care setting [19], it is important to offer an alternative online Rasch PCM-CAT assessment to gather hospitalization experience feedback from patients. We propose access to the questionnaire using a quick response (QR) code via mobile phone.

Aims of this Study

The aims of the current study were to investigate whether CAT can (1) be efficient with item reduction and (2) be used with QR codes used for mobile phones.


Study Data

The UK NHS 70-item questionnaire regarding patient experience was downloaded from the NHS official website [11]. The item and its threshold difficulties (lower summation scores for an item imply that it was more difficult for examinees to respond) were roughly determined by hand computation according to the key findings report for the 2008 inpatient survey [20]. We simulated an interactive metric of 1000 persons (following a normal distribution [~N(0,1)], called true scores) and 70 items (estimated with aforementioned item difficulties) using the Rasch PCM model [17,21]. Nine items originally designed to automatically select different paths were set with different probabilities by the authors. The remaining 61 items were allocated different weighted scores (see Multimedia Appendix 1). A set of 24 items (ie, regarding sections of the ward, doctors, nurses, patient care, and treatment) was extracted from the UK NHS 70-item questionnaire to be the CAT item pool (see Multimedia Appendix 2). We assumed that the set of 24 items is unidimensional based on the report from the previous study paper [13]. Because these 9 conditional selection path items make CAT difficult to design for a computer, they were excluded from the CAT item pool. Multimedia Appendix 3 shows the file layout and fields we designed for use with the datasets.

Unidimensionality

The Rasch model, named after Georg Rasch [22], is a psychometric model for analyzing categorical data as a mathematical function of the trade-off interaction between (1) the respondent’s latent trait (eg, hospitalization perception level in this study) and (2) the item difficulties. The dichotomous Rasch model and its extensions (eg, family models: rating scale model [23], PCM [17]) are successfully used in other areas, including the health profession [24] and market research [25], because of their general applicability [26].

The study data need to meet the following criteria to fit the Rasch model: the infit and outfit mean square errors (MNSQ) of all items are ˂1.5 for unidimensionality and ˃0.5 for local independence [27]. Simulation data were generated fitting to the Rasch PCM model [21].

Task 1: Investigating Computer Adaptive Testing Efficiency and Accuracy

Three scenarios were designed to compare their efficiency and accuracy on the UK NHS 70-item questionnaire: (1) the AAI (answering all items on those 24 items), (2) the RSM (randomized selection method to draw 12 items), and (3) the CAT (at least 5 items and stop at person reliability of 0.80) responding to the 24-item pool.

We applied CAT stop rules, such as when person reliability reaches 0.80 (=[1 − SEMpi]×[1 − SEMpi], where SEMpi=person standard error of measurement on item i=1/variancepi=1/informationpi), and when the last 5 average consecutive person estimation change is less than 0.05 after the minimum necessary completed number of items is ≥5.

In addition, we ran an author-made VBA (Visual Basic for Applications) module in Microsoft Excel to conduct a simulation study (see Multimedia Appendix 4). Another Web-based CAT was programmed for use on mobile phones. The maximum likelihood estimation algorithm [28] (see Multimedia Appendix 4) was used to (1) estimate person measures on the three scenarios, (2) compute correlation coefficients between estimated person measures among the three scenarios and the original true scores to verify CAT accuracy, and (3) analyze CAT efficiency of item length shortened by CAT compared with the other two scenarios (ie, AAI and RSM) (see Figure 1).

Figure 1. Study flowchart.
View this figure

Task 2: An Online Assessment Using Mobile Phones

The Conditional Path Skip Items Designed on UK NHS-70

The path skip item was automatically redirected to the next according to the respective route designed in the field of the item dataset (see Multimedia Appendix 3). To illustrate Item 39 in Multimedia Appendix 1, two route fields were filled with Items 40 and 41 in response to the respective answer code (eg, 1 or 2). In contrast, the route fields for those ordinary non-skip items were kept empty (or a null value).

An Online Computer Adaptive Testing Routine for Gathering Feedback From Patients

An online routine was designed for patients to report their perceptions of their inpatient hospitalization experience. The UK NHS 70-item questionnaire (see Multimedia Appendix 2) was uploaded to website. The first CAT item is randomly selected from the item pool (ie, Items 15-38) after Item 14 is answered. The next item to be answered is the item with the maximal variance in the remaining items according to the provisional person ability [7,29]. Multimedia Appendix 5 shows details on the item selection rules and the Excel VBA codes for the conditional path items. All the responses are then automatically saved on the study website (see Multimedia Appendix 3).


Items Fit to the Rasch Model

The set of 24 items (see Table 1) was taken as unidimensional due to simulation data fitting the Rasch model’s requirement with values of infit and outfit MNSQ between 0.5 and 1.5 [21]. Each item has its own threshold difficulties (see Multimedia Appendix 4).

Table 1. The 24 items selected from the UK NHS 70-item questionnaire.
No.ItemThresholdaDifficultybIn MNSQOut MNSQ
15Were you ever bothered by noise at night from other patients?12.170.961.00
16Were you ever bothered by noise at night from hospital staff?11.841.031.11
17In your opinion, how clean was the hospital room or ward that you were in?3-0.560.970.98
18How clean were the toilets and bathrooms that you used in the hospital?3-2.200.970.98
19Did you feel threatened during your stay in the hospital by other patients or visitors?11.880.970.98
20Were hand-wash gels available for patients and visitors to use?21.331.021.02
21How would you rate the hospital food?3-1.720.961.02
22Were you offered a choice of food?2-0.040.980.98
23Did you get enough help from staff to eat your meals?20.981.021.01
24When you had important questions to ask a doctor, did you get answers that you could understand?2-1.011.031.01
25Did you have confidence and trust in the doctors treating you?2-0.861.061.07
26Did doctors talk in front of you as if you weren’t there?2-1.150.961.00
27When you had important questions to ask a nurse, did you get answers that you could understand?21.480.990.99
28Did you have confidence and trust in the nurses treating you?21.261.011.01
29Did nurses talk in front of you as if you weren’t there?2-1.461.041.05
30In your opinion, were there enough nurses on duty to care for you in the hospital?2-1.400.940.92
31Sometimes in a hospital, a member of staff will say one thing and another will say something quite different. Did this happen to you?2-1.220.960.95
32Were you involved as much as you wanted to be in decisions about your care and treatment?20.561.021.02
33Did you have confidence in the decisions made about your condition or treatment?2-0.780.960.96
34How much information about your condition or treatment was given to you?21.481.041.04
35Did you find someone on the hospital staff to talk to about your worries and fears?2-0.771.041.04
36Do you feel you got enough emotional support from hospital staff during your stay?2-0.750.960.95
37Were you given enough privacy when discussing your condition or treatment?21.580.980.98
38Were you given enough privacy when being examined or treated?2-0.651.011.02

aThreshold denotes the number of categories on each item, for example, 2 for three categories and 1 for two categories.

bDifficulty represents item difficulty in a unit of logit (=log odds).

Task 1: Investigating Computer Adaptive Testing Efficiency and Accuracy

Table 2 indicates that the CAT relates the true scores (r=.97 in column 2) and the AAI (r=.97 in columns 3 and 4) to a high association, indicating that the CAT earns an equivalent accuracy compared to the AAI and a higher accuracy than the RSM (in column 5 of the estimation section). The summation scores have a higher correlation (r=.98) to the within counterparts (eg, summation RSM scores vs estimated RSM logit scores) and a slightly lower correlation (r=.92-.97) to the between counterparts (eg, summation RSM scores vs estimated AAI or CAT logit scores), implying that the raw summation scores have a high correlation (r=.98) with the estimated logit scores shown in the last 4 columns of Table 2. The bottom row of Table 2 shows that the CAT earns the shortest item length, indicating the CAT has advantages in efficiency over AAI and RSM.

Table 2. Comparisons of efficiency and accuracy among the AAI, RSM, and CAT.

Estimated logit scoresSummation scores

True scoreCATAAIRSMRSMAAI
True score0.970.960.910.920.95
Estimation

CAT0.97[0.98]0.930.940.97

AAI0.96[0.98]0.950.950.98

RSM0.910.930.95[0.98]0.94
Summation

RSM0.920.940.95[0.98]0.96

AAI0.950.97[0.98]0.940.96

Mean0.030.01-0.06-0.054.644.59

Standard deviation1.021.070.930.741.191.44
Item lengtha
40.5057.3145.4245.4257.31

aItem length denotes those items, excluding the 9 automatic-selection-to-different-path items.

Task 2: Online Computer Adaptive Testing Assessment

By scanning the QR code (see Figure 2), the CAT icon appears on the patient’s mobile phone. The mobile CAT survey procedure was demonstrated item-by-item in action (see Figure 3). Person fit (ie, infit and outfit MNSQ) statistics showed the respondent behaviors. Person theta is the provisional ability estimated by the CAT module.

The standard error in Figure 3 was generated by the following formula (see Multimedia Appendix 5): 1/√(Σ variance(i)), where i refers to the CAT finished items responded to by a person [30]. In addition, the residual (resi) in Figure 3 was the average of the last five change differences between the pre-and post-estimated abilities on each CAT step. CAT will stop if the residual value is less than 0.05. “Corr” refers to the correlation coefficient between the CAT estimated measures and its step series numbers using the last five estimated theta (=person measure) values. The flatter the theta trend, the higher the probability that the person measure is convergent with a final estimation.

Figure 2. A snapshot of a QR code and the CAT item.
View this figure
Figure 3. The process of CAT estimated scores.
View this figure

Principal Findings

We verified that computer adaptive testing can be (1) efficient with questionnaire item reduction and (2) used with QR codes on mobile phones.

The CAT item pool was designed using Items 15-38 of the UK NHS 70-item questionnaire. We found that CAT can be more efficient for answering questions than both AAI and RSM without compromising its measurement accuracy, which is consistent with previous studies [6-9]. Our online CAT inpatient survey for gathering satisfaction responses from patients was accessed by scanning a QR code on a mobile phone, which has never been demonstrated previously.

Many studies have discussed patient perceptions about hospitals and the benefit of listening to other patient experiences when choosing a hospital [31,32]. There has been a rapid increase in the number of websites that allow patients to rate their hospitals [33,34]. Almost all health care providers have been explicitly required to conduct surveys of their patients’ health care experiences. However, those surveys often use an individual item-by-item approach to disclose patient views on hospital service quality, which does not provide hospital staff with information to make further improvements without considering the overall hospital performance [13].

Implications and Future Considerations

We demonstrated that an NHS inpatient experience questionnaire with shortened items can be used with an IRT-based CAT technique without compromising its measurement accuracy. Using a CAT approach with such complex question structure jointly with item pools and conditional path skip items is rare. Our online CAT module used by scanning a QR code on a mobile phone can be extended to many dimensions simultaneously in a survey. For example, the Clinical Dementia Rating scale [35] used in patients with dementia consists of six domains. We could design a module using CAT through several procedures in a common questionnaire in the future.

Strengths

Hospital staff must consider both the efficiency and utility of assessment for the selection of the CAT items [36]. The traditional survey collects all feedback from patients through particular sets of questions to assess what causes patient difficulty or dissatisfaction. The assessment results help hospital managers determine where improvements can be made [36]. We can use the Rasch simulation technique to overcome the problem in questionnaires of unanswered items (ie, which do not provide hospital staff with information to make further improvement). This Rasch simulation technique [21] can be used to fill in the expected responses to those unanswered CAT items according to the final person theta (ability) and the specified item difficulties. Thus, the CAT can provide efficient assessments and the full information needed to make improvements.

Furthermore, the person outfit mean square in CAT is also saved in our database (see Multimedia Appendix 3). An outfit mean square of 2.0 or greater for a patient indicates a possibly aberrant response pattern [37], such as cheating, careless responding, lucky guessing, creative responding, or random responding [38], which makes it hard to reveal valuable information using the traditional survey method.

Limitations

Six limitations of this study are addressed. First, the study was based on the assumption of unidimensionality across those 24 CAT items. Although several articles have supported the notion that the UK NHS 70-item questionnaire can construct a one-dimension domain [13,31], those items cannot be generalized to the 24 CAT items used in different countries or by different groups. Future studies should further verify those 24 items to make the CAT module valid and feasible in health care practice.

Second, the first CAT item was selected from a randomized item pool. The CAT selection rule for the first item can be redesigned referring to the previously completed items and inferring a provisional theta (ie, person measure) to select the first item with the maximum information (ie, variance) in the item pool so that the questionnaire length could be shorter (see Multimedia Appendix 6).

Third, only one CAT module was designed in the NHS inpatient questionnaire due to the conditional selection path skip items that existed in non-CAT items. Future studies are recommended to overcome this barrier and to design a CAT-by-CAT approach in the questionnaire so as to reduce more item length in a questionnaire.

Fourth, we have not discussed the issue of participation options using traditional postal mail or email. Because not all patients possess a mobile phone, specifically a smartphone, and 3G/4G WiFi communication, all options (mail or email) must be offered to patients when invited to participate in the survey. Readers have found the email option useful to answer questions either by connecting to the Web, or by scanning a QR code on a mobile phone (eg, Figure 2) if applying the CAT demonstrated in this study. Future studies are needed to further explore and improve the processes of the CAT survey.

Fifth, we conducted a simulation of 1000 patients’ true scores followed by a standard normal distribution. This might contradict the general experience of satisfaction surveys with ceiling effects that impede standard normal distribution. Future studies are needed to sample from a negatively skewed population to further verify whether the CAT can be efficient on item reduction over AAI and RSM.

Sixth, the unidimensionality of the 24 CAT items may be questioned given the different realms of hospital ward, doctors, nurses, patient care, and treatment. It might be implausible to assume that these 24 items are unidimensional. Future studies are required to further investigate the issue.

Conclusion

With advances in technology, we can now offer patients alternative ways via mobile phones to gather their feedback on hospitalization satisfaction. The online CAT can reduce the number of survey items for patients to respond to, as well as be accessed via mobile phone using a QR code.

Authors' Contributions

All authors have read and approved the final manuscript. TW developed the study concept and design. TW and WP analyzed and interpreted the data. TW drafted the manuscript, and all authors have provided critical revisions for important intellectual content. The study was supervised by TW.

Conflicts of Interest

None declared.

Multimedia Appendix 1

A format of skip items in a questionnaire.

PDF File (Adobe PDF File), 224KB

Multimedia Appendix 2

The UK NHS 70-item questionnaire.

PDF File (Adobe PDF File), 269KB

Multimedia Appendix 3

The study item dataset.

XLS File (Microsoft Excel File), 14MB

Multimedia Appendix 4

Excel VBA module used for CAT simulation.

XLS File (Microsoft Excel File), 7MB

Multimedia Appendix 5

Excel module for simulation on CAT, AAI, and RSM saving to spreadsheet NAT.

TXT File, 32KB

Multimedia Appendix 6

Introduction to CAT.

PDF File (Adobe PDF File), 405KB

  1. Jenkinson C, Coulter A, Reeves R, Bruster S, Richards N. Properties of the Picker Patient Experience questionnaire in a randomized controlled trial of long versus short form survey instruments. J Public Health Med 2003 Sep;25(3):197-201 [FREE Full text] [Medline]
  2. National Clinical Guideline Centre. Patient Experience in Adult NHS Services: Improving the Experience of Care for People Using Adult NHS Services: Patient Experience in Generic Terms. London: Royal College of Physicians; 2012.
  3. Debra DS. The Evidence Center. 2013. Measuring patient experience   URL: http://www.health.org.uk/sites/default/files/MeasuringPatientExperience.pdf [accessed 2016-01-30] [WebCite Cache]
  4. CareQuality Commission. Inpatient survey 2013. 2015.   URL: http://www.cqc.org.uk/content/inpatient-survey-2014 [accessed 2016-01-30] [WebCite Cache]
  5. HCAHPS. Hospital Consumer Assessment of Healthcare Providers and Services. 2015.   URL: http://www.hcahpsonline.org/home.aspx [accessed 2016-01-30] [WebCite Cache]
  6. Chien T, Wu H, Wang W, Castillo RV, Chou W. Reduction in patient burdens with graphical computerized adaptive testing on the ADL scale: tool development and simulation. Health Qual Life Outcomes 2009;7:39 [FREE Full text] [CrossRef] [Medline]
  7. Chien T, Wang W, Huang S, Lai W, Chow JC. A web-based computerized adaptive testing (CAT) to assess patient perception in hospitalization. J Med Internet Res 2011;13(3):e61 [FREE Full text] [CrossRef] [Medline]
  8. Ma S, Chien T, Wang H, Li Y, Yui M. Applying computerized adaptive testing to the Negative Acts Questionnaire-Revised: Rasch analysis of workplace bullying. J Med Internet Res 2014;16(2):e50 [FREE Full text] [CrossRef] [Medline]
  9. De Beurs Derek Paul DP, de Vries AL, de Groot MH, de Keijser J, Kerkhof AJ. Applying computer adaptive testing to optimize online assessment of suicidal behavior: a simulation study. J Med Internet Res 2014;16(9):e207 [FREE Full text] [CrossRef] [Medline]
  10. Lu CY, Roughead E. Determinants of patient-reported medication errors: a comparison among seven countries. Int J Clin Pract 2011 Jul;65(7):733-740. [CrossRef] [Medline]
  11. NHS Survey. 2015. Scored Questionnaire: Inpatients 2013   URL: http://www.nhssurveys.org/survey/1354 [accessed 2016-01-31] [WebCite Cache]
  12. Survey Instruments: view or download the English survey materials. 2015.   URL: http://www.hcahpsonline.org/surveyinstrument.aspx [accessed 2016-01-31] [WebCite Cache]
  13. Jenkinson C, Coulter A, Bruster S. The Picker Patient Experience Questionnaire: development and validation using data from in-patient surveys in five countries. Int J Qual Health Care 2002 Oct;14(5):353-358 [FREE Full text] [Medline]
  14. Eack SM, Singer JB, Greeno CG. Screening for anxiety and depression in community mental health: the beck anxiety and depression inventories. Community Ment Health J 2008 Dec;44(6):465-474. [CrossRef] [Medline]
  15. Ramirez BM, Bostic JQ, Davies D, Rush AJ, Witte B, Hendrickse W, et al. Methods to improve diagnostic accuracy in a community mental health setting. Am J Psychiatry 2000 Oct;157(10):1599-1605. [CrossRef] [Medline]
  16. Shear MK, Greeno C, Kang J, Ludewig D, Frank E, Swartz HA, et al. Diagnosis of nonpsychotic patients in community clinics. Am J Psychiatry 2000 Apr;157(4):581-587. [CrossRef] [Medline]
  17. Masters G. A Rasch model for partial credit scoring. Psychometrika 1982;47:149-174 [FREE Full text]
  18. Muraki E. A generalized partial credit model: Application of an EM algorithm. Appl Psychol Meas 1992;16:159-176.
  19. Tran K, Morra D, Lo V, Quan SD, Abrams H, Wu RC. Medical students and personal smartphones in the clinical environment: the impact on confidentiality of personal health information and professionalism. J Med Internet Res 2014;16(5):e132 [FREE Full text] [CrossRef] [Medline]
  20. NHS. Key findings report: Inpatient Survey Results 2008. 2015.   URL: http://www.nhssurveys.org/survey/738 [accessed 2016-01-31] [WebCite Cache]
  21. Linacre J. How to simulate Rasch data. Rasch Measurement Transactions 2007;21(3):1125 [FREE Full text]
  22. Rasch G. Probabilistic models for some intelligence and attainment tests. Chicago: University of Chicago Press; 1960.
  23. Andrich. Rating scale formulation for ordered response categories. Psychometrika 1978;43:561.
  24. Bezruczko N. Rasch measurement in health sciences. Maple Grove, MN: Jam Press; 2005.
  25. Bechtel G. Generalizing the Rasch model for consumer rating scales. Marketing Science 1985;4(1):62-73.
  26. Wright BD. Solving measurement problems with the Rasch model. Journal of Educational Measurement 1977;14(2):97-116.
  27. Linacre J. WINSTEPS computer program user guide. Chicago, IL   URL: http://www.winsteps.com/index.htm [accessed 2016-01-30] [WebCite Cache]
  28. Birnbaum A, Lord FM, Novick MR, Birnbaum A. Some latent ability models and their use in inferring an examinee's ability. In: Statistical Theories of Mental Test Scores. Reading, MA: Addison-Wesley; 1968.
  29. Embretson S, Reise S. Measuring persons: Scoring examinees with IRT models. In: Item Response Theory for Psychologists. Mahwah, NJ: Erlbaum; 2000:158-186.
  30. Linacre J. Computer-adaptive tests (CAT), standard errors and stopping rules. Rasch Measurement Transactions 2006;20(2):1062 [FREE Full text]
  31. Chien T, Wang W, Wang H, Lin H. Online assessment of patients' views on hospital performances using Rasch model's KIDMAP diagram. BMC Health Serv Res 2009;9:135 [FREE Full text] [CrossRef] [Medline]
  32. Hao H. The development of online doctor reviews in China: an analysis of the largest online doctor review website in China. J Med Internet Res 2015;17(6):e134 [FREE Full text] [CrossRef] [Medline]
  33. Wang OJ, Wang Y, Lichtman JH, Bradley EH, Normand ST, Krumholz HM. “America's Best Hospitals” in the treatment of acute myocardial infarction. Arch Intern Med 2007 Jul 9;167(13):1345-1351. [CrossRef] [Medline]
  34. O'Brien SM, Peterson E. Identifying high-quality hospitals: consult the ratings or flip a coin? Arch Intern Med 2007 Jul 9;167(13):1342-1344. [CrossRef] [Medline]
  35. Hughes CP, Berg L, Danziger WL, Coben LA, Martin RL. A new clinical scale for the staging of dementia. Br J Psychiatry 1982 Jun;140:566-572. [Medline]
  36. Hsueh I, Chen J, Wang C, Hou W, Hsieh C. Development of a computerized adaptive test for assessing activities of daily living in outpatients with stroke. Phys Ther 2013 May;93(5):681-693 [FREE Full text] [CrossRef] [Medline]
  37. Linacre JM. Optimizing rating scale category effectiveness. J Appl Meas 2002;3(1):85-106. [Medline]
  38. Karabatsos G. Comparing the aberrant response detection performance of thirty-six person-fit statistics. Applied Measurement in Education 2003;16(4):277-298.


AAI: answering all items
CAT: computer adaptive testing
HCAHPS: Hospital Consumer Assessment of Healthcare Providers and Services
IRT: item response theory
MNSQ: mean square
NHS: National Health Service
PCM: partial credit model
QR: quick response
RSM: randomized selection method
SE: standard error
SEM: standard error measurement
VBA: Visual Basic for Applications


Edited by G Eysenbach; submitted 03.02.15; peer-reviewed by D de Beurs, S Bidmon; comments to author 26.05.15; revised version received 27.05.15; accepted 24.06.15; published 02.03.16

Copyright

©Tsair-Wei Chien, Weir-Sen Lin. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 02.03.2016.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.