I. Field of the Disclosure
The present disclosure relates to computer adaptive testing. More specifically, but not exclusively, the present disclosure relate to methods for improving ability estimation accuracy and item selection efficiency in a Computer Adaptive Test (CAT).
II. Description of the Prior Art
In educational assessments, a test battery is usually composed of several related sections based on content categories. The inter-relationships among different sections can be used to improve the test efficiency in a Computer Adaptive Test (CAT) scenario.
Therefore, a primary object, feature, or advantage of the present disclosure uses an examinee's ability estimate from an earlier section to inform the selection of the initial item and the subsequent items in a later section.
To date, a number of studies have been conducted on selecting the initial item based on an examinee's scores from earlier related tests using a variety of methods. However, none of the methods used in these studies have provided satisfactory results. Furthermore, using prior ability estimates to drive the starting point for a subsequent test section has not been widely used and practiced.
Therefore, it is another object, feature, or advantage of the present disclosure to provide a method for using the ability estimates from a previous test or test section(s) as prior information to inform the selection of the initial item of a subsequent test or section of a test.
Another object, feature, or advantage of the present disclosure is to improve methods for item selection and ability estimation in a CAT for accurately measuring an examinee's ability.
A still further object, feature, or advantage of the present disclosure is to provide methods for initial item selection based on ability estimates from a previous section.
One or more of these and/or other objects, features or advantages of the present disclosure will become apparent from the specification and claims that follow.
The present disclosure improves test efficiency and accuracy in a Computer Adaptive Test (CAT).
One exemplary method is for test item selection. One example of such includes a computer and a computer implemented test battery having a plurality of test sections. Each test section has a set of test items selected from a plurality of test items. An item selection process is also provided. At least one section from the plurality of test sections is administered to an examinee using the computer. The computer is configured to receive the examinee's responses to the set of test items in the one section. The item selection process for at least one subsequent test section is informed based on scores from the one test section or previous test sections.
According to another aspect, a method for test item selection is provided. A computer and computer implemented test battery are included along with at least two or more test sections having a plurality of test items. One test section of the at least two or more test sections is administered to an examinee using the computer. The computer is adapted to receive the examinee's responses to a set of the plurality of test items for the one test section. An initial ability estimate for the examinee's responses to the set of the plurality of test items in the one test section is calculated. One or more test items from the plurality of test items are selected to include in a subsequent test section to the one test section of the at least two or more test sections based upon the initial ability estimate from at least the one previous test section.
According to still another aspect, a method for test item selection is provided. The item selection method includes a computer and a computer implemented test battery having at least two or more test sections with a plurality of test items. One section of the at least two or more test sections are administered to an examinee using the computer. The computer receives the examinee's responses to a set of the plurality of test items for the one test section. An initial ability estimate is calculated for the examinee's responses to the set of the plurality of test items in the one test section. The plurality of test items are minimized to a subset of test items based upon the initial ability estimate from at least the one previous test section. A next test item is selected from the subset of test items for the subsequent test section.
Illustrated embodiments of the present disclosure are described in detail below with reference to the attached drawing figures, which are incorporated by reference herein, and where:
The present disclosure provides for various computer adaptive testing methods. One exemplary method includes a method for item selection and ability estimation in a computer adaptive test. The accuracy and efficiency of a computer adaptive test is improved by making the test more succinct and/or accurate, and thus more effective. What results is a testing platform using a computer adaptive test that can estimate an examinee's ability using the examinee's response information relating to a specific set of operational test items.
In educational assessments, a test battery is usually composed of several related sections based on content categories. The inter-relationships among different sections can be used to improve the test efficiency in a computer adaptive testing (CAT) scenario. For example, an examinee's ability estimate from an earlier section can inform the selection of an initial item and the subsequent items in a later section. The methods used for item selection and ability estimation in CAT are essential for accurately measuring an examinee's ability.
a. Illustrative Embodiments for Item Selection in a CAT for a Test Battery
Using, for example, a test script administration process or application, a section or item selector is used to select a first test section from a test battery. The selected test section may be displayed using one of the aforementioned interface pieces or other like electronic device, whereby for example a plurality of test items from a first selected section are displayed. Through a workstation, computer network, or other like electronic device, examinees' input, answers or responses are received in response to the plurality of test items from a first selected section of a test battery being presented using a test scripted administration process or application. Operably configured on a workstation, such as the one illustrated in
One comparative method selects a first item of a later section randomly from a pool of items most informative near the initial fixed theta value and then uses one or more estimation methods to estimate an examinee's interim ability. For example, the maximum likelihood (ML) estimator can be used to estimate an examinee's interim ability upon responding to each item, which was then used for selecting next items in subsequent sections of the test battery. Other exemplary methods herein use a Bayesian approach. The ability estimates from the first section may be used as prior information to select the first item of a subsequent or second section in a test battery. The expected a posteriori (EAP) estimation method can also be used at every step to obtain an examinee's interim ability which may then be used for selecting subsequent items in subsequent sections of a test battery.
b. Illustrative Application(s)
In practice, the existing CAT programs involving multiple test sections usually use the same initial ability for all examinees without considering the interrelatedness between the test sections. This practice is limited because it involves arbitrary choices of ability values and items for each examinee. If the initial ability estimate is away from the true ability of the examinee, the CAT procedure will have an inaccurate start. For example, starting the test with a mid-level difficulty item for a high- or low-proficiency examinee would take longer to arrive at an accurate estimate of his/her ability. This, to a large degree, affects the efficiency of item selection. Moreover, initialization at the same ability estimate for all examinees leads to first items in the test that are always chosen from the same subset in the pool. Hence, these items are overexposed (Van der Linden & Pashley, 2010).
In the present disclosure, using an initial ability estimate predicted by each examinee's performance on earlier related test section(s) provides an individualized initialization for adaptive tests. That is, different examinees will start at different initial ability estimates. Further, using an initial theta predicted by each examinee's performance on earlier section(s) provides an individualized prior that is located near each examinee's true ability. The use of a predicted initial theta and an individualized prior that is continuously improved during the test using additional information obtained from the individual examinee will improve item selection and speed up convergence of the ability estimates. Finally, the present invention helps improve item exposure. The empirical initialization of the test entails leads to a variable entry point to the pool, and hence offers a more even exposure of its items (Van der Linden & Pashley, 2010). The item selection and ability estimation procedure proposed in the present invention can be integrated with the current algorithm for delivering the CAT tests.
II. Other Embodiments and Variations
The present disclosure is not to be limited to the particular embodiments described herein. In particular, the present disclosure contemplates numerous variations in the type of ways in which embodiments of the disclosure may be applied to computer adaptive testing. The foregoing description has been presented for purposes of illustration and description. It is not intended to be an exhaustive list or limit any of the disclosure to the precise forms disclosed. It is contemplated that other alternatives or exemplary aspects that are considered are included in the disclosure. The description is merely examples of embodiments, processes or methods of the invention. For example, the methods used to select an initial item in a subsequent section based upon ability estimates from a previous section are not limited to those disclosed herein. It is understood that any other modifications, substitutions, and/or additions may be made, which are within the intended spirit and scope of the disclosure. For the foregoing, it can be seen that the disclosure accomplishes at least all of the intended objectives.
This application claims priority under 35 U.S.C. §119 to provisional application Ser. No. 61/912,774, filed Dec. 6, 2013, which is hereby incorporated in its entirety.
| Number | Date | Country | |
|---|---|---|---|
| 61912774 | Dec 2013 | US |