Bibliography on Computerized
Adaptive Testing (CAT)
(including related
literature on sequential testing)
Updated March 26, 2011
Compiled by David J. Weiss,
djweiss@umn.edu
Filed
manuscripts are identified by # followed by a manuscript number
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
#AB95-01 (also see
#SP95-01). Abdel-Fattah, A.. A, Lau,
C.-M. A., & Spray, J. A. (1995, June).
The effect of model misspecification on classification decisions made
using a computerized test: UIRT versus MIRT. Paper presented at the annual
meeting of the Psychometric Society,
Abdel-Fattah, A.. A, Lau,
C.. A., & Spray, J. A. (1996, April).
Effect of altering passing score in CAT when unidimensionality is
violated. Paper presented at the annual
meeting of the American Educational Research Association,
#AB00-01. Abdullah, S. C. & Cooley, R. E. (2000). Using constraints to develop and deliver
adaptive tests. Paper presented at the
Computer-Assisted Testing Conference. {PDF file, 46 KB}
#AC87-13. Ackerman, T. A.
(1987). The use of unidimensional parameter estimates of multidimensional items
in adaptive testing (ACT Research Report series 87-13).
Ackerman, T. A. (1991). The use of unidimensional parameter estimates
of multidimensional items in adaptive testing. Applied Psychological
Measurement, 15, 13-24.
Adams, R. J. (1987). Adaptive
testing, information, and the partial credit model.
Adema, J. J. (1990). The
construction of customized two-staged tests. Journal of Educational
Measurement, 27, 241-253.
Allred, L. A. & Green,
B. F. (1984). Analysis of experimental
CAT ASVAB data.
Almond, R. G. & Mislevy,
R. J. (1999. Graphical models and computerized adaptive
testing. Applied Psychological
Measurement, 23, 223-237. (Also
Educational Testing Service Research Report 98-4).
#AM95-01. American Council on Education. (1995). Guidelines for computer-adaptive test
development and use in education.
Anastasi, A. (1953). An empirical study of the applicability of
sequential analysis to item selection.
Educational and Psychological Measurement, 13, 3-13.
Anderson, D. (ETS). (1999, April). Use of conditional item exposure methodology
for an operational CAT. Paper presented
at the annual meeting of the National Council on Measurement in Education,
Andrich, D. (1995). Review of the book Computerized Adaptive
Testing: A Primer. Psychometrika, 4?,
615-620.
Angoff, W. H. &
Huddleston, E. M. (1958). The
multi-level experiment: A study of a two-level test system for the College
Board Scholastic Aptitude Test.
(Statistical Report 58-21).
Ariel, A., Veldkamp, B. P.,
& van der Linden, W. J. (2002). Constructing
rotating item pools for constrained adaptive testing. Submitted for
publication.
Archer, R.P.,
#AR03-01. Ariel, A., Veldkamp, B., & van der Linden, W. J. (2003, April). Constructing
rotating item pools for constrained adaptive testing. Paper presented at the Annual meeting of the
National Council on Measurement in Education,
Armitage, P. (1950). Sequential analysis with more than two
alternative hypotheses, and its relation to discriminant function
analysis. Journal of the Royal
Statistical Society, 12, 137-144.
Armstrong, R.D. & Edmonds,
J.J. (2003). The assembly of multiple stage adaptive tests with discrete items.
#AR04-01.
Armstrong, R. D. &
Armstrong, R. D. &
Jones, D. H. (1998). Computer adaptive
testing – Approaches for item selection and measurement. (Research report).
Armstrong, R. D., Jones, D.
H., & Berliner, N. (1998,
June). Computerized adaptive testing
with multiple form structures. Paper
presented at the annual meeting of the Psychometric Society,
#AR04147. Armstrong, R.D., Jones, D. H., Koppel, N .B.,
& Pashley, P. J. (2004).
Computerized adaptive testing with multiple-form structures. Applied Psychological Measurement, 28,
147-164.
#AR03-02. Armstrong, R. D. & Little, J.
(2003). The assembly of multiple form
structures. Paper presented at the
annual meeting of the National Council on Measurement in Education,
#AR03-03. Armstrong, R. D. & Roussos, L. (2003).
A method to determine targets for multi-stage adaptive tests. Unpublished manuscript. {PDF file,
207 KB}
Arrowwood, V. E.
(1994). Effects of computerized adaptive
test anxiety on nursing licensure examinations.
Dissertation Abstracts International, A (Humanities and Social
Sciences), 54 (9-A), 3410.
Assessment Systems
Corporation (1984). User’s manual for
the MicroCAT Testing System.
Assessment Systems
Corporation (1988). User’s manual for
the MicroCAT Testing System, Version 3.
Assessment Systems Corporation. (1996). User’s manual for the MicroCAT testing system, Version 3.5.
Assessment Systems
Corporation (2001). The FastTEST Professional Testing System, Version 1.6. [Computer software].
Auger, R. (1989). Étude de
praticabilité du testing adaptatif de maîtrise des apprentissages scolaires au
Québec : une expérimentation en éducation économique secondaire 5. Thèse de
doctorat non publiée. Montréal : Université du Québec à Montréal.
Auger, R. & Séguin, S.P.
(1992). Le testing adaptatif avec interprétation critérielle, une expérience de
praticabilité du TAM pour l’évaluation sommative des apprentissages au Québec. Mesure et évaluation en
éducation, 15-1 et 2, 10
Babcock,
B. & Weiss, D. J. (2009).
Termination criteria in computerized adaptive tests: Variable-length
CATs are not biased. In D. J. Weiss (Ed.), Proceedings
of the 2009 GMAC Conference on Computerized Adaptive Testing. {PDF File, 281 KB}
Baek, S. G. (1995).
Computerized adaptive attitude testing using the partial credit model.
Dissertation Abstracts International-A, 55(7-A), 1922 (UMI No. AAM9430378).
Baek, S.G. (1997). Computerized adaptive testing using the
partial credit model for attitude measurement.
In M. Wilson, G. Engelhard Jr., & K. Draney (Eds.), Objective measurement: Theory into practice, volume 4.
Baghi, H.,
Baghi, H., Gabrys, R., &
#BA01191. Ban, J.-C., Hanson. B. H., Wang, T., Ti, Q.,
& Harris, D. J. (2001). A
comparative study of on-line pretest item calibration-scaling methods in
computerized adaptive testing. Journal
of Educational Measurement,38, 191-212. (Also ACT Research Report 2002-11).
See #BA02207. Ban, J., Hanson, B.A., Yi, Q., & Harris,
D. (2001, April). Data sparseness and online pretest
calibration/scaling methods in CAT.
Paper presented at the annual meeting of the American Educational
Research Association, Seattle. (Also ACT Research Report 2002-1)
#BA02207. Ban, J-C., Hanson, B.A., Yi, Q., &
Harris, D. J. (2002). Data sparseness and online pretest item
calibration/scaling methods in CAT.
Journal of Educational Measurement,39, 207-218.
Ban, J.-C., Hanson, B., Wang, T., Yi,
Q. & Harris, D. (2000). A comparative study of online pretest item
calibration/scaling methods in CAT. American
Educational Research Association.
#BA99-01. Ban, J., Wang, T., & Yi, Q. (1999,
June). Comparison of the a-stratified
method, the Sympson-Hetter method, and the matched difficulty method in CAT
administration. Paper presented at the
annual meeting of the Psychometric Society,
#BA00-01. Ban, J. C., Wang, T., Yi, Q., & Harris,
D. J. (2000, April). Effects of nonequivalence of item pools on
ability estimates in CAT. Paper
presented at the annual meeting of the National Council on Measurement in
Education,
Barrada, J. R., Abad, F. J., & Veldkamp, B. P.
(2009). Comparison of methods for controlling maximum exposure rates in
computerized adaptive testing. Psicothema, 21, 313-320. {PDF file, 94 KB}
Barrada. J. R., Mazuela, P., & Olea, J. (2006). Maximum Information Stratification method for controlling item exposure in Computerized Adaptive Testing. Psicothema, 18, 156-159. {PDF file, 57 KB)
Barrada,
J. R., Olea, J., & Abad., F. J. (2008). Rotating item banks versus
restriction of maximum exposure rates in computerized adaptive testing. Spanish
Journal of Psychology, 11, 618-625. {PDF file, 267 KB}
Barrada,
J. R., Olea, J., & Ponsoda, V. (2007). Methods for restricting maximum exposure
rate in computerized adaptive testing. Methodology, 3, 14-23. {PDF file, 399KB}
Barrada, J. R., Olea, J.,
Ponsoda, V., & Abad, F. J. (2008). Incorporating randomness in the Fisher
information for improving item-exposure control in CATs. British Journal of
Mathematical and Statistical Psychology, 61, 493-513.
Barrada, J. R., Olea, J., Ponsoda, V., & Abad, F.
J. (2009). Item selection rules in computerized adaptive testing: Accuracy and
security. Methodology, 5, 7-17. (PDF
file, 445 KB)
Barrada, J., Olea, J., Ponsada,
V., & Abad, F. (2009). Test
overlap rate and item exposure rate as indicators of test security in
CATs. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on
Computerized Adaptive Testing.{PDF
File, 261 KB}
Barrada, J. R., Veldkamp,
B. P., & Olea, J. (2006, July). Multiple maximum exposure rates in computerized
adaptive testing. Paper presented at the SMABS-EAM Conference, Budapest,
Hungary.
Baumer, M., Roded, K.,
& Gafni, N. (2009). Assessing the
equivalence of Internet-based vs. paper-and-pencil psychometric tests. In D. J.
Weiss (Ed.), Proceedings of the 2009 GMAC
Conference on Computerized Adaptive Testing.{PDF File, 142 KB}
Bayliss, M.S., Dewey J.E.,
Dunlap,
Bayroff, A. G. (1964,
November). Feasibility of a programmed
testing machine.
#BA69-01. Bayroff, A. G. (1969, September). Psychometric problems with branching
tests. Paper presented at the annual
meeting of the American Psychological Association.
#BA74-01. Bayroff, A. G., Ross, R. M., & Fischl, M.
A. (1974). Development of a programmed
testing system (Technical Paper 259).
Bayroff, A. G. & Seeley,
L. C. (1967). An exploratory study of branching tests
(Technical Research Note 188).
Bayroff, A. G., Thomas, J.
J., & Anderson, A. A. (1960).
Construction of an experimental sequential item test (Research
Memorandum 60-1).
Bejar,
Bejar,
Bejar,
Bejar,
Bejar,
#BE02-01. Bejar,
#BE03-03. Bejar,
Bejar,
Bejar,
Bejar,
Belov,
Belov, D.I., Armstrong,
R.D. (2008). A Monte Carlo approach to the design, assembly, and evaluation of
multistage adaptive tests. Applied
Psychological Measurement,32,119–137.
Belov,
#BE98-45.
Bennett, R. E., Morley, M., & Quardt, D. (1998). Three response types for broadening the
conception of mathematical problem solving in computerized-adaptive tests
(Research Report 98-45).
Bennett, R. E., Steffen, M.,
Singley, M. K., Morley, M., & Jacquemin, D. (1997). Evaluating an
automatically scorable, open-ended response type for measuring mathematical
reasoning in computer-adaptive tests. Journal
of Educational Measurement, 34, 162-176.
Ben-Porath, Y. S., Waller,
N. G., Slutske, W. S. & Butcher, J. N.
(1988, August). A comparison of
two methods for the adaptive administration of the MMPI-2 content scales. Paper presented at the 86th Annual
Convention of the American Psychological Association,
Ben-Porath, Y. S., Slutske,
W. S., & Butcher, J. N. (1989). A
real-data simulation of computerized adaptive administration of the MMPI. Psychological Assessment: A Journal of Consulting and Clinical
Psychology, 1, 18-22.
Ben-Porath, Y. S. &
Roper, B. L. (1992, May). Computerized
adaptive testing with the MMPI-2:
Reliability, validity, and comparability to paper and pencil
administration. Paper presented at the
27th Annual Symposium on Recent Developments in the MMPI/MMPI-2,
Ben-Porath, Y. S., Roper, B.
L., & Butcher, J. N. (1990, June).
An empirical study of the computer adaptive MMPI-2. Paper presented at the 25th Annual
Symposium on recent developments in the MMPI/MMPI-2,
#BE94141.
Berger, M. P. F. (1994). A general approach to algorithmic design of
fixed-form tests, adaptive tests, and testlets.
Applied Psychological Measurement, 1994, 141-153.
Berger, M. P. F., &
Veerkamp, W. J. J. (1997). Some new item
selection criteria for adaptive testing.
Journal of Educational and Behavioral Statistics, 22, 203-226.
Bergstrom, B. (1992, April). Ability measure equivalence of computer
adaptive and paper and pencil tests: A
research synthesis. Paper presented at the
annual meeting of the American Educational Research Association,
#BE92-01. Bergstrom, B. (1992). Computer adaptive versus paper-and-pencil
tests. Unpublished doctoral
dissertation,
Bergstrom, B. A. (1992).
Confidence in pass/fail decisions for computer adaptive and paper and pencil
examinations. Evaluation and The Health Professions, 15(4),
435-464.
Bergstrom B. A. (1996). Computerized adaptive testing for the national certification examination. AANA.J, 64, 119-24.
Bergstrom, B. A. (1996).
Computerized adaptive testing for the national certification
examination. AANA Journal, 64, 119-24. (American Association of Nurse
Anesthetists)
Bergstrom, B. & Gershon,
R. (1992, April). Comparison of item
targeting strategies for pass/fail adaptive tests. Paper presented at the annual meeting of the
American Educational Research Association,
Bergstrom, B. & Gershon,
R. (1994, April). Computerized adaptive
testing exploring examinee response time using hierarchical linear
modeling. Paper presented at the annual
meeting of the American Educational Research Association,
Bergstrom, B. A
& Gershon, R. C. (1994). Computerized adaptive testing for licensure
and certification. CLEAR Exam Review, Winter 1994, 25-27.
Bergstrom, B. A., &
Bergstrom, B. B., &
Lunz, M. E. (1991, April). Confidence in
pass/fail decisions for computer adaptive and paper and pencil
examinations. Paper presented at the
annual meeting of the American Educational Research Association,
Bergstrom, B. A.. &
Lunz, M. E. (1991, July). Comparisons of
computer adaptive and pencil and paper tests.
Bergstrom, B. & Lunz, M.
E. (1992). Confidence in pass/fail decisions, for
computer adaptive and paper-and-pencil examinations. Evaluation and the Health Professions, 15(4),
453-464.
Bergstrom, B.
Bergstrom, B. A., Lunz, M.
E., & Gershon, R. C. (1992). Altering the level of difficulty in computer
adaptive testing. Applied Measurement in
Education, 5, 137-149.
Bergstrom, B.A. & Stahl,
J. A. (1992). Assessing existing item
bank depth for computer adaptive testing. ERIC Document No. TM022404
#BE75-01. Betz, N. E.
(1975). New types of information and
psychological implications. In D. J.
Weiss (Ed.), Computerized adaptive trait measurement: Problems and Prospects (Research Report
75-5), pp. 32-43.
Betz, N. E. (1977). Effects of immediate knowledge of results and
adaptive testing on ability test performance.
Applied Psychological Measurement, 2, 259-266.
Betz, N. E. & Weiss, D.
J. (1973). An empirical study of
computer-administered two-stage ability testing (Research Report 73-4).
#BE74-4. Betz, N. E. & Weiss, D. J. (1974). Simulation studies of two-stage ability
testing (Research Report 74-4).
Betz, N. E. & Weiss, D.
J. (1975). Empirical and simulation
studies of flexilevel ability testing
(Research Report 75-3).
Betz, N. E. & Weiss, D.
J. (1976, June). Effects of immediate
knowledge of results and adaptive testing on ability test performance (Research
Report 76-3).
Betz, N. E. & Weiss, D.
J. (1976, June). Psychological effects
of immediate knowledge of results and adaptive ability testing (Research Report
76-4).
#BI01069. Bickel, P., Buyske, S., Chang, H.-H., &
Ying, Z. (2001). On maximizing item
information and matching difficulty with ability. Psychometrika, 66, 69-77.
#BI84-01. Bill, B. C.
(1984). A comparison of the
maximum likelihood strategy and stradaptive test on a micro-computer. Unpublished M. S. thesis,
Binet, A., & Simon, Th. A. (1905). Méthode
nouvelle pour le diagnostic du niveau intellectuel des anormaux. L'Année
Psychologique, 11, 191-244. (also cited
as: Applications des methods nouvelles au diagnostic du niveau intellectual
chez des enfants normaux et anourmaux d’hospice et d’ecole primaire, 245-336.)
Binet, A. & Simon, T.
(1908). Le development de l’intelligence
chez les enfants. L’Anee Psychologique,
14, 1-94.
Binet, A. & Simon, T. (1915). A method of measuring the development
of the intelligence of young children.
#BJ04-02. Bjorner, J.B. (2004, June). Developing tailored instruments: Item banking
and computerized adaptive assessment.
Paper presented at the conference “Advances
in Health Outcomes Measurement: Exploring the
Bjorner, J.B., Chang, C.H.,
Thissen, D., Reeve, B.B. (2007). Developing tailored instruments: Item banking
and computerized adaptive assessment. Quality
of Life Research, 16(Suppl 1, 95–108.
#BJ03913. Bjorner, J. B., Kosinski, M. & Ware, J. E., Jr. (2003). Calibration of an item pool for assessing the burden of headaches: An application of item response theory to the Headache Impact Test (HIT). Quality of Life Research 12: 913–933. {PDF file, 286 KB}
#BJ04-01. Bjorner, J. B., Kosinski, M., & Ware, J. E., Jr. (2004, in press). Computerized adaptive testing and item
banking. In P. M. Fayers and R. D. Hays (Eds.) Assessing Quality of Life.
Blais, J.G. (2002). Historique et
concepts propres au testing adaptatif [Adaptive testing: Historical accounts and concepts]. Presented at the 69th Congress of the Acfas. Sherbrooke: Association
canadienne française pour l’avancement des sciences (Acfas). [In French]
#BL02-01. Blais, J-.G
& Raiche, G. (2002,
April). Some features of the sampling
distribution of the ability estimate in computerized adaptive testing according
to two stopping rules. Paper presented
at the annual meeting of the International Objective Measurement Workshops-XI,
Blais, J.-G. & Raîche,
G. (submitted). Features of the estimated sampling distribution
of the ability estimate in computerized adaptive testing according to two
stopping rules. In D. G. Englehard (Eds.), Objective measurement: Theory into practice. Volume 6.
Bloxom, B. M. & Vale, C.
D. (undated). An adaptive method of
multidimensional trait estimation.
Unpublished manuscript.
Bloxom, B. M. & Vale,C.
D. (1987, June). Multidimensional
adaptive testing: A procedure for sequential estimation of the posterior centroid
and dispersion of theta. Paper presented
at the annual meeting of the Psychometric Society,
Bochner, J., Garrison, W.,
Palmer, L., MacKenzie, D., & Braveman, A. (1997). A computerized adaptive testing system for
speech discrimination measurement: The Speech Sound Pattern Discrimination
Test. Journal of the Acoustic Society of
#BO75-01. Bock, R. D. (1975). Discussion.
In D. J. Weiss (Ed.), Computerized adaptive trait measurement: Problems and Prospects (Research Report
75-5), pp. 46-49.
#BO82431. Bock, B. D., & Mislevy, R. J. (1982).
Adaptive EAP estimation of ability in a microcomputer environment. Applied Psychological Measurement, 6, 431-444
Bock, R. D., Muraki, E.,
& Pfeiffenberger, W. (1988). Item
pool maintenance in the presence of item parameter drift. Journal of Educational Measurement,25, 275-285.
Bock, R. D., & Zimowski,
M. F. (1998). Feasibility studies of two-stage testing in large-scale
educational assessment: Implications for NAEP. American Institutes for
Research, CA.
Bontempo, B., & Julian,
E. R., & Gorham, J. L. (1997, March).
Assessing speededness in variable-length computer-adaptive tests. Paper presented at the annual meeting of the
National Council on Measurement in Education,
#BO01965. Borman, W. C., Buck, D. E., Hanson, M. A.,
Montowidlo, S. J., Stark, S., & Drasgow, F.
(2001). An examination of the
comparative reliability, validity, and accuracy of performance ratings made
using computer adaptive rating scales.
Journal of Applied Psychology, 86, 965-973.
Borman, W. C., Hanson, M.A.,
Kubisiak, U. C., & Buck, D. E. (2000).
Computerized adaptive rating scales (CARS): Development and evaluation of the concept.
(Institute Rep. No. 350).
Borman, W. C., Hanson, M.
A., Montowidlo, S. J., Drasgow, F., Foster, L., & Kubisiak, U. C.
(1998). Computerized adaptive rating
scales that measure contextual performance.
Paper presented at the 3th annual conference of the Society for
Industrial and Organizational Psychology,
Bouchard, J. (1990). Future directions for the National Council:
The Computerized Adaptive Testing Project. Issues, 11, 1-5. (National Council of State Boards of Nursing)
Bowers,
D. R. (1992). Computer-based adaptive
testing in music research and instruction.
Psychomusicology, 10, 49-63.
Bowles,
R. (2001). An examination of item review
on computer adaptive tests. Manuscript in preparation,
#BO01-01. Bowles, R., &
Pommerich, M. (2001, April). An
examination of item review on a CAT using the specific information item selection
algorithm. Paper presented at the annual
meeting of the National Council on Measurement in Education,
#BO03-01. Boyd. A. M.
(2003). Strategies for
controlling testlet exposure rates in computerized adaptive testing
systems. Unpublished Ph.D. Dissertation,
The
#BO03-02. Boyd, A. M., Dodd, B. G., & Fitzpatrick,
S. J. (2003, April). A comparison of
exposure control procedures in CAT systems based on different measurement
models for testlets using the verbal reasoning section of the MCAT. Paper presented at the Annual meeting of the
National Council on Measurement in Education,
Bradlow, E.T., Wainer, H.,
and Wang, X (1999). A Bayesian random effects model for testlets,
Psychometrika, 64, 153-168.
#BR01085. Bradlow, E. T. & Weiss, R. E. (2001).
Outlier measures and norming methods for computerized adaptive
tests. Journal of Educational and
Behavioral Statistics, 26, 85-104.
Bradlow, E. T., Weiss, R. E., Cho, M. (1998). Bayesian identification of outliers in computerized adaptive testing. Journal of the American Statistical Association, 93, 910-919.
#BR04-01.
Breithaupt, K., Ariel, A., & Veldkamp, B. (2004). Automated
Simultaneous Assembly of Multi-Stage Testing for the Uniform CPA Examination. Paper
presented at the annual meeting of the National Council on Measurement in
Education,
#BR00-01. Bridgeman, B. & Cline, F. (2000). Variations in mean response times for
questions on the computer-adaptive GRE general test: Implications for fair
assessment (GRE Board Professional Report No. 96-20P: Educational Testing
Service Research Report 00-7).
#BR02-01. Bridgeman, B. & Cline, F. (2002, April). Fairness issues in adaptive tests with strict
time limits. Paper presented at the
annual meeting of the American Educational Research Association,
#BR03-01. Bridgeman, B,. Cline, F., & Hessinger, J.
(2003). Effect of extra time on GRE®
Quantitative and Verbal Scores (Research Report 03-13).
Bridgeman, B. &
Schaeffer, G. A. (1995, April). A comparison of gender differences on
paper-and-pencil and computer-adaptive versions of the Graduate Record
Examination. Paper presented at the
annual meeting of the American Educational Research Association,
Brooks. S. (1977).
A comparison of the classification of students by two methods of
administration of a mathematics placement test.
Unpublished doctoral dissertation,
#BR78415. Brooks, S. & Hartz, M. A. (1978).
Predictive ability of a branching test.
Educational and Psychological Measurement, 38, 415-419.
#BR77-6. Brown, J. M., & Weiss, D. J. (1977). An
adaptive testing strategy for achievement test batteries (Research Rep. No.
77-6).
Bryson, R. (1971, December). A comparison of four methods of selecting
items for computer-assisted testing (Technical Bulletin STB 72-5).
Buhr, D. C., & Legg, S.
M. (1989, March). Investigating the
validity of a computerized adaptive test for different examinee groups. Paper presented at the annual meeting of the
American Educational Research Association,
Bunderson, C. V., Inouye, D.
K., & Olsen, J. B. (1988). The four generations of computerized educational
measurement. Research Report 98-35.
Bunderson, C. V., Inouye, D.
K., & Olsen, J. B. (1986). The four generations of computerized educational
measurement. In R. L. Linn (Ed.), Educational
Measurement (3rd ed., pp. 367-407).
Burke, M. J., Normand, J.,
& Raju, N. M. (1987). Computerized psychological testing: Overview and critique. Professional Psychology: Research and Practice, 1, 42-51.
#BU03-01. Burt, W. M., Kim. S.-J., Davis, L. L., &
Dodd, B. G. (2003, April). A comparison
of item exposure control procedures using a CAT system based on the generalized
partial credit model. Paper presented at
the annual meeting of the American Educational Research Association,
Buyske,
S. G. (1998). Optimal design for item calibration in computerized adaptive
testing. Unpublished doctoral dissertation,
Candell, G. L. (1988).
Application of appropriateness measurement to a problem in computerized
adaptive testing. Unpublished
doctoral dissertation,
Carey, P. A. (ETS) (1999, April). The use of linear-on-the-fly testing for
TOEFL Reading. Paper presented at the
annual meeting of the National Council on Measurement in Education,
Carlson, R. (1994). Computer adaptive testing: A shift in the evaluation paradigm. Educational Technology Systems, 22 (3),
213-224.
Carlson, S. (2000). ETS finds flaws in the
way online GRE rates some students. Chronicle
of Higher Education, 47, a47.
Case, S. M. & Luecht, R.
M. (1997, March). Computer assembly of tests so that content
reigns supreme. Paper presented at the
annual meeting of the National Council on Measurement in Education,
Cella, D., Gershon, R., Lai, J. S., & Choi, S. (2007). The
future of outcomes measurement: Item banking, tailored short-forms, and
computerized adaptive assessment. Quality of Life Research, 16(Suppl. 1),
133-141.
#CH95-01. Chae, S. (1995). Item equivalence from paper-and-pencil to
computer adaptive testing. Unpublished
doctoral dissertation,
Chajewski,
M. & Lewis, C. (2009). Optimizing
item exposure control algorithms for polytomous computerized adaptive tests
with restricted item banks. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive
Testing. {PDF File, 923 KB}
#CH04-01. Chang, C.-H. (2004, June). Developing
tailored instruments: Item banking and computerized adaptive assessment. Paper
presented at the conference “Advances
in Health Outcomes Measurement: Exploring the
#CH00-04. Chang, C.-Y., Kalohn. J. C., Lin, C.-J. &
Spray, J. (2000). Estimating item
parameters from classical indices for item pool development with a computerized
classification test (ACT Research 2000-4).
Iowa City IA, ACT, Inc.
Chang, H. (1995, April). A
global information approach to computerized adaptive testing. Paper presented at the annual meeting of the
National Council on Measurement in Education,
Chang, H. (1996,
April). A model for score maximization
within a computerized adaptive testing environment. Paper presented at the annual meeting of the
NMCE,
Chang, H. H. (2004). Understanding
computerized adaptive testing: From Robbins-Munro to Lord and beyond. In D.
Kaplan (Ed.), The Sage handbook of quantitative
methodology for the social sciences (pp.
117-133). New York: Sage.
Chang, H.-H., Qian, J., & Ying, Z. (1999). a-stratified
multistage computerized adaptive testing. Applied Psychological
Measurement, 23, 211–222.
Chang, H., Qian, J., &
Ying, Z. (2001). a-stratified
multistage computerized adaptive testing with b-blocking. Applied Psychological Measurement, 25, 333-341
(also presented at National Council on Measurement in Education, 2000).
Chang, H. & van der
Linden. (2001). Implementing content
constraints in a-stratified adaptive testing
using a shadow test approach (Research Report 01-001).
#CH03262. Chang, H.-H.
& van der Linden, W. J.
(2003) Optimal stratification of
item pools in a-stratified computerized
adaptive testing. Applied Psychological
Measurement, 27, 262-274.
Chang, H. & Ying,
Z. (1999). a-stratified multistage computerized adaptive
testing. Applied Psychological Measurement, 23, 211-222.
Chang, H., & Ying, Z.
(in press, 1997?). Nonlinear sequential designs
for logistic item response theory models with applications to computerized
adaptive tests. The Annals of Statistics.
Chang, H.-H., & Ying, Z.
(1996). A global information approach to computerized adaptive testing. Applied
Psychological Measurement, 20, 213-229. (also presented at National Council on
Measurement in Education, 1997)
Chang, H.-H., & Ying, Z.
(1996, June). Building a statistical foundation for computerized adaptive
testing. Paper presented at the annual meeting of the Psychometric Society,
Chang, H.-H., & Ying, Z.
(1996, in preparation). Recursive
maximum likelihood estimation, sequential design, and computerized adaptive
testing.
Chang, H.-H. & Ying, Z.
(1997, June). Multi-stage CAT with
stratified design. Paper presented at
the annual meeting of the Psychometric Society.
Chang,
H.-H., & Ying, Z. (1999). a-stratified
multistage computerized adaptive testing. Applied Psychological Measurement,
23, 211-222.
#CH02-01. Chang, H. H. & Ying, Z. (2002, April). To weight or not to weight – balancing
influence of initial and later items in CAT. Paper presented at the annual
meeting of the National Council on Measurement in Education,
Chang,
H. H. & Ying, Z. (2003, April). Test-score comparability, ability estimation,
and item-exposure control in computerized adaptive testing. Paper presented at the Annual meeting of the
National Council on Measurement in Education,
Chang, H.-H.., & Zhang,
J. (2002). Hypergeometric family and test overlap rates in computerized
adaptive testing. Psychometrika, 67, 387-398.
(Also presented at the annual meeting of the Psychometric Society,
Lawrence KS, 1999.)
Chang, H.-H. & Zhang, J.
(2002, April). Identify the lower bounds
for item sharing and item pooling in computerized adaptive testing. Paper presented at the annual meeting of the
American Educational Research Association,
Chang, H.-H. & Zhang, J.
(2003, April). Assessing CAT security
breaches by the item pooling index.
Paper presented at the annual meeting of the National Council on
Measurement in Education,
#CH02387. Chang, H., & Zhang, J. (2002). Hypergeometric family and test overlap rates
in computerized adaptive testing. Psychometrika, 67, 387-298.
#CH00-01. Chang, S., Ansley, T., & Lin, S. (2000,
April). Performance of item exposure control methods in computerized adaptive
testing: Further explorations. Paper
presented at the Annual Meeting of the American Educational Research
Association,
#CH03071. Chang, S.-H. & Ansley, T. (2003). A comparative study of item exposure control
methods in computerized adaptive testing. Journal of Educational Measurement,40, 1,
71-103.
Chang, S. W. (1998). A
comparative study of item exposure control methods in computerized adaptive
testing. Unpublished doctoral dissertation,
#CH02-02. Chang, S.-W. & Harris, D. J. (2002, April). Redeveloping the exposure control parameters
of CAT items when a pool is modified.
Paper presented at the annual meeting of the American Educational
Research Association,
Chang, S.W. & Harris, D. (2002, April). Redeveloping the exposure control parameters of CAT items when a pool is modified. Paper presented at the Annual Meeting of the American Educational Research Association, New Orleans.
#CH98-03. Chang, S. W. & Twu, B. Y. (1998).
A comparative study of item exposure control methods in computerized
adaptive testing. Research Report Series
98-3.
#CH01-02. Chang, S.-W. & Twu, B.-Y. (2001).
Effects of changes in the examinees’ ability distribution on the
exposure control methods in CAT. Paper
presented at the annual meeting of the American Educational Research Association,
Chen, P. H. (2009).
Comparison of adaptive Bayesian estimation and weighted Bayesian
estimation in multidimensional computerized adaptive testing. In D. J. Weiss
(Ed.), Proceedings of the 2009 GMAC
Conference on Computerized Adaptive Testing. {PDF file, 308KB}
Chen, S. (1998). A
comparison of maximum likelihood estimation and expected a posteriori
estimation in computerized adaptive testing using the generalized partial
credit model. (Doctoral Dissertation,
Chen, S.-K. (2007). The comparison of maximum likelihood
estimation and expected a posteriori in CAT using the graded response model. Journal
of Elementary Education. 19, 339-371.
#CH01-01. Chen, S.-Y. (2001). A new approach to simulation studies in
computerized adaptive testing. Paper presented
at the annual meeting of the American Educational Research Association,
Chen, S.-Y., &
Ankenmann, R. D. (2004). Effects of practical constraints on item selection
rules at the early stages of computerized adaptive testing. Journal of
Educational Measurement, 41, 149-174. (Also presented at American Educational Research
Association, 1999).
Chen, S.-Y., Ankenmann,
R.D., & Chang, H.-H. (2000). A comparison of item selection rules at the
early stages of computerized adaptive testing. Applied Psychological Measurement, 24, 241-255.
Chen, S., Ankenmann, R. D.,
& Spray, J. A. (1999, April). Exploring the relationship between item
exposure rate and test overlap rate in computerized adaptive testing. Paper
presented at the annual meeting of the National Council on Measurement in
Education,
Chen, S., Ankenmann, R. D.,
& Spray, J. A. (2003). The
relationship between item exposure and test overlap in computerized adaptive
testing. Journal of Educational
Measurement, 40, 129-145.
#CH98569. Chen, S., Hou, L., & Dodd, B.G.
(1998). A comparison of maximum
likelihood estimation and expected a posteriori estimation in CAT using the
partial credit model. Educational and Psychological Measurement, 58, 569-595.
Chen, S., Hou, L.,
Fitzpatrick, S. J., & Dodd, B. G. (1995, April). The effect of population
distribution and methods of theta estimation on CAT using the rating scale
model. Paper presented at the annual meeting of the American Educational
Research Association,
#CH97422.
Chen, S., Hou, L. Fitzpatrick, S. J., & Dodd, B. (1997). The effect of population distribution and
methods of theta estimation on computerized adaptive testing (CAT) using the
rating scale model. Educational and Psychological Measurement, 57, 422-439.
Chen, S.-Y., Ankenmann, R.
D., & Chang, H.-H. (2000). A comparison of item selection rules at the
early stages of computerized adaptive testing.
Applied Psychological Measurement, 24, 241-255.
Chen, S.-Y., Ankenmann, R.
D., & Spray, J. A. (1999). Exploring
the relationship between item exposure rate and test overlap rate in
computerized adaptive testing (ACT Research Report series 99-5).
#CH03129. Chen, S.-Y., Ankenmann, R. D., & Spray,
J. A. (2003). The relationship between
item exposure and test overlap in computerized adaptive testing. Journal of Educational Measurement,40,
129-145.
#CH03-01. Chen, S.-Y. & Doong, H. (2003). Predicting item exposure parameters
in computerized adaptive testing. Paper
presented at the annual meeting of the American Educational Research
Association,
Chen, S.Y. & Lei, P.W. (2005). Controlling item exposure and test overlap in computerized adaptive testing. Applied Psychological Measurement, 29(2), 204–217.
Cheng, P. E. & Liou, M. (2000). Estimation of trait levels in computerized adaptive testing. Applied Psychological Measurement, 24, 257-265.
#CH03204. Cheng, P. E. & Liou, M. (2003).
Computerized adaptive testing using the nearest-neighbors
criterion. Applied Psychological Measurement, 27, 204-216.
Chen,
Y.-Y., & Ankenmann, R. D. (2004) Effects of practical constraints on item
selection rules at the early stages of computerized adaptive testing. Journal of Educational Measurement, 41,
149-174.
Cheng,
Y. (2009). Computerized adaptive testing
for cognitive diagnosis. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive
Testing. {PDF File, 308 KB}
Cheng, Y. & Chang,
H.-H. (2007). The modified maximum global discrimination index method for
cognitive diagnostic computerized adaptive testing. In D. J. Weiss (Ed.).
Proceedings of the 2007 GMAC Conference on Computerized Adaptive
Testing. {PDF file, 172 KB}
Cheng, Y.,
Chang, H.-H.,
Cheng, Y., Chang, H. H.,
& Wang, X. B. (2006, April). Constraints-weighted information method for
item selection of severely constrained computerized adaptive testing. Paper
presented at the annual meeting of the National Council on Measurement in
Education, San Francisco.
Cheng, Y., Chang, H., &
Yi, Q. (2007). Two-phase item selection procedure for flexible content
balancing in CAT. Applied Psychological. Measurement, 3, 467–482.
Choi, S.W. (2009). Firestar: Computerized adaptive testing
simulation program for polytomous IRT models. Applied Psychological Measurement, 33, 644–645.
Choi,
S. W., Reise, S. P, Pilkonis, P.A., Hays, R. D., & Cella, D. (2010).
Efficiency
of static and computer adaptive short forms compared to full-length measures of
depressive symptoms. Quaity of Life
Research, 19(1),
125–136.
Choi, S.W. & Swartz,
R.J.. (2009). Comparison of CAT item
selection criteria for polytomous items. Applied
Psychological Measurement, 33, 419–440.
Cito.
(1999). WISCAT. Een computergestuurd
toetspakket voor rekenen en wiskunde. [A computerized test package for
arithmetic and mathematics]. Cito:
#CI04-01. Cizek, G. J. (2004). Protecting the integrity of computer-adaptive
licensure tests: Results of a legal challenge.
Paper presented at the annual meeting of the American Educational
Research Association,
#CL76-01.
Cleary, T. A., Linn, R. L.,
& Rock, D. A. (1968). Reproduction
of total test score through the use of sequential programmed tests. Journal of Educational Measurement, 5, 183-187.
#CL69345. Cleary, T. A., Linn, R. L., & Rock, D. A.
(1969). An exploratory study of
programmed tests. Educational and
Psychological Measurement, 28, 345-360.
Cliff, N. (1975).
A basic test theory generalizable to tailored testing (Technical Report
No. 1).
Cliff, N. (1975). Complete orders from incomplete data: Interactive ordering and tailored
testing. Psychological Bulletin, 82,
2859-302.
#CL76-02. Cliff, N. (1976). Elements of a basic test theory generalizable
to tailored testing. Unpublished
manuscript.
#CL75-01. Cliff,
N. (1976). Incomplete orders and
computerized testing. In C. K. Clark
(Ed.), Proceedings of the First
Conference on Computerized Adaptive Testing (pp. 18-23).
#CL77375. Cliff, N. A. (1977). A theory of consistency ordering generalizable to tailored testing. Psychometrika, 375-399.
#CL77-04. Cliff, N., Cudeck, R., & McCormick,
D. (1978). Evaluations of implied orders as a basis for
tailored testing using simulations (Technical Report No. 4).
#CL78-06. Cliff, N., Cudeck, R., & McCormick,
D. (1978). Implied orders as a basis for tailored
testing (Technical Report No. 6).
Cliff, N. A., Cudeck, R.
& McCormick, D. (1977). An empirical evaluation of implied orders as a basis for tailored
testing. In D. J. Weiss (Ed.),
Proceedings of the 1977 Computerized Adaptive Testing Conference.
Cliff, N. A., Cudeck, R.
& McCormick, D. (1979). Evaluation of
implied orders as a basis for tailored testing with simulation data. Applied Psychological Measurement, 3,
495-514.
#CO03-01.
Cook, K. F., Roddey, T. S., Gartsman, G. M., & Olson, S. L.
(2003). Development and psychometric
evaluation of the Flexilevel Scale of Shoulder Function (FLEX-SF). Medical Care (in press). {PDF file,
607 KB}
Collins, J. A. (1996).
Adaptive testing with granularity. Master’s thesis, University of Saskatchewan,
Department of Computer Science.
Collins,
J. A., Greer, J. E., & Huang, S. X. (1996). Adaptive assessment using
granularity hierarchies and Bayesian nets.
In Frasson, C., Gauthier, G., and Lesgold, A. (Eds.) Intelligent
Tutoring Systems, Third International Conference, ITS'96, Montréal, Canada, June 1996
Proceedings. Lecture Notes in Computer Science 1086.
Cordova, M. J. (1997).
Optimization methods in computerized adaptive testing. Unpublished doctoral
dissertation, Rutgers University,
Cordova, M. J. (1998). Applications of network flows to computerized
adaptive testing. Dissertation,
#CO75-01. Cory, C. H. (1976). Using computerized tests to add new
dimensions to the measurement of abilities which are important for on-job
performance: An exploratory study. In C. K. Clark (Ed.), Proceedings of the First Conference on
Computerized Adaptive Testing (pp. 64-74).
Costa, D. R., Karino, C.
A., Moura, F. A. S., & Andrade, D. F. (2009). A comparison of three methods of item
selection for computerized adaptive testing. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on
Computerized Adaptive Testing.{PDF
file, 531 KB}
Cowden, D. J. (1946). An application of sequential sampling to
testing students. Journal of the
American Statistical Association, 41, 547-556.
Crichton, L. I. (1981).
Effect of error in item parameter estimates on adaptive testing (Doctoral
dissertation,
#CR82-52. Croll, P. R. (1982). Computerized adaptive testing system
design: Preliminary design
considerations (Tech. Report 82-52).
Croll, P. R. & Urry, V.
W. (1975). Tailored testing: Maximizing validity and utility for job
selection. Paper presented at the 86th
Annual Convention of the American Psychological Association.
Cronbach, L. J. (1966). New light on test strategy from decision
theory. In A. Anastasi (Ed.). Testing problems in perspective.
Cudeck, R. (1985). A
structural comparison of conventional and adaptive versions of the ASVAB Multivariate Behavioral Research, 20, 305-322.
Cudeck, R. A., Cliff, N.,
& Kehoe, J. (1977). TAILOR:
A FORTRAN procedure for interactive tailored testing. Educational and Psychological Measurement,
37, 767-769.
#CU76-02. Cudeck, R. A., Cliff, N., Reynolds, T. J.,
& McCormick, D. J. (1976).
Cudeck, R., McCormick, D.
J., & Cliff, N. (1979).
Cudeck, R., McCormick, D.,
& Cliff, N. (1980). Implied orders tailored testing: Simulation
with the Stanford-Binet. Applied
Psychological Measurement, 4, 157-163.
Curran, L. T., & Wise,
L. L. (1994, August). Evaluation and implementation of CAT-ASVAB. Paper
presented at the annual meeting of the American Psychological Association,
Davey, T., & Fan, M.
(2000, April). Specific information item selection for adaptive testing. Paper
presented at the annual meeting of the National Council on Measurement in
Education,
Davey, T., Godwin, J., &
Mittelholz, D. (1997). Developing and scoring an innovative computerized
writing assessment. Journal of Educational Measurement, 34, 21-41.
Davey, T., & Nering, M.
L. (1998, June). Evaluating and insuring
measurement precision in adaptive testing.
Paper presented at the annual meeting of the Psychometric Society,
Davey, T., & Nering, M.
L. (1998, September). Controlling item exposure and maintaining item security.
Paper presented at an Educational Testing Service-sponsored colloquium entitled
“Computer-based testing: Building the foundations for future assessments,”
Davey, T., & Nering, M.
(2002). Controlling item exposure and maintaining item security. In C. N.
Mills, M. T. Potenza, & J. J. Fremer (Eds.), Computer-Based Testing: Building the
Foundation for Future Assessments (pp. 165-191).
Davey, T., Nering, M., &
Thompson, T. (1997, June). Realistic simulation procedures for item
response data. In T. Miller (Chair), High-dimensional
simulation of item response data for CAT research. Symposium presented at the annual meeting of
the Psychometric Society,
Davey, T. & Parshall, C.
G. (1995, April). New algorithms for item selection and exposure control with
computerized adaptive testing. Paper presented at the annual meeting of the
American Educational Research Association,
Davey, T., & Pitoniak, M. J. (2006). Designing
computerized adaptive tests. In S.M. Downing & T. M. Haladyna (Eds.), Handbook of test development.
Davey, T., Pommerich, M.
& Thompson, D. T. (1999). Pretesting
alongside an operational CAT. Paper presented at the annual meeting of the
National Council on Measurement in Education,
Davey, T. & Thomas,
L. (1996, April). Constructing adaptive tests to parallel
conventional programs. Paper presented
at the annual meeting of the American Educational Research Association,
David, L. A. & Lewis, C.
(1996, April). Person-fit indices and
their role in the CAT environment. Paper
presented at the Annual meeting of the National Council on Measurement in
Education,
Davis, K. M., Chang, C.
-H., Lai, J. -S., & Cella, D. (2002).
Feasibility and acceptability of computerized adaptive testing (CAT) for
fatigue monitoring in clinical practice. Quality of Life Research, 11(7), 134.
#DA02-01. Davis, L. L. (2002). Strategies for
controlling item exposure in computerized adaptive testing with polytomously
scored items. Unpublished doctoral dissertation,
#DA03-01. Davis, L. L.
(2003, April). Strategies for
controlling item exposure in computerized adaptive testing with the generalized
partial credit model. Paper presented at
the annual meeting of the National Council on Measurement in Education,
#DA04165. Davis, L. L.
(2004). Strategies for controlling item
exposure in computerized adaptive testing with the generalized partial credit
model. Applied Psychological Measurement,
28, 165-185.
#DAxx-01. Davis, L. L. & Dodd, B. G. (Undated). An examination of testlet scoring
and item exposure constraints in the Verbal Reasoning section of the MCAT. (See 2001 monograph below). {PDF file,
653 KB}
Davis, L. L. & Dodd, B.
G. (2003). Item exposure constraints for
testlets in the verbal reasoning section of the MCAT. Applied Psychological
Measurement, 27, 335-356.
Davis, L. & Dodd, B. (March 2005). Strategies for controlling item exposure in computerized adaptive testing with the partial credit model. Pearson Educational Measurement Research Report 05-01.
Davis, L. L. & Dodd, B. G. (2008). Strategies for
controlling item exposure in computerized adaptive testing with the partial
credit model. Journal of Applied Measurement, 9, 1-17.
Davis, L. L., Pastor, D. A.,
Dodd, B. G., Chiang, C., & Fitzpatrick, S. (2000). An examination of
exposure control and content balancing restrictions on item selection in CATs
using the partial credit model. Paper
presented at the annual meeting of the American Educational Research
Association,
#DA03024. Davis, L. L., Pastor, D. A., Dodd, B. G.,
Chiang, C., & Fitzpatrick, S. J.
(2003). An examination of exposure control and content balancing restrictions
on item selection in CATs using the partial credit model. Journal of Applied Measurement, 4, 24-42.
De Ayala, R. J. (1989). A
comparison of the nominal response model and the three-parameter logistic model
in computerized adaptive testing. Educational and Psychological Measurement,
49, 789-805.
De Ayala, R. J. (1992). The
nominal response model in computerized adaptive testing. Applied Psychological
Measurement, 16, 327-343.
De Ayala, R. J. (1992). The influence of dimensionality on CAT
ability estimation. Educational and Psychological Measurement, 52, 513-528.
De Ayala, R.
J., Dodd, B G., & Koch, W. R. (1990). A simulation and comparison of flexilevel and
Bayesian computerized adaptive testing. Journal of Educational Measurement,
27, 227-239.
Diones, R. & Everson, H. (1994). Computer adaptive testing: Assessment of the
future. Curriculum/Technology Quarterly, 4 (2), 1-3.
De Ayala, R. J., & Koch,
W. R. (1985). ALPHATAB: A lookup table
for Bayesian computerized adaptive testing.
Applied Psychological Measurement, 9, 326.
De Ayala, R. J., & Koch,
W. R. (1987, April). Computerized adaptive
testing: A comparison of the nominal response model and the three-parameter
logistic model. Paper presented at the
annual meeting of the National Council on Measurement in Education,
De Ayala, R. J., Dodd, B.
G., & Koch, W. R. (1992). A comparison of the partial credit and graded
response models in computerized adaptive testing. Applied Measurement in
Education, 5, 17-34.
#deBE00-01. De Beer, M.
(2000). Learning Potential Computerised
Adaptive Test (LPCAT): Technical Manual.
#deBE00-02. De Beer, M.
(2000). Learning Potential Computerised
Adaptive Test (LPCAT): User's Manual.
De Beer, M. (2000). The construction and evaluation of a dynamic computerised adaptive test
for the measurement of learning potential. Unpublished D .Litt et Phil
dissertation.
De Beer, M. (2002, June). Utility
of Learning Potential Computerised Adaptive Test (LPCAT) scores in predicting
academic performance of bridging students: A comparison with other predictors.
Paper presented at the 5th Annual Society for Industrial and
Organisational Psychology Congress,
#deBE03-01. De Beer, M.
(2003, June). A comparison of learning potential results at various
educational levels. Paper presented at the 6th Annual Society for
Industrial and Organisational Psychology of South Africa (SIOPSA) conference,
25-27 June 2003. {PDF file, 391 KB}
#deBE03-02. De Beer, M. (2003). Development of the
Learning Potential Computerised Adaptive Test (LPCAT). Unpublished
manuscript. {PDF file, 563 KB}
De Beer, M. (2007) Use of CAT in dynamic testing. In
D. J. Weiss (Ed.), Proceedings of the 2007 GMAC Conference on Computerized Adaptive
Testing. {PDF file, 133 KB}
De la Torre, R. (1991). The development and evaluation of a system
for computerized adaptive testing.
Unpublished doctoral dissertation,
De la Torre, R. & Vispoel, W. P. (1991, April). The development and evaluation of a
computerized adaptive testing system.
Paper presented at the annual meeting of the American Educational
Research Association,
de Gruijter, D. N. (1987). Wilcox' closed sequential testing
procedure in stratified item domains. Methodika, 1(1), 3-12.
#deGR77-01. De Gruijter, D. N. M. (1977).
A two-stage testing procedure (Memorandum 403-77).
#DE03-01. Deng, H. & Ansley, T. (2003, April). To stratify or not: An investigation of CAT item selection
procedures under practical constraints.
Paper presented at the Annual meeting of the National Council on
Measurement in Education,
#DE01-01.
Deng, H. & Chang, H.-H. (2001). a-stratified computerized adaptive
testing with unequal item exposure across strata. Paper presented at the annual
meeting of the American Educational Research Association,
Desmarais, M. C. & Pu,
X (no date). Computer Adaptive Testing With Bayesian Networks: A Comparison
with IRT.
Desmarais,
M. C., Pu, X, & Blais, J.-G. (2007).
Partial
order knowledge structures for CAT applications. In D. J. Weiss (Ed.), Proceedings
of the 2007 GMAC Conference on Computerized Adaptive Testing. {PDF
file, 475 KB}
De Witt, J. J. & Weiss, D. J. (1974). A computer software system for adaptive
ability measurement (Research Report 74-1).
#DE76104.
DeWitt, L. J. & Weiss, D. J.
(1976). Hardware and software
evolution of an adaptive ability measurement system. Behavior Research Methods and
Instrumentation, 8, 104-107.
Diao,
Q., and Reckase, M. (2009). Comparison
of ability estimation and item selection methods in multidimensional
computerized adaptive testing. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive
Testing. {PDF File, 342 KB}
#DI86-189. Divgi, D. R. (1986). Determining the
sensitivity of CAT-ASVAB scores to changes in item response curves with the
medium of administration (Report No. 86-189).
#DI87-161. Divgi, D. R. (1987, August). Properties of some Bayesian scoring
procedures for computerized adaptive tests (Research Memorandum CRM
87-161).
Divgi, D. R. (1991,
September). An analysis of CAT-ASVAB
scores in the Marine Corps JPM data (CRM- 91-161).
#DO04-01.
Do, B.-R., Chuah, S. C., & Drasgow, F. (2004). Item parameter recovery with adaptive
tests. Paper presented at the annual
meeting of the National Council on Measurement in Education,
Dodd, B. G. (1987, April).
Computerized adaptive testing with the rating scale model. Paper presented at
the Fourth International Objective Measurement Workshop, Chicago.
Dodd, B. G. (1990). The
effect of item selection procedure and stepsize on computerized adaptive
attitude measurement using the rating scale model. Applied Psychological
Measurement, 14, 355-366.
#DO95005. Dodd, B. G., De
Ayala, R. J., & Koch, W. R. (1995). Computerized adaptive testing with
polytomous items. Applied Psychological Measurement, 19, 5-22.
Dodd, B. G., &
Fitzpatrick, S. J. (1998, September). Alternatives for scoring computerized
adaptive tests. Paper presented at an Educational Testing Service-sponsored
colloquium entitled Computer-based testing: Building the foundations for future
assessments,
Dodd, B. G., Koch, W. R.,
& De Ayala, R. J. (1988, April). Computerized adaptive attitude
measurement: A comparison of the graded response and rating scale models. Paper
presented at the annual meeting of the American Educational Research
Association,
Dodd, B. G., Koch, W. R.,
& De Ayala, R. J. (1989). Operational characteristics of adaptive testing
procedures using the graded response model. Applied Psychological Measurement,
13, 129-143.
#DO9361. Dodd, B. G., Koch, W. R., & De Ayala, R.
J. (1993). Computerized adaptive testing using the partial credit model:
Effects of item pool characteristics and different stopping rules. Educational and Psychological Measurement,
53, 61-77.
Dolan, S. (1993). A comparison of computer adaptive test
administration methods. Unpublished
doctoral dissertation,
Doucette, D. (Ed.). (1988). Computerized adaptive testing: The state of the art in assessment at three
community colleges.
Dowling,
C. E., Hockemeyer, C., & Ludwig, A .H. (1996) Adaptive assessment and
training using the neighbourhood of knowledge states. In Frasson, C., Gauthier, G., & Lesgold,
A. (eds.) Intelligent Tutoring Systems, Third International Conference, ITS'96,
Montréal, Canada, June 1996 Proceedings. Lecture Notes in Computer Science
1086.
Dowling,
C.E. and Kaluscha, R. (1995, August).
Prerequisite relationships for the adaptive assessment of
knowledge. In Greer, J. (Ed.)
Proceedings of AIED'95, 7th World Conference on Artificial Intelligence in
Education,
Drasgow, F., &
Olson-Buchanan, J. B. (Eds.). (1999). Innovations in computerized assessment.
#DU93181. Du,
Y., Lewis, C. & Pashley, P. J. (1993).
Computerized mastery testing using fuzzy set decision theory. Applied Measurement in Education, 6,
181-193. (Also Educational Testing
Service Research Report 94-37)
See #DU93181.
Du, Y., Lewis, C., Pashley, P. J.
(1994). Computerized mastery
testing using fuzzy set decision theory (Research Report 94-37).
Dunkel, P. A. (1997). Computer-adaptive testing of listening
comprehension: A blueprint of CAT Development. The Language Teacher Online
21, no. 10.
<http://langue.hyper.chubu.ac.jp/jalt/pub/tlt/97/oct/dunkel.html>.
Dunkel, P. (1999). Research and development of a
computer-adaptive test of listening comprehension in the less-commonly taught
language Hausa. In M. Chalhoub-Deville (ed). Issues in
computer-adaptive testing of reading proficiency.
Economides, A.A. (2005). Adaptive orientation
methods in computer adaptive testing. Proceedings E-Learn 2005 World Conference
on E-Learning in Corporate, Government, Healthcare, and Higher Education, pp.
1290-1295, Vancouver, Canada, AACE, October 2005.
Economides, A.A. (2005). Computer adaptive
testing quality requirements. Proceedings E-Learn 2005 World Conference on
E-Learning in Corporate, Government, Healthcare, and Higher Education, pp.
288-295, Vancouver, Canada, AACE, October 2005.
Economides,
A.A. (2005). Personalized feedback in CAT. WSEAS Transactions on Advances in
Engineering Education, Issue 3, Volume 2, 174-181, July 2005.
Educational
Testing Service. (1993). The GRE computer adaptive testing program (CAT):
Integrating convenience, assessment, and technology.
Edwards,
M. C. & Thissen, D. (2007). Exploring potential designs for multi-form
structure computerized adaptive tests with uniform item exposure. In D. J.
Weiss (Ed.), Proceedings of the 2007 GMAC
Conference on Computerized Adaptive Testing.
{PDF file, 295 KB}
Egberink, I. J. L. & Veldkamp, B. P. (2007). The development of a computerized adaptive
test for integrity. In D. J. Weiss (Ed.), Proceedings
of the 2007 GMAC Conference on Computerized Adaptive Testing. {PDf file, 290 KB}
Eggen, T. J. H. M. (1998). Item selection in adaptive testing with the sequential probability
ratio test. Measurement and Research Department Report, 98-1.
Eggen, T. J. H. M.
(1999). Item selection in adaptive
testing with the sequential probability ratio test. Applied Psychological Measurement, 23,
249-261. [Reprinted as Chapter 6 in #EG04-01]
#EG01-1. Eggen, T. J. H. M. (2001).
Overexposure and underexposure of items in computerized adaptive testing
(Measurement and Research Department Reports 2001-1).
#EG04-01 Eggen, T. J. H. M. (2004). Contributions to the theory and practice of
computerized adaptive testing.
Eggen, T. J. H. M.
(2007). Choices in CAT models in the context of educational testing. In D.
J. Weiss (Ed.), Proceedings of the 2007
GMAC Conference on Computerized Adaptive Testing. {PDF
file, 123 KB}
#EG96-3. Eggen, T. J. H. M, & Straetmans, G. J. J.
M. (1996). Computerized adaptive testing for classifying examinees into three
categories (Measurement and Research Department Rep. 96-3).
#EG00713. Eggen, T. J. H. M,
& Straetmans, G. J. J. M. (2000). Computerized adaptive testing for
classifying examinees into three categories.
Educational and Psychological Measurement, 60, 713-734. [Reprinted as
Chapter 5 in #EG04-01]
#EG03-01. Eggen, T. & Verschoor, A. (2003, October). Optimal testing with easy items in
computerized adaptive testing. Paper
presented at the conference of the International Association for Educational
Assessment,
#EG03-02. Eggen, T. J. H. M. & Verschoor, A. J.
(2004). Optimal testing with easy items
in computerized adaptive testing (Measurement and Research Department Report
2004-2).
#EI93-55. Eignor, D. R. (1993). Deriving comparable scores for computer
adaptive and conventional tests: An example using the SAT. (ETS Research Report RR-93-5).
Eignor, D. R., Folk, V. G.,
Li, M.-Y., & Stocking, M. L. (1994, April).
Pinpointing PRAXIS I CAT characteristics through simulation
procedures. Paper presented at the annual
meeting of the National Council on Measurement in Education,
Eignor, D. R. &
Schaffer, G.A. (1995, April).
Comparability studies for the GRE CAT General Test and the NCLEX using
CAT. Paper presented at the annual meeting
of the National Council on Measurement in Education,
#EI93-56. Eignor, D. R., Stocking, M. L., Way, W. D.,
& Steffen, M. (1993). Case studies
in computer adaptive test design through simulation (Research Report RR-93-56).
Eignor, D. R., Way, W. D.,
& Amoss, K.E. (1994, April).
Establishing the comparability of the NCLEX using CAT with traditional
NCLEX examinations. Paper presented at the annual meeting of the National
Council on Measurement in Education,
Eignor, D. A., Way, W. D.,
Stocking, M., & Steffen, M. (1993).
Case studies in computerized adaptive test design through simulation
(Research Report 93-56).
Elwood, D. L. (1969). Automation of psychological testing. American Psychologist, 24, 287-289.
Elwood, D. L. &
Embretson,
S. E. (1999). Generating items during testing: Psychometric issues and
models. Psychometrika, 64, 407-433.
Engdahl, B. (1992). Computerized adaptive assessment of cognitive abilities among disabled adults. ERIC Document No. ED301274
#EN77158. English, R. A., Reckase, M. D., &
Patience, W. M. (1977). Application of
tailored testing to achievement measurement.
Behavior Research Methods and Instrumentation, 9, 158-161.
Epstein, K. I. & Knerr, C. S.
(1978). Applications of
sequential testing procedures to performance testing. In D. J. Weiss (Ed.), Proceedings of the 1977
Computerized Adaptive Testing Conference.
Fan, M. & Hsu, Y. (1995, June).
The effect of ability estimation for polytomous CAT in different item
selection procedures. Paper presented at
the Annual meeting of the Psychometric Society,
#FA96-02. Fan, M., & Hsu, Y. (1996, April). Multidimensional computer adaptive
testing. Paper presented at the Annual
Meeting of the American Educational Research Association,
#FA96-01. Fan, M., & Hsu, Y. (1996, April). Utility
of Fisher information, global information and different starting abilities in
mini CAT. Paper presented at the Annual Meeting of the National Council on
Measurement in Education,
#FA99-01. Fan, M., Thompson, T.,
& Davey, T. (1999, April). Constructing adaptive tests to parallel
conventional programs. Paper presented at the annual meeting of the National
council on Measurement in Education,
#FA02-01. Fan, M. &
Zhu. (2002, April). A further study on adjusting CAT item
selection starting point for individual examinees. Paper presented at the annual meeting of the
American Educational Research Association,
Fayers, P. (2007). Applying item
response theory and computer adaptive testing: The challenges for health
outcomes assessment. Quality of Life Research. 16:187–194.
Featherman, C. M., Subhiyah,
R. G., & Hadadi, A. (1996, April).
Effects of randomesque item selection on CAT item exposure rates and
proficiency estimation under 1- and 2-PL models. Paper presented at the annual meeting of the
American Educational Research Association,
Featherman, C. M., Subhiyah,
R. G., & Hadadi, A. (1996, April).
New algorithms for item selection and exposure and proficiency
estimation under 1- and 2-PL models.
Paper presented at the annual meeting of the American Educational
Research Association,
#FE69-49.
#FE70-01.
#FE70025.
#FE71-01.
#FE73-01.
Fields, F. A. (1992). Computerized adaptive testing for NCLEX-PN.
Journal of Practical.Nursing, 42, 8-10.
Finkelman,
M., Weiss, D. J., & Kim-Kang, G.
(2009). Item election and
hypothesis testing for the adaptive measurement of change. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive
Testing. {PDF File, 228 KB}
Finkelman, M., Nering, M.
L., & Roussos, L. A. (2009). A
conditional exposure control method for multidimensional adaptive testing. Journal of Educational Measurement, 46, 84-103.
Finney, S. J., Smith, R. W.,
& Wise, S. L. (1999, April). The
effects of judgment-based stratum classifications on the efficiency of stratum
scored CATs. Paper presented at the
annual meeting of the National Council on Measurement in Education,
Fischer, G. H. & Pendl,
P. (1980). Individualized testing on the
basis of the Rasch model. In . J. Th.
Van der Kamp, W. F. Langerak, & D. N. M. de Gruijter (Eds.). Psychometrics for educational debates.
Flaugher, R. (2000).
Item pools. In Wainer, H. (2000).
Computerized adaptive
testing: a primer.
Fliege,
H., Becker, J., Walter, O. B., Bjorner, J. B., Klapp, B. F., & Rose, M.
(2005). Development of a computer-adaptive test for depression (D-CAT). Quality of Life Research, 14, 2277–2291.
Folk, V. G. (1990,
April). Adaptive testing and item
difficulty order effects. Paper
presented at the annual meeting of the American Educational Research
Association,
Folk, V. & Golub-Smith,
M. (1996) Calibration of on-line pretest data using BILOG. Paper presented at
the annual meeting of National Council on Measurement in Education,
Folk, V.G., & Green, B.
F. Adaptive estimation when the
unidimensionality assumption of IRT is violated. Applied Psychological Measurement, 13,
373-389.
Folk, V. G. & Wingersky,
M. (1999, April). Fixed length CATs, or CATs in need of
fixing. Paper presented at the annual
meeting of the National Council on Measurement in Education,
Forbey, J. D., & Ben-Porath, Y. S. (2007). Computerized adaptive personality testing: a review and illustration with the MMPI-2 Computerized Adaptive Version. Psychological Assessment, 19(1), 14-24.
Forbey, J. D., Handel, R.
W., & Ben-Porath, Y.S. (June,
1996). Computerized adaptive
administration of the MMPI-A. Paper
presented at the 31st Annual Symposium and Recent Developments in
the use of the MMIP-2 and MMPI-A,
#FO00083. Forbey, J. D., Handel, R. W., &
Ben-Porath, Y.S. (2000). A real data simulation of computerized
adaptive administration of the MMPI-A.
Computers in Human Behavior, 16, 83-96.
Forker, J. E. &
McDonald, M. E. (1996). Methodologic
trends in the healthcare professions: Computer adaptive and computer simulation
testing. Nurse Education, 21,13-14.
#FR03-01. French, B. F. & Thompson, T. T. (2003,
April). The evaluation of exposure
control procedures for an operational CAT.
Paper presented at the annual meeting of the American Educational
Research Association,
Frick, T. W. (1988). A comparison of three decision models for adapting the length
of computer-based mastery tests.
Unpublished manuscript (submitted to Journal of Educational Computing
Research).
Frick, T. W. (1989).
Bayesian adaptation during computer-based tests and computer-guided
practice exercises. Journal of
Educational Computing Research, 5(1), 89-114.
Frick, T.W. (1989).
A comparison of an expert systems approach to computerized adaptive
testing and an IRT model. Unpublished
manuscript (submitted to American Educational Research Journal).
Frick,
T. J. (1990). A comparison of three
decision models for adapting the length of computerized mastery tests. Journal
of Educational Computing Research, 6(4), 479-513.
Frick,
T. W. (1992). Computerized adaptive mastery
tests as expert systems. Journal of Educational Computing Research, 8(2),
187-213.
Frick, T. W., Plew, G.T.,
& Luk, H.-K. (1989). EXSPRT: An
expert systems approach to computer-based adaptive testing. Paper presented at the annual meeting of the
American Educational Research Association,
Friedman, D,. Steinberg, A,
& Ree, M. J. (1981). Adaptive
testing without a computer. Catalog of
Selected Documents in Psychology, Nov. 1981, 11, 74-75 (Ms. No. 2350). AFHRL Technical Report 80-66.
Gafni, N., Cohen, Y., Roded, K., Baumer, M., &
Moshinsky, A. (2009). Applications of
CAT in admissions to higher education in
Gallagher, A. Bridgeman, B.,
(ETS) & Calahan, C. (Fordham) (1999,
April). Fairness in computer-based
testing. Paper presented at the annual
meeting of the National Council on Measurement in Education,
#GA02812.
Gardner, W., Shear, K.,
Kelleher, K., Pajer, K., Mammen, O., Buysse, D., et al. (2004). Computerized adaptive measurement of
depression: A simulation study. BMC
Psychiatry, 4(1),13.
Garrison, W. M. (1985). Monitoring item calibrations from data
yielded by an adaptive testing procedure.
Educational Research Quarterly, 10, 9-12.
#GA82-01. Garrison, W. M. & Baumgarten, B. S. (1982, March). Assessing mathematics achievement with a
tailored testing program. Paper
presented at the annual meeting of the American Educational Research
Association,
Garrison, W. M. &
Baumgarten, B. S. (1986). An application
of computer adaptive testing with communication handicapped examinees. Educational and Psychological Measurement,
46, 23-25.
Georgiadou, E.,
Triantafillou, E. & Economides, A.A. (2006). Evaluation parameters for
computer adaptive testing. British Journal of Educational Technology, Vol. 37,
No 2, 261-278, March 2006.
Georgiadou, E. G., Triantafillou, E., &
Economides, A. A. (2007). A review of item exposure control strategies for computerized
adaptive testing developed from 1983 to 2005. Journal of Technology, Learning, and Assessment, 5(8). Retrieved 25 July 2007 from http://www.jtla.org. {PDF file, 326 KB}
#GE01-01. Geranpayeh. A. (2001). CB BULATS: Examining
the reliability of a computer based test using test-retest method.
Gershon, R. (1989). CAT administrator [Computer program].
Gershon, R. C. (1994). CAT software system [computer program.]
Gershon, R. C. (year?). The effect of
individual differences variables on the assessment of ability for computerized
adaptive testing. Dissertation Abstracts
International, Section B: The Sciences and Engineering, 57 (6-B), 4085.
Gershon, R. C. (2004) The
ABCs of Computerized Adaptive Testing. In T. M. Wood & W. Zhi (Eds.),
Measurement issues and practice in physical activity. Champaign, IL: Human
kinetics.
Gershon, R. C. (2005). Computer adaptive testing. Journal of Applied Measurement 6:109-27.
Gershon, R.C. &
Bergstrom, B. (1991, April). Individual differences in computer adaptive
testing: Anxiety, computer literacy, and
satisfaction. Paper presented at the
annual meeting of the National Council on Measurement in Education.
Gershon, R.C. &
Bergstrom, B. (1995, April). Does cheating on CAT pay: Not. Paper presented at the annual meeting of the
American Educational Research Association,
#GI79-06. Gialluca, K. A., & Weiss, D. J. (1979). Efficiency of an adaptive
inter-subtest branching strategy in the measurement of classroom achievement
(Research Report 79-6).
Gibbons, R. D., Weiss,
D. J., Kupfer, D. J., Frank, E., Fagiolini, A., Grochocinski, V. J., Bhaumik,
D., K., Stover, A., Bock, R. D., & Immekus, J. C. (2008). Using
computerized adaptive testing to reduce the burden of mental health assessment.
Psychiatric Services, 59(4), 361-368. {PDF
file, 107 KB}
Gierl,
M. J. & Jiawen Zhou, J. (2008). Computer
adaptive-attribute testing: A new approach to cognitive diagnostic
assessment. Zeitschrift für
Psychologie / Journal of Psychology, 216(1), 29–39.
Giouroglou, H. &
Economides, A.A. (2003) Cognitive CAT in foreign language assessment.
Proceedings 11th International PEG Conference, Powerful ICT Tools for Learning
and Teaching, PEG '03, CD-ROM, 2003.
Giouroglou, H. &
Economides, A.A. (2004). State-of-the-art and adaptive open-closed items in
adaptive foreign language assessment. Proceedings 4th Hellenic Conference with
International Participation: Informational and Communication Technologies in
Education, Athens, 747-756, 2004.
Giouroglou, H. &
Economides, A.A. (2005). An implemented theoretical framework for a common
European foreign language adaptive assessment. Proceedings ICODL 2005, 3rd
International Conference on Open and Distance Learning 'Applications of
Pedagogy and Technology', 339-350, Greek Open University, Patra,
Greece, 2005.
Giouroglou, H. &
Economides, A.A. (2005). The development of the adaptive item language
assessment (AILA) for mixed-ability students. Proceedings E-Learn 2005 World
Conference on E-Learning in Corporate, Government, Healthcare, and Higher
Education, 643-650, Vancouver, Canada, AACE, October 2005.
Glas, C. A. W. (1998). Quality control of on-line calibration in
computerized adaptive testing (Research Report 98-03). Enschede, The
Glas, C. A. W. (1988).
The Rasch model and multi-stage testing.
Journal of Educational and Behavioral Statistics, 13, 45-52.
Glas, C. A. W. (2000). Item calibration and parameter drift. In W. J. van der linden & C. A. W. Glas
(Eds.). Computerized adaptive teting: Theory and practice (pp.183-199). Norwell MA: Kluwer Academic.
Glas, C. A. W., Meijer, R.
R., & van Krimpen-Stoop, E. M. L. A. (1997). Statistical tests for person misfit in
computerized adaptive testing (Research Report RR 97-08). Enschede, The
Glas, C. A. W. & Van der
Linden, W. J. (2001). Modeling variability in item parameters in
CAT. Paper presented at the Annual
Meeting of the National Council on Measurement in Education,
#GL03247. Glas, C. A. W. & Van der Linden, W. J. (2003).
Computerized adaptive testing with item cloning. Applied Psychological Measurement, 27,
247-261. (Also Research Report 01-10, Univerity of Twente.)
Glas, C. A. W., &
Veerkamp, W. J. J. (1999). Item calibration and parameter drift. In W. J. van
der Linden & C. A. W. Glas (Eds.), Computer adaptive testing: Theory and
practice. Norwell MA: Kluwer.
#GL98-01. Glas, C. A. W., Meijer, R. R., & van
Krimpen-Stoop, E. M. L. A. (1998).
Statistical tests for person misfit in computerized adaptive testing
(Research Report 98-01). Enschede, The
Glas, C.A.W., Wainer,
H., & Bradlow, E.T. (2000). MML and
EAP estimation in testlet-based adaptive testing. Dans W.J. van der Linden et
C.A.W. Glas (Es) : Computerized
adaptive testing: Theory and practice.
#GL98-15. Glas, C. A. W. & Vos, H. J. (1998). Adaptive mastery testing using the
Rasch model and Bayesian sequential decision theory (Research Report
98-15). Enschede, The
#GL00-01. Glas, C. A. W. & Vos, H. J. (2000). Adaptive mastery testing using a
multidimensional IRT model and Bayesian sequential decision theory (Research
Report 00-06). Enschede, The
Gorham,
W. A. ( 1976). Opening remarks. In W. H. Gorham (Chair), Computers and testing: Steps toward the inevitable conquest (PS
76-1). Symposium presented at the 83rd
annual convention of the American Psychological Association,
Gorin, J., Dodd, B. G., Fitzpatrick, S. J., & Shieh, Y. Y.
(2005). Computerized adaptive testing with the partial credit model: Estimation
procedures, population distributions, and item pool characteristics. Applied
Psychological Measurement, 29, 533-546.
#GO80-01. Gorman, S.
(1980). A comparative evaluation
of two Bayesian adaptive ability estimation procedures. Unpublished doctoral dissertation, the
Catholic University of America.
#GO80-02. Gorman, S. (1980). A comparison of the
accuracy of Bayesian adaptive and static tests using a correction for
regression. In D. J. Weiss (Ed.),
Proceedings of the 1979 Computerized Adaptive Testing Conference (pp.
35-50).
#GR01-01. Grabovsky,
Greaud, V. A., & Green, B.
F. (1984). Analysis of speeded test data from experimental CAT system.
Greaud, V. A., & Green,
B. F. (1986). Equivalence of conventional and computer presentation of speed
tests. Applied Psychological
Measurement, 10, 23-34.
#GR70184. Green, B. F. (1970). Comments on tailored testing. In W. H. Holtzman, (Ed.), Computer-assisted
instruction, testing, and guidance (pp. 184-197).
#GR75-01. Green, B. F. (1976). Discussion.
In C. K. Clark (Ed.), Proceedings
of the First Conference on Computerized Adaptive Testing (pp. pp.
118-119).
Green, B. F. (1983).
The promise of tailored tests. In H. Wainer & S. Messick (Eds.).,
Principals of modern psychological
measurement (pp. 69-80).
Green, B. F. (1983). Adaptive testing by computer. In R. B. Ekstrom (ed.), Measurement,
technology, and individuality in education.
New directions for testing and measurement, Number 17.
Green, B. F. (1988). Construct validity of computer-based
tests. In H. Wainer and H. Braun (Eds.),
Test validity (pp. 77-103).
#GR88223. Green, B. F. (1988). Critical problems in computer-based
psychological measurement, Applied
Measurement in Education, 1, 223-231.
Green, B. F. (1997,
March). Alternate methods of scoring
computer-based adaptive tests. Paper
presented at the annual meeting of the National Council on Measurement in
Education,
Green, B. F., Bock, R. D.,
Humphreys, L. G., Linn, R. L., & Reckase, M. D. (1984). (11982, May). Evaluation plan for the computerized adaptive
vocational aptitude battery (Research Report 82-1).
Green, B. F., Bock, R. D.,
Humphreys, L. G., Linn, R. L., & Reckase, M. D. (1984). Technical
guidelines for assessing computerized adaptive tests. Journal of Educational
Measurement, 21, 347-360.
Green, B. F., Bock, R. D.,
Linn, R. L., Lord, F. M., & Reckase, M. D. (1984). A plan for scaling the
computerized adaptive Armed Services Vocational Aptitude Battery. Journal of
Educational Measurement, 21, 347-360.
Green, B. F. & Thomas,
T. J. (1990). Utility of predicting
starting abilities in sequential computer-based adaptive tests (Research Report
90-1).
Grist, S., Rudner, L. M. & Wise, L. L.
Computerized adaptive tests. ERIC Clearinghouse on Tests, Measurement, and
Evaluation, no. 107.
Gu, L. & Reckase, M.D. (2007).
Designing optimal item pools for computerized adaptive tests with
Sympson-Hetter exposure control. In D.
J. Weiss (Ed.), Proceedings of the 2007
GMAC Conference on Computerized Adaptive Testing. {PDF
file, 1.13 MB}
#GU75-01. Gugel, J. F. Schmidt, F. L., & Urry, V.
W. (1976). Effectiveness of the
ancillary estimation procedure. In C. K.
Clark (Ed.), Proceedings of the First
Conference on Computerized Adaptive Testing (pp. 103-106).
#GU02-01. Guille, R.
Lipner, R. S., & Norcini, J. J. (2002, April). Content-stratified random item selection in
computerized classification testing.
Paper presented at the annual meeting of the National Council on
Measurement in Education,
Guo, F. (1999, April). Managing CAT item development in the face of
uncertainty. Paper presented at the annual
meeting of the National Council on Measurement in Education,
Guo, F.
(2007). CAT Security: A
practitioner’s perspective. In D. J.
Weiss (Ed.), Proceedings of the 2007
Guo, F. (2009). Quantifying the impact of compromised items
in CAT. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on
Computerized Adaptive Testing. {PDF
File, 438 KB}
#GU03-01. Guo, F. & Wang, G. (2003, April). Online calibration and scale stability of a
CAT program. Paper presented at the
annual meeting of the American Educational Research Association,
Guo, F. Stone, E. &
Cruz, D. (2001). On-line Calibration Using PARSCALE Item Specific Prior Method:
Changing Test Population and Sample Size.
Paper presented at National Council on Measurement in Education Annual Meeting,
Guo, F., Way, W. D., &
Reshetar, R. (2000, April). Test
security and the development of computerized tests. Paper presented at the National Council on
Measurement in Education invited symposium: Maintaining test security in
computerized programs--Implications for practice,
Gushta,
M. M. (2003). Standard-setting issues in computerized-adaptive testing. Paper
Prepared for Presentation at the Annual Conference of the Canadian Society for
Studies in Education,
Guyer, R. D.
(2008). Effect of early misfit in computerized adaptive testing on the recovery
of theta.
Unpublished Ph.D. dissertation,
Guyer, R. D. and Weiss, D. J. (2009). Effect of early misfit in computerized
adaptive testing on the recovery of theta. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on
Computerized Adaptive Testing. {PDF File, 212 KB}
- H -
Hadidi, A. & Luecht, R. M. (1997, March). Psychometric mode effects and fir issues with
respect to item difficulty estimates.
Paper presented at the annual meeting of the National Council on
Measurement in Education,
Haley, S. M., Coster, W. J., Andres, P.
L., Kosinski, M. & Ni, P. S. (2004). Score comparability of short-forms and
computerized adaptive testing: Simulation study with the activity measure for
post-acute care (am-pac). Archives
of Physical Medicine and Rehabilitation,
85, 661-666.
Haley, S. M., Ni, P., Hambleton, R. K.,
Slavin, M. D. & Jette, A. M. (2006). Computer adaptive testing improves
accuracy and precision of scores over random item selection in a physical
functioning item bank. Journal
of Clinical Epidemiology, 59, 1174-1182.
Halkitis, P. N. & Leahy,
J. M. (1993). Computerized adaptive
testing: The future is upon us. Nursing and Health Care, 14, 378-85.
Hambleton, R. H.
(1973). A review of testing and
decision-making procedures (Technical Bulletin No. 15).
Hambleton,
R. K. (1974). Testing and
decision-making procedures for selected individualized instruction
programs. Review of Educational Research, 10, 371-400.
Hambleton, R. K. (2002, April). Impact of item quality and item bank size on
the psychometric quality of computer-based credentialing exams. Paper presented at the annual meeting of the
National Council on Measurement in Education, New Orleans LA.
Hambleton, R. K. (2005). Applications
of item response theory to improve health outcomes assessment: Developing item
banks, linking instruments, and computer-adaptive testing. In J. Lipscomb, C.
C. Gotay, & C. Snyder (Eds.), Outcomes assessment in cancer (pp.445-464).
Cambridge, UK: Cambridge University Press.
See #JO02-01. Hambleton, R. K., Jodoin, M., & Zenisky,
A. (2002, April). Impact of selected factors on the
psychometric quality of credentialing examinations administered with a
sequential testlet design. Paper
presented at the annual meeting of the National Council on Measurement in
Education,
Hambleton , R. & Xing, D.
(2004). Computer-based test designs with optimal and non-optimal tests for
making pass-fail decisions. Research Report,
Hambleton, R. K., Zaal, J.
N., & Pieters, J. P. M. (1991). Computerized adaptive testing: Theory, applications, and standards. In R. K.
Hambleton & J. N. Zaal (Eds.), Advances in educational and psychological
testing: Theory and Applications (pp.
341-366).
Han,
N. (2003). Using moving averages to assess test and item security in
computer-based testing (Center for Educational Assessment Research Report No.
468).
Han, K. T. (2009).
A gradual maximum information ratio approach to item selection in
computerized adaptive testing. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive
Testing. {PDF file, 391 KB}
#HA04-01. Han, N. & Hambleton, R. K. (2004).
Detecting exposed test items in
computer-based testing. Paper presented
at the annual meeting of the National Council on Measurement in Education,
Handel, R. W. Ben-Porath,
Y.S., & Watt, M. (1997, June).
Comparability and validity of computerized adaptive testing with the
MMPI-2 using a clinical sample. Paper
presented at the 32nd Annual Symposium and Recent Developments in
the use of the MMPI-2 and MMPI-A.
#HA99369. Handel, R. W. Ben-Porath, Y.S., & Watt,
M. (1999). Computerized adaptive
assessment with the MMPI-2 in a clinical setting. Psychological Assessment, 11, 369-380.
Hankins, J. A. (1987).
The effects of variable entry on bias and information of the Bayesian
adaptive testing procedure. Dissertation
Abstracts International, 47 (8A), 3013.
Hankins, J. A. (1990).
The effects of variable entry on bias and information of the Bayesian
adaptive testing procedure. Educational
and Psychological Measurement, 50, 785-802.
Hansen, D. N. (1969).
An investigation of computer-based science testing. In R. C. Atkinson and H. A. Wilson (Eds.),
Computer-assisted instruction: A book of readings.
#HA75-01. Hansen, D. N. (1976). Reflections on adaptive testing. In C. K. Clark (Ed.), Proceedings of the First Conference on
Computerized Adaptive Testing (pp. 90-94).
#HA68-01. Hansen, D. N. & Schwarz, G. (1968,
March). An investigation of computer-based
science testing.
Hansen, D. N., Johnson, B.
F., Fagan, R. L., Tan, P., & Dick, W.
(1974/1975). Computer-based
adaptive testing models for the Air Force technical training environment: Phase I.
Development of a computerized measurement system for Air Force technical
Training. JSAS Catalogue of Selected
Documents in Psychology, 5, 1-86 (MS No. 882).
AFHRL Technical Report 74-48.
Hansen, D. N., Ross, S.,
& Harris, D. A. (1977). Flexilevel adaptive testing paradigm: Validation in technical training. AFHRL Technical Report 77-35 (I).
Hansen, D. N., Ross, S.,
& Harris, D. A. (1977). Flexilevel adaptive training paradigm: Hierarchical concept structures. AFHRL Technical Report 77-35 (II).
Hansen, D. N. & Schwarz,
G. An investigation of computer-based
science testing.
Hardwicke, S., Vicino, F.,
McBride, J.R., & Nemeth, C. (1984).
Evaluation of computerized adaptive testing of the ASVAB.
Hardwicke, S. & White,
K. E. (1983). Predictive utility evaluation of adaptive
testing: Results of the Navy research.
Harman, H. H., Helm, C. E.,
& Loye, D. E. (Eds.).
Computer-assisted testing.
#HA01-01. Harmes, J. C., Kromrey, J. D., &
Parshall, C. G. (2001, October). Online item
parameter recalibration: Application of missing data treatments to overcome the
effects of sparse data conditions in a computerized adaptive version of the
MCAT. Unpublished manuscript. {PDF file,
406 KB}
#HA03-01. Harmes, J. C., Parshall, C. G., &
Kromrey, J. D. (2003, April). Recalibration of IRT item parameters in CAT: Sparse data matrices and missing data
treatments. Paper presented at the
annual meeting of the National Council on Measurement in Education,
Harris, J. D. & Smith,
P. F. (1979). A comparison of a standard
and a computerized adaptive paradigm in Bekesy fixed-frequency audiometry.
Journal of Auditory Research, 19, 1-22.
Hart,
D. L., Cook, K. F., Mioduski, J. E., Teal, C. R., Crane, P. K. (2006).
Simulated computerized adaptive test for patients with shoulder impairments was
efficient and produced valid measures of function. Journal of Clinical Epidemiology, 59, 290–298.
Hart,
D. L., Mioduski, J. E., &
Hart, D., Mioduski, J., Werenke, M.
& Stratford, P. (2006). Simulated computerized adaptive test for patients
with lumbar spine impairments was efficient and produced valid measures of
function. Journal of Clinical Epidemiology, 59, 947-956
#HA01249. Hau, K.-T. & Chang, H.-H. (2001).
Item selection in computerized adaptive testing: Should more discriminating items be used
first? Journal of Educational
Measurement, 38, 249-266. (Also presented at American Educational Research Association,
1998)
Haynie, K.A., & Way,
W.D. (1994, April). The effects of item pool depth on the accuracy of pass/fail
decisions for NCLEX using CAT. Paper presented at the annual meeting of the
National Council on Measurement in Education,
Haynie, K.A., & Way,
W.D. (1995). An investigation of item
calibration procedures for a computerized licensure examination. Paper presented at the annual meeting of the
National Council on Measurement in Education,
Hau, K. T. & Chang, H.
H. (1998). Item selection in
computerized adaptive testing: Should more discriminating items be used first?
Paper presented at the annual meeting of
the American Educational Research Association,
Hau, K. T. & Chang, H.
H. (1998). Item selection in
computerized adaptive testing: Should more discriminating items be used
first? Journal of Educational
Measurement,38, 249-266.
Hendrickson, A. B. &
Kolen, M. J. (1992, April). Scaling of two-stage adaptive test
configurations for achievement testing.
Paper presented at the annual meeting of the National Council on
Measurement in Education, New Orleans LA.
#HE07044. Hendrickson, A. (2007). An NCME instructional module on multistage
testing. Educational Measurement: Issues
and Practice, 26(2), 44-52.
Henly, S. J., Klebe, K. J.,
McBride, J. R., & Cudeck, R. (1989). Adaptive and conventional versions of
the DAT: The first complete test battery comparison. Applied Psychological
Measurement, 13, 363-371.
Hetter, R. D., Segall, D.
O., & Bloxom, B. (1992,
October). Need title . Paper presented at the annual
conference of the Military Testing Association,
Hetter, R. D., Bloxom, B. M., & Segall, D. O.
(1993). Item Calibration:
Medium-of-administration effect on computerized adaptive scores (TR-93-9).
Navy Personnel Research and
Hetter, R. D., Segall, D.
O., & Bloxom, B. M. (1994). A comparison of item calibration media in
computerized adaptive tests. Applied Psychological
Measurement, 18, 197-204.
Hetter, R.D., Segall, D.O. & Bloxom, B.M. (1997). Evaluating item
calibration medium in computerized adaptive testing. In W.A. Sands, B.K. Waters & J.R.
McBride, Computerized adaptive testing:
From inquiry to operation (pp. 161-168).
Hetter, R. D., &
Sympson, J. B. (1997). Item exposure control in CAT-ASVAB. In W. A. Sands, B.
K. Waters, & J. R. McBride (Eds.), Computerized adaptive testing: From
inquiry to operation (pp. 141-144).
#HO89-01. Ho, R., & Hsu, T. C. (1989, March). A
comparison of three adaptive testing strategies using MicroCAT. Paper presented at the annual meeting of the
American Educational Research Association,
Ho, R.-G., & Yen, Y.-C. (2005). Design
and evaluation of an XML-based platform-independent computerized adaptive
testing system. IEEE Transactions on Education, 48(2), 230–237
Hockemeyer, C. (2002). A comparison of non-deterministic
procedures for the adaptive assessment of knowledge. Psychologische Beiträge, 44, 495–503.
Hogan, P.F., Dall, T. &
McBride, J.R. (1996) Preliminary
cost-effectiveness analysis of alternative ASVAB testing concepts at MET
sites. Interim report to
Hogan, P.F., McBride, J.R.
& Curran, L.T. (1995). An evaluation
of alternative concepts for administering the Armed Services Vocational
Aptitude Battery to applicants for enlistment.
DMDC Technical Report 95-013.
#HO06-01. Hol, A. M. (2006). A CAT with personality and attitude. Enschede, The
Hol, A. M.,
Hol, A. M., Vorst, H. C.
M., & Mellenbergh, G. J. (2005). A randomized experiment to compare
conventional, computerized, and computerized adaptive administration of ordinal
polytomous attitude items. Applied Psychological Measurement, 29, 159-183.
Hol, A. M.,
Holmes, R. M., & Segall, D. O. (DMDC) (1999, April).
Reducing item exposure without reducing precision (much) in computerized
adaptive testing. Paper presented at the
annual meeting of the National Council on Measurement in Education,
Holst, P. M., O’Donnell, A.
M., & Rocklin, T. R. (1992, April).
Effects of feedback during self-adapted testing on estimates of
ability. Paper presented at the annual
meeting of the American Educational Research Association,
#HO70198. Holtzman, W. H. (1970). Individually tailored testing:
Discussion. In W. H. Holtzman, (Ed.),
Computer-assisted instruction, testing, and guidance (pp.198-200).
Hontangas,
P., Olea, J., Ponsoda, V., Revuelta, J. & Wise, S.L. (2004). Assisted self-adapted
testing: A comparative study. European Journal of Psychological Assessment, 1,
2-9.
Hontangas, P., Ponsoda, V.,
Olea, J. & Wise, S.L. (2000). The choice of item difficulty in self adapted
testing. European Journal of Psychological Assessment, 16, 1, 3-12.
#HO77-01. Hornke, L. F. (1977, June). Four realizations of pyramidal adaptive
testing strategies. Paper presented at
the Third International Symposium on Educational Testing,
Hornke, L. F. (1979).
Four realizations of pyramidal adaptive testing. Programmed Larning and Educational
Technology, 16, 164-169.
Hornke, L. F. (1995). Item times in computerized testing—A new
differential information. European
Journal of Psychological Assessment, 11 (Suppl. 1) 108-109.
Hornke,
L. F. (1999). Benefits from computerized adaptive testing as seen in simulation
studies. European Journal of Psychological
Assessment, 15(2), 91-98.
#HO80-01. Hornke, L. F. & Sauter. M. B.
(1980). A validity study of an adaptive
test of reading comprehension. In D. J.
Weiss (Ed.), Proceedings of the 1979 Computerized Adaptive Testing Conference
(pp. 57-67).
Hou,
L., Chen, S., Dodd. B. G., & Fitzpatrick, S. J. (1996, April). The effects of methods of theta estimation,
prior distribution, and number of quadrature points on CAT using the graded
response model. Paper presented at the
annual meeting of the American Educational Research Association,
Hsu, T.-C. & Shermis, M. D. (1988). The development and evaluation of a
microcomputerized adaptive placement testing system for college
mathematics. Paper(s) presented at the
annual meeting(s) of the American Educational Research Association, 1986 (
Hsu, T. C. & Tseng, F. L. (1995). Using simulation to select an adaptive
testing strategy: An item bank
evaluation program. Unpublished manuscript,
Hsu, Y., Thompson, T.D., & Chen, W-H. (1998,
April). CAT item calibration. Paper
presented at the annual meeting of the National Council on Measurement in
Education,
Huang, C.-Y., Kalohn, J. C., Lin, C.-J., & Spray, J. (2000). Estimating item parameters from classical
indices for item pool development with a computerized classification test
(Research Report 2000-4).
Huang, S. X. (1996).
A content-balanced adaptive testing algorithm for computer-based
training systems. In Frasson, C., Gauthier, G., & Lesgold, A. (Eds.), Intelligent Tutoring
Systems, Third International Conference, ITS'96, Montréal, Canada, June 1996
Proceedings. Lecture Notes in Computer Science 1086.
Hubbard, J. P.
( 1966). Programmed testing in
the examinations of the National Board of Medical Examiners. In A. Anastasi (Ed.), Testing problems in
perspective.
Huebner, A., Wang, B., & Lee, S. (2009). Practical issues concerning the application of the DINA model to CAT data. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing. {PDF file, 139 KB}
#HU01016. Huff, K. L. & Sireci, S. G. (2001).
Validity issues in computer-based testing. Educational Measurement: Issues and Practice, 20(3), 16-25.
Huisman,
J. M. E. (1999). Item nonresponse: Occurrence, causes and imputation of missing
answers to test items. (M & T Series No. 32).
Huisman,
J. M. E., & Molenaar,
Hutt,
M. L. (1947). A clinical study of “consecutive” and “adaptive” testing with the
revised Stanford-Binet. Jurnal of
Consulting Psychology, 11, 93-103.
Imai, S., Ito, S.,
Nakamura, Y., Kikuchi, K., Akagi, Y., Nakasono, H., Honda, A., & Hiramura,
T. (2009). Features of J-CAT (Japanese Computerized
Adaptive Test). In D. J. Weiss (Ed.), Proceedings
of the 2009 GMAC Conference on Computerized Adaptive Testing. {PDF
File, 655KB}
Immekus, J. C., Gibbons, R.D., & Rush, J. A.
(2007). Patient-reported outcomes measurement and computerized adaptive
testing: An application of post-hoc simulation to a diagnostic screening
instrument. In D. J. Weiss (Ed.). Proceedings of the 2007 GMAC Conference on Computerized
Adaptive Testing. {PDF file, 203
KB}
Ito,
K., Pommerich, M., & Segall, D. (2009).
An evaluation of a new procedure for computing information functions for
Bayesian scores from computerized adaptive tests. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on
Computerized Adaptive Testing. {PDF file, 571 KB}
Ito, K. & Sykes, R.C. (1994). The
effect of restricting ability distributions in the estimation of item
difficulties: Implications for a CAT
implementation. Paper presented at the
annual meeting of the National Council on Measurement in Education,
Iwamoto, C. K., Nungester,
R. J., & Luecht, R. M. (1999, April).
Study of methods to detect aberrant response patterns in computerized
testing. Paper presented at the annual
meeting of the National Council on Measurement in Education,
Jacobs-Cassuto, M.S. (2005). A comparison of adaptive mastery testing
using testlets with the 3-parameter logistic model. Unpublished doctoral dissertation, University
of Minnesota, Minneapolis, MN.
Jacobson, R. L. (1993,
September 13). New computer technique
seen producing a revolution in testing.
The Chronicle of Higher Education, p A22.
Jacobson, R. L. (1995, January 6). Shortfall of questions curbs use of
computerized graduate exam. The
Chronicle of Higher Education, A23.
Janczewski, D. & Lowe, P. (1992). The
Language Training Division's computer adaptive reading proficiency test.
#JE72-01. Jensema, C. J. (1972). An application of latent trait mental test
theory to the Washington Pre-College Testing Battery. Unpublished doctoral dissertation,
#JE74029. Jensema, C. J. (1974). An application of latent trait mental test
theory. British Journal of Mathematical
and Statistical Psychology, 27, 29-48.
Jensema, C. J. (1974). The validity of Bayesian tailored
testing. Educational and Psychological
Measurement, 34, 757-756.
#JE75-01. Jensema, C. J. (1976). Bayesian tailored testing and the influence
of item bank characteristics. In C. K.
Clark (Ed.), Proceedings of the First
Conference on Computerized Adaptive Testing (pp. 82-89).
Jensema, C. J. (1977). Bayesian tailored testing and the influence of item bank characteristics. Applied Psychological Measurement, 1, 111-120.
Jette, A., Haley, S., Tao, W., Ni, P.,
Moed, R., Meyers, D. & Zurek, M. (2007). Prospective evaluation of the
am-pac-cat in outpatient rehabilitation settings. Physical Therapy, 87, 385-398.
Jhu, Y.-J., & Chen,
S.-Y. (2008). Item exposure control in a-stratified computerized adaptive
testing. Psychological Testing, 55, 793-811.
#JI03-01. Jiao, H. & Lau, A. C. (2003, April). The effects of model misfit in computerized
classification test. Paper presented at
the annual meeting of the National Council on Measurement in Education,
#JI04-01. Jiao, H., Wang, S., & Lau, A.(2004). An investigation of two combination
procedures of SPRT for three-category decisions in computerized classification
test. Paper presented at the annual
meeting of the American Educational Research Association,
Jodoin,
M. G. (2002, June). Reliability and decision accuracy of linear parallel
form and multi stage tests with realistic and ideal item pools. Paper
presented at the International Conference on Computer-Based Testing and the
Internet,
Jodoin, M. (2003, April). A multidimensional IRT mechanism for better
understanding adaptive test behavior.
Paper presented at the annual meeting of the National Council on
Measurement in Education,
#JO02-01. Jodoin, M., Zenisky,
A., & Hambleton, R. (2002,
April). Comparison of the psychometric properties of several computer-based test
designs for credentialing exams. Paper presented at the annual meeting of
the National Council on Measurement in Education,
Johnson, J. L., Roos, L. L.,
Wise, S. L., & Plake, B. S. (1991).
Correlates of examinee item choice behavior in self-adapted
testing. Mid-Western Eduactional
Researcher, 4, 25-28.
#JO79-01. Johnson, M. J. (1979).
Student reaction to computerized adaptive testing in the classroom. Paper presented at the 87th annual
meeting of the American Psychological Association,
#JO80-01. Johnson, M. J. &
Weiss, D. J. (1980). Parallel forms
reliability and measurement accuracy comparison of adaptive and conventional
testing strategies. In D. J. Weiss
(Ed.), Proceedings of the 1979 Computerized Adaptive Testing Conference (pp.
16-34).
#JO73083. Jones, D. &
Weinman, J. (1973). Computer-based
psychological testing. In A. Elithorn
& D. Jones (Eds.), Artificial and
human thinking (pp. 83-93).
Jones, D. H. (1997, March). Mathematical programming approaches to
computerized adaptive testing. Paper presented at the annual meeting of the
National Council on Measurement in Education,
Jones-Dickson, C., Dorsey,
D., Campbell-Warnock, J., & Fields F. (1993). Moving in a new direction: Computerized
adaptive testing (CAT). Nursing Management, 24, 80-82.
Kalisch, S. J. (1973). A tailored testing model employing the beta
distribution and conditional difficulties.
Journal of Computer-Based Instruction, 1, 111-120.
Kalisch, S. J. (1974). A tailored testing model employing the beta
distribution (unpublished manuscript).
Kalisch, S. J. (1974). A tailored testing model employing the beta
distribution and conditional difficulties.
Journal of Computer-Based Instruction, 1, 22-28.
Kalisch, S. J. (1974). The comparison of two tailored testing models
and the effects of the models’ variables on actual loss. Unpublished doctoral dissertation,
#KA80-01. Kalisch, S. J. (1980).
A model for computerized adaptive testing related to instructional
situations. In D. J. Weiss (Ed.). Proceedings of the 1979 Computerized Adaptive
Testing Conference (pp. 101-119).
Kalisch, S. J. (1980,
February). Computerized instructional adaptive
testing model: Formulation and
validation (AFHRL-TR-79-33, Final Report).
Brooks Air Force
Kalohn, J. C. & Spray,
J. A. (1998, April). Effect of item
selection on item exposure rates within a computerized classification
test. Paper presented at the annual
meeting of the National Council on Measurement in Education,
#KA99047. Kalohn, J. C. & Spray, J. A. (1999). The effect of model misspecification on
classifications decisions made using a computerized test. Journal of Educational Measurement,36, 47-59.
Kalohn, J. (2000). Test security and item exposure
control for computer-based … Paper
presented at the annual meeting of the National Council on Measurement in
Education,
Kappauf, W. E. (1969). Use of an on-line computer for psychological
testing with the up-and-down method.
American Psychologist, 24, 207-211.
Karino, C. A., Costa, D. R., & Laros, J. A. (2009). Adequacy
of an item pool measuring proficiency in English language to implement a CAT
procedure. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on
Computerized Adaptive Testing. {PDF File, 160 KB}
Kiely, G. L., Zara, A. R.,
& Weiss, D. J. (1983). Alternate
forms reliability and concurrent validity of adaptive and conventional tests
with military recruits.
Killcross, M. C. (1974,
August). A tailored testing system for selection
and allocation in the British Army.
Paper presented at the 18th International Congress of Applied
Psychology,
Killcross, M. C.
(1976). A review of research in tailored
testing (Report APRE No. 9/76).
Farnborough, Hants, U. K.: Ministry of Defence, Army Personnel Research
Establishment.
Kim, J. (1993).
Individual differences in computerized adaptive testing. Paper presented at the annual meeting of the
Mid-South Educational Research Association,
Kim, J. & McLean, J.
E. (1995, April). The influence of examinee test-taking
behavior motivation in computerized adaptive testing. Paper presented at the annual meeting of the
National Council on Measurement in Education,
Kim, H., & Plake, B. S.
(1993, April).
Kim-Kang, G. & Weiss, D. J. (2007). Comparison of computerized adaptive testing
and classical methods for measuring individual change. In D. J. Weiss (Ed.). Proceedings
of the 2007 GMAC Conference on Computerized Adaptive Testing. {PDF file, 347 KB}
Kim-Kang, G. & Weiss. D. J. (2008). Adaptive measurement of individual change. Zeitschrift für Psychologie / Journal of Psychology, 216, 49-58. {PDF file, 568 KB}
Kingsbury, G. G.
(1984). Adaptive self-referenced testing
as a procedure for the measurement of individual change in instruction: A comparison of the reliabilities of change
estimates obtained from conventional and adaptive testing procedures. Unpublished doctoral dissertation, Univerity
of Minnesota,
Kingsbury, G. G. (1985).
Adaptive self-referenced testing as a procedure for the measurement of
individual change: A comparison of the
reliabilities of change estimates obtained from conventional and adaptive
testing procedures. Dissertation
Abstracts International, 45 (9-B), 3057.
Kingsbury, G. . (1986). Computerized adaptive testing: A pilot
project. In W. C. Ryan (ed.),
Proceedings: NECC ’86, National Educational Computing Conference (pp.172-176).
Kingsbury, G. G. et al. (1988).
Computerized adaptive testing: A
four-year-old pilot study shows that CAT can work. Technological Horizons in Education, 16 (4),
73-76.
#KI90003. Kingsbury, G. G. (1990). Adapting adaptive testing: Using the MicroCAT testing in a local school
district. Educational Measurement: Issues and Practice,9 (2), 3-6, 29.
Kingsbury, G. G. (1991). A
comparison of procedures for content-sensitive item selection. Applied Measurement in Education, need page numbers.
Kingsbury, G. G. (1996,
April). Item review and adaptive
testing. Paper presented at the annual
meeting of the National Council on Measurement in Education,
Kingsbury, G. G. (1997, March). Item pool development and maintenance. Paper presented at the annual meeting of the
National Council on Measurement in Education,
Kingsbury, G. G. (1997, March). Some questions that must be addressed to
develop and maintain an item pool for use in an adaptive test. Paper presented at the annual meeting of the
National Council on Measurement in Education,
Kingsbury, G. G. (1999,
April). Standard errors of proficiency
estimates in stratum scored CAT. Paper
presented at the annual meeting of the National Council on Measurement in
Education,
#KI02-01. Kingsbury, G. G. (2002, April). An empirical comparison of achievement level
estimates from adaptive tests and paper-and-pencil tests. Paper presented at the annual meeting of the
American Educational Research Association,
Kingsbury, G. G. (2009). Adaptive item calibration: A process for estimating
item parameters within a computerized adaptive test. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on
Computerized Adaptive Testing. {PDF
File, 286 KB} {PDF File, 286 KB}
#KI88-01. Kingsbury, G. G.,
& Houser, R. L. (1988, April). A comparison of achievement level estimates
from computerized adaptive testing and paper-and-pencil testing. Paper presented at the annual meeting of the
American Educational Research Association,
#KI89-01. Kingsbury, G. G., & Houser, R. L. (1989,
March). Assessing the impact of using
item parameter estimates obtained from paper-and-pencil testing for
computerized adaptive testing. Paper
presented at the annual meeting of the National Council on Measurement in
Education,
Kingsbury, G. G. &
Houser, R. L. (1990, April). Assessing the utility of item response models:
Computerized adaptive testing. A paper presented to the annual meeting of the
National Council of Measurement in Education,
#KI93-01. Kingsbury, G. G., & Houser, R. L. (1993,
April). A practical examination of the
use of free-response questions in computerized adaptive testing. Paper presented to the annual meeting of the
American Educational Research Association:
Kingsbury, G. G., &
Houser, R. L. (1993). Assessing the utility of item response models:
Computerized adaptive testing. Educational Measurement: Issues and Practice,
12, 21-27, 39.
Kingsbury, G. G. &
Houser, R.L. (1999). Developing
computerized adaptive tests for school children. In F. Drasgow & J. B. Olson-Buchanan
(Eds.), Innovations in computerized assessment (pp. 93-115).
#KI04-01.
Kingsbury, G. G. & Hauser, C. (2004). Computer adaptive testing and the No Child
Left Behind Act. Paper presented at the
annual meeting of the American Educational Research Association,
Kingsbury, G. G. &
Houser, R. L. (2007). ICAT: An adaptive
testing procedure to allow the identification of idiosyncratic knowledge
patterns. In D. J. Weiss (Ed.). Proceedings of the 2007 GMAC Conference on
Computerized Adaptive Testing. {PDF
file, 161 KB}
Kingsbury,
G. G. & Houser, R/ L. (2008). ICAT:
An adaptive testing procedure for the identification of idiosyncratic knowledge
patterns. Zeitschrift für Psychologie / Journal of Psychology, 216(1), 40–48.
#KI79-05. Kingsbury, G. G.
& Weiss, D. J. (1979). An adaptive
testing strategy for mastery decisions (Research Report 79-5).
#KI80-04. Kingsbury, G. G., & Weiss, D. J. (1980).
A comparison of adaptive, sequential, and conventional testing strategies for
mastery decisions (Research Report 80-4).
#KI80-05. Kingsbury, G. G.,
& Weiss, D. J. (1980). An alternate-forms reliability and concurrent
validity comparison of Bayesian adaptive and conventional ability tests
(Research Report 80-5).
#KI80-01. Kingsbury, G. G. & Weiss, D. J. (1980).
A comparison of ICC-based adaptive mastery testing and the Waldian
probability ratio method. In D. J. Weiss
(Ed.). Proceedings of the 1979
Computerized Adaptive Testing Conference (pp. 120-139).
#KI81-03. Kingsbury, G. G. & Weiss, D. J.
(1981). A validity comparison of
adaptive and conventional strategies for mastery testing (Research Report
81-3).
Kingsbury, G. G. & Weiss, D. J. (1983). A
comparison of IRT-based adaptive mastery testing and a sequential mastery
testing procedure. In D. J. Weiss (Ed.), New
horizons in testing: Latent trait test
theory and computerized adaptive testing (pp. 257-283).
Kingsbury, G. G., &
Zara, A. R. (1989). Procedures for selecting items for computerized adaptive
tests. Applied Measurement in Education, 2, 359-375.
Kingsbury, G. G., &
Zara, A. R. (1991). A comparison of procedures for content-sensitive item
selection in computerized adaptive tests. Applied Measurement in Education, 4,
241-261.
#KI99-1. Kingsbury, G. G., & Zara, A. R. (1999,
April). A comparison of conventional and
adaptive testing procedures for making single-point decisions. Paper presented at the annual meeting of the
National Council on Measurement in Education,
Kingsbury, G. G., &
Zara, A. R. (1999, April). A procedure
to compare conventional and adaptive testing procedures for making single-point
decisions. Paper presented at the annual
meeting of the National Council on Measurement in Education,
Kirisci, L. & Hsu, T.-C.
(1988, April). A predictive analysis
approach to adaptive testing. Paper
presented at the annual meeting of the American Educational Research
Association,
Kirisci, L. (1992).
Estimation of ability level by using only observable quantities in adaptive
testing. Paper presented at the annual
meeting if the American Educational Research Association,
Koch, W. R. & Patience,
W. M. (1977). Student attitudes toward
tailored testing. In D. J. Weiss (Ed.),
Proceedings of the 1977 Computerized Adaptive Testing Conference.
Koch, W. R., & Dodd, B.
G. (1985, April). Computerized adaptive attitude measurement. Paper presented
at the annual meeting of the American Educational Research Association,
Chicago.
#KO86-01. Koch, W. R. & Dodd. B. G. (1986,
April). Operational characteristics of
adaptive testing procedures using partial credit scoring. Paper presented at the annual meeting of the
American Educational Research Association,
Koch, W. R., & Dodd, B.
G. (1989). An investigation of procedures for computerized adaptive testing
using partial credit scoring. Applied Measurement in Education, 2, 335-357.
Koch, W. R., & Dodd, B.
G. (1995). An investigation of procedures for computerized adaptive testing
using the successive intervals Rasch model. Educational and Psychological
Measurement, 55, 976-990.
Koch, W. R., Dodd, B. G.,
& Fitzpatrick, S. J. (1990). Computerized adaptive measurement of
attitudes. Measurement and Evaluation in Counseling and Development, 23, 20-30.
Koch, W. J. & Reckase,
M. D. (1978). A live tailored testing
comparison study of the one- and three-parameter logistic models (Research Report 78-1).
Koch, W. J. & Reckase,
M. D. (1979). Problems in application of
latent-trait models to tailored testing (Research Report 79-1).
Kolen, M. J. (1999-2000). Threats to score comparability with
applications to performance assessments and computerized adaptive tests. Educational Assessment, 6, 73-96.
Krass,
#KR98-01. Krass,
#KR00-01. Krass,
#KR01-01. Krass,
Krass,
#KR03-01. Krass,
Krathwohl, D. (1959).
Progress report on the sequential item test.
Krathwohl, D. R. &
Huyser, R. J. (1956). The sequential
item test. American Psychologist, 2,
419.
Kreiter, C. D.,
Kreitzberg, C. B. (1978). Computerized adaptive testing: Principles and directions. Computers and Education, 2 (4), 319-329.
Kreitzberg, C. B. &
Jones, D. J. (1980). An empirical study
of a broad range test of verbal ability.
Kreitzberg, C. B., Stocking,
M., & Swanson, L. (1978).
Computerized adaptive testing: Principles and directions. Computers and Education, 2, 319-329.
Krimpen-Stoop, E. M. L.A.
van and Meijer, R. R. (1999a).
CUSUM-based person-fit statistics for adaptive testing. Technical Report RR 99-05, Univeristy of
Twente, Enschede, The
Krimpen-Stoop, E. M. L.A.
van and Meijer, R.R. (2000). The null
distribution of person-fit statistics for conventional and adaptive tests. Applied Psychological Measurement, 23,
327-345.
Krimpen-Stoop, E. M. L.A. van and Meijer, R. R.. (2000). Detecting person misfit in adaptive testing using statistic