Bibliography on Computerized Adaptive Testing (CAT)

(including related literature on sequential testing)

 

Updated March 26, 2011

Compiled by David J. Weiss, University of Minnesota

djweiss@umn.edu

 

Filed manuscripts are identified by # followed by a manuscript number

 

A  B  C  D  E  F  G  H  I  J  K  L  M  N  O  P  Q  R  S  T  U  V  W  X  Y  Z

- A -

#AB95-01 (also see #SP95-01).  Abdel-Fattah, A.. A, Lau, C.-M. A., & Spray, J. A. (1995, June).  The effect of model misspecification on classification decisions made using a computerized test: UIRT versus MIRT. Paper presented at the annual meeting of the Psychometric Society, Minneapolis MN.

Abdel-Fattah, A.. A, Lau, C.. A., & Spray, J. A. (1996, April).  Effect of altering passing score in CAT when unidimensionality is violated.  Paper presented at the annual meeting of the American Educational Research Association, New York NY. 

#AB00-01.  Abdullah, S. C. &  Cooley, R. E. (2000).  Using constraints to develop and deliver adaptive tests.  Paper presented at the Computer-Assisted Testing Conference.  {PDF file, 46 KB}

#AC87-13.  Ackerman, T. A. (1987). The use of unidimensional parameter estimates of multidimensional items in adaptive testing (ACT Research Report series 87-13).  Iowa City IA:  The American College Testing Program.

Ackerman, T. A. (1991). The use of unidimensional parameter estimates of multidimensional items in adaptive testing. Applied Psychological Measurement, 15, 13-24.

Adams, R. J. (1987).  Adaptive testing, information, and the partial credit model.  Melbourne, Australia:  University of Melbourne, Center for the Study of Higher Education.

Adema, J. J. (1990). The construction of customized two-staged tests. Journal of Educational Measurement, 27, 241-253.

Allred, L. A. & Green, B. F. (1984).  Analysis of experimental CAT ASVAB data.  Baltimore MD:  Johns Hopkins University, Department of  Psychology.

Almond, R. G. & Mislevy, R. J.  (1999.  Graphical models and computerized adaptive testing.  Applied Psychological Measurement, 23, 223-237.  (Also Educational Testing Service Research Report 98-4). 

#AM95-01.  American Council on Education. (1995).  Guidelines for computer-adaptive test development and use in education.  Washington DC: Author.

Anastasi, A. (1953).  An empirical study of the applicability of sequential analysis to item selection.  Educational and Psychological Measurement, 13, 3-13.

Anderson, D. (ETS).  (1999, April).  Use of conditional item exposure methodology for an operational CAT.  Paper presented at the annual meeting of the National Council on Measurement in Education, Montreal, Canada.

Andrich, D. (1995).  Review of the book Computerized Adaptive Testing: A Primer.  Psychometrika, 4?, 615-620.

Angoff, W. H. & Huddleston, E. M. (1958).  The multi-level experiment: A study of a two-level test system for the College Board Scholastic Aptitude Test.  (Statistical Report 58-21).  Princeton NJ:  Educational Testing Service.

Ariel, A., Veldkamp, B. P., & van der Linden, W. J. (2002). Constructing rotating item pools for constrained adaptive testing. Submitted for publication.

Archer, R.P., Tirrell, C.A., & Elkins, D.E. (2001).  Evaluation of an MMPI--a short form: Implications for adaptive testing. Journal of  Personality Assessment,76, 76-89

#AR03-01.  Ariel, A., Veldkamp, B., &  van der Linden, W. J.  (2003, April). Constructing rotating item pools for constrained adaptive testing.  Paper presented at the Annual meeting of the National Council on Measurement in Education, Chicago IL.  {PDF file, 395 KB}

Armitage, P. (1950).  Sequential analysis with more than two alternative hypotheses, and its relation to discriminant function analysis.  Journal of the Royal Statistical Society, 12, 137-144.

Armstrong, R.D. & Edmonds, J.J. (2003). The assembly of multiple stage adaptive tests with discrete items. Newtown, PA: Law School Admission Council Report.

#AR04-01.  Armstrong, R. D. & Edmonds, J.  (2004, April).  A study of multiple stage adaptive test designs.  Paper presented at the annual meeting of the National Council on Measurement in Education, San Diego CA.  {PDF file, 288 KB}

Armstrong, R. D. & Jones, D. H. (1998).  Computer adaptive testing – Approaches for item selection and measurement.  (Research report).  Rutgers Center for Operations Research, New Brunswick NJ.

Armstrong, R. D., Jones, D. H., & Berliner, N.  (1998, June).  Computerized adaptive testing with multiple form structures.  Paper presented at the annual meeting of the Psychometric Society, Urbana, IL.

#AR04147.  Armstrong, R.D., Jones, D. H., Koppel, N .B., & Pashley, P. J. (2004).  Computerized adaptive testing with multiple-form structures.  Applied Psychological Measurement, 28, 147-164.

#AR03-02.  Armstrong, R. D. & Little, J. (2003).  The assembly of multiple form structures.  Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago IL.  {PDF file, 418 KB}

#AR03-03.  Armstrong, R. D. & Roussos, L.  (2003).  A method to determine targets for multi-stage adaptive tests.  Unpublished manuscript.  {PDF file, 207 KB}

Arrowwood, V. E. (1994).  Effects of computerized adaptive test anxiety on nursing licensure examinations.  Dissertation Abstracts International, A (Humanities and Social Sciences), 54 (9-A), 3410.

Assessment Systems Corporation (1984).  User’s manual for the MicroCAT Testing System.  St. Paul MN: Author.

Assessment Systems Corporation (1988).  User’s manual for the MicroCAT Testing System, Version 3.  St. Paul MN: Author.

Assessment Systems Corporation. (1996). User’s manual for the MicroCAT testing system, Version 3.5.  St Paul MN: Assessment Systems Corporation.

Assessment Systems Corporation (2001). The FastTEST Professional Testing System, Version 1.6.  [Computer software].  St. Paul MN: Author.

Auger, R. (1989). Étude de praticabilité du testing adaptatif de maîtrise des apprentissages scolaires au Québec : une expérimentation en éducation économique secondaire 5. Thèse de doctorat non publiée. Montréal : Université du Québec à Montréal.

Auger, R. & Séguin, S.P. (1992). Le testing adaptatif avec interprétation critérielle, une expérience de praticabilité du TAM pour l’évaluation sommative des apprentissages au Québec. Mesure et évaluation en éducation, 15-1 et 2, 10

- B -

Babcock, B. & Weiss, D. J. (2009).  Termination criteria in computerized adaptive tests: Variable-length CATs are not biased. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing. {PDF File, 281 KB}

 

Baek, S. G. (1995). Computerized adaptive attitude testing using the partial credit model. Dissertation Abstracts International-A, 55(7-A), 1922 (UMI No. AAM9430378).

Baek, S.G. (1997).  Computerized adaptive testing using the partial credit model for attitude measurement.  In M. Wilson, G. Engelhard Jr., & K. Draney (Eds.), Objective measurement:  Theory into practice, volume 4.  Norwood NJ: Ablex.

Baghi, H., Ferrara, S. F., & Gabrys, R. (1992, April).  Student attitudes toward computer-adaptive test administration.  Paper presented at the annual meeting of the American Educational Research Association, San Francisco CA.

Baghi, H., Gabrys, R., & Ferrara, S. (1991, April).  Applications of computer-adaptive testing in Maryland.  Paper presented at the annual meeting of the American Educational Research Association, Chicago IL.

#BA01191.  Ban, J.-C., Hanson. B. H., Wang, T., Ti, Q., & Harris, D. J. (2001).  A comparative study of on-line pretest item calibration-scaling methods in computerized adaptive testing.  Journal of Educational Measurement,38, 191-212. (Also ACT Research Report 2002-11).

See #BA02207.  Ban, J., Hanson, B.A., Yi, Q., & Harris, D.  (2001, April).  Data sparseness and online pretest calibration/scaling methods in CAT.  Paper presented at the annual meeting of the American Educational Research Association, Seattle. (Also ACT Research Report 2002-1)

#BA02207.  Ban, J-C., Hanson, B.A., Yi, Q., & Harris, D. J.  (2002).  Data sparseness and online pretest item calibration/scaling methods in CAT.  Journal of Educational Measurement,39, 207-218.

 

Ban, J.-C., Hanson, B., Wang, T., Yi, Q. & Harris, D. (2000). A comparative study of online pretest item calibration/scaling methods in CAT. American Educational Research Association.

#BA99-01.  Ban, J., Wang, T., & Yi, Q. (1999, June).  Comparison of the a-stratified method, the Sympson-Hetter method, and the matched difficulty method in CAT administration.  Paper presented at the annual meeting of the Psychometric Society, Lawrence KS.

#BA00-01.  Ban, J. C., Wang, T., Yi, Q., & Harris, D. J.  (2000, April).  Effects of nonequivalence of item pools on ability estimates in CAT.  Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans LA. {PDF file, 657 KB}

Barrada, J. R., Abad, F. J., & Veldkamp, B. P. (2009). Comparison of methods for controlling maximum exposure rates in computerized adaptive testing. Psicothema, 21, 313-320. {PDF file, 94 KB}

Barrada. J. R., Mazuela, P., & Olea, J. (2006). Maximum Information Stratification method for controlling item exposure in Computerized Adaptive Testing. Psicothema, 18, 156-159. {PDF file, 57 KB)

 

Barrada, J. R., Olea, J., & Abad., F. J. (2008). Rotating item banks versus restriction of maximum exposure rates in computerized adaptive testing. Spanish Journal of Psychology, 11, 618-625.  {PDF file, 267 KB}

 

Barrada, J. R., Olea, J., & Ponsoda, V. (2007). Methods for restricting maximum exposure rate in computerized adaptive testing. Methodology, 3, 14-23. {PDF file, 399KB}

 

Barrada, J. R., Olea, J., Ponsoda, V., & Abad, F. J. (2008). Incorporating randomness in the Fisher information for improving item-exposure control in CATs. British Journal of Mathematical and Statistical Psychology, 61, 493-513.

Barrada, J. R., Olea, J., Ponsoda, V., & Abad, F. J. (2009). Item selection rules in computerized adaptive testing: Accuracy and security. Methodology, 5, 7-17. (PDF file, 445 KB)

Barrada, J., Olea, J., Ponsada, V., & Abad, F.  (2009).  Test overlap rate and item exposure rate as indicators of test security in CATs.  In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing.{PDF File, 261 KB}

 

Barrada, J. R., Veldkamp, B. P., & Olea, J. (2006, July). Multiple maximum exposure rates in computerized adaptive testing. Paper presented at the SMABS-EAM Conference, Budapest, Hungary.

Baumer, M., Roded, K., & Gafni, N. (2009).  Assessing the equivalence of Internet-based vs. paper-and-pencil psychometric tests. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing.{PDF File, 142 KB}

Bayliss, M.S., Dewey J.E., Dunlap, I., et al. (2003). A study of the feasibility of Internet administration of a computerized health survey: The Headache Impact Test (HIT). Quality of Life Research,  12,  953–961.

Bayroff, A. G. (1964, November).  Feasibility of a programmed testing machine.  U.S. Army Personnel Research Office Research Study 64-3.

#BA69-01.  Bayroff, A. G. (1969, September).  Psychometric problems with branching tests.  Paper presented at the annual meeting of the American Psychological Association.

#BA74-01.  Bayroff, A. G., Ross, R. M., & Fischl, M. A. (1974).  Development of a programmed testing system (Technical Paper 259).  Arlington VA:  U. S. Army Research Institute for the Behavioral and Social Sciences.  NTIS No. AD A001534)

Bayroff, A. G. & Seeley, L. C.  (1967).  An exploratory study of branching tests (Technical Research Note 188).  Washington DC: U.S. Army Behavioral Science Research Laboratory.  (NTIS No. AD 655263)

Bayroff, A. G., Thomas, J. J., & Anderson, A. A. (1960).  Construction of an experimental sequential item test (Research Memorandum 60-1).  Washington DC, Personnel Research Branch, Department of the Army.

Bejar, I. I. (1977).  Applications of adaptive testing in measuring achievement and performance In D. J. Weiss (Ed.), Applications of computerized adaptive testing (Research Report 77-1).  Minneapolis MN:  University of Minnesota, Department of Psychology, Psychometric Methods Program.  {In #WE77-01, PDF file 3.228 MB}

Bejar, I. I. (1977).  A comparison of conventional and adaptive achievement testing. In D. J. Weiss (Ed.), Proceedings of the 1977 Computerized Adaptive Testing Conference.  Minneapolis MN:  University of Minnesota, Department of Psychology, Psychometric Methods Program. 

Bejar, I. I. (1986). Final report: Adaptive testing of spatial abilities (ONR 150 531). Princeton, NJ: Educational Testing Service.

Bejar, I. I. (1995).  From adaptive testing to automated scoring of architectural simulations.  IN L. E. Mancall & P. G. Bashook (Eds.), Assessing clinical reasoning: The oral examination and alternative methods (pp. 115-130.  Evanston IL: The American Board of Medical Specialities.

Bejar, I. I. (1999, April).  On-the-fly adaptive tests:  An application of generative modeling to quantitative reasoning.  Symposium presented at the annual meeting of the National Council on Measurement in Education, Montreal, Canada.

#BE02-01.  Bejar, I. I., Lawless, R. R., Morley, M. E., Wagner, M. E., Bennett R. E., & Revuelta, J.   (2002, July).  A feasibility study of on-the-fly item generation in adaptive testing (GRE Board Report No. 98-12; Educational Testing Service RR02-23).  Princeton NJ:  Educational Testing Service. {PDF file, 193 KB}

#BE03-03.  Bejar, I. I., Lawless, R. R., Morley, M. E., Wagner, M. E., Bennett R. E., & Revuelta, J.   (2003).  A feasibility study of on-the-fly item generation in adaptive testing.  Journal of Technology, Learning and Assessment, 2 (3). {PDF file, 427 KB}

Bejar, I. I. & Weiss, D. J. (1978).  A construct validation of adaptive achievement testing (Research Report 78-4).  Minneapolis MN:  University of Minnesota, Department of Psychology, Psychometric Methods Program, Computerized Adaptive Testing Laboratory.

Bejar, I. I., Weiss, D. J.,  & Gialluca, K. A.  (1977, October). An information comparison of conventional and adaptive tests in the measurement of classroom achievement (Research Report 77-7).  Minneapolis:  Department of Psychology, Psychometric Methods Program.

Bejar, I. I., Weiss, D. J.,  & Kingsbury, G. G. (1977, October). Calibration of an item pool for the adaptive measurement of achievement (Research Report 77-5).  Minneapolis:  Department of Psychology, Psychometric Methods Program. 

 

Belov, D. I. & Armstrong, R. D. (2009).  Direct and inverse problems of item pool design for computerized adaptive testing. Educational and Psychological Measurement, 69, 533-547.

 

Belov, D.I., Armstrong, R.D. (2008). A Monte Carlo approach to the design, assembly, and evaluation of multistage adaptive tests. Applied Psychological Measurement,32,119–137.

Belov, D. I. , Armstrong, R. D.  & Weissman, A. (2008). A monte carlo approach for adaptive testing with content constraints. Applied Psychological Measurement, 32, 431-446.

#BE98-45.  Bennett, R. E., Morley, M., & Quardt, D. (1998).  Three response types for broadening the conception of mathematical problem solving in computerized-adaptive tests (Research Report 98-45).  Princeton NJ :  Educational Testing Service. (Also presented at National Council on Measurement in Education, 1998)

Bennett, R. E., Steffen, M., Singley, M. K., Morley, M., & Jacquemin, D. (1997). Evaluating an automatically scorable, open-ended response type for measuring mathematical reasoning in computer-adaptive tests. Journal of Educational Measurement, 34, 162-176.

Ben-Porath, Y. S., Waller, N. G., Slutske, W. S. & Butcher, J. N.  (1988, August).  A comparison of two methods for the adaptive administration of the MMPI-2 content scales.  Paper presented at the 86th Annual Convention of the American Psychological Association, Atlanta GA.

Ben-Porath, Y. S., Slutske, W. S., & Butcher, J. N. (1989).  A real-data simulation of computerized adaptive administration of the MMPI.  Psychological Assessment:  A Journal of Consulting and Clinical Psychology, 1, 18-22.

Ben-Porath, Y. S. & Roper, B. L. (1992, May).  Computerized adaptive testing with the MMPI-2:  Reliability, validity, and comparability to paper and pencil administration.  Paper presented at the 27th Annual Symposium on Recent Developments in the MMPI/MMPI-2, Minneapolis MN.

Ben-Porath, Y. S., Roper, B. L., & Butcher, J. N. (1990, June).  An empirical study of the computer adaptive MMPI-2.  Paper presented at the 25th Annual Symposium on recent developments in the MMPI/MMPI-2, Minneapolis MN.

#BE94141.  Berger, M. P. F.  (1994).  A general approach to algorithmic design of fixed-form tests, adaptive tests, and testlets.  Applied Psychological Measurement, 1994, 141-153.

Berger, M. P. F., & Veerkamp, W. J. J. (1997).  Some new item selection criteria for adaptive testing.  Journal of Educational and Behavioral Statistics, 22, 203-226.

Bergstrom, B. (1992, April).  Ability measure equivalence of computer adaptive and paper and pencil tests:  A research synthesis.  Paper presented at the annual meeting of the American Educational Research Association, San Francisco.

#BE92-01.  Bergstrom, B. (1992).  Computer adaptive versus paper-and-pencil tests.  Unpublished doctoral dissertation, University of Chicago.

Bergstrom, B. A. (1992). Confidence in pass/fail decisions for computer adaptive and paper and pencil examinations. Evaluation and The Health Professions, 15(4), 435-464.

Bergstrom B. A. (1996).  Computerized adaptive testing for the national certification examination. AANA.J, 64, 119-24.

Bergstrom, B. A. (1996).  Computerized adaptive testing for the national certification examination. AANA Journal, 64, 119-24. (American Association of Nurse Anesthetists)

Bergstrom, B. & Gershon, R. (1992, April).  Comparison of item targeting strategies for pass/fail adaptive tests.  Paper presented at the annual meeting of the American Educational Research Association, San Francisco CA (ERIC NO. ED 400 287).

Bergstrom, B. & Gershon, R. (1994, April).  Computerized adaptive testing exploring examinee response time using hierarchical linear modeling.  Paper presented at the annual meeting of the American Educational Research Association, New Orleans LA. (ERIC No. ED 400 286).

Bergstrom, B. A  & Gershon, R. C. (1994). Computerized adaptive testing for licensure and certification. CLEAR Exam Review, Winter 1994, 25-27.

Bergstrom, B. A., & Lunz, M.E. (1990, June).  The stability of Rasch pencil and paper item calibrations on computer adaptive tests.  Paper presented at the Midwest Objective Measurement Seminar, Chicago IL.

Bergstrom, B. B., & Lunz, M. E. (1991, April).  Confidence in pass/fail decisions for computer adaptive and paper and pencil examinations.  Paper presented at the annual meeting of the American Educational Research Association, Chicago IL.

Bergstrom, B. A.. & Lunz, M. E. (1991, July).  Comparisons of computer adaptive and pencil and paper tests.  Chicago IL: American Society of Clinical Pathologists.  Unpublished manuscript.

Bergstrom, B. & Lunz, M. E.  (1992).  Confidence in pass/fail decisions, for computer adaptive and paper-and-pencil examinations.  Evaluation and the Health Professions, 15(4), 453-464.

Bergstrom, B. A. & Lunz, M.E. (1999).  CAT for certification and licensure.  In F. Drasgow & J. B. Olsen (Eds.), Innovations in computerized assessment (pp. 67-91).  Mahwah NJ: Erlbaum.

Bergstrom, B. A., Lunz, M. E., & Gershon, R. C.  (1992).  Altering the level of difficulty in computer adaptive testing.  Applied Measurement in Education, 5, 137-149.

Bergstrom, B.A. & Stahl, J. A. (1992).  Assessing existing item bank depth for computer adaptive testing. ERIC Document No. TM022404

#BE75-01. Betz, N. E. (1975).  New types of information and psychological implications.  In D. J. Weiss (Ed.), Computerized adaptive trait measurement:  Problems and Prospects (Research Report 75-5), pp. 32-43.  Minneapolis: University of Minnesota, Department of Psychology, Psychometric Methods Program. {PDF file, 609 KB}

Betz, N. E. (1977).  Effects of immediate knowledge of results and adaptive testing on ability test performance.  Applied Psychological Measurement, 2, 259-266.

Betz, N. E. & Weiss, D. J. (1973).  An empirical study of computer-administered two-stage ability testing (Research Report 73-4). Minneapolis:  Department of Psychology, Psychometric Methods Program.

#BE74-4.  Betz, N. E. & Weiss, D. J. (1974).  Simulation studies of two-stage ability testing (Research Report 74-4). Minneapolis:  Department of Psychology, Psychometric Methods Program.  {PDF file, 2.92 MB}

Betz, N. E. & Weiss, D. J. (1975).  Empirical and simulation studies of  flexilevel ability testing (Research Report 75-3). Minneapolis:  Department of Psychology, Psychometric Methods Program.

Betz, N. E. & Weiss, D. J. (1976, June).  Effects of immediate knowledge of results and adaptive testing on ability test performance (Research Report 76-3). Minneapolis:  University of Minnesota, Department of Psychology, Psychometric Methods Program, Computerized Adaptive Testing Laboratory.

Betz, N. E. & Weiss, D. J. (1976, June).  Psychological effects of immediate knowledge of results and adaptive ability testing (Research Report 76-4). Minneapolis:  University of Minnesota, Department of Psychology, Psychometric Methods Program, Computerized Adaptive Testing Laboratory.

#BI01069.  Bickel, P., Buyske, S., Chang, H.-H., & Ying, Z. (2001).  On maximizing item information and matching difficulty with ability.  Psychometrika, 66, 69-77.

#BI84-01.  Bill, B. C.  (1984).  A comparison of the maximum likelihood strategy and stradaptive test on a micro-computer.  Unpublished M. S. thesis, University of Wisconsin, Madison.

Binet, A., & Simon, Th. A. (1905). Méthode nouvelle pour le diagnostic du niveau intellectuel des anormaux. L'Année Psychologique, 11, 191-244.   (also cited as: Applications des methods nouvelles au diagnostic du niveau intellectual chez des enfants normaux et anourmaux d’hospice et d’ecole primaire, 245-336.)

Binet, A. & Simon, T. (1908).  Le development de l’intelligence chez les enfants.  L’Anee Psychologique, 14, 1-94.

Binet, A. & Simon, T. (1915). A method of measuring the development of the intelligence of young children. Chicago: Chicago Medical Book Co.

#BJ04-02.  Bjorner, J.B. (2004, June).  Developing tailored instruments: Item banking and computerized adaptive assessment.  Paper presented at the conference “Advances in Health Outcomes Measurement: Exploring the Current State and the Future of Item Response Theory, Item Banks, and Computer-Adaptive Testing,” Bethesda MD.  {PDF file, 406 KB}

 

Bjorner, J.B., Chang, C.H., Thissen, D., Reeve, B.B. (2007). Developing tailored instruments: Item banking and computerized adaptive assessment. Quality of Life Research, 16(Suppl 1, 95–108.

#BJ03913.  Bjorner, J. B., Kosinski,  M.  & Ware, J. E.,  Jr. (2003).  Calibration of an item pool for assessing the burden of headaches: An application of item response theory to the Headache Impact Test (HIT).  Quality of Life Research 12: 913–933.   {PDF file, 286 KB}

#BJ04-01.  Bjorner, J. B.,  Kosinski, M., &  Ware, J. E., Jr. (2004, in press).  Computerized adaptive testing and item banking. In P. M. Fayers and R. D. Hays (Eds.) Assessing Quality of Life.  Oxford: Oxford University Press.  {PDF file 371 KB}

Blais, J.G. (2002). Historique et concepts propres au testing adaptatif  [Adaptive testing: Historical accounts and concepts]. Presented at the 69th Congress of the Acfas. Sherbrooke: Association canadienne française pour l’avancement des sciences (Acfas). [In French]

#BL02-01.  Blais, J-.G  & Raiche, G.  (2002, April).  Some features of the sampling distribution of the ability estimate in computerized adaptive testing according to two stopping rules.  Paper presented at the annual meeting of the International Objective Measurement Workshops-XI, New Orleans, LA.  {PDF file, 38 KB} 

Blais, J.-G. & Raîche, G. (submitted). Features of the estimated sampling distribution of the ability estimate in computerized adaptive testing according to two stopping rules. In D. G. Englehard (Eds.), Objective measurement:  Theory into practice. Volume 6.

Bloxom, B. M. & Vale, C. D. (undated).  An adaptive method of multidimensional trait estimation.  Unpublished manuscript.

Bloxom, B. M. & Vale,C. D. (1987, June).  Multidimensional adaptive testing: A procedure for sequential estimation of the posterior centroid and dispersion of theta.  Paper presented at the annual meeting of the Psychometric Society, Montreal, Canada.

Bochner, J., Garrison, W., Palmer, L., MacKenzie, D., & Braveman, A. (1997).  A computerized adaptive testing system for speech discrimination measurement: The Speech Sound Pattern Discrimination Test. Journal of the Acoustic Society of  America, 101, 2289-2298.

#BO75-01.  Bock, R. D. (1975).  Discussion.  In D. J. Weiss (Ed.), Computerized adaptive trait measurement:  Problems and Prospects (Research Report 75-5), pp. 46-49. Minneapolis: University of Minnesota, Department of Psychology, Psychometric Methods Program. {PDF file, 414 KB}

#BO82431.  Bock, B. D., & Mislevy, R. J.  (1982).  Adaptive EAP estimation of ability in a microcomputer environment.  Applied Psychological Measurement, 6, 431-444

Bock, R. D., Muraki, E., & Pfeiffenberger, W. (1988).  Item pool maintenance in the presence of item parameter drift.  Journal of Educational Measurement,25, 275-285.

Bock, R. D., & Zimowski, M. F. (1998). Feasibility studies of two-stage testing in large-scale educational assessment: Implications for NAEP. American Institutes for Research, CA.

Bontempo, B., & Julian, E. R., & Gorham, J. L. (1997, March).  Assessing speededness in variable-length computer-adaptive tests.  Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago IL. 

#BO01965.  Borman, W. C., Buck, D. E., Hanson, M. A., Montowidlo, S. J., Stark, S., & Drasgow, F.  (2001).  An examination of the comparative reliability, validity, and accuracy of performance ratings made using computer adaptive rating scales.  Journal of Applied Psychology, 86, 965-973.

Borman, W. C., Hanson, M.A., Kubisiak, U. C., & Buck, D. E. (2000).  Computerized adaptive rating scales (CARS):  Development and evaluation of the concept. (Institute Rep. No. 350).  Tampa FL:  Personnel Decisions Research Institute.

Borman, W. C., Hanson, M. A., Montowidlo, S. J., Drasgow, F., Foster, L., & Kubisiak, U. C. (1998).  Computerized adaptive rating scales that measure contextual performance.  Paper presented at the 3th annual conference of the Society for Industrial and Organizational Psychology, Dallas TX.

Bouchard, J. (1990).  Future directions for the National Council: The Computerized Adaptive Testing Project. Issues, 11, 1-5.  (National Council of State Boards of Nursing)

Bowers, D. R. (1992).  Computer-based adaptive testing in music research and instruction.  Psychomusicology, 10, 49-63.

Bowles, R. (2001). An examination of item review on computer adaptive tests. Manuscript in preparation, University of Virginia.

#BO01-01. Bowles, R., & Pommerich, M. (2001, April).  An examination of item review on a CAT using the specific information item selection algorithm.  Paper presented at the annual meeting of the National Council on Measurement in Education, Seattle WA. {PDF file, 325 KB}

#BO03-01.  Boyd. A. M.  (2003).  Strategies for controlling testlet exposure rates in computerized adaptive testing systems.  Unpublished Ph.D. Dissertation, The University of Texas at Austin.  {PDF file, 485 KB}

#BO03-02.  Boyd, A. M., Dodd, B. G., & Fitzpatrick, S. J. (2003, April).  A comparison of exposure control procedures in CAT systems based on different measurement models for testlets using the verbal reasoning section of the MCAT.  Paper presented at the Annual meeting of the National Council on Measurement in Education, Chicago IL.  {PDF file, 405 KB}

Bradlow, E.T., Wainer, H., and Wang, X (1999). A Bayesian random effects model for testlets, Psychometrika, 64, 153-168.

#BR01085.  Bradlow, E. T. & Weiss, R. E.  (2001).  Outlier measures and norming methods for computerized adaptive tests.  Journal of Educational and Behavioral Statistics, 26, 85-104.

Bradlow, E. T., Weiss, R. E., Cho, M. (1998).  Bayesian identification of outliers in computerized adaptive testing.  Journal of the American Statistical Association, 93, 910-919.

#BR04-01.   Breithaupt, K., Ariel, A., & Veldkamp, B. (2004).  Automated Simultaneous Assembly of Multi-Stage Testing for the Uniform CPA Examination. Paper presented at the annual meeting of the National Council on Measurement in Education, San Diego CA.  {PDF file, 201 KB}

#BR00-01.  Bridgeman, B. & Cline, F. (2000).  Variations in mean response times for questions on the computer-adaptive GRE general test: Implications for fair assessment (GRE Board Professional Report No. 96-20P: Educational Testing Service Research Report 00-7).  Princeton NJ:  Educational Testing Service.

#BR02-01.  Bridgeman, B. & Cline, F.  (2002, April).  Fairness issues in adaptive tests with strict time limits.  Paper presented at the annual meeting of the American Educational Research Association, New Orleans LA.  {PDF file, 1.287 MB}

#BR03-01.  Bridgeman, B,. Cline, F., & Hessinger, J. (2003).  Effect of extra time on GRE® Quantitative and Verbal Scores (Research Report 03-13).  Princeton NJ:  Educational Testing service.  {PDF file, 88 KB}

Bridgeman, B. & Schaeffer, G. A.  (1995, April).  A comparison of gender differences on paper-and-pencil and computer-adaptive versions of the Graduate Record Examination.  Paper presented at the annual meeting of the American Educational Research Association, San Francisco CA.

Brooks. S.  (1977).  A comparison of the classification of students by two methods of administration of a mathematics placement test.  Unpublished doctoral dissertation, Syracuse University, 1977.

#BR78415.  Brooks, S. & Hartz, M. A.  (1978).  Predictive ability of a branching test.  Educational and Psychological Measurement, 38, 415-419.

#BR77-6.  Brown, J. M., & Weiss, D. J. (1977). An adaptive testing strategy for achievement test batteries (Research Rep. No. 77-6). Minneapolis: University of Minnesota, Department of Psychology, Psychometric Methods Program. {PDF file, 2.40 MB}

Bryson, R.  (1971, December).  A comparison of four methods of selecting items for computer-assisted testing (Technical Bulletin STB 72-5).  San Diego:  Naval Personnel and Training Research Laboratory.

Buhr, D. C., & Legg, S. M. (1989, March).  Investigating the validity of a computerized adaptive test for different examinee groups.  Paper presented at the annual meeting of the American Educational Research Association, San Francisco CA.

Bunderson, C. V., Inouye, D. K., & Olsen, J. B. (1988). The four generations of computerized educational measurement. Research Report 98-35. Princeton NJ: Educational Testing Service.

Bunderson, C. V., Inouye, D. K., & Olsen, J. B. (1986). The four generations of computerized educational measurement. In R. L. Linn (Ed.), Educational Measurement (3rd ed., pp. 367-407).  New York: Macmillan.

Burke, M. J., Normand, J., & Raju, N. M.  (1987).  Computerized psychological testing:  Overview and critique.  Professional Psychology:  Research and Practice, 1, 42-51.

#BU03-01.  Burt, W. M., Kim. S.-J., Davis, L. L., & Dodd, B. G. (2003, April).  A comparison of item exposure control procedures using a CAT system based on the generalized partial credit model.  Paper presented at the annual meeting of the American Educational Research Association, Chicago IL.  {PDF file, 265 KB}

 

Buyske, S. G. (1998). Optimal design for item calibration in computerized adaptive testing. Unpublished doctoral dissertation, Rutgers University, New Brunswick, NJ.

- C -

Candell, G. L.  (1988).  Application of appropriateness measurement to a problem in computerized adaptive testing.  Unpublished doctoral dissertation, University of Illinois.

Carey, P. A.  (ETS) (1999, April).  The use of linear-on-the-fly testing for TOEFL Reading.  Paper presented at the annual meeting of the National Council on Measurement in Education, Montreal, Canada.

Carlson, R. (1994).  Computer adaptive testing:  A shift in the evaluation paradigm.  Educational Technology Systems, 22 (3), 213-224.

 

Carlson, S. (2000). ETS finds flaws in the way online GRE rates some students. Chronicle of Higher Education, 47, a47.

Case, S. M. & Luecht, R. M.  (1997, March).  Computer assembly of tests so that content reigns supreme.  Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago IL.

 

Cella, D., Gershon, R., Lai, J. S., & Choi, S. (2007). The future of outcomes measurement: Item banking, tailored short-forms, and computerized adaptive assessment. Quality of Life Research, 16(Suppl. 1), 133-141.

#CH95-01.  Chae, S. (1995).  Item equivalence from paper-and-pencil to computer adaptive testing.  Unpublished doctoral dissertation, University of Chicago.  

 

Chajewski, M. & Lewis, C. (2009).  Optimizing item exposure control algorithms for polytomous computerized adaptive tests with restricted item banks. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing. {PDF File, 923 KB}

 

#CH04-01.  Chang, C.-H. (2004, June). Developing tailored instruments: Item banking and computerized adaptive assessment. Paper presented at the conference “Advances in Health Outcomes Measurement: Exploring the Current State and the Future of Item Response Theory, Item Banks, and Computer-Adaptive Testing,” Bethesda MD.  {PDF file, 181 KB}

#CH00-04.  Chang, C.-Y., Kalohn. J. C., Lin, C.-J. & Spray, J. (2000).  Estimating item parameters from classical indices for item pool development with a computerized classification test (ACT Research 2000-4).  Iowa City IA, ACT, Inc.

Chang, H. (1995, April). A global information approach to computerized adaptive testing.  Paper presented at the annual meeting of the National Council on Measurement in Education, San Francisco CA.

Chang, H. (1996, April).  A model for score maximization within a computerized adaptive testing environment.  Paper presented at the annual meeting of the NMCE, New York NY.

Chang, H. H. (2004). Understanding computerized adaptive testing: From Robbins-Munro to Lord and beyond. In D. Kaplan (Ed.), The Sage handbook of quantitative methodology for the social sciences (pp. 117-133). New York: Sage.

 

Chang, H.-H., Qian, J., & Ying, Z. (1999). a-stratified multistage computerized adaptive testing. Applied Psychological Measurement, 23, 211–222.

Chang, H., Qian, J., & Ying, Z.  (2001).  a-stratified multistage computerized adaptive testing with b-blocking.  Applied Psychological Measurement, 25, 333-341 (also presented at National Council on Measurement in Education, 2000).

Chang, H. & van der Linden.  (2001). Implementing content constraints in a-stratified adaptive testing using a shadow test approach (Research Report 01-001).  University of Twente, Department of Educational Measurement and Data Analysis.

#CH03262.  Chang, H.-H.  & van der Linden, W. J.  (2003)  Optimal stratification of item pools in a-stratified computerized adaptive testing.  Applied Psychological Measurement, 27, 262-274.

Chang, H. & Ying, Z.  (1999).  a-stratified multistage computerized adaptive testing. Applied Psychological Measurement, 23, 211-222.

Chang, H., & Ying, Z. (in press, 1997?). Nonlinear sequential designs for logistic item response theory models with applications to computerized adaptive tests. The Annals of Statistics.

Chang, H.-H., & Ying, Z. (1996). A global information approach to computerized adaptive testing. Applied Psychological Measurement, 20, 213-229. (also presented at National Council on Measurement in Education, 1997)

Chang, H.-H., & Ying, Z. (1996, June). Building a statistical foundation for computerized adaptive testing. Paper presented at the annual meeting of the Psychometric Society, Banff, Alberta, Canada.

Chang, H.-H., & Ying, Z. (1996, in preparation).  Recursive maximum likelihood estimation, sequential design, and computerized adaptive testing.  Princeton NJ: Educational Testing Service.

Chang, H.-H. & Ying, Z. (1997, June).  Multi-stage CAT with stratified design.  Paper presented at the annual meeting of the Psychometric Society.  Gatlinburg TN.

Chang, H.-H., & Ying, Z. (1999). a-stratified multistage computerized adaptive testing. Applied Psychological Measurement, 23, 211-222.

#CH02-01.  Chang, H. H. & Ying, Z.  (2002, April).  To weight or not to weight – balancing influence of initial and later items in CAT. Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans LA.  {PDF file, 252 KB}

Chang, H. H. & Ying, Z.  (2003, April).  Test-score comparability, ability estimation, and item-exposure control in computerized adaptive testing.  Paper presented at the Annual meeting of the National Council on Measurement in Education, Chicago IL.

Chang, H.-H.., & Zhang, J. (2002). Hypergeometric family and test overlap rates in computerized adaptive testing. Psychometrika, 67, 387-398.  (Also presented at the annual meeting of the Psychometric Society, Lawrence KS, 1999.) 

Chang, H.-H. & Zhang, J. (2002, April).  Identify the lower bounds for item sharing and item pooling in computerized adaptive testing.  Paper presented at the annual meeting of the American Educational Research Association, New Orleans LA. (Not available from author; will be replaced by a later paper).

Chang, H.-H. & Zhang, J. (2003, April).  Assessing CAT security breaches by the item pooling index.  Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago IL.

#CH02387.  Chang, H., & Zhang, J. (2002).  Hypergeometric family and test overlap rates in computerized adaptive testing.  Psychometrika, 67, 387-298.

 

#CH00-01.  Chang, S., Ansley, T., & Lin, S. (2000, April).  Performance of item exposure control methods in computerized adaptive testing: Further explorations.  Paper presented at the Annual Meeting of the American Educational Research Association, New Orleans , LA.

#CH03071.  Chang, S.-H. & Ansley, T. (2003).  A comparative study of item exposure control methods in computerized adaptive testing.  Journal of Educational Measurement,40, 1, 71-103.

Chang, S. W. (1998).  A comparative study of item exposure control methods in computerized adaptive testing.  Unpublished doctoral dissertation, University of Iowa , Iowa City IA.

#CH02-02.  Chang, S.-W. & Harris, D. J.  (2002, April).  Redeveloping the exposure control parameters of CAT items when a pool is modified.  Paper presented at the annual meeting of the American Educational Research Association, New Orleans LA.   {PDF file, 1.113 MB}

Chang, S.W. & Harris, D. (2002, April). Redeveloping the exposure control parameters of CAT items when a pool is modified. Paper presented at the Annual Meeting of the American Educational Research Association, New Orleans.

#CH98-03.  Chang, S. W. & Twu, B. Y.  (1998).  A comparative study of item exposure control methods in computerized adaptive testing.  Research Report Series 98-3.  Iowa City: American College Testing.

#CH01-02.  Chang, S.-W. & Twu, B.-Y.  (2001).  Effects of changes in the examinees’ ability distribution on the exposure control methods in CAT.  Paper presented at the annual meeting of the American Educational Research Association, Seattle WA.  {PDF file, 695 KB}

Chen, P. H. (2009).  Comparison of adaptive Bayesian estimation and weighted Bayesian estimation in multidimensional computerized adaptive testing. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing. {PDF file, 308KB}

Chen, S. (1998). A comparison of maximum likelihood estimation and expected a posteriori estimation in computerized adaptive testing using the generalized partial credit model. (Doctoral Dissertation, University of Texas).  Dissertation Abstracts International: Section B: the Sciences & Engineering, Vol. 58(1-B), Jul 1997, 0453.

 

Chen, S.-K. (2007). The comparison of maximum likelihood estimation and expected a posteriori in CAT using the graded response model. Journal of Elementary Education. 19, 339-371.

#CH01-01.  Chen, S.-Y. (2001).  A new approach to simulation studies in computerized adaptive testing.  Paper presented at the annual meeting of the American Educational Research Association, Seattle WA.  {PDF file, 251 KB}

 

Chen, S.-Y., & Ankenmann, R. D. (2004). Effects of practical constraints on item selection rules at the early stages of computerized adaptive testing. Journal of Educational Measurement, 41, 149-174. (Also presented at American Educational Research Association, 1999).

Chen, S.-Y., Ankenmann, R.D., & Chang, H.-H. (2000). A comparison of item selection rules at the early stages of computerized adaptive testing. Applied Psychological Measurement, 24, 241-255.

Chen, S., Ankenmann, R. D., & Spray, J. A.  (1999, April).  Exploring the relationship between item exposure rate and test overlap rate in computerized adaptive testing. Paper presented at the annual meeting of the National Council on Measurement in Education, Montreal, Canada. (Also ACT Research Report 99-5).   (Also presented at American Educational Research Association, 1999)

Chen, S., Ankenmann, R. D., & Spray, J. A. (2003).  The relationship between item exposure and test overlap in computerized adaptive testing.  Journal of Educational Measurement, 40, 129-145.

#CH98569.  Chen, S., Hou, L., & Dodd, B.G. (1998).  A comparison of maximum likelihood estimation and expected a posteriori estimation in CAT using the partial credit model. Educational and Psychological Measurement, 58, 569-595.

Chen, S., Hou, L., Fitzpatrick, S. J., & Dodd, B. G. (1995, April). The effect of population distribution and methods of theta estimation on CAT using the rating scale model. Paper presented at the annual meeting of the American Educational Research Association, San Francisco.

#CH97422.   Chen, S., Hou, L. Fitzpatrick, S. J., & Dodd, B. (1997).  The effect of population distribution and methods of theta estimation on computerized adaptive testing (CAT) using the rating scale model.  Educational and Psychological Measurement, 57, 422-439.

Chen, S.-Y., Ankenmann, R. D., & Chang, H.-H. (2000). A comparison of item selection rules at the early stages of computerized adaptive testing.  Applied Psychological Measurement, 24, 241-255.

Chen, S.-Y., Ankenmann, R. D., & Spray, J. A. (1999).  Exploring the relationship between item exposure rate and test overlap rate in computerized adaptive testing (ACT Research Report series 99-5).  Iowa City IA: ACT, Inc. (also National Council on Measurement in Education paper, 1999).

#CH03129.  Chen, S.-Y., Ankenmann, R. D., & Spray, J. A. (2003).  The relationship between item exposure and test overlap in computerized adaptive testing.  Journal of Educational Measurement,40, 129-145.

#CH03-01.  Chen, S.-Y. & Doong, H.  (2003). Predicting item exposure parameters in computerized adaptive testing.  Paper presented at the annual meeting of the American Educational Research Association, Chicago IL.  {PDF file, 239 KB}

 

Chen, S.Y. & Lei, P.W. (2005). Controlling item exposure and test overlap in computerized adaptive testing. Applied Psychological Measurement, 29(2), 204–217.

Cheng, P. E. & Liou, M.  (2000).  Estimation of trait levels in computerized adaptive testing.  Applied Psychological Measurement, 24, 257-265.

#CH03204.  Cheng, P. E. & Liou, M.  (2003).  Computerized adaptive testing using the nearest-neighbors criterion.  Applied Psychological Measurement, 27, 204-216.

 

Chen, Y.-Y., & Ankenmann, R. D. (2004) Effects of practical constraints on item selection rules at the early stages of computerized adaptive testing. Journal of Educational Measurement, 41, 149-174.

Cheng, Y. (2009).  Computerized adaptive testing for cognitive diagnosis. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing.  {PDF File, 308 KB}

 

Cheng, Y. & Chang, H.-H. (2007). The modified maximum global discrimination index method for cognitive diagnostic computerized adaptive testing.  In D. J. Weiss (Ed.).  Proceedings of the 2007 GMAC Conference on Computerized Adaptive Testing. {PDF file, 172 KB}

 

Cheng, Y., Chang, H.-H., Douglas, J., & Guo, F.  (2009). Constraint-weighted a-stratification for computerized adaptive testing with nonstatistical constraints: Balancing measurement efficiency and exposure control.  Educational and Psychological Measurement, 69, 35-49.

 

Cheng, Y., Chang, H. H., & Wang, X. B. (2006, April). Constraints-weighted information method for item selection of severely constrained computerized adaptive testing. Paper presented at the annual meeting of the National Council on Measurement in Education, San Francisco.

 

Cheng, Y., Chang, H., & Yi, Q. (2007). Two-phase item selection procedure for flexible content balancing in CAT. Applied Psychological. Measurement, 3, 467–482.

 

Choi, S.W. (2009).  Firestar: Computerized adaptive testing simulation program for polytomous IRT models. Applied Psychological Measurement, 33, 644–645.

 

Choi, S. W., Reise, S. P, Pilkonis, P.A., Hays, R. D., & Cella, D.  (2010).  Efficiency of static and computer adaptive short forms compared to full-length measures of depressive symptoms. Quaity of Life Research, 19(1), 125–136.

 

Choi, S.W. & Swartz, R.J.. (2009).  Comparison of CAT item selection criteria for polytomous items. Applied Psychological Measurement, 33, 419–440.

Cito. (1999). WISCAT. Een computergestuurd toetspakket voor rekenen en wiskunde. [A computerized test package for arithmetic and mathematics]. Cito: Arnhem.

#CI04-01.  Cizek, G. J. (2004).  Protecting the integrity of computer-adaptive licensure tests: Results of a legal challenge.  Paper presented at the annual meeting of the American Educational Research Association, San Diego CA.  {PDF file, 191 KB}

#CL76-01. Clark C. K. (1976).   Proceedings of the first conference on computerized adaptive testing.  Washington DC: U.S. Government Printing Office. {Complete document: PDF file, 7.494 MB; Table of contents and separate papers}

Cleary, T. A., Linn, R. L., & Rock, D. A. (1968).  Reproduction of total test score through the use of sequential programmed tests.  Journal of Educational Measurement, 5, 183-187.

#CL69345.  Cleary, T. A., Linn, R. L., & Rock, D. A. (1969).  An exploratory study of programmed tests.  Educational and Psychological Measurement, 28, 345-360.

Cliff, N.  (1975).  A basic test theory generalizable to tailored testing (Technical Report No. 1).  Los Angeles CA:  University of Southern California, Department of Psychology. (See CL76-01 for a later version).

Cliff, N. (1975).  Complete orders from incomplete data:  Interactive ordering and tailored testing.  Psychological Bulletin, 82, 2859-302.

#CL76-02.  Cliff, N. (1976).  Elements of a basic test theory generalizable to tailored testing.  Unpublished manuscript.

#CL75-01.  Cliff, N. (1976).  Incomplete orders and computerized testing.   In C. K. Clark (Ed.),  Proceedings of the First Conference on Computerized Adaptive Testing (pp. 18-23).  Washington DC: U.S. Government Printing Office.  {PDF file, 373 KB}

#CL77375.  Cliff, N. A. (1977).  A theory of consistency ordering generalizable to tailored testing.  Psychometrika, 375-399.

#CL77-04.  Cliff, N., Cudeck, R., & McCormick, D.  (1978).  Evaluations of implied orders as a basis for tailored testing using simulations (Technical Report No. 4).  Los Angeles CA:  University of Southern California, Department of Psychology.

#CL78-06.  Cliff, N., Cudeck, R., & McCormick, D.  (1978).  Implied orders as a basis for tailored testing (Technical Report No. 6).  Los Angeles CA:  University of Southern California, Department of Psychology.

Cliff, N. A., Cudeck, R. & McCormick, D.  (1977).  An empirical evaluation of  implied orders as a basis for tailored testing.  In D. J. Weiss (Ed.), Proceedings of the 1977 Computerized Adaptive Testing Conference.  Minneapolis MN:  University of Minnesota, Department of Psychology, Psychometric Methods Program.

Cliff, N. A., Cudeck, R. & McCormick, D.  (1979).  Evaluation of  implied orders as a basis for tailored testing with simulation data.  Applied Psychological Measurement, 3, 495-514.

#CO03-01.  Cook, K. F., Roddey, T. S., Gartsman, G. M., & Olson, S. L. (2003).  Development and psychometric evaluation of the Flexilevel Scale of Shoulder Function (FLEX-SF).  Medical Care (in press).  {PDF file, 607 KB}

 

Collins, J. A. (1996). Adaptive testing with granularity. Master’s thesis, University of Saskatchewan, Department of Computer Science.

Collins, J. A., Greer, J. E., & Huang, S. X. (1996). Adaptive assessment using granularity hierarchies and Bayesian nets.  In Frasson, C., Gauthier, G., and Lesgold, A. (Eds.) Intelligent Tutoring Systems, Third International Conference,  ITS'96, Montréal, Canada, June 1996 Proceedings. Lecture Notes in Computer Science 1086. Berlin Heidelberg: Springer-Verlag 569-577.

Cordova, M. J. (1997). Optimization methods in computerized adaptive testing. Unpublished doctoral dissertation, Rutgers University, New Brunswick NJ.

Cordova, M. J. (1998).  Applications of network flows to computerized adaptive testing.  Dissertation, Rutgers Center for Operations Research (RUTCOR), Rutgers University, New Brunswick NJ.

#CO75-01.  Cory, C. H. (1976).  Using computerized tests to add new dimensions to the measurement of abilities which are important for on-job performance:  An exploratory study.  In C. K. Clark (Ed.),  Proceedings of the First Conference on Computerized Adaptive Testing (pp. 64-74).  Washington DC: U.S. Government Printing Office.  {PDF file, 632 KB}

Costa, D. R., Karino, C. A., Moura, F. A. S., & Andrade, D. F. (2009).  A comparison of three methods of item selection for computerized adaptive testing. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing.{PDF file, 531 KB}

Cowden, D. J. (1946).  An application of sequential sampling to testing students.  Journal of the American Statistical Association, 41, 547-556.

Crichton, L. I. (1981). Effect of error in item parameter estimates on adaptive testing (Doctoral dissertation, University of Minnesota). Dissertation Abstracts International, 42, 06-B. (University Microfilms No. AAD81-25946)

#CR82-52.  Croll, P. R. (1982).  Computerized adaptive testing system design:  Preliminary design considerations (Tech. Report 82-52).  San Diego CA:  Navy Personnel Research and Development Center. (AD A118 495)

Croll, P. R. & Urry, V. W. (1975).  Tailored testing:  Maximizing validity and utility for job selection.  Paper presented at the 86th Annual Convention of the American Psychological Association.  Toronto, Canada.

Cronbach, L. J. (1966).  New light on test strategy from decision theory.  In A. Anastasi (Ed.).  Testing problems in perspective.  Washington DC:  American Council on Education.

Cudeck, R. (1985). A structural comparison of conventional and adaptive versions of the ASVAB  Multivariate Behavioral Research, 20,  305-322.

Cudeck, R. A., Cliff, N., & Kehoe, J.  (1977).  TAILOR:  A FORTRAN procedure for interactive tailored testing.  Educational and Psychological Measurement, 37, 767-769.

#CU76-02.  Cudeck, R. A., Cliff, N., Reynolds, T. J., & McCormick, D. J.  (1976).  Monte carlo results from a computer program for tailored testing (Technical Report No. 2).  Los Angeles CA:  University of California, Department of Psychology. 

Cudeck, R., McCormick, D. J., & Cliff, N.  (1979).  Monte carlo evaluation of implied orders as a basis for tailored testing.  Applied Psychological Measurement, 3, 6 5-74.

Cudeck, R., McCormick, D., & Cliff, N.  (1980).  Implied orders tailored testing: Simulation with the Stanford-Binet.  Applied Psychological Measurement, 4, 157-163.

Curran, L. T., & Wise, L. L. (1994, August). Evaluation and implementation of CAT-ASVAB. Paper presented at the annual meeting of the American Psychological Association, Los Angeles.

- D -

 

Davey, T., & Fan, M. (2000, April). Specific information item selection for adaptive testing. Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans.

Davey, T., Godwin, J., & Mittelholz, D. (1997). Developing and scoring an innovative computerized writing assessment. Journal of Educational Measurement, 34, 21-41.

Davey, T., & Nering, M. L. (1998, June).  Evaluating and insuring measurement precision in adaptive testing.  Paper presented at the annual meeting of the Psychometric Society, Urbana, IL.

Davey, T., & Nering, M. L. (1998, September). Controlling item exposure and maintaining item security. Paper presented at an Educational Testing Service-sponsored colloquium entitled “Computer-based testing: Building the foundations for future assessments,” Philadelphia PA.

Davey, T., & Nering, M. (2002). Controlling item exposure and maintaining item security. In C. N. Mills, M. T. Potenza, & J. J. Fremer (Eds.), Computer-Based Testing: Building the Foundation for Future Assessments (pp. 165-191). Mahwah, NJ: Lawrence Erlbaum Associates, Inc.

Davey, T., Nering, M., & Thompson, T.  (1997, June).  Realistic simulation procedures for item response data.  In T. Miller (Chair), High-dimensional simulation of item response data for CAT research.  Symposium presented at the annual meeting of the Psychometric Society, Gatlinburg TN.

Davey, T. & Parshall, C. G. (1995, April). New algorithms for item selection and exposure control with computerized adaptive testing. Paper presented at the annual meeting of the American Educational Research Association, San Francisco CA.

Davey, T., & Pitoniak, M. J. (2006). Designing computerized adaptive tests. In S.M. Downing & T. M. Haladyna (Eds.), Handbook of test development. New Jersey: Lawrence Erlbaum Associates.

Davey, T., Pommerich, M. & Thompson, D. T. (1999).  Pretesting alongside an operational CAT. Paper presented at the annual meeting of the National Council on Measurement in Education, Montreal, Canada. 

Davey, T. & Thomas, L.  (1996, April).  Constructing adaptive tests to parallel conventional programs.  Paper presented at the annual meeting of the American Educational Research Association, New York.

David, L. A. & Lewis, C. (1996, April).  Person-fit indices and their role in the CAT environment.  Paper presented at the Annual meeting of the National Council on Measurement in Education, New York NY.

 

Davis, K. M., Chang, C. -H., Lai, J. -S., & Cella, D. (2002).  Feasibility and acceptability of computerized adaptive testing (CAT) for fatigue monitoring in clinical practice. Quality of Life Research, 11(7), 134.

#DA02-01.  Davis, L. L. (2002). Strategies for controlling item exposure in computerized adaptive testing with polytomously scored items. Unpublished doctoral dissertation, University of Texas, Austin. {PDF file, 1.83 MB}

#DA03-01. Davis, L. L. (2003, April).  Strategies for controlling item exposure in computerized adaptive testing with the generalized partial credit model.  Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago IL. {PDF file, 620 KB} [See published version, #DA04165]

#DA04165. Davis, L. L. (2004).  Strategies for controlling item exposure in computerized adaptive testing with the generalized partial credit model.  Applied Psychological Measurement, 28, 165-185.

#DAxx-01.  Davis, L. L. & Dodd, B. G.  (Undated). An examination of testlet scoring and item exposure constraints in the Verbal Reasoning section of the MCAT.  (See 2001 monograph below).  {PDF file, 653 KB}

Davis, L.L., & Dodd, B.G. (2001). An examination of testlet scoring and item exposure constraints in the verbal reasoning section of the MCAT.  MCAT Monograph Series: Association of American Medical Colleges.

Davis, L. L. & Dodd, B. G. (2003).  Item exposure constraints for testlets in the verbal reasoning section of the MCAT. Applied Psychological Measurement, 27, 335-356.

 

Davis, L. & Dodd, B. (March 2005). Strategies for controlling item exposure in computerized adaptive testing with the partial credit model. Pearson Educational Measurement Research Report 05-01.

 

Davis, L. L. & Dodd, B. G. (2008). Strategies for controlling item exposure in computerized adaptive testing with the partial credit model. Journal of Applied Measurement, 9, 1-17.

Davis, L. L., Pastor, D. A., Dodd, B. G., Chiang, C., & Fitzpatrick, S. (2000). An examination of exposure control and content balancing restrictions on item selection in CATs using the partial credit model.  Paper presented at the annual meeting of the American Educational Research Association, New Orleans, LA.

#DA03024.  Davis, L. L., Pastor, D. A., Dodd, B. G., Chiang, C.,  & Fitzpatrick, S. J. (2003). An examination of exposure control and content balancing restrictions on item selection in CATs using the partial credit model.  Journal of Applied Measurement, 4, 24-42.

De Ayala, R. J. (1989). A comparison of the nominal response model and the three-parameter logistic model in computerized adaptive testing. Educational and Psychological Measurement, 49, 789-805.

De Ayala, R. J. (1992). The nominal response model in computerized adaptive testing. Applied Psychological Measurement, 16, 327-343.

De Ayala, R. J. (1992).  The influence of dimensionality on CAT ability estimation. Educational and Psychological Measurement, 52, 513-528.

De Ayala, R.  J., Dodd, B G., & Koch, W. R. (1990).  A simulation and comparison of flexilevel and Bayesian computerized adaptive testing. Journal of Educational Measurement, 27, 227-239.

Diones, R. & Everson, H. (1994).  Computer adaptive testing: Assessment of the future. Curriculum/Technology Quarterly,  4 (2), 1-3.

De Ayala, R. J., & Koch, W. R. (1985).  ALPHATAB: A lookup table for Bayesian computerized adaptive testing.  Applied Psychological Measurement, 9, 326.

De Ayala, R. J., & Koch, W. R. (1987, April).  Computerized adaptive testing: A comparison of the nominal response model and the three-parameter logistic model.  Paper presented at the annual meeting of the National Council on Measurement in Education, Washington DC.

De Ayala, R. J., Dodd, B. G., & Koch, W. R. (1992). A comparison of the partial credit and graded response models in computerized adaptive testing. Applied Measurement in Education, 5, 17-34.

#deBE00-01.  De Beer, M. (2000).  Learning Potential Computerised Adaptive Test (LPCAT): Technical Manual.  Pretoria: UNISA.

#deBE00-02.  De Beer, M. (2000).  Learning Potential Computerised Adaptive Test (LPCAT): User's Manual.  Pretoria: UNISA.

De Beer, M. (2000). The construction and evaluation of a dynamic computerised adaptive test for the measurement of learning potential. Unpublished D .Litt et Phil dissertation. University of South Africa, Pretoria.

De Beer, M. (2002, June).  Utility of Learning Potential Computerised Adaptive Test (LPCAT) scores in predicting academic performance of bridging students: A comparison with other predictors. Paper presented at the 5th Annual Society for Industrial and Organisational Psychology Congress, Pretoria, South Africa.

#deBE03-01.  De Beer, M.  (2003, June). A comparison of learning potential results at various educational levels. Paper presented at the 6th Annual Society for Industrial and Organisational Psychology of South Africa (SIOPSA) conference, 25-27 June 2003.  {PDF file, 391 KB}

#deBE03-02.  De Beer, M. (2003). Development of the Learning Potential Computerised Adaptive Test (LPCAT). Unpublished manuscript.  {PDF file, 563 KB}

De Beer, M. (2007) Use of CAT in dynamic testing. In D. J. Weiss (Ed.),  Proceedings of the 2007 GMAC Conference on Computerized Adaptive Testing.  {PDF file, 133 KB}

De la Torre, R. (1991).  The development and evaluation of a system for computerized adaptive testing.  Unpublished doctoral dissertation, University of Iowa.

De la Torre, R.  & Vispoel, W. P. (1991, April).  The development and evaluation of a computerized adaptive testing system.  Paper presented at the annual meeting of the American Educational Research Association, Chicago IL. (ERIC No. ED 338 711)

de Gruijter, D. N. (1987).  Wilcox' closed sequential testing procedure in stratified item domains. Methodika, 1(1), 3-12.

#deGR77-01.  De Gruijter, D. N. M.  (1977).  A two-stage testing procedure (Memorandum 403-77).  University of Leyden, The Netherlands, Educational Research Center.

#DE03-01.  Deng, H. & Ansley, T. (2003, April).  To stratify or not:  An investigation of CAT item selection procedures under practical constraints.  Paper presented at the Annual meeting of the National Council on Measurement in Education, Chicago IL.  {PDF file, 186 KB} 

#DE01-01.  Deng, H. & Chang, H.-H.  (2001).  a-stratified computerized adaptive testing with unequal item exposure across strata. Paper presented at the annual meeting of the American Educational Research Association, Seattle WA.

 

Desmarais, M. C. & Pu, X (no date). Computer Adaptive Testing With Bayesian Networks: A Comparison with IRT.

Desmarais, M. C., Pu, X, & Blais, J.-G. (2007).  Partial order knowledge structures for CAT applications. In D. J. Weiss (Ed.), Proceedings of the 2007 GMAC Conference on Computerized Adaptive Testing.  {PDF file, 475 KB}

De Witt, J. J. & Weiss, D. J. (1974).  A computer software system for adaptive ability measurement (Research Report 74-1).  Minneapolis MN:  University of Minnesota, Department of Psychology, Computerized Adaptive Testing Laboratory.

#DE76104.  DeWitt, L. J. & Weiss, D. J.  (1976).  Hardware and software evolution of an adaptive ability measurement system.  Behavior Research Methods and Instrumentation, 8, 104-107.

Diao, Q., and Reckase, M. (2009).  Comparison of ability estimation and item selection methods in multidimensional computerized adaptive testing. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing.  {PDF File, 342 KB}

#DI86-189.   Divgi, D. R. (1986). Determining the sensitivity of CAT-ASVAB scores to changes in item response curves with the medium of administration (Report No. 86-189). Alexandria VA: Center for Naval Analyses.

#DI87-161.  Divgi, D. R. (1987, August).  Properties of some Bayesian scoring procedures for computerized adaptive tests (Research Memorandum CRM 87-161).  Alexandria VA:  Center for Naval Analyses.

Divgi, D. R. (1991, September).  An analysis of CAT-ASVAB scores in the Marine Corps JPM data (CRM- 91-161).  Alexandria VA:  Center for Naval Analysis.

#DO04-01.  Do, B.-R., Chuah, S. C., & Drasgow, F. (2004).  Item parameter recovery with adaptive tests.  Paper presented at the annual meeting of the National Council on Measurement in Education, San Diego CA.  {PDF file, 379 KB}

Dodd, B. G. (1987, April). Computerized adaptive testing with the rating scale model. Paper presented at the Fourth International Objective Measurement Workshop, Chicago.

Dodd, B. G. (1990). The effect of item selection procedure and stepsize on computerized adaptive attitude measurement using the rating scale model. Applied Psychological Measurement, 14, 355-366.

#DO95005. Dodd, B. G., De Ayala, R. J., & Koch, W. R. (1995). Computerized adaptive testing with polytomous items. Applied Psychological Measurement, 19, 5-22.

Dodd, B. G., & Fitzpatrick, S. J. (1998, September). Alternatives for scoring computerized adaptive tests. Paper presented at an Educational Testing Service-sponsored colloquium entitled Computer-based testing: Building the foundations for future assessments, Philadelphia PA.

Dodd, B. G., Koch, W. R., & De Ayala, R. J. (1988, April). Computerized adaptive attitude measurement: A comparison of the graded response and rating scale models. Paper presented at the annual meeting of the American Educational Research Association, New Orleans.

Dodd, B. G., Koch, W. R., & De Ayala, R. J. (1989). Operational characteristics of adaptive testing procedures using the graded response model. Applied Psychological Measurement, 13, 129-143.

#DO9361.  Dodd, B. G., Koch, W. R., & De Ayala, R. J. (1993). Computerized adaptive testing using the partial credit model: Effects of item pool characteristics and different stopping rules. Educational and Psychological Measurement, 53, 61-77.

Dolan, S. (1993).  A comparison of computer adaptive test administration methods.  Unpublished doctoral dissertation, University of Chicago.

Doucette, D. (Ed.).  (1988).  Computerized adaptive testing:  The state of the art in assessment at three community colleges.  Laguna Hills CA:  League for Innovation in the Community College.

Dowling, C. E., Hockemeyer, C., & Ludwig, A .H. (1996) Adaptive assessment and training using the neighbourhood of knowledge states.  In Frasson, C., Gauthier, G., & Lesgold, A. (eds.) Intelligent Tutoring Systems, Third International Conference, ITS'96, Montréal, Canada, June 1996 Proceedings. Lecture Notes in Computer Science 1086. Berlin Heidelberg: Springer-Verlag 578-587.

Dowling, C.E. and Kaluscha, R. (1995, August).  Prerequisite relationships for the adaptive assessment of knowledge.  In Greer, J. (Ed.) Proceedings of AIED'95, 7th World Conference on Artificial Intelligence in Education, Washington, DC, AACE 43-50.

Drasgow, F., & Olson-Buchanan, J. B. (Eds.). (1999). Innovations in computerized assessment. Hillsdale NJ: Erlbaum.

#DU93181.  Du, Y., Lewis, C. & Pashley, P. J. (1993).  Computerized mastery testing using fuzzy set decision theory.  Applied Measurement in Education, 6, 181-193.  (Also Educational Testing Service Research Report 94-37)

See #DU93181.  Du, Y., Lewis, C., Pashley, P. J.  (1994).  Computerized mastery testing using fuzzy set decision theory (Research Report 94-37).  Princeton NJ:  Educational Testing Service.

Dunkel, P. A. (1997).  Computer-adaptive testing of listening comprehension: A blueprint of CAT Development. The Language Teacher Online 21, no. 10. <http://langue.hyper.chubu.ac.jp/jalt/pub/tlt/97/oct/dunkel.html>.

Dunkel, P. (1999). Research and  development of a computer-adaptive test of listening comprehension in the less-commonly taught language Hausa. In M. Chalhoub-Deville (ed). Issues in computer-adaptive testing of reading proficiency. Cambridge, UK : Cambridge University Press.

- E -

 

Economides, A.A. (2005). Adaptive orientation methods in computer adaptive testing. Proceedings E-Learn 2005 World Conference on E-Learning in Corporate, Government, Healthcare, and Higher Education, pp. 1290-1295, Vancouver, Canada, AACE, October 2005.

 

Economides, A.A. (2005). Computer adaptive testing quality requirements. Proceedings E-Learn 2005 World Conference on E-Learning in Corporate, Government, Healthcare, and Higher Education, pp. 288-295, Vancouver, Canada, AACE, October 2005.

 

Economides, A.A. (2005). Personalized feedback in CAT. WSEAS Transactions on Advances in Engineering Education, Issue 3, Volume 2, 174-181, July 2005.

 

Educational Testing Service. (1993). The GRE computer adaptive testing program (CAT): Integrating convenience, assessment, and technology. Princeton, NJ: Educational Testing Service.

Edwards, M. C. & Thissen, D.  (2007).  Exploring potential designs for multi-form structure computerized adaptive tests with uniform item exposure. In D. J. Weiss (Ed.), Proceedings of the 2007 GMAC Conference on Computerized Adaptive Testing.  {PDF file, 295 KB}

Egberink, I. J. L. & Veldkamp, B. P.  (2007).   The development of a computerized adaptive test for integrity. In D. J. Weiss (Ed.), Proceedings of the 2007 GMAC Conference on Computerized Adaptive Testing.  {PDf file, 290 KB}

Eggen, T. J. H. M. (1998). Item selection in adaptive testing with the sequential probability ratio test. Measurement and Research Department Report, 98-1. Arnhem: Cito. [see APM paper, 1999; also reprinted as Chapter 6 in #EG04-01.]

Eggen, T. J. H. M. (1999).  Item selection in adaptive testing with the sequential probability ratio test.  Applied Psychological Measurement, 23, 249-261. [Reprinted as Chapter 6 in #EG04-01]

#EG01-1.  Eggen, T. J. H. M.  (2001).  Overexposure and underexposure of items in computerized adaptive testing (Measurement and Research Department Reports 2001-1).  Arnhem, The Netherlands.  CITO Groep.  {PDF file, 276 KB}

#EG04-01  Eggen, T. J. H. M. (2004).  Contributions to the theory and practice of computerized adaptive testing.  Arnhem, The Netherlands:  Citogroep.

Eggen, T. J. H. M. (2007).  Choices in CAT models in the context of educational testing. In D. J. Weiss (Ed.), Proceedings of the 2007 GMAC Conference on Computerized Adaptive Testing.  {PDF file, 123 KB}

#EG96-3.  Eggen, T. J. H. M, & Straetmans, G. J. J. M. (1996). Computerized adaptive testing for classifying examinees into three categories (Measurement and Research Department Rep. 96-3). Arnhem, The Netherlands: Cito. [Reprinted in as Chapter 5 in #EG04-01]

#EG00713. Eggen, T. J. H. M, & Straetmans, G. J. J. M. (2000). Computerized adaptive testing for classifying examinees into three categories.  Educational and Psychological Measurement, 60, 713-734. [Reprinted as Chapter 5 in #EG04-01]

#EG03-01.  Eggen, T. & Verschoor, A.  (2003, October).  Optimal testing with easy items in computerized adaptive testing.  Paper presented at the conference of the International Association for Educational Assessment, Manchester UK. {PDF file, 216 KB} [See expanded version as Chapter 7 in #EG04-01]

#EG03-02.  Eggen, T. J. H. M. & Verschoor, A. J. (2004).  Optimal testing with easy items in computerized adaptive testing (Measurement and Research Department Report 2004-2).  Arnhem, The Netherlands:  Cito Group.

#EI93-55.  Eignor, D. R. (1993).  Deriving comparable scores for computer adaptive and conventional tests: An example using the SAT.  (ETS Research Report RR-93-5).  Princeton NJ:  Educational Testing Service. (Also presented at the 1993 National Council on Measurement in Education meeting in Atlanta GA.)

Eignor, D. R., Folk, V. G., Li, M.-Y., & Stocking, M. L. (1994, April).  Pinpointing PRAXIS I CAT characteristics through simulation procedures.  Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans, LA.

Eignor, D. R. & Schaffer, G.A. (1995, April).  Comparability studies for the GRE CAT General Test and the NCLEX using CAT.  Paper presented at the annual meeting of the National Council on Measurement in Education, San Francisco.

#EI93-56.  Eignor, D. R., Stocking, M. L., Way, W. D., & Steffen, M. (1993).  Case studies in computer adaptive test design through simulation (Research Report RR-93-56).  Princeton NJ:  Educational Testing Service.  (also presented at the 1993 National Council on Measurement in Education meeting in Atlanta GA)

Eignor, D. R., Way, W. D., & Amoss, K.E. (1994, April).  Establishing the comparability of the NCLEX using CAT with traditional NCLEX examinations. Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans, LA.

Eignor, D. A., Way, W. D., Stocking, M., & Steffen, M. (1993).  Case studies in computerized adaptive test design through simulation (Research Report 93-56).  Princeton NJ:  Educational Testing Service.

Elwood, D. L. (1969).  Automation of psychological testing.  American Psychologist, 24, 287-289.

Elwood, D. L. & Griffin, H.R. (1972).  Individual intelligence testing without the examiner: reliability of an automated method. Journal of Consulting and Clinical Psychology, 38, 9-14.

Embretson, S. E. (1999). Generating items during testing: Psychometric issues and models.  Psychometrika, 64, 407-433.

Engdahl, B. (1992).  Computerized adaptive assessment of cognitive abilities among disabled adults. ERIC Document No. ED301274

#EN77158.  English, R. A., Reckase, M. D., & Patience, W. M. (1977).  Application of tailored testing to achievement measurement.  Behavior Research Methods and Instrumentation, 9, 158-161.

Epstein, K. I. & Knerr, C. S.  (1978).  Applications of sequential testing procedures to performance testing.  In D. J. Weiss (Ed.), Proceedings of the 1977 Computerized Adaptive Testing Conference.  Minneapolis MN: University of Minnesota, Department of Psychology, Psychometric Methods Program.

- F -

Fan, M. & Hsu, Y.  (1995, June).  The effect of ability estimation for polytomous CAT in different item selection procedures.  Paper presented at the Annual meeting of the Psychometric Society, Minneapolis MN..

#FA96-02.  Fan, M., & Hsu, Y. (1996, April).  Multidimensional computer adaptive testing.  Paper presented at the Annual Meeting of the American Educational Research Association, New York NY.

#FA96-01.  Fan, M., & Hsu, Y. (1996, April). Utility of Fisher information, global information and different starting abilities in mini CAT. Paper presented at the Annual Meeting of the National Council on Measurement in Education, New York NY.

#FA99-01.  Fan, M., Thompson, T., & Davey, T. (1999, April). Constructing adaptive tests to parallel conventional programs. Paper presented at the annual meeting of the National council on Measurement in Education, Montreal.

#FA02-01.  Fan, M. & Zhu.  (2002, April).  A further study on adjusting CAT item selection starting point for individual examinees.  Paper presented at the annual meeting of the American Educational Research Association, New Orleans LA.

 

Fayers, P. (2007). Applying item response theory and computer adaptive testing: The challenges for health outcomes assessment. Quality of Life Research. 16:187–194.

Featherman, C. M., Subhiyah, R. G., & Hadadi, A. (1996, April).  Effects of randomesque item selection on CAT item exposure rates and proficiency estimation under 1- and 2-PL models.  Paper presented at the annual meeting of the American Educational Research Association, New York.

Featherman, C. M., Subhiyah, R. G., & Hadadi, A. (1996, April).  New algorithms for item selection and exposure and proficiency estimation under 1- and 2-PL models.  Paper presented at the annual meeting of the American Educational Research Association, New York.

#FE69-49.  Ferguson, R. L. (1969). Computer-assisted criterion-referenced measurement (Working Paper No. 49). Pittsburgh PA: University of Pittsburgh, Learning and Research Development Center. (ERIC No. ED 037 089).

Ferguson, R. L. (1969). The development, implementation, and evaluation of a computer-assisted branched test for a program of individually prescribed instruction. Doctoral dissertation, University of Pittsburgh.  Dissertation Abstracts International, 30-09A, 3856. (University Microfilms No. 70-4530).

#FE70-01.  Ferguson, R. L. (1970, March).  A model for computer-assisted criterion-referenced measurement.  Paper presented at the annual meeting of the American Educational Research Association/National Council on Measurement in Education, Minneapolis MN.

#FE70025.  Ferguson, R. L. (1970).  A model for computer-assisted criterion-referenced measurement.  Education, 1970, 91, 25-31.

Ferguson, R. L. (1970).  Computer assistance for individualizing measurement.  Pittsburgh PA:  University of Pittsburgh, Learning Research and Development Center.

Ferguson, R. L. (1970).  Computer assistance for individualizing measurement.  Computers and Automation, March 1970, 19.

Ferguson, R. L. (1971).  A model for computer-assisted criterion-referenced measurement.  Education, 81, 25-31.

#FE71-01.  Ferguson, R. L. (1971, March).  Computer assistance for individualizing measurement.  Pittsburgh PA:  University of Pittsburgh R & D Center.

Ferguson, R. L. & Hsu, T. (1971). The application of item generators for individualizing mathematics testing and instruction (Report 1971/14). Pittsburgh PA: University of Pittsburgh Learning Research and Development Center.

#FE73-01.  Ferguson, R. L. & Novick, M. R. (1973).  Implementation of a Bayesian system for decision analysis in a program of individually prescribed instruction (Research Report No. 60).  Iowa City IA:  American College Testing Program.

Ferrara, S., Frances, A, Gilmartin, D., Knott, T., Michaels, H., Pollack, J. Schuder, T., Vaeth, R,. & Wise, S. (1996, April).  A qualitative study of the information examinees consider during item review on a computer-adaptive test.  Paper presented at the annual meeting of the National Council on Measurement in Education, New York.

Fields, F. A. (1992).  Computerized adaptive testing for NCLEX-PN. Journal of  Practical.Nursing, 42, 8-10.

Finkelman, M., Weiss, D. J., & Kim-Kang, G.  (2009).  Item election and hypothesis testing for the adaptive measurement of change.  In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing. {PDF File, 228 KB}

Finkelman, M., Nering, M. L., & Roussos, L. A. (2009).  A conditional exposure control method for multidimensional adaptive testing.  Journal of Educational Measurement, 46, 84-103.

Finney, S. J., Smith, R. W., & Wise, S. L. (1999, April).  The effects of judgment-based stratum classifications on the efficiency of stratum scored CATs.  Paper presented at the annual meeting of the National Council on Measurement in Education, Montreal, Canada (or New York?.)

Fischer, G. H. & Pendl, P. (1980).  Individualized testing on the basis of the Rasch model.  In . J. Th. Van der Kamp, W. F. Langerak, & D. N. M. de Gruijter (Eds.).  Psychometrics for educational debates.  New York: Wiley.

Flaugher, R.  (2000).  Item pools.  In Wainer, H.  (2000).  Computerized adaptive testing:  a primer.  Mahwah, NJ: Erlbaum.

 

Fliege, H., Becker, J., Walter, O. B., Bjorner, J. B., Klapp, B. F., & Rose, M. (2005). Development of a computer-adaptive test for depression (D-CAT). Quality of Life Research, 14, 2277–2291.

Folk, V. G. (1990, April).  Adaptive testing and item difficulty order effects.  Paper presented at the annual meeting of the American Educational Research Association, Boston MA.

Folk, V. & Golub-Smith, M. (1996) Calibration of on-line pretest data using BILOG. Paper presented at the annual meeting of National Council on Measurement in Education, Chicago.

Folk, V.G., & Green, B. F.  Adaptive estimation when the unidimensionality assumption of IRT is violated.  Applied Psychological Measurement, 13, 373-389.

Folk, V. G. & Wingersky, M.  (1999, April).  Fixed length CATs, or CATs in need of fixing.  Paper presented at the annual meeting of the National Council on Measurement in Education, Montreal, Canada.

Forbey, J. D., & Ben-Porath, Y. S. (2007). Computerized adaptive personality testing: a review and illustration with the MMPI-2 Computerized Adaptive Version. Psychological Assessment, 19(1), 14-24.

Forbey, J. D., Handel, R. W., & Ben-Porath, Y.S.  (June, 1996).  Computerized adaptive administration of the MMPI-A.  Paper presented at the 31st Annual Symposium and Recent Developments in the use of the MMIP-2 and MMPI-A, Minneapolis MN.

#FO00083.  Forbey, J. D., Handel, R. W., & Ben-Porath, Y.S.  (2000).  A real data simulation of computerized adaptive administration of the MMPI-A.  Computers in Human Behavior, 16, 83-96.

Forker, J. E. & McDonald, M. E.  (1996). Methodologic trends in the healthcare professions: Computer adaptive and computer simulation testing. Nurse Education, 21,13-14.

#FR03-01.  French, B. F. & Thompson, T. T. (2003, April).  The evaluation of exposure control procedures for an operational CAT.  Paper presented at the annual meeting of the American Educational Research Association, Chicago IL.  {PDF file, 199 KB}

Frick, T. W. (1988).  A comparison of  three decision models for adapting the length of computer-based mastery tests.  Unpublished manuscript (submitted to Journal of Educational Computing Research).

Frick, T. W.  (1989).  Bayesian adaptation during computer-based tests and computer-guided practice exercises.  Journal of Educational Computing Research, 5(1), 89-114.

Frick, T.W.  (1989).  A comparison of an expert systems approach to computerized adaptive testing and an IRT model.  Unpublished manuscript (submitted to American Educational Research Journal).

Frick, T. J. (1990).  A comparison of three decision models for adapting the length of computerized mastery tests.  Journal of Educational Computing Research, 6(4), 479-513.

Frick, T. W. (1992).  Computerized adaptive mastery tests as expert systems.  Journal of Educational Computing Research, 8(2), 187-213.

Frick, T. W., Plew, G.T., & Luk, H.-K. (1989).  EXSPRT: An expert systems approach to computer-based adaptive testing.  Paper presented at the annual meeting of the American Educational Research Association, San Francisco.

Friedman, D,. Steinberg, A, & Ree, M. J. (1981).  Adaptive testing without a computer.  Catalog of Selected Documents in Psychology, Nov. 1981, 11, 74-75 (Ms. No. 2350).  AFHRL Technical Report 80-66.

Fu, S. & Desmarais, M. (2006). Multidimensional computerized adaptive testing based on Bayesian theory. Education and Technology Conference, Calgary, 2006.

- G -

Gafni, N., Cohen, Y., Roded, K., Baumer, M., & Moshinsky, A. (2009).  Applications of CAT in admissions to higher education in Israel: Twenty-two years of experience. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing.  {PDF file, 326 KB}

Gallagher, A. Bridgeman, B., (ETS) & Calahan, C. (Fordham)  (1999, April).  Fairness in computer-based testing.  Paper presented at the annual meeting of the National Council on Measurement in Education, Montreal, Canada.

#GA02812.  Gardner, W., Kelleher, K. J., & Pajer, K. A. (2002).  Multidimensional adaptive testing for mental health problems in primary care. Medical Care, 40, 812-23. {PDF file, 132 KB}  [multidimensional polytomous CAT]

 

Gardner, W., Shear, K., Kelleher, K., Pajer, K., Mammen, O., Buysse, D., et al. (2004).  Computerized adaptive measurement of depression: A simulation study. BMC Psychiatry, 4(1),13.

Garrison, W. M. (1985).  Monitoring item calibrations from data yielded by an adaptive testing procedure.  Educational Research Quarterly, 10, 9-12.

#GA82-01.  Garrison, W. M. & Baumgarten, B. S.  (1982, March).  Assessing mathematics achievement with a tailored testing program.  Paper presented at the annual meeting of the American Educational Research Association, New York.

Garrison, W. M. & Baumgarten, B. S. (1986).  An application of computer adaptive testing with communication handicapped examinees.  Educational and Psychological Measurement, 46, 23-25. 

Georgiadou, E., Triantafillou, E. & Economides, A.A. (2006). Evaluation parameters for computer adaptive testing. British Journal of Educational Technology, Vol. 37, No 2, 261-278, March 2006.

Georgiadou, E. G., Triantafillou, E., & Economides, A. A. (2007). A review of item exposure control strategies for computerized adaptive testing developed from 1983 to 2005. Journal of Technology, Learning, and Assessment, 5(8). Retrieved 25 July 2007 from http://www.jtla.org.  {PDF file, 326 KB}

#GE01-01.  Geranpayeh. A. (2001). CB BULATS: Examining the reliability of a computer based test using test-retest method.  Cambridge ESOL Research Notes, Issue 5, July 2001, pp. 14-16.   {PDF file, 456 KB}

Gershon, R. (1989).  CAT administrator  [Computer program]. Chicago: Micro Connections.

Gershon, R. C. (1994).  CAT software system [computer program.]  Chicago IL: Computer Adaptive Technologies.  

Gershon, R. C. (year?). The effect of individual differences variables on the assessment of ability for computerized adaptive testing.  Dissertation Abstracts International, Section B: The Sciences and Engineering, 57 (6-B), 4085.

 

Gershon, R. C. (2004) The ABCs of Computerized Adaptive Testing. In T. M. Wood & W. Zhi (Eds.), Measurement issues and practice in physical activity. Champaign, IL: Human kinetics.

Gershon, R. C. (2005). Computer adaptive testing. Journal of Applied Measurement 6:109-27.

Gershon, R.C. & Bergstrom, B.  (1991, April).  Individual differences in computer adaptive testing:  Anxiety, computer literacy, and satisfaction.  Paper presented at the annual meeting of the National Council on Measurement in Education.

Gershon, R.C. & Bergstrom, B.  (1995, April).  Does cheating on CAT pay: Not.  Paper presented at the annual meeting of the American Educational Research Association, San Francisco. (ERIC ED 392 844)

#GI79-06.  Gialluca, K. A., & Weiss,  D. J. (1979). Efficiency of an adaptive inter-subtest branching strategy in the measurement of classroom achievement (Research Report 79-6). Minneapolis: University of Minnesota, Department of Psychology, Psychometric Methods Program.  {PDF file, 2.782 MB}

Gibbons, R. D., Weiss, D. J., Kupfer, D. J., Frank, E., Fagiolini, A., Grochocinski, V. J., Bhaumik, D., K., Stover, A., Bock, R. D., & Immekus, J. C. (2008). Using computerized adaptive testing to reduce the burden of mental health assessment.  Psychiatric Services, 59(4), 361-368.  {PDF file, 107 KB}

 

Gierl, M. J. & Jiawen Zhou, J. (2008). Computer adaptive-attribute testing: A new approach to cognitive diagnostic assessment.  Zeitschrift für Psychologie / Journal of Psychology,  216(1), 29–39.

Giouroglou, H. & Economides, A.A. (2003) Cognitive CAT in foreign language assessment. Proceedings 11th International PEG Conference, Powerful ICT Tools for Learning and Teaching, PEG '03, CD-ROM, 2003.

Giouroglou, H. & Economides, A.A. (2004). State-of-the-art and adaptive open-closed items in adaptive foreign language assessment. Proceedings 4th Hellenic Conference with International Participation: Informational and Communication Technologies in Education, Athens, 747-756, 2004.

Giouroglou, H. & Economides, A.A. (2005). An implemented theoretical framework for a common European foreign language adaptive assessment. Proceedings ICODL 2005, 3rd International Conference on Open and Distance Learning 'Applications of Pedagogy and Technology', 339-350, Greek Open University, Patra, Greece, 2005.

Giouroglou, H. & Economides, A.A. (2005). The development of the adaptive item language assessment (AILA) for mixed-ability students. Proceedings E-Learn 2005 World Conference on E-Learning in Corporate, Government, Healthcare, and Higher Education, 643-650, Vancouver, Canada, AACE, October 2005.

Glas, C. A. W. (1998).  Quality control of on-line calibration in computerized adaptive testing (Research Report 98-03).  Enschede, The Netherlands:  University of Twente, Faculty of Educational Science and Technology, Department of Measurement and Data Analysis.

Glas, C. A. W.  (1988).  The Rasch model and multi-stage testing.  Journal of Educational and Behavioral Statistics, 13, 45-52.

Glas, C. A. W. (2000).  Item calibration and parameter drift.  In W. J. van der linden & C. A. W. Glas (Eds.).  Computerized adaptive teting:  Theory and practice (pp.183-199).  Norwell MA: Kluwer Academic.

Glas, C. A. W., Meijer, R. R., & van Krimpen-Stoop, E. M. L. A. (1997).  Statistical tests for person misfit in computerized adaptive testing (Research Report RR 97-08).  Enschede, The Netherlands:  University of Twente.

Glas, C. A. W. & Van der Linden, W. J.  (2001).  Modeling variability in item parameters in CAT.  Paper presented at the Annual Meeting of the National Council on Measurement in Education, Seattle WA.

#GL03247.  Glas, C. A. W. & Van der Linden, W. J.  (2003).  Computerized adaptive testing with item cloning.  Applied Psychological Measurement, 27, 247-261. (Also Research Report 01-10, Univerity of Twente.)

Glas, C. A. W., & Veerkamp, W. J. J. (1999). Item calibration and parameter drift. In W. J. van der Linden & C. A. W. Glas (Eds.), Computer adaptive testing: Theory and practice. Norwell MA: Kluwer.

#GL98-01.  Glas, C. A. W., Meijer, R. R., & van Krimpen-Stoop, E. M. L. A. (1998).  Statistical tests for person misfit in computerized adaptive testing (Research Report 98-01).  Enschede, The Netherlands :  University of Twente, Faculty of Educational Science and Technology, Department of Measurement and Data Analysis.

Glas, C.A.W., Wainer, H.,  & Bradlow, E.T. (2000). MML and EAP estimation in testlet-based adaptive testing. Dans W.J. van der Linden et C.A.W. Glas (Es) : Computerized adaptive testing: Theory and practice. Dordrecht : Kluwer.

#GL98-15.  Glas, C. A. W. & Vos, H. J.  (1998). Adaptive mastery testing using the Rasch model and Bayesian sequential decision theory (Research Report 98-15).  Enschede, The Netherlands:  University of Twente, Faculty of Educational Science and Technology, Department of Measurement and Data Analysis.

#GL00-01.  Glas, C. A. W. & Vos, H. J.  (2000). Adaptive mastery testing using a multidimensional IRT model and Bayesian sequential decision theory (Research Report 00-06).  Enschede, The Netherlands:  University of Twente, Faculty of Educational Science and Technology, Department of Measurement and Data Analysis.

 

Gorham, W. A.  ( 1976).  Opening remarks.  In W. H. Gorham (Chair),  Computers and testing:  Steps toward the inevitable conquest (PS 76-1).  Symposium presented at the 83rd annual convention of the American Psychological Association, Chicago IL.   Washington DC: U.S. Civil Service Commission, Personnel research and Developement Center.  (NTIS No. PB 261 694).

 

Gorin, J., Dodd, B. G., Fitzpatrick, S. J., & Shieh, Y. Y. (2005). Computerized adaptive testing with the partial credit model: Estimation procedures, population distributions, and item pool characteristics. Applied Psychological Measurement, 29, 533-546.

#GO80-01.  Gorman, S.  (1980).  A comparative evaluation of two Bayesian adaptive ability estimation procedures.  Unpublished doctoral dissertation, the Catholic University of America.

#GO80-02.  Gorman, S. (1980). A comparison of the accuracy of Bayesian adaptive and static tests using a correction for regression.  In D. J. Weiss (Ed.), Proceedings of the 1979 Computerized Adaptive Testing Conference (pp. 35-50).  Minneapolis MN:  University of Minnesota, Department of Psychology, Computerized Adaptive Testing Laboratory.  {PDF file, 735 KB}

#GR01-01.  Grabovsky, I., Chang, H.-H., & Ying, Z.  (2001, April).  Deriving a stopping rule for sequential adaptive tests.  Paper presented at the annual meeting of the American Educational Research Association, Seattle WA.  {PDF file, 111 KB} 

Greaud, V. A., & Green, B. F. (1984). Analysis of speeded test data from experimental CAT system.  Baltimore MD:  Johns Hopkins University, Department of Psychology.

Greaud, V. A., & Green, B. F. (1986). Equivalence of conventional and computer presentation of speed tests.  Applied Psychological Measurement, 10,  23-34.

#GR70184.  Green, B. F. (1970).  Comments on tailored testing.  In W. H. Holtzman, (Ed.), Computer-assisted instruction, testing, and guidance (pp. 184-197).  New York:  Harper & Row. [See #LO70139 and #HO70198]

#GR75-01.  Green, B. F. (1976).  Discussion.  In C. K. Clark (Ed.),  Proceedings of the First Conference on Computerized Adaptive Testing (pp. pp. 118-119).  Washington DC: U.S. Government Printing Office.  {PDF file, 347 KB}

Green, B. F.  (1983).  The promise of tailored tests. In H. Wainer & S. Messick (Eds.)., Principals of  modern psychological measurement (pp. 69-80).  Hillsdale NJ: Erlbaum.

Green, B. F. (1983).  Adaptive testing by computer.  In R. B. Ekstrom (ed.), Measurement, technology, and individuality in education.  New directions for testing and measurement, Number 17.  San Francisco: Jossey-Bass.

Green, B. F. (1988).  Construct validity of computer-based tests.  In H. Wainer and H. Braun (Eds.), Test validity (pp. 77-103).  Hillsdale NJ: Erlbaum.

#GR88223.  Green, B. F. (1988).  Critical problems in computer-based psychological measurement,  Applied Measurement in Education, 1, 223-231.

Green, B. F. (1997, March).  Alternate methods of scoring computer-based adaptive tests.  Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago IL.

Green, B. F., Bock, R. D., Humphreys, L. G., Linn, R. L., & Reckase, M. D. (1984). (11982, May).  Evaluation plan for the computerized adaptive vocational aptitude battery (Research Report 82-1).  Baltimore MD: The Johns Hopkins University, Department of Psychology.

Green, B. F., Bock, R. D., Humphreys, L. G., Linn, R. L., & Reckase, M. D. (1984). Technical guidelines for assessing computerized adaptive tests. Journal of Educational Measurement, 21, 347-360.

Green, B. F., Bock, R. D., Linn, R. L., Lord, F. M., & Reckase, M. D. (1984). A plan for scaling the computerized adaptive Armed Services Vocational Aptitude Battery. Journal of Educational Measurement, 21, 347-360.

Green, B. F. & Thomas, T. J. (1990).  Utility of predicting starting abilities in sequential computer-based adaptive tests (Research Report 90-1).  Baltimore MD: Johns Hopkins University, Department of Psychology.

Greenwood, D. I. & Taylor, C. (1965).  Adaptive testing in an older population.  Journal of Psychology, 60, 193-198.

Grist, S., Rudner, L. M. & Wise, L. L. Computerized adaptive tests. ERIC Clearinghouse on Tests, Measurement, and Evaluation, no. 107.

Gu, L. & Reckase, M.D.  (2007).  Designing optimal item pools for computerized adaptive tests with Sympson-Hetter exposure control.  In D. J. Weiss (Ed.), Proceedings of the 2007 GMAC Conference on Computerized Adaptive Testing.  {PDF file, 1.13 MB}

#GU75-01.  Gugel, J. F. Schmidt, F. L., & Urry, V. W. (1976).  Effectiveness of the ancillary estimation procedure.  In C. K. Clark (Ed.),  Proceedings of the First Conference on Computerized Adaptive Testing (pp. 103-106).  Washington DC: U.S. Government Printing Office.  {PDF file, 252 KB}

#GU02-01.  Guille, R.  Lipner, R. S., & Norcini, J. J. (2002, April).  Content-stratified random item selection in computerized classification testing.  Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans LA. 

Guo, F.  (1999, April).  Managing CAT item development in the face of uncertainty.  Paper presented at the annual meeting of the National Council on Measurement in Education, Montreal, Canada

Guo, F.  (2007).  CAT Security: A practitioner’s perspective.  In D. J. Weiss (Ed.), Proceedings of the 2007 GMAC Conference on Computerized Adaptive Testing.  {PDF file, 104 KB}

Guo, F. (2009).  Quantifying the impact of compromised items in CAT.  In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing. {PDF File, 438 KB}

#GU03-01.  Guo, F. & Wang, G. (2003, April).  Online calibration and scale stability of a CAT program.  Paper presented at the annual meeting of the American Educational Research Association, Chicago IL.  {PDF file, 274 KB}

Guo, F. Stone, E. & Cruz, D. (2001). On-line Calibration Using PARSCALE Item Specific Prior Method: Changing Test Population and Sample Size. Paper presented at National Council on Measurement in Education Annual Meeting, Seattle, Washington.

Guo, F., Way, W. D., & Reshetar, R.  (2000, April).  Test security and the development of computerized tests.  Paper presented at the National Council on Measurement in Education invited symposium: Maintaining test security in computerized programs--Implications for practice, New Orleans.

Gushta, M. M. (2003). Standard-setting issues in computerized-adaptive testing. Paper Prepared for Presentation at the Annual Conference of the Canadian Society for Studies in Education, Halifax, Nova Scotia, May 30th, 2003.

Guyer, R. D. (2008).  Effect of early misfit in computerized adaptive testing on the recovery of theta.  Unpublished Ph.D. dissertation, University of Minnesota, Minneapolis MN. {PDF file, 1,004 KB}

Guyer, R. D. and Weiss, D. J. (2009).  Effect of early misfit in computerized adaptive testing on the recovery of theta. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing. {PDF File, 212 KB}

                                         

- H -

Hadidi, A.  & Luecht, R. M. (1997, March).  Psychometric mode effects and fir issues with respect to item difficulty estimates.  Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago IL.

 

Haley, S. M., Coster, W. J., Andres, P. L., Kosinski, M. & Ni, P. S. (2004). Score comparability of short-forms and computerized adaptive testing: Simulation study with the activity measure for post-acute care (am-pac). Archives of Physical Medicine and Rehabilitation, 85, 661-666.

 

Haley, S. M., Ni, P., Hambleton, R. K., Slavin, M. D. & Jette, A. M. (2006). Computer adaptive testing improves accuracy and precision of scores over random item selection in a physical functioning item bank. Journal of Clinical Epidemiology, 59, 1174-1182.

Halkitis, P. N. & Leahy, J. M. (1993).  Computerized adaptive testing: The future is upon us. Nursing and Health Care, 14, 378-85.

Hambleton, R. H. (1973).  A review of testing and decision-making procedures (Technical Bulletin No. 15).  Iowa City IA:  American College Testing Program.

Hambleton, R. K. (1974).  Testing and decision-making procedures for selected individualized instruction programs.  Review of Educational Research, 10, 371-400.

Hambleton, R. K.  (2002, April).  Impact of item quality and item bank size on the psychometric quality of computer-based credentialing exams.  Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans LA.

 

Hambleton, R. K. (2005). Applications of item response theory to improve health outcomes assessment: Developing item banks, linking instruments, and computer-adaptive testing. In J. Lipscomb, C. C. Gotay, & C. Snyder (Eds.), Outcomes assessment in cancer (pp.445-464). Cambridge, UK: Cambridge University Press.

See #JO02-01.  Hambleton, R. K., Jodoin, M., & Zenisky, A.  (2002, April).  Impact of selected factors on the psychometric quality of credentialing examinations administered with a sequential testlet design.  Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans LA.

Hambleton , R. & Xing, D. (2004). Computer-based test designs with optimal and non-optimal tests for making pass-fail decisions. Research Report, University of Massachusetts, Amherst, MA.

Hambleton, R. K., Zaal, J. N., & Pieters, J. P. M.  (1991).  Computerized adaptive testing:  Theory, applications, and standards. In R. K. Hambleton & J. N. Zaal (Eds.), Advances in educational and psychological testing:  Theory and Applications (pp. 341-366).  Boston: Kluwer.

Han, N. (2003). Using moving averages to assess test and item security in computer-based testing (Center for Educational Assessment Research Report No. 468). Amherst, MA: University of Massachusetts, School of Education.

Han, K. T. (2009).  A gradual maximum information ratio approach to item selection in computerized adaptive testing. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing. {PDF file, 391 KB}

#HA04-01.  Han, N. & Hambleton, R. K.  (2004).  Detecting exposed test items in  computer-based testing.  Paper presented at the annual meeting of the National Council on Measurement in Education, San Diego CA.  {PDF file, 1.245 MB}

Handel, R. W. Ben-Porath, Y.S., & Watt, M. (1997, June).  Comparability and validity of computerized adaptive testing with the MMPI-2 using a clinical sample.  Paper presented at the 32nd Annual Symposium and Recent Developments in the use of the MMPI-2 and MMPI-A.  Minneapolis MN.

#HA99369.  Handel, R. W. Ben-Porath, Y.S., & Watt, M. (1999).  Computerized adaptive assessment with the MMPI-2 in a clinical setting.  Psychological Assessment, 11, 369-380.

Hankins, J. A.  (1987).  The effects of variable entry on bias and information of the Bayesian adaptive testing procedure.  Dissertation Abstracts International, 47 (8A), 3013.

Hankins, J. A.  (1990).  The effects of variable entry on bias and information of the Bayesian adaptive testing procedure.  Educational and Psychological Measurement, 50, 785-802.

Hansen, D. N.  (1969).  An investigation of computer-based science testing.  In R. C. Atkinson and H. A. Wilson (Eds.), Computer-assisted instruction: A book of readings.  New York:  Academic Press.

#HA75-01.  Hansen, D. N. (1976).  Reflections on adaptive testing.  In C. K. Clark (Ed.),  Proceedings of the First Conference on Computerized Adaptive Testing (pp. 90-94).  Washington DC: U.S. Government Printing Office.  {PDF file, 464 KB}

#HA68-01.  Hansen, D. N. & Schwarz, G. (1968, March).  An investigation of computer-based science testing.  Tallahassee FL: Florida State University.  (See published version.)

Hansen, D. N., Johnson, B. F., Fagan, R. L., Tan, P., & Dick, W.  (1974/1975).  Computer-based adaptive testing models for the Air Force technical training environment:  Phase I.  Development of a computerized measurement system for Air Force technical Training.  JSAS Catalogue of Selected Documents in Psychology, 5, 1-86 (MS No. 882).  AFHRL Technical Report 74-48.

Hansen, D. N., Ross, S., & Harris, D. A.  (1977).  Flexilevel adaptive testing paradigm:   Validation in technical training.  AFHRL Technical Report 77-35 (I).

Hansen, D. N., Ross, S., & Harris, D. A.  (1977).  Flexilevel adaptive training paradigm:   Hierarchical concept structures.  AFHRL Technical Report 77-35 (II).

Hansen, D. N. & Schwarz, G.  An investigation of computer-based science testing.  Tallahassee: Institute of Human Learning, Florida State University, 1968.

Hardwicke, S., Vicino, F., McBride, J.R., & Nemeth, C. (1984).  Evalua­tion of com­pu­terized adaptive testing of the ASVAB.  San Diego, CA: Navy Personnel Research and Development Center, unpublished manuscript.

Hardwicke, S. & White, K. E.  (1983).  Predictive utility evaluation of adaptive testing: Results of the Navy research.  Falls Church VA: The Rehab Group Inc.

Harman, H. H., Helm, C. E., & Loye, D. E. (Eds.).  Computer-assisted testing. Princeton NJ:  Educational Testing Service, 1968.

#HA01-01.  Harmes, J. C., Kromrey, J. D., & Parshall, C. G. (2001, October).  Online item parameter recalibration: Application of missing data treatments to overcome the effects of sparse data conditions in a computerized adaptive version of the MCAT.  Unpublished manuscript.  {PDF file, 406 KB}

#HA03-01.  Harmes, J. C., Parshall, C. G., & Kromrey, J. D.  (2003, April).  Recalibration of IRT item parameters in CAT:  Sparse data matrices and missing data treatments.  Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago IL.  (PDF file, 626 KB} 

Harris, J. D. & Smith, P. F. (1979).  A comparison of a standard and a computerized adaptive paradigm in Bekesy fixed-frequency audiometry. Journal of Auditory Research, 19, 1-22.

Hart, D. L., Cook, K. F., Mioduski, J. E., Teal, C. R., Crane, P. K. (2006). Simulated computerized adaptive test for patients with shoulder impairments was efficient and produced valid measures of function. Journal of Clinical Epidemiology, 59, 290–298.

Hart, D. L., Mioduski, J. E., & Stratford, P. W. (2005). Simulated computerized adaptive tests for measuring functional status were efficient with good discriminant validity in patients with hip, knee, or foot/ankle impairments. Journal of Clinical Epidemiology, 58, 629–638.

 

Hart, D., Mioduski, J., Werenke, M. & Stratford, P. (2006). Simulated computerized adaptive test for patients with lumbar spine impairments was efficient and produced valid measures of function. Journal of Clinical Epidemiology, 59, 947-956

#HA01249.  Hau, K.-T. & Chang, H.-H.  (2001).  Item selection in computerized adaptive testing:  Should more discriminating items be used first?  Journal of Educational Measurement, 38, 249-266. (Also presented at American Educational Research Association, 1998)

Haynie, K.A., & Way, W.D. (1994, April). The effects of item pool depth on the accuracy of pass/fail decisions for NCLEX using CAT. Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans.

Haynie, K.A., & Way, W.D. (1995).  An investigation of item calibration procedures for a computerized licensure examination.  Paper presented at the annual meeting of the National Council on Measurement in Education, San Francisco, CA.

Hau, K. T. & Chang, H. H. (1998).  Item selection in computerized adaptive testing: Should more discriminating items be used first? Paper presented at the annual  meeting of the American Educational Research Association, San Diego, CA.

Hau, K. T. & Chang, H. H. (1998).  Item selection in computerized adaptive testing: Should more discriminating items be used first?  Journal of Educational Measurement,38, 249-266.

Hendrickson, A. B. & Kolen, M. J.  (1992, April).  Scaling of two-stage adaptive test configurations for achievement testing.  Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans LA.

#HE07044.  Hendrickson, A. (2007).  An NCME instructional module on multistage testing.  Educational Measurement: Issues and Practice, 26(2), 44-52.

Henly, S. J., Klebe, K. J., McBride, J. R., & Cudeck, R. (1989). Adaptive and conventional versions of the DAT: The first complete test battery comparison. Applied Psychological Measurement, 13, 363-371.

Hetter, R. D., Segall, D. O., & Bloxom, B.  (1992, October).  Need title . Paper presented at the annual conference of the Military Testing Association, San Diego CA.

Hetter, R. D., Bloxom, B. M., & Segall, D. O. (1993).  Item Calibration: Medium-of-administration effect on computerized adaptive scores (TR-93-9). Navy Personnel Research and Development Center.

Hetter, R. D., Segall, D. O., & Bloxom, B. M.  (1994).  A comparison of item calibration media in computerized adaptive tests.  Applied Psychological Measurement, 18, 197-204. 

Hetter, R.D., Segall, D.O. & Bloxom, B.M. (1997). Evaluating item calibration medium in computerized adaptive testing.  In W.A. Sands, B.K. Waters & J.R. McBride, Computerized adaptive testing: From inquiry to operation (pp. 161-168). Washington, DC: American Psychological Association.

Hetter, R. D., & Sympson, J. B. (1997). Item exposure control in CAT-ASVAB. In W. A. Sands, B. K. Waters, & J. R. McBride (Eds.), Computerized adaptive testing: From inquiry to operation (pp. 141-144). Washington DC: American Psychological Association.

#HO89-01.  Ho, R., & Hsu, T. C. (1989, March). A comparison of three adaptive testing strategies using MicroCAT.  Paper presented at the annual meeting of the American Educational Research Association, San Francisco. (Tables and figures only.)

 

Ho, R.-G., & Yen, Y.-C. (2005). Design and evaluation of an XML-based platform-independent computerized adaptive testing system. IEEE Transactions on Education, 48(2), 230–237

 

Hockemeyer, C. (2002). A comparison of non-deterministic procedures for the adaptive assessment of knowledge. Psychologische Beiträge, 44, 495–503.

Hogan, P.F., Dall, T. & McBride, J.R. (1996)  Preliminary cost-effectiveness analysis of alternative ASVAB testing concepts at MET sites.  Interim report to Defense Manpower Data Center.  Fairfax, VA:  Lewin-VHI, Inc.

Hogan, P.F., McBride, J.R. & Curran, L.T. (1995).  An evaluation of alternative concepts for administering the Armed Services Vocational Aptitude Battery to applicants for enlistment.  DMDC Technical Report 95-013.  Monterey, CA:  Personnel Testing Division, Defense Manpower Data Center.

#HO06-01.  Hol, A. M. (2006).  A CAT with personality and attitude.  Enschede, The Netherlands:  PrintPartners Ipskamp B.V.

Hol, A. M., Vorst, H. C. M., & Mellenbergh, G. J. (2001).  [Application of a computerized adaptive test procedure on personality data].  Nederlands tijdschrift voor de psychologie, 56, 119-133).  In Dutch.

 

Hol, A. M., Vorst, H. C. M., & Mellenbergh, G. J. (2005). A randomized experiment to compare conventional, computerized, and computerized adaptive administration of ordinal polytomous attitude items. Applied Psychological Measurement, 29, 159-183.

Hol, A. M., Vorst, H. C. M. & Mellenbergh, G. J. (2007). Computerized adaptive testing for polytomous motivation items: Administration mode effects and a comparison with short forms. Applied Psychological  Measurement, 31, 412-429.

Holland, P. W. & Zwick, R.  (1991).  A simulation study of some simple approaches to the study of DIF for CATs.  Internal memorandum, Educational Testing Service.

Holmes, R. M.,  & Segall, D. O. (DMDC) (1999, April). Reducing item exposure without reducing precision (much) in computerized adaptive testing.  Paper presented at the annual meeting of the National Council on Measurement in Education, Montreal, CA.

Holst, P. M., O’Donnell, A. M., & Rocklin, T. R. (1992, April).  Effects of feedback during self-adapted testing on estimates of ability.  Paper presented at the annual meeting of the American Educational Research Association, San Francisco.

#HO70198.  Holtzman, W. H. (1970).  Individually tailored testing: Discussion.  In W. H. Holtzman, (Ed.), Computer-assisted instruction, testing, and guidance (pp.198-200).  New York:  Harper & Row.  [see #LO70139 and #GR70184]

Hontangas, P., Olea, J., Ponsoda, V., Revuelta, J. & Wise, S.L. (2004). Assisted self-adapted testing: A comparative study. European Journal of Psychological Assessment, 1, 2-9.

Hontangas, P., Ponsoda, V., Olea, J. & Wise, S.L. (2000). The choice of item difficulty in self adapted testing. European Journal of Psychological Assessment, 16, 1, 3-12.

#HO77-01.  Hornke, L. F. (1977, June).  Four realizations of pyramidal adaptive testing strategies.  Paper presented at the Third International Symposium on Educational Testing, University of Leiden, The Netherlands.

Hornke, L. F.  (1979).  Four realizations of pyramidal adaptive testing.  Programmed Larning and Educational Technology, 16, 164-169.

Hornke, L. F. (1995).  Item times in computerized testing—A new differential information.  European Journal of Psychological Assessment, 11 (Suppl. 1) 108-109.

Hornke, L. F. (1999). Benefits from computerized adaptive testing as seen in simulation studies. European Journal of Psychological Assessment, 15(2), 91-98.

#HO80-01.  Hornke, L. F. & Sauter. M. B. (1980).  A validity study of an adaptive test of reading comprehension.  In D. J. Weiss (Ed.), Proceedings of the 1979 Computerized Adaptive Testing Conference (pp. 57-67).  Minneapolis MN:  University of Minnesota, Department of Psychology, Psychometric Methods Program.  {PDF file, 676 KB}

Hou, L., Chen, S., Dodd. B. G., & Fitzpatrick, S. J.  (1996, April).  The effects of methods of theta estimation, prior distribution, and number of quadrature points on CAT using the graded response model.  Paper presented at the annual meeting of the American Educational Research Association, New York NY.

Hsu, T.-C. & Shermis, M. D. (1988).  The development and evaluation of a microcomputerized adaptive placement testing system for college mathematics.  Paper(s) presented at the annual meeting(s) of the American Educational Research Association, 1986 (San Francisco CA) and 1987 (Washington DC).

Hsu, T. C. & Tseng, F. L. (1995).  Using simulation to select an adaptive testing strategy:  An item bank evaluation program.  Unpublished manuscript, University of Pittsburgh.

Hsu, Y., Thompson, T.D., & Chen, W-H. (1998, April).  CAT item calibration.  Paper presented at the annual meeting of the National Council on Measurement in Education, San Diego.

Huang, C.-Y., Kalohn, J. C.,  Lin, C.-J., &  Spray, J. (2000).  Estimating item parameters from classical indices for item pool development with a computerized classification test (Research Report 2000-4).  Iowa City IA: ACT Inc.

Huang, S. X. (1996).  A content-balanced adaptive testing algorithm for computer-based training systems. In Frasson, C., Gauthier, G., &  Lesgold, A. (Eds.), Intelligent Tutoring Systems, Third International Conference, ITS'96, Montréal, Canada, June 1996 Proceedings. Lecture Notes in Computer Science 1086. Berlin Heidelberg: Springer-Verlag 306-314.

Hubbard, J. P.  ( 1966).  Programmed testing in the examinations of the National Board of Medical Examiners.  In A. Anastasi (Ed.), Testing problems in perspective.  Washington DC: American Council on Education. 

Huebner, A., Wang, B., & Lee, S. (2009).  Practical issues concerning the application of the DINA model to CAT data. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing. {PDF file, 139 KB}

#HU01016.  Huff, K. L. & Sireci, S. G.  (2001).  Validity issues in computer-based testing.  Educational Measurement:  Issues and Practice, 20(3), 16-25.

Huisman, J. M. E. (1999). Item nonresponse: Occurrence, causes and imputation of missing answers to test items. (M & T Series No. 32). Leiden: DSWO Press.

Huisman, J. M. E., & Molenaar, I. W. (2001). Imputation of missing scale data with item response models. In A. Boomsma, M. A. J. van Duijn, & T. A. B. Snijders (Eds.), Essays on item response theory (pp. 222-244). New York: Springer-Verlag.

Hutt, M. L. (1947).  A clinical study of  “consecutive” and “adaptive” testing with the revised Stanford-Binet.  Jurnal of Consulting Psychology, 11, 93-103.

- I -

Imai, S., Ito, S., Nakamura, Y., Kikuchi, K., Akagi, Y., Nakasono, H., Honda, A., & Hiramura, T.  (2009).  Features of J-CAT (Japanese Computerized Adaptive Test). In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing.  {PDF File, 655KB}

Immekus, J. C., Gibbons, R.D., & Rush, J. A. (2007). Patient-reported outcomes measurement and computerized adaptive testing: An application of post-hoc simulation to a diagnostic screening instrument. In D. J. Weiss (Ed.). Proceedings of the 2007 GMAC Conference on Computerized Adaptive Testing. {PDF file, 203 KB}

Ireland, C. M. (1977).  An application of the Rasch one-parameter logistic model to individual intelligence testing in a tailored testing environment.  Dissertation Abstracts International, 37 (9-A), 5766.

Ito, K., Pommerich, M., & Segall, D. (2009).  An evaluation of a new procedure for computing information functions for Bayesian scores from computerized adaptive tests. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing.  {PDF file, 571 KB}

Ito, K. & Sykes, R.C. (1994).  The effect of restricting ability distributions in the estimation of item difficulties:  Implications for a CAT implementation.  Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans.

Iwamoto, C. K., Nungester, R. J., & Luecht, R. M. (1999, April).  Study of methods to detect aberrant response patterns in computerized testing.  Paper presented at the annual meeting of the National Council on Measurement in Education, Montreal, Canada.

- J -

Jacobs-Cassuto, M.S. (2005). A comparison of adaptive mastery testing using testlets with the 3-parameter logistic model.  Unpublished doctoral dissertation, University of Minnesota, Minneapolis, MN.

Jacobson, R. L. (1993, September 13).  New computer technique seen producing a revolution in testing.  The Chronicle of Higher Education, p A22.

Jacobson, R. L.  (1995, January 6).   Shortfall of questions curbs use of computerized graduate exam.  The Chronicle of Higher Education, A23.

Janczewski, D. & Lowe, P. (1992). The Language Training Division's computer adaptive reading proficiency test. Provo, UT: Language Training Division, Office of Training and Education.

#JE72-01.  Jensema, C. J. (1972).  An application of latent trait mental test theory to the Washington Pre-College Testing Battery.  Unpublished doctoral dissertation, University of Washington.

#JE74029.  Jensema, C. J. (1974).  An application of latent trait mental test theory.  British Journal of Mathematical and Statistical Psychology, 27, 29-48. 

Jensema, C. J. (1974).  The validity of Bayesian tailored testing.  Educational and Psychological Measurement, 34, 757-756.

#JE75-01.  Jensema, C. J. (1976).  Bayesian tailored testing and the influence of item bank characteristics.  In C. K. Clark (Ed.),  Proceedings of the First Conference on Computerized Adaptive Testing (pp. 82-89).  Washington DC: U.S. Government Printing Office.  {PDF file, 370 KB}

Jensema, C. J. (1977).  Bayesian tailored testing and the influence of item bank characteristics.  Applied Psychological Measurement, 1, 111-120.

Jennings, J. A. (U TX, Austin), Dodd, B. G., & Fitzpatrick, S. J. (2001, April).  An investigation of the impact of items that exhibit mild DIF on ability estimation in CAT.  Paper presented at the annual meeting of the National Council on Measurement in Education, Seattle WA.

 

Jette, A., Haley, S., Tao, W., Ni, P., Moed, R., Meyers, D. & Zurek, M. (2007). Prospective evaluation of the am-pac-cat in outpatient rehabilitation settings. Physical Therapy, 87, 385-398.

 

Jhu, Y.-J., & Chen, S.-Y. (2008). Item exposure control in a-stratified computerized adaptive testing. Psychological Testing, 55, 793-811.

#JI03-01.  Jiao, H. & Lau, A. C.  (2003, April).  The effects of model misfit in computerized classification test.  Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago IL.  {PDF file, 432 KB}

#JI04-01.  Jiao, H., Wang, S., & Lau, A.(2004).  An investigation of two combination procedures of SPRT for three-category decisions in computerized classification test.  Paper presented at the annual meeting of the American Educational Research Association, San Diego CA.  {PDF file, 649 KB}

Jodoin, M. G. (2002, June). Reliability and decision accuracy of linear parallel form and multi stage tests with realistic and ideal item pools. Paper presented at the International Conference on Computer-Based Testing and the Internet, Winchester, England.

Jodoin, M.  (2003, April).  A multidimensional IRT mechanism for better understanding adaptive test behavior.  Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago IL.

#JO02-01.  Jodoin, M., Zenisky, A., & Hambleton, R.  (2002, April).  Comparison of the psychometric properties of several computer-based test designs for credentialing exams. Paper presented at the annual meeting of the National Council on Measurement in Education, New Orleans LA.  {PDF file, 261 KB}  

 Johnson, J. L., Roos, L. L., Wise, S. L., & Plake, B. S. (1991).  Correlates of examinee item choice behavior in self-adapted testing.  Mid-Western Eduactional Researcher, 4, 25-28.

#JO79-01.  Johnson, M. J.  (1979).  Student reaction to computerized adaptive testing in the classroom.  Paper presented at the 87th annual meeting of the American Psychological Association, New York.

#JO80-01.  Johnson, M. J. & Weiss, D. J. (1980).  Parallel forms reliability and measurement accuracy comparison of adaptive and conventional testing strategies.  In D. J. Weiss (Ed.), Proceedings of the 1979 Computerized Adaptive Testing Conference (pp. 16-34).  Minneapolis:  University of Minnesota, Department of Psychology, Psychometric Methods Program, Computerized Adaptive Testing Laboratory.  {PDF file, 918 KB}

#JO73083.  Jones, D. & Weinman, J.  (1973). Computer-based psychological testing.  In A. Elithorn & D. Jones (Eds.),  Artificial and human thinking (pp. 83-93).  San Francisco CA: Jossey-Bass.

Jones, D. H.  (1997, March).  Mathematical programming approaches to computerized adaptive testing. Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago IL.

Jones-Dickson, C., Dorsey, D., Campbell-Warnock, J., & Fields F. (1993).  Moving in a new direction: Computerized adaptive testing (CAT). Nursing Management, 24, 80-82.

- K -

Kalisch, S. J. (1973).  A tailored testing model employing the beta distribution and conditional difficulties.  Journal of Computer-Based Instruction, 1, 111-120.

Kalisch, S. J. (1974).  A tailored testing model employing the beta distribution (unpublished manuscript).  Florida State University, Educational Evaluation and Research Design Program.

Kalisch, S. J. (1974).  A tailored testing model employing the beta distribution and conditional difficulties.  Journal of Computer-Based Instruction, 1, 22-28.

Kalisch, S. J. (1974).  The comparison of two tailored testing models and the effects of the models’ variables on actual loss.  Unpublished doctoral dissertation, Florida State University.

#KA80-01.  Kalisch, S. J.  (1980).  A model for computerized adaptive testing related to instructional situations.   In D. J. Weiss (Ed.).  Proceedings of the 1979 Computerized Adaptive Testing Conference (pp. 101-119).  Minneapolis MN:  University of Minnesota, Department of Psychology, Psychometric Methods Program, Computerized Adaptive Testing Laboratory.  {PDF file, 965 KB}

Kalisch, S. J. (1980, February).  Computerized instructional adaptive testing model:  Formulation and validation (AFHRL-TR-79-33, Final Report).  Brooks Air Force Base TX: Air Force Human Resources Laboratory.  Also Catalog of Selected Documents in Psychology, February 1981, 11, 20 (Ms. No, 2217).

Kalohn, J. C. & Spray, J. A. (1998, April).  Effect of item selection on item exposure rates within a computerized classification test.  Paper presented at the annual meeting of the National Council on Measurement in Education, San Diego CA.

#KA99047.  Kalohn, J. C. & Spray, J. A. (1999).  The effect of model misspecification on classifications decisions made using a computerized test.  Journal of Educational Measurement,36, 47-59.

Kalohn, J.  (2000). Test security and item exposure control for computer-based …  Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago.

Kamakura, W. A., & Balasubramanian, S. K. (1989). Tailored interviewing: An application of item response theory for personality measurement. Journal of Personality Assessment, 53, 502-519.

Kappauf, W. E. (1969).  Use of an on-line computer for psychological testing with the up-and-down method.  American Psychologist, 24, 207-211.

Karino, C. A., Costa, D. R., & Laros, J. A. (2009).  Adequacy of an item pool measuring proficiency in English language to implement a CAT procedure. In D. J. Weiss (Ed.),  Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing. {PDF File, 160 KB}

Kiely, G. L., Zara, A. R., & Weiss, D. J. (1983).  Alternate forms reliability and concurrent validity of adaptive and conventional tests with military recruits.  Minneapolis MN:  University of Minnesota, Department of Psychology, Computerized Adaptive Testing Laboratory.

Killcross, M. C. (1974, August).  A tailored testing system for selection and allocation in the British Army.  Paper presented at the 18th International Congress of Applied Psychology, Montreal Canada.

Killcross, M. C. (1976).  A review of research in tailored testing (Report APRE No. 9/76).  Farnborough, Hants, U. K.: Ministry of Defence, Army Personnel Research Establishment.

Kim, J.  (1993).  Individual differences in computerized adaptive testing.  Paper presented at the annual meeting of the Mid-South Educational Research Association, New Orleans LA.

Kim, J. & McLean, J. E.  (1995, April).  The influence of examinee test-taking behavior motivation in computerized adaptive testing.  Paper presented at the annual meeting of the National Council on Measurement in Education, San Francisco CA.  (ERIC No. ED392839)

Kim, H., & Plake, B. S. (1993, April). Monte Carlo simulation comparison of two-stage testing and computerized adaptive testing. Paper presented at the meeting of the National Council on Measurement in Education, Atlanta, GA.

Kim-Kang, G. & Weiss, D. J. (2007).  Comparison of computerized adaptive testing and classical methods for measuring individual change.  In D. J. Weiss (Ed.).  Proceedings of the 2007 GMAC Conference on Computerized Adaptive Testing. {PDF file, 347 KB}

Kim-Kang, G. & Weiss. D. J.  (2008). Adaptive measurement of individual change. Zeitschrift für Psychologie / Journal of Psychology,  216, 49-58. {PDF file, 568 KB}

Kingsbury, G. G. (1984).  Adaptive self-referenced testing as a procedure for the measurement of individual change in instruction:  A comparison of the reliabilities of change estimates obtained from conventional and adaptive testing procedures.  Unpublished doctoral dissertation, Univerity of Minnesota, Minneapolis.

Kingsbury, G. G.  (1985).  Adaptive self-referenced testing as a procedure for the measurement of individual change:  A comparison of the reliabilities of change estimates obtained from conventional and adaptive testing procedures.  Dissertation Abstracts International, 45 (9-B), 3057.

Kingsbury, G. . (1986).  Computerized adaptive testing: A pilot project.  In W. C. Ryan (ed.), Proceedings: NECC ’86, National Educational Computing Conference (pp.172-176).   Eugene OR: University of Oregon, International Council on Computers in Education.

Kingsbury, G. G. et al. (1988). Computerized adaptive testing:  A four-year-old pilot study shows that CAT can work.  Technological Horizons in Education, 16 (4), 73-76.

#KI90003.  Kingsbury, G. G. (1990).  Adapting adaptive testing:  Using the MicroCAT testing in a local school district.  Educational Measurement:  Issues and Practice,9 (2), 3-6, 29.

Kingsbury, G. G. (1991). A comparison of procedures for content-sensitive item selection.  Applied Measurement in Education, need page numbers.

Kingsbury, G. G. (1996, April).  Item review and adaptive testing.  Paper presented at the annual meeting of the National Council on Measurement in Education, New York.

Kingsbury, G. G.  (1997, March).  Item pool development and maintenance.  Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago IL.

Kingsbury, G. G.  (1997, March).  Some questions that must be addressed to develop and maintain an item pool for use in an adaptive test.  Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago IL.

Kingsbury, G. G. (1999, April).  Standard errors of proficiency estimates in stratum scored CAT.  Paper presented at the annual meeting of the National Council on Measurement in Education, Montreal, Canada.

#KI02-01.  Kingsbury, G. G. (2002, April).  An empirical comparison of achievement level estimates from adaptive tests and paper-and-pencil tests.  Paper presented at the annual meeting of the American Educational Research Association, New Orleans LA.  {PDF file, 134 KB}

Kingsbury, G. G. (2009).  Adaptive item calibration: A process for estimating item parameters within a computerized adaptive test. In D. J. Weiss (Ed.), Proceedings of the 2009 GMAC Conference on Computerized Adaptive Testing.  {PDF File, 286 KB} {PDF File, 286 KB}

#KI88-01. Kingsbury, G. G., & Houser, R. L. (1988, April). A comparison of achievement level estimates from computerized adaptive testing and paper-and-pencil testing.  Paper presented at the annual meeting of the American Educational Research Association, New Orleans LA.  {PDF file, 43 KB}

#KI89-01.  Kingsbury, G. G., & Houser, R. L. (1989, March).  Assessing the impact of using item parameter estimates obtained from paper-and-pencil testing for computerized adaptive testing.  Paper presented at the annual meeting of the National Council on Measurement in Education, San Francisco.

Kingsbury, G. G. & Houser, R. L. (1990, April). Assessing the utility of item response models: Computerized adaptive testing. A paper presented to the annual meeting of the National Council of Measurement in Education, Boston MA.

#KI93-01.  Kingsbury, G. G., & Houser, R. L. (1993, April).   A practical examination of the use of free-response questions in computerized adaptive testing.  Paper presented to the annual meeting of the American Educational Research Association: Atlanta GA.  {PDF file, 30 KB}

Kingsbury, G. G., & Houser, R. L. (1993). Assessing the utility of item response models: Computerized adaptive testing. Educational Measurement: Issues and Practice, 12, 21-27, 39.

Kingsbury, G. G. & Houser, R.L. (1999).  Developing computerized adaptive tests for school children.  In F. Drasgow & J. B. Olson-Buchanan (Eds.), Innovations in computerized assessment (pp. 93-115).  Mahwah NJ: Erlbaum.

#KI04-01.  Kingsbury, G. G. & Hauser, C. (2004).  Computer adaptive testing and the No Child Left Behind Act.  Paper presented at the annual meeting of the American Educational Research Association, San Diego CA.  {PDF file, 117 KB}

 

Kingsbury, G. G. & Houser, R. L. (2007).  ICAT: An adaptive testing procedure to allow the identification of idiosyncratic knowledge patterns. In D. J. Weiss (Ed.).  Proceedings of the 2007 GMAC Conference on Computerized Adaptive Testing. {PDF file, 161 KB}

 

Kingsbury, G. G. & Houser, R/ L. (2008). ICAT: An adaptive testing procedure for the identification of idiosyncratic knowledge patterns. Zeitschrift für Psychologie / Journal of Psychology, 216(1), 40–48.

#KI79-05. Kingsbury, G. G. & Weiss, D. J. (1979).  An adaptive testing strategy for mastery decisions (Research Report 79-5).  Minneapolis:  University of Minnesota, Department of Psychology, Psychometric Methods Program. {PDF file, 2.146 MB}

#KI80-04.  Kingsbury, G. G., & Weiss, D. J. (1980). A comparison of adaptive, sequential, and conventional testing strategies for mastery decisions (Research Report 80-4).  Minneapolis, Department of Psychology, Psychometric Methods Program, Computerized Adaptive Testing Laboratory.  {PDF file, 1.905 MB}

#KI80-05. Kingsbury, G. G., & Weiss, D. J. (1980). An alternate-forms reliability and concurrent validity comparison of Bayesian adaptive and conventional ability tests (Research Report 80-5).  Minneapolis, Department of Psychology, Psychometric Methods Program, Computerized Adaptive Testing Laboratory. {PDF file, 1.11 MB}

#KI80-01.  Kingsbury, G. G. & Weiss, D. J.  (1980).  A comparison of ICC-based adaptive mastery testing and the Waldian probability ratio method.  In D. J. Weiss (Ed.).  Proceedings of the 1979 Computerized Adaptive Testing Conference (pp. 120-139).  Minneapolis MN:  University of Minnesota, Department of Psychology, Psychometric Methods Program, Computerized Adaptive Testing Laboratory.  {PDF file, 1.351 MB}

#KI81-03.  Kingsbury, G. G. & Weiss, D. J. (1981).  A validity comparison of adaptive and conventional strategies for mastery testing (Research Report 81-3).  Minneapolis, Department of Psychology, Psychometric Methods Program, Computerized Adaptive Testing Laboratory.  {PDF file, 1.855 MB}

Kingsbury, G. G. & Weiss, D. J. (1983). A comparison of IRT-based adaptive mastery testing and a sequential mastery testing procedure. In D. J. Weiss (Ed.), New horizons in testing:  Latent trait test theory and computerized adaptive testing (pp. 257-283). New York: Academic Press.

Kingsbury, G. G., & Zara, A. R. (1989). Procedures for selecting items for computerized adaptive tests. Applied Measurement in Education, 2, 359-375.

Kingsbury, G. G., & Zara, A. R. (1991). A comparison of procedures for content-sensitive item selection in computerized adaptive tests. Applied Measurement in Education, 4, 241-261.

#KI99-1.  Kingsbury, G. G., & Zara, A. R. (1999, April).  A comparison of conventional and adaptive testing procedures for making single-point decisions.  Paper presented at the annual meeting of the National Council on Measurement in Education, Montreal, Canada.

Kingsbury, G. G., & Zara, A. R. (1999, April).  A procedure to compare conventional and adaptive testing procedures for making single-point decisions.  Paper presented at the annual meeting of the National Council on Measurement in Education, Montreal, Canada.

Kingston, N. M. & Dorans, J.J. (1984).  Item location effects and their implications for IRT equating and adaptive testing.  Applied Psychological Measurement, 8, 147-154.

Kirisci, L. & Hsu, T.-C. (1988, April).  A predictive analysis approach to adaptive testing.  Paper presented at the annual meeting of the American Educational Research Association, New Orleans LA. (ERIC No. ED295982).

Kirisci, L. (1992). Estimation of ability level by using only observable quantities in adaptive testing.  Paper presented at the annual meeting if the American Educational Research Association, Chicago.

Koch, W. R. & Patience, W. M. (1977).  Student attitudes toward tailored testing.  In D. J. Weiss (Ed.), Proceedings of the 1977 Computerized Adaptive Testing Conference.  Minneapolis MN:  University of Minnesota, Department of Psychology, Psychometric Methods Program. 

Koch, W. R., & Dodd, B. G. (1985, April). Computerized adaptive attitude measurement. Paper presented at the annual meeting of the American Educational Research Association, Chicago.

#KO86-01.  Koch, W. R. & Dodd. B. G. (1986, April).  Operational characteristics of adaptive testing procedures using partial credit scoring.  Paper presented at the annual meeting of the American Educational Research Association, San Francisco CA.

Koch, W. R., & Dodd, B. G. (1989). An investigation of procedures for computerized adaptive testing using partial credit scoring. Applied Measurement in Education, 2, 335-357.

Koch, W. R., & Dodd, B. G. (1995). An investigation of procedures for computerized adaptive testing using the successive intervals Rasch model. Educational and Psychological Measurement, 55, 976-990.

Koch, W. R., Dodd, B. G., & Fitzpatrick, S. J. (1990). Computerized adaptive measurement of attitudes. Measurement and Evaluation in Counseling and Development, 23, 20-30.

Koch, W. J. & Reckase, M. D. (1978).  A live tailored testing comparison study of the one- and three-parameter logistic models  (Research Report 78-1).  Columbia MO:  University of Missouri, Department of  Psychology.

Koch, W. J. & Reckase, M. D. (1979).  Problems in application of latent-trait models to tailored testing (Research Report 79-1).  Columbia MO:  University of Missouri, Department of  Psychology. (also presented at National Council on Measurement in Education, 1979:  ERIC No. ED 177 196)

Kolen, M. J. (1999-2000).  Threats to score comparability with applications to performance assessments and computerized adaptive tests.  Educational Assessment, 6, 73-96.

Krass, I. A. (1997, June). Getting more precision on computer adaptive testing. Paper presented at the 62nd Annual meeting of Psychometric Society, University of Tennessee, Knoxville, TN.

#KR98-01.  Krass, I. A. (1998, April). Application of direct optimization for on-line calibration in computerized adaptive testing.  Paper presented at the annual meeting of the National Council on Measurement in Education, San Diego CA.  {PDF file, 146 KB}

#KR00-01.  Krass, I. A. (2000). Change in distribution of latent ability with item position in CAT sequence. Paper presented at the annual meeting of the National Council on Measurement in Education in New Orleans, LA.  {PDF file, 103 KB} 

#KR01-01.  Krass, I. A.  (2001). Application of score information for CAT pool development and its connection with "likelihood test information.” Paper presented at the annual meeting of the National Council on Measurement in Education, Seattle WA.  {PDF file, 392 KB}

Krass, I. A. & Thomasson, G.L. (DMDC).  (1999, April).  Automated flawed item detection and graphical item used in on-line calibration of CAT-ASVAB.  Paper presented at the annual meeting of the National Council on Measurement in Education, Montreal, Canada.

#KR03-01.  Krass, I. A. & Williams, B.  (2003, April).  Calibrating CAT pools and online pretest items using nonparametric and adjusted marginal maximum likelihood methods.  Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago IL. {PDF file, 128 KB} 

Krathwohl, D.  (1959).  Progress report on the sequential item test.  East Lansing MI: Michigan State University, Bureau of Educational Research.

Krathwohl, D. R. & Huyser, R. J. (1956).  The sequential item test.  American Psychologist, 2, 419.

Kreiter, C. D., Ferguson, K., & Gruppen, L. D. (1999).  Evaluating the usefulness of computerized adaptive testing for medical in-course assessment. Academic Medicine, 74, 125-28.

Kreitzberg, C. B. (1978).  Computerized adaptive testing:  Principles and directions.  Computers and Education, 2 (4), 319-329.

Kreitzberg, C. B. & Jones, D. J. (1980).  An empirical study of a broad range test of verbal ability.  Princeton NJ:  Educational Testing Service.

Kreitzberg, C. B., Stocking, M., & Swanson, L. (1978).  Computerized adaptive testing: Principles and directions.  Computers and Education, 2, 319-329.

Krimpen-Stoop, E. M. L.A. van and Meijer, R. R. (1999a).  CUSUM-based person-fit statistics for adaptive testing.  Technical Report RR 99-05, Univeristy of Twente, Enschede, The Netherlands.

Krimpen-Stoop, E. M. L.A. van and Meijer, R.R. (2000).  The null distribution of person-fit statistics for conventional and adaptive tests.  Applied Psychological Measurement, 23, 327-345.

Krimpen-Stoop, E. M. L.A. van and Meijer, R. R.. (2000). Detecting person misfit in adaptive testing using statistic