The period from 2001 to 2002 was a crucial one in the new TOEFL® project. If the new test was to become a reality, Educational Testing Service® (ETS®) needed to begin operational planning for the production, delivery, and scoring of test forms to be administered worldwide to 700,000 or more candidates per year. To facilitate this planning, a concrete example of the new test was needed to demonstrate feasibility of such a test on a larger scale, to determine operational requirements, and to inform test users about impending changes. This called for a fi eld study of the proposed new test. In this chapter, we discuss the empirical evaluation of two prototype test forms in a fi eld study.